Comparing Accumulo, Cassandra, and HBase

We’ll look at
• Architecture
• Data Model
• Security
• Query Support
• Gotchas
• Popularity

Accumulo
Apache Accumulo is a computer software project
that developed a sorted, distributed key-value store
based on the BigTable technology from Google.
Known to store on the order of 100 trillion (1014)
entries in a single table. Single instance can run on
thousands of machines, sometimes over multiple
instances of HDFS.
http://www.pdl.cmu.edu/SDI/2013/slides/big_graph_nsa_rd_2013_56002v1.pdf

Architecture
Service
Storage
New Tablet Servers join the cluster immediately, and can
begin serving requests. Data is replicated underneath
asynchronously

Architecture
Fans of the CAP theorem will recognize that
Accumulo (as BigTable) is a CP system.
Recovery from single node failure means some
amount of unavailability while the WAL is replayed
for some number of tablets.
Each key is managed by exactly one server, ensuring
high consistency.

Architecture
Accumulo has a metadata table that can split and
grow to make it possible to keep track of a huge
number of user tablets.

Data Model
Adds a new component, ‘Visibility’, to the original BigTable
model
Same as BigTable’s but with the addition of a Column Visibility

Data Model: Column Visibility
• “1st class” component in data model
• Security filtering is implemented in a system
iterator that can’t be turned off
• Column Visibilities are stored as human readable
strings – no mapping that could introduce
confusion

Data Model: Locality Groups
• Column families can be created dynamically
• Column families are put into the default locality
group until assigned otherwise
• Column Families can be reassigned to locality
groups as needed
• Provides columnar scanning optimization
• Locality groups are simply sections within Rfiles
• no additional overhead on HDFS NameNode
• no restrictions on column family names

Security
• On by default
• User, group/role, access control
• “Cell-level” security via Column Visibilities
• Fail-safe
• Security label operators include & and | or
Common problem for new users is writing data they can’t
even see.

Query Support
• No secondary indexing capabilities built in
• Several secondary indexing patterns are well
supported
• Support for scanning long rows
• BatchScanner for fetching results
• Iterators enable storing index entries with records
• No official query language

Some Gotchas
• Usually requires tuning beyond what distros provide
• Not balancing clients and tablet servers
• Having many small tables vs few large tables
• Not as many free resources online like blog posts,
tutorials, forums, etc.
• Larger individual servers mean that server failure can
result in a large amount of data needing to be
replicated. Accumulo only needs to process recent
write-ahead log entries, however, before everything is
back online.

Popularity
• 4th most popular ‘wide column store’, behind
Cassandra, HBase, and Microsoft Azure Cosmos DB
• About a 16th as popular as HBase
• 60th most popular DB overall, higher than:
• Cloudant
• MemSQL
• Apache Drill
• Oracle NoSQL
• Amazon Simple DB
• LevelDB
• VoltDB
• Google Cloud BigTable
• MapR-DB
https://db-engines.com/en/system/Accumulo

https://db-engines.com/en/system/Accumulo

Recent Improvements in 1.7
• Client Authentication with Kerberos
• Data-Center Replication
• User-Initiated Compaction Strategies
• API Clarification
• Faster Startup via Configurable Threadpool Size for Assignments
• Group-Commit Threshold as a Factor of Data Size
• Balancing Groups of Tablets
• User-specified Durability
• Hadoop Metrics2 Support
• Distributed Tracing with Htrace
• Per-Table Volume Chooser
• Table and namespace custom properties

Recent Improvements in 1.8
• Speed up WAL roll overs
• User level API for Rfile
• Suspend Tablet assignment for rolling restarts
• Run multiple Tablet Servers on one node
• Rate limiting Major Compactions
• Table Sampling

HBase
“Use Apache HBase™ when you need random, real-
time read/write access to your Big Data. This
project's goal is the hosting of very large tables --
billions of rows X millions of columns -- atop clusters
of commodity hardware.”
Billions x Millions = Trillions, 1012

Architecture
Basically the same as Accumulo’s.
HBase now stores it’s root table in ZooKeeper. The
hbase:meta table doesn’t split.

Architecture
Facebook and others didn’t like the recovery time
associated with reading recent write-ahead log
entries, and so in 2015 (0.98) read replication was
introduced
Hortonworks says improves from 99.9% to 99.99%[1]
Facebook claims HydraBase design has 99.999%[2]
1. https://hortonworks.com/blog/apache-hbase-high-availability-next-level/
2. https://code.facebook.com/posts/321111638043166/hydrabase-the-evolution-of-hbase-facebook/

Architecture
“With read replicas enabled, the HMaster distributes
read-only copies of regions (replicas) to different
RegionServers in the cluster. One RegionServer
services the default or primary replica, which is the
only replica which can service write requests. If the
RegionServer servicing the primary replica is down,
writes will fail.”
Stale reads are also now possible and unavoidable
https://www.cloudera.com/documentation/enterprise/5-4-x/topics/admin_hbase_read_replicas.html

Architecture
“... make sure to account for their increased heap
memory requirements. Although no additional
copies of HFile data are created, read-only replicas
regions have the same memory footprint as normal
regions and need to be considered when calculating
the amount of increased heap memory required.”
https://www.cloudera.com/documentation/enterprise/5-4-x/topics/admin_hbase_read_replicas.html

Data Model
Same as BigTable’s.
But HBase does not implement BigTable’s Locality
Group feature. Each column family is a directory in
HDFS, effectively separating storage similarly to a
locality group.
Column families must be declared via DDL
statements before using and be printable

Data Model: Column Families
A blog from 2009 recommends less than 100 column
families:
"While the number of rows and columns is
theoretically unbound the number of column
families is not. This is a design trade-off but does not
impose too much restrictions if the tables and key
are designed accordingly.”
http://www.larsgeorge.com/2009/11/hbase-vs-bigtable-comparison.html

“In Cloud Bigtable, unlike in HBase, you can use up to
~100 column families while maintaining excellent
performance.”
https://cloud.google.com/bigtable/docs/schema-design

"HBase currently does not do well with anything
above two or three column families so keep the
number of column families in your schema low."
http://hbase.apache.org/book.html#table_schema_rules_of_thumb

"Try to make do with one column family if you can in
your schemas."
http://hbase.apache.org/book.html#table_schema_rules_of_thumb

Data Model: Value Sizes
"storing 10-50MB objects in HBase would probably
be too much to ask”[1]
“Aim to have cells no larger than 10 MB, or 50 MB if
you use MOB”[2]
1. http://hbase.apache.org/0.94/book/supported.datatypes.html
2. http://hbase.apache.org/book.html#table_schema_rules_of_thumb

Security
• Typical group / role based access control
• Cell-level control was added in 2014, using the co-
processor mechanism

Security
• To use cell level security
• Ensure HBase is configured to use v 3 Hfile storage
• VisibilityController must be added to the list of co-
processors
• Setup Hadoop Group Mapping mechanism
• By default, visibility labels are lost on replication
• ! (not) included as an operator, making it more
important to ensure that clients can’t drop user
authorization tokens to avoid elevation of privilege

Query Support
Despite several attempts, it appears secondary
indexing is handled most often outside of HBase,
using Solr
https://www.cloudera.com/documentation/enterprise/5-6-x/topics/search.html

Gotchas
Covered above
https://www.cloudera.com/documentation/enterprise/5-6-x/topics/search.html

Popularity
• 2nd most popular ‘wide columnar store’
• About half as popular as Cassandra
• Still growing in popularity, but growth has slowed
https://db-engines.com/en/system/HBase

https://db-engines.com/en/system/HBase

Cassandra
“The Apache Cassandra database is the right choice
when you need scalability and high availability
without compromising performance. Linear
scalability and proven fault-tolerance on commodity
hardware or cloud infrastructure make it the perfect
platform for mission-critical data. Cassandra's
support for replicating across multiple datacenters is
best-in-class, providing lower latency for your users
and the peace of mind of knowing that you can
survive regional outages”

Architecture
• Tries to combine parts of BigTable and Amazon’s
Dynamo
• Designed to span data centers, allows users to
choose between CP and AP
• Every node is the same, no masters, no zookeeper,
storage is coupled with service
• Each server still uses a memtable, sstable files on
disk, compaction, sorting, etc
• Use the ‘gossip’ peer-to-peer protocol

Architecture: Consistent Hashing

Data Model
• Same as BigTable, but interaction with the data
model directly has been completely eclipsed by a
more table-like abstraction as part of CQL
• Table schemas have to be declared up front
• Now features ‘partition keys’, ‘clustering columns’,
and other, regular columns

Data Model
• ‘Partition keys’ determine to which partition and
server a row belongs
• ‘Clustering columns’ determine how other columns
are grouped within a partition

Data Model
Videos by User
user_id K
uploaded_timestamp C
video_id C
email
first_name
last_name
title
description

Data Model
user_id uploaded
timestamp
video_id email first_name last_name title description
Partition
key
clustering
column
clustering
column
column column column column column
Row Col Value
hash(user_id)::uploaded_timestamp::video_id email …
hash(user_id)::uploaded_timestamp::video_id first_name …
hash(user_id)::uploaded_timestamp::video_id last_name …
hash(user_id)::uploaded_timestamp::video_id title …
hash(user_id)::uploaded_timestamp::video_id description …

Data Model
• All dynamism appears to be absent from the data
model
• All data has to be modeled up front
• Columns are even typed

Security
• Cassandra supports a security model similar to that
of relational databases, which supports controlling
access to key- spaces, tables, and rows.[1]
• Row level access relies on exact string matches.[2]
• Column level permissions are not yet
implemented.[3]
1 http://docs.datastax.com/en/dse/5.1/dse-admin/datastax_enterprise/security/secPermissions.html
2 http://docs.datastax.com/en/dse/5.1/dse-admin/datastax_enterprise/security/secRlac.html
3 https://issues.apache.org/jira/browse/CASSANDRA-12859

Query Support
• Cassandra features a SQL like language called CQL
• Used to specify table schemas
• Used for querying
• Also features built-in secondary indexing

Query Support: Index distribution

Query Support
• hash(user_id) implies no range scans over partition
keys
• only lookups on partition keys or clustering
columns are efficient without secondary indexing
Row Col Value
hash(user_id)::uploaded_timestamp::video_id email …
hash(user_id)::uploaded_timestamp::video_id first_name …
hash(user_id)::uploaded_timestamp::video_id last_name …
hash(user_id)::uploaded_timestamp::video_id title …
hash(user_id)::uploaded_timestamp::video_id description …

Query Support: Caveats
• If not including partition keys in your query, have to
filter (not allowed by default)
• Possible to order partition keys to allow range
scans, but not on by default
• Without all the clustering columns in a query, it will
be rejected unless filtering is on
• Can do some range scans on clustering columns

Query Support: Caveats
• Queries on secondary index restricted to equality
or ‘contains’
• Non-indexed columns can be part of a query with
additional filtering
• When querying secondary index, Cassandra must
query all partitions but avoids naively querying all
at once

Query Support: rounds strategy

Query Support: Indexing Caveats
Avoid very low cardinality index
“e.g. index where the number of distinct values is
very low. A good example is an index on the gender
of an user. On each node, the whole user population
will be distributed on only 2 different partitions for
the index: MALE & FEMALE. If the number of users
per node is very dense (e.g. millions) we’ll have very
wide partitions for MALE & FEMALE index, which is
bad”
https://www.datastax.com/dev/blog/cassandra-native-secondary-index-deep-dive

Avoid very high cardinality index.
“For example, indexing user by their email address is
a very bad idea. Generally an email address is used
by at most 1 user. So there are as many distinct index
values (email addresses) as there are users. When
searching user by email, in the best case the
coordinator will hit 1 node and find the user by
chance. The worst case is when the coordinator hits
all primary replicas without finding any answer (0
rows for querying N/RF nodes !)”

Avoid indexing a column which is updated (or
removed then created) frequently.
By design the index data are stored in a Cassandra
table and Cassandra data structure is designed for
immutability. Indexing frequently updated data will
increase write amplification (for the base table + for
the index table)

Other Gotchas
• Adding a replacement node can take a long time
(days) because data must replicate – more smaller
servers alleviates this
• Hinted handoffs can be problematic
• Large number of tables can be problematic
• People who run 9000 nodes (Netflix) spread them
across 100s of cluster – averaging 100 nodes per
cluster

Popularity
• Most popular ‘wide columnar store’
• 8th most popular database overall
• Popularity has plateaued somewhat
https://db-engines.com/en/system/Cassandra

https://db-engines.com/en/system/Cassandra

0 2 4 6 8 10 12 14 16 18
ad
built on
cloud
enterprise
gaming
marketing
mobile
social
web
Relative Usage by Declaration
Cassandra Hbase
https://en.wikipedia.org/wiki/Apache_Cassandra
https://hbase.apache.org/poweredbyhbase.html

0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Accumulo Hbase Cassandra
Relative Usage By Job Postings
web
cloud
retail
telecom
enterprise
finance
defense
https://www.indeed.com/

Ideas for Accumulo
• Security is still good, but might not be a strong
enough differentiator
• Grow community support
• Offer a secondary indexing with high level query
language?

Thanks!
Aaron Cordova, Koverse
@aaroncordova

Comparing Accumulo, Cassandra, and HBase

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Comparing Accumulo, Cassandra, and HBase

Similar to Comparing Accumulo, Cassandra, and HBase (20)

Recently uploaded

Recently uploaded (20)

Comparing Accumulo, Cassandra, and HBase