HBase Deep Dive

HBase
Amandeep Khurana
University of California, Santa Cruz
Twitter: @amansk
amansk@gmail.com
www.amandeepkhurana.com

Tuesday, August 17, 2010

How did it start?

• At Google
• Lots of semi structured data

• Commodity hardware

• Horizontal scalability

• Tight integration with MapReduce

2

Why NoSQL?

• RDBMS don’t scale
• Typically large monolithic systems

• Hard to shard

• Specialized hardware.. expensive!

• Buzzword!

3

Google BigTable
• Distributed multi level map
• Fault tolerant, persistent
• Scalable
• Runs on commodity hardware
• Self managing
• Large number of read/write ops
• Fast scans

4

HBase

• Open source BigTable
• HDFS as underlying DFS
• ZooKeeper as lock service
• Tight integration with Hadoop MapReduce

5

HBase
• Data model
• Architecture, implementation
• Regions, Region Servers etc

• API
• Current status and future direction
• Use cases
• How to think HBase (or NoSQL)?

6

Data Model
• Sparse, multi dimensional map
(row, column, timestamp) cell

• Column = Column Family:Column Qualiﬁer
Columns
Fam1:Qual1

Rows
t1
AK v1
Timestamps

7

Data Model
• Sparse, multi dimensional map
(row, column, timestamp) cell

• Column = Column Family:Column Qualiﬁer
Columns
Fam1:Qual1

Rows
t1
AK v1 t2
v2
Timestamps

t2>t1
7

Regions

• Region: Contiguous set of lexicographically
sorted rows
• hbase.hregion.max.ﬁlesize (default 256MB)

• Regions hosted by Region Servers

8

Regions and Splitting
row1

row256

row257

row600

9

row1

row256

row257

row600
Writes

9

row1

row256

row257

row400

row401

row600

9

System Structure

M
a
p Region Servers Master
R
e
d
u
c
e HDFS ZooKeeper

10

Master

• Region splitting
• Load balancing
• Metadata operations
• Multiple masters for failover

11

ZooKeeper

• Master election
• Locate -ROOT- region
• Region Server membership

12

Where is my row?
• 3 level hierarchical lookup scheme MyTable
.META.
MyRow

-ROOT-

ZooKeeper

13

Where is my row?
.META.
MyRow

-ROOT-

ZooKeeper

Row per META region

13

Where is my row?
.META.
MyRow

-ROOT-

ZooKeeper

Row per META region

Row per table region

13

Region

Memstore

HLog
(Append only
WAL on HDFS)
HFile HFile
(on HDFS) (on HDFS)
(Sequence File)
(one per RS)

Region

HFile: Immutable sorted map (byte[] byte[])
(row, column, timestamp) cell value

14

Region
Write
Memstore

HLog
(Append only
WAL on HDFS)
HFile HFile
(on HDFS) (on HDFS)
(Sequence File)
(one per RS)

Region


14

Region

Memstore Flush

HLog
(Append only Small
WAL on HDFS)
HFile HFile
(Sequence File)
(on HDFS) (on HDFS) HFile
(one per RS)

Region


14

Region

Memstore

HLog
(Append only Small
WAL on HDFS)
HFile HFile
(Sequence File)
(one per RS)

Region


14

Region

Memstore

HLog
(Append only Small
WAL on HDFS)
HFile HFile
(Sequence File)
(one per RS) Compaction
Region


14

Region

Memstore

HLog
(Append only
WAL on HDFS)
(Sequence File)
(one per RS) Compaction
Region


14

Region

Memstore

HLog
(Append only
WAL on HDFS)
HFile
(on HDFS)
(Sequence File)
(one per RS)

Region


14

Region

Memstore

HLog
(Append only
WAL on HDFS)
HFile HFile HFile
(on HDFS) (on HDFS) (on HDFS)
(Sequence File)
(one per RS)

Region

15

Region

Read
Memstore

HLog
(Append only
WAL on HDFS)
HFile HFile HFile
(on HDFS) (on HDFS) (on HDFS)
(Sequence File)
(one per RS)

Region

15

Ways to access
• Java
• REST
• Thrift
• Scala
• Jython
• Groovy DSL
• Ruby shell
• Java MR, Cascading, Pig, Hive

16

Java API
• Get
• Put
• Delete
• Scan
• IncrementColumnValue
• TableInputFormat - MapReduce Source
• TableOutputFormat - MapReduce Sink

17

Other Features
• Compression
• In memory column families
• Multiple masters
• Rolling restart
• Bloom ﬁlters
• Efﬁcient bulk loads
• Source and sink for Hive, Pig, Cascading

18

Things being worked on

• Master rewrite
• Move more stuff into ZooKeeper

• Column family based access control
• Inter cluster replication (managed by ZK)
• Store Lucene indexes (HBasene)
19

Use Cases


HBase @ SU*
• Backend for su.pr
• Real time serving + MR analytics (separate clusters)

• 50% cascading, 50% java MR

• Prod cluster (~20 nodes) serves 20k requests/sec

• All new features are backed by HBase
• Hardware: 2xi7, 24GB RAM, 4x1TB
*Source: Personal communication with
21
J-D Cryans, StumbleUpon

HBase @ Mozilla*
• Socorro - crash reporting system
• Catch, process and present crash info for Firefox, Thunderbird,
Fennec, Camino, Seamonkey

• 1.5m crash reports/day

• Earlier: NFS, PostgreSQL
• 17 node production cluster
• Dual Quad Core + 24GB RAM + 4x1TB

• Some user facing reports still served by PostgreSQL.
Being ported to HBase in next Socorro version

*Source: http://blog.mozilla.com/webdev/2010/07/26/
22
moving-socorro-to-hbase/

Data Integration*
• Multiple heterogenous data sources
• Notion of connected data
• Think RDF

• Graph connecting data elements across
systems
• Store in HBase, build transitive closures
• Pattern mining
*Source: ClouDFuse - Scalable data integration in the
23
cloud, MS Project, Amandeep Khurana, UC Santa Cruz

HBase @ Trend Micro*

• Store threat information - Smart Protection
Network
• Open source cloud computing initiative -
TCloud
• Primarily run off EC2

*Source: https://hbase.s3.amazonaws.com/hbase/HBase-
24
Trend-HUG10.pdf

HBase @ Yahoo*

• Content optimization
• Meta-data about content stored in HBase
• Used for extracting item features
• Used in conjunction with PNUTS, Hadoop
• Process 100s of GB in each run
*Source: http://www.slideshare.net/ydn/7-online-
25
contentoptimizationhadoopsummit2010

HBase @ Twitter*
• 7TB/day incoming data, increasing
• Analytics
• People search
• Building new solutions on HBase
• Part of a much larger scheme of things
• Scribe, Crane, Pig, MySQL, Cassandra, Oink,
Elephant Bird, Birdbrain, Hadoop
*Sources: http://www.slideshare.net/kevinweil/nosql-at-
twitter-nosql-eu-2010
http://www.slideshare.net/ydn/3-hadoop-
pigattwitterhadoopsummit2010 26

Others
• Facebook
• Powerset
• Flurry
• WorldLingo
• Adobe
• Lily
• Runa
• Drawn To Scale
• GumGum
• RapLeaf
• Openplaces
• ...
• Meetup.com

27

How to think in HBase?


HBase v/s RDBMS

• Neither solves all problems
• It’s really a wrong comparison

• But puts things in context

29

HBase v/s RDBMS
HBase RDBMS
Column oriented Row oriented (mostly)
Flexible schema, add columns on the
Fixed schema
ﬂy
Good with sparse tables Not optimized for sparse tables
No query language SQL
Wide tables Narrow tables
Optimized for joins (small, fast ones
Joins using MR - not optimized
too!)
Tight integration with MR Not really...

30

HBase v/s RDBMS
HBase RDBMS
De-normalize your data Normalize as you can
Horizontal scalability. Just add
Hard to shard and scale
hardware
Consistent Consistent
No transactions Transactional
Good for semi structured data as well
Good for structured data
as structured data

31

HBase v/s RDBMS

32

HBase v/s RDBMS

Rule:You probably don’t need HBase if your
data can easily ﬁt and be processed on a single
RDBMS box.

32

HBase v/s RDBMS

Rule:You probably don’t need HBase if your
data can easily ﬁt and be processed on a single
RDBMS box.

But then, you are at Hadoop Day, so it probably
can’t!

32

Q&A


HBase Deep Dive

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (16)

Similar to HBase Deep Dive

Similar to HBase Deep Dive (20)

HBase Deep Dive