0
HBase	
  status:	
  0.94,	
  0.96,	
  0.98,	
  and	
  future	
  releases
Ma6eo.Bertozzi	
  |	
  	
  @Cloudera	
  	
  
Spea...
What	
  is	
  HBase?
App

ZooKeeper
!2

MR

HDFS

Apache	
  HBase	
  is

an	
  Open	
  Source,

distributed,	
  consistent...
Open	
  Source	
  -­‐	
  Developer	
  Community

•  Vibrant,)Highly)

Ac1ve)community!))
•  We’re)Growing!)
!3

What	
  is...
non-­‐relaQonal
•

Key:Column/Value	
  Interface	
  
• Dynamic	
  columns	
  (qualifiers),	
  “no	
  schema	
  required”	
 ...
Distributed
create,	
  delete	
  table	
  opera=ons

HMaster

put,	
  get,	
  scan

Client/App

ZooKeeper

•

Region	
  Se...
Apache	
  HBase
Timeline	
  and	
  Features

!6
Apache	
  HBase	
  Timeline
Nov$’06:$
Google$$
BigTable$OSDI$‘06$
Q3$‘14:$1.0$

2006$

2007$

Apr$‘07:$First$
Apache$HBase...
Apache	
  HBase	
  0.94
Create/Delete	
  Tables	
  
• Table	
  Insert,	
  Update,	
  Delete,	
  Get,	
  Scan	
  
• Import/...
Apache	
  HBase	
  0.96
(The	
  Latest	
  Release)

!9
0.96:	
  Major	
  Changes,	
  Minimal	
  disturbance
…more	
  than	
  a	
  year	
  in	
  the	
  making	
  
• Lots	
  of	
 ...
0.96:	
  New	
  Features
Online	
  Region	
  Merge	
  
• Online	
  “Schema”	
  Change	
  
• Snapshots	
  
• MTTR	
  
• Fav...
Namespaces
AbstracQon	
  for	
  mulQple	
  tenants	
  to	
  create	
  and	
  manage	
  their	
  own	
  
tables	
  within	
...
Mean	
  Time	
  to	
  Recovery	
  (MTTR)
Region'available''
for'RW'

Region'
unavailable'

detect'

split'
hdfs'

hdfs'

r...
Apache	
  HBase	
  0.98
(The	
  Next	
  Release)

!14
0.98:	
  HBase
Wire	
  CompaQble	
  with	
  0.96	
  
• “No	
  binary	
  guarantee	
  with	
  0.96”	
  
• 0.94	
  -­‐>	
  0...
HBase	
  Security
•

0.90+	
  Kerberos	
  (RPC	
  Level)	
  

•

0.92+	
  Access	
  Control	
  List	
  (aka	
  ACL)	
  

•...
“

What’s	
  Next?	
  
Apache	
  HBase	
  1.0	
  and	
  beyond

	
  	
  	
  	
  	
  	
  	
  

!17

”
QuesQons?
17	
  February	
  2014	
  (HBase	
  London	
  Meetup)

Speaker	
  Name	
  or	
  Subhead	
  Goes	
  Here

!18
Upcoming SlideShare
Loading in...5
×

HBase lon meetup

166

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
166
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
5
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "HBase lon meetup"

  1. 1. HBase  status:  0.94,  0.96,  0.98,  and  future  releases Ma6eo.Bertozzi  |    @Cloudera     Speaker  Name  or  Subhead  Goes  Here 17  February  2014  (HBase  London  Meetup) !1
  2. 2. What  is  HBase? App ZooKeeper !2 MR HDFS Apache  HBase  is
 an  Open  Source,
 distributed,  consistent,
 non-­‐relaQonal  database
 that  provides  low-­‐latency,
 random  read/write  operaQons   on  top  of  HDFS
  3. 3. Open  Source  -­‐  Developer  Community •  Vibrant,)Highly) Ac1ve)community!)) •  We’re)Growing!) !3 What  is  HBase?
  4. 4. non-­‐relaQonal • Key:Column/Value  Interface   • Dynamic  columns  (qualifiers),  “no  schema  required”   • “Fixed”  column  groups  (families)   • table[row:family:column]  =  value Key Qualifier Value User-A info name Theo User-A info address 3 Abbey Rd - London NW8 9AY User-B info name Dave User-C !4 Family info . . . . . .
  5. 5. Distributed create,  delete  table  opera=ons HMaster put,  get,  scan Client/App ZooKeeper • Region  Server   • • • Server  that  contains  a  set  of  Regions   Handle  reads  and  writes  requests   Region   • Basic  unit  of  scalability   Region  Server Region  Server Region  Server • Subset  of  the  table’s  data   Region Region Region • Region Region Region ConQguous,  sorted  range  of  rows   stored  together   Region Region Region HDFS !5 What  is  HBase? • Master   • Coordinate  the  cluster  (e.g.  Balancing)   • Admin  Ops  (create/delete  table,  …)
  6. 6. Apache  HBase Timeline  and  Features !6
  7. 7. Apache  HBase  Timeline Nov$’06:$ Google$$ BigTable$OSDI$‘06$ Q3$‘14:$1.0$ 2006$ 2007$ Apr$‘07:$First$ Apache$HBase$ commit$as$ Hadoop$contrib$ project$ !7 2008$ 2009$ 2010$ 2011$ 2012$ 2013$ 2014$ Apr’11:'' 0.90.1' Jan‘08:$Promoted$ to$Hadoop$ subproject$ Apr$‘10:$Apache$ HBase$becomes$ top$level$project$ May$‘12:$0.94.0$ Jan$‘12:$0.92.0$ Oct$‘13:$0.96.0$ Feb$‘14:$0.98.0$
  8. 8. Apache  HBase  0.94 Create/Delete  Tables   • Table  Insert,  Update,  Delete,  Get,  Scan   • Import/Export  tools   • Map-­‐Reduce  job  helpers   • Kerberos  &  ACLs   •… • !8
  9. 9. Apache  HBase  0.96 (The  Latest  Release) !9
  10. 10. 0.96:  Major  Changes,  Minimal  disturbance …more  than  a  year  in  the  making   • Lots  of  changes  under  the  hood   • HadoopWritables  replaced  with  protobuf  (RPC,  metadata,  …)   • -­‐ROOT-­‐  Table  Removed   • /hbase  dir  Layout  Changes   • Minimal  disturbance  to  the  API   • Improved  stability   • Mean  Time  To  Recovery  (MTTR) • !10
  11. 11. 0.96:  New  Features Online  Region  Merge   • Online  “Schema”  Change   • Snapshots   • MTTR   • Favored  Nodes   • New  Balancers   • Namespaces • h6ps://blogs.apache.org/hbase/entry/hbase_0_96_0_released !11
  12. 12. Namespaces AbstracQon  for  mulQple  tenants  to  create  and  manage  their  own   tables  within  a  large  HBase  instance. Separate  ACLs   • Performance  IsolaQon  *   • Region  Server  groups    * • RSG$blue$ !12 Namespace(blue( RSG$green$orange$ Namespace(green( Namespace(orange(
  13. 13. Mean  Time  to  Recovery  (MTTR) Region'available'' for'RW' Region' unavailable' detect' split' hdfs' hdfs' replay' assign' recovered' hdfs' Machine  failures  happen  in  distributed  systems   • Repair  ==  split,  assign,  replay   • Distributed  log  replay  with  fast  write  recovery   • Writes  in  HBase  do  not  incur  reads.   • regions  open  for  write,  during  distributed  log  replay • !13
  14. 14. Apache  HBase  0.98 (The  Next  Release) !14
  15. 15. 0.98:  HBase Wire  CompaQble  with  0.96   • “No  binary  guarantee  with  0.96”   • 0.94  -­‐>  0.98  upgrade  is  possible   • • • • • • !15 Map-­‐Reduce  over  Snapshots   Stripe  Compac=on  (Pluggable  Compac=on  Algo)   Improved  WAL  write  throughput   Reverse  Scan   Per-­‐Cell  ACLs,  Visibility  Labels,  Encryp=on
  16. 16. HBase  Security • 0.90+  Kerberos  (RPC  Level)   • 0.92+  Access  Control  List  (aka  ACL)   • 0.98+  Per-­‐Cell  ACLs   • 0.98+  Visibility  Labels  (aka  Tags)   • 0.98+  Transparent  Table/CF  encrypQon  (HBASE-­‐7544)   • !16 Java  KeyStore  support
  17. 17. “ What’s  Next?   Apache  HBase  1.0  and  beyond               !17 ”
  18. 18. QuesQons? 17  February  2014  (HBase  London  Meetup) Speaker  Name  or  Subhead  Goes  Here !18
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×