HBase lon meetup

445 views

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
445
On SlideShare
0
From Embeds
0
Number of Embeds
10
Actions
Shares
0
Downloads
6
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

HBase lon meetup

  1. 1. HBase  status:  0.94,  0.96,  0.98,  and  future  releases Ma6eo.Bertozzi  |    @Cloudera     Speaker  Name  or  Subhead  Goes  Here 17  February  2014  (HBase  London  Meetup) !1
  2. 2. What  is  HBase? App ZooKeeper !2 MR HDFS Apache  HBase  is
 an  Open  Source,
 distributed,  consistent,
 non-­‐relaQonal  database
 that  provides  low-­‐latency,
 random  read/write  operaQons   on  top  of  HDFS
  3. 3. Open  Source  -­‐  Developer  Community •  Vibrant,)Highly) Ac1ve)community!)) •  We’re)Growing!) !3 What  is  HBase?
  4. 4. non-­‐relaQonal • Key:Column/Value  Interface   • Dynamic  columns  (qualifiers),  “no  schema  required”   • “Fixed”  column  groups  (families)   • table[row:family:column]  =  value Key Qualifier Value User-A info name Theo User-A info address 3 Abbey Rd - London NW8 9AY User-B info name Dave User-C !4 Family info . . . . . .
  5. 5. Distributed create,  delete  table  opera=ons HMaster put,  get,  scan Client/App ZooKeeper • Region  Server   • • • Server  that  contains  a  set  of  Regions   Handle  reads  and  writes  requests   Region   • Basic  unit  of  scalability   Region  Server Region  Server Region  Server • Subset  of  the  table’s  data   Region Region Region • Region Region Region ConQguous,  sorted  range  of  rows   stored  together   Region Region Region HDFS !5 What  is  HBase? • Master   • Coordinate  the  cluster  (e.g.  Balancing)   • Admin  Ops  (create/delete  table,  …)
  6. 6. Apache  HBase Timeline  and  Features !6
  7. 7. Apache  HBase  Timeline Nov$’06:$ Google$$ BigTable$OSDI$‘06$ Q3$‘14:$1.0$ 2006$ 2007$ Apr$‘07:$First$ Apache$HBase$ commit$as$ Hadoop$contrib$ project$ !7 2008$ 2009$ 2010$ 2011$ 2012$ 2013$ 2014$ Apr’11:'' 0.90.1' Jan‘08:$Promoted$ to$Hadoop$ subproject$ Apr$‘10:$Apache$ HBase$becomes$ top$level$project$ May$‘12:$0.94.0$ Jan$‘12:$0.92.0$ Oct$‘13:$0.96.0$ Feb$‘14:$0.98.0$
  8. 8. Apache  HBase  0.94 Create/Delete  Tables   • Table  Insert,  Update,  Delete,  Get,  Scan   • Import/Export  tools   • Map-­‐Reduce  job  helpers   • Kerberos  &  ACLs   •… • !8
  9. 9. Apache  HBase  0.96 (The  Latest  Release) !9
  10. 10. 0.96:  Major  Changes,  Minimal  disturbance …more  than  a  year  in  the  making   • Lots  of  changes  under  the  hood   • HadoopWritables  replaced  with  protobuf  (RPC,  metadata,  …)   • -­‐ROOT-­‐  Table  Removed   • /hbase  dir  Layout  Changes   • Minimal  disturbance  to  the  API   • Improved  stability   • Mean  Time  To  Recovery  (MTTR) • !10
  11. 11. 0.96:  New  Features Online  Region  Merge   • Online  “Schema”  Change   • Snapshots   • MTTR   • Favored  Nodes   • New  Balancers   • Namespaces • h6ps://blogs.apache.org/hbase/entry/hbase_0_96_0_released !11
  12. 12. Namespaces AbstracQon  for  mulQple  tenants  to  create  and  manage  their  own   tables  within  a  large  HBase  instance. Separate  ACLs   • Performance  IsolaQon  *   • Region  Server  groups    * • RSG$blue$ !12 Namespace(blue( RSG$green$orange$ Namespace(green( Namespace(orange(
  13. 13. Mean  Time  to  Recovery  (MTTR) Region'available'' for'RW' Region' unavailable' detect' split' hdfs' hdfs' replay' assign' recovered' hdfs' Machine  failures  happen  in  distributed  systems   • Repair  ==  split,  assign,  replay   • Distributed  log  replay  with  fast  write  recovery   • Writes  in  HBase  do  not  incur  reads.   • regions  open  for  write,  during  distributed  log  replay • !13
  14. 14. Apache  HBase  0.98 (The  Next  Release) !14
  15. 15. 0.98:  HBase Wire  CompaQble  with  0.96   • “No  binary  guarantee  with  0.96”   • 0.94  -­‐>  0.98  upgrade  is  possible   • • • • • • !15 Map-­‐Reduce  over  Snapshots   Stripe  Compac=on  (Pluggable  Compac=on  Algo)   Improved  WAL  write  throughput   Reverse  Scan   Per-­‐Cell  ACLs,  Visibility  Labels,  Encryp=on
  16. 16. HBase  Security • 0.90+  Kerberos  (RPC  Level)   • 0.92+  Access  Control  List  (aka  ACL)   • 0.98+  Per-­‐Cell  ACLs   • 0.98+  Visibility  Labels  (aka  Tags)   • 0.98+  Transparent  Table/CF  encrypQon  (HBASE-­‐7544)   • !16 Java  KeyStore  support
  17. 17. “ What’s  Next?   Apache  HBase  1.0  and  beyond               !17 ”
  18. 18. QuesQons? 17  February  2014  (HBase  London  Meetup) Speaker  Name  or  Subhead  Goes  Here !18

×