Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
CC 2.0 by Bryan Pearson | http://flic.kr/p/RUfEt
11.                                   April                                   2013                                    2   ...
11.                                        April                                        2013                              ...
11.                                                                                 April                                 ...
11.                                            info Column Family                    April                                ...
11.                                                                April                                                  ...
11.                                             April                                             2013                    ...
CC 2.0 by Aurelien Guichard | http://flic.kr/p/cjg9yw
11.                                    April                                    2013                                     9...
11.                                                                                                                      A...
Upcoming SlideShare
Loading in …5
×

Apache HBase: Introduction to a column-oriented data store

1,594 views

Published on

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Apache HBase: Introduction to a column-oriented data store

  1. 1. CC 2.0 by Bryan Pearson | http://flic.kr/p/RUfEt
  2. 2. 11. April 2013 2 HBase is an open source, distributed, column-oriented data store modeled after Google’s BigTableHBase IntroductionAbout HBase
  3. 3. 11. April 2013 3•  Sorted map data store•  Table consists of rows, each has a row key (primary key)•  Each row may have any number of columns (Map<byte[], byte []>)•  Rows are sorted lexicographically based on row keyHBase IntroductionData Model
  4. 4. 11. April 2013 Different types of data separated 4 into different “column families” Data is all byte[] Row key Data amuller info: { ‘height’: ‘2.0m’, ‘state’: ‘ZH’ } roles: { ‘IBM’: ‘Sales Manager’ } cguegi info: { ‘height’: ‘1.85m’, ‘state’: ‘BE’ } roles: { ‘Sentric’: ‘Architect’@ts=2011, ‘Sentric’: ‘Mentor’@ts=2012, ‘SBDUG’: ‘Founder’ } Different rows may have different sets of columns (table is sparse) Single cell may have different values at different timestampesHBase IntroductionSorted Map (Logical View)
  5. 5. 11. info Column Family April 2013 Row key Column key Timestamp Value 5 amuller info:height 1333883187 2.0m amuller info:state 1273871824 ZHSorted on disk by row key,column key, descending ts cguegi info:height 1325755229 1.85m cguegi info:state 1325751049 TG roles Column Family Row key Column key Timestamp Value amuller roles:IBM 1320105636 Developer cguegi roles:SBDUG 1330561785 Founder cguegi roles:Sentric 1325376723 Mentor cguegi roles:Sentric 1293840959 Architect Unix timestamp HBase Introduction Sorted Map (Physical View)
  6. 6. 11. April 2013 6 HBase API RegionServer Master HFile Memstore Write-Ahead Log HDFS ZooKeeper [HBase: The Definitive Guide]HBase IntroductionHBase Architecture
  7. 7. 11. April 2013 7•  Favors Consistency over Availability•  Great Hadoop integration•  Ordered range partitions•  Automatically shards/scales•  Sparse column storageHBase IntroductionHBase vs other “NoSQL”
  8. 8. CC 2.0 by Aurelien Guichard | http://flic.kr/p/cjg9yw
  9. 9. 11. April 2013 9•  http://hbase.apache.org•  http://www.sentric.ch•  http://bigdata-usergroup.ch•  http://about.me/cguegiHBase IntroductionResources
  10. 10. 11. April 2013 10Source: http://blogs.the451group.com/information_management/2013/02/04/updated-database-lanscape-map-february-2013/HBase IntroductionDatabase Landscape Map

×