SlideShare a Scribd company logo
1 of 40
Download to read offline
1
HBase	
  0.96+	
  	
  
A	
  Report	
  on	
  the	
  Current	
  Status	
  
Lars	
  George	
  |	
  EMEA	
  Chief	
  Architect	
  
About	
  Me	
  
•  EMEA	
  Chief	
  Architect	
  @	
  Cloudera	
  
•  ConsulDng	
  on	
  Hadoop	
  projects	
  (everywhere)	
  
•  Apache	
  CommiLer	
  
•  HBase	
  and	
  Whirr	
  
•  O’Reilly	
  Author	
  
•  HBase	
  –	
  The	
  DefiniDve	
  Guide	
  
•  Now	
  in	
  Japanese!	
  
•  Contact	
  
•  lars@cloudera.com	
  
•  @larsgeorge	
  
日本語版も出ました!	
  
The	
  Content...	
  
•  Version	
  History	
  
•  Overview	
  of	
  new	
  Features	
  
•  Summary	
  
CONFIDENTIAL	
  -­‐	
  RESTRICTED	
  
Version	
  History	
  
A	
  Timeline	
  Overview	
  
HBase	
  Releases	
  
5
URL:	
  hLp://s.apache.org/hbase-­‐releases	
  
HBase	
  Releases	
  –	
  Issues	
  Closed	
  (JIRA)	
  
6
URL:	
  hLp://s.apache.org/hbase-­‐releases	
  
HBase	
  Releases	
  –	
  Issues	
  Closed	
  (DisDnct)	
  
7
URL:	
  hLp://s.apache.org/hbase-­‐releases	
  
HBase	
  Book?	
  
I	
  targeted	
  0.92.0	
  but…	
  
	
  
r1130336 | stack | 2011-06-02 00:52:45 +0200 ⤦
(Thu, 02 Jun 2011) | 1 line
Add link to meet up
...
r1234894 | stack | 2012-01-23 17:50:43 +0100 ⤦
(Mon, 23 Jan 2012) | 1 line
Move version on past 0.92.0 to 0.92.1-SNAPSHOT
$ svn log -r 1130336:1234894 | grep "^r" | wc -l
807
8
HBase	
  Book?	
  
I	
  am	
  trailing	
  0.92.0	
  by	
  800+	
  commits,	
  including	
  for	
  
example	
  	
  
r1153634 | tedyu | 2011-08-03 21:59:48 +0200 ⤦
(Wed, 03 Aug 2011) | 2 lines
HBASE-3857 Change the HFile Format (Mikhail & Liyin)
…which	
  is	
  not	
  “unimportant”.	
  J	
  
	
  
I	
  am	
  working	
  on	
  an	
  update!	
  
9
10
Coprocessors	
  and	
  more…	
  
HBase	
  0.92	
  
HBase	
  0.92	
  -­‐	
  Highlights	
  
•  682	
  issues	
  addressed	
  
•  811	
  issues	
  total	
  in	
  0.92.x	
  line	
  
•  New	
  logo!	
   	
  (HBASE-­‐4312)	
  
•  HFile	
  v2	
   	
  (HBASE-­‐3857)	
  
•  Distributed	
  Log	
  Splilng	
   	
  (HBASE-­‐1364)	
  
•  Enhanced	
  Master	
  UI	
  
•  Major	
  compacDon	
  progress	
   	
  (HBASE-­‐3900)	
  
•  Regions	
  in	
  transiDon	
   	
  (HBASE-­‐4291)	
  
•  Tasks	
   	
  (HBASE-­‐3839)	
  
•  Slow	
  Query	
  Metrics	
   	
  (HBASE-­‐4117)	
  
11
HBase	
  0.92	
  -­‐	
  Highlights	
  
•  Coprocessors 	
  (HBASE-­‐2000)	
  
•  Oneap	
  cache 	
  (HBASE-­‐4027)	
  
•  Online	
  Table	
  Schema	
  Change 	
  (HBASE-­‐1730)	
  
•  Regions	
  Size	
  from	
  256MB	
  to	
  1GB 	
  (HBASE-­‐4374)	
  
•  Hadoop	
  1	
  Support 	
  (HBASE-­‐5125)	
  
•  Snappy	
  Support 	
  (HBASE-­‐3691)	
  
•  Keep	
  last	
  version	
  with	
  TTL 	
  (HBASE-­‐4071)	
  
•  MulDthreaded	
  CompacDons 	
  (HBASE-­‐4572)	
  
12
HFile	
  v1	
  –	
  HBase	
  0.90	
  
13
•  Previously	
  the	
  file	
  layout	
  was	
  data	
  blocks,	
  meta	
  blocks	
  
and	
  then	
  file	
  metadata	
  like	
  indexes.	
  
•  Each	
  data	
  block	
  held	
  a	
  magic	
  header	
  and	
  then	
  the	
  
actual	
  data	
  sequenDally.	
  
HFile	
  v2	
  –	
  HBase	
  0.92+	
  
The	
  2nd	
  version	
  of	
  HFile	
  splits	
  the	
  indexes	
  and	
  Bloom	
  
filters	
  up	
  into	
  a	
  hierarchy	
  and	
  interleaves	
  those	
  with	
  
data	
  blocks.	
  
14
The	
  data	
  block	
  header	
  now	
  holds	
  addiDonal	
  info	
  on	
  
the	
  block	
  itself.	
  	
  
	
  
Source:	
  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/	
  
Coprocessors:	
  Observers	
  
15
Coprocessors:	
  RPC	
  Calls	
  
16
Slab	
  Cache	
  –	
  Off-­‐heap	
  Block	
  Cache	
  
17 hLp://blog.cloudera.com/blog/2012/01/caching-­‐in-­‐hbase-­‐slabcache/	
  
•  The	
  off-­‐heap	
  cache	
  uses	
  Java	
  NIO’s	
  Direct	
  ByteBuffer	
  
structures	
  
•  Uses	
  its	
  on	
  slab	
  allocaDon	
  handling	
  
•  Does	
  copy-­‐on-­‐read	
  
during	
  access	
  of	
  data	
  
•  Uses	
  L2	
  cache	
  and	
  
replaces	
  OS	
  buffer	
  
cache	
  
18
Performance	
  Tuning…	
  
HBase	
  0.94	
  
HBase	
  0.94	
  -­‐	
  Highlights	
  
•  420	
  issues	
  addressed	
  
•  1394	
  issues	
  total	
  in	
  0.94.x	
  line	
  
•  Read	
  Caching	
  Improvements 	
  (HBASE-­‐5074)	
  
•  Seek	
  OpDmizaDon	
  
•  Bloom	
  Filter	
  for	
  Delete	
  Family 	
  (HBASE-­‐4532)	
  
•  Lazy	
  Seeks 	
  (HBASE-­‐4465)	
  
•  Write	
  to	
  WAL	
  OpDmizaDons 	
  	
  
•  WAL	
  Compression 	
  (HBASE-­‐4608)	
  
•  Data	
  Block	
  Encoding	
  of	
  KeyValues	
   	
  (HBASE-­‐4218)	
  
•  Improved	
  HBaseFsck 	
  (HBASE-­‐5128)	
  
19
HBase	
  0.94	
  -­‐	
  Highlights	
  
•  Simplified	
  Region	
  Sizing 	
  (HBASE-­‐4365)	
  
•  Smarter	
  TransacDon	
  SemanDcs 	
  	
  
•  Atomic	
  Put&Delete	
  in	
  One	
  Call 	
  (HBASE-­‐3584)	
  
•  Snapshots	
  (0.94.6) 	
  (HBASE-­‐7360)	
  
•  Atomic	
  Appends 	
  (HBASE-­‐4102)	
  
•  MulD	
  Increment	
  and	
  Append 	
  (HBASE-­‐2947)	
  
•  More	
  Aggressive	
  Off-­‐Peak	
  CompacDons	
  (HBASE-­‐4463)	
  
20
HBase	
  0.94	
  -­‐	
  Highlights	
  
•  Per	
  Column	
  Family	
  Metrics 	
  (HBASE-­‐4219)	
  
•  MulD-­‐row	
  local	
  transacDons 	
  (HBASE-­‐5229)	
  
•  Pluggable	
  Split	
  Key	
  Policy 	
  (HBASE-­‐5304)	
  
•  Load	
  balance	
  regions	
  by	
  table 	
  (HBASE-­‐3373)	
  
•  Also	
  backported	
  to	
  0.92.1 	
  	
  
•  Make	
  CompacDon	
  Code	
  Pluggable 	
  (HBASE-­‐6427)	
  
•  Deprecate	
  HTablePool	
  (0.94.11) 	
  (HBASE-­‐6580)	
  
•  Canary	
  Test	
  Tool 	
  (HBASE-­‐4393)	
  
21
Block	
  Encoding	
  
•  Allows	
  to	
  reduce	
  data	
  footprint	
  in	
  memory	
  
•  Only	
  encodes	
  the	
  key	
  porDon	
  of	
  a	
  key/value	
  pair	
  
•  Encoded	
  keys	
  stay	
  encoded	
  also	
  during	
  flushes	
  
•  Compression	
  on	
  top	
  of	
  encoding	
  takes	
  care	
  of	
  the	
  
values	
  and	
  remainder	
  of	
  key	
  data	
  
	
  	
  
Example:	
  
•  Key	
  length:	
  90B	
  
•  Value	
  length:	
  8B	
  
22
Type	
   Ra0o	
  
Key	
  Compression	
   92%	
  
Total	
  Compression	
   85%	
  
LZO	
  on	
  same	
  data	
   85%	
  
LZO	
  axer	
  encoding	
   91%	
  
hLps://issues.apache.org/jira/browse/HBASE-­‐4218	
  
Block	
  Encoding:	
  None	
  
23
Source:	
  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/	
  
•  With	
  no	
  encoding	
  the	
  Key/Value	
  structures	
  are	
  stored	
  
verbaDm	
  (with	
  some	
  overhead	
  for	
  lengths)	
  
•  In	
  the	
  past	
  you	
  were	
  advised	
  to	
  keep	
  the	
  “keys”	
  short	
  
for	
  that	
  reason	
  
	
  
Block	
  Encoding:	
  Prefix	
  Encoding	
  
24
Source:	
  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/	
  
•  The	
  encoding	
  patch	
  added	
  a	
  new	
  Cell	
  abstracDon	
  that	
  
allows	
  for	
  extra	
  fields	
  in	
  a	
  Key/Value	
  
•  The	
  fields	
  are	
  used	
  to	
  track	
  necessary	
  details	
  for	
  the	
  
encoding	
  
Block	
  Encoding:	
  Diff	
  Encoding	
  
25
Source:	
  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/	
  
•  Apart	
  from	
  the	
  prefix	
  encoding	
  there	
  are	
  other	
  ways	
  
of	
  encoding	
  the	
  keys	
  
•  The	
  diff	
  encoding	
  is	
  one	
  of	
  such	
  approaches	
  
Block	
  Encoding	
  
•  Advantage	
  of	
  block	
  encoding	
  is	
  faster	
  decompression/
decoding	
  	
  
•  20-­‐80%	
  faster	
  than	
  LZO	
  
•  Also	
  it	
  allows	
  to	
  seek	
  data	
  sDll,	
  which	
  is	
  not	
  possible	
  
with	
  compressed	
  data	
  
•  Penalty	
  is	
  a	
  slightly	
  slower	
  read	
  performance	
  
compared	
  to	
  non-­‐encoded	
  keys	
  
•  Important	
  is	
  to	
  watch	
  the	
  sizes	
  and	
  repeDDon	
  of	
  key	
  
data,	
  encoding	
  might	
  not	
  be	
  useful	
  for	
  random	
  data	
  
26 hLps://issues.apache.org/jira/browse/HBASE-­‐4218	
  
27
The	
  Singularity	
  
HBase	
  0.96	
  
HBase	
  0.96	
  -­‐	
  Highlights	
  
•  1219	
  issues	
  addressed	
  
•  2243	
  issues	
  total	
  in	
  0.96.x	
  line	
  
•  Improved	
  Stability 	
  (HBASE-­‐6241/6201)	
  
•  ZK	
  based	
  Read/Write	
  locks	
  for	
  table	
  operaDons 	
  (HBASE-­‐7305)	
  
•  Scalability	
  Improvements 	
  (HBASE-­‐8877)	
  
•  Schema	
  Storage 	
  (HBASE-­‐8778)	
  
•  Log	
  Cleaner	
  for	
  ReplicaDon	
  Speed	
  Up 	
  (HBASE-­‐9208)	
  
•  Mean-­‐Time-­‐To-­‐Recovery	
  (MTTR)	
  Improvements
	
  (HBASE-­‐5844/5926)	
  
•  Distributed	
  Log	
  Replay 	
  (HBASE-­‐7006)	
  
•  Dedicated	
  WAL	
  for	
  System	
  Table 	
  (HBASE-­‐7213/8631)	
  
28
HBase	
  0.96	
  -­‐	
  Highlights	
  
•  Operability	
  Improvements	
  
•  Hooks	
  for	
  Health	
  Scripts 	
  (HBASE-­‐7399/7406)	
  
•  Trace	
  Lagging	
  Calls	
  with	
  HTrace 	
  (HBASE-­‐9121)	
  
•  Versioned	
  RPCs	
  and	
  Metadata	
  (Protobufs)	
   	
  (HBASE-­‐3505)	
  
•  Parallel	
  Seeks	
  in	
  Stores 	
  (HBASE-­‐7495)	
  
•  Hadoop	
  1	
  and	
  2	
  Support 	
  	
  
•  Secure	
  Short	
  Circuit	
  Reads	
  on	
  H2 	
  (HBASE-­‐6783)	
  
•  Namespaces	
  Support 	
  (HBASE-­‐8015)	
  
•  New	
  Metrics	
  v2 	
  (HBASE-­‐4050)	
  
•  Cell	
  Interface	
  vs	
  KeyValue 	
  (HBASE-­‐7162)	
  
29
HBase	
  0.96	
  -­‐	
  Highlights	
  
•  No	
  more	
  ROOT	
  table 	
  (HBASE-­‐3171)	
  
•  Remove	
  HFile	
  v1 	
  (HBASE-­‐7660)	
  
•  Trie	
  Data	
  Block	
  Encoding	
   	
  (HBASE-­‐4676)	
  
•  Remove	
  Client-­‐side	
  Row	
  Locks 	
  (HBASE-­‐7263/7315)	
  
•  CompacDon	
  and	
  Flush	
  Improvements
	
  (HBASE-­‐7516/7763/6466/7678)	
  
	
  (HBASE-­‐7667/7110/7603/7519/7842)	
  
•  Improved	
  Default	
  ConfiguraDon 	
  (HBASE-­‐4657?)	
  
•  Client-­‐side	
  Type	
  Library 	
  (HBASE-­‐8089)	
  
	
  
30
HBase	
  0.96	
  -­‐	
  Highlights	
  
•  Online	
  Region	
  Merging 	
  (HBASE-­‐7403/8219)	
  
•  Bucket	
  Cache	
  Support 	
  (HBASE-­‐7404)	
  
•  Remove	
  older	
  ICV	
  Calls 	
  (HBASE-­‐7032)	
  
•  New	
  “Bootstrap”	
  based	
  UIs!	
   	
  (HBASE-­‐6135)	
  
•  Remove	
  Client-­‐side	
  Row	
  Locks 	
  (HBASE-­‐7263/7315)	
  
•  CompacDon	
  and	
  Flush	
  Improvements
	
  (HBASE-­‐7516/7763/6466/7678)	
  
	
  (HBASE-­‐7667/7110/7603/7519/7842)	
  
	
  
31
32
—	
  Michael	
  Stack,	
  HBase	
  PMC	
  Chair	
  
Mean-­‐Time-­‐To-­‐Recovery	
  (MTTR)	
  
•  Lot‘s	
  of	
  effort	
  put	
  into	
  improve	
  how	
  long	
  data	
  might	
  
not	
  be	
  accessible	
  during	
  a	
  region	
  move	
  
•  The	
  offline	
  period	
  is	
  made	
  up	
  of	
  phases:	
  	
  
•  a	
  detecDon	
  phase,	
  	
  
•  a	
  repair	
  phase,	
  	
  
•  reassignment,	
  and	
  finally,	
  	
  
•  clients	
  noDcing	
  the	
  data	
  available	
  in	
  its	
  new	
  locaDon	
  
•  Improvements	
  in	
  many	
  of	
  those	
  areas	
  
•  Faster	
  detecDon,	
  efficient	
  repair,	
  parallel	
  replay	
  
•  Dedicated	
  WAL	
  for	
  system	
  tables	
  
33 hLps://blog.cloudera.com/blog/2013/10/hbase-­‐0-­‐96-­‐0-­‐released/	
  
34
Cell	
  Level	
  Security	
  
HBase	
  0.98	
  
HBase	
  0.98	
  -­‐	
  Highlights	
  
•  1303	
  issues	
  addressed	
  
•  1458	
  issues	
  total	
  in	
  0.98.x	
  line	
  
•  Cell	
  Level	
  Security 	
  (HBASE-­‐6222/7663/7662)	
  
•  Server-­‐side	
  EncrypDon 	
  (HBASE-­‐7544)	
  
•  WAL	
  Throughput	
  Improvements 	
  (HBASE-­‐8755)	
  
•  Reverse	
  Scanner 	
  (HBASE-­‐4811)	
  
•  MapReduce	
  over	
  Snapshot	
  Files 	
  (HBASE-­‐8369)	
  
•  Striped	
  CompacDons 	
  (HBASE-­‐7667)	
  
•  ThroLle	
  ReplicaDon 	
  (HBASE-­‐9501)	
  
35
Cell	
  Level	
  Security	
  
•  Added	
  HFile	
  v3	
  which	
  can	
  store	
  arbitrary	
  metadata	
  in	
  
a	
  cell,	
  called	
  tags	
  
•  Also	
  extended	
  ACL	
  checks	
  to	
  apply	
  to	
  cell	
  levels	
  
•  With	
  this	
  visibility	
  labels	
  can	
  be	
  stored	
  in	
  tags	
  
•  An	
  API	
  and	
  CLI	
  tools	
  are	
  provided	
  that	
  are	
  akin	
  to	
  
Accumulo’s,	
  axer	
  which	
  it	
  is	
  modeled	
  
•  AddiDonal	
  encrypDon	
  of	
  data	
  at	
  rest	
  ensures	
  further	
  
security	
  of	
  sensiDve	
  data	
  
36 hLps://blogs.apache.org/hbase/entry/hbase_cell_security	
  
Visibility	
  Labels	
  
The	
  API	
  allows	
  to	
  set	
  visibility	
  by	
  using	
  expressions	
  with	
  
“&”,	
  “|”,	
  and	
  “!”,	
  as	
  well	
  as	
  “(“	
  and	
  “)”,	
  e.g.	
  label	
  set	
  of	
  
{	
  confidenDal,	
  secret,	
  topsecret,	
  probaDonary	
  }	
  could	
  
be	
  combined	
  as	
  	
  
( secret | topsecret ) & !probationary
	
  
At	
  runDme	
  the	
  expressions	
  are	
  evaluated	
  against	
  a	
  user	
  
and	
  then	
  applied	
  to	
  each	
  cell.	
  
37
38
The	
  Future…	
  
HBase	
  0.??	
  
HBase	
  Future	
  
•  Not	
  much	
  is	
  wriDng	
  in	
  stone	
  yet	
  
•  Master	
  gets	
  rewriLen	
  and	
  also	
  META	
  table	
  handling	
  
•  Build	
  in	
  consensus 	
  (HBASE-­‐10296)	
  
•  Co-­‐locate	
  Master	
  and	
  META 	
  (HBASE-­‐10569)	
  
•  MTTR	
  is	
  further	
  extended	
  into	
  interesDng	
  areas	
  
•  Read	
  replicas 	
  (HBASE-­‐10070)	
  
It	
  has	
  to	
  be	
  seen	
  when	
  1.0.0	
  is	
  released	
  and	
  what	
  it	
  
contains.	
  Your	
  opinion	
  counts!	
  
39
40
Ques0ons?	
  
@larsgeorge	
  

More Related Content

What's hot

HBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and SparkHBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and SparkHBaseCon
 
Large-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestLarge-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestHBaseCon
 
Cloudera Impala: A Modern SQL Engine for Apache Hadoop
Cloudera Impala: A Modern SQL Engine for Apache HadoopCloudera Impala: A Modern SQL Engine for Apache Hadoop
Cloudera Impala: A Modern SQL Engine for Apache HadoopCloudera, Inc.
 
Taming the Elephant: Efficient and Effective Apache Hadoop Management
Taming the Elephant: Efficient and Effective Apache Hadoop ManagementTaming the Elephant: Efficient and Effective Apache Hadoop Management
Taming the Elephant: Efficient and Effective Apache Hadoop ManagementDataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in HadoopBackup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadooplarsgeorge
 
HBase Sizing Guide
HBase Sizing GuideHBase Sizing Guide
HBase Sizing Guidelarsgeorge
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoopmarkgrover
 
Intro to HBase - Lars George
Intro to HBase - Lars GeorgeIntro to HBase - Lars George
Intro to HBase - Lars GeorgeJAX London
 
Introduction to Impala
Introduction to ImpalaIntroduction to Impala
Introduction to Impalamarkgrover
 
HBaseCon 2012 | HBase, the Use Case in eBay Cassini
HBaseCon 2012 | HBase, the Use Case in eBay Cassini HBaseCon 2012 | HBase, the Use Case in eBay Cassini
HBaseCon 2012 | HBase, the Use Case in eBay Cassini Cloudera, Inc.
 
HBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBaseHBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBaseCloudera, Inc.
 
Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity
Harmonizing Multi-tenant HBase Clusters for Managing Workload DiversityHarmonizing Multi-tenant HBase Clusters for Managing Workload Diversity
Harmonizing Multi-tenant HBase Clusters for Managing Workload DiversityHBaseCon
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseenissoz
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteDataWorks Summit
 
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetHBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetCloudera, Inc.
 
Efficient in situ processing of various storage types on apache tajo
Efficient in situ processing of various storage types on apache tajoEfficient in situ processing of various storage types on apache tajo
Efficient in situ processing of various storage types on apache tajoHyunsik Choi
 
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...Yahoo Developer Network
 

What's hot (20)

HBase in Practice
HBase in Practice HBase in Practice
HBase in Practice
 
HBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and SparkHBaseCon 2015: HBase and Spark
HBaseCon 2015: HBase and Spark
 
Large-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestLarge-scale Web Apps @ Pinterest
Large-scale Web Apps @ Pinterest
 
Cloudera Impala: A Modern SQL Engine for Apache Hadoop
Cloudera Impala: A Modern SQL Engine for Apache HadoopCloudera Impala: A Modern SQL Engine for Apache Hadoop
Cloudera Impala: A Modern SQL Engine for Apache Hadoop
 
Taming the Elephant: Efficient and Effective Apache Hadoop Management
Taming the Elephant: Efficient and Effective Apache Hadoop ManagementTaming the Elephant: Efficient and Effective Apache Hadoop Management
Taming the Elephant: Efficient and Effective Apache Hadoop Management
 
Backup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in HadoopBackup and Disaster Recovery in Hadoop
Backup and Disaster Recovery in Hadoop
 
HBase Sizing Guide
HBase Sizing GuideHBase Sizing Guide
HBase Sizing Guide
 
Applications on Hadoop
Applications on HadoopApplications on Hadoop
Applications on Hadoop
 
Intro to HBase - Lars George
Intro to HBase - Lars GeorgeIntro to HBase - Lars George
Intro to HBase - Lars George
 
Introduction to Impala
Introduction to ImpalaIntroduction to Impala
Introduction to Impala
 
HBaseCon 2012 | HBase, the Use Case in eBay Cassini
HBaseCon 2012 | HBase, the Use Case in eBay Cassini HBaseCon 2012 | HBase, the Use Case in eBay Cassini
HBaseCon 2012 | HBase, the Use Case in eBay Cassini
 
HBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBaseHBaseCon 2013: Compaction Improvements in Apache HBase
HBaseCon 2013: Compaction Improvements in Apache HBase
 
Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity
Harmonizing Multi-tenant HBase Clusters for Managing Workload DiversityHarmonizing Multi-tenant HBase Clusters for Managing Workload Diversity
Harmonizing Multi-tenant HBase Clusters for Managing Workload Diversity
 
NoSQL: Cassadra vs. HBase
NoSQL: Cassadra vs. HBaseNoSQL: Cassadra vs. HBase
NoSQL: Cassadra vs. HBase
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great Taste
 
Apache HBase: State of the Union
Apache HBase: State of the UnionApache HBase: State of the Union
Apache HBase: State of the Union
 
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring BudgetHBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
HBaseCon 2012 | Building a Large Search Platform on a Shoestring Budget
 
Efficient in situ processing of various storage types on apache tajo
Efficient in situ processing of various storage types on apache tajoEfficient in situ processing of various storage types on apache tajo
Efficient in situ processing of various storage types on apache tajo
 
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
Apache Hadoop India Summit 2011 talk "Searching Information Inside Hadoop Pla...
 

Viewers also liked

Big Data is not Rocket Science
Big Data is not Rocket ScienceBig Data is not Rocket Science
Big Data is not Rocket Sciencelarsgeorge
 
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv larsgeorge
 
Near-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBaseNear-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBasedave_revell
 
Realtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBaseRealtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBaselarsgeorge
 
HBase: Extreme makeover
HBase: Extreme makeoverHBase: Extreme makeover
HBase: Extreme makeoverbigbase
 
Oracle 12c r1 installation on solaris 11.1
Oracle 12c r1 installation on solaris 11.1Oracle 12c r1 installation on solaris 11.1
Oracle 12c r1 installation on solaris 11.1Laurent Leturgez
 
Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)
Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)
Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)Laurent Leturgez
 
Ysance conference - cloud computing - aws - 3 mai 2010
Ysance   conference - cloud computing - aws - 3 mai 2010Ysance   conference - cloud computing - aws - 3 mai 2010
Ysance conference - cloud computing - aws - 3 mai 2010Ysance
 
Social Networks and the Richness of Data
Social Networks and the Richness of DataSocial Networks and the Richness of Data
Social Networks and the Richness of Datalarsgeorge
 
Oracle 12c in memory en action
Oracle 12c in memory en actionOracle 12c in memory en action
Oracle 12c in memory en actionLaurent Leturgez
 
Oracle Database In-Memory Option in Action
Oracle Database In-Memory Option in ActionOracle Database In-Memory Option in Action
Oracle Database In-Memory Option in ActionTanel Poder
 
Hadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
Hadoop is dead - long live Hadoop | BiDaTA 2013 GenoaHadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
Hadoop is dead - long live Hadoop | BiDaTA 2013 Genoalarsgeorge
 
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012larsgeorge
 
Introduction sur les problématiques d'une architecture distribuée
Introduction sur les problématiques d'une architecture distribuéeIntroduction sur les problématiques d'une architecture distribuée
Introduction sur les problématiques d'une architecture distribuéeKhanh Maudoux
 
HBase Sizing Notes
HBase Sizing NotesHBase Sizing Notes
HBase Sizing Noteslarsgeorge
 
Phoenix - A High Performance Open Source SQL Layer over HBase
Phoenix - A High Performance Open Source SQL Layer over HBasePhoenix - A High Performance Open Source SQL Layer over HBase
Phoenix - A High Performance Open Source SQL Layer over HBaseSalesforce Developers
 
Modern Linux Performance Tools for Application Troubleshooting
Modern Linux Performance Tools for Application TroubleshootingModern Linux Performance Tools for Application Troubleshooting
Modern Linux Performance Tools for Application TroubleshootingTanel Poder
 

Viewers also liked (20)

Big Data is not Rocket Science
Big Data is not Rocket ScienceBig Data is not Rocket Science
Big Data is not Rocket Science
 
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
Data Pipelines in Hadoop - SAP Meetup in Tel Aviv
 
Near-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBaseNear-realtime analytics with Kafka and HBase
Near-realtime analytics with Kafka and HBase
 
Realtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBaseRealtime Analytics with Hadoop and HBase
Realtime Analytics with Hadoop and HBase
 
HBase: Extreme makeover
HBase: Extreme makeoverHBase: Extreme makeover
HBase: Extreme makeover
 
Hanganalyze presentation
Hanganalyze presentationHanganalyze presentation
Hanganalyze presentation
 
Oracle 12c r1 installation on solaris 11.1
Oracle 12c r1 installation on solaris 11.1Oracle 12c r1 installation on solaris 11.1
Oracle 12c r1 installation on solaris 11.1
 
Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)
Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)
Ukoug15 SIMD outside and inside Oracle 12c (12.1.0.2)
 
Ysance conference - cloud computing - aws - 3 mai 2010
Ysance   conference - cloud computing - aws - 3 mai 2010Ysance   conference - cloud computing - aws - 3 mai 2010
Ysance conference - cloud computing - aws - 3 mai 2010
 
Hadoop unit
Hadoop unitHadoop unit
Hadoop unit
 
Social Networks and the Richness of Data
Social Networks and the Richness of DataSocial Networks and the Richness of Data
Social Networks and the Richness of Data
 
Oracle 12c in memory en action
Oracle 12c in memory en actionOracle 12c in memory en action
Oracle 12c in memory en action
 
Oracle Database In-Memory Option in Action
Oracle Database In-Memory Option in ActionOracle Database In-Memory Option in Action
Oracle Database In-Memory Option in Action
 
Hadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
Hadoop is dead - long live Hadoop | BiDaTA 2013 GenoaHadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
Hadoop is dead - long live Hadoop | BiDaTA 2013 Genoa
 
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
From Batch to Realtime with Hadoop - Berlin Buzzwords - June 2012
 
Introduction sur les problématiques d'une architecture distribuée
Introduction sur les problématiques d'une architecture distribuéeIntroduction sur les problématiques d'une architecture distribuée
Introduction sur les problématiques d'une architecture distribuée
 
Présentation Club STORM
Présentation Club STORMPrésentation Club STORM
Présentation Club STORM
 
HBase Sizing Notes
HBase Sizing NotesHBase Sizing Notes
HBase Sizing Notes
 
Phoenix - A High Performance Open Source SQL Layer over HBase
Phoenix - A High Performance Open Source SQL Layer over HBasePhoenix - A High Performance Open Source SQL Layer over HBase
Phoenix - A High Performance Open Source SQL Layer over HBase
 
Modern Linux Performance Tools for Application Troubleshooting
Modern Linux Performance Tools for Application TroubleshootingModern Linux Performance Tools for Application Troubleshooting
Modern Linux Performance Tools for Application Troubleshooting
 

Similar to HBase Report on Current Status and Version 0.96+ Features

Hbase status quo apache-con europe - nov 2012
Hbase status quo   apache-con europe - nov 2012Hbase status quo   apache-con europe - nov 2012
Hbase status quo apache-con europe - nov 2012Chris Huang
 
HBase New Features
HBase New FeaturesHBase New Features
HBase New Featuresrxu
 
What's new in hadoop 3.0
What's new in hadoop 3.0What's new in hadoop 3.0
What's new in hadoop 3.0Heiko Loewe
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconYiwei Ma
 
支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统yongboy
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase强 王
 
Hadoop hbase introduction
Hadoop hbase introductionHadoop hbase introduction
Hadoop hbase introductionJakub Stransky
 
Scaling HDFS to Manage Billions of Files
Scaling HDFS to Manage Billions of FilesScaling HDFS to Manage Billions of Files
Scaling HDFS to Manage Billions of FilesHaohui Mai
 
Scaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value StoresScaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value StoresDataWorks Summit
 
Apache HBase: Where We've Been and What's Upcoming
Apache HBase: Where We've Been and What's UpcomingApache HBase: Where We've Been and What's Upcoming
Apache HBase: Where We've Been and What's Upcominghuguk
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsEsther Kundin
 
hbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBasehbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBaseMichael Stack
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsEsther Kundin
 
Etu Solution Day 2014 Track-D: 掌握Impala和Spark
Etu Solution Day 2014 Track-D: 掌握Impala和SparkEtu Solution Day 2014 Track-D: 掌握Impala和Spark
Etu Solution Day 2014 Track-D: 掌握Impala和SparkJames Chen
 
SQL on Hadoop
SQL on HadoopSQL on Hadoop
SQL on Hadoopnvvrajesh
 
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the CloudSpeed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloudgluent.
 
Cloudera Impala: A Modern SQL Engine for Hadoop
Cloudera Impala: A Modern SQL Engine for HadoopCloudera Impala: A Modern SQL Engine for Hadoop
Cloudera Impala: A Modern SQL Engine for HadoopCloudera, Inc.
 

Similar to HBase Report on Current Status and Version 0.96+ Features (20)

Hbase status quo apache-con europe - nov 2012
Hbase status quo   apache-con europe - nov 2012Hbase status quo   apache-con europe - nov 2012
Hbase status quo apache-con europe - nov 2012
 
HBase New Features
HBase New FeaturesHBase New Features
HBase New Features
 
Hbase 20141003
Hbase 20141003Hbase 20141003
Hbase 20141003
 
What's new in hadoop 3.0
What's new in hadoop 3.0What's new in hadoop 3.0
What's new in hadoop 3.0
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
 
支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统支撑Facebook消息处理的h base存储系统
支撑Facebook消息处理的h base存储系统
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase
 
Hadoop hbase introduction
Hadoop hbase introductionHadoop hbase introduction
Hadoop hbase introduction
 
Scaling HDFS to Manage Billions of Files
Scaling HDFS to Manage Billions of FilesScaling HDFS to Manage Billions of Files
Scaling HDFS to Manage Billions of Files
 
Scaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value StoresScaling HDFS to Manage Billions of Files with Key-Value Stores
Scaling HDFS to Manage Billions of Files with Key-Value Stores
 
Apache HBase: Where We've Been and What's Upcoming
Apache HBase: Where We've Been and What's UpcomingApache HBase: Where We've Been and What's Upcoming
Apache HBase: Where We've Been and What's Upcoming
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
 
hbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBasehbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBase
 
Hbase: an introduction
Hbase: an introductionHbase: an introduction
Hbase: an introduction
 
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry TrendsBig Data and Hadoop - History, Technical Deep Dive, and Industry Trends
Big Data and Hadoop - History, Technical Deep Dive, and Industry Trends
 
Etu Solution Day 2014 Track-D: 掌握Impala和Spark
Etu Solution Day 2014 Track-D: 掌握Impala和SparkEtu Solution Day 2014 Track-D: 掌握Impala和Spark
Etu Solution Day 2014 Track-D: 掌握Impala和Spark
 
SQL on Hadoop
SQL on HadoopSQL on Hadoop
SQL on Hadoop
 
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the CloudSpeed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
Speed Up Your Queries with Hive LLAP Engine on Hadoop or in the Cloud
 
Cloudera Impala: A Modern SQL Engine for Hadoop
Cloudera Impala: A Modern SQL Engine for HadoopCloudera Impala: A Modern SQL Engine for Hadoop
Cloudera Impala: A Modern SQL Engine for Hadoop
 
SQL on Hadoop
SQL on HadoopSQL on Hadoop
SQL on Hadoop
 

Recently uploaded

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

HBase Report on Current Status and Version 0.96+ Features

  • 1. 1 HBase  0.96+     A  Report  on  the  Current  Status   Lars  George  |  EMEA  Chief  Architect  
  • 2. About  Me   •  EMEA  Chief  Architect  @  Cloudera   •  ConsulDng  on  Hadoop  projects  (everywhere)   •  Apache  CommiLer   •  HBase  and  Whirr   •  O’Reilly  Author   •  HBase  –  The  DefiniDve  Guide   •  Now  in  Japanese!   •  Contact   •  lars@cloudera.com   •  @larsgeorge   日本語版も出ました!  
  • 3. The  Content...   •  Version  History   •  Overview  of  new  Features   •  Summary  
  • 4. CONFIDENTIAL  -­‐  RESTRICTED   Version  History   A  Timeline  Overview  
  • 5. HBase  Releases   5 URL:  hLp://s.apache.org/hbase-­‐releases  
  • 6. HBase  Releases  –  Issues  Closed  (JIRA)   6 URL:  hLp://s.apache.org/hbase-­‐releases  
  • 7. HBase  Releases  –  Issues  Closed  (DisDnct)   7 URL:  hLp://s.apache.org/hbase-­‐releases  
  • 8. HBase  Book?   I  targeted  0.92.0  but…     r1130336 | stack | 2011-06-02 00:52:45 +0200 ⤦ (Thu, 02 Jun 2011) | 1 line Add link to meet up ... r1234894 | stack | 2012-01-23 17:50:43 +0100 ⤦ (Mon, 23 Jan 2012) | 1 line Move version on past 0.92.0 to 0.92.1-SNAPSHOT $ svn log -r 1130336:1234894 | grep "^r" | wc -l 807 8
  • 9. HBase  Book?   I  am  trailing  0.92.0  by  800+  commits,  including  for   example     r1153634 | tedyu | 2011-08-03 21:59:48 +0200 ⤦ (Wed, 03 Aug 2011) | 2 lines HBASE-3857 Change the HFile Format (Mikhail & Liyin) …which  is  not  “unimportant”.  J     I  am  working  on  an  update!   9
  • 10. 10 Coprocessors  and  more…   HBase  0.92  
  • 11. HBase  0.92  -­‐  Highlights   •  682  issues  addressed   •  811  issues  total  in  0.92.x  line   •  New  logo!    (HBASE-­‐4312)   •  HFile  v2    (HBASE-­‐3857)   •  Distributed  Log  Splilng    (HBASE-­‐1364)   •  Enhanced  Master  UI   •  Major  compacDon  progress    (HBASE-­‐3900)   •  Regions  in  transiDon    (HBASE-­‐4291)   •  Tasks    (HBASE-­‐3839)   •  Slow  Query  Metrics    (HBASE-­‐4117)   11
  • 12. HBase  0.92  -­‐  Highlights   •  Coprocessors  (HBASE-­‐2000)   •  Oneap  cache  (HBASE-­‐4027)   •  Online  Table  Schema  Change  (HBASE-­‐1730)   •  Regions  Size  from  256MB  to  1GB  (HBASE-­‐4374)   •  Hadoop  1  Support  (HBASE-­‐5125)   •  Snappy  Support  (HBASE-­‐3691)   •  Keep  last  version  with  TTL  (HBASE-­‐4071)   •  MulDthreaded  CompacDons  (HBASE-­‐4572)   12
  • 13. HFile  v1  –  HBase  0.90   13 •  Previously  the  file  layout  was  data  blocks,  meta  blocks   and  then  file  metadata  like  indexes.   •  Each  data  block  held  a  magic  header  and  then  the   actual  data  sequenDally.  
  • 14. HFile  v2  –  HBase  0.92+   The  2nd  version  of  HFile  splits  the  indexes  and  Bloom   filters  up  into  a  hierarchy  and  interleaves  those  with   data  blocks.   14 The  data  block  header  now  holds  addiDonal  info  on   the  block  itself.       Source:  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/  
  • 17. Slab  Cache  –  Off-­‐heap  Block  Cache   17 hLp://blog.cloudera.com/blog/2012/01/caching-­‐in-­‐hbase-­‐slabcache/   •  The  off-­‐heap  cache  uses  Java  NIO’s  Direct  ByteBuffer   structures   •  Uses  its  on  slab  allocaDon  handling   •  Does  copy-­‐on-­‐read   during  access  of  data   •  Uses  L2  cache  and   replaces  OS  buffer   cache  
  • 19. HBase  0.94  -­‐  Highlights   •  420  issues  addressed   •  1394  issues  total  in  0.94.x  line   •  Read  Caching  Improvements  (HBASE-­‐5074)   •  Seek  OpDmizaDon   •  Bloom  Filter  for  Delete  Family  (HBASE-­‐4532)   •  Lazy  Seeks  (HBASE-­‐4465)   •  Write  to  WAL  OpDmizaDons     •  WAL  Compression  (HBASE-­‐4608)   •  Data  Block  Encoding  of  KeyValues    (HBASE-­‐4218)   •  Improved  HBaseFsck  (HBASE-­‐5128)   19
  • 20. HBase  0.94  -­‐  Highlights   •  Simplified  Region  Sizing  (HBASE-­‐4365)   •  Smarter  TransacDon  SemanDcs     •  Atomic  Put&Delete  in  One  Call  (HBASE-­‐3584)   •  Snapshots  (0.94.6)  (HBASE-­‐7360)   •  Atomic  Appends  (HBASE-­‐4102)   •  MulD  Increment  and  Append  (HBASE-­‐2947)   •  More  Aggressive  Off-­‐Peak  CompacDons  (HBASE-­‐4463)   20
  • 21. HBase  0.94  -­‐  Highlights   •  Per  Column  Family  Metrics  (HBASE-­‐4219)   •  MulD-­‐row  local  transacDons  (HBASE-­‐5229)   •  Pluggable  Split  Key  Policy  (HBASE-­‐5304)   •  Load  balance  regions  by  table  (HBASE-­‐3373)   •  Also  backported  to  0.92.1     •  Make  CompacDon  Code  Pluggable  (HBASE-­‐6427)   •  Deprecate  HTablePool  (0.94.11)  (HBASE-­‐6580)   •  Canary  Test  Tool  (HBASE-­‐4393)   21
  • 22. Block  Encoding   •  Allows  to  reduce  data  footprint  in  memory   •  Only  encodes  the  key  porDon  of  a  key/value  pair   •  Encoded  keys  stay  encoded  also  during  flushes   •  Compression  on  top  of  encoding  takes  care  of  the   values  and  remainder  of  key  data       Example:   •  Key  length:  90B   •  Value  length:  8B   22 Type   Ra0o   Key  Compression   92%   Total  Compression   85%   LZO  on  same  data   85%   LZO  axer  encoding   91%   hLps://issues.apache.org/jira/browse/HBASE-­‐4218  
  • 23. Block  Encoding:  None   23 Source:  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/   •  With  no  encoding  the  Key/Value  structures  are  stored   verbaDm  (with  some  overhead  for  lengths)   •  In  the  past  you  were  advised  to  keep  the  “keys”  short   for  that  reason    
  • 24. Block  Encoding:  Prefix  Encoding   24 Source:  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/   •  The  encoding  patch  added  a  new  Cell  abstracDon  that   allows  for  extra  fields  in  a  Key/Value   •  The  fields  are  used  to  track  necessary  details  for  the   encoding  
  • 25. Block  Encoding:  Diff  Encoding   25 Source:  hLp://blog.cloudera.com/blog/2012/06/hbase-­‐io-­‐hfile-­‐input-­‐output/   •  Apart  from  the  prefix  encoding  there  are  other  ways   of  encoding  the  keys   •  The  diff  encoding  is  one  of  such  approaches  
  • 26. Block  Encoding   •  Advantage  of  block  encoding  is  faster  decompression/ decoding     •  20-­‐80%  faster  than  LZO   •  Also  it  allows  to  seek  data  sDll,  which  is  not  possible   with  compressed  data   •  Penalty  is  a  slightly  slower  read  performance   compared  to  non-­‐encoded  keys   •  Important  is  to  watch  the  sizes  and  repeDDon  of  key   data,  encoding  might  not  be  useful  for  random  data   26 hLps://issues.apache.org/jira/browse/HBASE-­‐4218  
  • 28. HBase  0.96  -­‐  Highlights   •  1219  issues  addressed   •  2243  issues  total  in  0.96.x  line   •  Improved  Stability  (HBASE-­‐6241/6201)   •  ZK  based  Read/Write  locks  for  table  operaDons  (HBASE-­‐7305)   •  Scalability  Improvements  (HBASE-­‐8877)   •  Schema  Storage  (HBASE-­‐8778)   •  Log  Cleaner  for  ReplicaDon  Speed  Up  (HBASE-­‐9208)   •  Mean-­‐Time-­‐To-­‐Recovery  (MTTR)  Improvements  (HBASE-­‐5844/5926)   •  Distributed  Log  Replay  (HBASE-­‐7006)   •  Dedicated  WAL  for  System  Table  (HBASE-­‐7213/8631)   28
  • 29. HBase  0.96  -­‐  Highlights   •  Operability  Improvements   •  Hooks  for  Health  Scripts  (HBASE-­‐7399/7406)   •  Trace  Lagging  Calls  with  HTrace  (HBASE-­‐9121)   •  Versioned  RPCs  and  Metadata  (Protobufs)    (HBASE-­‐3505)   •  Parallel  Seeks  in  Stores  (HBASE-­‐7495)   •  Hadoop  1  and  2  Support     •  Secure  Short  Circuit  Reads  on  H2  (HBASE-­‐6783)   •  Namespaces  Support  (HBASE-­‐8015)   •  New  Metrics  v2  (HBASE-­‐4050)   •  Cell  Interface  vs  KeyValue  (HBASE-­‐7162)   29
  • 30. HBase  0.96  -­‐  Highlights   •  No  more  ROOT  table  (HBASE-­‐3171)   •  Remove  HFile  v1  (HBASE-­‐7660)   •  Trie  Data  Block  Encoding    (HBASE-­‐4676)   •  Remove  Client-­‐side  Row  Locks  (HBASE-­‐7263/7315)   •  CompacDon  and  Flush  Improvements  (HBASE-­‐7516/7763/6466/7678)    (HBASE-­‐7667/7110/7603/7519/7842)   •  Improved  Default  ConfiguraDon  (HBASE-­‐4657?)   •  Client-­‐side  Type  Library  (HBASE-­‐8089)     30
  • 31. HBase  0.96  -­‐  Highlights   •  Online  Region  Merging  (HBASE-­‐7403/8219)   •  Bucket  Cache  Support  (HBASE-­‐7404)   •  Remove  older  ICV  Calls  (HBASE-­‐7032)   •  New  “Bootstrap”  based  UIs!    (HBASE-­‐6135)   •  Remove  Client-­‐side  Row  Locks  (HBASE-­‐7263/7315)   •  CompacDon  and  Flush  Improvements  (HBASE-­‐7516/7763/6466/7678)    (HBASE-­‐7667/7110/7603/7519/7842)     31
  • 32. 32 —  Michael  Stack,  HBase  PMC  Chair  
  • 33. Mean-­‐Time-­‐To-­‐Recovery  (MTTR)   •  Lot‘s  of  effort  put  into  improve  how  long  data  might   not  be  accessible  during  a  region  move   •  The  offline  period  is  made  up  of  phases:     •  a  detecDon  phase,     •  a  repair  phase,     •  reassignment,  and  finally,     •  clients  noDcing  the  data  available  in  its  new  locaDon   •  Improvements  in  many  of  those  areas   •  Faster  detecDon,  efficient  repair,  parallel  replay   •  Dedicated  WAL  for  system  tables   33 hLps://blog.cloudera.com/blog/2013/10/hbase-­‐0-­‐96-­‐0-­‐released/  
  • 34. 34 Cell  Level  Security   HBase  0.98  
  • 35. HBase  0.98  -­‐  Highlights   •  1303  issues  addressed   •  1458  issues  total  in  0.98.x  line   •  Cell  Level  Security  (HBASE-­‐6222/7663/7662)   •  Server-­‐side  EncrypDon  (HBASE-­‐7544)   •  WAL  Throughput  Improvements  (HBASE-­‐8755)   •  Reverse  Scanner  (HBASE-­‐4811)   •  MapReduce  over  Snapshot  Files  (HBASE-­‐8369)   •  Striped  CompacDons  (HBASE-­‐7667)   •  ThroLle  ReplicaDon  (HBASE-­‐9501)   35
  • 36. Cell  Level  Security   •  Added  HFile  v3  which  can  store  arbitrary  metadata  in   a  cell,  called  tags   •  Also  extended  ACL  checks  to  apply  to  cell  levels   •  With  this  visibility  labels  can  be  stored  in  tags   •  An  API  and  CLI  tools  are  provided  that  are  akin  to   Accumulo’s,  axer  which  it  is  modeled   •  AddiDonal  encrypDon  of  data  at  rest  ensures  further   security  of  sensiDve  data   36 hLps://blogs.apache.org/hbase/entry/hbase_cell_security  
  • 37. Visibility  Labels   The  API  allows  to  set  visibility  by  using  expressions  with   “&”,  “|”,  and  “!”,  as  well  as  “(“  and  “)”,  e.g.  label  set  of   {  confidenDal,  secret,  topsecret,  probaDonary  }  could   be  combined  as     ( secret | topsecret ) & !probationary   At  runDme  the  expressions  are  evaluated  against  a  user   and  then  applied  to  each  cell.   37
  • 39. HBase  Future   •  Not  much  is  wriDng  in  stone  yet   •  Master  gets  rewriLen  and  also  META  table  handling   •  Build  in  consensus  (HBASE-­‐10296)   •  Co-­‐locate  Master  and  META  (HBASE-­‐10569)   •  MTTR  is  further  extended  into  interesDng  areas   •  Read  replicas  (HBASE-­‐10070)   It  has  to  be  seen  when  1.0.0  is  released  and  what  it   contains.  Your  opinion  counts!   39