SlideShare a Scribd company logo
1 of 1
Download to read offline
HBase is a NoSQL (Not Only SQL) database. HBase store data
rows in labeled tables, each row have sortable key and an
arbitrary number of columns.
v HBase is linear and modular scaling
v It have automatic region server failover
v Tables are distributed on the cluster via
regions which supports automatic
sharding
v Hadoop/HDFS integration
v Supports MPP (massively parallelized
processing)
v Support Thrift and REST API
v Supports Block Cache and Bloom Filters
for high volume query optimization
v Provides build-in web pages for
operational insight
v Supports strongly consistent
reads/writes
v No joins supported as RDBMS
v Supports Tables with column family,
rows and columns
v All columns belong to column family
v Have table cells with intersection of row
and column coordinates and are
versioned {row, column, version}
v You can run get command to select row,
put to insert or update a row, scan to do
a loop for multiple rows and Delete to
delete record
By: Milind Zodge
v If you have millions or billions of rows,
then HBase is a good candidate
v You do not need advanced query
language SQL and can leave without
secondary indexes and typed columns
v You have enough hardware available as
even HDFS doesn’t do well with
anything less than 5 Data Node so you
will at least need 5 nodes cluster
v If your application has a variable
schema where each row is slightly
different
v If you data I stored in collections
v If you need key based access to data
when storing or retrieving
About HBase When to use HBase

More Related Content

Similar to H base one page

Similar to H base one page (20)

HBase.pptx
HBase.pptxHBase.pptx
HBase.pptx
 
Introduction to HBase
Introduction to HBaseIntroduction to HBase
Introduction to HBase
 
Introduction To HBase
Introduction To HBaseIntroduction To HBase
Introduction To HBase
 
Hbase
HbaseHbase
Hbase
 
Hbase
HbaseHbase
Hbase
 
Apache h base
Apache h baseApache h base
Apache h base
 
HBASE Overview
HBASE OverviewHBASE Overview
HBASE Overview
 
Apache hadoop hbase
Apache hadoop hbaseApache hadoop hbase
Apache hadoop hbase
 
Hadoop - Apache Hbase
Hadoop - Apache HbaseHadoop - Apache Hbase
Hadoop - Apache Hbase
 
Advance Hive, NoSQL Database (HBase) - Module 7
Advance Hive, NoSQL Database (HBase) - Module 7Advance Hive, NoSQL Database (HBase) - Module 7
Advance Hive, NoSQL Database (HBase) - Module 7
 
Nyc hadoop meetup introduction to h base
Nyc hadoop meetup   introduction to h baseNyc hadoop meetup   introduction to h base
Nyc hadoop meetup introduction to h base
 
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
CCS334 BIG DATA ANALYTICS UNIT 5 PPT  ELECTIVE PAPERCCS334 BIG DATA ANALYTICS UNIT 5 PPT  ELECTIVE PAPER
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
 
Hbase
HbaseHbase
Hbase
 
4. hbase overview
4. hbase overview4. hbase overview
4. hbase overview
 
Apache HBase - Just the Basics
Apache HBase - Just the BasicsApache HBase - Just the Basics
Apache HBase - Just the Basics
 
Hbase
HbaseHbase
Hbase
 
Techincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql databaseTechincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql database
 
Hbasepreso 111116185419-phpapp02
Hbasepreso 111116185419-phpapp02Hbasepreso 111116185419-phpapp02
Hbasepreso 111116185419-phpapp02
 
Introduction to Apache HBase
Introduction to Apache HBaseIntroduction to Apache HBase
Introduction to Apache HBase
 
H base
H baseH base
H base
 

H base one page

  • 1. HBase is a NoSQL (Not Only SQL) database. HBase store data rows in labeled tables, each row have sortable key and an arbitrary number of columns. v HBase is linear and modular scaling v It have automatic region server failover v Tables are distributed on the cluster via regions which supports automatic sharding v Hadoop/HDFS integration v Supports MPP (massively parallelized processing) v Support Thrift and REST API v Supports Block Cache and Bloom Filters for high volume query optimization v Provides build-in web pages for operational insight v Supports strongly consistent reads/writes v No joins supported as RDBMS v Supports Tables with column family, rows and columns v All columns belong to column family v Have table cells with intersection of row and column coordinates and are versioned {row, column, version} v You can run get command to select row, put to insert or update a row, scan to do a loop for multiple rows and Delete to delete record By: Milind Zodge v If you have millions or billions of rows, then HBase is a good candidate v You do not need advanced query language SQL and can leave without secondary indexes and typed columns v You have enough hardware available as even HDFS doesn’t do well with anything less than 5 Data Node so you will at least need 5 nodes cluster v If your application has a variable schema where each row is slightly different v If you data I stored in collections v If you need key based access to data when storing or retrieving About HBase When to use HBase