-
1.
1 hbasecon.com
-
2.
The State of HBase
Andrew Purtell, Enis Söztutar, Michael Stack
-
3.
About Us
Andrew Purtell
Salesforce
Release Manager for 0.98
@akpurtell
Enis Söztutar
Hortonworks
Release Manager for 1.0
@enissoz
2.0
Michael Stack
Cloudera
-
@saintstack
-
4.
Outline
● State of the Project
● State of the Software
● State of the Ecosystem
2.0
-
5.
Outline
● State of the Project
● State of the Software
● State of the Ecosystem
2.0
-
6.
State of the Project
● Backing medium- and high- scale services
o Hundreds of enterprises
o Some of the largest Internet companies in the world
● Well established, mature codebase
o >100 contributors, 4.2M lines of code, 1200+ man-
years of total effort*
● Runs on HDFS, MapR, Gluster, GPFS, etc.
● As a service: AWS EMR, HDInsight, etc.
2.0
*Source: OpenHub https://www.openhub.net/p/hbase
-
7.
Project: Vision
Simple, steady, and powerful: “A first class high
performance horizontally scalable data storage
engine for Big Data, suitable as the store of
record for mission critical data.”
2.0
-
8.
Project: Goals
● Availability: Always more, always faster
● Stability and Operability
o Continuous Improvement
● Scaling (up and down)
● Readying for NextGen ‘commodity’ hardware
● Multi-tenancy
● Diversifying our ecosystem
o Come talk to us if you’re building a Big Data product
2.0
-
9.
Project: Delivered 2.0
1.0.0
2.0.0 by HBaseCon 2016!
Released Feb 24, 2015
-
10.
● Eight new committers
Zhang Duo (duozhang), Andrey Stepachev (octo47),
Liu Shaohui (liushaohui), Virag Kothari (virag),
Sean Busbey (busbey), Srikanth Srungarapu (ssrungarapu), Jing
Chen (Jerry) He (jerryjch),
Misty Stanley-Jones (misty)
● Now 43 committers! from a diverse group of companies
including Cask, Cloudera, Facebook, HortonWorks,
IBM, Intel, Salesforce, Xiaomi, Yahoo!, and Google
2.0Project: Committers
-
11.
Project: PMC
● First chair rotation in the project lifetime
Michael Stack (stack), outgoing
Andrew Purtell (apurtell), incoming
● Four new members
Sean Busbey (busbey)
Matteo Bertozzi (mbertozzi)
Nick Dimiduk (ndimiduk)
Jeffrey Zhong (jefferyz)
2.0
-
12.
Project: dev@hbase.a.o
-
13.
Project: user@hbase.a.o
-
14.
Outline
● State of the Project
● State of the Software
● State of the Ecosystem
2.0
-
15.
Software: Semantic Versioning
MAJOR.MINOR.PATCH[-identifiers]
PATCH: only BC bug fixes.
MINOR: BC new features
MAJOR: Incompatible changes
2.0
-
16.
Software: Releases
-
17.
● 0.94.x
o Eight releases: 0.94.20 - 0.94.27*
● 0.98.x
o Twelve releases: 0.98.2 - 0.98.11
● 1.0.x
o Two releases: 1.0.0 - 1.0.1
● 1.1.x
o Release candidate
2.0Software: Releases
-
18.
Software: Issues
13K+ issues created
12K issues resolved!
-
19.
Software: Issues
~3K issues resolved last year
-
20.
Software: Semantic Versioning
Starting with 1.0.0, HBase is working towards
Semantic Versioning* of releases…
* http://semver.org/
2.0
-
21.
Software: Semantic Versioning
MAJOR.MINOR.PATCH[-identifiers]
2.0
-
22.
Software: Semantic Versioning
MAJOR.MINOR.PATCH[-identifiers]
PATCH: only BC bug fixes.
2.0
-
23.
Software: Semantic Versioning
MAJOR.MINOR.PATCH[-identifiers]
PATCH: only BC bug fixes.
MINOR: BC new features
2.0
-
24.
Software: Semantic Versioning
1.0.0
1.0.1
1.1.0
2.0.0-alpha
2.0.0-beta
2.0
-
25.
Software : Semantic Versioning
Client / Server API cleanup continuing
Dependency isolation / shading
Goal is for full semver compliance
HBase-1.0 talk and HBase-2.0 panel for more
-
26.
Software: Focus
● Smaller regions, more regions (scaling)
o Less write amplification
o 1M+ region clusters
● Stability
o Procedure Version2
o Assignment improvements/stability
● Scanners
o Chunking, Heartbeating, ‘Parking’, Streaming
2.0
-
27.
Software: Focus
● Adaption: Work Loads
o HBase as Medium Object Store (MOB)
● Tunable Consistency
o TIMELINE Consistency
● Improving coprocessor API supportability
● Profile-driven optimization
● Improved GC-friendliness, use more RAM
o Offheaping
2.0
-
28.
Software: Focus
● Multitenancy
o Table groups
o Quotas
o Priorities
● Using all of the machine
o RAM
o iops
o All of the CPUs
2.0
-
29.
Outline
● State of the Project
● State of the Software
● State of the Ecosystem
2.0
-
30.
Ecosystem
● OpenTSDB
● Transaction Managers
o Themis, Tephra, Omid2
● Lots-o-Graphs-on-HBase
● SQL
● Hadoop dogfooding Hbase
● Google Cloud Bigtable (keynote follows)
2.0
-
31.
Ecosystem: SQL
o Phoenix 4.4.0RC for HBase 1.0.0
o SQL over raw HBase tables
● Trafodion
o Trafodion 1.1.0 announced
o Heading for Apache Incubator!
● & LeanXcale
2.0
-
32.
2.0Ecosytem: Dogfooding
● YARN-2928 Application Timeline Service
● HIVE-9452 HBase to store Hive metadata
● AMBARI-5707 Ambari Metrics System
-
33.
Get Involved!
Follow us on Twitter
@HBase
Follow us on Facebook
Follow our Blog
https://blogs.apache.org/hbase/
Join our mailing lists
user-subscribe@hbase.apache.org
dev-subscribe@hbase.apache.org
2.0
-
34.
35 hbasecon.com
Welcome everyone! Today is going to be fantastic and we have quite the agenda for you. In the interest of time, lets dive right in.
Deploys you will hear about today.
Not compete with C*. MTTR stuff. Our replication is better than theirs. Master-master, large sequential scans. We should not have to do these C* vs HBase fights anymore… No advertising, no coherent leadership, no PM… Doesn’t help in sales. No one talks about it.
New logo
We won’t leave you behind. 0.94 might be finished. We want you to move up to newer versions…. Counts are since last hbasecon
Add and delete column family while table is online.
See the ecosystem track and use cases for sampling of what is going on in hbase ecosystem these times.
See the ecosystem track and use cases for sampling of what is going on in hbase ecosystem these times. Be sure attend SQL SmackDown
Ambari shipping
Welcome everyone! Today is going to be fantastic and we have quite the agenda for you. In the interest of time, lets dive right in.