4. Έ{¨
HBase
distributed database modeled on Google’s Bigtable1
tables of column-oriented rows
scalable data store (scales horizontally !!!)
billions of rows X millions of columns X thousands of
versions
running on top of “commodity” hardware
Apache Hadoop subproject since 2008
1
Bigtable: A Distributed Storage System for Structured Data
Rong-En Fan (rafan) HBase Apr 19, 2009 4 / 13
5. §‚D‹—¨
simple key-value
(table, row, family:column, timestamp) → value
table: rows are sorted
column family: any number of columns, columns are
sorted and stored together
timestamp: one column can have multiple versions
value: just byte array!
Rong-En Fan (rafan) HBase Apr 19, 2009 5 / 13
10. 1Ý9‚9¡Õ9b 3à¨
Powerset, a Microsoft company
Trend Micro: Advanced Threats Research
HBase PoweredBy list
Mahalo (the first human-powered search engine)
Streamy (realtime social news site)
WorldLingo (multilingual archive)
Rong-En Fan (rafan) HBase Apr 19, 2009 10 / 13