HBase @ OSDC.TW 2009

  • 3,514 views
Uploaded on

HBase talk @ OSDC.TW 2009

HBase talk @ OSDC.TW 2009

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
3,514
On Slideshare
0
From Embeds
0
Number of Embeds
2

Actions

Shares
Downloads
80
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. HBase Distributed, column-oriented database on Hadoop Rong-En Fan rafan@infor.org OSDC.TW 2009 Í7Ån2 CC-SA 2.5 0J Rong-En Fan (rafan) HBase Apr 19, 2009 1 / 13
  • 2. 7‚Šý HBase¨ . @Î Open Source. . . ‡1¨3œ9߇u32…«Õ‹—«D‹— ×Ëb½y RDBMS Ý;DÙ K 2009 Ýý FreeBSD ports. . . ×OG$§„àÝ HBase ¡Æÿœb¶ Rong-En Fan (rafan) HBase Apr 19, 2009 2 / 13
  • 3. $Î RDBMS HBase ^b×À RDBMS b݋—§relational, joins, types, SQL, query engine, . . . b×° RDBMS bݧtransaction, secondary indices (K^œW!) X|...µ§à MySQL Ï! (0) Rong-En Fan (rafan) HBase Apr 19, 2009 3 / 13
  • 4. Έ{¨ HBase distributed database modeled on Google’s Bigtable1 tables of column-oriented rows scalable data store (scales horizontally !!!) billions of rows X millions of columns X thousands of versions running on top of “commodity” hardware Apache Hadoop subproject since 2008 1 Bigtable: A Distributed Storage System for Structured Data Rong-En Fan (rafan) HBase Apr 19, 2009 4 / 13
  • 5. §‚D‹—¨ simple key-value (table, row, family:column, timestamp) → value table: rows are sorted column family: any number of columns, columns are sorted and stored together timestamp: one column can have multiple versions value: just byte array! Rong-En Fan (rafan) HBase Apr 19, 2009 5 / 13
  • 6. §‚¢¡¨ Ts! (table → regions) ×Íxߐ.Sg .ø°‹— (one master w/ many region servers) xß,ݵIà# (ì×̺bߊ›#W©xß) bß,Ýݕ¡Í€ßºŠ›#W Rong-En Fan (rafan) HBase Apr 19, 2009 6 / 13
  • 7. §‚=¨ Java API Thrift for PHP, Ruby, Python, C, . . . REST (JSON or XML) No ACL, encryption for now Rong-En Fan (rafan) HBase Apr 19, 2009 7 / 13
  • 8. §‚D㨠ÃÍÝDãÿP§Create, Read, Update, Delete (CRUD) « (table) Scan à primary key Ý ^DãTïÎà scan Ý1g D㠐Ó݋—KŠ3ŠÝP…«ŠX¡ô| Õ?DR¼ AŒŠà web server à#Æݕ¡˜ÈÎG«> ×· memcached (v݋— Rong-En Fan (rafan) HBase Apr 19, 2009 8 / 13
  • 9. b7‚TਠÎlÝàHΩ† crawler Ý;D٠ݥÕ(©Î=”¢7n(vÝ) (webtable)¡…«wÝÎ8¼Ýç°«×ÀbÝ^ 1gDã§5—ç°/¢† index ^Dã§?±ç°/¢èºquot;ãç° J¼D¡Ý log Q¡†S—§©Î Google Analytics Rong-En Fan (rafan) HBase Apr 19, 2009 9 / 13
  • 10. 1Ý9‚9¡Õ9b 3ਠPowerset, a Microsoft company Trend Micro: Advanced Threats Research HBase PoweredBy list Mahalo (the first human-powered search engine) Streamy (realtime social news site) WorldLingo (multilingual archive) Rong-En Fan (rafan) HBase Apr 19, 2009 10 / 13
  • 11. ݨµ HBase êGb4Íf´ðŒ^Ý committers¡b°¢ 4Íf´þ$Ý contributors J͌iÎã Jim Kellerman (@Powerset) ñr ì×Ì (0.20) ji[«%—;Ÿ&ð9 Rong-En Fan (rafan) HBase Apr 19, 2009 11 / 13
  • 12. Õ9$à«... ¡VsÝËO (2007 Oޖը3)¡ŒÝèÌ (êGÎ 0.19.1) (&ŠݎÆ) Ê)Ø°ËvÝTà g)Ê©ÝÚx|J¼†aîDã %—“«ì×Ì (0.20) º;Ÿœ9 Rong-En Fan (rafan) HBase Apr 19, 2009 12 / 13
  • 13. ×° HBase Ý£] “ç° HBase IRC channel #hbase on freenode Rong-En Fan (rafan) HBase Apr 19, 2009 13 / 13