• Email
  • Like
  • Save
  • Private Content
  • Embed
 

Low Latency OLAP with Hadoop and HBase

by on Jun 19, 2012

  • 4,714 views

We use "SaasBase Analytics" to incrementally process large heterogeneous data sets into pre-aggregated, indexed views, stored in HBase to be queried in realtime. The requirement we started from was to ...

We use "SaasBase Analytics" to incrementally process large heterogeneous data sets into pre-aggregated, indexed views, stored in HBase to be queried in realtime. The requirement we started from was to get large amounts of data available in near realtime (minutes) to large amounts of users for large amounts of (different) queries that take milliseconds to execute. This set our problem apart from classical solutions such as Hive and PIG. In this talk I`ll go through the design of the solution and the strategies (and hacks) to achieve low latency and scalability from theoretical model to the entire process of ETL to warehousing and queries.

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Adobe PDF

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

8 Embeds 96

http://blog.newitfarmer.com 60
http://charlesxieyupeng.blogspot.com 15
http://eventifier.co 9
http://www.twylah.com 7
http://www.scoop.it 2
http://charlesxieyupeng.blogspot.in 1
https://twitter.com 1
http://matrixpp.blogspot.com 1

More...

Statistics

Likes
19
Downloads
0
Comments
0
Embed Views
96
Views on SlideShare
4,618
Total Views
4,714
Post Comment
Edit your comment

Low Latency OLAP with Hadoop and HBase Low Latency OLAP with Hadoop and HBase Presentation Transcript