SlideShare is now on Android. 15 million presentations at your fingertips.  Get the app

×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Oct 2012 HUG: Project Panthera: Better Analytics with SQL, MapReduce, and HBase

by on Oct 19, 2012

  • 1,772 views

Project Panthera is an open source effort that showcases better data analytics capabilities on Hadoop/HBase (e.g., better integration with existing infrastructure using SQL, better query processing on ...

Project Panthera is an open source effort that showcases better data analytics capabilities on Hadoop/HBase (e.g., better integration with existing infrastructure using SQL, better query processing on HBase, and efficiently utilizing new HW platform technologies). In this talk, we will discusses two new capabilities that we are currently working on under Project Panthera: (1) a SQL Engine for MapReduce (built on top of Hive) that supports common SQL constructs used in analytic queries, including some important features (e.g., sub-query in WHERE clauses, multiple-table SELECT statement, etc.) that are not supported in Hive today; (2) a Document-Oriented Store on HBase for better Hive/SQL query processing, which brings up-to 3x reduction in table storage and up-to 1.8x speedup in query processing.

Presenter: Jason Dai, Principal Engineer, Intel Software and Services Group

Statistics

Views

Total Views
1,772
Views on SlideShare
1,772
Embed Views
0

Actions

Likes
3
Downloads
19
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
Post Comment
Edit your comment

Oct 2012 HUG: Project Panthera: Better Analytics with SQL, MapReduce, and HBase Oct 2012 HUG: Project Panthera: Better Analytics with SQL, MapReduce, and HBase Presentation Transcript