×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Cloudera Impala: A Modern SQL Engine for Hadoop

by on Jan 10, 2013

  • 4,533 views

This is a technical deep dive about Cloudera Impala, the project that makes scalable parallel databse technology available to the Hadoop community for the first time. Impala is an open-sourced code ...

This is a technical deep dive about Cloudera Impala, the project that makes scalable parallel databse technology available to the Hadoop community for the first time. Impala is an open-sourced code base that allows users to issue low-latency queries to data stored in HDFS and Apache HBase using familiar SQL operators.

Presenter Marcel Kornacker, creator of Impala, begins with an overview of Impala from the user's perspective, followed by an overview of Impala's architecture and implementation, and will conclude with a comparison of Impala with Dremel and Apache Hive, commercial MapReduce alternatives and traditional data warehouse infrastructure.

Statistics

Views

Total Views
4,533
Views on SlideShare
2,542
Embed Views
1,991

Actions

Likes
11
Downloads
175
Comments
2

7 Embeds 1,991

http://www.cloudera.com 1943
http://tedwon.com 24
http://cloudera.com 13
http://author01.mtv.cloudera.com 8
http://cloudera.d.solutionset.com 1
http://my.cloudera.com 1
http://www.linkedin.com 1
More...

Accessibility

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel

12 of 2 previous next

  • gemini5201314 alex gemini, Data Engineer at SNDA @Carl compare hive and impala,hive use much more resource to complete a job, so I think 'high latency,low throughput' for hive is correct. 1 year ago
    Are you sure you want to
    Your message goes here
    Processing…
  • cwsteinbach Carl Steinbach at Cloudera On slide 20 Hive is described as 'high latency, low throughput.' Shouldn't that be 'high latency, high throughput'? 1 year ago
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Cloudera Impala: A Modern SQL Engine for Hadoop Cloudera Impala: A Modern SQL Engine for Hadoop Presentation Transcript