×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

20130912 YTC_Reynold Xin_Spark and Shark

by on Sep 17, 2013

  • 2,199 views

In this talk, we present two emerging, popular open source projects: Spark and Shark. Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and ...

In this talk, we present two emerging, popular open source projects: Spark and Shark. Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write. It outperform Hadoop by up to 100x in many real-world applications. Spark programs are often much shorter than their MapReduce counterparts thanks to its high-level APIs and language integration in Java, Scala, and Python. Shark is an analytic query engine built on top of Spark that is compatible with Hive. It can run Hive queries much faster in existing Hive warehouses without modifications.

These systems have been adopted by many organizations large and small (e.g. Yahoo, Intel, Adobe, Alibaba, Tencent) to implement data intensive applications such as ETL, interactive SQL, and machine learning.

Statistics

Views

Total Views
2,199
Views on SlideShare
1,036
Embed Views
1,163

Actions

Likes
10
Downloads
80
Comments
0

13 Embeds 1,163

http://www.andretw.com 947
http://docs.sigmoidanalytics.com 70
http://it.tomtang.idv.tw 69
https://www.google.com.tw 49
https://www.google.com.hk 11
http://4783512064634308801_299975f142ad7f5c8f001cc0f014e665a0a4e49b.blogspot.com 5
http://cloud.feedly.com 4
https://www.google.com 3
http://plus.url.google.com 1
http://tw.mobi.yahoo.com 1
http://webcache.googleusercontent.com 1
http://www.google.com.hk 1
http://www.google.com.tw 1
More...

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
Post Comment
Edit your comment

20130912 YTC_Reynold Xin_Spark and Shark 20130912 YTC_Reynold Xin_Spark and Shark Presentation Transcript