×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

Should I Use Scalding or Scoobi or Scrunch?

by on Jul 09, 2013

  • 4,529 views

In the past year there has been a tremendous amount of activity on Scala APIs for Hadoop. In this talk we`ll talk about writing Map/Reduce jobs in a more functional manner and explore the three most ...

In the past year there has been a tremendous amount of activity on Scala APIs for Hadoop. In this talk we`ll talk about writing Map/Reduce jobs in a more functional manner and explore the three most popular Scala packages for Hadoop: Scalding, Scoobi and Scrunch. Detailed usage examples will be provided for each along with some real world use cases.

Statistics

Views

Total Views
4,529
Views on SlideShare
3,757
Embed Views
772

Actions

Likes
16
Downloads
0
Comments
1

4 Embeds 772

http://www.scoop.it 549
http://www.linkedin.com 111
https://twitter.com 58
http://www.cnblogs.com 54

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel

11 of 1 previous next

  • robertmetzger1 Robert Metzger You should also have a look at Stratosphere (stratosphere.eu).
    If also offers a Scala API that is similar to the ones presented here (but the execution is probably faster since stratosphere natively supports operators such as join or union and data flow graphs)

    Similar to Spark, Stratosphere does not run on top of MapReduce, it has its own operators. But you can use your existing YARN cluster (and HDFS) and Stratosphere also has a Map and Reduce operator.
    3 months ago
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Should I Use Scalding or Scoobi or Scrunch? Should I Use Scalding or Scoobi or Scrunch? Presentation Transcript