• Email
  • Like
  • Save
  • Private Content
  • Embed
 

Hadoop Summit 2012 | Optimizing MapReduce Job Performance

by on Jun 18, 2012

  • 11,635 views

Optimizing MapReduce job performance is often seen as something of a black art. In order to maximize performance, developers need to understand the inner workings of the MapReduce execution framework ...

Optimizing MapReduce job performance is often seen as something of a black art. In order to maximize performance, developers need to understand the inner workings of the MapReduce execution framework and how they are affected by various configuration parameters and MR design patterns. The talk will illustrate the underlying mechanics of job and task execution, including the map side sort/spill, the shuffle, and the reduce side merge, and then explain how different job configuration parameters and job design strategies affect the performance of these operations. Though the talk will cover internals, it will also provide practical tips, guidelines, and rules of thumb for better job performance. The talk is primarily targeted towards developers directly using the MapReduce API, though will also include some tips for users of higher level frameworks.

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

13 Embeds 2,121

http://d.hatena.ne.jp 1222
http://www.cloudera.com 640
http://www.technology-mania.com 202
https://twitter.com 23
http://eventifier.co 22
http://us-w1.rockmelt.com 3
http://www.redditmedia.com 2
http://blog.cloudera.com 2
http://localhost 1
http://www.onlydoo.com 1
http://author.cloudera.solutionset.com 1
http://twitter.com 1
http://www.moriwaki.net 1

More...

Statistics

Likes
50
Downloads
0
Comments
0
Embed Views
2,121
Views on SlideShare
9,514
Total Views
11,635
Post Comment
Edit your comment

Hadoop Summit 2012 | Optimizing MapReduce Job Performance Hadoop Summit 2012 | Optimizing MapReduce Job Performance Presentation Transcript