Map Reduce

MapReduce A Gentle Introduction, In Four Acts

[object Object],What is Map >> l = (1..10) => 1..10 >> l.map { |i| i + 1 } => [2, 3, 4, 5, 6, 7, 8, 9, 10, 11]

[object Object],[object Object],What is Reduce >> l = (1..10) => 1..10 >> l.inject {|i, j| i + j } => 55

[object Object],[object Object],[object Object],What Is MapReduce

But There Is Some Order <html> <head> <title> Marmots I’ve Loved </title> </head> <body> <h1> Marmot List </h1> <ul> <li> Marcy </li> <li> Stacy </li> </ul> </body> </html> 12:00:23 GET /marmots/index.html 12:00:55 GET /marmots/stacy.jpg 12:00:67 GET /marmots/marcy.jpg

[object Object],[object Object],[object Object],[object Object],[object Object],But What To Do With It?

Act II Enter Stage Left – MapReduce

[object Object],[object Object],[object Object],[object Object],What Is Map, Part Deux

[object Object],[object Object],[object Object],[object Object],What Is Reduce, Part Deux

[object Object],[object Object],What Is Reduce, Part Deux, Part Deux

MapReduce Pseudocode Distributed Word Count* *This example is legally required to be in all introductions to MapReduce map(record) words = split(record, ‘ ‘) for word in words emit(word, 1) reduce(key, values) int count = 0 for value in values count += 1 emit(key, count)

Act III Hadoop (Streaming Mode)

Hadoop! ,[object Object],[object Object],[object Object]

MapReduce Mapper Distributed Word Count* *This example is legally required to be in all introductions to MapReduce #!/usr/bin/ruby STDIN.each_line do |line| words = line.split(' ') words.each { |word| puts "#{word} 1" } end

MapReduce Reducer Distributed Word Count* *This example is legally required to be in all introductions to MapReduce #!/usr/bin/ruby count = 0 current_word = nil STDIN.each_line do |line| key, value = line.split("") current_word = key if nil == current_word if (key != current_word) then puts "#{current_word}#{count}" count = 0 current_word = key end count += value.to_i end puts "#{current_word}#{count}"

Streaming Mode ,[object Object],[object Object],[object Object],[object Object],[object Object]

Act IV Amazon Elastic Map Reduce

So I’ve Got This Pile Of Data, Now What?

Elastic Map Reduce ,[object Object],[object Object],[object Object],[object Object]

Tips! ,[object Object],[object Object],[object Object]

Map Reduce

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Viewers also liked

Viewers also liked (8)

Similar to Map Reduce

Similar to Map Reduce (20)

Recently uploaded

Recently uploaded (20)

Map Reduce