Be the first to like this
A lightning talk highlighting the difference in power between the Hadoop Streaming and Java apis. In short, the former may handle simpler operations, many more complex operations will require you to combine several map-reduce jobs in a data flow. The power of the multi-job paradigm enables you to tackle large problems, such as large graph traversal operations.
Originally delivered at the 2009 Goruco conference in New York.