Introduction to SARA's Hadoop Hackathon - dec 7th 2010

1,391
-1

Published on

This was the first of two introduction presentations to the first Hadoop Hackathon at SARA, the Dutch center for High Performance Computing and Networking.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,391
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
11
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Introduction to SARA's Hadoop Hackathon - dec 7th 2010

  1. 1. SARA Hadoop Hackathon Evert.Lammerts@sara.nl December 7, 2010
  2. 2. DJOERD HIEMSTRA (UTwente)EDGAR MEIJ (UvA) SARA Hadoop Hackathon, December 7, 2010
  3. 3. 2002 2004 2006Nutch* MR/GFS** Hadoop*  http://nutch.apache.org/** http://labs.google.com/papers/mapreduce.html   http://labs.google.com/papers/gfs.html SARA Hadoop Hackathon, December 7, 2010
  4. 4. 2010: A Hype in Productionhttp://wiki.apache.org/hadoop/PoweredBy SARA Hadoop Hackathon, December 7, 2010
  5. 5. Super computingCloud computing Grid computing Cluster computing GPU computing http://www.sara.nl/ SARA Hadoop Hackathon, December 7, 2010
  6. 6. :-( Data Expensive! Computation :-) Data Cheaper! ComputationRef: Luiz André Barroso and Urs Hölzle, Google Inc.   The Datacenter as a Computer: An Introduction to the Design of Warehouse­Scale Machines SARA Hadoop Hackathon, December 7, 2010
  7. 7. NameNode JobTrackerDN TT DN TT DN TT DN TTDN TT DN TT DN TT DN TT DN DataNode TT TaskTracker SARA Hadoop Hackathon, December 7, 2010
  8. 8. File Map Shuffle Reduce Output $ echo “${email#*@}, ${name}” $ sort $ wc ­l ewi.utwente.nl, 1 gmail.com,      2 nbic.nl,        1 nikhef.nl,      3 sara.nl,        1 SARA Hadoop Hackathon, December 7, 2010
  9. 9. From: Hadoop, The Definitive Guide (2nd Edition), Tom White SARA Hadoop Hackathon, December 7, 2010
  10. 10. Today09.30 - 09.50 Welcome & Introduction09.50 - 10.15 Map/Reduce @ University of Twente10.15 - 10.30 Kick-off hackathon14.00 - 15.00 Optional: SARA tour10.30 - 17.00 Hackathon17.00 - 17.30 Results and closing SARA Hadoop Hackathon, December 7, 2010
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×