0
SARA Hadoop Hackathon   Evert.Lammerts@sara.nl   December 7, 2010
DJOERD HIEMSTRA                                             (UTwente)EDGAR MEIJ     (UvA)             SARA Hadoop Hackatho...
2002             2004                   2006Nutch*           MR/GFS**               Hadoop*  http://nutch.apache.org/** ht...
2010: A Hype in Productionhttp://wiki.apache.org/hadoop/PoweredBy                SARA Hadoop Hackathon, December 7, 2010
Super computingCloud computing                               Grid computing     Cluster computing              GPU computi...
:-(                       Data         Expensive!                                                         Computation     ...
NameNode              JobTrackerDN   TT   DN      TT             DN        TT        DN     TTDN   TT   DN      TT        ...
File   Map                              Shuffle         Reduce           Output       $ echo “${email#*@}, ${name}”     $ ...
From: Hadoop, The Definitive Guide (2nd Edition), Tom White           SARA Hadoop Hackathon, December 7, 2010
Today09.30 - 09.50   Welcome & Introduction09.50 - 10.15   Map/Reduce @ University of Twente10.15 - 10.30   Kick-off hacka...
Upcoming SlideShare
Loading in...5
×

Introduction to SARA's Hadoop Hackathon - dec 7th 2010

1,317

Published on

This was the first of two introduction presentations to the first Hadoop Hackathon at SARA, the Dutch center for High Performance Computing and Networking.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,317
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
11
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Introduction to SARA's Hadoop Hackathon - dec 7th 2010"

  1. 1. SARA Hadoop Hackathon Evert.Lammerts@sara.nl December 7, 2010
  2. 2. DJOERD HIEMSTRA (UTwente)EDGAR MEIJ (UvA) SARA Hadoop Hackathon, December 7, 2010
  3. 3. 2002 2004 2006Nutch* MR/GFS** Hadoop*  http://nutch.apache.org/** http://labs.google.com/papers/mapreduce.html   http://labs.google.com/papers/gfs.html SARA Hadoop Hackathon, December 7, 2010
  4. 4. 2010: A Hype in Productionhttp://wiki.apache.org/hadoop/PoweredBy SARA Hadoop Hackathon, December 7, 2010
  5. 5. Super computingCloud computing Grid computing Cluster computing GPU computing http://www.sara.nl/ SARA Hadoop Hackathon, December 7, 2010
  6. 6. :-( Data Expensive! Computation :-) Data Cheaper! ComputationRef: Luiz André Barroso and Urs Hölzle, Google Inc.   The Datacenter as a Computer: An Introduction to the Design of Warehouse­Scale Machines SARA Hadoop Hackathon, December 7, 2010
  7. 7. NameNode JobTrackerDN TT DN TT DN TT DN TTDN TT DN TT DN TT DN TT DN DataNode TT TaskTracker SARA Hadoop Hackathon, December 7, 2010
  8. 8. File Map Shuffle Reduce Output $ echo “${email#*@}, ${name}” $ sort $ wc ­l ewi.utwente.nl, 1 gmail.com,      2 nbic.nl,        1 nikhef.nl,      3 sara.nl,        1 SARA Hadoop Hackathon, December 7, 2010
  9. 9. From: Hadoop, The Definitive Guide (2nd Edition), Tom White SARA Hadoop Hackathon, December 7, 2010
  10. 10. Today09.30 - 09.50 Welcome & Introduction09.50 - 10.15 Map/Reduce @ University of Twente10.15 - 10.30 Kick-off hackathon14.00 - 15.00 Optional: SARA tour10.30 - 17.00 Hackathon17.00 - 17.30 Results and closing SARA Hadoop Hackathon, December 7, 2010
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×