Apache Con Eu2008 Hadoop Tour Tom White

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

2 comments

Comments 1 - 2 of 2 previous next Post a comment

Post a comment
Embed Video
Edit your comment Cancel

11 Favorites

Apache Con Eu2008 Hadoop Tour Tom White - Presentation Transcript

  1. A Tour of Apache Hadoop Tom White Lexeme Ltd www.lexemetech.com tomwhite@apache.org
  2. Itinerary • What is Hadoop? • Components – Distributed File System – MapReduce – HBase • Related Projects
  3. What is Hadoop?
  4. The Problem • Existing tools are struggling to process today's large datasets • How long to grep 1TB of log files? • Why is this a problem for me?
  5. How Does Hadoop Help? • Hadoop provides a framework for storing and processing petabytes of data. • Storage: HDFS, HBase • Processing: MapReduce
  6. A Brief History of Hadoop • Feb 2003 – First MapReduce library written at Google • Oct 2003 – Google File System paper published • Dec 2004 – Google MapReduce paper published • Jul 2005 – Doug Cutti

+ tomwhitetomwhite, 2 years ago

custom

3959 views, 11 favs, 2 embeds more stats

Slides of my talk on Hadoop at ApacheCon EU 2008. S more

More Info

© All Rights Reserved

Go to text version
  • Total Views 3959
    • 3805 on SlideShare
    • 154 from embeds
  • Comments 2
  • Favorites 11
  • Downloads 212
Most viewed embeds
  • 153 views on http://www.lexemetech.com
  • 1 views on http://ydn.corp.yahoo.com:8080

more

All embeds
  • 153 views on http://www.lexemetech.com
  • 1 views on http://ydn.corp.yahoo.com:8080

less

Flagged as inappropriate Flag as inappropriate
Flag as innappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

Categories