Your SlideShare is downloading. ×
  • Like
Apache Con Eu2008 Hadoop Tour Tom White
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Apache Con Eu2008 Hadoop Tour Tom White

  • 8,142 views
Published

Slides of my talk on Hadoop at ApacheCon EU 2008. See my blog at http://www.lexemetech.com/2008/04/hadoop-at-apachecon-europe.html

Slides of my talk on Hadoop at ApacheCon EU 2008. See my blog at http://www.lexemetech.com/2008/04/hadoop-at-apachecon-europe.html

Published in Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
8,142
On SlideShare
0
From Embeds
0
Number of Embeds
3

Actions

Shares
Downloads
417
Comments
0
Likes
13

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. A Tour of Apache Hadoop Tom White Lexeme Ltd www.lexemetech.com tomwhite@apache.org
  • 2. Itinerary • What is Hadoop? • Components – Distributed File System – MapReduce – HBase • Related Projects
  • 3. What is Hadoop?
  • 4. The Problem • Existing tools are struggling to process today's large datasets • How long to grep 1TB of log files? • Why is this a problem for me?
  • 5. How Does Hadoop Help? • Hadoop provides a framework for storing and processing petabytes of data. • Storage: HDFS, HBase • Processing: MapReduce