Apache Con Eu2008 Hadoop Tour Tom White

  • 8,122 views
Uploaded on

Slides of my talk on Hadoop at ApacheCon EU 2008. See my blog at http://www.lexemetech.com/2008/04/hadoop-at-apachecon-europe.html

Slides of my talk on Hadoop at ApacheCon EU 2008. See my blog at http://www.lexemetech.com/2008/04/hadoop-at-apachecon-europe.html

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
8,122
On Slideshare
0
From Embeds
0
Number of Embeds
3

Actions

Shares
Downloads
417
Comments
0
Likes
13

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. A Tour of Apache Hadoop Tom White Lexeme Ltd www.lexemetech.com tomwhite@apache.org
  • 2. Itinerary • What is Hadoop? • Components – Distributed File System – MapReduce – HBase • Related Projects
  • 3. What is Hadoop?
  • 4. The Problem • Existing tools are struggling to process today's large datasets • How long to grep 1TB of log files? • Why is this a problem for me?
  • 5. How Does Hadoop Help? • Hadoop provides a framework for storing and processing petabytes of data. • Storage: HDFS, HBase • Processing: MapReduce