Your SlideShare is downloading. ×
  • Like
Hug India Jul10 Hadoop Map
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×

Now you can save presentations on your phone or tablet

Available for both IPhone and Android

Text the download link to your phone

Standard text messaging rates apply

Hug India Jul10 Hadoop Map

  • 3,322 views
Published

Hadoop Ecosystem Map

Hadoop Ecosystem Map

  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
3,322
On SlideShare
0
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
10
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • How did it start- huge data on the web Nutch built to crawl this web data Huge data had to saved- HDFS was born How to use this data Map reduce framework built for coding and running analytics – java, any language-streaming, pipes How to get in unstructured data – Web logs, Click streams, Apache logs, Server logs- fuse,webdav, chukwa, flume, Scribe RDBMS – the next data sources Hiho and sqoop for loading data into HDFS High level interfaces required over low level map reduce programming– Pig, Hive, Jaql Hive -SQL like interface- BI tools with advanced UI reporting- drilldown etc- Intellicus Workflow tools over Map reduce processes and High level languages Monitor and manage hadoop, run jobs/hive, view HDFS – high level view- Hue, karmasphere, eclipse plugin Support frameworks- Avro (Serialization), Zookeeper (Coordination) More High level interfaces/uses- Mahout, Elastic map Reduce OLTP- also possible - Hbase

Transcript

  • 1. HUG - July 2010 Hadoop Today! Presenter- Sanjay Sharma
  • 2. Hadoop Ecosystem Map Java Applications Web Data RDBMS Scribe OLTP Structured Data hiho Sqoop File system Engine + Logic Unstructured Data High Level Interfaces Workflow Cascading Monitor/manage Hadoop ecosystem Support Cascading More High Level Interfaces
  • 3. Q & A At the end of sessions Thank You
    • http://code.google.com/p/hadoop-toolkit/
    • Hadoop performance monitoring tool
    • Hadoop cluster setup tool