Your SlideShare is downloading. ×
0
Hug India Jul10 Hadoop Map
Hug India Jul10 Hadoop Map
Hug India Jul10 Hadoop Map
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Hug India Jul10 Hadoop Map

3,339

Published on

Hadoop Ecosystem Map

Hadoop Ecosystem Map

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
3,339
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
10
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide
  • How did it start- huge data on the web Nutch built to crawl this web data Huge data had to saved- HDFS was born How to use this data Map reduce framework built for coding and running analytics – java, any language-streaming, pipes How to get in unstructured data – Web logs, Click streams, Apache logs, Server logs- fuse,webdav, chukwa, flume, Scribe RDBMS – the next data sources Hiho and sqoop for loading data into HDFS High level interfaces required over low level map reduce programming– Pig, Hive, Jaql Hive -SQL like interface- BI tools with advanced UI reporting- drilldown etc- Intellicus Workflow tools over Map reduce processes and High level languages Monitor and manage hadoop, run jobs/hive, view HDFS – high level view- Hue, karmasphere, eclipse plugin Support frameworks- Avro (Serialization), Zookeeper (Coordination) More High level interfaces/uses- Mahout, Elastic map Reduce OLTP- also possible - Hbase
  • Transcript

    • 1. HUG - July 2010 Hadoop Today! Presenter- Sanjay Sharma
    • 2. Hadoop Ecosystem Map Java Applications Web Data RDBMS Scribe OLTP Structured Data hiho Sqoop File system Engine + Logic Unstructured Data High Level Interfaces Workflow Cascading Monitor/manage Hadoop ecosystem Support Cascading More High Level Interfaces
    • 3. Q & A At the end of sessions Thank You <ul><li>http://code.google.com/p/hadoop-toolkit/ </li></ul><ul><li>Hadoop performance monitoring tool </li></ul><ul><li>Hadoop cluster setup tool </li></ul>

    ×