• Email
  • Like
  • Save
  • Private Content
  • Embed
 

Chicago Data Summit: Keynote - Data Processing with Hadoop: Scalable and Cost Effective

by on Apr 26, 2011

  • 2,776 views

Hadoop is a new paradigm for data processing that scales near linearly to petabytes of data. Commodity hardware running open source software provides unprecedented cost effectiveness. It is ...

Hadoop is a new paradigm for data processing that scales near linearly to petabytes of data. Commodity hardware running open source software provides unprecedented cost effectiveness. It is affordable to save large, raw datasets, unfiltered, in Hadoop's file system. Together with Hadoop's computational power, this facilitates operations such as ad hoc analysis and retroactive schema changes. An extensive open source tool-set is being built around these capabilities, making it easy to integrate Hadoop into many new application areas.

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Adobe PDF

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

5 Embeds 1,374

http://www.cloudera.com 1309
http://lanyrd.com 60
http://blog.cloudera.com 3
http://static.slidesharecdn.com 1
http://test.cloudera.com 1

Statistics

Likes
5
Downloads
92
Comments
0
Embed Views
1,374
Views on SlideShare
1,402
Total Views
2,776
Post Comment
Edit your comment

Chicago Data Summit: Keynote - Data Processing with Hadoop: Scalable and Cost Effective Chicago Data Summit: Keynote - Data Processing with Hadoop: Scalable and Cost Effective Presentation Transcript