Chicago Data Summit: Keynote - Data Processing with Hadoop: Scalable and Cost Effective
by Cloudera, Inc. on Apr 26, 2011
- 2,776 views
Hadoop is a new paradigm for data processing that scales near linearly to petabytes of data. Commodity hardware running open source software provides unprecedented cost effectiveness. It is ...
Hadoop is a new paradigm for data processing that scales near linearly to petabytes of data. Commodity hardware running open source software provides unprecedented cost effectiveness. It is affordable to save large, raw datasets, unfiltered, in Hadoop's file system. Together with Hadoop's computational power, this facilitates operations such as ad hoc analysis and retroactive schema changes. An extensive open source tool-set is being built around these capabilities, making it easy to integrate Hadoop into many new application areas.
Accessibility
Categories
Upload Details
Uploaded via SlideShare as Adobe PDF
Usage Rights
© All Rights Reserved
Statistics
- Likes
- 5
- Downloads
- 92
- Comments
- 0
- Embed Views
- Views on SlideShare
- 1,402
- Total Views
- 2,776