2. What is BigData ?
What is Hadoop ?
How Hadoop helps to achieve Bigdata
problem ?
3. Big data is a broad term for data sets so large
or complex that traditional data
processing applications are inadequate.
Challenges include analysis,
capture, curation, search, sharing, storage,
transfer, visualization, and information
privacy.
** credits : www.wikipedia.com**
4.
5.
6.
7.
8. How to store huge data sets ?
Processing Speed ?
Cost of processing frame work ?
Cost of Infrastructure ?
Need of Simple Abstract modules ?
Ease of development.
Need of an unified Framework Hadoop
9.
10. • Hadoop:
• An open-source software framework that supports data-
intensive distributed applications, licensed under the Apache
v2 license.
• Goals / Requirements:
• Abstract and facilitate the storage and processing of large
and/or rapidly growing data sets
• Structured and non-structured data
• Simple programming models
• High scalability and availability
• Use commodity (cheap!) hardware with little redundancy
• Fault-tolerance
• Move computation rather than data
17. bench
S S S S S S
bench
S S S S S S
bench
S S S S S S
bench
S S S S S S
bench
S S S S S S
Class Room Cluster
Rack
DNNN DN DN DN DN
Rack
DNDN DN DN DN DN
Rack
DNDN DN DN DN DN
Rack
DNDN DN DN DN DN
Rack
DNDN DN DN DN DN
Switch
Student
Leader/Master