6. Big Data
Big Data is a
cluster of many technologies and tools
that are used in various scenarios.
(Hadoop + HDFS+ Hcatalog+Flume+PowerView)
(HortonWorks + PowerView)
7. What you can do in Big Data?
Fetching
Processing
Visualizing
8. How Big is Big Data?
Byte of data: one grain of rice
Kilobyte: cup of rice
Megabyte: 8 bags of rice
Gigabyte: 3 container of lorries
Terabyte: 2 container ships
Petabyte: covers Mumbai
Exabyte: covers India
Zettabyte: fills Indian Ocean
11. MapReduce
•MapReduce is a processing technique and a program
model for distributed computing based on java.
•The MapReduce algorithm contains two important tasks,
namely Map and Reduce.