SlideShare a Scribd company logo
Under the guidance of,
Sir Sourabh Bhattacharya.
Presented by:
Akash Das
Akansha Kumari
Aishik Das
Adriza Bera
WHAT ACTUALLY BIG DATA IS…?
Now, on our day to day lives, we do many things after we wake up in the morning, like
As per the records, it is one of the major things highly done by people nowaday
Checking out the social media
Reading NEWS
People with a working background do stay updated with news everyday
w, have you ever thought of the suggestions you get, while you’re trying to google somethi
Or when you visit a website, these are kind of hoardings which we see on buildings right?
Yeah, it’s the same thing, but in an electronic format, and much advanced manner.
Whatever you’ve been searching quite frequently, would be appearing to you?
Ever noticed that?
So what do we get in common?
This thing right in our hand, does much more things we can just think of, just under our noses.
Yeah you guessed it quite right, anything you’re surfing on the internet is not just your property, its being collected ever
So if you think your data is being collected, then there are over 100 crore people in India itself.
HOW IS THIS HAPPENING…???
How and where do we store so much of data?
• That’s what Big Data is, and managing the Big Data is a very tough task.
• For storing the data we need storage devices isn’t it?
• How about a storage device to store 100 crore people’s data?
• Imagine the size of Hard Drive you would be requiring!!!
The large infrastructure, looks more like a factory, isn’t it?
But it isn’t, it’s the Facebook’s data centre in Denmark.
They store the large Terabytes and Petabytes of people’s data he
Now, what does a data centre contain?
In movies, you might’ve seen, these kind of shelves with blinking lights,
They are actually shelves with high storage capacity drives,
Just like our hard drives, they are used to store the peoples data
from all around the world.
In a highly facilitated place with temperatures to be maintained, so that
they don’t get fused with heat
Now, we’re well aware of what a data centre is, but how to manage such enormous amount of data,
So, considering the numerous amount of festivals in a country like India, you might be observing the change in
your feed showing you the best deals and other things related to the festival.
Analysing the data
• How is the data being analysed?
• A very big question in every person’s head.
• Well, there are many people who are trying to build tools with which we can analyse the enormous data.
• Lets have a look at some reknowned applications:
Now talking about the first application,
Hadoop: It contains
• Basically enormous amount of storage is required to store enormous amount of data.
• So, in here we’ve a master-slave design,
• Master or NameNode: It keeps the track of the different types of changes made where each component is stor
• Slave or DataNode: It stores the data sent by the master and sends back the data when required, as a slave do
• From the above NameNode or DataNode, all are basically computers only, which are sharing their hard drive
for distributed storage.
• We also know that there’s a master for the slave, who keeps a track of the activities going on, inside the
slaves, how much space is left, and other stats.
• Now, basically what HDFS master or NameNode does is distributing a certain file in equal sizes and storing
each in different DataNodes or slaves.
• Means it’ll distribute the data into certain parts of equal sizes and store into the DataNodes, hence the name
distributed storage.
• Now, what if the master dies or NameNode stops working, the data would be lost as the data entry registry is
maintained by it, hence, we could suffer a data loss, so to overcome such conditions we’ve a secondary
master or NameNode, which comes into action when the primary master dies.
• Therefore, during making any changes into the filesystem, the changes are noted down and the image of the
updated registry is passed on to the secondary one.
• Now, MapReduce helps us in computing the data we got in HDFS, the data might be unsorted or we need to
sort in a particular way.
• Here also, we’ve two parts such as:
• Mapper: The mapper processes the data and creates several small chunks of data.
• Reducer: The Reducer’s job is to process the data that comes from the mapper. After processing, it produces a new
set of output, which will be stored in the HDFS.
• Mapper − Mapper maps the input key/value pairs to a set of intermediate key/value pair.
• JobTracker − Schedules jobs and tracks the assign jobs to Task tracker.
• Task Tracker − Tracks the task and reports status to JobTracker.
• During a MapReduce job, Hadoop sends the Map and Reduce tasks to the appropriate servers in the
• Most of the computing takes place on nodes with data on local disks that reduces the network traffic.
• After completion of the given tasks, the cluster collects and reduces the data to form an appropriate result,
and sends it back to the Hadoop server.
So till now, we’ve been how the Big Data works and how it is collected and analyzed, but how do Big Data
help us in
real life.
1. Help us predict weather
forecasts.
2. Help us prevent cyberattacks.
3. Help us predict diseases.
THANK YOU
For your valuable time
Once we know something, we find it hard to imagine what it was like not
to know it.

More Related Content

Similar to Big data

Big data explanation with real time use case
 Big data explanation with real time use case Big data explanation with real time use case
Big data explanation with real time use case
N.Jagadish Kumar
 
Big data for the rest of us with hadoop
Big data for the rest of us with hadoopBig data for the rest of us with hadoop
Big data for the rest of us with hadoop
Dhaval Anjaria
 
Big data & hadoop Introduction
Big data & hadoop IntroductionBig data & hadoop Introduction
Big data & hadoop Introduction
Mohit Shukla
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
Dr.K.Sreenivas Rao
 
bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
VIJAYAPRABAP
 
bigdata.pdf
bigdata.pdfbigdata.pdf
bigdata.pdf
AnjaliKumari301316
 
Big data technology
Big data technology Big data technology
Big data technology
omer mohamed abd alrhman
 
Hadoop for the Absolute Beginner
Hadoop for the Absolute BeginnerHadoop for the Absolute Beginner
Hadoop for the Absolute Beginner
Ike Ellis
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
ch adnan
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
MaulikLakhani
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
bodaceacat
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
Sara-Jayne Terp
 
Big data-denis-rothman
Big data-denis-rothmanBig data-denis-rothman
Big data-denis-rothman
Denis Rothman
 
Big Data
Big DataBig Data
Big Data
Priyanka Tuteja
 
Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP
vinoth kumar
 
Video 1 big data
Video 1 big dataVideo 1 big data
Video 1 big data
BrushUpYourSkills
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with Azure
Christos Charmatzis
 
5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer
Caserta
 
A gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and HadoopA gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and Hadoop
Stefano Paluello
 
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...
Simplilearn
 

Similar to Big data (20)

Big data explanation with real time use case
 Big data explanation with real time use case Big data explanation with real time use case
Big data explanation with real time use case
 
Big data for the rest of us with hadoop
Big data for the rest of us with hadoopBig data for the rest of us with hadoop
Big data for the rest of us with hadoop
 
Big data & hadoop Introduction
Big data & hadoop IntroductionBig data & hadoop Introduction
Big data & hadoop Introduction
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
 
bigdata.pdf
bigdata.pdfbigdata.pdf
bigdata.pdf
 
Big data technology
Big data technology Big data technology
Big data technology
 
Hadoop for the Absolute Beginner
Hadoop for the Absolute BeginnerHadoop for the Absolute Beginner
Hadoop for the Absolute Beginner
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
 
Session 10 handling bigger data
Session 10 handling bigger dataSession 10 handling bigger data
Session 10 handling bigger data
 
Big data-denis-rothman
Big data-denis-rothmanBig data-denis-rothman
Big data-denis-rothman
 
Big Data
Big DataBig Data
Big Data
 
Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP Introduction to Bigdata and HADOOP
Introduction to Bigdata and HADOOP
 
Video 1 big data
Video 1 big dataVideo 1 big data
Video 1 big data
 
Big Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with AzureBig Data Analytics: Finding diamonds in the rough with Azure
Big Data Analytics: Finding diamonds in the rough with Azure
 
5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer5 Things that Make Hadoop a Game Changer
5 Things that Make Hadoop a Game Changer
 
A gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and HadoopA gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and Hadoop
 
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop...
 

Recently uploaded

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
ArpitMalhotra16
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
2023240532
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
ukgaet
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
MaleehaSheikh2
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
enxupq
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
yhkoc
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 

Recently uploaded (20)

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Adjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTESAdjusting primitives for graph : SHORT REPORT / NOTES
Adjusting primitives for graph : SHORT REPORT / NOTES
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
standardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghhstandardisation of garbhpala offhgfffghh
standardisation of garbhpala offhgfffghh
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
Quantitative Data AnalysisReliability Analysis (Cronbach Alpha) Common Method...
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
FP Growth Algorithm and its Applications
FP Growth Algorithm and its ApplicationsFP Growth Algorithm and its Applications
FP Growth Algorithm and its Applications
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单一比一原版(QU毕业证)皇后大学毕业证成绩单
一比一原版(QU毕业证)皇后大学毕业证成绩单
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
一比一原版(CU毕业证)卡尔顿大学毕业证成绩单
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 

Big data

  • 1. Under the guidance of, Sir Sourabh Bhattacharya. Presented by: Akash Das Akansha Kumari Aishik Das Adriza Bera
  • 2. WHAT ACTUALLY BIG DATA IS…? Now, on our day to day lives, we do many things after we wake up in the morning, like As per the records, it is one of the major things highly done by people nowaday Checking out the social media Reading NEWS People with a working background do stay updated with news everyday
  • 3. w, have you ever thought of the suggestions you get, while you’re trying to google somethi
  • 4. Or when you visit a website, these are kind of hoardings which we see on buildings right? Yeah, it’s the same thing, but in an electronic format, and much advanced manner. Whatever you’ve been searching quite frequently, would be appearing to you? Ever noticed that?
  • 5. So what do we get in common? This thing right in our hand, does much more things we can just think of, just under our noses.
  • 6. Yeah you guessed it quite right, anything you’re surfing on the internet is not just your property, its being collected ever So if you think your data is being collected, then there are over 100 crore people in India itself. HOW IS THIS HAPPENING…??? How and where do we store so much of data? • That’s what Big Data is, and managing the Big Data is a very tough task. • For storing the data we need storage devices isn’t it? • How about a storage device to store 100 crore people’s data? • Imagine the size of Hard Drive you would be requiring!!!
  • 7. The large infrastructure, looks more like a factory, isn’t it? But it isn’t, it’s the Facebook’s data centre in Denmark. They store the large Terabytes and Petabytes of people’s data he Now, what does a data centre contain? In movies, you might’ve seen, these kind of shelves with blinking lights, They are actually shelves with high storage capacity drives, Just like our hard drives, they are used to store the peoples data from all around the world. In a highly facilitated place with temperatures to be maintained, so that they don’t get fused with heat
  • 8. Now, we’re well aware of what a data centre is, but how to manage such enormous amount of data, So, considering the numerous amount of festivals in a country like India, you might be observing the change in your feed showing you the best deals and other things related to the festival. Analysing the data • How is the data being analysed? • A very big question in every person’s head. • Well, there are many people who are trying to build tools with which we can analyse the enormous data. • Lets have a look at some reknowned applications:
  • 9. Now talking about the first application, Hadoop: It contains
  • 10. • Basically enormous amount of storage is required to store enormous amount of data. • So, in here we’ve a master-slave design, • Master or NameNode: It keeps the track of the different types of changes made where each component is stor • Slave or DataNode: It stores the data sent by the master and sends back the data when required, as a slave do
  • 11. • From the above NameNode or DataNode, all are basically computers only, which are sharing their hard drive for distributed storage. • We also know that there’s a master for the slave, who keeps a track of the activities going on, inside the slaves, how much space is left, and other stats. • Now, basically what HDFS master or NameNode does is distributing a certain file in equal sizes and storing each in different DataNodes or slaves. • Means it’ll distribute the data into certain parts of equal sizes and store into the DataNodes, hence the name distributed storage. • Now, what if the master dies or NameNode stops working, the data would be lost as the data entry registry is maintained by it, hence, we could suffer a data loss, so to overcome such conditions we’ve a secondary master or NameNode, which comes into action when the primary master dies. • Therefore, during making any changes into the filesystem, the changes are noted down and the image of the updated registry is passed on to the secondary one.
  • 12. • Now, MapReduce helps us in computing the data we got in HDFS, the data might be unsorted or we need to sort in a particular way. • Here also, we’ve two parts such as: • Mapper: The mapper processes the data and creates several small chunks of data. • Reducer: The Reducer’s job is to process the data that comes from the mapper. After processing, it produces a new set of output, which will be stored in the HDFS.
  • 13. • Mapper − Mapper maps the input key/value pairs to a set of intermediate key/value pair. • JobTracker − Schedules jobs and tracks the assign jobs to Task tracker. • Task Tracker − Tracks the task and reports status to JobTracker. • During a MapReduce job, Hadoop sends the Map and Reduce tasks to the appropriate servers in the • Most of the computing takes place on nodes with data on local disks that reduces the network traffic. • After completion of the given tasks, the cluster collects and reduces the data to form an appropriate result, and sends it back to the Hadoop server.
  • 14. So till now, we’ve been how the Big Data works and how it is collected and analyzed, but how do Big Data help us in real life. 1. Help us predict weather forecasts. 2. Help us prevent cyberattacks. 3. Help us predict diseases.
  • 15. THANK YOU For your valuable time Once we know something, we find it hard to imagine what it was like not to know it.