SlideShare a Scribd company logo
1 of 9
Hadoop Ecosystem: FromBig
Data to Big Results
Name: Tazeen GulrezSayed
Class : TE - A
Roll number :62
Subject: Data Science And Big Data Analytics
1.Introduction to Hadoop Ecosystem
2.HDFS - Hadoop Distributed File System
3.YARN - Yet Another Resource
NegotiatorMapReduce
4.Other Components of Hadoop Ecosystem
5.Conclusion
INDEX
Hadoop is an open-source software framework that is
used for distributed storage and processing of big
data. It was created by Doug Cutting and Mike
Cafarella in 2005, and it has since become one of the
most popular big data processing platforms in the
world.
The Hadoop ecosystem consists of several
components, including HDFS (Hadoop Distributed File
System), YARN (Yet Another Resource Negotiator),
and MapReduce. These components work together to
provide a scalable, fault-tolerant platform for
processing large amounts ofdata.
Introduction to Hadoop
Ecosystem
HDFS - Hadoop Distributed File
System
Hadoop is an open-source software framework that is
used for distributed storage and processing of big
data. It was created by Doug Cutting and Mike
Cafarella in 2005, and it has since become one of the
most popular big data processing platforms in the
world.
The Hadoop ecosystem consists of several
components, including HDFS (Hadoop Distributed File
System), YARN (Yet Another Resource Negotiator),
and MapReduce. These components work together to
provide a scalable, fault-tolerant platform for
processing large amounts ofdata.
YARN - Yet Another Resource
Negotiator
YARN is the resource management layer of Hadoop. It
is responsible for managing resources in a Hadoop
cluster, such as CPU, memory, and disk space. YARN
allows multiple applications to run on the same
cluster without interfering with each other.
YARN also enables dynamic allocation of resources,
allowing applications to request additional resources
as needed. This makes it possible to run complex big
data applications that require significant amounts of
resources.
MapReduce
Map Reduce is a programming model used for
processing large datasets in parallel. It works by
breaking down a large dataset into smaller chunks,
which are then processed in parallel across multiple
nodes in a cluster. Map Reduce consists of two main
functions: map andreduce.
The map function takes input data and converts it into
key-value pairs, while the reduce function takes the
output of the map function and combines it into a
smaller set of key-value pairs. Map Reduce is highly
scalable and fault-tolerant, making it ideal for
processing large amounts ofdata.
Other Components ofHadoop
Ecosystem
In addition to HDFS, YARN, and MapReduce, the
Hadoop ecosystem includes several other
components that provide additional functionality.
These include Hive, Pig, HBase, and Spark.n addition
to HDFS, YARN, and MapReduce, the Hadoop
ecosystem includes several other components that
provide additional functionality. These include Hive,
Pig, HBase, and Spark.
Hive is a data warehouse system that provides SQL-
like querying capabilities for Hadoop. Pig is a high-
level platform for creating MapReduce programs.
HBase is a NoSQL database that provides real-time
access to data stored in Hadoop. Spark is a fast, in-
memory data processing engine that can be used with
Hadoop to perform real-time analytics.
CONCLUSION
The Hadoop ecosystem is a powerful platform
for processing large amounts of data. With its
distributed architecture, fault tolerance, and
scalability, Hadoop has become the go-to
solution for big data processing.
By understanding the various components of the
Hadoop ecosystem, businesses and
organizations can take advantage of its
capabilities to gain insights and make informed
decisions based on theirdata.
62_Tazeen_Sayed_Hadoop_Ecosystem.pptx

More Related Content

Similar to 62_Tazeen_Sayed_Hadoop_Ecosystem.pptx

Survey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization MethodsSurvey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization Methodspaperpublications3
 
Hadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An OverviewHadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An Overviewrahulmonikasharma
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component rebeccatho
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHitendra Kumar
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoopOmar Jaber
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopIOSR Journals
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfDIVYA370851
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introductionsaisreealekhya
 
Harnessing Hadoop and Big Data to Reduce Execution Times
Harnessing Hadoop and Big Data to Reduce Execution TimesHarnessing Hadoop and Big Data to Reduce Execution Times
Harnessing Hadoop and Big Data to Reduce Execution TimesDavid Tjahjono,MD,MBA(UK)
 

Similar to 62_Tazeen_Sayed_Hadoop_Ecosystem.pptx (20)

Big data
Big dataBig data
Big data
 
Big data Analytics Hadoop
Big data Analytics HadoopBig data Analytics Hadoop
Big data Analytics Hadoop
 
Survey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization MethodsSurvey on Performance of Hadoop Map reduce Optimization Methods
Survey on Performance of Hadoop Map reduce Optimization Methods
 
Hadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An OverviewHadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An Overview
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
 
Hadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log ProcessingHadoop a Natural Choice for Data Intensive Log Processing
Hadoop a Natural Choice for Data Intensive Log Processing
 
Anju
AnjuAnju
Anju
 
Introduction to Apache hadoop
Introduction to Apache hadoopIntroduction to Apache hadoop
Introduction to Apache hadoop
 
Big Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – HadoopBig Data Analysis and Its Scheduling Policy – Hadoop
Big Data Analysis and Its Scheduling Policy – Hadoop
 
G017143640
G017143640G017143640
G017143640
 
Unit-3_BDA.ppt
Unit-3_BDA.pptUnit-3_BDA.ppt
Unit-3_BDA.ppt
 
BIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdfBIGDATA MODULE 3.pdf
BIGDATA MODULE 3.pdf
 
Cppt Hadoop
Cppt HadoopCppt Hadoop
Cppt Hadoop
 
Cppt
CpptCppt
Cppt
 
Cppt
CpptCppt
Cppt
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introduction
 
Lecture 2 Hadoop.pptx
Lecture 2 Hadoop.pptxLecture 2 Hadoop.pptx
Lecture 2 Hadoop.pptx
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Hadoop map reduce
Hadoop map reduceHadoop map reduce
Hadoop map reduce
 
Harnessing Hadoop and Big Data to Reduce Execution Times
Harnessing Hadoop and Big Data to Reduce Execution TimesHarnessing Hadoop and Big Data to Reduce Execution Times
Harnessing Hadoop and Big Data to Reduce Execution Times
 

Recently uploaded

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learningmisbanausheenparvam
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 

Recently uploaded (20)

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
chaitra-1.pptx fake news detection using machine learning
chaitra-1.pptx  fake news detection using machine learningchaitra-1.pptx  fake news detection using machine learning
chaitra-1.pptx fake news detection using machine learning
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 

62_Tazeen_Sayed_Hadoop_Ecosystem.pptx

  • 1. Hadoop Ecosystem: FromBig Data to Big Results Name: Tazeen GulrezSayed Class : TE - A Roll number :62 Subject: Data Science And Big Data Analytics
  • 2. 1.Introduction to Hadoop Ecosystem 2.HDFS - Hadoop Distributed File System 3.YARN - Yet Another Resource NegotiatorMapReduce 4.Other Components of Hadoop Ecosystem 5.Conclusion INDEX
  • 3. Hadoop is an open-source software framework that is used for distributed storage and processing of big data. It was created by Doug Cutting and Mike Cafarella in 2005, and it has since become one of the most popular big data processing platforms in the world. The Hadoop ecosystem consists of several components, including HDFS (Hadoop Distributed File System), YARN (Yet Another Resource Negotiator), and MapReduce. These components work together to provide a scalable, fault-tolerant platform for processing large amounts ofdata. Introduction to Hadoop Ecosystem
  • 4. HDFS - Hadoop Distributed File System Hadoop is an open-source software framework that is used for distributed storage and processing of big data. It was created by Doug Cutting and Mike Cafarella in 2005, and it has since become one of the most popular big data processing platforms in the world. The Hadoop ecosystem consists of several components, including HDFS (Hadoop Distributed File System), YARN (Yet Another Resource Negotiator), and MapReduce. These components work together to provide a scalable, fault-tolerant platform for processing large amounts ofdata.
  • 5. YARN - Yet Another Resource Negotiator YARN is the resource management layer of Hadoop. It is responsible for managing resources in a Hadoop cluster, such as CPU, memory, and disk space. YARN allows multiple applications to run on the same cluster without interfering with each other. YARN also enables dynamic allocation of resources, allowing applications to request additional resources as needed. This makes it possible to run complex big data applications that require significant amounts of resources.
  • 6. MapReduce Map Reduce is a programming model used for processing large datasets in parallel. It works by breaking down a large dataset into smaller chunks, which are then processed in parallel across multiple nodes in a cluster. Map Reduce consists of two main functions: map andreduce. The map function takes input data and converts it into key-value pairs, while the reduce function takes the output of the map function and combines it into a smaller set of key-value pairs. Map Reduce is highly scalable and fault-tolerant, making it ideal for processing large amounts ofdata.
  • 7. Other Components ofHadoop Ecosystem In addition to HDFS, YARN, and MapReduce, the Hadoop ecosystem includes several other components that provide additional functionality. These include Hive, Pig, HBase, and Spark.n addition to HDFS, YARN, and MapReduce, the Hadoop ecosystem includes several other components that provide additional functionality. These include Hive, Pig, HBase, and Spark. Hive is a data warehouse system that provides SQL- like querying capabilities for Hadoop. Pig is a high- level platform for creating MapReduce programs. HBase is a NoSQL database that provides real-time access to data stored in Hadoop. Spark is a fast, in- memory data processing engine that can be used with Hadoop to perform real-time analytics.
  • 8. CONCLUSION The Hadoop ecosystem is a powerful platform for processing large amounts of data. With its distributed architecture, fault tolerance, and scalability, Hadoop has become the go-to solution for big data processing. By understanding the various components of the Hadoop ecosystem, businesses and organizations can take advantage of its capabilities to gain insights and make informed decisions based on theirdata.