SlideShare a Scribd company logo
1 of 9
To understand the major components of Hadoop
1.0 architecture
∙
∙ Name node
∙ Secondary name node
∙ Job Tracker
∙ Task tracker
∙ Data nodes
Agenda
Some key terms used while discussing Hadoop
●Commodity hardware: PCs which can be used to make a cluster
●Cluster/grid: Interconnection of systems in a network
●Node: A single instance of a computer
●Distributed System: A system composed of multiple autonomous
computers that communicate through a computer network
●ASF: Apache Software Foundation
●HA: High Availability
●Hot standby : Uninterrupted failover
Key Terms
Master-Slave Architecture
3
Machines in MASTER
MODE
Name
ode
JOB
tracker
Secondary name
node
Data nodes in slave
mode
HADOOP supports 3 configuration modes when its is implemented
on commodity hardware:
Standalone mode : All services run locally on single machine on
a single JVM (seldom used)
Pseudo distributed mode : All services run on the same machine
but on a different JVM (development and testing purpose)
Fully distributed mode: Each service runs on a seperate
hardware (a dedicated server). Used in production setup.
Note: Service here reffers to namenode, secondary name node, job
tracker and data node.
HADOOP Deployment Modes
4
∙ Master nodes
∙ Name node : Central file system manager
–Secondary name node : Data backup of name node (not hot standby)
–Job tracker : Centralized job scheduler
–
∙ Slave nodes and deamons/software services
–Data node : Machine where files gets stored and processed
–Task tracker : A software service which monitors the state of job tracker
–
–
–Note : Every slave node keeps sending a heart beat signal to the name node once in
every 3 seconds to state that its alive. What happens when a data node goes down
would be discussed in the subsequent slides.
–
Functionality of Each Component of HADOOP
5
What is a job in HADOOP eco system?
� A job usually is some task submitted by the user to the
Hadoop cluster.
� The job is in the form of a program or collection of
programs (a JAR file) which needs to be executed.
� A job would have the following attributes to it
1)The actual program/s
2)Input data to the program (a file or a
collection of files in a directory)
3)The output directory where the results
of execution is collected in a file/s
6
Core features of Apache Hadoop are:
∙ HDFS (Hadoop Distributed File System) – data storage
∙ MapReduce Framework – compute in distributed environment
–A Java framework responsible for processing jobs in distributed mode
–User-defined map phase, which is a parallel, share-nothing processing of
input
–User-defined reduce phase aggregates of the output of the map phase
Apache HADOOP Core Features
7
Submitting and executing a job in a hadoop cluster
8
Data nodes
The engineer/analyst's machine is not a part of Hadoop
cluster. Usually Hadoop would be installed in pseudo-
distributed mode on his/her machine. The job ( program/s )
would be submitted to the gateway machine
The gateway machine would have the necessary configuration
to communicate to the name node and job tracker.
Job gets submitted to the name node and eventually job
tracker is responsible for scheduling the execution of the job on
the data nodes in the cluster.
Gateway Name node Job tracker
Engineer/analyst
9
Summary
An overview of the core components of HADOOP
1.0
Basic understanding of the components
namenode, secondary name node, job tracker,
data node and task tracker
What is a job in a HADOOP eco system ?
How to submit a job in a HADOOP cluster ?
9
9

More Related Content

Similar to 2 Hadoop 1.x presentation in understading .pptx

Hadoop architecture-tutorial
Hadoop  architecture-tutorialHadoop  architecture-tutorial
Hadoop architecture-tutorialvinayiqbusiness
 
Big data processing using hadoop poster presentation
Big data processing using hadoop poster presentationBig data processing using hadoop poster presentation
Big data processing using hadoop poster presentationAmrut Patil
 
Introduction to Hadoop part1
Introduction to Hadoop part1Introduction to Hadoop part1
Introduction to Hadoop part1Giovanna Roda
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component rebeccatho
 
Introduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopIntroduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopGERARDO BARBERENA
 
Hadoop architecture-tutorial
Hadoop  architecture-tutorialHadoop  architecture-tutorial
Hadoop architecture-tutorialvinayiqbusiness
 
Distributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxDistributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxUttara University
 
July 2010 Triangle Hadoop Users Group - Chad Vawter Slides
July 2010 Triangle Hadoop Users Group - Chad Vawter SlidesJuly 2010 Triangle Hadoop Users Group - Chad Vawter Slides
July 2010 Triangle Hadoop Users Group - Chad Vawter Slidesryancox
 
Report Hadoop Map Reduce
Report Hadoop Map ReduceReport Hadoop Map Reduce
Report Hadoop Map ReduceUrvashi Kataria
 
Intro to Apache Hadoop
Intro to Apache HadoopIntro to Apache Hadoop
Intro to Apache HadoopSufi Nawaz
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introductionChirag Ahuja
 
Learn to setup a Hadoop Multi Node Cluster
Learn to setup a Hadoop Multi Node ClusterLearn to setup a Hadoop Multi Node Cluster
Learn to setup a Hadoop Multi Node ClusterEdureka!
 

Similar to 2 Hadoop 1.x presentation in understading .pptx (20)

Unit 1
Unit 1Unit 1
Unit 1
 
Hadoop architecture-tutorial
Hadoop  architecture-tutorialHadoop  architecture-tutorial
Hadoop architecture-tutorial
 
Big data processing using hadoop poster presentation
Big data processing using hadoop poster presentationBig data processing using hadoop poster presentation
Big data processing using hadoop poster presentation
 
Introduction to Hadoop part1
Introduction to Hadoop part1Introduction to Hadoop part1
Introduction to Hadoop part1
 
BD-zero lecture.pptx
BD-zero lecture.pptxBD-zero lecture.pptx
BD-zero lecture.pptx
 
BD-zero lecture.pptx
BD-zero lecture.pptxBD-zero lecture.pptx
BD-zero lecture.pptx
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
 
Introduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to HadoopIntroduccion a Hadoop / Introduction to Hadoop
Introduccion a Hadoop / Introduction to Hadoop
 
Hadoop architecture-tutorial
Hadoop  architecture-tutorialHadoop  architecture-tutorial
Hadoop architecture-tutorial
 
Distributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptxDistributed Systems Hadoop.pptx
Distributed Systems Hadoop.pptx
 
Hadoop overview.pdf
Hadoop overview.pdfHadoop overview.pdf
Hadoop overview.pdf
 
July 2010 Triangle Hadoop Users Group - Chad Vawter Slides
July 2010 Triangle Hadoop Users Group - Chad Vawter SlidesJuly 2010 Triangle Hadoop Users Group - Chad Vawter Slides
July 2010 Triangle Hadoop Users Group - Chad Vawter Slides
 
Anju
AnjuAnju
Anju
 
Report Hadoop Map Reduce
Report Hadoop Map ReduceReport Hadoop Map Reduce
Report Hadoop Map Reduce
 
Intro to Apache Hadoop
Intro to Apache HadoopIntro to Apache Hadoop
Intro to Apache Hadoop
 
Hadoop introduction
Hadoop introductionHadoop introduction
Hadoop introduction
 
Learn to setup a Hadoop Multi Node Cluster
Learn to setup a Hadoop Multi Node ClusterLearn to setup a Hadoop Multi Node Cluster
Learn to setup a Hadoop Multi Node Cluster
 
Getting started big data
Getting started big dataGetting started big data
Getting started big data
 
Hadoop seminar
Hadoop seminarHadoop seminar
Hadoop seminar
 
Hadoop description
Hadoop descriptionHadoop description
Hadoop description
 

More from Kishanhari3

how-to-write-a-technical-report-presentatin.ppt
how-to-write-a-technical-report-presentatin.ppthow-to-write-a-technical-report-presentatin.ppt
how-to-write-a-technical-report-presentatin.pptKishanhari3
 
RTA Process PPT-NEXA presentastion .pptx
RTA Process PPT-NEXA presentastion .pptxRTA Process PPT-NEXA presentastion .pptx
RTA Process PPT-NEXA presentastion .pptxKishanhari3
 
MachineLearning_Brick and Mortar Store Layout Design.pptx
MachineLearning_Brick and Mortar Store Layout Design.pptxMachineLearning_Brick and Mortar Store Layout Design.pptx
MachineLearning_Brick and Mortar Store Layout Design.pptxKishanhari3
 
Basic Asanas Part_1-3_Aurabindo Asharam.pptx
Basic Asanas Part_1-3_Aurabindo Asharam.pptxBasic Asanas Part_1-3_Aurabindo Asharam.pptx
Basic Asanas Part_1-3_Aurabindo Asharam.pptxKishanhari3
 
4 Hadoop 2.0 presentaion techaings in ppt.pptx
4 Hadoop 2.0 presentaion techaings in ppt.pptx4 Hadoop 2.0 presentaion techaings in ppt.pptx
4 Hadoop 2.0 presentaion techaings in ppt.pptxKishanhari3
 
20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt
20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt
20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.pptKishanhari3
 
Burj Al Arab presentation in english.pptx
Burj Al Arab presentation in english.pptxBurj Al Arab presentation in english.pptx
Burj Al Arab presentation in english.pptxKishanhari3
 

More from Kishanhari3 (7)

how-to-write-a-technical-report-presentatin.ppt
how-to-write-a-technical-report-presentatin.ppthow-to-write-a-technical-report-presentatin.ppt
how-to-write-a-technical-report-presentatin.ppt
 
RTA Process PPT-NEXA presentastion .pptx
RTA Process PPT-NEXA presentastion .pptxRTA Process PPT-NEXA presentastion .pptx
RTA Process PPT-NEXA presentastion .pptx
 
MachineLearning_Brick and Mortar Store Layout Design.pptx
MachineLearning_Brick and Mortar Store Layout Design.pptxMachineLearning_Brick and Mortar Store Layout Design.pptx
MachineLearning_Brick and Mortar Store Layout Design.pptx
 
Basic Asanas Part_1-3_Aurabindo Asharam.pptx
Basic Asanas Part_1-3_Aurabindo Asharam.pptxBasic Asanas Part_1-3_Aurabindo Asharam.pptx
Basic Asanas Part_1-3_Aurabindo Asharam.pptx
 
4 Hadoop 2.0 presentaion techaings in ppt.pptx
4 Hadoop 2.0 presentaion techaings in ppt.pptx4 Hadoop 2.0 presentaion techaings in ppt.pptx
4 Hadoop 2.0 presentaion techaings in ppt.pptx
 
20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt
20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt
20081121-positive-thinking-sdfdsfds38s-ati-1227445729031902-9.ppt
 
Burj Al Arab presentation in english.pptx
Burj Al Arab presentation in english.pptxBurj Al Arab presentation in english.pptx
Burj Al Arab presentation in english.pptx
 

Recently uploaded

Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 

Recently uploaded (20)

Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 

2 Hadoop 1.x presentation in understading .pptx

  • 1. To understand the major components of Hadoop 1.0 architecture ∙ ∙ Name node ∙ Secondary name node ∙ Job Tracker ∙ Task tracker ∙ Data nodes Agenda
  • 2. Some key terms used while discussing Hadoop ●Commodity hardware: PCs which can be used to make a cluster ●Cluster/grid: Interconnection of systems in a network ●Node: A single instance of a computer ●Distributed System: A system composed of multiple autonomous computers that communicate through a computer network ●ASF: Apache Software Foundation ●HA: High Availability ●Hot standby : Uninterrupted failover Key Terms
  • 3. Master-Slave Architecture 3 Machines in MASTER MODE Name ode JOB tracker Secondary name node Data nodes in slave mode
  • 4. HADOOP supports 3 configuration modes when its is implemented on commodity hardware: Standalone mode : All services run locally on single machine on a single JVM (seldom used) Pseudo distributed mode : All services run on the same machine but on a different JVM (development and testing purpose) Fully distributed mode: Each service runs on a seperate hardware (a dedicated server). Used in production setup. Note: Service here reffers to namenode, secondary name node, job tracker and data node. HADOOP Deployment Modes 4
  • 5. ∙ Master nodes ∙ Name node : Central file system manager –Secondary name node : Data backup of name node (not hot standby) –Job tracker : Centralized job scheduler – ∙ Slave nodes and deamons/software services –Data node : Machine where files gets stored and processed –Task tracker : A software service which monitors the state of job tracker – – –Note : Every slave node keeps sending a heart beat signal to the name node once in every 3 seconds to state that its alive. What happens when a data node goes down would be discussed in the subsequent slides. – Functionality of Each Component of HADOOP 5
  • 6. What is a job in HADOOP eco system? � A job usually is some task submitted by the user to the Hadoop cluster. � The job is in the form of a program or collection of programs (a JAR file) which needs to be executed. � A job would have the following attributes to it 1)The actual program/s 2)Input data to the program (a file or a collection of files in a directory) 3)The output directory where the results of execution is collected in a file/s 6
  • 7. Core features of Apache Hadoop are: ∙ HDFS (Hadoop Distributed File System) – data storage ∙ MapReduce Framework – compute in distributed environment –A Java framework responsible for processing jobs in distributed mode –User-defined map phase, which is a parallel, share-nothing processing of input –User-defined reduce phase aggregates of the output of the map phase Apache HADOOP Core Features 7
  • 8. Submitting and executing a job in a hadoop cluster 8 Data nodes The engineer/analyst's machine is not a part of Hadoop cluster. Usually Hadoop would be installed in pseudo- distributed mode on his/her machine. The job ( program/s ) would be submitted to the gateway machine The gateway machine would have the necessary configuration to communicate to the name node and job tracker. Job gets submitted to the name node and eventually job tracker is responsible for scheduling the execution of the job on the data nodes in the cluster. Gateway Name node Job tracker Engineer/analyst
  • 9. 9 Summary An overview of the core components of HADOOP 1.0 Basic understanding of the components namenode, secondary name node, job tracker, data node and task tracker What is a job in a HADOOP eco system ? How to submit a job in a HADOOP cluster ? 9 9