SlideShare a Scribd company logo
Hadoop
MapReduce
HELLO!
I am Yogender Singh
Working with Entrench Electronic and penthao
as big data and spark developer.
Map reduce
Introduction
Agenda For Today’s Session
◉ What is hadoop MapReduce ?
◉ MapReduce In Nutshell
◉ Two Advantages of MapReduce
◉ Hadoop MapReduce Approach with an Example
What is mapreduce?
Components of mapreduce
Storage
Hadoop Components
Processing
2 main Hadoop Components
MapReduce Data Processing and
Programming
◉ MapReduce is the processing components
of Apache Hadoop
◉ It process data parallelly in distributed
environment
Result
MapReduce In Nutshell
2 Biggest Adavantages
Of MapReduce
Advantage 1 : Parallel Processing
◉ Data is processed in parallel.
◉ Processing become fast.
Data
Advantage 2 : Data Locality – Processing to Storage
◉ Moving Data to processing is very
costly.
◉ In MapReduce , we move processing to
data.
Data
Traditional vs Mapreduce
Election votes counting : Traditional way
Election Vote Casting
◉ vote is stored at different Booths
◉ Result Center has the details of
all the Booths
Election votes counting : MapReduce way
Counting - MapReduce Approach
◉ Votes are counted at individual booths.
◉ Booth-wise result are send back to the result
Centre.
◉ Final Result is declared easily and quickly
using this way.
MapReduce way :
INPUT
OUTPUT
Map()
Map()
Map()
Reduce()
Reduce()
Anatomy of a MapReduce Program
MapReduce
ValueKey
(k1,v1)
List(k3,v3)(k2,list(v2))
List(K2,v2)
Map :
Reduce :
Example (MapReduce Problem)
MapReduce way – Word Count Process
The overall MapReduce Word Count Process
Input Split
Deer Bear River
Car Car River
Deer Car Bear
Deer Bear River
Car Car River
Splitting
MapReduce way – Word Count Process
The overall MapReduce Word Count Process
Input Split
Deer Bear River
Car Car River
Deer Car Bear
Deer Bear River
Car Car River
Deer Car Bear
Splitting
Deer,1
Bear,1
River,1
Car,1
Car,1
River,1
Mapping
MapReduce way – Word Count Process
The overall MapReduce Word Count Process
Input Split
Deer Bear River
Car Car River
Deer Car Bear
Deer Bear River
Car Car River
Deer Car Bear
Splitting
Deer,1
Bear,1
River,1
Car,1
Car,1
River,1
Deer,1
Car,1
Bear,1
Bear,(1,1)
Car,(1,1,1)
Dear,(1,1)
Mapping Shuffling
MapReduce way – Word Count Process
The overall MapReduce Word Count Process
Input Split
Deer Bear River
Car Car River
Deer Car Bear
Deer Bear River
Car Car River
Deer Car Bear
Splitting
Deer,1
Bear,1
River,1
Car,1
Car,1
River,1
Deer,1
Car,1
Bear,1
Bear,(1,1)
Car,(1,1,1)
Dear,(1,1)
River,(1,1)
Mapping
Bear,2
Car,3
Dear,2
Shuffling Reducing
MapReduce way – Word Count Process
The overall MapReduce Word Count Process
Input Split
Deer Bear River
Car Car River
Deer Car Bear
Deer Bear River
Car Car River
Deer Car Bear
Splitting
Deer,1
Bear,1
River,1
Car,1
Car,1
River,1
Deer,1
Car,1
Bear,1
Bear,(1,1)
Car,(1,1,1)
Dear,(1,1)
River,(1,1)
Mapping
Bear,2
Car,3
Dear,2
River,2
Shuffling Reducing Final Result
Bear,2
Car,3
Dear,2
River,2
THANKS!
Any questions?
You can reach me
yash991314@gmail.com / yogi@entrench.org

More Related Content

What's hot

EuroPython 2019: GeoSpatial Analysis using Python and JupyterHub
EuroPython 2019: GeoSpatial Analysis using Python and JupyterHubEuroPython 2019: GeoSpatial Analysis using Python and JupyterHub
EuroPython 2019: GeoSpatial Analysis using Python and JupyterHubMartin Christen
 
MapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy ResourcesMapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy ResourcesPrakher Hajela Saxena
 
Plugins in QGIS and its uses
Plugins in QGIS and its usesPlugins in QGIS and its uses
Plugins in QGIS and its usesMayuresh Padalkar
 
LIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3DLIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3DPrakher Hajela Saxena
 
Exploring Spatial data in GIS Environment
Exploring Spatial data in GIS Environment Exploring Spatial data in GIS Environment
Exploring Spatial data in GIS Environment NAXA-Developers
 
Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...
Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...
Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...Kohei Otsuka
 
How to empower community by using GIS lecture 2
How to empower community by using GIS lecture 2How to empower community by using GIS lecture 2
How to empower community by using GIS lecture 2wang yaohui
 
FME and the BGS in 2016/2017
FME and the BGS in 2016/2017FME and the BGS in 2016/2017
FME and the BGS in 2016/2017Sterling Geo
 
FME World Tour 2015 Belfast - Donegal County Council - Daragh McDonough
FME World Tour 2015 Belfast - Donegal County Council - Daragh McDonoughFME World Tour 2015 Belfast - Donegal County Council - Daragh McDonough
FME World Tour 2015 Belfast - Donegal County Council - Daragh McDonoughIMGS
 
A vision to make OSM data the backbone of history across time and space - Int...
A vision to make OSM data the backbone of history across time and space - Int...A vision to make OSM data the backbone of history across time and space - Int...
A vision to make OSM data the backbone of history across time and space - Int...Kohei Otsuka
 
R programming language in spatial analysis
R programming language in spatial analysisR programming language in spatial analysis
R programming language in spatial analysisAbhiram Kanigolla
 
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015Prakher Hajela Saxena
 
An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)
An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)
An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)Rich Harris
 
Creating and indoor routable network with QGIS and pgRouting
Creating and indoor routable network with QGIS and pgRoutingCreating and indoor routable network with QGIS and pgRouting
Creating and indoor routable network with QGIS and pgRoutingRoss McDonald
 
Datamine Discover3D - LIDAR-Drone Applications
Datamine Discover3D - LIDAR-Drone ApplicationsDatamine Discover3D - LIDAR-Drone Applications
Datamine Discover3D - LIDAR-Drone ApplicationsPrakher Hajela Saxena
 
Pgrouting_foss4guk_ross_mcdonald
Pgrouting_foss4guk_ross_mcdonaldPgrouting_foss4guk_ross_mcdonald
Pgrouting_foss4guk_ross_mcdonaldRoss McDonald
 
How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1wang yaohui
 

What's hot (20)

Mapreduce
MapreduceMapreduce
Mapreduce
 
EuroPython 2019: GeoSpatial Analysis using Python and JupyterHub
EuroPython 2019: GeoSpatial Analysis using Python and JupyterHubEuroPython 2019: GeoSpatial Analysis using Python and JupyterHub
EuroPython 2019: GeoSpatial Analysis using Python and JupyterHub
 
MapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy ResourcesMapInfo Discover 3D for Wind Energy Resources
MapInfo Discover 3D for Wind Energy Resources
 
Plugins in QGIS and its uses
Plugins in QGIS and its usesPlugins in QGIS and its uses
Plugins in QGIS and its uses
 
LIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3DLIDAR and Drone Data - Datamine Discover3D
LIDAR and Drone Data - Datamine Discover3D
 
Introduction to GIS
Introduction to GISIntroduction to GIS
Introduction to GIS
 
Exploring Spatial data in GIS Environment
Exploring Spatial data in GIS Environment Exploring Spatial data in GIS Environment
Exploring Spatial data in GIS Environment
 
Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...
Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...
Introduction of HTGCL (Historical Topographic Ground Control Line) - New para...
 
How to empower community by using GIS lecture 2
How to empower community by using GIS lecture 2How to empower community by using GIS lecture 2
How to empower community by using GIS lecture 2
 
FME and the BGS in 2016/2017
FME and the BGS in 2016/2017FME and the BGS in 2016/2017
FME and the BGS in 2016/2017
 
Map Reduce introduction
Map Reduce introductionMap Reduce introduction
Map Reduce introduction
 
FME World Tour 2015 Belfast - Donegal County Council - Daragh McDonough
FME World Tour 2015 Belfast - Donegal County Council - Daragh McDonoughFME World Tour 2015 Belfast - Donegal County Council - Daragh McDonough
FME World Tour 2015 Belfast - Donegal County Council - Daragh McDonough
 
A vision to make OSM data the backbone of history across time and space - Int...
A vision to make OSM data the backbone of history across time and space - Int...A vision to make OSM data the backbone of history across time and space - Int...
A vision to make OSM data the backbone of history across time and space - Int...
 
R programming language in spatial analysis
R programming language in spatial analysisR programming language in spatial analysis
R programming language in spatial analysis
 
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
 
An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)
An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)
An Introduction to Mapping, GIS and Spatial Modelling in R (presentation)
 
Creating and indoor routable network with QGIS and pgRouting
Creating and indoor routable network with QGIS and pgRoutingCreating and indoor routable network with QGIS and pgRouting
Creating and indoor routable network with QGIS and pgRouting
 
Datamine Discover3D - LIDAR-Drone Applications
Datamine Discover3D - LIDAR-Drone ApplicationsDatamine Discover3D - LIDAR-Drone Applications
Datamine Discover3D - LIDAR-Drone Applications
 
Pgrouting_foss4guk_ross_mcdonald
Pgrouting_foss4guk_ross_mcdonaldPgrouting_foss4guk_ross_mcdonald
Pgrouting_foss4guk_ross_mcdonald
 
How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1How to empower community by using GIS lecture 1
How to empower community by using GIS lecture 1
 

Similar to Mapreduce introduction

Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and CassandraBrief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and CassandraSomnath Mazumdar
 
Introduction to map reduce
Introduction to map reduceIntroduction to map reduce
Introduction to map reduceBhupesh Chawda
 
Apache Hadoop - A Deep Dive (Part 2 - MapReduce)
Apache Hadoop - A Deep Dive (Part 2 - MapReduce)Apache Hadoop - A Deep Dive (Part 2 - MapReduce)
Apache Hadoop - A Deep Dive (Part 2 - MapReduce)Debarchan Sarkar
 
MAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptxMAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptxHARIKRISHNANU13
 
Big Data Analytics Chapter3-6@2021.pdf
Big Data Analytics Chapter3-6@2021.pdfBig Data Analytics Chapter3-6@2021.pdf
Big Data Analytics Chapter3-6@2021.pdfWasyihunSema2
 
Comparing Distributed Indexing To Mapreduce or Not?
Comparing Distributed Indexing To Mapreduce or Not?Comparing Distributed Indexing To Mapreduce or Not?
Comparing Distributed Indexing To Mapreduce or Not?TerrierTeam
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduceDavid Gleich
 
Behm Shah Pagerank
Behm Shah PagerankBehm Shah Pagerank
Behm Shah Pagerankgothicane
 
Automatic Scaling Iterative Computations
Automatic Scaling Iterative ComputationsAutomatic Scaling Iterative Computations
Automatic Scaling Iterative ComputationsGuozhang Wang
 
Stratosphere with big_data_analytics
Stratosphere with big_data_analyticsStratosphere with big_data_analytics
Stratosphere with big_data_analyticsAvinash Pandu
 
Introduction to Spark on Hadoop
Introduction to Spark on HadoopIntroduction to Spark on Hadoop
Introduction to Spark on HadoopCarol McDonald
 
High Performance Computing on NYC Yellow Taxi Data Set
High Performance Computing on NYC Yellow Taxi Data SetHigh Performance Computing on NYC Yellow Taxi Data Set
High Performance Computing on NYC Yellow Taxi Data SetParag Ahire
 

Similar to Mapreduce introduction (20)

Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and CassandraBrief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
Brief introduction on Hadoop,Dremel, Pig, FlumeJava and Cassandra
 
Introduction to map reduce
Introduction to map reduceIntroduction to map reduce
Introduction to map reduce
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Apache Hadoop - A Deep Dive (Part 2 - MapReduce)
Apache Hadoop - A Deep Dive (Part 2 - MapReduce)Apache Hadoop - A Deep Dive (Part 2 - MapReduce)
Apache Hadoop - A Deep Dive (Part 2 - MapReduce)
 
Mypreson 27
Mypreson 27Mypreson 27
Mypreson 27
 
MAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptxMAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptx
 
Big Data Analytics Chapter3-6@2021.pdf
Big Data Analytics Chapter3-6@2021.pdfBig Data Analytics Chapter3-6@2021.pdf
Big Data Analytics Chapter3-6@2021.pdf
 
Hadoop Map Reduce
Hadoop Map ReduceHadoop Map Reduce
Hadoop Map Reduce
 
Comparing Distributed Indexing To Mapreduce or Not?
Comparing Distributed Indexing To Mapreduce or Not?Comparing Distributed Indexing To Mapreduce or Not?
Comparing Distributed Indexing To Mapreduce or Not?
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduce
 
T180304125129
T180304125129T180304125129
T180304125129
 
Using MapReduce for Large–scale Medical Image Analysis
Using MapReduce for Large–scale Medical Image AnalysisUsing MapReduce for Large–scale Medical Image Analysis
Using MapReduce for Large–scale Medical Image Analysis
 
Behm Shah Pagerank
Behm Shah PagerankBehm Shah Pagerank
Behm Shah Pagerank
 
Automatic Scaling Iterative Computations
Automatic Scaling Iterative ComputationsAutomatic Scaling Iterative Computations
Automatic Scaling Iterative Computations
 
Stratosphere with big_data_analytics
Stratosphere with big_data_analyticsStratosphere with big_data_analytics
Stratosphere with big_data_analytics
 
Introduction to Spark on Hadoop
Introduction to Spark on HadoopIntroduction to Spark on Hadoop
Introduction to Spark on Hadoop
 
High Performance Computing on NYC Yellow Taxi Data Set
High Performance Computing on NYC Yellow Taxi Data SetHigh Performance Computing on NYC Yellow Taxi Data Set
High Performance Computing on NYC Yellow Taxi Data Set
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop ecosystem
Hadoop ecosystemHadoop ecosystem
Hadoop ecosystem
 

Recently uploaded

一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单nscud
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单ewymefz
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单vcaxypu
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsalex933524
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单nscud
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单vcaxypu
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIAlejandraGmez176757
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单ewymefz
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBAlireza Kamrani
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxbenishzehra469
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxzahraomer517
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单ewymefz
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...correoyaya
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Domenico Conte
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单ukgaet
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundOppotus
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单ewymefz
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?DOT TECH
 

Recently uploaded (20)

一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
一比一原版(CBU毕业证)卡普顿大学毕业证成绩单
 
一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单一比一原版(NYU毕业证)纽约大学毕业证成绩单
一比一原版(NYU毕业证)纽约大学毕业证成绩单
 
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
一比一原版(RUG毕业证)格罗宁根大学毕业证成绩单
 
Tabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflowsTabula.io Cheatsheet: automate your data workflows
Tabula.io Cheatsheet: automate your data workflows
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
一比一原版(ArtEZ毕业证)ArtEZ艺术学院毕业证成绩单
 
Business update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMIBusiness update Q1 2024 Lar España Real Estate SOCIMI
Business update Q1 2024 Lar España Real Estate SOCIMI
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
Using PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDBUsing PDB Relocation to Move a Single PDB to Another Existing CDB
Using PDB Relocation to Move a Single PDB to Another Existing CDB
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
Introduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxxIntroduction-to-Cybersecurit57hhfcbbcxxx
Introduction-to-Cybersecurit57hhfcbbcxxx
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单
 
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
Innovative Methods in Media and Communication Research by Sebastian Kubitschk...
 
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
Professional Data Engineer Certification Exam Guide  _  Learn  _  Google Clou...
 
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
一比一原版(UVic毕业证)维多利亚大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
一比一原版(UofM毕业证)明尼苏达大学毕业证成绩单
 
How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?How can I successfully sell my pi coins in Philippines?
How can I successfully sell my pi coins in Philippines?
 

Mapreduce introduction

  • 2. HELLO! I am Yogender Singh Working with Entrench Electronic and penthao as big data and spark developer.
  • 4. Agenda For Today’s Session ◉ What is hadoop MapReduce ? ◉ MapReduce In Nutshell ◉ Two Advantages of MapReduce ◉ Hadoop MapReduce Approach with an Example
  • 7. MapReduce Data Processing and Programming ◉ MapReduce is the processing components of Apache Hadoop ◉ It process data parallelly in distributed environment Result
  • 10. Advantage 1 : Parallel Processing ◉ Data is processed in parallel. ◉ Processing become fast. Data
  • 11. Advantage 2 : Data Locality – Processing to Storage ◉ Moving Data to processing is very costly. ◉ In MapReduce , we move processing to data. Data
  • 13. Election votes counting : Traditional way Election Vote Casting ◉ vote is stored at different Booths ◉ Result Center has the details of all the Booths
  • 14. Election votes counting : MapReduce way Counting - MapReduce Approach ◉ Votes are counted at individual booths. ◉ Booth-wise result are send back to the result Centre. ◉ Final Result is declared easily and quickly using this way.
  • 16. Anatomy of a MapReduce Program MapReduce ValueKey (k1,v1) List(k3,v3)(k2,list(v2)) List(K2,v2) Map : Reduce :
  • 18. MapReduce way – Word Count Process The overall MapReduce Word Count Process Input Split Deer Bear River Car Car River Deer Car Bear Deer Bear River Car Car River Splitting
  • 19. MapReduce way – Word Count Process The overall MapReduce Word Count Process Input Split Deer Bear River Car Car River Deer Car Bear Deer Bear River Car Car River Deer Car Bear Splitting Deer,1 Bear,1 River,1 Car,1 Car,1 River,1 Mapping
  • 20. MapReduce way – Word Count Process The overall MapReduce Word Count Process Input Split Deer Bear River Car Car River Deer Car Bear Deer Bear River Car Car River Deer Car Bear Splitting Deer,1 Bear,1 River,1 Car,1 Car,1 River,1 Deer,1 Car,1 Bear,1 Bear,(1,1) Car,(1,1,1) Dear,(1,1) Mapping Shuffling
  • 21. MapReduce way – Word Count Process The overall MapReduce Word Count Process Input Split Deer Bear River Car Car River Deer Car Bear Deer Bear River Car Car River Deer Car Bear Splitting Deer,1 Bear,1 River,1 Car,1 Car,1 River,1 Deer,1 Car,1 Bear,1 Bear,(1,1) Car,(1,1,1) Dear,(1,1) River,(1,1) Mapping Bear,2 Car,3 Dear,2 Shuffling Reducing
  • 22. MapReduce way – Word Count Process The overall MapReduce Word Count Process Input Split Deer Bear River Car Car River Deer Car Bear Deer Bear River Car Car River Deer Car Bear Splitting Deer,1 Bear,1 River,1 Car,1 Car,1 River,1 Deer,1 Car,1 Bear,1 Bear,(1,1) Car,(1,1,1) Dear,(1,1) River,(1,1) Mapping Bear,2 Car,3 Dear,2 River,2 Shuffling Reducing Final Result Bear,2 Car,3 Dear,2 River,2
  • 23. THANKS! Any questions? You can reach me yash991314@gmail.com / yogi@entrench.org