SlideShare a Scribd company logo
1 of 8
Download to read offline
MapReduce: Simplified Data
Processing on Large Clusters
          Rob Keisler
           CSCI 638
         Summer 2011
Outline

● Background

● Model

● Examples

● Execution

● Conclusions
Background

● Transformation operations are conceptually straightforward
   ○ Until data is large and the computation must be
     distributed over hundred or thousands of machines

● So, Google created MapReduce

● MapReduce is a programming abstraction
   ○ Expresses simple computations
   ○ Hides complexity details
Model

● Utilizes higher-order shaping functions Map and Reduce to
  take a set of input key/value pairs and produce a set of
  output key/value pairs

● Map
   ○ Takes an input key/value pair and produces a set of
     intermediate key/value pairs

● Reduce
   ○ Accepts an intermediate key I and a set of values for
     that key, and merges those values to form possibly
     smaller sets of values
Examples

● Distributed Grep

● Count of URL Access Frequency

● Reverse Web-Link Graph

● Term-Vector per Host

● Inverted Index

● Distributed Sort
Execution Overview
Conclusions

● The MapReduce programming model proved to be a useful
  abstraction for many different purposes
   ○ Easy to use
       ■ even for programmers without experience with
         parallel and distributed systems
   ○ A large variety of problems are easily expressible as
     MapReduce computations
   ○ The implementation scales to large clusters of machines

● Greatly simplifies large-scale computations at Google
Questions?

http://labs.google.com/papers/mapreduce.html

More Related Content

What's hot

FME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the treesFME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the treesSafe Software
 
Using FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a CityUsing FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a CitySafe Software
 
Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017Safe Software
 
FME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility IdeasFME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility IdeasSafe Software
 
Supporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME ServerSupporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME ServerSafe Software
 
Prepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your RequirementsPrepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your RequirementsSafe Software
 
Using FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas CompaniesUsing FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas CompaniesSafe Software
 
Using GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissionsUsing GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissionsniket_narang
 
Map Reduce
Map ReduceMap Reduce
Map Reducemsgroner
 
KDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FMEKDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FMESafe Software
 
Gain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud DataGain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud DataSafe Software
 
Creating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of BarrieCreating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of BarrieSafe Software
 
Some of my favourite QGIS plugins
Some of my favourite QGIS pluginsSome of my favourite QGIS plugins
Some of my favourite QGIS pluginsMark Owen
 
Essential NumPy By ZekeLabs
Essential NumPy By ZekeLabsEssential NumPy By ZekeLabs
Essential NumPy By ZekeLabsAwantik Das
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISShaun Lewis
 
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...I3E Technologies
 
Tilemill gwu-wboykinm
Tilemill gwu-wboykinmTilemill gwu-wboykinm
Tilemill gwu-wboykinmBill Morris
 
From Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue MappingFrom Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue MappingSafe Software
 

What's hot (20)

FME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the treesFME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the trees
 
Using FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a CityUsing FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a City
 
Map Reduce Presentation
Map Reduce PresentationMap Reduce Presentation
Map Reduce Presentation
 
Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017
 
FME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility IdeasFME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility Ideas
 
Supporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME ServerSupporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME Server
 
Prepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your RequirementsPrepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your Requirements
 
Using FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas CompaniesUsing FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas Companies
 
Om
OmOm
Om
 
Using GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissionsUsing GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissions
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
KDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FMEKDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FME
 
Gain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud DataGain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud Data
 
Creating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of BarrieCreating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of Barrie
 
Some of my favourite QGIS plugins
Some of my favourite QGIS pluginsSome of my favourite QGIS plugins
Some of my favourite QGIS plugins
 
Essential NumPy By ZekeLabs
Essential NumPy By ZekeLabsEssential NumPy By ZekeLabs
Essential NumPy By ZekeLabs
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GIS
 
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
 
Tilemill gwu-wboykinm
Tilemill gwu-wboykinmTilemill gwu-wboykinm
Tilemill gwu-wboykinm
 
From Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue MappingFrom Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue Mapping
 

Similar to MapReduce

Big data processing systems research
Big data processing systems researchBig data processing systems research
Big data processing systems researchVasia Kalavri
 
Superworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueSuperworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueDatabricks
 
Hadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. ElephantHadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. ElephantAkshay Rai
 
MapReduce Programming Model
MapReduce Programming ModelMapReduce Programming Model
MapReduce Programming ModelAdarshaDhakal
 
Introduction to Machine Learning with Spark
Introduction to Machine Learning with SparkIntroduction to Machine Learning with Spark
Introduction to Machine Learning with Sparkdatamantra
 
My mapreduce1 presentation
My mapreduce1 presentationMy mapreduce1 presentation
My mapreduce1 presentationNoha Elprince
 
Hadoop Map Reduce OS
Hadoop Map Reduce OSHadoop Map Reduce OS
Hadoop Map Reduce OSVedant Mane
 
An Introduction to MapReduce
An Introduction to MapReduce An Introduction to MapReduce
An Introduction to MapReduce Sina Ebrahimi
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...areej qasrawi
 
Mapreduce2008 cacm
Mapreduce2008 cacmMapreduce2008 cacm
Mapreduce2008 cacmlmphuong06
 
Map reduce presentation
Map reduce presentationMap reduce presentation
Map reduce presentationAhmad El Tawil
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache SparkLucian Neghina
 
Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster ServicesAdam Doyle
 
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsFugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsDatabricks
 
Netflix machine learning
Netflix machine learningNetflix machine learning
Netflix machine learningAmer Ather
 
Software Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale AutomationSoftware Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale AutomationHao Xu
 
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdfmodule3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdfTSANKARARAO
 

Similar to MapReduce (20)

Big data processing systems research
Big data processing systems researchBig data processing systems research
Big data processing systems research
 
Superworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueSuperworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and Fugue
 
Hadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. ElephantHadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. Elephant
 
MapReduce Programming Model
MapReduce Programming ModelMapReduce Programming Model
MapReduce Programming Model
 
Introduction to Machine Learning with Spark
Introduction to Machine Learning with SparkIntroduction to Machine Learning with Spark
Introduction to Machine Learning with Spark
 
Main map reduce
Main map reduceMain map reduce
Main map reduce
 
My mapreduce1 presentation
My mapreduce1 presentationMy mapreduce1 presentation
My mapreduce1 presentation
 
Hadoop Map Reduce OS
Hadoop Map Reduce OSHadoop Map Reduce OS
Hadoop Map Reduce OS
 
An Introduction to MapReduce
An Introduction to MapReduce An Introduction to MapReduce
An Introduction to MapReduce
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
 
Mapreduce2008 cacm
Mapreduce2008 cacmMapreduce2008 cacm
Mapreduce2008 cacm
 
Map reduce presentation
Map reduce presentationMap reduce presentation
Map reduce presentation
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache Spark
 
try
trytry
try
 
Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster Services
 
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsFugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
 
Netflix machine learning
Netflix machine learningNetflix machine learning
Netflix machine learning
 
Software Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale AutomationSoftware Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale Automation
 
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdfmodule3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 

Recently uploaded

ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 

Recently uploaded (20)

ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 

MapReduce

  • 1. MapReduce: Simplified Data Processing on Large Clusters Rob Keisler CSCI 638 Summer 2011
  • 2. Outline ● Background ● Model ● Examples ● Execution ● Conclusions
  • 3. Background ● Transformation operations are conceptually straightforward ○ Until data is large and the computation must be distributed over hundred or thousands of machines ● So, Google created MapReduce ● MapReduce is a programming abstraction ○ Expresses simple computations ○ Hides complexity details
  • 4. Model ● Utilizes higher-order shaping functions Map and Reduce to take a set of input key/value pairs and produce a set of output key/value pairs ● Map ○ Takes an input key/value pair and produces a set of intermediate key/value pairs ● Reduce ○ Accepts an intermediate key I and a set of values for that key, and merges those values to form possibly smaller sets of values
  • 5. Examples ● Distributed Grep ● Count of URL Access Frequency ● Reverse Web-Link Graph ● Term-Vector per Host ● Inverted Index ● Distributed Sort
  • 7. Conclusions ● The MapReduce programming model proved to be a useful abstraction for many different purposes ○ Easy to use ■ even for programmers without experience with parallel and distributed systems ○ A large variety of problems are easily expressible as MapReduce computations ○ The implementation scales to large clusters of machines ● Greatly simplifies large-scale computations at Google