SlideShare a Scribd company logo
1 of 3
An introduction to
Class Presentation by
Damon A. Runion
MIS 2321 - Spring 2017
Hello and welcome to An Introduction to Hadoop
Data Everywhere
“Every two days now we create as much information as we did
from the dawn of civilization up until 2003”
Eric Schmidt
then CEO of Google
Aug 4, 2010
Read this quote. That data is something like 4 exabytes.
The Hadoop Project
Originally based on papers published by Google in 2003 and
2004
Hadoop started in 2006 at Yahoo!Top level Apache Foundation
project Large, active user base, user groups Very active
development, strong development team
One way to do that analysis is through Hadoop
Who Uses Hadoop?
Rackspace for log processing. Netflix for recommendations.
LinkedIn for social graph. SU for page recommendations.
Hadoop Components
Storage
Self-healing
high-bandwidth
clustered storage
Processing
Fault-tolerant
distributed
processing
HDFS
MapReduce
HDFS cluster/healing. MapReduce
HDFS Basics
HDFS is a filesystem written in Java Sits on top of a native
filesystemProvides redundant storage for massive amounts of
dataUse cheap(ish), unreliable computers
Let’s talk about HDFS
HDFS DataData is split into blocks and stored on multiple
nodes in the clusterEach block is usually 64 MB or 128 MB
(conf)Each block is replicated multiple times (conf)Replicas
stored on different data nodesLarge files, 100 MB+
What is MapReduce?
MapReduce is a method for distributing a task across multiple
nodes
Automatic parallelization and distributionEach node processes
data stored on that node (processing goes to the data, unlike
Databases where data is brought to the query engine)

More Related Content

Similar to An introduction toClass Presentation byDamon A. Runion.docx

Similar to An introduction toClass Presentation byDamon A. Runion.docx (20)

Hadoop foundation for analytics
Hadoop foundation for analyticsHadoop foundation for analytics
Hadoop foundation for analytics
 
Hadoop hdfs
Hadoop hdfsHadoop hdfs
Hadoop hdfs
 
Big Data Training in Amritsar
Big Data Training in AmritsarBig Data Training in Amritsar
Big Data Training in Amritsar
 
Data analytics
Data analyticsData analytics
Data analytics
 
Hadoop
HadoopHadoop
Hadoop
 
Big Data Training in Mohali
Big Data Training in MohaliBig Data Training in Mohali
Big Data Training in Mohali
 
Hadoop An Introduction
Hadoop An IntroductionHadoop An Introduction
Hadoop An Introduction
 
Introduction to Apache Hadoop Ecosystem
Introduction to Apache Hadoop EcosystemIntroduction to Apache Hadoop Ecosystem
Introduction to Apache Hadoop Ecosystem
 
Big Data Training in Ludhiana
Big Data Training in LudhianaBig Data Training in Ludhiana
Big Data Training in Ludhiana
 
Introduction to hadoop
Introduction to hadoopIntroduction to hadoop
Introduction to hadoop
 
Introduction to hadoop
Introduction to hadoopIntroduction to hadoop
Introduction to hadoop
 
Seminar ppt
Seminar pptSeminar ppt
Seminar ppt
 
Unit IV.pdf
Unit IV.pdfUnit IV.pdf
Unit IV.pdf
 
Big data Analytics Hadoop
Big data Analytics HadoopBig data Analytics Hadoop
Big data Analytics Hadoop
 
Hadoop Tutorial for Beginners
Hadoop Tutorial for BeginnersHadoop Tutorial for Beginners
Hadoop Tutorial for Beginners
 
Hadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An OverviewHadoop and its role in Facebook: An Overview
Hadoop and its role in Facebook: An Overview
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop-2022.pptx
Hadoop-2022.pptxHadoop-2022.pptx
Hadoop-2022.pptx
 
field_guide_to_hadoop_pentaho
field_guide_to_hadoop_pentahofield_guide_to_hadoop_pentaho
field_guide_to_hadoop_pentaho
 
Hadoop
HadoopHadoop
Hadoop
 

More from greg1eden90113

Analyze and describe how social media could influence each stage of .docx
Analyze and describe how social media could influence each stage of .docxAnalyze and describe how social media could influence each stage of .docx
Analyze and describe how social media could influence each stage of .docxgreg1eden90113
 
Analyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docx
Analyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docxAnalyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docx
Analyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docxgreg1eden90113
 
Analyze and Evaluate Human Performance TechnologyNow that you ha.docx
Analyze and Evaluate Human Performance TechnologyNow that you ha.docxAnalyze and Evaluate Human Performance TechnologyNow that you ha.docx
Analyze and Evaluate Human Performance TechnologyNow that you ha.docxgreg1eden90113
 
Analyze a popular culture reference (e.g., song, tv show, movie) o.docx
Analyze a popular culture reference (e.g., song, tv show, movie) o.docxAnalyze a popular culture reference (e.g., song, tv show, movie) o.docx
Analyze a popular culture reference (e.g., song, tv show, movie) o.docxgreg1eden90113
 
ANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docx
ANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docxANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docx
ANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docxgreg1eden90113
 
Analytics, Data Science, and Artificial Intelligence, 11th Editi.docx
Analytics, Data Science, and Artificial Intelligence, 11th Editi.docxAnalytics, Data Science, and Artificial Intelligence, 11th Editi.docx
Analytics, Data Science, and Artificial Intelligence, 11th Editi.docxgreg1eden90113
 
Analytical Essay One, due Sunday, February 24th at 1100 pmTopic.docx
Analytical Essay One, due Sunday, February 24th at 1100 pmTopic.docxAnalytical Essay One, due Sunday, February 24th at 1100 pmTopic.docx
Analytical Essay One, due Sunday, February 24th at 1100 pmTopic.docxgreg1eden90113
 
Analytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docx
Analytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docxAnalytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docx
Analytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docxgreg1eden90113
 
analytic 1000 word essay about the Matrix 1  Simple english .docx
analytic 1000 word essay about the Matrix 1  Simple english .docxanalytic 1000 word essay about the Matrix 1  Simple english .docx
analytic 1000 word essay about the Matrix 1  Simple english .docxgreg1eden90113
 
ANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docx
ANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docxANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docx
ANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docxgreg1eden90113
 
Analysis on the Demand of Top Talent Introduction in Big Dat.docx
Analysis on the Demand of Top Talent Introduction in Big Dat.docxAnalysis on the Demand of Top Talent Introduction in Big Dat.docx
Analysis on the Demand of Top Talent Introduction in Big Dat.docxgreg1eden90113
 
AnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docx
AnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docxAnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docx
AnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docxgreg1eden90113
 
Analysis of the Marketing outlook of Ferrari4MARK001W Mark.docx
Analysis of the Marketing outlook of Ferrari4MARK001W Mark.docxAnalysis of the Marketing outlook of Ferrari4MARK001W Mark.docx
Analysis of the Marketing outlook of Ferrari4MARK001W Mark.docxgreg1eden90113
 
Analysis of the Monetary Systems and International Finance with .docx
Analysis of the Monetary Systems and International Finance with .docxAnalysis of the Monetary Systems and International Finance with .docx
Analysis of the Monetary Systems and International Finance with .docxgreg1eden90113
 
Analysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docx
Analysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docxAnalysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docx
Analysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docxgreg1eden90113
 
Analysis of Literature ReviewFailure to develop key competencie.docx
Analysis of Literature ReviewFailure to develop key competencie.docxAnalysis of Literature ReviewFailure to develop key competencie.docx
Analysis of Literature ReviewFailure to develop key competencie.docxgreg1eden90113
 
Analysis Of Electronic Health Records System1C.docx
Analysis Of Electronic Health Records System1C.docxAnalysis Of Electronic Health Records System1C.docx
Analysis Of Electronic Health Records System1C.docxgreg1eden90113
 
Analysis of element, when we perform this skill we break up a whole .docx
Analysis of element, when we perform this skill we break up a whole .docxAnalysis of element, when we perform this skill we break up a whole .docx
Analysis of element, when we perform this skill we break up a whole .docxgreg1eden90113
 
Analysis of a Career in SurgeryStude.docx
Analysis of a Career in SurgeryStude.docxAnalysis of a Career in SurgeryStude.docx
Analysis of a Career in SurgeryStude.docxgreg1eden90113
 
Analysis Assignment -Major Artist ResearchInstructionsYo.docx
Analysis Assignment -Major Artist ResearchInstructionsYo.docxAnalysis Assignment -Major Artist ResearchInstructionsYo.docx
Analysis Assignment -Major Artist ResearchInstructionsYo.docxgreg1eden90113
 

More from greg1eden90113 (20)

Analyze and describe how social media could influence each stage of .docx
Analyze and describe how social media could influence each stage of .docxAnalyze and describe how social media could influence each stage of .docx
Analyze and describe how social media could influence each stage of .docx
 
Analyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docx
Analyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docxAnalyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docx
Analyze Delta Airlines, Inc public stock exchange NYSE- company’s pr.docx
 
Analyze and Evaluate Human Performance TechnologyNow that you ha.docx
Analyze and Evaluate Human Performance TechnologyNow that you ha.docxAnalyze and Evaluate Human Performance TechnologyNow that you ha.docx
Analyze and Evaluate Human Performance TechnologyNow that you ha.docx
 
Analyze a popular culture reference (e.g., song, tv show, movie) o.docx
Analyze a popular culture reference (e.g., song, tv show, movie) o.docxAnalyze a popular culture reference (e.g., song, tv show, movie) o.docx
Analyze a popular culture reference (e.g., song, tv show, movie) o.docx
 
ANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docx
ANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docxANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docx
ANALYTICS PLAN TO REDUCE CUSTOMER CHURN AT YORE BLENDS Himabin.docx
 
Analytics, Data Science, and Artificial Intelligence, 11th Editi.docx
Analytics, Data Science, and Artificial Intelligence, 11th Editi.docxAnalytics, Data Science, and Artificial Intelligence, 11th Editi.docx
Analytics, Data Science, and Artificial Intelligence, 11th Editi.docx
 
Analytical Essay One, due Sunday, February 24th at 1100 pmTopic.docx
Analytical Essay One, due Sunday, February 24th at 1100 pmTopic.docxAnalytical Essay One, due Sunday, February 24th at 1100 pmTopic.docx
Analytical Essay One, due Sunday, February 24th at 1100 pmTopic.docx
 
Analytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docx
Analytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docxAnalytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docx
Analytical Essay Two, due Sunday, March 31st at 1100 pmTopi.docx
 
analytic 1000 word essay about the Matrix 1  Simple english .docx
analytic 1000 word essay about the Matrix 1  Simple english .docxanalytic 1000 word essay about the Matrix 1  Simple english .docx
analytic 1000 word essay about the Matrix 1  Simple english .docx
 
ANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docx
ANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docxANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docx
ANALYSIS PAPER GUIDELINES and FORMAT What is the problem or is.docx
 
Analysis on the Demand of Top Talent Introduction in Big Dat.docx
Analysis on the Demand of Top Talent Introduction in Big Dat.docxAnalysis on the Demand of Top Talent Introduction in Big Dat.docx
Analysis on the Demand of Top Talent Introduction in Big Dat.docx
 
AnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docx
AnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docxAnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docx
AnalysisLet s embrace ourdual identitiesCOMMUNITY COHE.docx
 
Analysis of the Marketing outlook of Ferrari4MARK001W Mark.docx
Analysis of the Marketing outlook of Ferrari4MARK001W Mark.docxAnalysis of the Marketing outlook of Ferrari4MARK001W Mark.docx
Analysis of the Marketing outlook of Ferrari4MARK001W Mark.docx
 
Analysis of the Monetary Systems and International Finance with .docx
Analysis of the Monetary Systems and International Finance with .docxAnalysis of the Monetary Systems and International Finance with .docx
Analysis of the Monetary Systems and International Finance with .docx
 
Analysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docx
Analysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docxAnalysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docx
Analysis of the Barrios Gomez, Agustin, et al. Mexico-US A New .docx
 
Analysis of Literature ReviewFailure to develop key competencie.docx
Analysis of Literature ReviewFailure to develop key competencie.docxAnalysis of Literature ReviewFailure to develop key competencie.docx
Analysis of Literature ReviewFailure to develop key competencie.docx
 
Analysis Of Electronic Health Records System1C.docx
Analysis Of Electronic Health Records System1C.docxAnalysis Of Electronic Health Records System1C.docx
Analysis Of Electronic Health Records System1C.docx
 
Analysis of element, when we perform this skill we break up a whole .docx
Analysis of element, when we perform this skill we break up a whole .docxAnalysis of element, when we perform this skill we break up a whole .docx
Analysis of element, when we perform this skill we break up a whole .docx
 
Analysis of a Career in SurgeryStude.docx
Analysis of a Career in SurgeryStude.docxAnalysis of a Career in SurgeryStude.docx
Analysis of a Career in SurgeryStude.docx
 
Analysis Assignment -Major Artist ResearchInstructionsYo.docx
Analysis Assignment -Major Artist ResearchInstructionsYo.docxAnalysis Assignment -Major Artist ResearchInstructionsYo.docx
Analysis Assignment -Major Artist ResearchInstructionsYo.docx
 

Recently uploaded

Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 

Recently uploaded (20)

Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 

An introduction toClass Presentation byDamon A. Runion.docx

  • 1. An introduction to Class Presentation by Damon A. Runion MIS 2321 - Spring 2017 Hello and welcome to An Introduction to Hadoop Data Everywhere “Every two days now we create as much information as we did from the dawn of civilization up until 2003” Eric Schmidt then CEO of Google Aug 4, 2010 Read this quote. That data is something like 4 exabytes. The Hadoop Project Originally based on papers published by Google in 2003 and 2004 Hadoop started in 2006 at Yahoo!Top level Apache Foundation project Large, active user base, user groups Very active development, strong development team
  • 2. One way to do that analysis is through Hadoop Who Uses Hadoop? Rackspace for log processing. Netflix for recommendations. LinkedIn for social graph. SU for page recommendations. Hadoop Components Storage Self-healing high-bandwidth clustered storage Processing Fault-tolerant distributed processing HDFS MapReduce HDFS cluster/healing. MapReduce HDFS Basics HDFS is a filesystem written in Java Sits on top of a native filesystemProvides redundant storage for massive amounts of dataUse cheap(ish), unreliable computers
  • 3. Let’s talk about HDFS HDFS DataData is split into blocks and stored on multiple nodes in the clusterEach block is usually 64 MB or 128 MB (conf)Each block is replicated multiple times (conf)Replicas stored on different data nodesLarge files, 100 MB+ What is MapReduce? MapReduce is a method for distributing a task across multiple nodes Automatic parallelization and distributionEach node processes data stored on that node (processing goes to the data, unlike Databases where data is brought to the query engine)