SlideShare a Scribd company logo
1 of 16
Big Data and Hadoop
By –
Ujjwal Kumar Gupta
Contents
Why Big Data & Hadoop
Drawbacks of Traditional Database
Hadoop History
What is Hadoop & How it Works
Hadoop Cluster
Hadoop Ecosystem
Following are the reasons why Big Data is needed:
● 90% of the data in the world today has been created in the last two years alone.
● 80% of the data is unstructured or exists in widely varying structures, which are
difficult to analyze.
● Structured formats have some limitations with respect to handling large quantities
of data.
● It is difficult to integrate information distributed across multiple systems.
● Most business users do not know what should be analyzed.
● Potentially valuable data is dormant or discarded.
● It is too expensive to justify the integration of large volumes of unstructured data.
● A lot of information has a short, useful lifespan.
● Context adds meaning to the existing information.
Why Big Data & Hadoop ?
Why Big Data & Hadoop ?
Drawbacks of Traditional Database
Expensive - Out of Reach for small & mid-
size company
Scalability – As Data Grows Expanding the
system is a Challenging task
Time Consuming – It takes lots of time to
store & process data
What is Hadoop
 Open source framework designed for storage and
processing of large scale data on clusters of commodity
hardware
 Created by Doug Cutting in 2006.
 Cutting named the program after his son’s toy elephant.
How Hadoop Works
When data is loaded onto the system it is divided into
blocks
Typically 64MB or 128MB
Tasks are divided into two phases
Map tasks which are done on small portions of data
where the data is stored
Reduce tasks which combine data to produce the final
output
A master program allocates work to individual nodes
3 V’s of Hadoop
Big Data Sources
Big Data Sources
The sources of Big Data are:
● web logs;
● sensor networks;
● social media;
● internet text and documents;
● internet pages;
● search index data;
● atmospheric science, astronomy, biochemical and medical records;
● scientific research;
● military surveillance; and
● photography archives.
Hadoop Cluster
Hadoop Cluster
Who Uses Hadoop
Use Cases of Hadoop

More Related Content

What's hot

Introduction to hadoop
Introduction to hadoopIntroduction to hadoop
Introduction to hadoopGanesh Sanap
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataIMC Institute
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public CloudIMC Institute
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop iACT Global
 
What is big data
What is big data What is big data
What is big data DeZyre
 
big data and hadoop
 big data and hadoop big data and hadoop
big data and hadoopahmed alshikh
 
Big data peresintaion
Big data peresintaion Big data peresintaion
Big data peresintaion ahmed alshikh
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data miningPolash Halder
 
High Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming waveHigh Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming waveIntel IT Center
 
Hadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance InitiativeHadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance InitiativeDataWorks Summit
 
Hadoop and Big Data for Absolute Beginners
Hadoop and Big Data for Absolute BeginnersHadoop and Big Data for Absolute Beginners
Hadoop and Big Data for Absolute BeginnersSam Dias
 
Peter Elleby - Big Data, Big Noise, Big Hope - No Miracles
Peter Elleby - Big Data, Big Noise, Big Hope - No MiraclesPeter Elleby - Big Data, Big Noise, Big Hope - No Miracles
Peter Elleby - Big Data, Big Noise, Big Hope - No MiraclesWeAreEsynergy
 
Hadoop Presentation - PPT
Hadoop Presentation - PPTHadoop Presentation - PPT
Hadoop Presentation - PPTAnand Pandey
 
How is smart data cooked?
How is smart data cooked?How is smart data cooked?
How is smart data cooked?Ontotext
 
Introduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemIntroduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemPetr Novotný
 

What's hot (20)

Introduction to hadoop
Introduction to hadoopIntroduction to hadoop
Introduction to hadoop
 
Bar camp bigdata
Bar camp bigdataBar camp bigdata
Bar camp bigdata
 
Thilga
ThilgaThilga
Thilga
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop Introduction to Big Data & Hadoop
Introduction to Big Data & Hadoop
 
What is big data
What is big data What is big data
What is big data
 
big data and hadoop
 big data and hadoop big data and hadoop
big data and hadoop
 
Big data peresintaion
Big data peresintaion Big data peresintaion
Big data peresintaion
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
High Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming waveHigh Performance Computing and Big Data: The coming wave
High Performance Computing and Big Data: The coming wave
 
Big data
Big dataBig data
Big data
 
Hadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance InitiativeHadoop in Validated Environment - Data Governance Initiative
Hadoop in Validated Environment - Data Governance Initiative
 
Hadoop and Big Data for Absolute Beginners
Hadoop and Big Data for Absolute BeginnersHadoop and Big Data for Absolute Beginners
Hadoop and Big Data for Absolute Beginners
 
Peter Elleby - Big Data, Big Noise, Big Hope - No Miracles
Peter Elleby - Big Data, Big Noise, Big Hope - No MiraclesPeter Elleby - Big Data, Big Noise, Big Hope - No Miracles
Peter Elleby - Big Data, Big Noise, Big Hope - No Miracles
 
Hadoop Presentation - PPT
Hadoop Presentation - PPTHadoop Presentation - PPT
Hadoop Presentation - PPT
 
How is smart data cooked?
How is smart data cooked?How is smart data cooked?
How is smart data cooked?
 
Introduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 SystemIntroduction to Big Data & Big Data 1.0 System
Introduction to Big Data & Big Data 1.0 System
 
Are you ready for BIG DATA?
Are you ready for BIG DATA?Are you ready for BIG DATA?
Are you ready for BIG DATA?
 

Viewers also liked

Andrea Quido Resume updt02012015
Andrea Quido Resume updt02012015Andrea Quido Resume updt02012015
Andrea Quido Resume updt02012015Andrea Quido
 
America’s Wealth Gap Presentation
America’s Wealth Gap PresentationAmerica’s Wealth Gap Presentation
America’s Wealth Gap PresentationLeon Clarke Jr
 
WOUNDS 2 WISDOM TOUR Final
WOUNDS 2 WISDOM TOUR Final WOUNDS 2 WISDOM TOUR Final
WOUNDS 2 WISDOM TOUR Final Leon Clarke Jr
 
Mohammed Anas ..........Curriculam Vitae
Mohammed Anas ..........Curriculam VitaeMohammed Anas ..........Curriculam Vitae
Mohammed Anas ..........Curriculam VitaeMohammed Ayndeen
 
Glow Connection Press Kit 2015
Glow Connection Press Kit 2015Glow Connection Press Kit 2015
Glow Connection Press Kit 2015Leon Clarke Jr
 
Cultural Differences In International Business Group 5 Final Presentation(Bus...
Cultural Differences In International Business Group 5 Final Presentation(Bus...Cultural Differences In International Business Group 5 Final Presentation(Bus...
Cultural Differences In International Business Group 5 Final Presentation(Bus...Leon Clarke Jr
 
Apple and Foxconn's Issues Presentation
Apple and Foxconn's Issues PresentationApple and Foxconn's Issues Presentation
Apple and Foxconn's Issues PresentationLeon Clarke Jr
 
Digital Design Trends Summer 2014
Digital Design Trends Summer 2014Digital Design Trends Summer 2014
Digital Design Trends Summer 2014Andrew Newman
 
Траектория движения уголовных дел в России 2014
Траектория движения уголовных дел в России 2014 Траектория движения уголовных дел в России 2014
Траектория движения уголовных дел в России 2014 KomitetGI
 
Luz mireya camacho m análisis-p.p.
Luz mireya camacho m  análisis-p.p.Luz mireya camacho m  análisis-p.p.
Luz mireya camacho m análisis-p.p.LuzMireyaCamacho
 
LOR (Masahudu, 2015)
LOR (Masahudu, 2015)LOR (Masahudu, 2015)
LOR (Masahudu, 2015)Michael Rael
 
Boletín de información ambiental 11 del 9
Boletín de información ambiental 11 del 9Boletín de información ambiental 11 del 9
Boletín de información ambiental 11 del 9Cole Navalazarza
 

Viewers also liked (20)

Andrea Quido Resume updt02012015
Andrea Quido Resume updt02012015Andrea Quido Resume updt02012015
Andrea Quido Resume updt02012015
 
America’s Wealth Gap Presentation
America’s Wealth Gap PresentationAmerica’s Wealth Gap Presentation
America’s Wealth Gap Presentation
 
WOUNDS 2 WISDOM TOUR Final
WOUNDS 2 WISDOM TOUR Final WOUNDS 2 WISDOM TOUR Final
WOUNDS 2 WISDOM TOUR Final
 
Mohammed Anas ..........Curriculam Vitae
Mohammed Anas ..........Curriculam VitaeMohammed Anas ..........Curriculam Vitae
Mohammed Anas ..........Curriculam Vitae
 
Glow Connection Press Kit 2015
Glow Connection Press Kit 2015Glow Connection Press Kit 2015
Glow Connection Press Kit 2015
 
Diactral FAQ
Diactral FAQDiactral FAQ
Diactral FAQ
 
SACHIN 4.6 years Exp
SACHIN 4.6 years ExpSACHIN 4.6 years Exp
SACHIN 4.6 years Exp
 
Cultural Differences In International Business Group 5 Final Presentation(Bus...
Cultural Differences In International Business Group 5 Final Presentation(Bus...Cultural Differences In International Business Group 5 Final Presentation(Bus...
Cultural Differences In International Business Group 5 Final Presentation(Bus...
 
Apple and Foxconn's Issues Presentation
Apple and Foxconn's Issues PresentationApple and Foxconn's Issues Presentation
Apple and Foxconn's Issues Presentation
 
nickclegges
nickcleggesnickclegges
nickclegges
 
Digital Design Trends Summer 2014
Digital Design Trends Summer 2014Digital Design Trends Summer 2014
Digital Design Trends Summer 2014
 
Aditi_Nikam_Finance
Aditi_Nikam_FinanceAditi_Nikam_Finance
Aditi_Nikam_Finance
 
Траектория движения уголовных дел в России 2014
Траектория движения уголовных дел в России 2014 Траектория движения уголовных дел в России 2014
Траектория движения уголовных дел в России 2014
 
logo-c
logo-clogo-c
logo-c
 
Web 2.0
Web 2.0 Web 2.0
Web 2.0
 
Luz mireya camacho m análisis-p.p.
Luz mireya camacho m  análisis-p.p.Luz mireya camacho m  análisis-p.p.
Luz mireya camacho m análisis-p.p.
 
LOR (Masahudu, 2015)
LOR (Masahudu, 2015)LOR (Masahudu, 2015)
LOR (Masahudu, 2015)
 
Slideshare
SlideshareSlideshare
Slideshare
 
Ressume
RessumeRessume
Ressume
 
Boletín de información ambiental 11 del 9
Boletín de información ambiental 11 del 9Boletín de información ambiental 11 del 9
Boletín de información ambiental 11 del 9
 

Similar to Hadoop Tutorial

Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxPankajkumar496281
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeeling Cheung
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadhMithlesh Sadh
 
Significance Of Hadoop For Data Science
Significance Of Hadoop For Data ScienceSignificance Of Hadoop For Data Science
Significance Of Hadoop For Data ScienceRobert Smith
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewIRJET Journal
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataPrakalp Agarwal
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
IRJET- Survey of Big Data with Hadoop
IRJET-  	  Survey of Big Data with HadoopIRJET-  	  Survey of Big Data with Hadoop
IRJET- Survey of Big Data with HadoopIRJET Journal
 
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articlesIRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articlesIRJET Journal
 

Similar to Hadoop Tutorial (20)

Lesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptxLesson 1 introduction to_big_data_and_hadoop.pptx
Lesson 1 introduction to_big_data_and_hadoop.pptx
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
 
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Significance Of Hadoop For Data Science
Significance Of Hadoop For Data ScienceSignificance Of Hadoop For Data Science
Significance Of Hadoop For Data Science
 
Big Data
Big DataBig Data
Big Data
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
Big Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A ReviewBig Data Processing with Hadoop : A Review
Big Data Processing with Hadoop : A Review
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
Big Data
Big DataBig Data
Big Data
 
Big Data Hadoop
Big Data HadoopBig Data Hadoop
Big Data Hadoop
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
IRJET- Survey of Big Data with Hadoop
IRJET-  	  Survey of Big Data with HadoopIRJET-  	  Survey of Big Data with Hadoop
IRJET- Survey of Big Data with Hadoop
 
BIG DATA
BIG DATABIG DATA
BIG DATA
 
big data
big databig data
big data
 
BigData Analytics
BigData AnalyticsBigData Analytics
BigData Analytics
 
IRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articlesIRJET- Systematic Review: Progression Study on BIG DATA articles
IRJET- Systematic Review: Progression Study on BIG DATA articles
 
Big data rmoug
Big data rmougBig data rmoug
Big data rmoug
 

Recently uploaded

Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 

Recently uploaded (20)

Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 

Hadoop Tutorial

  • 1. Big Data and Hadoop By – Ujjwal Kumar Gupta
  • 2. Contents Why Big Data & Hadoop Drawbacks of Traditional Database Hadoop History What is Hadoop & How it Works Hadoop Cluster Hadoop Ecosystem
  • 3. Following are the reasons why Big Data is needed: ● 90% of the data in the world today has been created in the last two years alone. ● 80% of the data is unstructured or exists in widely varying structures, which are difficult to analyze. ● Structured formats have some limitations with respect to handling large quantities of data. ● It is difficult to integrate information distributed across multiple systems. ● Most business users do not know what should be analyzed. ● Potentially valuable data is dormant or discarded. ● It is too expensive to justify the integration of large volumes of unstructured data. ● A lot of information has a short, useful lifespan. ● Context adds meaning to the existing information. Why Big Data & Hadoop ?
  • 4. Why Big Data & Hadoop ?
  • 5. Drawbacks of Traditional Database Expensive - Out of Reach for small & mid- size company Scalability – As Data Grows Expanding the system is a Challenging task Time Consuming – It takes lots of time to store & process data
  • 6.
  • 7. What is Hadoop  Open source framework designed for storage and processing of large scale data on clusters of commodity hardware  Created by Doug Cutting in 2006.  Cutting named the program after his son’s toy elephant.
  • 8. How Hadoop Works When data is loaded onto the system it is divided into blocks Typically 64MB or 128MB Tasks are divided into two phases Map tasks which are done on small portions of data where the data is stored Reduce tasks which combine data to produce the final output A master program allocates work to individual nodes
  • 9. 3 V’s of Hadoop
  • 11. Big Data Sources The sources of Big Data are: ● web logs; ● sensor networks; ● social media; ● internet text and documents; ● internet pages; ● search index data; ● atmospheric science, astronomy, biochemical and medical records; ● scientific research; ● military surveillance; and ● photography archives.
  • 14.
  • 16. Use Cases of Hadoop