SlideShare a Scribd company logo
1 of 13
www.beinghadoop.com
Big Data: 
Big data is an all-encompassing term for any collection of data 
sets, so large and complex that it becomes difficult to process 
using on-hand data management tools or traditional data 
processing applications 
Big data is a huge amount of data which is too large to process 
using traditional methods. Big data contains data in the form 
Tera bytes , Peta bytes, Exa bytes of data. 
The data can be structured, unstructured and semi structured 
data. 
www.beinghadoop.com
BIG DATA CAN BE 
1. Peta bytes/exa bytes of data, 
2. Millions/billions of people, 
3. Billions/trillions of records, 
4. Loosely-structured and often distributed data, 
5. Flat schemas with few complex interrelationships, 
6. Often involving time-stamped events, 
7. Often made up of incomplete data, 
8. Often including connections between data elements that 
must be probabilistically inferred, 
www.beinghadoop.com
DATA REPRESENTATION 
www.beinghadoop.com 
1 Byte=8 bits 
1 Kilobyte(kb)=1024 bytes 
1 Mega byte(mb)=1024 kilo bytes or 1,000,000 bytes 
1 Giga byte(gb)=1024 mega bytes or1,000,000,000 bytes 
1 TERA BYTE (TB)= 1024 Giga bytes or 1,000,000,000,000 bytes 
1 Peta byte (pb)=1024 Tera bytes or1,000,000,000,000,000 bytes 
1 Exa byte(Eb)=1024Peta bytes or 1000 000 000 000 000 000bytes 
1 Zotta byte(Eb)=1024Exa bytes or 1000 000 000 000 000 000 000bytes 
1 Yotta byte(Yb)=1024Zotta bytes or 1000 000 000 000 000 000 000 000 bytes
DATA SIGE GB PETABYTE 
ACCESS Interactive and 
batch 
batch 
UPDATE Read and 
Write many times 
Write once 
read many 
times 
STRUCTURE Static schema Dynamic 
schema 
INTEGRITY high low 
SCALING Non lenear Linear 
www.beinghadoop.com
www.beinghadoop.com
www.beinghadoop.com
www.beinghadoop.com
APACHE HADOOP: 
Apache Hadoop is a scalable framework for storing and processing 
data on a cluster of commodity 
hardware nodes. Hadoop is designed to scale up from a single node to 
thousands of 
nodes. Hadoop has two main components: a computing framework 
and Hadoop Distributed 
File System (HDFS). HDFS uses the commodity server nodes and JBOD 
(Just a Bunch Of 
Disks) storage drives to store the data and provide large aggregated 
I/O bandwidth to data 
www.beinghadoop.com
www.beinghadoop.com
Hadoop Use cases 
MANUFACTURING: 
Use Apache Hadoop to Increase Production, Reduce Costs & 
Improve 
Quality 
Assure Just-In-Time Delivery of Raw Materials 
Control Quality with Real-Time & Historical 
Assembly Line Data 
Avoid Stoppages with Proactive Equipment 
Maintenance 
Increase Yields in Drug Manufacturing 
Channel 
www.beinghadoop.com
Health care: 
Use Apache Hadoop to Save Lives While Delivering More Efficient 
Care 
Access Genomic Data for Medical Trials 
Monitor Patient Vitals in Real-Time 
Track Equipment and Medicines with RFID Data 
Improve Prescription Adherence 
Retailers : 
Build a 360° View of the Customer 
Analyze Brand Sentiment 
Localize & Personalize Promotions 
Optimize Websites 
Optimize Store Layouts 
www.beinghadoop.com
TELECOM: 
Use Apache Hadoop to Improve Service & Launch New 
Products 
Analyze Call Detail Records (CDRs) 
Service Equipment Proactively 
Rationalize Infrastructure Investments 
Recommend Next Product to Buy (NPTB) 
Allocate Bandwidth in Real-time 
Develop New Products 
www.beinghadoop.com

More Related Content

What's hot

13 09-28 hadoop-in_taiwan_2013_opening
13 09-28 hadoop-in_taiwan_2013_opening13 09-28 hadoop-in_taiwan_2013_opening
13 09-28 hadoop-in_taiwan_2013_openingJazz Yao-Tsung Wang
 
Built in data structures in python
Built in data structures in pythonBuilt in data structures in python
Built in data structures in pythonMaria786439
 
re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.Shakir Ali
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataPrakalp Agarwal
 
Is Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data ScienceIs Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data ScienceEdureka!
 
Introduction to the Environmental Data Initiative (EDI)
Introduction to the Environmental Data Initiative (EDI)Introduction to the Environmental Data Initiative (EDI)
Introduction to the Environmental Data Initiative (EDI)Corinna Gries
 
Hadoop/Spark Non-Technical Basics
Hadoop/Spark Non-Technical BasicsHadoop/Spark Non-Technical Basics
Hadoop/Spark Non-Technical BasicsZitao Liu
 
Significance Of Hadoop For Data Science
Significance Of Hadoop For Data ScienceSignificance Of Hadoop For Data Science
Significance Of Hadoop For Data ScienceRobert Smith
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
How is smart data cooked?
How is smart data cooked?How is smart data cooked?
How is smart data cooked?Ontotext
 
WP3: overzicht van de voortgang van WP# op de CLARIAH-dag
WP3: overzicht van de voortgang van WP# op de CLARIAH-dagWP3: overzicht van de voortgang van WP# op de CLARIAH-dag
WP3: overzicht van de voortgang van WP# op de CLARIAH-dagCLARIAH
 

What's hot (19)

Hadoop
HadoopHadoop
Hadoop
 
Big Data
Big DataBig Data
Big Data
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
13 09-28 hadoop-in_taiwan_2013_opening
13 09-28 hadoop-in_taiwan_2013_opening13 09-28 hadoop-in_taiwan_2013_opening
13 09-28 hadoop-in_taiwan_2013_opening
 
Built in data structures in python
Built in data structures in pythonBuilt in data structures in python
Built in data structures in python
 
Hadoop
HadoopHadoop
Hadoop
 
Hadoop bigdata projects list(ver)
Hadoop bigdata projects list(ver)Hadoop bigdata projects list(ver)
Hadoop bigdata projects list(ver)
 
re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.re:Introduce Big Data and Hadoop Eco-system.
re:Introduce Big Data and Hadoop Eco-system.
 
Oh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG DataOh! Session on Introduction to BIG Data
Oh! Session on Introduction to BIG Data
 
What is Big Data ?
What is Big Data ?What is Big Data ?
What is Big Data ?
 
Is Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data ScienceIs Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data Science
 
Introduction to the Environmental Data Initiative (EDI)
Introduction to the Environmental Data Initiative (EDI)Introduction to the Environmental Data Initiative (EDI)
Introduction to the Environmental Data Initiative (EDI)
 
Hadoop/Spark Non-Technical Basics
Hadoop/Spark Non-Technical BasicsHadoop/Spark Non-Technical Basics
Hadoop/Spark Non-Technical Basics
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Significance Of Hadoop For Data Science
Significance Of Hadoop For Data ScienceSignificance Of Hadoop For Data Science
Significance Of Hadoop For Data Science
 
Big Data
Big DataBig Data
Big Data
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
How is smart data cooked?
How is smart data cooked?How is smart data cooked?
How is smart data cooked?
 
WP3: overzicht van de voortgang van WP# op de CLARIAH-dag
WP3: overzicht van de voortgang van WP# op de CLARIAH-dagWP3: overzicht van de voortgang van WP# op de CLARIAH-dag
WP3: overzicht van de voortgang van WP# op de CLARIAH-dag
 

Similar to Introduction to Bigdata & Hadoop

Bigdata and Hadoop Bootcamp
Bigdata and Hadoop BootcampBigdata and Hadoop Bootcamp
Bigdata and Hadoop BootcampSpotle.ai
 
Hadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionalsHadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionalsEdureka!
 
Big data - Apache Hadoop for Beginner's
Big data - Apache Hadoop for Beginner'sBig data - Apache Hadoop for Beginner's
Big data - Apache Hadoop for Beginner'ssenthil0809
 
big data and hadoop
 big data and hadoop big data and hadoop
big data and hadoopahmed alshikh
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introductionsaisreealekhya
 
Eric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceEric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceHortonworks
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Scienceijtsrd
 
Hadoop Developer
Hadoop DeveloperHadoop Developer
Hadoop DeveloperEdureka!
 
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014Josh Patterson
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptalmaraniabwmalk
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and HadoopEdureka!
 

Similar to Introduction to Bigdata & Hadoop (20)

Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
paper
paperpaper
paper
 
Hadoop basics
Hadoop basicsHadoop basics
Hadoop basics
 
Hadoop
HadoopHadoop
Hadoop
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Bigdata and Hadoop Bootcamp
Bigdata and Hadoop BootcampBigdata and Hadoop Bootcamp
Bigdata and Hadoop Bootcamp
 
Hadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionalsHadoop for Data Warehousing professionals
Hadoop for Data Warehousing professionals
 
BDA ( haoop ).pptx
BDA ( haoop ).pptxBDA ( haoop ).pptx
BDA ( haoop ).pptx
 
Big data - Apache Hadoop for Beginner's
Big data - Apache Hadoop for Beginner'sBig data - Apache Hadoop for Beginner's
Big data - Apache Hadoop for Beginner's
 
Bigdata overview
Bigdata overviewBigdata overview
Bigdata overview
 
big data and hadoop
 big data and hadoop big data and hadoop
big data and hadoop
 
A Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - IntroductionA Glimpse of Bigdata - Introduction
A Glimpse of Bigdata - Introduction
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
 
Eric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers ConferenceEric Baldeschwieler Keynote from Storage Developers Conference
Eric Baldeschwieler Keynote from Storage Developers Conference
 
A Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data ScienceA Review Paper on Big Data and Hadoop for Data Science
A Review Paper on Big Data and Hadoop for Data Science
 
Hadoop Developer
Hadoop DeveloperHadoop Developer
Hadoop Developer
 
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
Chattanooga Hadoop Meetup - Hadoop 101 - November 2014
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 

Recently uploaded

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.MateoGardella
 

Recently uploaded (20)

Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 

Introduction to Bigdata & Hadoop

  • 2. Big Data: Big data is an all-encompassing term for any collection of data sets, so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications Big data is a huge amount of data which is too large to process using traditional methods. Big data contains data in the form Tera bytes , Peta bytes, Exa bytes of data. The data can be structured, unstructured and semi structured data. www.beinghadoop.com
  • 3. BIG DATA CAN BE 1. Peta bytes/exa bytes of data, 2. Millions/billions of people, 3. Billions/trillions of records, 4. Loosely-structured and often distributed data, 5. Flat schemas with few complex interrelationships, 6. Often involving time-stamped events, 7. Often made up of incomplete data, 8. Often including connections between data elements that must be probabilistically inferred, www.beinghadoop.com
  • 4. DATA REPRESENTATION www.beinghadoop.com 1 Byte=8 bits 1 Kilobyte(kb)=1024 bytes 1 Mega byte(mb)=1024 kilo bytes or 1,000,000 bytes 1 Giga byte(gb)=1024 mega bytes or1,000,000,000 bytes 1 TERA BYTE (TB)= 1024 Giga bytes or 1,000,000,000,000 bytes 1 Peta byte (pb)=1024 Tera bytes or1,000,000,000,000,000 bytes 1 Exa byte(Eb)=1024Peta bytes or 1000 000 000 000 000 000bytes 1 Zotta byte(Eb)=1024Exa bytes or 1000 000 000 000 000 000 000bytes 1 Yotta byte(Yb)=1024Zotta bytes or 1000 000 000 000 000 000 000 000 bytes
  • 5. DATA SIGE GB PETABYTE ACCESS Interactive and batch batch UPDATE Read and Write many times Write once read many times STRUCTURE Static schema Dynamic schema INTEGRITY high low SCALING Non lenear Linear www.beinghadoop.com
  • 9. APACHE HADOOP: Apache Hadoop is a scalable framework for storing and processing data on a cluster of commodity hardware nodes. Hadoop is designed to scale up from a single node to thousands of nodes. Hadoop has two main components: a computing framework and Hadoop Distributed File System (HDFS). HDFS uses the commodity server nodes and JBOD (Just a Bunch Of Disks) storage drives to store the data and provide large aggregated I/O bandwidth to data www.beinghadoop.com
  • 11. Hadoop Use cases MANUFACTURING: Use Apache Hadoop to Increase Production, Reduce Costs & Improve Quality Assure Just-In-Time Delivery of Raw Materials Control Quality with Real-Time & Historical Assembly Line Data Avoid Stoppages with Proactive Equipment Maintenance Increase Yields in Drug Manufacturing Channel www.beinghadoop.com
  • 12. Health care: Use Apache Hadoop to Save Lives While Delivering More Efficient Care Access Genomic Data for Medical Trials Monitor Patient Vitals in Real-Time Track Equipment and Medicines with RFID Data Improve Prescription Adherence Retailers : Build a 360° View of the Customer Analyze Brand Sentiment Localize & Personalize Promotions Optimize Websites Optimize Store Layouts www.beinghadoop.com
  • 13. TELECOM: Use Apache Hadoop to Improve Service & Launch New Products Analyze Call Detail Records (CDRs) Service Equipment Proactively Rationalize Infrastructure Investments Recommend Next Product to Buy (NPTB) Allocate Bandwidth in Real-time Develop New Products www.beinghadoop.com