SlideShare a Scribd company logo
1 of 109
Machine
Learning Basics
An Introduction
In a farm far away…
Jack harvests grapes and then sells it in
the nearby town
After harvesting, he then stores the
produce in a storage room
Soon there was a high demand for other fruits. So,
he started harvesting apples and oranges as well
He then realizes that it is time consuming and
difficult to harvest all the fruits by himself
So, he hires 2 more people to work with him. With
this, harvesting is done simultaneously
Now, the storage room becomes a bottleneck to
store and access all the fruits in a single storage
area
Jack now decides to distribute the storage area
and give each one of them a separate storage
space
Hello, I want a fruit
basket of 3 grapes, 2
apples and 3 oranges
To complete the order on time, all of them work
parallelly with their own storage space
Hello, I want a fruit
basket of 3 grapes, 2
apples and 3 oranges
This solution helps them to complete the order on
time without any hassles
Fruit
basket
All of them are happy and they are prepared
for an increase in demand in the future
All of them are happy and they are prepared
for an increase in demand in the future
So, how does this story
relate to Big Data?
The rise of Big Data
Structured data
Earlier with limited data, only one processor and one storage unit was needed
The rise of Big Data
Structured data
Semi structured data
Unstructured data
Soon, data generation increased leading to high volume of data along with
different data formats
The rise of Big Data
Structured data
Semi structured data
Unstructured data
A single processor was not enough to process such high volume of different kinds
of data as it was very time consuming
The rise of Big Data
Structured data
Semi structured data
Unstructured data
Hence, multiple processors were used to process high volume of data and this
saved time
The rise of Big Data
Structured data
Semi structured data
Unstructured data
The single storage unit became the bottleneck due to which network overhead
was generated
The rise of Big Data
Structured data
Semi structured data
Unstructured data
The solution was to use distributed storage for each processor. This enabled easy
access to store and access data
The rise of Big Data
Structured data
Semi structured data
Unstructured data
This method worked and there was no network overhead generated
The rise of Big Data
Structured data
Semi structured data
Unstructured data
This is known as parallel processing with distributed storage
The rise of Big Data
Structured data
Semi structured data
Unstructured data
This is known as parallel processing with distributed storage
Parallel processing
The rise of Big Data
Structured data
Semi structured data
Unstructured data
This is known as parallel processing with distributed storage
Parallel processing Distributed storage
What’s in it for you?
What’s in it for you?
1. Big Data and it’s challenges1
What’s in it for you?
1. Big Data and it’s challenges1
1. Hadoop as a solution2
What’s in it for you?
1. Big Data and it’s challenges1
1. Hadoop as a solution2
1. What is Hadoop?3
What’s in it for you?
1. Big Data and it’s challenges1
1. Hadoop as a solution2
1. What is Hadoop?3
1. Components of Hadoop4
What’s in it for you?
1. Big Data and it’s challenges1
1. Hadoop as a solution2
1. What is Hadoop?3
1. Components of Hadoop4
1. Use case of Hadoop5
What is Big Data?
What is Big Data?
Massive amount of data which cannot be stored, processed and analyzed using the traditional
ways
What is Big Data?
Massive amount of data which cannot be stored, processed and analyzed using the traditional
ways
VERACITY
BIG
DATA
VELOCITY
VOLUME
VARIETYVALUE
VERACITY
What is Big Data?
Massive amount of data which cannot be stored, processed and analyzed using the traditional
ways
VERACITY
BIG
DATA
VELOCITY
VOLUME
VARIETYVALUE
VERACITY
What is Big Data?
Massive amount of data which cannot be stored, processed and analyzed using the traditional
ways
VERACITY
BIG
DATA
VELOCITY
VOLUME
VARIETYVALUE
VERACITY
What is Big Data?
Massive amount of data which cannot be stored, processed and analyzed using the traditional
ways
VERACITY
BIG
DATA
VELOCITY
VOLUME
VARIETYVALUE
VERACITY
What is Big Data?
Massive amount of data which cannot be stored, processed and analyzed using the traditional
ways
VERACITY
BIG
DATA
VELOCITY
VOLUME
VARIETYVALUE
VERACITY
Big Data challenges and solution
Single central storage
Challenges
Big Data challenges and solution
Distributed storagesSingle central storage
Challenges Solutions
Distributed storage
Big Data challenges and solution
Serial processing
OutputProcess
Input
A
Distributed storagesSingle central storage
Challenges Solutions
Distributed storage
Big Data challenges and solution
Serial processing
OutputProcess
Input
A
Distributed storagesSingle central storage
Parallel processing
Output
B
Inputs
A
Process
Challenges Solutions
Distributed storage
Big Data challenges and solution
Serial processing
OutputProcess
Input
A
Distributed storagesSingle central storage
Parallel processing
Output
B
Inputs
A
Process
Lack of ability to process
unstructured data
Challenges Solutions
Distributed storage
Big Data challenges and solution
Serial processing
OutputProcess
Input
A
Distributed storagesSingle central storage
Parallel processing
Output
B
Inputs
A
Process
Lack of ability to process
unstructured data
Ability to process every type
of data
Challenges Solutions
Distributed storage
Hadoop as a solution
Serial processing
OutputProcess
Input
A
Distributed storagesSingle central storage
Parallel processing
Output
B
Inputs
A
Process
Lack of ability to process
unstructured data
Ability to process every type
of data
Challenges Solutions
Distributed storage
What is Hadoop?
What is Hadoop?
Big Data
VOLUME
STORING
Storing Processing Analyzing
Hadoop is a framework that manages big data storage in a distributed way and processes it parallelly
Components of Hadoop
Components of Hadoop
Storage unit of
Hadoop
Processing unit of
Hadoop
Components of Hadoop
Storage unit of
Hadoop
Processing unit of
Hadoop
What is HDFS?
What is HDFS?
VOLUME
STORING
Hadoop Distributed File System (HDFS) is specially designed for storing huge datasets in commodity
hardware
Distributed storage
What is HDFS?
VOLUME
STORING
Hadoop Distributed File System (HDFS) has two core components NameNode and DataNode
NameNode
DataNode
What is HDFS?
VOLUME
STORING
Hadoop Distributed File System (HDFS) has two core components NameNode and DataNode
NameNode
DataNode
There is only one
NameNode
What is HDFS?
VOLUME
STORING
Hadoop Distributed File System (HDFS) has two core components NameNode and DataNode
NameNode
DataNode
There is only one
NameNode
DataNode DataNode
There can be multiple
DataNodes
What is HDFS?
VOLUME
STORING
Master/slave nodes typically form the HDFS cluster
What is HDFS?
VOLUME
STORING
Master/slave nodes typically form the HDFS cluster
Master/NameNode
Slave/DataNode Slave/DataNode Slave/DataNode
What is HDFS?
VOLUME
STORING
Master/slave nodes typically form the HDFS cluster
Master/NameNode
Slave/DataNode Slave/DataNode Slave/DataNode
NameNode maintains and manages the
DataNode. It also stores the metadata
What is HDFS?
VOLUME
STORING
Master/slave nodes typically form the HDFS cluster
Master/NameNode
Slave/DataNode Slave/DataNode Slave/DataNode
NameNode maintains and manages the
DataNode. It also stores the metadata
DataNodes stores the actual data, does
reading, writing and processing. Performs
replication as well
What is HDFS?
VOLUME
STORING
Master/slave nodes typically form the HDFS cluster
Master/NameNode
Slave/DataNode Slave/DataNode Slave/DataNode
NameNode maintains and manages the
DataNode. It also stores the metadata
DataNodes stores the actual data, does
reading, writing and processing. Performs
replication as well
What is HDFS?
VOLUME
STORING
Master/slave nodes typically form the HDFS cluster
Master/NameNode
Slave/DataNode Slave/DataNode Slave/DataNode
NameNode maintains and manages the
DataNode. It also stores the metadata
DataNodes stores the actual data, does
reading, writing and processing. Performs
replication as well
HeartBeat is the signal that DataNode
continuously sends to the NameNode.
This signal shows the status of the DataNode
What is HDFS?
VOLUME
STORING
In HDFS, data is stored in a distributed manner
30 TB
file
What is HDFS?
VOLUME
STORING
In HDFS, data is stored in a distributed manner
30 TB
file
NameNode
30 TB of
data is
loaded
What is HDFS?
VOLUME
STORING
In HDFS, data is stored in a distributed manner
30 TB
file
NameNode
30 TB of
data is
loaded
.
.
.
Data is divided into
blocks of 128 MB each
What is HDFS?
VOLUME
STORING
In HDFS, data is stored in a distributed manner
30 TB
file
NameNode
30 TB of
data is
loaded
DataNodes
.
.
.
Data is divided into
blocks of 128 MB each
.
.
.
.
.
What is HDFS?
VOLUME
STORING
In HDFS, data is stored in a distributed manner
30 TB
file
NameNode
30 TB of
data is
loaded
DataNodes
.
.
.
Data is divided into
blocks of 128 MB each
Blocks are then
replicated among the
DataNodes
.
.
.
.
.
What is HDFS?
Provides distributed
storage
Features of HDFS
What is HDFS?
Provides distributed
storage
Implemented on
commodity hardware
Features of HDFS
What is HDFS?
Provides distributed
storage
Implemented on
commodity hardware
Provides data
security
Features of HDFS
What is HDFS?
Provides distributed
storage
Implemented on
commodity hardware
Provides data
security
Highly fault tolerant
Features of HDFS
Components of Hadoop
Storage unit of
Hadoop
Processing unit of
Hadoop
What is MapReduce?
What is MapReduce?
VOLUME
STORING
Hadoop MapReduce is a programming technique where huge data is processed in a parallel and
distributed fashion
What is MapReduce?
VOLUME
STORING
Hadoop MapReduce is a programming technique where huge data is processed in a parallel and
distributed fashion
Big Data
What is MapReduce?
VOLUME
STORING
Hadoop MapReduce is a programming technique where huge data is processed in a parallel and
distributed fashion
Big Data
Processor
What is MapReduce?
VOLUME
STORING
Hadoop MapReduce is a programming technique where huge data is processed in a parallel and
distributed fashion
Big Data
Processor
MapReduce is used for parallel processing of the Big
Data, which is stored in HDFS
What is MapReduce?
VOLUME
STORING
Hadoop MapReduce is a programming technique where huge data is processed in a parallel and
distributed fashion
Big Data
Output
Processor
MapReduce is used for parallel processing of the Big
Data, which is stored in HDFS
What is MapReduce?
VOLUME
STORING
In MapReduce approach, processing is done at the slave nodes and the final result is sent to the
master node
What is MapReduce?
VOLUME
STORING
In MapReduce approach, processing is done at the slave nodes and the final result is sent to the
master node
Master
Slave Slave
Slave Slave
Traditional approach – Data is
processed at the Master node
What is MapReduce?
VOLUME
STORING
In MapReduce approach, processing is done at the slave nodes and the final result is sent to the
master node
Master
Slave Slave
Slave Slave
Traditional approach – Data is
processed at the Master node
MapReduce approach – Data is
processed at the Slave nodes
Slave Slave
Slave Slave
Master
What is MapReduce?
Input
Bus Car Train
Ship Ship Train
Bus Ship Car
What is MapReduce?
Input Split
Ship Ship Train
Bus Car Train
Bus Car Train
Ship Ship Train
Bus Ship Car
Bus Ship Car
The input dataset is first
split into chunks of data
What is MapReduce?
Input Split Map phase
Ship Ship Train
Bus Car Train
Bus Car Train
Ship Ship Train
Bus Ship Car
Bus Ship Car
Ship, 1
Ship, 1
Train, 1
Bus, 1
Ship, 1
Car, 1
Bus, 1
Car, 1
Train, 1
These chunks of data are
then processed by map
tasks parallelly
What is MapReduce?
Input Split Map phase Reduce phase
Ship Ship Train
Bus Car Train
Bus Car Train
Ship Ship Train
Bus Ship Car
Bus Ship Car
Ship, 1
Ship, 1
Train, 1
Bus, 1
Ship, 1
Car, 1
Bus, 1
Car, 1
Train, 1
Ship, 1
Ship, 1
Ship, 1
Bus, 1
Bus, 1
Car, 1
Car, 1
Train, 1
Train, 1
What is MapReduce?
Input Split Map phase Shuffle and sortReduce phase
Ship Ship Train
Bus Car Train
Bus Car Train
Ship Ship Train
Bus Ship Car
Bus Ship Car
Ship, 1
Ship, 1
Train, 1
Bus, 1
Ship, 1
Car, 1
Bus, 1
Car, 1
Train, 1
Ship, 1
Ship, 1
Ship, 1
Bus, 2
Car, 2
Ship, 3
Train, 2
Bus, 1
Bus, 1
Car, 1
Car, 1
Train, 1
Train, 1
At the reduce task, the
aggregation takes place and
the final output is obtained
Components of Hadoop version 2.0
Storage unit of
Hadoop
Processing unit of
Hadoop
Resource management
unit of Hadoop
What is YARN?
YARN – Yet Another Resource Negotiator
Acts like an OS
to Hadoop 2 Does job scheduling
Responsible for managing
cluster resources
What is YARN?
What is YARN?
Client
Client
Client
What is YARN?
Client
Client
Client
Client submits the
job request
What is YARN?
Resource
Manager
Client
Client
Client
Client submits the
job request
What is YARN?
Resource
Manager
Responsible for resource
allocation and
management
Client
Client
Client
Client submits the
job request
What is YARN?
Resource
Manager
Responsible for resource
allocation and
management
Node
Manager
Client
Client
Client
Node
Manager
Node
Manager
Client submits the
job request
What is YARN?
Node Manager manages
the nodes and monitors
resource usage
Resource
Manager
Responsible for resource
allocation and
management
Node
Manager
Client
Client
Client
Node
Manager
Node
Manager
Client submits the
job request
What is YARN?
Container is a collection
of physical resources
such as RAM, CPU
Node Manager manages
the nodes and monitors
resource usage
Resource
Manager
Responsible for resource
allocation and
management
container
Node
Manager
Client
Client
Client
Node
Manager
Node
Manager
container
container container
Client submits the
job request
What is YARN?
Container is a collection
of physical resources
such as RAM, CPU
Node Manager manages
the nodes and monitors
resource usage
Resource
Manager
Responsible for resource
allocation and
management
App Master
container
Node
Manager
Client
Client
Client
Node
Manager
Node
Manager
App Master container
container container
App Master requests
container from the
NodeManager
Client submits the
job request
What is YARN?
Container is a collection
of physical resources
such as RAM, CPU
Node Manager manages
the nodes and monitors
resource usage
Resource
Manager
Responsible for resource
allocation and
management
App Master
container
Node
Manager
Client
Client
Client
Node
Manager
Node
Manager
App Master container
container container
App Master requests
container from the
NodeManager
Client submits the
job request
Hadoop use case – Combating
fraudulent activities
Hadoop use case – Combating fraudulent activities
Fraud activities
Detecting fraudulent transactions is one among the various problems any bank faces
Zions’ main challenge was to combat the fraudulent activities which were taking place
Challenge
Hadoop use case – Combating fraudulent activities
Approaches used by Zions’ security team to combat fraudulent activities
Hadoop use case – Combating fraudulent activities
Approaches used by Zions’ security team to combat fraudulent activities
Security information
management – SIM Tools
Problem
It was based on RDBMS
Unable to store huge data which
needed to be analyzed
Hadoop use case – Combating fraudulent activities
Approaches used by Zions’ security team to combat fraudulent activities
Security information
management – SIM Tools
Problem
It was based on RDBMS
Unable to store huge data which
needed to be analyzed
Hadoop use case – Combating fraudulent activities
Parallel processing system
Problem
Analyzing unstructured data
was not possible
Approaches used by Zions’ security team to combat fraudulent activities
Security information
management – SIM Tools
Problem
It was based on RDBMS
Unable to store huge data which
needed to be analyzed
Hadoop use case – Combating fraudulent activities
Parallel processing system
Problem
Analyzing unstructured data
was not possible
How Hadoop solved the problems
Hadoop use case – Combating fraudulent activities
Storing
Zions could now store
massive amount of data
using Hadoop
How Hadoop solved the problems
Hadoop use case – Combating fraudulent activities
Storing
Zions could now store
massive amount of data
using Hadoop
Processing
Processing of unstructured
data (like server logs, customer
data, customer transactions)
was now possible
How Hadoop solved the problems
Hadoop use case – Combating fraudulent activities
Storing
Zions could now store
massive amount of data
using Hadoop
Processing Analyzing
In-depth analysis of different data
formats became easy and time
efficient
Processing of unstructured
data (like server logs, customer
data, customer transactions)
was now possible
How Hadoop solved the problems
Hadoop use case – Combating fraudulent activities
Storing
Zions could now store
massive amount of data
using Hadoop
Processing Analyzing Detecting
In-depth analysis of different data
formats became easy and time
efficient
The team could now detect
everything from malware, spear
phishing attempts to account
takeovers
Processing of unstructured
data (like server logs, customer
data, customer transactions)
was now possible
Key Takeaways
What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop Tutorial | Simplilearn

More Related Content

What's hot

Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop TutorialEdureka!
 
Introduction to HiveQL
Introduction to HiveQLIntroduction to HiveQL
Introduction to HiveQLkristinferrier
 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture EMC
 
Introduction to Pig
Introduction to PigIntroduction to Pig
Introduction to PigPrashanth Babu
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and HadoopRahul Agarwal
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache SparkRahul Jain
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Simplilearn
 
Mining Data Streams
Mining Data StreamsMining Data Streams
Mining Data StreamsSujaAldrin
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologiesneeraj rathore
 
Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0Cloudera, Inc.
 
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...Simplilearn
 
Hadoop Ecosystem
Hadoop EcosystemHadoop Ecosystem
Hadoop EcosystemSandip Darwade
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component rebeccatho
 
Introduction to NoSQL Databases
Introduction to NoSQL DatabasesIntroduction to NoSQL Databases
Introduction to NoSQL DatabasesDerek Stainer
 
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLabApache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLabCloudxLab
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Simplilearn
 

What's hot (20)

Big Data & Hadoop Tutorial
Big Data & Hadoop TutorialBig Data & Hadoop Tutorial
Big Data & Hadoop Tutorial
 
Introduction to HiveQL
Introduction to HiveQLIntroduction to HiveQL
Introduction to HiveQL
 
Hadoop Overview & Architecture
Hadoop Overview & Architecture  Hadoop Overview & Architecture
Hadoop Overview & Architecture
 
Hadoop hdfs
Hadoop hdfsHadoop hdfs
Hadoop hdfs
 
Introduction to Pig
Introduction to PigIntroduction to Pig
Introduction to Pig
 
Big data and Hadoop
Big data and HadoopBig data and Hadoop
Big data and Hadoop
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache Spark
 
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
Hadoop Architecture | HDFS Architecture | Hadoop Architecture Tutorial | HDFS...
 
Apache hive introduction
Apache hive introductionApache hive introduction
Apache hive introduction
 
Mining Data Streams
Mining Data StreamsMining Data Streams
Mining Data Streams
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0
 
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
 
Hadoop Ecosystem
Hadoop EcosystemHadoop Ecosystem
Hadoop Ecosystem
 
Hadoop YARN
Hadoop YARNHadoop YARN
Hadoop YARN
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
 
Introduction to NoSQL Databases
Introduction to NoSQL DatabasesIntroduction to NoSQL Databases
Introduction to NoSQL Databases
 
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLabApache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
Apache Spark - Basics of RDD | Big Data Hadoop Spark Tutorial | CloudxLab
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 

Similar to What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop Tutorial | Simplilearn

OPERATING SYSTEM .pptx
OPERATING SYSTEM .pptxOPERATING SYSTEM .pptx
OPERATING SYSTEM .pptxAltafKhadim
 
Big data Hadoop presentation
Big data  Hadoop  presentation Big data  Hadoop  presentation
Big data Hadoop presentation Shivanee garg
 
Hadoop hdfs interview questions
Hadoop hdfs interview questionsHadoop hdfs interview questions
Hadoop hdfs interview questionsKalyan Hadoop
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?sudhakara st
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & HadoopAnkan Banerjee
 
Hadoop, Evolution of Hadoop, Features of Hadoop
Hadoop, Evolution of Hadoop, Features of HadoopHadoop, Evolution of Hadoop, Features of Hadoop
Hadoop, Evolution of Hadoop, Features of HadoopDr Neelesh Jain
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoopRahul Johari
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentationhimanshu arora
 
Big Data and Hadoop - An Introduction
Big Data and Hadoop - An IntroductionBig Data and Hadoop - An Introduction
Big Data and Hadoop - An IntroductionNagarjuna Kanamarlapudi
 
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...Edureka!
 
A gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and HadoopA gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and HadoopStefano Paluello
 
Big data processing system
Big data processing systemBig data processing system
Big data processing systemshima jafari
 
Hadoop Technology
Hadoop TechnologyHadoop Technology
Hadoop TechnologyAtul Kushwaha
 
Big data(hadoop)
Big data(hadoop)Big data(hadoop)
Big data(hadoop)Shabnam Saxena
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and HadoopMr. Ankit
 
hadoop
hadoophadoop
hadoopswatic018
 

Similar to What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop Tutorial | Simplilearn (20)

OPERATING SYSTEM .pptx
OPERATING SYSTEM .pptxOPERATING SYSTEM .pptx
OPERATING SYSTEM .pptx
 
Big data Hadoop presentation
Big data  Hadoop  presentation Big data  Hadoop  presentation
Big data Hadoop presentation
 
Hadoop hdfs interview questions
Hadoop hdfs interview questionsHadoop hdfs interview questions
Hadoop hdfs interview questions
 
Hadoop introduction , Why and What is Hadoop ?
Hadoop introduction , Why and What is  Hadoop ?Hadoop introduction , Why and What is  Hadoop ?
Hadoop introduction , Why and What is Hadoop ?
 
Big Data & Hadoop
Big Data & HadoopBig Data & Hadoop
Big Data & Hadoop
 
Hadoop, Evolution of Hadoop, Features of Hadoop
Hadoop, Evolution of Hadoop, Features of HadoopHadoop, Evolution of Hadoop, Features of Hadoop
Hadoop, Evolution of Hadoop, Features of Hadoop
 
Chapter2.pdf
Chapter2.pdfChapter2.pdf
Chapter2.pdf
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Hadoop
HadoopHadoop
Hadoop
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big Data and Hadoop - An Introduction
Big Data and Hadoop - An IntroductionBig Data and Hadoop - An Introduction
Big Data and Hadoop - An Introduction
 
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
What is Hadoop | Introduction to Hadoop | Hadoop Tutorial | Hadoop Training |...
 
Big Data Hadoop Tutorial by Easylearning Guru
Big Data Hadoop Tutorial by Easylearning GuruBig Data Hadoop Tutorial by Easylearning Guru
Big Data Hadoop Tutorial by Easylearning Guru
 
A gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and HadoopA gentle introduction to the world of BigData and Hadoop
A gentle introduction to the world of BigData and Hadoop
 
Big Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning GuruBig Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning Guru
 
Big data processing system
Big data processing systemBig data processing system
Big data processing system
 
Hadoop Technology
Hadoop TechnologyHadoop Technology
Hadoop Technology
 
Big data(hadoop)
Big data(hadoop)Big data(hadoop)
Big data(hadoop)
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
hadoop
hadoophadoop
hadoop
 

More from Simplilearn

ChatGPT in Cybersecurity
ChatGPT in CybersecurityChatGPT in Cybersecurity
ChatGPT in CybersecuritySimplilearn
 
Whatis SQL Injection.pptx
Whatis SQL Injection.pptxWhatis SQL Injection.pptx
Whatis SQL Injection.pptxSimplilearn
 
Top 5 High Paying Cloud Computing Jobs in 2023
 Top 5 High Paying Cloud Computing Jobs in 2023  Top 5 High Paying Cloud Computing Jobs in 2023
Top 5 High Paying Cloud Computing Jobs in 2023 Simplilearn
 
Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024Simplilearn
 
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...Simplilearn
 
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...Simplilearn
 
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...Simplilearn
 
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...Simplilearn
 
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...Simplilearn
 
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...Simplilearn
 
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...Simplilearn
 
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...Simplilearn
 
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...Simplilearn
 
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...Simplilearn
 
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...Simplilearn
 
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...Simplilearn
 
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...Simplilearn
 
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...Simplilearn
 
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...Simplilearn
 
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...Simplilearn
 

More from Simplilearn (20)

ChatGPT in Cybersecurity
ChatGPT in CybersecurityChatGPT in Cybersecurity
ChatGPT in Cybersecurity
 
Whatis SQL Injection.pptx
Whatis SQL Injection.pptxWhatis SQL Injection.pptx
Whatis SQL Injection.pptx
 
Top 5 High Paying Cloud Computing Jobs in 2023
 Top 5 High Paying Cloud Computing Jobs in 2023  Top 5 High Paying Cloud Computing Jobs in 2023
Top 5 High Paying Cloud Computing Jobs in 2023
 
Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024Types Of Cloud Jobs In 2024
Types Of Cloud Jobs In 2024
 
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
Top 12 AI Technologies To Learn 2024 | Top AI Technologies in 2024 | AI Trend...
 
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
What is LSTM ?| Long Short Term Memory Explained with Example | Deep Learning...
 
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
Top 10 Chat GPT Use Cases | ChatGPT Applications | ChatGPT Tutorial For Begin...
 
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
React JS Vs Next JS - What's The Difference | Next JS Tutorial For Beginners ...
 
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
Backpropagation in Neural Networks | Back Propagation Algorithm with Examples...
 
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
How to Become a Business Analyst ?| Roadmap to Become Business Analyst | Simp...
 
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
Career Opportunities In Artificial Intelligence 2023 | AI Job Opportunities |...
 
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
Programming for Beginners | How to Start Coding in 2023? | Introduction to Pr...
 
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
Best IDE for Programming in 2023 | Top 8 Programming IDE You Should Know | Si...
 
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
React 18 Overview | React 18 New Features and Changes | React 18 Tutorial 202...
 
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
What Is Next JS ? | Introduction to Next JS | Basics of Next JS | Next JS Tut...
 
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
How To Become an SEO Expert In 2023 | SEO Expert Tutorial | SEO For Beginners...
 
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
WordPress Tutorial for Beginners 2023 | What Is WordPress and How Does It Wor...
 
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
Blogging For Beginners 2023 | How To Create A Blog | Blogging Tutorial | Simp...
 
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
How To Start A Blog In 2023 | Pros And Cons Of Blogging | Blogging Tutorial |...
 
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
How to Increase Website Traffic ? | 10 Ways To Increase Website Traffic in 20...
 

Recently uploaded

Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxsqpmdrvczh
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 

Recently uploaded (20)

Romantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptxRomantic Opera MUSIC FOR GRADE NINE pptx
Romantic Opera MUSIC FOR GRADE NINE pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 

What Is Hadoop? | What Is Big Data & Hadoop | Introduction To Hadoop | Hadoop Tutorial | Simplilearn

  • 2. In a farm far away…
  • 3. Jack harvests grapes and then sells it in the nearby town
  • 4. After harvesting, he then stores the produce in a storage room
  • 5. Soon there was a high demand for other fruits. So, he started harvesting apples and oranges as well
  • 6. He then realizes that it is time consuming and difficult to harvest all the fruits by himself
  • 7. So, he hires 2 more people to work with him. With this, harvesting is done simultaneously
  • 8. Now, the storage room becomes a bottleneck to store and access all the fruits in a single storage area
  • 9. Jack now decides to distribute the storage area and give each one of them a separate storage space
  • 10. Hello, I want a fruit basket of 3 grapes, 2 apples and 3 oranges
  • 11. To complete the order on time, all of them work parallelly with their own storage space Hello, I want a fruit basket of 3 grapes, 2 apples and 3 oranges
  • 12. This solution helps them to complete the order on time without any hassles Fruit basket
  • 13. All of them are happy and they are prepared for an increase in demand in the future
  • 14. All of them are happy and they are prepared for an increase in demand in the future So, how does this story relate to Big Data?
  • 15. The rise of Big Data Structured data Earlier with limited data, only one processor and one storage unit was needed
  • 16. The rise of Big Data Structured data Semi structured data Unstructured data Soon, data generation increased leading to high volume of data along with different data formats
  • 17. The rise of Big Data Structured data Semi structured data Unstructured data A single processor was not enough to process such high volume of different kinds of data as it was very time consuming
  • 18. The rise of Big Data Structured data Semi structured data Unstructured data Hence, multiple processors were used to process high volume of data and this saved time
  • 19. The rise of Big Data Structured data Semi structured data Unstructured data The single storage unit became the bottleneck due to which network overhead was generated
  • 20. The rise of Big Data Structured data Semi structured data Unstructured data The solution was to use distributed storage for each processor. This enabled easy access to store and access data
  • 21. The rise of Big Data Structured data Semi structured data Unstructured data This method worked and there was no network overhead generated
  • 22. The rise of Big Data Structured data Semi structured data Unstructured data This is known as parallel processing with distributed storage
  • 23. The rise of Big Data Structured data Semi structured data Unstructured data This is known as parallel processing with distributed storage Parallel processing
  • 24. The rise of Big Data Structured data Semi structured data Unstructured data This is known as parallel processing with distributed storage Parallel processing Distributed storage
  • 26. What’s in it for you? 1. Big Data and it’s challenges1
  • 27. What’s in it for you? 1. Big Data and it’s challenges1 1. Hadoop as a solution2
  • 28. What’s in it for you? 1. Big Data and it’s challenges1 1. Hadoop as a solution2 1. What is Hadoop?3
  • 29. What’s in it for you? 1. Big Data and it’s challenges1 1. Hadoop as a solution2 1. What is Hadoop?3 1. Components of Hadoop4
  • 30. What’s in it for you? 1. Big Data and it’s challenges1 1. Hadoop as a solution2 1. What is Hadoop?3 1. Components of Hadoop4 1. Use case of Hadoop5
  • 31. What is Big Data?
  • 32. What is Big Data? Massive amount of data which cannot be stored, processed and analyzed using the traditional ways
  • 33. What is Big Data? Massive amount of data which cannot be stored, processed and analyzed using the traditional ways VERACITY BIG DATA VELOCITY VOLUME VARIETYVALUE VERACITY
  • 34. What is Big Data? Massive amount of data which cannot be stored, processed and analyzed using the traditional ways VERACITY BIG DATA VELOCITY VOLUME VARIETYVALUE VERACITY
  • 35. What is Big Data? Massive amount of data which cannot be stored, processed and analyzed using the traditional ways VERACITY BIG DATA VELOCITY VOLUME VARIETYVALUE VERACITY
  • 36. What is Big Data? Massive amount of data which cannot be stored, processed and analyzed using the traditional ways VERACITY BIG DATA VELOCITY VOLUME VARIETYVALUE VERACITY
  • 37. What is Big Data? Massive amount of data which cannot be stored, processed and analyzed using the traditional ways VERACITY BIG DATA VELOCITY VOLUME VARIETYVALUE VERACITY
  • 38. Big Data challenges and solution Single central storage Challenges
  • 39. Big Data challenges and solution Distributed storagesSingle central storage Challenges Solutions Distributed storage
  • 40. Big Data challenges and solution Serial processing OutputProcess Input A Distributed storagesSingle central storage Challenges Solutions Distributed storage
  • 41. Big Data challenges and solution Serial processing OutputProcess Input A Distributed storagesSingle central storage Parallel processing Output B Inputs A Process Challenges Solutions Distributed storage
  • 42. Big Data challenges and solution Serial processing OutputProcess Input A Distributed storagesSingle central storage Parallel processing Output B Inputs A Process Lack of ability to process unstructured data Challenges Solutions Distributed storage
  • 43. Big Data challenges and solution Serial processing OutputProcess Input A Distributed storagesSingle central storage Parallel processing Output B Inputs A Process Lack of ability to process unstructured data Ability to process every type of data Challenges Solutions Distributed storage
  • 44. Hadoop as a solution Serial processing OutputProcess Input A Distributed storagesSingle central storage Parallel processing Output B Inputs A Process Lack of ability to process unstructured data Ability to process every type of data Challenges Solutions Distributed storage
  • 46. What is Hadoop? Big Data VOLUME STORING Storing Processing Analyzing Hadoop is a framework that manages big data storage in a distributed way and processes it parallelly
  • 48. Components of Hadoop Storage unit of Hadoop Processing unit of Hadoop
  • 49. Components of Hadoop Storage unit of Hadoop Processing unit of Hadoop
  • 51. What is HDFS? VOLUME STORING Hadoop Distributed File System (HDFS) is specially designed for storing huge datasets in commodity hardware Distributed storage
  • 52. What is HDFS? VOLUME STORING Hadoop Distributed File System (HDFS) has two core components NameNode and DataNode NameNode DataNode
  • 53. What is HDFS? VOLUME STORING Hadoop Distributed File System (HDFS) has two core components NameNode and DataNode NameNode DataNode There is only one NameNode
  • 54. What is HDFS? VOLUME STORING Hadoop Distributed File System (HDFS) has two core components NameNode and DataNode NameNode DataNode There is only one NameNode DataNode DataNode There can be multiple DataNodes
  • 55. What is HDFS? VOLUME STORING Master/slave nodes typically form the HDFS cluster
  • 56. What is HDFS? VOLUME STORING Master/slave nodes typically form the HDFS cluster Master/NameNode Slave/DataNode Slave/DataNode Slave/DataNode
  • 57. What is HDFS? VOLUME STORING Master/slave nodes typically form the HDFS cluster Master/NameNode Slave/DataNode Slave/DataNode Slave/DataNode NameNode maintains and manages the DataNode. It also stores the metadata
  • 58. What is HDFS? VOLUME STORING Master/slave nodes typically form the HDFS cluster Master/NameNode Slave/DataNode Slave/DataNode Slave/DataNode NameNode maintains and manages the DataNode. It also stores the metadata DataNodes stores the actual data, does reading, writing and processing. Performs replication as well
  • 59. What is HDFS? VOLUME STORING Master/slave nodes typically form the HDFS cluster Master/NameNode Slave/DataNode Slave/DataNode Slave/DataNode NameNode maintains and manages the DataNode. It also stores the metadata DataNodes stores the actual data, does reading, writing and processing. Performs replication as well
  • 60. What is HDFS? VOLUME STORING Master/slave nodes typically form the HDFS cluster Master/NameNode Slave/DataNode Slave/DataNode Slave/DataNode NameNode maintains and manages the DataNode. It also stores the metadata DataNodes stores the actual data, does reading, writing and processing. Performs replication as well HeartBeat is the signal that DataNode continuously sends to the NameNode. This signal shows the status of the DataNode
  • 61. What is HDFS? VOLUME STORING In HDFS, data is stored in a distributed manner 30 TB file
  • 62. What is HDFS? VOLUME STORING In HDFS, data is stored in a distributed manner 30 TB file NameNode 30 TB of data is loaded
  • 63. What is HDFS? VOLUME STORING In HDFS, data is stored in a distributed manner 30 TB file NameNode 30 TB of data is loaded . . . Data is divided into blocks of 128 MB each
  • 64. What is HDFS? VOLUME STORING In HDFS, data is stored in a distributed manner 30 TB file NameNode 30 TB of data is loaded DataNodes . . . Data is divided into blocks of 128 MB each . . . . .
  • 65. What is HDFS? VOLUME STORING In HDFS, data is stored in a distributed manner 30 TB file NameNode 30 TB of data is loaded DataNodes . . . Data is divided into blocks of 128 MB each Blocks are then replicated among the DataNodes . . . . .
  • 66. What is HDFS? Provides distributed storage Features of HDFS
  • 67. What is HDFS? Provides distributed storage Implemented on commodity hardware Features of HDFS
  • 68. What is HDFS? Provides distributed storage Implemented on commodity hardware Provides data security Features of HDFS
  • 69. What is HDFS? Provides distributed storage Implemented on commodity hardware Provides data security Highly fault tolerant Features of HDFS
  • 70. Components of Hadoop Storage unit of Hadoop Processing unit of Hadoop
  • 72. What is MapReduce? VOLUME STORING Hadoop MapReduce is a programming technique where huge data is processed in a parallel and distributed fashion
  • 73. What is MapReduce? VOLUME STORING Hadoop MapReduce is a programming technique where huge data is processed in a parallel and distributed fashion Big Data
  • 74. What is MapReduce? VOLUME STORING Hadoop MapReduce is a programming technique where huge data is processed in a parallel and distributed fashion Big Data Processor
  • 75. What is MapReduce? VOLUME STORING Hadoop MapReduce is a programming technique where huge data is processed in a parallel and distributed fashion Big Data Processor MapReduce is used for parallel processing of the Big Data, which is stored in HDFS
  • 76. What is MapReduce? VOLUME STORING Hadoop MapReduce is a programming technique where huge data is processed in a parallel and distributed fashion Big Data Output Processor MapReduce is used for parallel processing of the Big Data, which is stored in HDFS
  • 77. What is MapReduce? VOLUME STORING In MapReduce approach, processing is done at the slave nodes and the final result is sent to the master node
  • 78. What is MapReduce? VOLUME STORING In MapReduce approach, processing is done at the slave nodes and the final result is sent to the master node Master Slave Slave Slave Slave Traditional approach – Data is processed at the Master node
  • 79. What is MapReduce? VOLUME STORING In MapReduce approach, processing is done at the slave nodes and the final result is sent to the master node Master Slave Slave Slave Slave Traditional approach – Data is processed at the Master node MapReduce approach – Data is processed at the Slave nodes Slave Slave Slave Slave Master
  • 80. What is MapReduce? Input Bus Car Train Ship Ship Train Bus Ship Car
  • 81. What is MapReduce? Input Split Ship Ship Train Bus Car Train Bus Car Train Ship Ship Train Bus Ship Car Bus Ship Car The input dataset is first split into chunks of data
  • 82. What is MapReduce? Input Split Map phase Ship Ship Train Bus Car Train Bus Car Train Ship Ship Train Bus Ship Car Bus Ship Car Ship, 1 Ship, 1 Train, 1 Bus, 1 Ship, 1 Car, 1 Bus, 1 Car, 1 Train, 1 These chunks of data are then processed by map tasks parallelly
  • 83. What is MapReduce? Input Split Map phase Reduce phase Ship Ship Train Bus Car Train Bus Car Train Ship Ship Train Bus Ship Car Bus Ship Car Ship, 1 Ship, 1 Train, 1 Bus, 1 Ship, 1 Car, 1 Bus, 1 Car, 1 Train, 1 Ship, 1 Ship, 1 Ship, 1 Bus, 1 Bus, 1 Car, 1 Car, 1 Train, 1 Train, 1
  • 84. What is MapReduce? Input Split Map phase Shuffle and sortReduce phase Ship Ship Train Bus Car Train Bus Car Train Ship Ship Train Bus Ship Car Bus Ship Car Ship, 1 Ship, 1 Train, 1 Bus, 1 Ship, 1 Car, 1 Bus, 1 Car, 1 Train, 1 Ship, 1 Ship, 1 Ship, 1 Bus, 2 Car, 2 Ship, 3 Train, 2 Bus, 1 Bus, 1 Car, 1 Car, 1 Train, 1 Train, 1 At the reduce task, the aggregation takes place and the final output is obtained
  • 85. Components of Hadoop version 2.0 Storage unit of Hadoop Processing unit of Hadoop Resource management unit of Hadoop
  • 87. YARN – Yet Another Resource Negotiator Acts like an OS to Hadoop 2 Does job scheduling Responsible for managing cluster resources What is YARN?
  • 89. What is YARN? Client Client Client Client submits the job request
  • 91. What is YARN? Resource Manager Responsible for resource allocation and management Client Client Client Client submits the job request
  • 92. What is YARN? Resource Manager Responsible for resource allocation and management Node Manager Client Client Client Node Manager Node Manager Client submits the job request
  • 93. What is YARN? Node Manager manages the nodes and monitors resource usage Resource Manager Responsible for resource allocation and management Node Manager Client Client Client Node Manager Node Manager Client submits the job request
  • 94. What is YARN? Container is a collection of physical resources such as RAM, CPU Node Manager manages the nodes and monitors resource usage Resource Manager Responsible for resource allocation and management container Node Manager Client Client Client Node Manager Node Manager container container container Client submits the job request
  • 95. What is YARN? Container is a collection of physical resources such as RAM, CPU Node Manager manages the nodes and monitors resource usage Resource Manager Responsible for resource allocation and management App Master container Node Manager Client Client Client Node Manager Node Manager App Master container container container App Master requests container from the NodeManager Client submits the job request
  • 96. What is YARN? Container is a collection of physical resources such as RAM, CPU Node Manager manages the nodes and monitors resource usage Resource Manager Responsible for resource allocation and management App Master container Node Manager Client Client Client Node Manager Node Manager App Master container container container App Master requests container from the NodeManager Client submits the job request
  • 97. Hadoop use case – Combating fraudulent activities
  • 98. Hadoop use case – Combating fraudulent activities Fraud activities Detecting fraudulent transactions is one among the various problems any bank faces
  • 99. Zions’ main challenge was to combat the fraudulent activities which were taking place Challenge Hadoop use case – Combating fraudulent activities
  • 100. Approaches used by Zions’ security team to combat fraudulent activities Hadoop use case – Combating fraudulent activities
  • 101. Approaches used by Zions’ security team to combat fraudulent activities Security information management – SIM Tools Problem It was based on RDBMS Unable to store huge data which needed to be analyzed Hadoop use case – Combating fraudulent activities
  • 102. Approaches used by Zions’ security team to combat fraudulent activities Security information management – SIM Tools Problem It was based on RDBMS Unable to store huge data which needed to be analyzed Hadoop use case – Combating fraudulent activities Parallel processing system Problem Analyzing unstructured data was not possible
  • 103. Approaches used by Zions’ security team to combat fraudulent activities Security information management – SIM Tools Problem It was based on RDBMS Unable to store huge data which needed to be analyzed Hadoop use case – Combating fraudulent activities Parallel processing system Problem Analyzing unstructured data was not possible
  • 104. How Hadoop solved the problems Hadoop use case – Combating fraudulent activities Storing Zions could now store massive amount of data using Hadoop
  • 105. How Hadoop solved the problems Hadoop use case – Combating fraudulent activities Storing Zions could now store massive amount of data using Hadoop Processing Processing of unstructured data (like server logs, customer data, customer transactions) was now possible
  • 106. How Hadoop solved the problems Hadoop use case – Combating fraudulent activities Storing Zions could now store massive amount of data using Hadoop Processing Analyzing In-depth analysis of different data formats became easy and time efficient Processing of unstructured data (like server logs, customer data, customer transactions) was now possible
  • 107. How Hadoop solved the problems Hadoop use case – Combating fraudulent activities Storing Zions could now store massive amount of data using Hadoop Processing Analyzing Detecting In-depth analysis of different data formats became easy and time efficient The team could now detect everything from malware, spear phishing attempts to account takeovers Processing of unstructured data (like server logs, customer data, customer transactions) was now possible

Editor's Notes

  1. Style - 01
  2. Style - 01
  3. Style - 01
  4. Style - 01
  5. Style - 01
  6. Style - 01
  7. Style - 01
  8. Style - 01
  9. Style - 01
  10. Style - 01
  11. Style - 01
  12. Style - 01
  13. Style - 01
  14. Style - 01
  15. Style - 01
  16. Style - 01
  17. Style - 01
  18. Style - 01
  19. Style - 01
  20. Style - 01
  21. Style - 01
  22. Style - 01
  23. Style - 01
  24. Style - 01
  25. Style - 01
  26. Style - 01
  27. Style - 01
  28. Style - 01