www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
What is Hadoop?
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
What are we going to learn?
Big Data Introduction
Big Data Domains
Big Data Job Trends
Edureka Big Data
Certification Courses
1
2
6
Big Data Learning Path
5
Big Data Career Path
3 4
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
E x p l o d i n g G l o b a l D a t a
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Fun Facts about Global Data
6.1 Billion Global Smartphone Users by 2020
In 5 yearsthere will be over 50 Billion Smart Connected Devices in the World
2.5 Quintillion Bytes of Data is Created Everyday
Exabyte
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Data Generated Every Minute
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
W h a t i s B i g D a t a ?
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
“23 Exabytes of information was
recorded and replicated in 2002.
We now record and transfer that
much information every 7 days”
“Big data is the term for a collection
of data sets so large and complex
that it becomes difficult to process
using on-hand database
management tools or traditional
data processing applications”
Traditional System
What is Big Data?
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
5 V ’s D e f i n i t i o n
of
B i g D a t a
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
5 V’s of Big Data
Volume
Variety
Velocity
Value
Veracity
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Volume
Variety
Velocity
Value
Veracity
Different kinds of data is being generated from various sources
5 V’s of Big Data
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Volume
Variety
Velocity
Value
Veracity
Data is being generated at an alarming rate
5 V’s of Big Data
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Volume
Variety
Velocity
Value
Veracity
Value
?
Mechanism to bring the correct meaning out of the data
5 V’s of Big Data
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Volume
Variety
Velocity
Value
Veracity
Uncertainty and inconsistencies in the data
5 V’s of Big Data
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
B i g D a t a D o m a i n s
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
B I G D ATA A P P L I C AT I O N D O M A I N S
Web & E - Tailing Tele - Communication Government
Healthcare Finance & Banking Retail
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
Web and E-tailing
Search Quality
Recommendation Engines
Ad Targeting
Abuse & Click
Fraud Detection
Search Quality
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
Telecommunications
Search Quality
Customer Churn
Prevention
Calling Data
Record Analysis
Analyzing Network
to Predict Failure
Network Performance
Optimization
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
Government
Political Campaigns
Welfare Schemes
Fraud Detection &
Cyber Security
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
Healthcare & Life Sciences
Search Quality
Health Information Exchange
Healthcare Service
Quality Improvements
Drug Safety
Gene Sequencing
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
Banks and Financial services
Search Quality
Fraud Detection
Modeling True Risk
Credit Scoring and Analysis
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Domains
Welfare Schemes
Sentiment Analysis
Customer Churn Analysis
Point of Sales
Transaction Analysis
Retail
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
C o m p a n i e s L e v e r a g i n g B i g D a t a
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Companies using Big Data Technologies
 30 nodes running HDFS
 Hadoop and HBase on both
production & development
 15 nodes cluster
 Each node has 8 cores 16 GB
RAM & 1.4 TB storage
 532 nodes & 5.3 PB storage
 Uses MapReduce, Pig, Hive
& HBase
 Uses Hadoop to store internal log
and dimension data sources
 Currently has 2 clusters:
a) 1100 nodes – 12 PB storage
b) 300 nodes – 3 PB Storage
 Uses Hadoop to store and process
tweets, log files, etc.
 Uses Pig for scheduled and ad-hoc
jobs
 More than 40,000 computers
running Hadoop
 Used to support research for Ad
Systems and Web Search
C O M P A N I E S L E V E R A G I N G B I G D A T A & H A D O O P
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
B i g D a t a J o b Tr e n d s
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Job Trends
Source: Google Trends
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Job Trends
Source: Google Trends
Big Data Developer Big Data Architect
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data & Hadoop Job Salary
0 50000 100000 150000
Hadoop Developer
Big Data Developer
Hadoop Administrator
Big Data Engineer
Hadoop Developer…
Big Data Architect
Data Scientist
Hadoop Architect
Data Analytics Engineer
Avg. Salary (USD)
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
B i g D a t a Te c h n o l o g i e s I n D e m a n d
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data Technologies in Demand
Apache Spark
Hadoop Admin
Apache Kafka
Apache Cassandra
Mongo DB
Data Science
Informatica
Talend
Hadoop Developer
Big Data
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
C a r e e r s i n B i g D a t a & H a d o o p
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Careers in Big Data & Hadoop
Big Data
Big Data Developer
Data Analyst
Data Scientist
Hadoop
Administration
Big Data Architect
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Careers in Big Data & Hadoop
Big Data
Big Data Developer
Data Analyst
Data Scientist
Hadoop
Administration
Big Data Architect
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Careers in Big Data & Hadoop
Big Data
Big Data Developer
Data Analyst
Data Scientist
Hadoop
Administration
Big Data Architect
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Careers in Big Data & Hadoop
Big Data
Big Data Developer
Data Analyst
Data Scientist
Hadoop
Administration
Big Data Architect
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Careers in Big Data & Hadoop
Big Data
Big Data Developer
Data Analyst
Data Scientist
Hadoop
Administration
Big Data Architect
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data
Architect
Data Scientist
Hadoop
Admin
Data Analyst
Big Data
Developer
Design, implement and
integrate Big Data
solutions within the IT
enterprise
Careers in Big Data & Hadoop
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data
Architect
Data ScientistHadoop
Admin
Data Analyst
Big Data
Developer
Design, implement and
integrate Big Data
solutions within the IT
enterprise
Analyze and interpret
complex digital data in
order to assist a business
in its decision-making
Careers in Big Data & Hadoop
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data
Architect
Data Scientist
Hadoop
Admin
Data Analyst
Big Data
Developer
Design, implement and
integrate Big Data
solutions within the IT
enterprise
Analyze and interpret
complex digital data in
order to assist a business
in its decision-making
Administers and manages
Hadoop clusters and all other
resources in the entire
Hadoop ecosystem
Careers in Big Data & Hadoop
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data
Architect
Data Scientist
Hadoop
Admin
Data Analyst
Big Data
Developer
Design, implement and
integrate Big Data
solutions within the IT
enterprise
Analyze and interpret
complex digital data in
order to assist a business
in its decision-making
Administers and manages
Hadoop clusters and all other
resources in the entire
Hadoop ecosystem
Analyses different
types of data and
relationships among
data elements within
a system
Careers in Big Data & Hadoop
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Big Data
Architect
Data Scientist
Hadoop
Admin
Data Analyst
Big Data
Developer
Design, implement and
integrate Big Data
solutions within the IT
enterprise
Analyze and interpret
complex digital data in
order to assist a business
in its decision-making
Administers and manages
Hadoop clusters and all other
resources in the entire
Hadoop ecosystem
Analyses different
types of data and
relationships among
data elements within
a system
Develops applications
that deal with big data
Careers in Big Data & Hadoop
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Why you should learn Big Data & Hadoop?
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
B i g D a t a L e a r n i n g Pa t h
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
• Java/Python/Ruby
• Hadoop Eco-system
• NoSQL DB
• Spark
• Linux Administration
• Cluster Management
• Cluster Performance
• Virtualization
• Statistics Skills
• Data Science
• Hadoop Essentials
• Expertise in R
Developer/Testing
Administration
Data Analyst
Big Data and Hadoop
MapReduce
Design Patterns
Apache
Spark & Scala
Apache Cassandra
Linux Administration Hadoop Administration
Data Science
Business Analytics
Using R
Advance Predictive
Modelling in R
Talend for Big Data
Data Visualization
Using Tableau
Big Data Learning Path
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
E d u r e k a
B i g D a t a C e r t i f i c a t i o n C o u r s e s
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Data Analytics with R
Certification Training
Big Data Hadoop
C Certification Training
Data Science Certification
Training
 Real Time Projects
 Hadoop Ecosystem Tools
 Hands-on Assignment
Hadoop Administration
Ce Certification Training
 Cluster Setup Projects
 Hadoop Architecture
 Hands-on Assignment
 Data Mining Techniques
 Data Analytics & R
 Real Time Projects
 Data Transformation
Tools
 Hadoop, Machine
Learning & R
 Real Time Projects
 Real Time Projects
 Spark & Scala Concepts
 Hands-on Assignment
Apache Spark
Certification Training
Edureka Big Data Certification Courses
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Salient Features
Live Online Class
Hands-on Experience
24x7 Support
Real Time Projects
Module Wise Assessment
www.edureka.co/big-data-and-hadoopEDUREKA HADOOP CERTIFICATION TRAINING
Thank You…
Questions/Queries/Feedback

Big Data Career Path | Big Data Learning Path | Hadoop Tutorial | Edureka