www.edureka.co/big-data-and-hadoop
Is It the right time for me to Learn Hadoop ?
Find Out!
Slide 2Slide 2Slide 2 www.edureka.co/big-data-and-hadoop
At the end of the session, you will be able to:
 Understand Why Learn Hadoop?
 Know Advantages of Hadoop & its Predictions for 2015
 Discover Hadoop Career Path
 Understand how Companies are using Hadoop?
Agenda
Slide 3Slide 3Slide 3 www.edureka.co/big-data-and-hadoop
Why Hadoop?
Slide 4Slide 4Slide 4 www.edureka.co/big-data-and-hadoop
Rise of Big Data
 By 2020, IDC (International Data Corporation) predicts the number will have reached 40,000 EB, or 40 Zettabytes
(ZB)
The world’s information is doubling every two years. By 2020, there will be 5,200 GB of data for every person on
Earth.
0
1000
2000
3000
4000
5000
6000
7000
2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015
Unstructured Data
Structured Data Un-structured Data
Slide 5Slide 5Slide 5 www.edureka.co/big-data-and-hadoop
Application of Big Data
Source: Twitter
Slide 6Slide 6Slide 6 www.edureka.co/big-data-and-hadoop
Application of Big Data
Amazon handles 15 million
customer click stream user
data per day to
recommend products.
Stock market generates about one
terabyte of new trade data per day
to perform stock trading analytics to
determine trends for optimal trades.
294 billion emails sent every
day. Services analyse this data
to find the spams.
Systems / Enterprises generate huge amount of data from Terabytes to Petabytes of information
Slide 7Slide 7Slide 7 www.edureka.co/big-data-and-hadoop
Can’t use Big Data without Hadoop
Current Scenario:
 Unstructured Data is Exploding
 Organizations take fact based decisions
 The Bigger the data, accurate is the decision!
Conclusion:
The use of Big Data is essential
 To Enable the use of Big Data one needs “Hadoop”
Slide 8Slide 8Slide 8 www.edureka.co/big-data-and-hadoop
Advantages of Hadoop & its Predictions
Slide 9Slide 9Slide 9 www.edureka.co/big-data-and-hadoop
Advantages of Hadoop
F  Fast
F  Flexible
S  Scalable
CE  Cost Effective
FT  Fault Tolerant
F
CE
FT
S
F
Slide 10Slide 10Slide 10 www.edureka.co/big-data-and-hadoop
Feature Comparision
Structured Data Types Multi and Unstructured
Limited, No Data Processing Processing Processing coupled with Data
Standards & Structured Governance Loosely Structured
Required On Write Schema Required On Read
Reads are Fast Speed Writes are Fast
Software License Cost Support Only
Known Entity Resources Growing, Complexities, Wide
OLTP
Complex ACID Transactions
Operational Data Store
Best Fit Use Data Discovery
Processing Unstructured Data
Massive Storage/Processing
RDBMS HADOOP
Slide 11Slide 11Slide 11 www.edureka.co/big-data-and-hadoop
2015 Predictions for Hadoop!
Hadoop has been found not
guilty of being an over-
hyped open source platform!
Source: Forrester
Hadooponomics makes enterprise adoption mandatory
Enterprise
Adoption
The Hadoop skills shortage will disappear
Enterprise
Developers
Hadoop will become SQL enabled
SQL Featured
Hadoop
Integration with enterprise softwares – SAS, Teradata,
Talent etc.
Large Enterprise
Adoption
Hadoop Clusters in the cloud
Scalable Hadoop
Cluster
Beyond Analytics, it will become Application Platform
Expanding Horizon
More Hadoop Distributions will emerge by large
enterprise vendors like SAS, Oracle, IBM etc.
Increasing
Competition
Slide 12Slide 12Slide 12 www.edureka.co/big-data-and-hadoop
Hadoop Career Path
Slide 13Slide 13Slide 13 www.edureka.co/big-data-and-hadoop
Hadoop Career Path
• Java / Python / Ruby
• Hadoop Eco-system
• NoSQL DB
• Spark
• Linux Administration
• Cluster Management
• Cluster Performance
• Virtualization
• Statistics Skills
• Machine Learning
• Hadoop Essentials
• Expertise in R
Developers/Testers
Administrators
Data Analyst
Hadoop Developer
Hadoop Administrator
Data Scientist
Slide 14Slide 14Slide 14 www.edureka.co/big-data-and-hadoop
Job Trends
Slide 15Slide 15Slide 15 www.edureka.co/big-data-and-hadoop
Major Companies Using Hadoop
Slide 16Slide 16Slide 16 www.edureka.co/big-data-and-hadoop
How Companies are using Hadoop?
Slide 17Slide 17Slide 17 www.edureka.co/big-data-and-hadoop
Common Big Data Customer Scenarios
 Web and e-tailing
» Recommendation Engines
» Ad Targeting
» Search Quality
» Abuse and Click Fraud Detection
 Telecommunications
» Customer Churn Prevention
» Network Performance Optimization
» Calling Data Record (CDR) Analysis
» Analysing Network to Predict Failure
http://wiki.apache.org/hadoop/PoweredBy
Slide 18Slide 18Slide 18 www.edureka.co/big-data-and-hadoop
Common Big Data Customer Scenarios
 Government
» Fraud Detection and Cyber Security
» Welfare Schemes
» Justice
 Healthcare and Life Sciences
» Health Information Exchange
» Gene Sequencing
» Serialization
» Healthcare Service Quality Improvements
» Drug Safety
http://wiki.apache.org/hadoop/PoweredBy
www.edureka.co/big-data-and-hadoop
Demo
Slide 20Slide 20Slide 20 www.edureka.co/big-data-and-hadoop
The Big Question!
Is it the right time for me to learn Hadoop?
Slide 21Slide 21Slide 21 www.edureka.co/big-data-and-hadoop
The Big Question!
Is it the right time for me to learn Hadoop?
Answer – Yes, it’s Now or Never!
Reasons:
1. Hadoop has proved its worth
2. Large Enterprises are adopting Hadoop
3. Hadoop skill Shortage will disappear. Learn Before its too late
4. Handsome paid opportunities
Questions
Slide 22 www.edureka.co/big-data-and-hadoop
Slide 23
Your feedback is important to us, be it a compliment, a suggestion or a complaint. It helps us to make
the course better!
Please spare few minutes to take the survey after the webinar.
www.edureka.co/big-data-and-hadoop
Survey
Is It A Right Time For Me To Learn Hadoop. Find out ?

Is It A Right Time For Me To Learn Hadoop. Find out ?

  • 1.
    www.edureka.co/big-data-and-hadoop Is It theright time for me to Learn Hadoop ? Find Out!
  • 2.
    Slide 2Slide 2Slide2 www.edureka.co/big-data-and-hadoop At the end of the session, you will be able to:  Understand Why Learn Hadoop?  Know Advantages of Hadoop & its Predictions for 2015  Discover Hadoop Career Path  Understand how Companies are using Hadoop? Agenda
  • 3.
    Slide 3Slide 3Slide3 www.edureka.co/big-data-and-hadoop Why Hadoop?
  • 4.
    Slide 4Slide 4Slide4 www.edureka.co/big-data-and-hadoop Rise of Big Data  By 2020, IDC (International Data Corporation) predicts the number will have reached 40,000 EB, or 40 Zettabytes (ZB) The world’s information is doubling every two years. By 2020, there will be 5,200 GB of data for every person on Earth. 0 1000 2000 3000 4000 5000 6000 7000 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 Unstructured Data Structured Data Un-structured Data
  • 5.
    Slide 5Slide 5Slide5 www.edureka.co/big-data-and-hadoop Application of Big Data Source: Twitter
  • 6.
    Slide 6Slide 6Slide6 www.edureka.co/big-data-and-hadoop Application of Big Data Amazon handles 15 million customer click stream user data per day to recommend products. Stock market generates about one terabyte of new trade data per day to perform stock trading analytics to determine trends for optimal trades. 294 billion emails sent every day. Services analyse this data to find the spams. Systems / Enterprises generate huge amount of data from Terabytes to Petabytes of information
  • 7.
    Slide 7Slide 7Slide7 www.edureka.co/big-data-and-hadoop Can’t use Big Data without Hadoop Current Scenario:  Unstructured Data is Exploding  Organizations take fact based decisions  The Bigger the data, accurate is the decision! Conclusion: The use of Big Data is essential  To Enable the use of Big Data one needs “Hadoop”
  • 8.
    Slide 8Slide 8Slide8 www.edureka.co/big-data-and-hadoop Advantages of Hadoop & its Predictions
  • 9.
    Slide 9Slide 9Slide9 www.edureka.co/big-data-and-hadoop Advantages of Hadoop F  Fast F  Flexible S  Scalable CE  Cost Effective FT  Fault Tolerant F CE FT S F
  • 10.
    Slide 10Slide 10Slide10 www.edureka.co/big-data-and-hadoop Feature Comparision Structured Data Types Multi and Unstructured Limited, No Data Processing Processing Processing coupled with Data Standards & Structured Governance Loosely Structured Required On Write Schema Required On Read Reads are Fast Speed Writes are Fast Software License Cost Support Only Known Entity Resources Growing, Complexities, Wide OLTP Complex ACID Transactions Operational Data Store Best Fit Use Data Discovery Processing Unstructured Data Massive Storage/Processing RDBMS HADOOP
  • 11.
    Slide 11Slide 11Slide11 www.edureka.co/big-data-and-hadoop 2015 Predictions for Hadoop! Hadoop has been found not guilty of being an over- hyped open source platform! Source: Forrester Hadooponomics makes enterprise adoption mandatory Enterprise Adoption The Hadoop skills shortage will disappear Enterprise Developers Hadoop will become SQL enabled SQL Featured Hadoop Integration with enterprise softwares – SAS, Teradata, Talent etc. Large Enterprise Adoption Hadoop Clusters in the cloud Scalable Hadoop Cluster Beyond Analytics, it will become Application Platform Expanding Horizon More Hadoop Distributions will emerge by large enterprise vendors like SAS, Oracle, IBM etc. Increasing Competition
  • 12.
    Slide 12Slide 12Slide12 www.edureka.co/big-data-and-hadoop Hadoop Career Path
  • 13.
    Slide 13Slide 13Slide13 www.edureka.co/big-data-and-hadoop Hadoop Career Path • Java / Python / Ruby • Hadoop Eco-system • NoSQL DB • Spark • Linux Administration • Cluster Management • Cluster Performance • Virtualization • Statistics Skills • Machine Learning • Hadoop Essentials • Expertise in R Developers/Testers Administrators Data Analyst Hadoop Developer Hadoop Administrator Data Scientist
  • 14.
    Slide 14Slide 14Slide14 www.edureka.co/big-data-and-hadoop Job Trends
  • 15.
    Slide 15Slide 15Slide15 www.edureka.co/big-data-and-hadoop Major Companies Using Hadoop
  • 16.
    Slide 16Slide 16Slide16 www.edureka.co/big-data-and-hadoop How Companies are using Hadoop?
  • 17.
    Slide 17Slide 17Slide17 www.edureka.co/big-data-and-hadoop Common Big Data Customer Scenarios  Web and e-tailing » Recommendation Engines » Ad Targeting » Search Quality » Abuse and Click Fraud Detection  Telecommunications » Customer Churn Prevention » Network Performance Optimization » Calling Data Record (CDR) Analysis » Analysing Network to Predict Failure http://wiki.apache.org/hadoop/PoweredBy
  • 18.
    Slide 18Slide 18Slide18 www.edureka.co/big-data-and-hadoop Common Big Data Customer Scenarios  Government » Fraud Detection and Cyber Security » Welfare Schemes » Justice  Healthcare and Life Sciences » Health Information Exchange » Gene Sequencing » Serialization » Healthcare Service Quality Improvements » Drug Safety http://wiki.apache.org/hadoop/PoweredBy
  • 19.
  • 20.
    Slide 20Slide 20Slide20 www.edureka.co/big-data-and-hadoop The Big Question! Is it the right time for me to learn Hadoop?
  • 21.
    Slide 21Slide 21Slide21 www.edureka.co/big-data-and-hadoop The Big Question! Is it the right time for me to learn Hadoop? Answer – Yes, it’s Now or Never! Reasons: 1. Hadoop has proved its worth 2. Large Enterprises are adopting Hadoop 3. Hadoop skill Shortage will disappear. Learn Before its too late 4. Handsome paid opportunities
  • 22.
  • 23.
    Slide 23 Your feedbackis important to us, be it a compliment, a suggestion or a complaint. It helps us to make the course better! Please spare few minutes to take the survey after the webinar. www.edureka.co/big-data-and-hadoop Survey