Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Agenda
 What Is Artificial Intelligence ?
 What Is Machine Learning ?
 Limitations Of Machine Learning
 Deep Learning To The Rescue
 What Is Deep Learning ?
 Deep Learning Applications
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Agenda
Hadoop Introduction
Hadoop Ecosystem
Hadoop Use-cases
Demo
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Introduction
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Introduction
Hadoop is a framework that allows us to store and process large data sets in parallel and distributed fashion.
Allows to dump any kind of data
across the cluster
Allows parallel processing of the
data stored in HDFS
HDFS (Storage)
YARN
(Processing)
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Ecosystem
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Hadoop Ecosystem
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Use-Cases
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Use-Cases
Recommendations
Managing Reviews using
NLP
ISIS Tweet network
Analysis
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
NetFlix Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
NetFlix
Recommendation Engine
80% of views comes from
recommendation
Recommendations are driven by
Machine Learning Algorithms
Continuous A/B Testing
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Transformers
The Item Transformer
➢ Extends Spark ML Transformer
➢ Accepts DMC-12 DataFrame with contextual
information
➢ Transforms DataFrame at the item level
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Processes
Using
DataFrames
Multithread
Model
Training
Distributed
Model
Training
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
TripAdvisor Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
TripAdvisor
➢ Covers almost all parts of the world
➢ One of the best platform for hotel
reviews
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
TripAdvisor
Dataset
Generation
Training Application
1 32
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
ISIS Tweet Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Isis Tweets
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Goals
Social Network
Cluster
Analysis
Keyword
Analysis
Data
Categorization
of Links
Sentiment
Analysis
Timeline View
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
ISIS Tweet Analysis
Transforming Data
Filtration
Visualizations
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
DEMO
Travel Sector Use-Case
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
Travel Sector
Find Top 20 frequently travelled destinations
Top 20 locations people travel from
Top 20 high air revenue destinations
Copyright © 2017, edureka and/or its affiliates. All rights reserved.
WebDriver vs. IDE vs. RC
➢ Data Warehouse is like a relational database designed for analytical needs.
➢ It functions on the basis of OLAP (Online Analytical Processing).
➢ It is a central location where consolidated data from multiple locations (databases) are stored.

Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka

  • 1.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Agenda  What Is Artificial Intelligence ?  What Is Machine Learning ?  Limitations Of Machine Learning  Deep Learning To The Rescue  What Is Deep Learning ?  Deep Learning Applications
  • 2.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Agenda Hadoop Introduction Hadoop Ecosystem Hadoop Use-cases Demo
  • 3.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Hadoop Introduction
  • 4.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Hadoop Introduction Hadoop is a framework that allows us to store and process large data sets in parallel and distributed fashion. Allows to dump any kind of data across the cluster Allows parallel processing of the data stored in HDFS HDFS (Storage) YARN (Processing)
  • 5.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Hadoop Ecosystem
  • 6.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Hadoop Ecosystem
  • 7.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Use-Cases
  • 8.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Use-Cases Recommendations Managing Reviews using NLP ISIS Tweet network Analysis
  • 9.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. NetFlix Use-Case
  • 10.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. NetFlix Recommendation Engine 80% of views comes from recommendation Recommendations are driven by Machine Learning Algorithms Continuous A/B Testing
  • 11.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Transformers The Item Transformer ➢ Extends Spark ML Transformer ➢ Accepts DMC-12 DataFrame with contextual information ➢ Transforms DataFrame at the item level
  • 12.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Processes Using DataFrames Multithread Model Training Distributed Model Training
  • 13.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. TripAdvisor Use-Case
  • 14.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. TripAdvisor ➢ Covers almost all parts of the world ➢ One of the best platform for hotel reviews
  • 15.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. TripAdvisor Dataset Generation Training Application 1 32
  • 16.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. ISIS Tweet Use-Case
  • 17.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Isis Tweets
  • 18.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Goals Social Network Cluster Analysis Keyword Analysis Data Categorization of Links Sentiment Analysis Timeline View
  • 19.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. ISIS Tweet Analysis Transforming Data Filtration Visualizations
  • 20.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. DEMO Travel Sector Use-Case
  • 21.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. Travel Sector Find Top 20 frequently travelled destinations Top 20 locations people travel from Top 20 high air revenue destinations
  • 22.
    Copyright © 2017,edureka and/or its affiliates. All rights reserved. WebDriver vs. IDE vs. RC ➢ Data Warehouse is like a relational database designed for analytical needs. ➢ It functions on the basis of OLAP (Online Analytical Processing). ➢ It is a central location where consolidated data from multiple locations (databases) are stored.