Predictive Analytics - Big Data & Artificial Intelligence

•

15 likes•12,236 views

Quick overview of the latest in big data and artificial intelligence. A lot of buzzwords being thrown around, hopefully this presentation will demystify many of the terms.

Technology

October 2016
Predictive Analytics
Big Data & Artificial Intelligence

Agenda
Artificial Intelligence AI
Big Data
Machine Learning
Deep Learning
Neural Networks
NLPNatural Language Processing
Demystify the following buzzwords.
Image Recognition
2

Ultimate Goal: Predictive Analytics
Predict what users will want to buy.
A consumer searches
for a TV and based on
previous customers
data, show a product
that has a high
probability of being
bought as well.
3

Evolution of Data Analytics
1990s 2000s
Excel Business Intelligence (BI)
Dashboards
2015 and beyond
Actionable
Insights
What Happened? What’s Happening? What Will Happen?
4

The Process
Structured and
unstructured (ex.
video) data
Data is stored in
databases and
servers
Data
Generated
Data
Stored
Actionable
Insights
Data
Processing
Process the data
using CPU/GPUs
and AI algorithms
to detect patterns
Predictive
signals are
generated
Central Processing Unit (CPU) / Graphics Processing Unit (GPU)
Big Data Artificial Intelligence
5

How Did We Get Here?
Databases
(the 80s)
Data Warehousing
(the 90s)
• Relational databases
• Gigabytes in size
• Low latency
• Terabytes in size
• Custom hardware
6

When To Use Machine Learning
A pattern exists1
We cannot pin down the pattern
mathematically
2
We have data and hopefully lots of
data
10

Supervised Learning
X
X
X
X
X
Price
Square Feet
We know what we are trying to
predict. We use some examples that
we and the model know the answers
to “train” our model. It can then
generate predictions to examples we
don’t know the answer to.
Example: Predict the price of a house
based on the size of the house.
X
X
12

Unsupervised Learning
O
O O
O
O
O
O
OO
O
X
Y
OO
O O
O
We don’t know what we are trying to
predict. We are trying to identify
some naturally occurring patterns in
the data which may be informative.
Example: Try to identify “clusters” of
customers based on the data we have
on them.
13

What is Deep Learning?
• Deep Learning and Neural Networks are synonymous
• It’s a branch of machine learning based on a set of algorithms that
attempt to model high level abstractions in data by using a deep graph
with multiple processing layers, composed of multiple linear and non-
linear transformations
What we see What the computer “sees”
14

Tools of The Trade
Apache SystemML
Google Cloud
Machine Learning
15

mrjain@gmail.com
Questions?
version: draft

AI Researchers
Geoffrey Hinton
University of Toronto
Google
Yoshua Bengio
University of Montreal
Yann LeCun
New York University
Facebook
Andrew Ng
Stanford University
Baidu
18

The Name…Hadoop
Named after the yellow toy elephant of Doug Cutting’s son.
In 2006 while working at Yahoo, Doug came up with the Hadoop
framework. In 2008, it was taken over by the open source group
Apache, hence the official name is Apache Hadoop.
21

Hadoop to the Rescue
“an open source framework written in Java for storing and
processing massive amounts of data in a distributed manner”
1
Hadoop Distributed File System
(HDFS). Scalable file system that
distributes and stores data across
many machines in a cluster.
MapReduce – framework for
distributed processing.
2 Key Components of the Framework:
Storage 2 Analysis
22

Hadoop Architecture
Hadoop can run on cheap commoditized
hardware on premise or in the cloud.
Stores files in large
blocks (64MB) across
multiple machines for
fault tolerance. By
default, data is stored
on 3 separate machines
HDFS
MapReduce
Breaks large data processing
problems into multiple steps,
namely Mappers (DataNode)
and Reducers (TaskTrackers)
that can be worked on in
parallel on multiple machines
23

MapReduce Store Sales Data
(100MB)
Mappers Name Node 1 Data Node 1
(64MB)
Data Node 2
(36MB)
LA NYC LA NYC
Reducers Job Tracker Task Tracker
1
LA LA
Task Tracker
2
NYC NYC
Shuffle and Sort
24

MapReduce
Map Shuffle & Sort Reduce Result
25

What's hot

Support Vector Machine ppt presentationAyanaRukasar

Supervised and unsupervised learningParas Kohli

Outlier detection method introductionDaeJin Kim

Introduction to Machine LearningRahul Jain

Machine Learning and Real-World ApplicationsMachinePulse

Support Vector Machines for ClassificationPrakash Pimpale

Machine LearningShrey Malik

Machine learningSanjay krishne

Predictive Analytics - An OverviewMachinePulse

Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn

Support vector machineRishabh Gupta

01 Data Mining: Concepts and Techniques, 2nd ed.Institute of Technology Telkom

Introduction to Machine LearningEng Teong Cheah

Big Data TrendsCollabor8now Ltd

Big Data AnalyticsGhulam Imaduddin

Machine learningAmit Kumar Rathi

Web mining TeklayBirhane

Random ForestAbdullah al Mamun

Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationImpetus Technologies

Dbscan algorithomMahbubur Rahman Shimul

What's hot (20)

Support Vector Machine ppt presentation

Supervised and unsupervised learning

Outlier detection method introduction

Introduction to Machine Learning

Machine Learning and Real-World Applications

Support Vector Machines for Classification

Machine Learning

Machine learning

Predictive Analytics - An Overview

Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...

Support vector machine

01 Data Mining: Concepts and Techniques, 2nd ed.

Introduction to Machine Learning

Big Data Trends

Big Data Analytics

Machine learning

Web mining

Random Forest

Anomaly Detection - Real World Scenarios, Approaches and Live Implementation

Dbscan algorithom

Viewers also liked

MachineLearning.pptbutest

Machine Learning for DummiesVenkata Reddy Konasani

Basics of Machine Learningbutest

IBM Watson Health: How cognitive technologies have begun transforming clinica...Maged N. Kamel Boulos

Big Data to Artificial Intelligence in Healthcarejetweedy

The Hive Think Tank: Unpacking AI for Healthcare The Hive

IBM Watson for HealthcareIBM_CH

IBM Watson in HealthcareAnders Quitzau

Big Data & Artificial IntelligenceZavain Dar

IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!Tony Pearson

Introduction to Big Data/Machine LearningLars Marius Garshol

Viewers also liked (11)

MachineLearning.ppt

Machine Learning for Dummies

Basics of Machine Learning

IBM Watson Health: How cognitive technologies have begun transforming clinica...

Big Data to Artificial Intelligence in Healthcare

The Hive Think Tank: Unpacking AI for Healthcare

IBM Watson for Healthcare

IBM Watson in Healthcare

Big Data & Artificial Intelligence

IBM Watson: How it Works, and What it means for Society beyond winning Jeopardy!

Introduction to Big Data/Machine Learning

Similar to Predictive Analytics - Big Data & Artificial Intelligence

Predictive Analytics World Chicago 2015Dan Potter

Advanced Analytics for Any Data at Real-Time Speeddanpotterdwch

In-Memory Computing Webcast. Market Predictions 2017SingleStore

Bigdata " new level"Vamshikrishna Goud

AI in the Enterprise at ScaleGanesan Narayanasamy

Dell AI and HPC University RoadshowBill Wong

SuanIct-Bigdata desktop-finalstelligence

Big Data in AzureDataWorks Summit/Hadoop Summit

AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)byteLAKE

Workshop_Presentation.pptxRUDRAPRASADSABAR

Predictive modelling with azure mlKoray Kocabas

Database Shootout: What's best for BI?Jos van Dongen

Big data Introduction by MohanVenkata Reddy Konasani

Big data Analytics Guduru Lakshmi Kiranmai

Accelerate Machine Learning Software on Intel Architecture Intel® Software

Big Data - A Real Life RevolutionCapgemini

Moving Targets: Harnessing Real-time Value from Data in Motion Inside Analysis

Introduction to Big Data and its TrendsJongwook Woo

Internet of Things: Lightning Round, HiteGovLoop

CS8091_BDA_Unit_I_Analytical_ArchitecturePalani Kumar

Similar to Predictive Analytics - Big Data & Artificial Intelligence (20)

Predictive Analytics World Chicago 2015

Advanced Analytics for Any Data at Real-Time Speed

In-Memory Computing Webcast. Market Predictions 2017

Bigdata " new level"

AI in the Enterprise at Scale

Dell AI and HPC University Roadshow

SuanIct-Bigdata desktop-final

Big Data in Azure

AI for Manufacturing (Machine Vision, Edge AI, Federated Learning)

Workshop_Presentation.pptx

Predictive modelling with azure ml

Database Shootout: What's best for BI?

Big data Introduction by Mohan

Big data Analytics

Accelerate Machine Learning Software on Intel Architecture

Big Data - A Real Life Revolution

Moving Targets: Harnessing Real-time Value from Data in Motion

Introduction to Big Data and its Trends

Internet of Things: Lightning Round, Hite

CS8091_BDA_Unit_I_Analytical_Architecture

Recently uploaded

Histor y of HAM Radio presentation slidevu2urc

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

Slack Application Development 101 Slidespraypatel2

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

A Call to Action for Generative AI in 2024Results

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Recently uploaded (20)

Histor y of HAM Radio presentation slide

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

Slack Application Development 101 Slides

Google AI Hackathon: LLM based Evaluator for RAG

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

CNv6 Instructor Chapter 6 Quality of Service

Understanding the Laravel MVC Architecture

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Finology Group – Insurtech Innovation Award 2024

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

A Call to Action for Generative AI in 2024

Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Boost PC performance: How more available memory can improve productivity

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

IAC 2024 - IA Fast Track to Search Focused AI Solutions