SlideShare a Scribd company logo
1 of 35
Download to read offline
1
Oct 2016
NVIDIA DEEP LEARNING
2
ENTERPRISE AUTOGAMING DATA CENTERPRO VISUALIZATION
THE WORLD LEADER IN VISUAL COMPUTING
3
THE BIG BANG IN MACHINE LEARNING
DNN GPUBIG DATA
100 hours of video
uploaded every
minute
350 millions
images uploaded
per day
2.5 Petabytes of
customer data
hourly
0.0
0.5
1.0
1.5
2.0
2.5
3.0
2008 2009 2010 2011 2012 2013 2014
NVIDIA GPU x86 CPU
TFLOPS
4
BIG DATA & ANALYTICS
AUTOMOTIVE
Auto sensors reporting
location, problems
COMMUNICATIONS
Location-based advertising
CONSUMER PACKAGED GOODS
Sentiment analysis of
what’s hot, problems
$
FINANCIAL SERVICES
Risk & portfolio analysis
New products
EDUCATION & RESEARCH
Experiment sensor analysis
HIGH TECHNOLOGY /
INDUSTRIAL MFG.
Mfg. quality
Warranty analysis
LIFE SCIENCES
Clinical trials
MEDIA/ENTERTAINMENT
Viewers / advertising
effectiveness
ON-LINE SERVICES /
SOCIAL MEDIA
People & career matching
HEALTH CARE
Patient sensors,
monitoring, EHRs
OIL & GAS
Drilling exploration sensor
analysis
RETAIL
Consumer sentiment
TRAVEL &
TRANSPORTATION
Sensor analysis for
optimal traffic flows
UTILITIES
Smart Meter analysis
for network capacity,
LAW ENFORCEMENT
& DEFENSE
Threat analysis - social media
monitoring, photo analysis
5
EXPONENTIAL DATA GROWTH
INCREASING DATA VARIETY
Search
Marketing
Behavioral
Targeting
Dynamic
Funnels
User
Generated
Content
Mobile Web
SMS/MMS
Sentiment
HD Video
Speech To
Text
Product/
Service Logs
Social
Network
Business
Data Feeds
User Click
Stream
Sensors Infotainment
Systems
Wearable
Devices
Cyber
Security Logs
Connected
Vehicles
Machine
Data
IoT Data
Dynamic
Pricing
Payment
Record
Purchase
Detail
Purchase
Record
Support
Contacts
Segmentation
Offer
Details
Web
Logs
Offer
History
A/B
Testing
BUSINESS
PROCESS
PETABYTESTERABYTESGIGABYTESEXABYTESZETTABYTES
Streaming
Video
Natural
Language
Processing
WEB
DIGITAL
AI
90% of the world’s
data created in the
last year - IBM
6
7
WHAT IS DEEP LEARNING?
ARTIFICAL
INTELLIGENCE MACHINE
LEARNING
DEEP LEARNINGPerception
Reasoning
Planning
Optimization
Computational
Statistics
Supervised and
Unsupervised Learning
Neural networks
Distributed Representations
Hierarchical Explanatory Factors
Unsupervised Feature Engineering
8
DEEP LEARNING FUELING DISCOVERY
Classify Satellite Images for
Carbon Monitoring
Analyze Obituaries on the Web for
Cancer-related Discoveries
Determine Drug Treatments to Increase
Child’s Chance of Survival
NASA AMES
9
DEEP LEARNING FOR EVERY APPLICATION
Visual search for
e-commerce
Visual Search in
Geoinformatics
Improving Agriculture:
LettuceBot only
sprays weeds
10
Language Classification
Deep Learning CNN
Super-Human Language Translation
DEEP LEARNING FOR EVERY APPLICATION
11
DEEP LEARNING FOR EVERY APPLICATION
12
CONSUMERS LOVE DEEP LEARNING
13
MORE THAN 1,500 AI START UPS
AROUND THE WORLD
Deep Learning
for Art
Deep Learning for
Cybersecurity
Deep Learning for
Genomics
Deep Learning for
Self-Driving Cars
14
IMAGENET CHALLENGE
Where it all started … again
bird
frog
person
hammer
flower pot
power drill
person
car
helmet
motorcycle
person
dog
chair
1.2M training images • 1000 object categories
Challenge
15
ACHIEVING SUPERHUMAN PERFORMANCE
2012: Deep Learning
researchers
worldwide discover GPUs
2016: Microsoft achieves
speech recognition
milestone
2015: ImageNet — Deep
Learning achieves
superhuman image
recognition
16
DEEP LEARNING ADOPTION IS EXPONENTIAL
# of Organizations Using Deep Learning
Source: Jeff Dean, Spark Summit 2016
17
MASSIVE COMPUTING CHALLENGE
SPEECH RECOGNITION
2014
Deep Speech 1
80 GFLOP
7,000 hrs of Data
~8% Error
465 GFLOP
12,000 hrs of
Data
~5% Error
2015
Deep Speech 2
10X
Training Ops
IMAGE RECOGNITION
2012
AlexNet
8 Layers
1.4 GFLOP
~16% Error
152 Layers
22.6 GFLOP
~3.5% Error
2015
ResNet
16X
Model
18
Device
NVIDIA DEEP LEARNING PLATFORM
TRAINING
DIGITS Training System
Deep Learning Frameworks
Tesla P100, DGX1
DATACENTER INFERENCING
DeepStream SDK
TensorRT
Tesla P40 & P4
19
Device
NVIDIA DEEP LEARNING PLATFORM
TRAINING DATACENTER INFERENCING
Training: comparing to Kepler GPU in 2013 using Caffe, Inference: comparing img/sec/watt to CPU: Intel E5-2697v4 using AlexNet
65Xin 3 years
Tesla P100
40Xvs CPU
Tesla P4
20
40x Efficient vs CPU, 8x Efficient vs FPGA
0
50
100
150
200
AlexNet
CPU FPGA 1x M4 (FP32) 1x P4 (INT8)
Images/Sec/Watt
Maximum Efficiency for Scale-out Servers
TESLA P4
5.5 TFLOPS
0
20,000
40,000
60,000
80,000
100,000
GoogLeNet AlexNet
8x M40 (FP32) 8x P40 (INT8)TESLA P40
Highest Throughput for Scale-up Servers
Images/Sec
4x Boost in Less than One Year
21
INTRODUCING TESLA P100
Page Migration Engine
Virtually Unlimited Memory
CoWoS HBM2
3D Stacked Memory (i.e fast!)
NVLink
GPU Interconnect for
Maximum Scalability
22
NVIDIA DGX-1
AI Supercomputer-in-a-Box
170 TFLOPS | 8x Tesla P100 16GB | NVLink Hybrid Cube Mesh
2x Xeon | 8 TB RAID 0 | Quad IB 100Gbps, Dual 10GbE | 3U — 3200W
23
Instant productivity — plug-and-
play, supports every AI framework
Performance optimized across
the entire stack
Always up-to-date via the cloud
Mixed framework environments
—containerized
Direct access to NVIDIA experts
DGX STACK
Fully integrated Deep Learning platform
24
NVIDIA POWERS DEEP LEARNING
Every major DL framework leverages NVIDIA SDKs
Mocha.jl
NVIDIA DEEP LEARNING SDK
COMPUTER VISION SPEECH & AUDIO NATURAL LANGUAGE PROCESSING
OBJECT
DETECTION
IMAGE
CLASSIFICATION
VOICE
RECOGNITION
LANGUAGE
TRANSLATION
RECOMMENDATION
ENGINES
SENTIMENT
ANALYSIS
25
NVIDIA DIGITS
Interactive Deep Learning GPU Training System
Interactive deep neural network development
environment for image classification and object
detection
Schedule, monitor, and manage neural network training
jobs
Analyze accuracy and loss in real time
Track datasets, results, and trained neural networks
Scale training jobs across multiple GPUs automatically
26
NVIDIA cuDNN
Accelerating Deep Learning
High performance building blocks for deep learning
frameworks
Drop-in acceleration for widely used deep learning
frameworks such as Caffe, CNTK, Tensorflow, Theano,
Torch and others
Accelerates industry vetted deep learning algorithms, such
as convolutions, LSTM, fully connected, and pooling layers
Fast deep learning training performance tuned for NVIDIA
GPUs
Deep Learning Training Performance
Caffe AlexNet
Speed-upofImages/SecvsK40in2013
K40 K80 +
cuDN…
M40 +
cuDNN4
P100 +
cuDNN5
0x
10x
20x
30x
40x
50x
60x
70x
80x
“ NVIDIA has improved the speed of cuDNN
with each release while extending the
interface to more operations and devices
at the same time.”
— Evan Shelhamer, Lead Caffe Developer, UC Berkeley
AlexNet training throughput on CPU: 1x E5-2680v3 12 Core 2.5GHz.
128GB System Memory, Ubuntu 14.04
M40 bar: 8x M40 GPUs in a node, P100: 8x P100 NVLink-enabled
27
0 50 100 150 200 250 300
P40
P4
1x CPU (14 cores)
Inference Execution Time (ms)
11 ms
6 ms
User Experience: Instant Response
45x Faster with Pascal + TensorRT
Faster, more responsive AI-powered services such as voice recognition, speech translation
Efficient inference on images, video, & other data in hyperscale production data centers
INTRODUCING NVIDIA TensorRT
High Performance Inference Engine
260 ms
Training
Device
Datacenter
28
NVIDIA DEEPSTREAM SDK
Delivering Video Analytics at Scale
Inference
Preprocess
Hardware
Decode
“Boy playing soccer”
Simple, high performance API for analyzing video
Decode H.264, HEVC, MPEG-2, MPEG-4, VP9
CUDA-optimized resize and scale
TensorRT
0
20
40
60
80
100
1x Tesla P4 Server +
DeepStream SDK
13x E5-2650 v4 Servers
ConcurrentVideoStreams
Concurrent Video Streams Analyzed
29
“Billions of intelligent devices will take advantage of deep learning to provide
personalization and localization as GPUs become faster and faster over the next
several years.” — Tractica
BILLIONS OF INTELLIGENT DEVICES
30
SMART CITIES OF THE FUTURE
“Pittsburgh's "predictive policing" program … police car laptops will display maps
showing locations where crime is likely to occur, based on data-crunching
algorithms developed by scientists at Carnegie Mellon University — Science
31
ACCELERATED ANALYTICS TECHNOLOGY
32
GPU-ACCELERATION HAS NO LIMITS
MapD
MapD is 55x to 1,000x faster than
comparable CPU databases on billion+
row datasets
Kinetica
Hardware costs that are 1⁄10 that of
standard in-memory databases
BlazeGraph
200-300x speed-up
Graphistry
See 100x more data at millisecond
speed
SQream
The supercomputing powers of the GPU combined with SQream’s patented
technology, results in up to 100 times faster analytics performance on terabyte-
petabyte scale data sets
33
MASSIVE SCALE GPU ACCELERATED ANALYTICS
DEA theft of Silk Road bitcoinsSIEM attack escalationTwitter botnet deconstruction
34
GETTING STARTED WITH DEEP LEARNING
developer.nvidia.com/deep-learning
35
Thank you!

More Related Content

What's hot

What's hot (20)

LLMs_talk_March23.pdf
LLMs_talk_March23.pdfLLMs_talk_March23.pdf
LLMs_talk_March23.pdf
 
Deep learning ppt
Deep learning pptDeep learning ppt
Deep learning ppt
 
AI Hardware Landscape 2021
AI Hardware Landscape 2021AI Hardware Landscape 2021
AI Hardware Landscape 2021
 
An introduction to Deep Learning
An introduction to Deep LearningAn introduction to Deep Learning
An introduction to Deep Learning
 
Deep Learning Explained
Deep Learning ExplainedDeep Learning Explained
Deep Learning Explained
 
Generative AI
Generative AIGenerative AI
Generative AI
 
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
 
Deep neural networks
Deep neural networksDeep neural networks
Deep neural networks
 
Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2Latent diffusions vs DALL-E v2
Latent diffusions vs DALL-E v2
 
Intro to deep learning
Intro to deep learning Intro to deep learning
Intro to deep learning
 
FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning FPGA Hardware Accelerator for Machine Learning
FPGA Hardware Accelerator for Machine Learning
 
1.Introduction to deep learning
1.Introduction to deep learning1.Introduction to deep learning
1.Introduction to deep learning
 
Generative adversarial networks
Generative adversarial networksGenerative adversarial networks
Generative adversarial networks
 
Deep Learning Workflows: Training and Inference
Deep Learning Workflows: Training and InferenceDeep Learning Workflows: Training and Inference
Deep Learning Workflows: Training and Inference
 
What is Deep Learning | Deep Learning Simplified | Deep Learning Tutorial | E...
What is Deep Learning | Deep Learning Simplified | Deep Learning Tutorial | E...What is Deep Learning | Deep Learning Simplified | Deep Learning Tutorial | E...
What is Deep Learning | Deep Learning Simplified | Deep Learning Tutorial | E...
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
 
Deep learning presentation
Deep learning presentationDeep learning presentation
Deep learning presentation
 
Deep learning
Deep learningDeep learning
Deep learning
 
Exploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion ModelsExploring Generating AI with Diffusion Models
Exploring Generating AI with Diffusion Models
 
Neural networks and deep learning
Neural networks and deep learningNeural networks and deep learning
Neural networks and deep learning
 

Similar to Introduction to Deep Learning (NVIDIA)

abelbrownnvidiarakuten2016-170208065814 (1).pptx
abelbrownnvidiarakuten2016-170208065814 (1).pptxabelbrownnvidiarakuten2016-170208065814 (1).pptx
abelbrownnvidiarakuten2016-170208065814 (1).pptx
gopikahari7
 
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
Willy Marroquin (WillyDevNET)
 

Similar to Introduction to Deep Learning (NVIDIA) (20)

abelbrownnvidiarakuten2016-170208065814 (1).pptx
abelbrownnvidiarakuten2016-170208065814 (1).pptxabelbrownnvidiarakuten2016-170208065814 (1).pptx
abelbrownnvidiarakuten2016-170208065814 (1).pptx
 
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
 
GTC China 2016
GTC China 2016GTC China 2016
GTC China 2016
 
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
Introduction to multi gpu deep learning with DIGITS 2 - Mike WangIntroduction to multi gpu deep learning with DIGITS 2 - Mike Wang
Introduction to multi gpu deep learning with DIGITS 2 - Mike Wang
 
NVIDIA Deep Learning Institute 2017 基調講演
NVIDIA Deep Learning Institute 2017 基調講演NVIDIA Deep Learning Institute 2017 基調講演
NVIDIA Deep Learning Institute 2017 基調講演
 
DataArt
DataArtDataArt
DataArt
 
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Simplifying AI Infrastructure: Lessons in Scaling on DGX SystemsSimplifying AI Infrastructure: Lessons in Scaling on DGX Systems
Simplifying AI Infrastructure: Lessons in Scaling on DGX Systems
 
GTC 2016 Opening Keynote
GTC 2016 Opening KeynoteGTC 2016 Opening Keynote
GTC 2016 Opening Keynote
 
Fueling the AI Revolution with Gaming
Fueling the AI Revolution with GamingFueling the AI Revolution with Gaming
Fueling the AI Revolution with Gaming
 
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...
 
Enabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. LowndesEnabling Artificial Intelligence - Alison B. Lowndes
Enabling Artificial Intelligence - Alison B. Lowndes
 
Data Science Week 2016. NVIDIA. "Платформы и инструменты для реализации систе...
Data Science Week 2016. NVIDIA. "Платформы и инструменты для реализации систе...Data Science Week 2016. NVIDIA. "Платформы и инструменты для реализации систе...
Data Science Week 2016. NVIDIA. "Платформы и инструменты для реализации систе...
 
Aplicações Potenciais de Deep Learning à Indústria do Petróleo
Aplicações Potenciais de Deep Learning à Indústria do PetróleoAplicações Potenciais de Deep Learning à Indústria do Petróleo
Aplicações Potenciais de Deep Learning à Indústria do Petróleo
 
Dell and NVIDIA for Your AI workloads in the Data Center
Dell and NVIDIA for Your AI workloads in the Data CenterDell and NVIDIA for Your AI workloads in the Data Center
Dell and NVIDIA for Your AI workloads in the Data Center
 
AI in the Financial Services Industry
AI in the Financial Services IndustryAI in the Financial Services Industry
AI in the Financial Services Industry
 
Introduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI PlatformIntroduction to PowerAI - The Enterprise AI Platform
Introduction to PowerAI - The Enterprise AI Platform
 
The Revolution of Deep Learning
The Revolution of Deep LearningThe Revolution of Deep Learning
The Revolution of Deep Learning
 
Accelerate AI w/ Synthetic Data using GANs
Accelerate AI w/ Synthetic Data using GANsAccelerate AI w/ Synthetic Data using GANs
Accelerate AI w/ Synthetic Data using GANs
 
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...TECHNICAL OVERVIEW NVIDIA DEEP  LEARNING PLATFORM Giant Leaps in Performance ...
TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance ...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 

More from Rakuten Group, Inc.

More from Rakuten Group, Inc. (20)

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
 
楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり
 
What Makes Software Green?
What Makes Software Green?What Makes Software Green?
What Makes Software Green?
 
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
 
DataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組みDataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組み
 
大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開
 
楽天における大規模データベースの運用
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用
 
楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー
 
楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割
 
Rakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdfRakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdf
 
The Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdfThe Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdf
 
Supporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdfSupporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdf
 
Making Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdfMaking Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdf
 
How We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdfHow We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdf
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
OWASPTop10_Introduction
OWASPTop10_IntroductionOWASPTop10_Introduction
OWASPTop10_Introduction
 
Introduction of GORA API Group technology
Introduction of GORA API Group technologyIntroduction of GORA API Group technology
Introduction of GORA API Group technology
 
100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情
 
社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー
 

Recently uploaded

Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 

Recently uploaded (20)

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
The Metaverse: Are We There Yet?
The  Metaverse:    Are   We  There  Yet?The  Metaverse:    Are   We  There  Yet?
The Metaverse: Are We There Yet?
 
Generative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdfGenerative AI Use Cases and Applications.pdf
Generative AI Use Cases and Applications.pdf
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Intro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptxIntro to Passkeys and the State of Passwordless.pptx
Intro to Passkeys and the State of Passwordless.pptx
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
Portal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russePortal Kombat : extension du réseau de propagande russe
Portal Kombat : extension du réseau de propagande russe
 
Intro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджераIntro in Product Management - Коротко про професію продакт менеджера
Intro in Product Management - Коротко про професію продакт менеджера
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 

Introduction to Deep Learning (NVIDIA)

  • 2. 2 ENTERPRISE AUTOGAMING DATA CENTERPRO VISUALIZATION THE WORLD LEADER IN VISUAL COMPUTING
  • 3. 3 THE BIG BANG IN MACHINE LEARNING DNN GPUBIG DATA 100 hours of video uploaded every minute 350 millions images uploaded per day 2.5 Petabytes of customer data hourly 0.0 0.5 1.0 1.5 2.0 2.5 3.0 2008 2009 2010 2011 2012 2013 2014 NVIDIA GPU x86 CPU TFLOPS
  • 4. 4 BIG DATA & ANALYTICS AUTOMOTIVE Auto sensors reporting location, problems COMMUNICATIONS Location-based advertising CONSUMER PACKAGED GOODS Sentiment analysis of what’s hot, problems $ FINANCIAL SERVICES Risk & portfolio analysis New products EDUCATION & RESEARCH Experiment sensor analysis HIGH TECHNOLOGY / INDUSTRIAL MFG. Mfg. quality Warranty analysis LIFE SCIENCES Clinical trials MEDIA/ENTERTAINMENT Viewers / advertising effectiveness ON-LINE SERVICES / SOCIAL MEDIA People & career matching HEALTH CARE Patient sensors, monitoring, EHRs OIL & GAS Drilling exploration sensor analysis RETAIL Consumer sentiment TRAVEL & TRANSPORTATION Sensor analysis for optimal traffic flows UTILITIES Smart Meter analysis for network capacity, LAW ENFORCEMENT & DEFENSE Threat analysis - social media monitoring, photo analysis
  • 5. 5 EXPONENTIAL DATA GROWTH INCREASING DATA VARIETY Search Marketing Behavioral Targeting Dynamic Funnels User Generated Content Mobile Web SMS/MMS Sentiment HD Video Speech To Text Product/ Service Logs Social Network Business Data Feeds User Click Stream Sensors Infotainment Systems Wearable Devices Cyber Security Logs Connected Vehicles Machine Data IoT Data Dynamic Pricing Payment Record Purchase Detail Purchase Record Support Contacts Segmentation Offer Details Web Logs Offer History A/B Testing BUSINESS PROCESS PETABYTESTERABYTESGIGABYTESEXABYTESZETTABYTES Streaming Video Natural Language Processing WEB DIGITAL AI 90% of the world’s data created in the last year - IBM
  • 6. 6
  • 7. 7 WHAT IS DEEP LEARNING? ARTIFICAL INTELLIGENCE MACHINE LEARNING DEEP LEARNINGPerception Reasoning Planning Optimization Computational Statistics Supervised and Unsupervised Learning Neural networks Distributed Representations Hierarchical Explanatory Factors Unsupervised Feature Engineering
  • 8. 8 DEEP LEARNING FUELING DISCOVERY Classify Satellite Images for Carbon Monitoring Analyze Obituaries on the Web for Cancer-related Discoveries Determine Drug Treatments to Increase Child’s Chance of Survival NASA AMES
  • 9. 9 DEEP LEARNING FOR EVERY APPLICATION Visual search for e-commerce Visual Search in Geoinformatics Improving Agriculture: LettuceBot only sprays weeds
  • 10. 10 Language Classification Deep Learning CNN Super-Human Language Translation DEEP LEARNING FOR EVERY APPLICATION
  • 11. 11 DEEP LEARNING FOR EVERY APPLICATION
  • 13. 13 MORE THAN 1,500 AI START UPS AROUND THE WORLD Deep Learning for Art Deep Learning for Cybersecurity Deep Learning for Genomics Deep Learning for Self-Driving Cars
  • 14. 14 IMAGENET CHALLENGE Where it all started … again bird frog person hammer flower pot power drill person car helmet motorcycle person dog chair 1.2M training images • 1000 object categories Challenge
  • 15. 15 ACHIEVING SUPERHUMAN PERFORMANCE 2012: Deep Learning researchers worldwide discover GPUs 2016: Microsoft achieves speech recognition milestone 2015: ImageNet — Deep Learning achieves superhuman image recognition
  • 16. 16 DEEP LEARNING ADOPTION IS EXPONENTIAL # of Organizations Using Deep Learning Source: Jeff Dean, Spark Summit 2016
  • 17. 17 MASSIVE COMPUTING CHALLENGE SPEECH RECOGNITION 2014 Deep Speech 1 80 GFLOP 7,000 hrs of Data ~8% Error 465 GFLOP 12,000 hrs of Data ~5% Error 2015 Deep Speech 2 10X Training Ops IMAGE RECOGNITION 2012 AlexNet 8 Layers 1.4 GFLOP ~16% Error 152 Layers 22.6 GFLOP ~3.5% Error 2015 ResNet 16X Model
  • 18. 18 Device NVIDIA DEEP LEARNING PLATFORM TRAINING DIGITS Training System Deep Learning Frameworks Tesla P100, DGX1 DATACENTER INFERENCING DeepStream SDK TensorRT Tesla P40 & P4
  • 19. 19 Device NVIDIA DEEP LEARNING PLATFORM TRAINING DATACENTER INFERENCING Training: comparing to Kepler GPU in 2013 using Caffe, Inference: comparing img/sec/watt to CPU: Intel E5-2697v4 using AlexNet 65Xin 3 years Tesla P100 40Xvs CPU Tesla P4
  • 20. 20 40x Efficient vs CPU, 8x Efficient vs FPGA 0 50 100 150 200 AlexNet CPU FPGA 1x M4 (FP32) 1x P4 (INT8) Images/Sec/Watt Maximum Efficiency for Scale-out Servers TESLA P4 5.5 TFLOPS 0 20,000 40,000 60,000 80,000 100,000 GoogLeNet AlexNet 8x M40 (FP32) 8x P40 (INT8)TESLA P40 Highest Throughput for Scale-up Servers Images/Sec 4x Boost in Less than One Year
  • 21. 21 INTRODUCING TESLA P100 Page Migration Engine Virtually Unlimited Memory CoWoS HBM2 3D Stacked Memory (i.e fast!) NVLink GPU Interconnect for Maximum Scalability
  • 22. 22 NVIDIA DGX-1 AI Supercomputer-in-a-Box 170 TFLOPS | 8x Tesla P100 16GB | NVLink Hybrid Cube Mesh 2x Xeon | 8 TB RAID 0 | Quad IB 100Gbps, Dual 10GbE | 3U — 3200W
  • 23. 23 Instant productivity — plug-and- play, supports every AI framework Performance optimized across the entire stack Always up-to-date via the cloud Mixed framework environments —containerized Direct access to NVIDIA experts DGX STACK Fully integrated Deep Learning platform
  • 24. 24 NVIDIA POWERS DEEP LEARNING Every major DL framework leverages NVIDIA SDKs Mocha.jl NVIDIA DEEP LEARNING SDK COMPUTER VISION SPEECH & AUDIO NATURAL LANGUAGE PROCESSING OBJECT DETECTION IMAGE CLASSIFICATION VOICE RECOGNITION LANGUAGE TRANSLATION RECOMMENDATION ENGINES SENTIMENT ANALYSIS
  • 25. 25 NVIDIA DIGITS Interactive Deep Learning GPU Training System Interactive deep neural network development environment for image classification and object detection Schedule, monitor, and manage neural network training jobs Analyze accuracy and loss in real time Track datasets, results, and trained neural networks Scale training jobs across multiple GPUs automatically
  • 26. 26 NVIDIA cuDNN Accelerating Deep Learning High performance building blocks for deep learning frameworks Drop-in acceleration for widely used deep learning frameworks such as Caffe, CNTK, Tensorflow, Theano, Torch and others Accelerates industry vetted deep learning algorithms, such as convolutions, LSTM, fully connected, and pooling layers Fast deep learning training performance tuned for NVIDIA GPUs Deep Learning Training Performance Caffe AlexNet Speed-upofImages/SecvsK40in2013 K40 K80 + cuDN… M40 + cuDNN4 P100 + cuDNN5 0x 10x 20x 30x 40x 50x 60x 70x 80x “ NVIDIA has improved the speed of cuDNN with each release while extending the interface to more operations and devices at the same time.” — Evan Shelhamer, Lead Caffe Developer, UC Berkeley AlexNet training throughput on CPU: 1x E5-2680v3 12 Core 2.5GHz. 128GB System Memory, Ubuntu 14.04 M40 bar: 8x M40 GPUs in a node, P100: 8x P100 NVLink-enabled
  • 27. 27 0 50 100 150 200 250 300 P40 P4 1x CPU (14 cores) Inference Execution Time (ms) 11 ms 6 ms User Experience: Instant Response 45x Faster with Pascal + TensorRT Faster, more responsive AI-powered services such as voice recognition, speech translation Efficient inference on images, video, & other data in hyperscale production data centers INTRODUCING NVIDIA TensorRT High Performance Inference Engine 260 ms Training Device Datacenter
  • 28. 28 NVIDIA DEEPSTREAM SDK Delivering Video Analytics at Scale Inference Preprocess Hardware Decode “Boy playing soccer” Simple, high performance API for analyzing video Decode H.264, HEVC, MPEG-2, MPEG-4, VP9 CUDA-optimized resize and scale TensorRT 0 20 40 60 80 100 1x Tesla P4 Server + DeepStream SDK 13x E5-2650 v4 Servers ConcurrentVideoStreams Concurrent Video Streams Analyzed
  • 29. 29 “Billions of intelligent devices will take advantage of deep learning to provide personalization and localization as GPUs become faster and faster over the next several years.” — Tractica BILLIONS OF INTELLIGENT DEVICES
  • 30. 30 SMART CITIES OF THE FUTURE “Pittsburgh's "predictive policing" program … police car laptops will display maps showing locations where crime is likely to occur, based on data-crunching algorithms developed by scientists at Carnegie Mellon University — Science
  • 32. 32 GPU-ACCELERATION HAS NO LIMITS MapD MapD is 55x to 1,000x faster than comparable CPU databases on billion+ row datasets Kinetica Hardware costs that are 1⁄10 that of standard in-memory databases BlazeGraph 200-300x speed-up Graphistry See 100x more data at millisecond speed SQream The supercomputing powers of the GPU combined with SQream’s patented technology, results in up to 100 times faster analytics performance on terabyte- petabyte scale data sets
  • 33. 33 MASSIVE SCALE GPU ACCELERATED ANALYTICS DEA theft of Silk Road bitcoinsSIEM attack escalationTwitter botnet deconstruction
  • 34. 34 GETTING STARTED WITH DEEP LEARNING developer.nvidia.com/deep-learning