SlideShare a Scribd company logo
1 of 41
PowerAI
The Enterprise AI Platform
Indrajit Poddar (I.P)
ipoddar@us.ibm.com
STSM, IBM Systems Technical Strategy
July, 2017
2
3
$-
$2,000.00
$4,000.00
$6,000.00
$8,000.00
$10,000.00
$12,000.00
$14,000.00
$16,000.00
$18,000.00
2017 2018 2019 2020
Deep Learning Hardware Revenue: $1.8B-$15-6B
Cognitive Market Spend(2019)
Software Services Hardware Other
5
$31 Billion
IDC Market Data
6
Artificial
Intelligence &
Cognitive
Applications
Big
Data
Machine
Learning
Deep
Learning
(Neural Nets)
The Cognitive Landscape is Evolving
Core concepts in Machine Learning:
Training  Inference
Training
• Data intensive:
historical data sets
• Compute intensive:
100% accelerated
• Develop a model for
use on the edge as
inference
Inference
• Enables the computer
to act in real time
• Low Power
• Out at the edge
9
Input Result
Earlier Layers
Detect Edges
Later Layers Detect
Features like Eyes,
Nose, Mouth
person
car
helmet
motorcycle
bird
frog
person
dog
chair
person
hammer
flower pot
power drill
13
Transmission Line
Inspection
14
15
To build a team with deep learning
expertise : 2 months ~ 1 year
To prepare massive training
data : ~ 10 man month(s)
To train a new
model : 1 hour ~
week
To give an AI
inference result :
< 1s
Challenges in creating an AI infrastructure
Time needed to:
• Find skills
• Handle large data-sets
• Hi-res images, video feed..
• Continuously train models
• Run inferencing at scale
• Handle rapidly evolving open
source components
CPUs are not getting faster as rapidly as before
- Moore’s law is dying
Resulting in unprecedented demand for :
• Offloaded computation, accelerators, and higher
memory bandwidth systems
• Easy to use software that works with open source and
scales
PowerAI: Enterprise Class, Ease of Use, Faster Training
Enterprise Software
Distribution
Binary Package of Major
Deep Learning Frameworks
with Enterprise Support
Tools for Ease of
Development
Graphical tools to Enhance
Data Scientist Developer
Experience
Faster Training Times
for Data Scientists
Performance Optimized for
Single Node & Distributed
Computing Scaling
17
Data Lake
Transform & Prep
Data (ETL)
Trained Model
Images of
Damaged
Components
ModelTraining
Transform & Prep
Data (ETL)
Off-Line
Training
Production
LiveVideo
Financial Services
Retailers
Internal Business Processes
Chatbots, Call Center Automation
Transportation
Text Analytics of Social Media, Call
Center Phone Logs
18
MEDIA/ENTERTAINMENT
RETAIL
Reco. Engines,
Precision Mktg
COMMUNICATIONS
Location-based
advertising
LIFE SCIENCES
Sequence Analysis,
Radiology
UTILITIES
Smart Meter analysis,
Capacity planning
$
FINANCIAL SERVICES
Risk analysis
Fraud detection
CUSTOMER SERVICE
Chatbots, Helpdesk
Automated Expenses
LAW & DEFENSE
Threat analysis - social
media monitoring
RESEARCH
Physics
Modeling
HEALTH CARE
Patient sensors,
monitoring, EHRs
TRANSPORTATION
Optimal traffic flows,
Route planning
CONSUMER GOODS
Sentiment
analysis
Advertising
effectiveness
OIL & GAS
Exploration,
sensor analysis
AUTOMOTIVE
ADAS,
Maintenance
MANUFACTURING
Line inspection,
Defect analysis
20
21
AI Strategy: Ease of Use & Performance
Open Frameworks
Developer Ease-of-Use Tools
Performance Optimizations:
Software & Hardware
Caffe NVCaffe TorchIBMCaffe
Distributed
TensorFlowTensorFlow
OpenBLAS
Theano
Deep Learning
Frameworks
Accelerated
Servers and
Infrastructure
for Scaling
Spectrum Scale:
High-Speed Parallel
File System
Scale to
Cloud
Cluster of NVLink
Servers
Bazel DIGITSNCCL
Distributed
Communications
Supporting
Libraries
Chainer
PowerAI
DL Frameworks + Libraries
(TensorFlow, Caffe, ..)
IBM Data Science
Experience (DSX)
Distributed Computing
with Spark & MPI
DL Developer Tools
Spectrum Scale High-Speed
File System via HDFS APIs
Cluster of NVLink Servers
PowerAI Enterprise (Coming soon)
IBM Enterprise
Support
Application Dev
Services
Enterprise Support & Services
to Augment Enterprise
Expertise
Packaged, Pre-Compiled Deep
Learning Frameworks
(TensorFlow, Caffe, Torch, ..)
Optimized for Scaling &
Fast Training Time
Data Scientists Productivity
Tools Targeted to DL
Developers
IBM Confidential
PowerAI: Making AI More Accessible to Developers
• AI Vision: Targeted at Application Developers
• Data Extraction, Transformation and Preparation tool
• DL Insight
• Distributed Deep Learning
Multi-tenant, Enterprise-ready Deep Learning Platform for Data Scientists
24
caffe-bvlc: install cuda,cuDNN, install openblas, install protobuf, clone, build and install opencv, install python, install python-dev, install
libgflags, install libgoogle-glog-dev, install liblmdb-dev, edit make file to enable CuDNN, make all, make distribution
Torch: complicated on Power as luaJIT has mixed support for OpenPOWER. We use a luaJIT fork to build.
caffe-nv: same dependencies as caffe-bvlc; separate upstream repo for caffe-nv, specific versions are needed for newer versions of Nvidia’s
DIGITS tool.
caffe-ibm: same dependencies as caffe-bvlc, separate build stream; versions; updates
Tensorflow: in PIP for x86, but it is often recommended to build from source: upgrade pip, install Bazel, install many dependencies including
java, configure the build, compile, pip install whl, upgrade protobuf
Theano: install python, numpy, scipy, openBLAS, python-dev, nose, Sphinx, cuda, pycuda, clone, build and install libgpuarray
DIGITS: clone digits from repo, install dependencies (PIP)
Life without PowerAI:
With PowerAI:
PowerAI: install cuda, cuDNN; sudo apt-get install power-mldl
DL Frameworks
(TF, Caffe, etc)
Data Prep & ETL via
Spectrum Conductor
with Spark
Input
Data
Deep Learning GUI
Data & Model
Management, ETLTools,
Monitor,Visualize,
Advise
DL Insight
Tuning Engine
AIVision
ComputerVisionApp
DevelopmentToolkit
IBM Spectrum Conductor with Spark
System mgmt, Distributed ETL, DistributedTraining, Hyper-Parameter Optimization
DistributedTraining
27
Data Lake & Data Stores
Distributed Computing
Machine & Deep Learning
Libraries & Frameworks
CognitiveAPIs
(Eg:Watson)
In-House
CognitiveAPIs
Applications
Hadoop HDFS,
NoSQL DBs
Spark, MPI
TensorFlow, Caffe,
SparkML
Speech,Vision,
NLP, Sentiment
Segment Specific:
Finance, Retail,
Healthcare, etc.
Accelerated Servers Storage
Accelerated
Infrastructure
Transform & Prep
Data (ETL)
https://mc.jarvice.com/
28
ATLAS
Automatically Tuned Linear Algebra
Software)
https://power.jarvice.com/
29
Deep Learning Training + Inference
Accelerators
Clustering frameworks
Workload
Aware
Scheduling
Shared
Resource
Management
Emerging
Workloads
Dev Ops & Micro Services
High Performance
Computing
Design / Simulation / Modeling
‘New-gen
Workloads’
Hadoop, Spark, Containers
with Spark
IBM
Cloud
private
Ne
w
High Performance
Analytics
Trade / Risk Analytics
Containers and images
IBM Data
Science
Experience
30
31
IBM OpenPOWER Moves on Deep Learning with a Vengeance
“In short, IBM kicked some butt today”
Rob Enderle
Industry Analyst
IBM brings Google's AI tools to its powerful computers
Google has cool technology to recognize images and speech, and IBM's
hardware can diagnose diseases and beat humans in Jeopardy.
Combine the two, and you get a powerful computer with serious brains.
OpenPOWER: Open Hardware for High Performance
32
Systems designed for
big data analytics
and superior cloud economics
Upto:
10 cores per cpu
96 hardware threads per cpu
1/2 TB RAM
7.6Tb/s combined I/O Bandwidth
OpenPOWER
Traditional
Intel x86
http://www.softlayer.com/POWER-SERVERS
https://power.jarvice.com/landing
Accelerated AI: Chip and Servers
POWER8 + coherent CAPI +
novel NVlink
for high BW coherent
CPU/GPU acceleration
S822LC-hpc:
• 2 POWER8 10 Core CPUs
• 4 NVIDIA P100 ”Pascal” GPUs
• 256 GB System Memory
• 2 SSD storage devices
• High-speed interconnect
(IB or Ethernet, depending on
infrastructure)
• Optional:
• Up to 1 TB System Memory
• PCIe attached NVMe storage
“POWER8 with NVLink”
S821LC:
High Density 2-Socket 1U
S822LC for Big Data
S822LC for High
Performance Computing
Power
Linux Servers
M.Gschwind, Bringing the Deep Learning Revolution into the Enterprise
Accelerated AI
Accelerator X
33
Introducing 822LC Power System for HPC:
First Custom-Built GPU Accelerator Server with NVLink and NVidia P100 GPUs
M.Gschwind, Bringing the Deep Learning Revolution into the Enterprise
▪ Custom-built GPU Accelerator Server
▪ High-Speed NVLink Connections between
CPUs & GPUs and among GPUs
▪ Features novel NVIDIA P100 Pascal GPU
accelerator
NVIDIA P100 Pascal GPU
2.5x Faster CPU-GPU Data Communication
via NVLink
NVLink
80 GB/s
GPU
P8
GPU GPU
P8
GPU
POWER8 NVLink Server
PCIe
32 GB/s
GPU GPU GPU GPU
No NVLink between CPU & GPU for x86
Servers: PCIe Bottleneck
x86 Servers with PCIe
x86 x86
34
Higher Performance with Power8 CPU-P100 GPU NVLink
P100
GPU
POWER8
CPU
GPU
Memory
System
Memory
P100
GPU
80 GB/s
GPU
Memory
NVLink
115 GB/s
P100
GPU
POWER8
CPU
GPU
Memory
System
Memory
P100
GPU
80 GB/s
GPU
Memory
NVLink
115 GB/s
0
50
100
150
200
250
300
S822LC - Optimized E5-2640v4
Images Processed (Images/Sec)
(TensorFlow, Inception v3)
36
IBM S822LC 20-cores 2.86GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs / Ubuntu 16.04 /
CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch)
Intel Broadwell E5-2640v4 20-core 2.6 GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs/ Ubuntu 16.04 /
CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch)
Power8 “Minsky” Server Intel x86-Based Server
Minsky: 30% Faster
PowerAI vs DGX-1: 1.6xTensorFlowThroughput / Dollar
(lower cost is better)
37
• TensorFlow 0.12 on the IBM PowerAI
platform takes advantage of the full
capabilities of NVLink
• For image classification and analysis this
means a 1.6X price performance advantage
relative to the NVIDIA DGX-1
System Images /
Second
List Price $ / Image /
Second
NVIDIA DGX-1
(8 P100 GPU,
512GB Mem)
330 $129,000 $390
PowerAI (4 P100
GPU, 512 GB
Mem)
273 $67,000 $241
38
PowerAI Trial Configurations in a public cloud:
• Docker container builds and comes up in minutes
• Single P100 GPUs
• 30 days with 60 hrs standard (120 for Sales referral)
• 128GB RAM, 32 CPU threads, 1TB shared storage
• Quad P100 GPUs
• 30 days with 120hrs standard (more by request)
• 512GB RAM, 128 CPU threads, 1TB shared storage
Contact: Michael Boros
Nimbix Cloud Advantages
• Easier to use
• Highest Performance
• Ultra Fast Launch Times
• Lower Cost
• Faster time to Value
• Bare-Metal Acceleration
• Enterprise Accounting
• Application Marketplace
• Private Apps
https://www.slideshare.net/IndrajitPoddar/fast-scalable-easy-machine-
learning-with-openpower-gpus-and-docker
Experience performance
with productivity
A superior integrated stack and
adequate hardware resources for
deep learning insights
40
Launch deep learning
training by one-clickData labeling
Monitor the training
progress
Deploy the inference API to
data center
Generate and deploy the DL
inference accelerator onto FPGA
DL Engineer could get
optimized model parameters
DL Insight
DL Engineer
DSX
Inject the designed DL
network into AI Vision
AI Vision
Develop the DL
neural network via
the interactive GUI
Solution
developer
PowerAI
Inference Engine
Test
engineer
Error results will be looped back
to trigger new training task
"Easier Insights with Data Science Experience and PowerAI Deep Learning" -
https://ibm.box.com/s/m7ooeoi738rs7dq9l9v0i9iir79t4xmd
Analytics Signature Moment Event in Munich:
https://www.ibm.com/analytics/us/en/events/machine-learning/
• a 10x increase in
inspections/day
• a 90%decrease in
inspection time
• a Significant reduction
in worker accidents
Example value realized by an Asian
Utility company using PowerAI

More Related Content

What's hot

Intel's Machine Learning Strategy
Intel's Machine Learning StrategyIntel's Machine Learning Strategy
Intel's Machine Learning Strategyinside-BigData.com
 
Affordable AI Connects To A Better Life
Affordable AI Connects To A Better LifeAffordable AI Connects To A Better Life
Affordable AI Connects To A Better LifeNVIDIA Taiwan
 
OpenPOWER Foundation Overview
OpenPOWER Foundation OverviewOpenPOWER Foundation Overview
OpenPOWER Foundation OverviewNVIDIA Taiwan
 
AWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI ResearchAWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI ResearchIntel® Software
 
Accelerated Computing: The Path Forward
Accelerated Computing: The Path ForwardAccelerated Computing: The Path Forward
Accelerated Computing: The Path ForwardNVIDIA
 
Shattering AI Performance Records
Shattering AI Performance RecordsShattering AI Performance Records
Shattering AI Performance RecordsNVIDIA
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA
 
Tesla Accelerated Computing Platform
Tesla Accelerated Computing PlatformTesla Accelerated Computing Platform
Tesla Accelerated Computing Platforminside-BigData.com
 
5 biggest hpc trends 2021
5 biggest hpc trends 20215 biggest hpc trends 2021
5 biggest hpc trends 2021Sandeep Mishra
 
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA Taiwan
 
Opening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual ComputingOpening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual ComputingNVIDIA
 
A Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate ArraysA Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate ArraysTaylor Riggan
 
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
NVIDIA 深度學習教育機構 (DLI): Neural network deploymentNVIDIA 深度學習教育機構 (DLI): Neural network deployment
NVIDIA 深度學習教育機構 (DLI): Neural network deploymentNVIDIA Taiwan
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooJason Dai
 
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY
 
GPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 KeynoteGPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 KeynoteNVIDIA
 
Nvidia Deep Learning Solutions - Alex Sabatier
Nvidia Deep Learning Solutions - Alex SabatierNvidia Deep Learning Solutions - Alex Sabatier
Nvidia Deep Learning Solutions - Alex SabatierSri Ambati
 

What's hot (20)

WML OpenPOWER presentation
WML OpenPOWER presentationWML OpenPOWER presentation
WML OpenPOWER presentation
 
Intel's Machine Learning Strategy
Intel's Machine Learning StrategyIntel's Machine Learning Strategy
Intel's Machine Learning Strategy
 
Affordable AI Connects To A Better Life
Affordable AI Connects To A Better LifeAffordable AI Connects To A Better Life
Affordable AI Connects To A Better Life
 
OpenPOWER Foundation Overview
OpenPOWER Foundation OverviewOpenPOWER Foundation Overview
OpenPOWER Foundation Overview
 
AWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI ResearchAWS & Intel Webinar Series - Accelerating AI Research
AWS & Intel Webinar Series - Accelerating AI Research
 
Accelerated Computing: The Path Forward
Accelerated Computing: The Path ForwardAccelerated Computing: The Path Forward
Accelerated Computing: The Path Forward
 
Shattering AI Performance Records
Shattering AI Performance RecordsShattering AI Performance Records
Shattering AI Performance Records
 
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
NVIDIA CEO Jensen Huang Presentation at Supercomputing 2019
 
Tesla Accelerated Computing Platform
Tesla Accelerated Computing PlatformTesla Accelerated Computing Platform
Tesla Accelerated Computing Platform
 
5 biggest hpc trends 2021
5 biggest hpc trends 20215 biggest hpc trends 2021
5 biggest hpc trends 2021
 
NVIDIA Keynote #GTC21
NVIDIA Keynote #GTC21 NVIDIA Keynote #GTC21
NVIDIA Keynote #GTC21
 
AI + E-commerce
AI + E-commerceAI + E-commerce
AI + E-commerce
 
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習NVIDIA DGX-1 超級電腦與人工智慧及深度學習
NVIDIA DGX-1 超級電腦與人工智慧及深度學習
 
Opening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual ComputingOpening Keynote at GTC 2015: Leaps in Visual Computing
Opening Keynote at GTC 2015: Leaps in Visual Computing
 
A Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate ArraysA Primer on FPGAs - Field Programmable Gate Arrays
A Primer on FPGAs - Field Programmable Gate Arrays
 
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
NVIDIA 深度學習教育機構 (DLI): Neural network deploymentNVIDIA 深度學習教育機構 (DLI): Neural network deployment
NVIDIA 深度學習教育機構 (DLI): Neural network deployment
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
HPC DAY 2017 | Accelerating tomorrow's HPC and AI workflows with Intel Archit...
 
GPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 KeynoteGPU Technology Conference 2014 Keynote
GPU Technology Conference 2014 Keynote
 
Nvidia Deep Learning Solutions - Alex Sabatier
Nvidia Deep Learning Solutions - Alex SabatierNvidia Deep Learning Solutions - Alex Sabatier
Nvidia Deep Learning Solutions - Alex Sabatier
 

Similar to Introduction to PowerAI - The Enterprise AI Platform

Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...E-Commerce Brasil
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataMatt Stubbs
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Intel® Software
 
End to End Machine Learning Open Source Solution Presented in Cisco Developer...
End to End Machine Learning Open Source Solution Presented in Cisco Developer...End to End Machine Learning Open Source Solution Presented in Cisco Developer...
End to End Machine Learning Open Source Solution Presented in Cisco Developer...Manish Harsh
 
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Lablup Inc.
 
Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceAlison B. Lowndes
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next DecadePaula Koziol
 
[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...
[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...
[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...Naoki (Neo) SATO
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...James Serra
 
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...Infoshare
 
InTech Event | Cognitive Infrastructure for Enterprise AI
InTech Event | Cognitive Infrastructure for Enterprise AIInTech Event | Cognitive Infrastructure for Enterprise AI
InTech Event | Cognitive Infrastructure for Enterprise AIInTTrust S.A.
 
GPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersGPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersRommel Garcia
 
Power AI introduction
Power AI introductionPower AI introduction
Power AI introductionSnowy Chen
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceData Works MD
 
Deep learning for FinTech
Deep learning for FinTechDeep learning for FinTech
Deep learning for FinTechgeetachauhan
 
Intel Powered AI Applications for Telco
Intel Powered AI Applications for TelcoIntel Powered AI Applications for Telco
Intel Powered AI Applications for TelcoMichelle Holley
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Intel® Software
 

Similar to Introduction to PowerAI - The Enterprise AI Platform (20)

Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
Fórum E-Commerce Brasil | Tecnologias NVIDIA aplicadas ao e-commerce. Muito a...
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
 
Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)Introduction to Software Defined Visualization (SDVis)
Introduction to Software Defined Visualization (SDVis)
 
End to End Machine Learning Open Source Solution Presented in Cisco Developer...
End to End Machine Learning Open Source Solution Presented in Cisco Developer...End to End Machine Learning Open Source Solution Presented in Cisco Developer...
End to End Machine Learning Open Source Solution Presented in Cisco Developer...
 
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)Backend.AI Technical Introduction (19.09 / 2019 Autumn)
Backend.AI Technical Introduction (19.09 / 2019 Autumn)
 
Harnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligenceHarnessing the virtual realm for successful real world artificial intelligence
Harnessing the virtual realm for successful real world artificial intelligence
 
OpenPOWER and IBM AI overview
OpenPOWER and IBM AI  overview   OpenPOWER and IBM AI  overview
OpenPOWER and IBM AI overview
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next Decade
 
[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...
[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...
[Connect(); // Japan 2016] Microsoft の AI 開発最新アップデート ~ Cognitive Services からA...
 
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
AI for an intelligent cloud and intelligent edge: Discover, deploy, and manag...
 
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
infoShare AI Roadshow 2018 - Tomasz Kopacz (Microsoft) - jakie możliwości daj...
 
InTech Event | Cognitive Infrastructure for Enterprise AI
InTech Event | Cognitive Infrastructure for Enterprise AIInTech Event | Cognitive Infrastructure for Enterprise AI
InTech Event | Cognitive Infrastructure for Enterprise AI
 
GPU 101: The Beast In Data Centers
GPU 101: The Beast In Data CentersGPU 101: The Beast In Data Centers
GPU 101: The Beast In Data Centers
 
Power AI introduction
Power AI introductionPower AI introduction
Power AI introduction
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data Science
 
Deep learning for FinTech
Deep learning for FinTechDeep learning for FinTech
Deep learning for FinTech
 
Intel Powered AI Applications for Telco
Intel Powered AI Applications for TelcoIntel Powered AI Applications for Telco
Intel Powered AI Applications for Telco
 
Nvidia at SEMICon, Munich
Nvidia at SEMICon, MunichNvidia at SEMICon, Munich
Nvidia at SEMICon, Munich
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture
 

More from Indrajit Poddar

Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Indrajit Poddar
 
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs  Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs Indrajit Poddar
 
Fast Scalable Easy Machine Learning with OpenPOWER, GPUs and Docker
Fast Scalable Easy Machine Learning with OpenPOWER, GPUs and DockerFast Scalable Easy Machine Learning with OpenPOWER, GPUs and Docker
Fast Scalable Easy Machine Learning with OpenPOWER, GPUs and DockerIndrajit Poddar
 
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...Indrajit Poddar
 
Build FAST Learning Apps with Docker and OpenPOWER
Build FAST Learning Apps with Docker and OpenPOWERBuild FAST Learning Apps with Docker and OpenPOWER
Build FAST Learning Apps with Docker and OpenPOWERIndrajit Poddar
 
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...Indrajit Poddar
 
Scalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUs
Scalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUsScalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUs
Scalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUsIndrajit Poddar
 
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERContinuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERIndrajit Poddar
 

More from Indrajit Poddar (8)

Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
 
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs  Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
Build FAST Deep Learning Apps with Docker on OpenPOWER and GPUs
 
Fast Scalable Easy Machine Learning with OpenPOWER, GPUs and Docker
Fast Scalable Easy Machine Learning with OpenPOWER, GPUs and DockerFast Scalable Easy Machine Learning with OpenPOWER, GPUs and Docker
Fast Scalable Easy Machine Learning with OpenPOWER, GPUs and Docker
 
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
 
Build FAST Learning Apps with Docker and OpenPOWER
Build FAST Learning Apps with Docker and OpenPOWERBuild FAST Learning Apps with Docker and OpenPOWER
Build FAST Learning Apps with Docker and OpenPOWER
 
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
Enabling Cognitive Workloads on the Cloud: GPUs with Mesos, Docker and Marath...
 
Scalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUs
Scalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUsScalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUs
Scalable TensorFlow Deep Learning as a Service with Docker, OpenPOWER, and GPUs
 
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWERContinuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
Continuous Integration with Cloud Foundry Concourse and Docker on OpenPOWER
 

Recently uploaded

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Introduction to PowerAI - The Enterprise AI Platform

  • 1. PowerAI The Enterprise AI Platform Indrajit Poddar (I.P) ipoddar@us.ibm.com STSM, IBM Systems Technical Strategy July, 2017
  • 2. 2
  • 3. 3
  • 5. Cognitive Market Spend(2019) Software Services Hardware Other 5 $31 Billion IDC Market Data
  • 6. 6
  • 8. Core concepts in Machine Learning: Training  Inference Training • Data intensive: historical data sets • Compute intensive: 100% accelerated • Develop a model for use on the edge as inference Inference • Enables the computer to act in real time • Low Power • Out at the edge
  • 9. 9
  • 10.
  • 11. Input Result Earlier Layers Detect Edges Later Layers Detect Features like Eyes, Nose, Mouth
  • 14. 14
  • 15. 15 To build a team with deep learning expertise : 2 months ~ 1 year To prepare massive training data : ~ 10 man month(s) To train a new model : 1 hour ~ week To give an AI inference result : < 1s Challenges in creating an AI infrastructure Time needed to: • Find skills • Handle large data-sets • Hi-res images, video feed.. • Continuously train models • Run inferencing at scale • Handle rapidly evolving open source components CPUs are not getting faster as rapidly as before - Moore’s law is dying Resulting in unprecedented demand for : • Offloaded computation, accelerators, and higher memory bandwidth systems • Easy to use software that works with open source and scales
  • 16. PowerAI: Enterprise Class, Ease of Use, Faster Training Enterprise Software Distribution Binary Package of Major Deep Learning Frameworks with Enterprise Support Tools for Ease of Development Graphical tools to Enhance Data Scientist Developer Experience Faster Training Times for Data Scientists Performance Optimized for Single Node & Distributed Computing Scaling
  • 17. 17 Data Lake Transform & Prep Data (ETL) Trained Model Images of Damaged Components ModelTraining Transform & Prep Data (ETL) Off-Line Training Production LiveVideo
  • 18. Financial Services Retailers Internal Business Processes Chatbots, Call Center Automation Transportation Text Analytics of Social Media, Call Center Phone Logs 18
  • 19. MEDIA/ENTERTAINMENT RETAIL Reco. Engines, Precision Mktg COMMUNICATIONS Location-based advertising LIFE SCIENCES Sequence Analysis, Radiology UTILITIES Smart Meter analysis, Capacity planning $ FINANCIAL SERVICES Risk analysis Fraud detection CUSTOMER SERVICE Chatbots, Helpdesk Automated Expenses LAW & DEFENSE Threat analysis - social media monitoring RESEARCH Physics Modeling HEALTH CARE Patient sensors, monitoring, EHRs TRANSPORTATION Optimal traffic flows, Route planning CONSUMER GOODS Sentiment analysis Advertising effectiveness OIL & GAS Exploration, sensor analysis AUTOMOTIVE ADAS, Maintenance MANUFACTURING Line inspection, Defect analysis
  • 20. 20
  • 21. 21 AI Strategy: Ease of Use & Performance Open Frameworks Developer Ease-of-Use Tools Performance Optimizations: Software & Hardware
  • 22. Caffe NVCaffe TorchIBMCaffe Distributed TensorFlowTensorFlow OpenBLAS Theano Deep Learning Frameworks Accelerated Servers and Infrastructure for Scaling Spectrum Scale: High-Speed Parallel File System Scale to Cloud Cluster of NVLink Servers Bazel DIGITSNCCL Distributed Communications Supporting Libraries Chainer
  • 23. PowerAI DL Frameworks + Libraries (TensorFlow, Caffe, ..) IBM Data Science Experience (DSX) Distributed Computing with Spark & MPI DL Developer Tools Spectrum Scale High-Speed File System via HDFS APIs Cluster of NVLink Servers PowerAI Enterprise (Coming soon) IBM Enterprise Support Application Dev Services Enterprise Support & Services to Augment Enterprise Expertise Packaged, Pre-Compiled Deep Learning Frameworks (TensorFlow, Caffe, Torch, ..) Optimized for Scaling & Fast Training Time Data Scientists Productivity Tools Targeted to DL Developers IBM Confidential
  • 24. PowerAI: Making AI More Accessible to Developers • AI Vision: Targeted at Application Developers • Data Extraction, Transformation and Preparation tool • DL Insight • Distributed Deep Learning Multi-tenant, Enterprise-ready Deep Learning Platform for Data Scientists 24
  • 25. caffe-bvlc: install cuda,cuDNN, install openblas, install protobuf, clone, build and install opencv, install python, install python-dev, install libgflags, install libgoogle-glog-dev, install liblmdb-dev, edit make file to enable CuDNN, make all, make distribution Torch: complicated on Power as luaJIT has mixed support for OpenPOWER. We use a luaJIT fork to build. caffe-nv: same dependencies as caffe-bvlc; separate upstream repo for caffe-nv, specific versions are needed for newer versions of Nvidia’s DIGITS tool. caffe-ibm: same dependencies as caffe-bvlc, separate build stream; versions; updates Tensorflow: in PIP for x86, but it is often recommended to build from source: upgrade pip, install Bazel, install many dependencies including java, configure the build, compile, pip install whl, upgrade protobuf Theano: install python, numpy, scipy, openBLAS, python-dev, nose, Sphinx, cuda, pycuda, clone, build and install libgpuarray DIGITS: clone digits from repo, install dependencies (PIP) Life without PowerAI: With PowerAI: PowerAI: install cuda, cuDNN; sudo apt-get install power-mldl
  • 26. DL Frameworks (TF, Caffe, etc) Data Prep & ETL via Spectrum Conductor with Spark Input Data Deep Learning GUI Data & Model Management, ETLTools, Monitor,Visualize, Advise DL Insight Tuning Engine AIVision ComputerVisionApp DevelopmentToolkit IBM Spectrum Conductor with Spark System mgmt, Distributed ETL, DistributedTraining, Hyper-Parameter Optimization DistributedTraining
  • 27. 27 Data Lake & Data Stores Distributed Computing Machine & Deep Learning Libraries & Frameworks CognitiveAPIs (Eg:Watson) In-House CognitiveAPIs Applications Hadoop HDFS, NoSQL DBs Spark, MPI TensorFlow, Caffe, SparkML Speech,Vision, NLP, Sentiment Segment Specific: Finance, Retail, Healthcare, etc. Accelerated Servers Storage Accelerated Infrastructure Transform & Prep Data (ETL)
  • 28. https://mc.jarvice.com/ 28 ATLAS Automatically Tuned Linear Algebra Software) https://power.jarvice.com/
  • 29. 29 Deep Learning Training + Inference Accelerators Clustering frameworks Workload Aware Scheduling Shared Resource Management Emerging Workloads Dev Ops & Micro Services High Performance Computing Design / Simulation / Modeling ‘New-gen Workloads’ Hadoop, Spark, Containers with Spark IBM Cloud private Ne w High Performance Analytics Trade / Risk Analytics Containers and images IBM Data Science Experience
  • 30. 30
  • 31. 31 IBM OpenPOWER Moves on Deep Learning with a Vengeance “In short, IBM kicked some butt today” Rob Enderle Industry Analyst IBM brings Google's AI tools to its powerful computers Google has cool technology to recognize images and speech, and IBM's hardware can diagnose diseases and beat humans in Jeopardy. Combine the two, and you get a powerful computer with serious brains.
  • 32. OpenPOWER: Open Hardware for High Performance 32 Systems designed for big data analytics and superior cloud economics Upto: 10 cores per cpu 96 hardware threads per cpu 1/2 TB RAM 7.6Tb/s combined I/O Bandwidth OpenPOWER Traditional Intel x86 http://www.softlayer.com/POWER-SERVERS https://power.jarvice.com/landing
  • 33. Accelerated AI: Chip and Servers POWER8 + coherent CAPI + novel NVlink for high BW coherent CPU/GPU acceleration S822LC-hpc: • 2 POWER8 10 Core CPUs • 4 NVIDIA P100 ”Pascal” GPUs • 256 GB System Memory • 2 SSD storage devices • High-speed interconnect (IB or Ethernet, depending on infrastructure) • Optional: • Up to 1 TB System Memory • PCIe attached NVMe storage “POWER8 with NVLink” S821LC: High Density 2-Socket 1U S822LC for Big Data S822LC for High Performance Computing Power Linux Servers M.Gschwind, Bringing the Deep Learning Revolution into the Enterprise Accelerated AI Accelerator X 33
  • 34. Introducing 822LC Power System for HPC: First Custom-Built GPU Accelerator Server with NVLink and NVidia P100 GPUs M.Gschwind, Bringing the Deep Learning Revolution into the Enterprise ▪ Custom-built GPU Accelerator Server ▪ High-Speed NVLink Connections between CPUs & GPUs and among GPUs ▪ Features novel NVIDIA P100 Pascal GPU accelerator NVIDIA P100 Pascal GPU 2.5x Faster CPU-GPU Data Communication via NVLink NVLink 80 GB/s GPU P8 GPU GPU P8 GPU POWER8 NVLink Server PCIe 32 GB/s GPU GPU GPU GPU No NVLink between CPU & GPU for x86 Servers: PCIe Bottleneck x86 Servers with PCIe x86 x86 34
  • 35. Higher Performance with Power8 CPU-P100 GPU NVLink P100 GPU POWER8 CPU GPU Memory System Memory P100 GPU 80 GB/s GPU Memory NVLink 115 GB/s P100 GPU POWER8 CPU GPU Memory System Memory P100 GPU 80 GB/s GPU Memory NVLink 115 GB/s
  • 36. 0 50 100 150 200 250 300 S822LC - Optimized E5-2640v4 Images Processed (Images/Sec) (TensorFlow, Inception v3) 36 IBM S822LC 20-cores 2.86GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs / Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch) Intel Broadwell E5-2640v4 20-core 2.6 GHz 512GB memory / 4 NVIDIA Tesla P100 GPUs/ Ubuntu 16.04 / CUDA 8.0.44 / cuDNN 5.1 / TensorFlow 0.12.0 / Inception v3 Benchmark (64 image minbatch) Power8 “Minsky” Server Intel x86-Based Server Minsky: 30% Faster
  • 37. PowerAI vs DGX-1: 1.6xTensorFlowThroughput / Dollar (lower cost is better) 37 • TensorFlow 0.12 on the IBM PowerAI platform takes advantage of the full capabilities of NVLink • For image classification and analysis this means a 1.6X price performance advantage relative to the NVIDIA DGX-1 System Images / Second List Price $ / Image / Second NVIDIA DGX-1 (8 P100 GPU, 512GB Mem) 330 $129,000 $390 PowerAI (4 P100 GPU, 512 GB Mem) 273 $67,000 $241
  • 38. 38
  • 39. PowerAI Trial Configurations in a public cloud: • Docker container builds and comes up in minutes • Single P100 GPUs • 30 days with 60 hrs standard (120 for Sales referral) • 128GB RAM, 32 CPU threads, 1TB shared storage • Quad P100 GPUs • 30 days with 120hrs standard (more by request) • 512GB RAM, 128 CPU threads, 1TB shared storage Contact: Michael Boros Nimbix Cloud Advantages • Easier to use • Highest Performance • Ultra Fast Launch Times • Lower Cost • Faster time to Value • Bare-Metal Acceleration • Enterprise Accounting • Application Marketplace • Private Apps https://www.slideshare.net/IndrajitPoddar/fast-scalable-easy-machine- learning-with-openpower-gpus-and-docker Experience performance with productivity A superior integrated stack and adequate hardware resources for deep learning insights
  • 40. 40 Launch deep learning training by one-clickData labeling Monitor the training progress Deploy the inference API to data center Generate and deploy the DL inference accelerator onto FPGA DL Engineer could get optimized model parameters DL Insight DL Engineer DSX Inject the designed DL network into AI Vision AI Vision Develop the DL neural network via the interactive GUI Solution developer PowerAI Inference Engine Test engineer Error results will be looped back to trigger new training task "Easier Insights with Data Science Experience and PowerAI Deep Learning" - https://ibm.box.com/s/m7ooeoi738rs7dq9l9v0i9iir79t4xmd Analytics Signature Moment Event in Munich: https://www.ibm.com/analytics/us/en/events/machine-learning/
  • 41. • a 10x increase in inspections/day • a 90%decrease in inspection time • a Significant reduction in worker accidents Example value realized by an Asian Utility company using PowerAI