SlideShare a Scribd company logo
1 of 20
Watson Machine Learning
Community Edition
WML-CE
—
Pradipta Ghosh
Distributed Machine learning
IBM Cognitive Systems
Agenda
WML-CE Overview
Setup Hands-on
SnapML Hands-on
Watson Machine Learning Community Edition
Open Source – Enhanced and delivered as Conda Packages
Curated, tested and pre-compiled binary software distribution that enables enterprises to quickly and easily deploy deep learning
for their data science and analytics development
SnapML
WML CE
(PowerAI)
Deep Learning Impact
(DLI) Module
Data & Model
Management, ETL,
Visualize, Advise
IBM Spectrum Conductor with Spark
Cluster Virtualization,
Dynamic Resource Orchestration,
Multiple Frameworks, Distributed Execution Engine
PowerAI: Open Source ML Frameworks
Large Model Support (LMS)
Distributed Deep Learning
(DDL – 1000s of nodes)
Auto Hyper-parameter
Tuning
WML Accelerator
(PowerAI Enterprise)
Accelerated
Infrastructure
Accelerated Servers Storage
AI for
Data Scientists and
non-Data Scientists
Distributed Deep Learning (up
to 4 nodes)
SnapML
PowerAI Vision
Auto-DL for Images & Video
Label Train Deploy
H2O Driverless AI
Auto-ML for Text & Numeric Data, NLP
Import Experiment Deploy
Train larger more complex models
Large Model SupportTraditional Model Support
Limited memory on GPU forces tradeoff
in model size / data resolution
Use system memory and GPU to support more
complex and higher resolution data
CPUDDR4
GPU
PCIe
Graphics
Memory
System
Bottleneck
Here
POWER
CPU
DDR4
GPU
NVLink
Graphics
Memory
POWER NVLink
Data Pipe
6
1 TB
Memory
Power 9
CPU
V100
GPU
V100
GPU
170GB/s
NVLink
150 GB/s
1 TB
Memory
Power 9
CPU
V100
GPU
V100
GPU
170GB/s
NVLink
150 GB/s
IBM AC922 Power System
Deep Learning Server (4-GPU Config)
Store Large Models
in System Memory
Operate on One
Layer at a Time
Fast Transfer
via NVLink
5x Faster Data Communication with Unique CPU-GPU NVLink High-Speed
Connection
Distributed Deep
Learning (DDL)
Deep learning training takes
days to weeks
Limited scaling to
multiple x86 servers
PowerAI with DDL enables
scaling to 100s of GPUs 1 System 64 Systems
16 Days Down to 7 Hours
58x Faster
16 Days
7 Hours
Near Ideal Scaling to 256 GPUs
ResNet-101, ImageNet-22K
1
2
4
8
16
32
64
128
256
4 16 64 256
Speedup
Number of GPUs
Ideal Scaling
DDL Actual Scaling
95%Scaling with
256 GPUS
Caffe with PowerAI DDL, Running on Minsky (S822LC) Power System
ResNet-50, ImageNet-1K
8
WML-CE : conda distribution
What is Conda
(and why should I care?)
It’s a packaging format
It has its own packaging manager
It has its own packaging list
Integrated dependency solver
Acquires software from a repository by URL (similar to Git)
Importantly for us… it allows you to install multiple instances
https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/
9
We haven’t forgotten Docker!
https://hub.docker.com/r/ibmcom/powerai
NEW: Images with individual frameworks
- Base repository image (no frameworks installed)
- Tensorflow based image (py36, py37)
- Pytorch based image (py36, py37)
- Caffe-ibm based image (py36, py37)
- SnapML based image (py36, py37)
- All frameworks (py36, py37)
We now provide Red Hat Universal Base images too !!
https://access.redhat.com/containers/#/product/18c03ee6ba6a3657
More choice, more flexibility, more simplicity
CE Strategy
 Freely available
https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/
 Provide as Bare Metal and Container
 Support upstream CI environments
 Contribute features and bug fixes
 Release Quarterly
 Engage with Conda Forge
 Sample data
https://github.com/IBM/powerai
WML-CE setup Hands-on
11
What data
science
methods are
used at work?
Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
Source: Kaggle Data Science Survey 2019
Machine LearningArtifical
Intelligence
Deep Learning
Logistic Regression
(80%)
Decision Trees / Random
Forrest (75%)
Gradient Boosting
Machines (64%)
SVMs
CNNs (43%)
DNNs (32%)
RNNs (30%)
GANs (7%)
Bayesian
Techniques (31%)
Evolutionary
Approaches
(7%)
Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
scikit-learn is the most
widely-used ML
framework
Source: Kaggle Data Science Survey 2018
Why?
• Wide variety of ML models.
• Good documentation.
• Standardized API.
Source: Kaggle Data Science Survey 2019
Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
Snap ML: Accelerating Machine Learning
Why Fast?
Performance matters for:
• online re-training of models
• model selection and hyper-parameter tuning
• fast adaptability to changes
Why Large-Scale?
Large datasets arise in business-critical applications:
recommendation, credit fraud, advertising,
space exploration, weather, etc.
Why Resource-Savvy?
Increased Resource Utilization. Less idle time.
Less usage means savings, higher profit margin.
Why Interpretable?
Necessary feature for regulated industries where
accountability is critical.
Snap ML is a set of compute libraries that transparently
accelerate open source frameworks for training Machine
Learning (ML) Models
It’s main characteristics are:
15
Fast
Scalable
Consumable
Interpretable
Resource-efficient
Core publication: https://arxiv.org/abs/1803.06333
Snap ML Features
16
Decision Trees
Random Forest
WMLCE 1.6.1 (2Q19)
Boosting Machine
WMLCE 1.6.2 (4Q19)
Linear Regression
Logistic Regression
WMLCE 1.6.0 (1Q19)
SVM
Source: Kaggle ML & DS Survey (Nov. 2019)
In February 2020 we released the 7th version of
Snap ML with IBM WML-CE 1.7.0
Snap ML offers today’s most popular ML models
17
Multi-Core, Multi-Socket &
GPU Acceleration
Distributed Training: Multi-CPU & Multi-GPU
GPU Accelerated
Logistic Regression
Linear Regression
Support Vector Machines
SnapBoost
Multi-Core, Multi-CPU
Decision Trees
Random Forests
CPU-GPU Memory
Management
APIs for Popular ML
Frameworks
Snap ML
Distributed High Performance Machine Learning Library
Snap Machine Learning (ML) Library
Distributed Training GPU Acceleration Sparse Data Optimization
Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
Accelerated and Distributed ML in WML CE
Snap ML
scikit-learn fork (pai4sk)
RAPIDSLogistic Regression Random Forest
Decision Tree SVM
Ridge/Lasso
Regression
SnapBoost
cuDF, cuML
Watson Machine Learning CE
TensorFlow
PyTorch
Caffe
Keras
ML solutions DL solutions
DDL
LMS
Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
DMLC XGBoost
Dask
Snap ML Value Proposition
19
Leadership in AI for business:
Leading performance, response time
Scalability to multi-TB datasets
Higher efficiency, translating to lower cost
Higher accuracy, translating to higher profits
Explainability for regulated industries
Leading ML framework
Strong differentiation in performance, scalability,
accuracy
Applied in usecases across FSS, Retail, Advertising:
Fraud detection, credit default prediction, stock
prediction, pricing, sales forecasting, CTR prediction
>10x faster than Scikit-learn
398.9
17.3 12.7
0.0
50.0
100.0
150.0
200.0
250.0
300.0
350.0
400.0
450.0
sklearn snap-CPU snap-1GPU
TrainingTime(s)
PRICE PREDICTION dataset
23x 31x
Handling TB-scale datasets More accurate than XGBoost
XGBoost
better
SnapBoost
better
1.1 Hours
1.53
Minutes
0
20
40
60
80
Runtime(Minutes)
Criteo TB dataset (4.2B rows)
46x
Faster
TensorFlow
90 x86 Servers
(CPU-only)
Snap ML
4 Power9 Servers
With GPUs
OpenML.org 48 binary classification dataset
SnapML Hands-on
20

More Related Content

What's hot

OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM Ganesan Narayanasamy
 
The IBM Data Engine for NoSQL on IBM Power Systems™
The IBM Data Engine for NoSQL on IBM Power Systems™The IBM Data Engine for NoSQL on IBM Power Systems™
The IBM Data Engine for NoSQL on IBM Power Systems™IBM Power Systems
 
Collaborate07kmohiuddin
Collaborate07kmohiuddinCollaborate07kmohiuddin
Collaborate07kmohiuddinSal Marcus
 
SyncHPC: A Multi-Cloud HPC Hosting Platform
SyncHPC: A Multi-Cloud HPC Hosting PlatformSyncHPC: A Multi-Cloud HPC Hosting Platform
SyncHPC: A Multi-Cloud HPC Hosting PlatformSyncious
 
Build, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at ScaleBuild, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at ScaleAmazon Web Services
 
TAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformTAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformGanesan Narayanasamy
 
Performing Simulation-Based, Real-time Decision Making with Cloud HPC
Performing Simulation-Based, Real-time Decision Making with Cloud HPCPerforming Simulation-Based, Real-time Decision Making with Cloud HPC
Performing Simulation-Based, Real-time Decision Making with Cloud HPCinside-BigData.com
 
StratusLab at FOSDEM'13
StratusLab at FOSDEM'13StratusLab at FOSDEM'13
StratusLab at FOSDEM'13stratuslab
 
EXTENT-2017: Heterogeneous Computing Trends and Business Value Creation
EXTENT-2017: Heterogeneous Computing Trends and Business Value CreationEXTENT-2017: Heterogeneous Computing Trends and Business Value Creation
EXTENT-2017: Heterogeneous Computing Trends and Business Value CreationIosif Itkin
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfAmazon Web Services
 
MongoDB Sharding
MongoDB ShardingMongoDB Sharding
MongoDB Shardinguzzal basak
 
Heterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsHeterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsAnand Haridass
 
Think about a future of database
Think about a future of databaseThink about a future of database
Think about a future of databasehondas80
 
Hss Trends May2009c
Hss Trends May2009cHss Trends May2009c
Hss Trends May2009cJoe Bachana
 
IBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & PowerIBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & PowerIBM France Lab
 
Clabby Analytics Research Report: The Mainframe Virtualization Advantage
Clabby Analytics Research Report: The Mainframe Virtualization AdvantageClabby Analytics Research Report: The Mainframe Virtualization Advantage
Clabby Analytics Research Report: The Mainframe Virtualization AdvantageIBM India Smarter Computing
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Karl Roche
 
Big Memory for HPC
Big Memory for HPCBig Memory for HPC
Big Memory for HPCMemVerge
 

What's hot (20)

OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM
 
OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar OpenPOWER/POWER9 AI webinar
OpenPOWER/POWER9 AI webinar
 
The IBM Data Engine for NoSQL on IBM Power Systems™
The IBM Data Engine for NoSQL on IBM Power Systems™The IBM Data Engine for NoSQL on IBM Power Systems™
The IBM Data Engine for NoSQL on IBM Power Systems™
 
Collaborate07kmohiuddin
Collaborate07kmohiuddinCollaborate07kmohiuddin
Collaborate07kmohiuddin
 
SyncHPC: A Multi-Cloud HPC Hosting Platform
SyncHPC: A Multi-Cloud HPC Hosting PlatformSyncHPC: A Multi-Cloud HPC Hosting Platform
SyncHPC: A Multi-Cloud HPC Hosting Platform
 
Build, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at ScaleBuild, Train & Deploy Machine Learning Models at Scale
Build, Train & Deploy Machine Learning Models at Scale
 
TAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platformTAU E4S ON OpenPOWER /POWER9 platform
TAU E4S ON OpenPOWER /POWER9 platform
 
Performing Simulation-Based, Real-time Decision Making with Cloud HPC
Performing Simulation-Based, Real-time Decision Making with Cloud HPCPerforming Simulation-Based, Real-time Decision Making with Cloud HPC
Performing Simulation-Based, Real-time Decision Making with Cloud HPC
 
StratusLab at FOSDEM'13
StratusLab at FOSDEM'13StratusLab at FOSDEM'13
StratusLab at FOSDEM'13
 
EXTENT-2017: Heterogeneous Computing Trends and Business Value Creation
EXTENT-2017: Heterogeneous Computing Trends and Business Value CreationEXTENT-2017: Heterogeneous Computing Trends and Business Value Creation
EXTENT-2017: Heterogeneous Computing Trends and Business Value Creation
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdf
 
MongoDB Sharding
MongoDB ShardingMongoDB Sharding
MongoDB Sharding
 
Heterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of SystemsHeterogeneous Computing : The Future of Systems
Heterogeneous Computing : The Future of Systems
 
Think about a future of database
Think about a future of databaseThink about a future of database
Think about a future of database
 
Hss Trends May2009c
Hss Trends May2009cHss Trends May2009c
Hss Trends May2009c
 
IBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & PowerIBM Cloud Paris Meetup - 20190520 - IA & Power
IBM Cloud Paris Meetup - 20190520 - IA & Power
 
Amd fusion apus
Amd fusion apusAmd fusion apus
Amd fusion apus
 
Clabby Analytics Research Report: The Mainframe Virtualization Advantage
Clabby Analytics Research Report: The Mainframe Virtualization AdvantageClabby Analytics Research Report: The Mainframe Virtualization Advantage
Clabby Analytics Research Report: The Mainframe Virtualization Advantage
 
Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.Optimized Systems: Matching technologies for business success.
Optimized Systems: Matching technologies for business success.
 
Big Memory for HPC
Big Memory for HPCBig Memory for HPC
Big Memory for HPC
 

Similar to WML SNAP ML

Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0Ganesan Narayanasamy
 
H2O platform workshop
H2O platform workshopH2O platform workshop
H2O platform workshopShareThis
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsBill Liu
 
2018 12 18 Tech Valley UserGroup Machine Learning.Net
2018 12 18 Tech Valley UserGroup Machine Learning.Net2018 12 18 Tech Valley UserGroup Machine Learning.Net
2018 12 18 Tech Valley UserGroup Machine Learning.NetBruno Capuano
 
IBM Power Systems - enabling cloud solutions
IBM Power Systems - enabling cloud solutionsIBM Power Systems - enabling cloud solutions
IBM Power Systems - enabling cloud solutionsDavid Spurway
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsStephan Reimann
 
Mainframe Architecture & Product Overview
Mainframe Architecture & Product OverviewMainframe Architecture & Product Overview
Mainframe Architecture & Product Overviewabhi1112
 
LJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfLJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfEmilyJiang23
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next DecadePaula Koziol
 
Webinar GLUGNet - Machine Learning.Net and Windows Machine Learning
Webinar GLUGNet - Machine Learning.Net and Windows Machine LearningWebinar GLUGNet - Machine Learning.Net and Windows Machine Learning
Webinar GLUGNet - Machine Learning.Net and Windows Machine LearningBruno Capuano
 
Webinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroWebinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroSpagoWorld
 
Ibm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIbm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIBM Switzerland
 
Power AI introduction
Power AI introductionPower AI introduction
Power AI introductionSnowy Chen
 
Data Engine for NoSQL - IBM Power Systems
Data Engine for NoSQL - IBM Power SystemsData Engine for NoSQL - IBM Power Systems
Data Engine for NoSQL - IBM Power SystemsthinkASG
 
Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...
Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...
Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...Hitachi Vantara
 

Similar to WML SNAP ML (20)

OpenPOWER and IBM AI overview
OpenPOWER and IBM AI  overview   OpenPOWER and IBM AI  overview
OpenPOWER and IBM AI overview
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0
 
BSC LMS DDL
BSC LMS DDL BSC LMS DDL
BSC LMS DDL
 
PowerAI Deep dive
PowerAI Deep divePowerAI Deep dive
PowerAI Deep dive
 
IBM FlashSystem 7300 Product Guide.pdf
IBM FlashSystem 7300
Product Guide.pdfIBM FlashSystem 7300
Product Guide.pdf
IBM FlashSystem 7300 Product Guide.pdf
 
H2O platform workshop
H2O platform workshopH2O platform workshop
H2O platform workshop
 
Deep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps WorkflowsDeep AutoViML For Tensorflow Models and MLOps Workflows
Deep AutoViML For Tensorflow Models and MLOps Workflows
 
2018 12 18 Tech Valley UserGroup Machine Learning.Net
2018 12 18 Tech Valley UserGroup Machine Learning.Net2018 12 18 Tech Valley UserGroup Machine Learning.Net
2018 12 18 Tech Valley UserGroup Machine Learning.Net
 
IBM Power Systems - enabling cloud solutions
IBM Power Systems - enabling cloud solutionsIBM Power Systems - enabling cloud solutions
IBM Power Systems - enabling cloud solutions
 
The sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of ThingsThe sensor data challenge - Innovations (not only) for the Internet of Things
The sensor data challenge - Innovations (not only) for the Internet of Things
 
Mainframe Architecture & Product Overview
Mainframe Architecture & Product OverviewMainframe Architecture & Product Overview
Mainframe Architecture & Product Overview
 
LJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdfLJC-Unconference-2023-Keynote.pdf
LJC-Unconference-2023-Keynote.pdf
 
AI Scalability for the Next Decade
AI Scalability for the Next DecadeAI Scalability for the Next Decade
AI Scalability for the Next Decade
 
Webinar GLUGNet - Machine Learning.Net and Windows Machine Learning
Webinar GLUGNet - Machine Learning.Net and Windows Machine LearningWebinar GLUGNet - Machine Learning.Net and Windows Machine Learning
Webinar GLUGNet - Machine Learning.Net and Windows Machine Learning
 
Webinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence IntroWebinar: Open Source Business Intelligence Intro
Webinar: Open Source Business Intelligence Intro
 
Ibm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIbm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bk
 
Power AI introduction
Power AI introductionPower AI introduction
Power AI introduction
 
PIM4Cloud
PIM4CloudPIM4Cloud
PIM4Cloud
 
Data Engine for NoSQL - IBM Power Systems
Data Engine for NoSQL - IBM Power SystemsData Engine for NoSQL - IBM Power Systems
Data Engine for NoSQL - IBM Power Systems
 
Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...
Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...
Hitachi white-paper-ibm-mainframe-storage-compatibility-and-innovation-quick-...
 

More from Ganesan Narayanasamy

Chip Design Curriculum development Residency program
Chip Design Curriculum development Residency programChip Design Curriculum development Residency program
Chip Design Curriculum development Residency programGanesan Narayanasamy
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and VerilogGanesan Narayanasamy
 
180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISAGanesan Narayanasamy
 
Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Ganesan Narayanasamy
 
Deep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsDeep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsGanesan Narayanasamy
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...Ganesan Narayanasamy
 
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsAI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsGanesan Narayanasamy
 
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsAI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsGanesan Narayanasamy
 
AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems Ganesan Narayanasamy
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Ganesan Narayanasamy
 

More from Ganesan Narayanasamy (20)

Chip Design Curriculum development Residency program
Chip Design Curriculum development Residency programChip Design Curriculum development Residency program
Chip Design Curriculum development Residency program
 
Basics of Digital Design and Verilog
Basics of Digital Design and VerilogBasics of Digital Design and Verilog
Basics of Digital Design and Verilog
 
180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA180 nm Tape out experience using Open POWER ISA
180 nm Tape out experience using Open POWER ISA
 
Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture Workload Transformation and Innovations in POWER Architecture
Workload Transformation and Innovations in POWER Architecture
 
OpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT RoorkeeOpenPOWER Workshop at IIT Roorkee
OpenPOWER Workshop at IIT Roorkee
 
Deep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systemsDeep Learning Use Cases using OpenPOWER systems
Deep Learning Use Cases using OpenPOWER systems
 
OpenPOWER System Marconi100
OpenPOWER System Marconi100OpenPOWER System Marconi100
OpenPOWER System Marconi100
 
OpenPOWER Latest Updates
OpenPOWER Latest UpdatesOpenPOWER Latest Updates
OpenPOWER Latest Updates
 
POWER10 innovations for HPC
POWER10 innovations for HPCPOWER10 innovations for HPC
POWER10 innovations for HPC
 
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
OpenCAPI-based Image Analysis Pipeline for 18 GB/s kilohertz-framerate X-ray ...
 
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systemsAI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
AI in healthcare and Automobile Industry using OpenPOWER/IBM POWER9 systems
 
AI in healthcare - Use Cases
AI in healthcare - Use Cases AI in healthcare - Use Cases
AI in healthcare - Use Cases
 
AI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systemsAI in Health Care using IBM Systems/OpenPOWER systems
AI in Health Care using IBM Systems/OpenPOWER systems
 
AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems AI in Healh Care using IBM POWER systems
AI in Healh Care using IBM POWER systems
 
Poster from NUS
Poster from NUSPoster from NUS
Poster from NUS
 
SAP HANA on POWER9 systems
SAP HANA on POWER9 systemsSAP HANA on POWER9 systems
SAP HANA on POWER9 systems
 
Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9Graphical Structure Learning accelerated with POWER9
Graphical Structure Learning accelerated with POWER9
 
AI in the enterprise
AI in the enterprise AI in the enterprise
AI in the enterprise
 
Robustness in deep learning
Robustness in deep learningRobustness in deep learning
Robustness in deep learning
 
Perspectives of Frond end Design
Perspectives of Frond end DesignPerspectives of Frond end Design
Perspectives of Frond end Design
 

Recently uploaded

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 

Recently uploaded (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetHyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxMaking_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 

WML SNAP ML

  • 1. Watson Machine Learning Community Edition WML-CE — Pradipta Ghosh Distributed Machine learning IBM Cognitive Systems
  • 3. Watson Machine Learning Community Edition Open Source – Enhanced and delivered as Conda Packages Curated, tested and pre-compiled binary software distribution that enables enterprises to quickly and easily deploy deep learning for their data science and analytics development SnapML
  • 4. WML CE (PowerAI) Deep Learning Impact (DLI) Module Data & Model Management, ETL, Visualize, Advise IBM Spectrum Conductor with Spark Cluster Virtualization, Dynamic Resource Orchestration, Multiple Frameworks, Distributed Execution Engine PowerAI: Open Source ML Frameworks Large Model Support (LMS) Distributed Deep Learning (DDL – 1000s of nodes) Auto Hyper-parameter Tuning WML Accelerator (PowerAI Enterprise) Accelerated Infrastructure Accelerated Servers Storage AI for Data Scientists and non-Data Scientists Distributed Deep Learning (up to 4 nodes) SnapML PowerAI Vision Auto-DL for Images & Video Label Train Deploy H2O Driverless AI Auto-ML for Text & Numeric Data, NLP Import Experiment Deploy
  • 5. Train larger more complex models Large Model SupportTraditional Model Support Limited memory on GPU forces tradeoff in model size / data resolution Use system memory and GPU to support more complex and higher resolution data CPUDDR4 GPU PCIe Graphics Memory System Bottleneck Here POWER CPU DDR4 GPU NVLink Graphics Memory POWER NVLink Data Pipe
  • 6. 6 1 TB Memory Power 9 CPU V100 GPU V100 GPU 170GB/s NVLink 150 GB/s 1 TB Memory Power 9 CPU V100 GPU V100 GPU 170GB/s NVLink 150 GB/s IBM AC922 Power System Deep Learning Server (4-GPU Config) Store Large Models in System Memory Operate on One Layer at a Time Fast Transfer via NVLink 5x Faster Data Communication with Unique CPU-GPU NVLink High-Speed Connection
  • 7. Distributed Deep Learning (DDL) Deep learning training takes days to weeks Limited scaling to multiple x86 servers PowerAI with DDL enables scaling to 100s of GPUs 1 System 64 Systems 16 Days Down to 7 Hours 58x Faster 16 Days 7 Hours Near Ideal Scaling to 256 GPUs ResNet-101, ImageNet-22K 1 2 4 8 16 32 64 128 256 4 16 64 256 Speedup Number of GPUs Ideal Scaling DDL Actual Scaling 95%Scaling with 256 GPUS Caffe with PowerAI DDL, Running on Minsky (S822LC) Power System ResNet-50, ImageNet-1K
  • 8. 8 WML-CE : conda distribution What is Conda (and why should I care?) It’s a packaging format It has its own packaging manager It has its own packaging list Integrated dependency solver Acquires software from a repository by URL (similar to Git) Importantly for us… it allows you to install multiple instances https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/
  • 9. 9 We haven’t forgotten Docker! https://hub.docker.com/r/ibmcom/powerai NEW: Images with individual frameworks - Base repository image (no frameworks installed) - Tensorflow based image (py36, py37) - Pytorch based image (py36, py37) - Caffe-ibm based image (py36, py37) - SnapML based image (py36, py37) - All frameworks (py36, py37) We now provide Red Hat Universal Base images too !! https://access.redhat.com/containers/#/product/18c03ee6ba6a3657 More choice, more flexibility, more simplicity
  • 10. CE Strategy  Freely available https://developer.ibm.com/linuxonpower/deep-learning-powerai/releases/  Provide as Bare Metal and Container  Support upstream CI environments  Contribute features and bug fixes  Release Quarterly  Engage with Conda Forge  Sample data https://github.com/IBM/powerai
  • 12. What data science methods are used at work? Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation Source: Kaggle Data Science Survey 2019
  • 13. Machine LearningArtifical Intelligence Deep Learning Logistic Regression (80%) Decision Trees / Random Forrest (75%) Gradient Boosting Machines (64%) SVMs CNNs (43%) DNNs (32%) RNNs (30%) GANs (7%) Bayesian Techniques (31%) Evolutionary Approaches (7%) Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
  • 14. scikit-learn is the most widely-used ML framework Source: Kaggle Data Science Survey 2018 Why? • Wide variety of ML models. • Good documentation. • Standardized API. Source: Kaggle Data Science Survey 2019 Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
  • 15. Snap ML: Accelerating Machine Learning Why Fast? Performance matters for: • online re-training of models • model selection and hyper-parameter tuning • fast adaptability to changes Why Large-Scale? Large datasets arise in business-critical applications: recommendation, credit fraud, advertising, space exploration, weather, etc. Why Resource-Savvy? Increased Resource Utilization. Less idle time. Less usage means savings, higher profit margin. Why Interpretable? Necessary feature for regulated industries where accountability is critical. Snap ML is a set of compute libraries that transparently accelerate open source frameworks for training Machine Learning (ML) Models It’s main characteristics are: 15 Fast Scalable Consumable Interpretable Resource-efficient Core publication: https://arxiv.org/abs/1803.06333
  • 16. Snap ML Features 16 Decision Trees Random Forest WMLCE 1.6.1 (2Q19) Boosting Machine WMLCE 1.6.2 (4Q19) Linear Regression Logistic Regression WMLCE 1.6.0 (1Q19) SVM Source: Kaggle ML & DS Survey (Nov. 2019) In February 2020 we released the 7th version of Snap ML with IBM WML-CE 1.7.0 Snap ML offers today’s most popular ML models
  • 17. 17 Multi-Core, Multi-Socket & GPU Acceleration Distributed Training: Multi-CPU & Multi-GPU GPU Accelerated Logistic Regression Linear Regression Support Vector Machines SnapBoost Multi-Core, Multi-CPU Decision Trees Random Forests CPU-GPU Memory Management APIs for Popular ML Frameworks Snap ML Distributed High Performance Machine Learning Library Snap Machine Learning (ML) Library Distributed Training GPU Acceleration Sparse Data Optimization Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation
  • 18. Accelerated and Distributed ML in WML CE Snap ML scikit-learn fork (pai4sk) RAPIDSLogistic Regression Random Forest Decision Tree SVM Ridge/Lasso Regression SnapBoost cuDF, cuML Watson Machine Learning CE TensorFlow PyTorch Caffe Keras ML solutions DL solutions DDL LMS Distributed Machine Learning / December, 2019 / © 2019 IBM Corporation DMLC XGBoost Dask
  • 19. Snap ML Value Proposition 19 Leadership in AI for business: Leading performance, response time Scalability to multi-TB datasets Higher efficiency, translating to lower cost Higher accuracy, translating to higher profits Explainability for regulated industries Leading ML framework Strong differentiation in performance, scalability, accuracy Applied in usecases across FSS, Retail, Advertising: Fraud detection, credit default prediction, stock prediction, pricing, sales forecasting, CTR prediction >10x faster than Scikit-learn 398.9 17.3 12.7 0.0 50.0 100.0 150.0 200.0 250.0 300.0 350.0 400.0 450.0 sklearn snap-CPU snap-1GPU TrainingTime(s) PRICE PREDICTION dataset 23x 31x Handling TB-scale datasets More accurate than XGBoost XGBoost better SnapBoost better 1.1 Hours 1.53 Minutes 0 20 40 60 80 Runtime(Minutes) Criteo TB dataset (4.2B rows) 46x Faster TensorFlow 90 x86 Servers (CPU-only) Snap ML 4 Power9 Servers With GPUs OpenML.org 48 binary classification dataset

Editor's Notes

  1. FREE. CAN TRY TODAY
  2. 4