World's Fastest Machine Learning With GPUs

•

2 likes•380 views

Presented during the "Introduction to H2O4GPU and Driverless AI" webinar on April 11th, 2018. Watch the recording here: https://attendee.gotowebinar.com/register/6156356209443281667?source=SlideshareH2O4GPU

Technology

World's Fastest
Machine Learning
With GPUs
http://github.com/h2oai/h2o4gpu
Speaker: Jonathan C. McKinney

H2O4GPU TEAM
Mateusz Erin Navdeep Rory Terry
Karen Arno Jonathan Steve

5
RISE OF GPU COMPUTING
GPU-Computing perf
1.5X per year
1000X
by
2025
102
103
104
105
106
107
Single-threaded perf
1.5X per year
1.1X per year
APPLICATIONS
SYSTEMS
ALGORITHMS
CUDA
ARCHITECTURE

H2O4GPU
/ Open-Source: http://github.com/h2oai/h2o4gpu
/ Used within our own Driverless AI Product to boost performance 30X
/ Scikit-Learn Python API (and now R API)
/ All Scikit-Learn algorithms included
/ Important algorithms ported to GPU

K-Means
• Significantly faster than Scikit-learn implementation (50x)
• Significantly faster than other GPU implementations (5x-10x)
• Supports kmeans++/kmeans|| initialization
• Supports multiple GPUs
• Supports batching data if exceeds GPU memory

https://github.com/h2oai/h2o4gpu/blob/master/examples/py/demos/H2O4GPU_KMeans_Images.ipynb
K-Means

Gradient Boosting Machines
/ Based upon XGBoost
/ Raw floating point data -> Binned into Quantiles
/ Quantiles are stored as compressed instead of floats
/ Compressed Quantiles are efficiently transferred to GPU
/ Sparsity is handled directly with highly GPU efficiency
/ Multi-GPU by sharding rows using NVIDIA NCCL AllReduce

17
https://www.youtube.com/watch?v=NkeSDrifJdg
171 with latest solver
87
51

H2O4GPU
http://github.com/h2oai/h2o4gpu
https://stackoverflow.com/questions/tagged/h2o4gpu
https://gitter.im/h2oai/h2o4gpu
Thank You!
Questions?

What's hot

Introduction to data science with H2O-ChicagoSri Ambati

From Kaggle to H2O - The True Story of a Civil Engineer Turned Data GeekJo-fai Chow

Scalable and Automatic Machine Learning with H2OSri Ambati

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinyJo-fai Chow

An Early Evaluation of Running Spark on KubernetesDataWorks Summit

Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...Sri Ambati

Introduction to Data Science with H2O- Mountain ViewSri Ambati

Automatic and Interpretable Machine Learning with H2O and LIMEJo-fai Chow

ArnoCandelAIFrontiers011217Sri Ambati

Introducing Kubeflow (w. Special Guests Tensorflow and Apache Spark)DataWorks Summit

Introduction to H2O and Model Stacking Use CasesJo-fai Chow

H2O at Berlin R MeetupJo-fai Chow

Scalable Machine Learning in R and Python with H2OSri Ambati

H2O Big Join SlidesSri Ambati

H2O at BelgradeR MeetupJo-fai Chow

Daniel Putz & Maksim Puzykov [Volvo Cars] | History of Monitoring at Volvo Ca...InfluxData

OpenACC Month Highlights- OctoberNVIDIA

Deploying your Predictive Models as a Service via DominoJo-fai Chow

Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...Codemotion

H2O at Poznan R MeetupJo-fai Chow

What's hot (20)

Introduction to data science with H2O-Chicago

From Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek

Scalable and Automatic Machine Learning with H2O

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny

An Early Evaluation of Running Spark on Kubernetes

Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...

Introduction to Data Science with H2O- Mountain View

Automatic and Interpretable Machine Learning with H2O and LIME

ArnoCandelAIFrontiers011217

Introducing Kubeflow (w. Special Guests Tensorflow and Apache Spark)

Introduction to H2O and Model Stacking Use Cases

H2O at Berlin R Meetup

Scalable Machine Learning in R and Python with H2O

H2O Big Join Slides

H2O at BelgradeR Meetup

Daniel Putz & Maksim Puzykov [Volvo Cars] | History of Monitoring at Volvo Ca...

OpenACC Month Highlights- October

Deploying your Predictive Models as a Service via Domino

Alison B Lowndes - Fueling the Artificial Intelligence Revolution with Gaming...

H2O at Poznan R Meetup

Similar to World's Fastest Machine Learning With GPUs

The GPGPU ContinuumOfer Rosenberg

“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...Edge AI and Vision Alliance

Odsc workshop - Distributed Tensorflow on HopsJim Dowling

HPC DAY 2017 | FlyElephant Solutions for Data Science and HPCHPC DAY

Accelerate Cloud Training with AlluxioAlluxio, Inc.

GPU and Deep learning best practicesLior Sidi

General Learning.pptxAmmarAhmedSiddiqui2

DIANA: Recent developments in GooFitHenry Schreiner

Elyra - a set of AI-centric extensions to JupyterLab Notebooks.Luciano Resende

HKG18-100K1 - George Grey: Opening KeynoteLinaro

Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDSDatabricks

Hadoop Meetup Jan 2019 - TonY: TensorFlow on YARN and BeyondErik Krogen

오픈소스로 시작하는 인공지능 실습Mario Cho

Implementing AI: High Performace ArchitecturesKTN

State of Big Data on ARM64 / AArch64 - Apache BigtopGanesh Raju

AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)Amazon Web Services

Infrastructure for the work of Data ScientistsFlyElephant

Scientific Computing @ Fred HutchDirk Petersen

Open source Android 10 on Orange Pi: Meth or Reality?GlobalLogic Ukraine

HDF UpdateThe HDF-EOS Tools and Information Center

Similar to World's Fastest Machine Learning With GPUs (20)

The GPGPU Continuum

“Khronos Standard APIs for Accelerating Vision and Inferencing,” a Presentati...

Odsc workshop - Distributed Tensorflow on Hops

HPC DAY 2017 | FlyElephant Solutions for Data Science and HPC

Accelerate Cloud Training with Alluxio

GPU and Deep learning best practices

General Learning.pptx

DIANA: Recent developments in GooFit

Elyra - a set of AI-centric extensions to JupyterLab Notebooks.

HKG18-100K1 - George Grey: Opening Keynote

Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS

Hadoop Meetup Jan 2019 - TonY: TensorFlow on YARN and Beyond

오픈소스로 시작하는 인공지능 실습

Implementing AI: High Performace Architectures

State of Big Data on ARM64 / AArch64 - Apache Bigtop

AWS re:Invent 2016: Bringing Deep Learning to the Cloud with Amazon EC2 (CMP314)

Infrastructure for the work of Data Scientists

Scientific Computing @ Fred Hutch

Open source Android 10 on Orange Pi: Meth or Reality?

HDF Update

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

AI as an Interface for Commercial BuildingsMemoori

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

DMCC Future of Trade Web3 - Special Edition

Scanning the Internet for External Cloud Exposures via SSL Certs

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

The transition to renewables in India.pdf

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Pigging Solutions in Pet Food Manufacturing

Human Factors of XR: Using Human Factors to Design XR Systems

AI as an Interface for Commercial Buildings

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Understanding the Laravel MVC Architecture

Maximizing Board Effectiveness 2024 Webinar.pptx

New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024

APIForce Zurich 5 April Automation LPDG

Unleash Your Potential - Namagunga Girls Coding Club

World's Fastest Machine Learning With GPUs

1. World's Fastest Machine Learning With GPUs http://github.com/h2oai/h2o4gpu Speaker: Jonathan C. McKinney

2. H2O4GPU TEAM Mateusz Erin Navdeep Rory Terry Karen Arno Jonathan Steve

3. Machine Learning c Deep Learning

5. 5 RISE OF GPU COMPUTING GPU-Computing perf 1.5X per year 1000X by 2025 102 103 104 105 106 107 Single-threaded perf 1.5X per year 1.1X per year APPLICATIONS SYSTEMS ALGORITHMS CUDA ARCHITECTURE

6. H2O4GPU / Open-Source: http://github.com/h2oai/h2o4gpu / Used within our own Driverless AI Product to boost performance 30X / Scikit-Learn Python API (and now R API) / All Scikit-Learn algorithms included / Important algorithms ported to GPU

8. 8 Driverless AI

9. 9 Driverless AI

10. K-Means • Significantly faster than Scikit-learn implementation (50x) • Significantly faster than other GPU implementations (5x-10x) • Supports kmeans++/kmeans|| initialization • Supports multiple GPUs • Supports batching data if exceeds GPU memory

11. https://github.com/h2oai/h2o4gpu/blob/master/examples/py/demos/H2O4GPU_KMeans_Images.ipynb K-Means

12. 12 10 with latest solver

13.

14. Principle Component Analysis (PCA)

15.

16. Gradient Boosting Machines / Based upon XGBoost / Raw floating point data -> Binned into Quantiles / Quantiles are stored as compressed instead of floats / Compressed Quantiles are efficiently transferred to GPU / Sparsity is handled directly with highly GPU efficiency / Multi-GPU by sharding rows using NVIDIA NCCL AllReduce

17. 17 https://www.youtube.com/watch?v=NkeSDrifJdg 171 with latest solver 87 51

18. H2O4GPU http://github.com/h2oai/h2o4gpu https://stackoverflow.com/questions/tagged/h2o4gpu https://gitter.im/h2oai/h2o4gpu Thank You! Questions?

World's Fastest Machine Learning With GPUs

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to World's Fastest Machine Learning With GPUs

Similar to World's Fastest Machine Learning With GPUs (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

World's Fastest Machine Learning With GPUs