Presented during the "Introduction to H2O4GPU and Driverless AI" webinar on April 11th, 2018.
Watch the recording here:
https://attendee.gotowebinar.com/register/6156356209443281667?source=SlideshareH2O4GPU
5. 5
RISE OF GPU COMPUTING
GPU-Computing perf
1.5X per year
1000X
by
2025
102
103
104
105
106
107
Single-threaded perf
1.5X per year
1.1X per year
APPLICATIONS
SYSTEMS
ALGORITHMS
CUDA
ARCHITECTURE
6. H2O4GPU
/ Open-Source: http://github.com/h2oai/h2o4gpu
/ Used within our own Driverless AI Product to boost performance 30X
/ Scikit-Learn Python API (and now R API)
/ All Scikit-Learn algorithms included
/ Important algorithms ported to GPU
16. Gradient Boosting Machines
/ Based upon XGBoost
/ Raw floating point data -> Binned into Quantiles
/ Quantiles are stored as compressed instead of floats
/ Compressed Quantiles are efficiently transferred to GPU
/ Sparsity is handled directly with highly GPU efficiency
/ Multi-GPU by sharding rows using NVIDIA NCCL AllReduce