H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

•

3 likes•586 views

Presented at #H2OWorld 2017 in Mountain View, CA. Enjoy the recording: https://youtu.be/NyaJ7uDroww. Learn more about H2O.ai: https://www.h2o.ai/. Follow @h2oai: https://www.twitter.com/h2oai.

Technology

Jim McHugh | NVIDIA VP and GM of Data Center | December 2017
NVIDIA GPU ACCELERATED COMPUTING

2
NVIDIA
GPU Computing
Gaming VR AI & HPC Self-Driving Cars

3
GPU COMPUTING AT THE HEART OF AI
Big Bang of Modern AI
103
105
107
1.5X per year
40 Years of CPU Trend Data
Single-threaded perf
GPU-Computing perf
1.5X per year
1.1X per year
1000X
by 2025
Performance Beyond Moore’s Law
Original data up to the year 2010 collected and plotted by M. Horowitz,  
F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp
1980 1990 2000 2010 2020

4
RISE OF NVIDIA GPU COMPUTING
Global GTC Attendees
10X in 5 Years
GPU Developers
16X in 5 Years
CUDA Downloads
6X in 5 Years
2012 2017
2M670,00022,000
2012 2017 2012 2017

5
ML & DL CHALLENGES
Slow Processes Model Complexity Resource Constraints
Hours to
build GBT
30+
Days
Data Transformation
Weeks
Feature Engineering
Months
Scoring Pipelines
<100Grandmasters
in the world

6
REVOLUTIONARY  
AI PERFORMANCE
Performance up to 100 CPUs
21 billion transistors – 5120 CUDA cores
New Tensor Core architecture inspired  
by the demands of deep learning
Volta is the Most Advanced Data Center GPU
Ever Built

7
MAXIMIZING PERFORMANCE ON VOLTA
GPU Generational Training Scaling
0
3
6
9
12
K80 V100 Tensor Core
ResNet-152 Training, 8x K80 (16 GPUs total) compared with 8x V100 NVLink GPUs using NVIDIA 17.10 containers
Greater Than 10x Performance
K80 vs. V100

8
NVIDIA TENSORRT  
10X BETTER DATA
CENTER TCO
160 CPU Servers
45,000 Images / Second
65 KWatts

9
NVIDIA TENSORRT  
10X BETTER DATA
CENTER TCO
1 NVIDIA HGX with 8 Tesla V100 GPUs
45,000 Images / Second
3 KWatts
1/6 the Cost | 1/20 the Power
4 Racks in a Box

10
NVIDIA VOLTA IN EVERY CLOUD,  
EVERY DATACENTER

11
NVIDIA DGX FAMILY
AI Workstation
DGX Station 
The Personal AI Supercomputer
DGX-1 
The Essential Instrument for AI Research
Cloud-Scale AIAI Datacenter
NVIDIA GPU Cloud
Cloud Service with the Highest Deep Learning Efficiency

13
THREE STEPS TO NVIDIA GPU CLOUD
www.nvidia.com/ngc
SIGN UP
To get an NGC account,  
go to:
www.nvidia.com/ngcsignup
DEPLOY IMAGE
On Amazon EC2, choose a P3
instance and deploy the
NVIDIA Volta Deep Learning
AMI for NGC
PULL CONTAINER
Pick your desired framework
and pull the container into
your instance
1 2 3

14
ANNOUNCING H2O DRIVERLESS AI  
ON NVIDIA GPU CLOUD
Up to 40X Faster

16
H2O4GPU PERFORMANCE
GLM XGBoost K-Means
40x10x5x

17
NVIDIA GPU ACCELERATED  
DRIVERLESS AI ADOPTION
Downloads Activation Installs
1250+ 300+600+

18
COLLABORATION THROUGH OPEN SOURCE
gpuopenanalytics.com
GPU DATA FRAME | APACHE ARROW

19
ML ACROSS INDUSTRIES
Finance Healthcare Telco

20
NVIDIA + H2O
CommunityEngineering
GPU DATA FRAME | APACHE ARROWH2O Driverless AI on DGX and NGC
Business
Joint Sales and Marketing Global Initiative
$40m Series C Round of Funding

21
FOR MORE INFORMATION
Webinar:
Interpretability with Driverless AI
Webinar:
Accelerated Machine Learning on GPUs
Blog:
GOAI: Open GPU Accelerated Data Analytics
Blog:
Crossing the Chasm of AI Interpretability
Blog:
H2O Offers GPU-Accelerated ML for
Enterprise Transformation
Solution Brief:
NVIDIA and H2O: GPU-Accelerated ML

22
A NEW COMPUTING ERA
Performance Beyond Moore’s law Driverless AI on DGX and NGC Cross-Industry Solutions for ML

Enjoy the webinar recording here: https://youtu.be/Lll1qwQJKVw. Driverless AI speeds up data science workflows by automating feature engineering, model tuning, ensembling, and model deployment. In this presentation, Arno Candel (CTO, H2O.ai), gives a quick overview and guide attendees through an interactive hands-on lab using Qwiklabs. Driverless AI turns Kaggle-winning recipes into production-ready code and is specifically designed to avoid common mistakes such as under or overfitting, data leakage or improper model validation. Avoiding these pitfalls alone can save weeks or more for each model, and is necessary to achieve high modeling accuracy. With Driverless AI, everyone can now train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client/server API through a variety of languages such as Python, Java, C++, go, C# and many more. To speed up training, Driverless AI uses highly optimized C++/CUDA algorithms to take full advantage of the latest compute hardware. For example, Driverless AI runs orders of magnitudes faster on the latest Nvidia GPU supercomputers on Intel and IBM platforms, both in the cloud or on-premise. There are two more product innovations in Driverless AI: statistically rigorous automatic data visualization and interactive model interpretation with reason codes and explanations in plain English. Both help data scientists and analysts to quickly validate the data and models.

World's Fastest Machine Learning With GPUs

Sri Ambati

Introduction to Machine Learning with H2O and Python

Sri Ambati

H2O Deep Water - Making Deep Learning Accessible to Everyone

Jo-fai Chow

From H2O to Steam - Dr. Bingwei Liu, Sr. Data Engineer, Aetna

Sri Ambati

Get Behind the Wheel with H2O Driverless AI Hands-On Training

Sri Ambati

This training took place in London on October 30th, 2018. A hands-on training on our ground-breaking product, H2O Driverless AI, was delivered by the following makers: 1. Introduction to Driverless AI by Arno Candel 2. Feature Engineering in Driverless AI by Dmitry Larko 3. Time Series in Driverless AI by Marios Michailidis and Mathias Müller 4. NLP in Driverless AI by Sudalai Rajkumar Machine Learning 5. Interpretability in Driverless AI by Arno Candel

H2O Deep Water - Making Deep Learning Accessible to Everyone

Jo-fai Chow

Introduction to data science with H2O-Chicago

Sri Ambati

This talk was recorded in London on Oct 30, 2018 and can be viewed here: https://youtu.be/lk2NXurrwAA This talk highlights the integration of Driverless AI with IBM Spectrum Conductor. The integration demonstrates how you can deploy, manage, and scale out to have multiple Driverless AI instances running within your cluster per user to help maximize the efficiency and security of the cluster. The integration includes failover for Driverless AI instances, so that users can continue to work without needing to find another host to start Driverless AI on. In addition, the integration of H2O Sparkling Water with IBM Spectrum Conductor as a notebook is highlighted; as well as the benefits of running H20 Sparkling water within the cluster to maximize your cluster utilization across different workloads.For both Driverless AI and H2O Sparkling Water, a demo will be provided and a future plan for the integrations is highlighted. Bio: Kevin Doyle is the lead architect of IBM Spectrum Conductor at IBM, where he works with customers to deploy and manage all workloads; especially Spark and deep learning workloads to on-premise clusters. Kevin has been working on distributed computing, grid, cloud, and big data for the past five years with a focus on the management and lifecycle of workloads.

Scalable and Automatic Machine Learning with H2O

Sri Ambati

H2O is widely used for machine learning projects. A TechCrunch article, published in January 2017 by John Mannes, reported that around 20% of Fortune 500 companies use H2O. Talk 1: Introduction to Scalable & Automatic Machine Learning with H2O In recent years, the demand for machine learning experts has outpaced the supply, despite the surge of people entering the field. To address this gap, there have been big strides in the development of user-friendly machine learning software that can be used by non-experts. Although H2O and other tools have made it easier for practitioners to train and deploy machine learning models at scale, there is still a fair bit of knowledge and background in data science that is required to produce high-performing machine learning models. In this presentation, Joe will introduce the AutoML functionality in H2O. H2O's AutoML provides an easy-to-use interface which automates the process of training a large, comprehensive selection of candidate models and a stacked ensemble model which, in most cases, will be the top performing model in the AutoML Leaderboard. Talk 2: Making Multimillion-dollar Baseball Decisions with H2O AutoML and Shiny Joe recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application. Bio : Jo-fai (or Joe) Chow is a data scientist at H2O.ai. Before joining H2O, he was in the business intelligence team at Virgin Media in UK where he developed data products to enable quick and smart business decisions. He also worked remotely for Domino Data Lab in the US as a data science evangelist promoting products via blogging and giving talks at meetups. Joe has a background in water engineering. Before his data science journey, he was an EngD research engineer at STREAM Industrial Doctorate Centre working on machine learning techniques for drainage design optimization. Prior to that, he was an asset management consultant specialized in data mining and constrained optimization for the utilities sector in the UK and abroad. He also holds an MSc in Environmental Management and a BEng in Civil Engineering.

Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...

Sri Ambati

Project "Deep Water"

Jo-fai Chow

Introduction to H2O and Model Stacking Use Cases

Jo-fai Chow

OpenACC Month Highlights- October

NVIDIA

H2O Driverless AI Workshop

Sri Ambati

Automatic and Interpretable Machine Learning in R with H2O and LIME

Jo-fai Chow

ArnoCandelAIFrontiers011217

Sri Ambati

How Deep Learning Will Make Us More Human Again While deep learning is taking over the AI space, most of us are struggling to keep up with the pace of innovation. Arno Candel shares success stories and challenges in training and deploying state-of-the-art machine learning models on real-world datasets. He will also share his insights into what the future of machine learning and deep learning might look like, and how to best prepare for it. - Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai - To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata

H2O at Poznan R Meetup

Jo-fai Chow

OpenACC Monthly Highlights- December

NVIDIA

H2O Machine Learning Use Cases

Jo-fai Chow

05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...Revolution Analytics

Introduction to Machine Learning with H2O and Python

Jo-fai Chow

Introduction to Data Science with H2O- Mountain View

Sri Ambati

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny

Jo-fai Chow

Joe recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.

Scalable Machine Learning in R and Python with H2O

Sri Ambati

The focus of this presentation is scalable machine learning using the h2o R and Python packages. H2O is an open source, distributed machine learning platform designed for big data, with the added benefit that it's easy to use on a laptop (in addition to a multi-node Hadoop or Spark cluster). The core machine learning algorithms of H2O are implemented in high-performance Java, however, fully-featured APIs are available in R, Python, Scala, REST/JSON, and also through a web interface. Since H2O's algorithm implementations are distributed, this allows the software to scale to very large datasets that may not fit into RAM on a single machine. H2O currently features distributed implementations of Generalized Linear Models, Gradient Boosting Machines, Random Forest, Deep Neural Nets, Stacked Ensembles (aka "Super Learners"), dimensionality reduction methods (PCA, GLRM), clustering algorithms (K-means), anomaly detection methods, among others. R and Python code with H2O machine learning code examples will be demoed live and will be made available on GitHub for participants to follow along on their laptops if they choose. For those interested in running the code on a multi-node Amazon EC2 cluster, an H2O AMI is also available. Author Bio: Dr. Erin LeDell is a Machine Learning Scientist at H2O.ai, the company that produces the open source machine learning platform, H2O. Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from UC Berkeley. Before joining H2O.ai, she was the Principal Data Scientist at Wise.io (acquired by GE in 2016) and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.

Aplicações Potenciais de Deep Learning à Indústria do Petróleo

Grupo de Geofísica Computacional, UNICAMP

Harnessing the virtual realm for successful real world artificial intelligence

Alison B. Lowndes

Artificial Intelligence is impacting all areas of society, from healthcare and transportation to smart cities and energy. How NVIDIA invests both in internal pure research and accelerated computation to enable its diverse customer base, across gaming & extended reality, graphics, AI, robotics, simulation, high performance scientific computing, healthcare & more. You will be introduced to the GPU computing platform & shown real world successfully deployed applications as well as a glimpse into the current state of the art across academia, enterprise and startups.

What's hot

H2O at Berlin R Meetup

Jo-fai Chow

From Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek

Jo-fai Chow

H2O at BelgradeR Meetup

Jo-fai Chow

Scaling out Driverless AI with IBM Spectrum Conductor - Kevin Doyle - H2O AI ...

Sri Ambati

Scalable and Automatic Machine Learning with H2O

Sri Ambati

Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...

Sri Ambati

Project "Deep Water"

Jo-fai Chow

Introduction to H2O and Model Stacking Use Cases

Jo-fai Chow

OpenACC Month Highlights- October

NVIDIA

H2O Driverless AI Workshop

Sri Ambati

Automatic and Interpretable Machine Learning in R with H2O and LIME

Jo-fai Chow

ArnoCandelAIFrontiers011217

Sri Ambati

H2O at Poznan R Meetup

Jo-fai Chow

OpenACC Monthly Highlights- December

NVIDIA

H2O Machine Learning Use Cases

Jo-fai Chow

05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...Revolution Analytics

Introduction to Machine Learning with H2O and Python

Jo-fai Chow

Introduction to Data Science with H2O- Mountain View

Sri Ambati

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny

Jo-fai Chow

Scalable Machine Learning in R and Python with H2O

Sri Ambati

What's hot (20)

H2O at Berlin R Meetup

From Kaggle to H2O - The True Story of a Civil Engineer Turned Data Geek

H2O at BelgradeR Meetup

Scaling out Driverless AI with IBM Spectrum Conductor - Kevin Doyle - H2O AI ...

Scalable and Automatic Machine Learning with H2O

Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...

Project "Deep Water"

Introduction to H2O and Model Stacking Use Cases

OpenACC Month Highlights- October

H2O Driverless AI Workshop

Automatic and Interpretable Machine Learning in R with H2O and LIME

ArnoCandelAIFrontiers011217

H2O at Poznan R Meetup

OpenACC Monthly Highlights- December

H2O Machine Learning Use Cases

05Nov13 Webinar: Introducing Revolution R Enterprise 7 - The Big Data Big Ana...

Introduction to Machine Learning with H2O and Python

Introduction to Data Science with H2O- Mountain View

Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and Shiny

Scalable Machine Learning in R and Python with H2O

Similar to H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

Aplicações Potenciais de Deep Learning à Indústria do Petróleo

Grupo de Geofísica Computacional, UNICAMP

Harnessing the virtual realm for successful real world artificial intelligence

Alison B. Lowndes

Using-NVIDIA-GPU-Cloud-Containers-on-the-Nimbix-Cloud-NVIDIA.pdf

jn7887

Nvidia Deep Learning Solutions - Alex Sabatier

Sri Ambati

GTC 2016 Opening Keynote

NVIDIA

GTC 2018: A New AI Era Dawns

NVIDIA

At the 2018 GPU Technology Conference in Silicon Valley, NVIDIA CEO Jensen Huang announced the new "double-sized" 32GB Volta GPU; unveiled the NVIDIA DGX-2, the power of 300 servers in a box; showed an expanded inference platform with TensorRT 4 and Kubernetes on NVIDIA GPU; and revealed the NVIDIA GPU Cloud registry with 30 GPU-optimized containers and made it available from more cloud service providers. GTC attendees also got a sneak peek of the latest NVIDIA DRIVE software stack and the next DRIVE AI car computer, "Orin," along with developments in the NVIDIA Isaac platform for robotics and Project Clara, NVIDIA's medical imaging supercomputer.

AI + E-commerce

Alison B. Lowndes

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化

NVIDIA Taiwan

Backend.AI Technical Introduction (19.09 / 2019 Autumn)

Lablup Inc.

Accelerating Data Science With GPUs

iguazio

1) NVIDIA-Iguazio Accelerated Solutions for Deep Learning and Machine Learning (30 mins): About the speaker: Dr. Gabriel Noaje, Senior Solutions Architect, NVIDIA http://bit.ly/GabrielNoaje 2) GPUs in Data Science Pipelines ( 30 mins) - GPU as a Service for enterprise AI - A short demo on the usage of GPUs for model training and model inferencing within a data science workflow About the speaker: Anant Gandhi, Solutions Engineer, Iguazio Singapore. https://www.linkedin.com/in/anant-gandhi-b5447614/

Dell and NVIDIA for Your AI workloads in the Data Center

Renee Yao

Join us and learn more about how Dell PowerEdge C4140 Rack Server, powered by four of NVIDIA V100s, the world’s most powerful GPU, address training and inference for the most demanding HPC, data visualization and AI workloads. This enables organizations to take advantage of the convergence of HPC and data analytics and realize advancements in areas including fraud detection, image processing, financial investment analysis and personalized medicine.

Rapids: Data Science on GPUs

inside-BigData.com

In this deck from FOSDEM'19, Christoph Angerer from NVIDIA presents: Rapids - Data Science on GPUs. "The next big step in data science will combine the ease of use of common Python APIs, but with the power and scalability of GPU compute. The RAPIDS project is the first step in giving data scientists the ability to use familiar APIs and abstractions while taking advantage of the same technology that enables dramatic increases in speed in deep learning. This session highlights the progress that has been made on RAPIDS, discusses how you can get up and running doing data science on the GPU, and provides some use cases involving graph analytics as motivation. GPUs and GPU platforms have been responsible for the dramatic advancement of deep learning and other neural net methods in the past several years. At the same time, traditional machine learning workloads, which comprise the majority of business use cases, continue to be written in Python with heavy reliance on a combination of single-threaded tools (e.g., Pandas and Scikit-Learn) or large, multi-CPU distributed solutions (e.g., Spark and PySpark). RAPIDS, developed by a consortium of companies and available as open source code, allows for moving the vast majority of machine learning workloads from a CPU environment to GPUs. This allows for a substantial speed up, particularly on large data sets, and affords rapid, interactive work that previously was cumbersome to code or very slow to execute. Many data science problems can be approached using a graph/network view, and much like traditional machine learning workloads, this has been either local (e.g., Gephi, Cytoscape, NetworkX) or distributed on CPU platforms (e.g., GraphX). We will present GPU-accelerated graph capabilities that, with minimal conceptual code changes, allows both graph representations and graph-based analytics to achieve similar speed ups on a GPU platform. By keeping all of these tasks on the GPU and minimizing redundant I/O, data scientists are enabled to model their data quickly and frequently, affording a higher degree of experimentation and more effective model generation. Further, keeping all of this in compatible formats allows quick movement from feature extraction, graph representation, graph analytic, enrichment back to the original data, and visualization of results. RAPIDS has a mission to build a platform that allows data scientist to explore data, train machine learning algorithms, and build applications while primarily staying on the GPU and GPU platforms." Learn more: https://rapids.ai/ and https://fosdem.org/2019/ Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter

NVIDIA Rapids presentation

testSri1

NVIDIA GTC 2019: Red Hat and the NVIDIA DGX: Tried, Tested, Trusted

Jeremy Eder

Red Hat and NVIDIA collaborated to bring together two of the technology industry's most popular products: Red Hat Enterprise Linux 7 and the NVIDIA DGX system. This talk will cover how the combination of RHELs rock-solid stability with the incredible DGX hardware can deliver tremendous value to enterprise data scientists. We will also show how to leverage NVIDIA GPU Cloud container images with Kubernetes and RHEL to reap maximum benefits from this incredible hardware.

NVIDIA vGPU - Introduction to NVIDIA Virtual GPU

Lee Bushen

GPU Cloud Server in India

Cloudtechtiq

組み込みから HPC まで ARM コアで実現するエコシステム

Shinnosuke Furuya

TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA

Shanker Trivedi

Enabling Artificial Intelligence - Alison B. Lowndes

WithTheBest

HPC Top 5 Stories: Dec. 7, 2016

NVIDIA

Similar to H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA (20)

Aplicações Potenciais de Deep Learning à Indústria do Petróleo

Harnessing the virtual realm for successful real world artificial intelligence

Using-NVIDIA-GPU-Cloud-Containers-on-the-Nimbix-Cloud-NVIDIA.pdf

Nvidia Deep Learning Solutions - Alex Sabatier

GTC 2016 Opening Keynote

GTC 2018: A New AI Era Dawns

AI + E-commerce

GTC Taiwan 2017 在 Google Cloud 當中使用 GPU 進行效能最佳化

Backend.AI Technical Introduction (19.09 / 2019 Autumn)

Accelerating Data Science With GPUs

Dell and NVIDIA for Your AI workloads in the Data Center

Rapids: Data Science on GPUs

NVIDIA Rapids presentation

NVIDIA GTC 2019: Red Hat and the NVIDIA DGX: Tried, Tested, Trusted

NVIDIA vGPU - Introduction to NVIDIA Virtual GPU

GPU Cloud Server in India

組み込みから HPC まで ARM コアで実現するエコシステム

TiECon Florida keynote - New opportunities for entrepreneurs using GPU & CUDA

Enabling Artificial Intelligence - Alison B. Lowndes

HPC Top 5 Stories: Dec. 7, 2016

More from Sri Ambati

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Sri Ambati

Generative AI Masterclass - Model Risk Management.pptx

Sri Ambati

AI and the Future of Software Development: A Sneak Peek

Sri Ambati

LLMOps: Match report from the top of the 5th

Sri Ambati

Building, Evaluating, and Optimizing your RAG App for Production

Sri Ambati

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Sri Ambati

Sandeep Singh, Head of Applied AI Computer Vision, Beans.ai H2O Open Source GenAI World SF 2023 In the modern era of machine learning, leveraging both open-source and closed-source solutions has become paramount for achieving cutting-edge results. This talk delves into the intricacies of seamlessly integrating open-source Large Language Model (LLM) solutions like Vicuna, Falcon, and Llama with industry giants such as ChatGPT and Google's Palm. As the demand for fine-tuned and specialized datasets grows, it is imperative to understand the synergy between these tools. Attendees will gain insights into best practices for building and enriching datasets tailored for fine-tuning tasks, ensuring that their LLM projects are both robust and efficient. Through real-world examples and hands-on demonstrations, this talk will equip attendees with the knowledge to harness the power of both open and closed-source tools in a coherent and effective manner.

Risk Management for LLMs

Sri Ambati

Patrick Hall, Professor, AI Risk Management, The George Washington University H2O Open Source GenAI World SF 2023 Language models are incredible engineering breakthroughs but require auditing and risk management before productization. These systems raise concerns about toxicity, transparency and reproducibility, intellectual property licensing and ownership, disinformation and misinformation, supply chains, and more. How can your organization leverage these new tools without taking on undue or unknown risks? While language models and associated risk management are in their infancy, a small number of best practices in governance and risk are starting to emerge. If you have a language model use case in mind, want to understand your risks, and do something about them, this presentation is for you!

Open-Source AI: Community is the Way

Sri Ambati

Dr. Alexy Khrabrov, Open Source Science Community Director, IBM H2O Open Source GenAI World SF 2023 In this talk, Dr. Alexy Khrabrov, recently elected Chair of the new Generative AI Commons at Linux Foundation for AI & Data, outlines the OSS AI landscape, challenges, and opportunities. With new models and frameworks being unveiled weekly, one thing remains constant: community building and validation of all aspects of AI is key to reliable and responsible AI we can use for business and society needs. Industrial AI is one key area where such community validation can prove invaluable.

Building Custom GenAI Apps at H2O

Sri Ambati

Applied Gen AI for the Finance Vertical

Sri Ambati

Megan Kurka, Vice President, Customer Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 Discover the transformative power of Applied Gen AI. Learn how the H2O team builds customized applications and workflows that integrate capabilities of Gen AI and AutoML specifically designed to address and enhance financial use cases. Explore real world examples, learn best practices, and witness firsthand how our innovative solutions are reshaping the landscape of finance technology.

Cutting Edge Tricks from LLM Papers

Sri Ambati

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Sri Ambati

Pascal Pfeiffer, Principal Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 This talk dives into the expansive ecosystem of Large Language Models (LLMs), offering practitioners an insightful guide to various relevant applications, from natural language understanding to creative content generation. While exploring use cases across different industries, it also honestly addresses the current limitations of LLMs and anticipates future advancements.

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

Sri Ambati

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

Sri Ambati

LLM Interpretability

Sri Ambati

Never Reply to an Email Again

Sri Ambati

Introducción al Aprendizaje Automatico con H2O-3 (1)

Sri Ambati

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

Sri Ambati

Numerai is an open, crowd-sourced hedge fund powered by predictions from data scientists around the world. In return, participants are rewarded with weekly payouts in crypto. In this talk, Joe will give an overview of the Numerai tournament based on his own experience. He will then explain how he automates the time-consuming tasks such as testing different modelling strategies, scoring new datasets, submitting predictions to Numerai as well as monitoring model performance with H2O Driverless AI and R.

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

Sri Ambati

In this session, you will learn about what you should do after you’ve taken an AI transformation baseline. Over the span of this session, we will discuss the next steps in moving toward AI readiness through alignment of talent and tools to drive successful adoption and continuous use within an organization. To find additional videos on AI courses, earn badges, join the courses at H2O.ai Learning Center: https://training.h2o.ai/products/ai-foundations-course To find the Youtube video about this presentation: https://youtu.be/K1Cl3x3rd8g Speaker: Chemere Davis (H2O.ai - Senior Data Scientist Training Specialist)

More from Sri Ambati (20)

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Generative AI Masterclass - Model Risk Management.pptx

AI and the Future of Software Development: A Sneak Peek

LLMOps: Match report from the top of the 5th

Building, Evaluating, and Optimizing your RAG App for Production

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Risk Management for LLMs

Open-Source AI: Community is the Way

Building Custom GenAI Apps at H2O

Applied Gen AI for the Finance Vertical

Cutting Edge Tricks from LLM Papers

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

LLM Interpretability

Never Reply to an Email Again

Introducción al Aprendizaje Automatico con H2O-3 (1)

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

Recently uploaded

Introduction to CHERI technology - Cybersecurity

mikeeftimakis1

The Art of the Pitch: WordPress Relationships and Sales

Laura Byrne

Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes? All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.

PHP Frameworks: I want to break free (IPC Berlin 2024)

Ralf Eggert

In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development. This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Neo4j

Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf

FIDO Alliance

FIDO Alliance Osaka Seminar: Overview.pdf

FIDO Alliance

PCI PIN Basics Webinar from the Controlcase Team

ControlCase

By Design, not by Accident - Agile Venture Bolzano 2024

Pierluigi Pugliese

National Security Agency - NSA mobile device best practices

Quotidiano Piemontese

Securing your Kubernetes cluster_ a step-by-step guide to success !

KatiaHIMEUR1

Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster. However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks. In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Aggregage

20240605 QFM017 Machine Intelligence Reading List May 2024

Matthew Sinclair

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf

Peter Spielvogel

Building better applications for business users with SAP Fiori. • What is SAP Fiori and why it matters to you • How a better user experience drives measurable business benefits • How to get started with SAP Fiori today • How SAP Fiori elements accelerates application development • How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities • How SAP Fiori paves the way for using AI in SAP apps

Elevating Tactical DDD Patterns Through Object Calisthenics

Dorra BARTAGUIZ

After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!

Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...

SOFTTECHHUB

The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing. One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.

Removing Uninteresting Bytes in Software Fuzzing

Aftab Hussain

Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process. In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds. - These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

James Anderson

Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management. The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM). Speakers: Bob Boule Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle. Gopinath Rebala Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.

Video Streaming: Then, Now, and in the Future

Alpen-Adria-Universität

In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.

State of ICS and IoT Cyber Threat Landscape Report 2024 preview

Prayukth K V

The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development. The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers: State of global ICS asset and network exposure Sectoral targets and attacks as well as the cost of ransom Global APT activity, AI usage, actor and tactic profiles, and implications Rise in volumes of AI-powered cyberattacks Major cyber events in 2024 Malware and malicious payload trends Cyberattack types and targets Vulnerability exploit attempts on CVEs Attacks on counties – USA Expansion of bot farms – how, where, and why In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East Why are attacks on smart factories rising? Cyber risk predictions Axis of attacks – Europe Systemic attacks in the Middle East Download the full report from here: https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/

Elizabeth Buie - Older adults: Are we really designing for our future selves?

Nexer Digital

Recently uploaded (20)

Introduction to CHERI technology - Cybersecurity

The Art of the Pitch: WordPress Relationships and Sales

PHP Frameworks: I want to break free (IPC Berlin 2024)

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf

FIDO Alliance Osaka Seminar: Overview.pdf

PCI PIN Basics Webinar from the Controlcase Team

By Design, not by Accident - Agile Venture Bolzano 2024

National Security Agency - NSA mobile device best practices

Securing your Kubernetes cluster_ a step-by-step guide to success !

Generative AI Deep Dive: Advancing from Proof of Concept to Production

20240605 QFM017 Machine Intelligence Reading List May 2024

SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf

Elevating Tactical DDD Patterns Through Object Calisthenics

Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...

Removing Uninteresting Bytes in Software Fuzzing

GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...

Video Streaming: Then, Now, and in the Future

State of ICS and IoT Cyber Threat Landscape Report 2024 preview

Elizabeth Buie - Older adults: Are we really designing for our future selves?

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

1. Jim McHugh | NVIDIA VP and GM of Data Center | December 2017 NVIDIA GPU ACCELERATED COMPUTING

2. 2 NVIDIA GPU Computing Gaming VR AI & HPC Self-Driving Cars

3. 3 GPU COMPUTING AT THE HEART OF AI Big Bang of Modern AI 103 105 107 1.5X per year 40 Years of CPU Trend Data Single-threaded perf GPU-Computing perf 1.5X per year 1.1X per year 1000X by 2025 Performance Beyond Moore’s Law Original data up to the year 2010 collected and plotted by M. Horowitz,   F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for 2010-2015 by K. Rupp 1980 1990 2000 2010 2020

4. 4 RISE OF NVIDIA GPU COMPUTING Global GTC Attendees 10X in 5 Years GPU Developers 16X in 5 Years CUDA Downloads 6X in 5 Years 2012 2017 2M670,00022,000 2012 2017 2012 2017

5. 5 ML & DL CHALLENGES Slow Processes Model Complexity Resource Constraints Hours to build GBT 30+ Days Data Transformation Weeks Feature Engineering Months Scoring Pipelines <100Grandmasters in the world

6. 6 REVOLUTIONARY   AI PERFORMANCE Performance up to 100 CPUs 21 billion transistors – 5120 CUDA cores New Tensor Core architecture inspired   by the demands of deep learning Volta is the Most Advanced Data Center GPU Ever Built

7. 7 MAXIMIZING PERFORMANCE ON VOLTA GPU Generational Training Scaling 0 3 6 9 12 K80 V100 Tensor Core ResNet-152 Training, 8x K80 (16 GPUs total) compared with 8x V100 NVLink GPUs using NVIDIA 17.10 containers Greater Than 10x Performance K80 vs. V100

8. 8 NVIDIA TENSORRT   10X BETTER DATA CENTER TCO 160 CPU Servers 45,000 Images / Second 65 KWatts

9. 9 NVIDIA TENSORRT   10X BETTER DATA CENTER TCO 1 NVIDIA HGX with 8 Tesla V100 GPUs 45,000 Images / Second 3 KWatts 1/6 the Cost | 1/20 the Power 4 Racks in a Box

10. 10 NVIDIA VOLTA IN EVERY CLOUD,   EVERY DATACENTER

11. 11 NVIDIA DGX FAMILY AI Workstation DGX Station  The Personal AI Supercomputer DGX-1  The Essential Instrument for AI Research Cloud-Scale AIAI Datacenter NVIDIA GPU Cloud Cloud Service with the Highest Deep Learning Efficiency

12. 12

13. 13 THREE STEPS TO NVIDIA GPU CLOUD www.nvidia.com/ngc SIGN UP To get an NGC account,   go to: www.nvidia.com/ngcsignup DEPLOY IMAGE On Amazon EC2, choose a P3 instance and deploy the NVIDIA Volta Deep Learning AMI for NGC PULL CONTAINER Pick your desired framework and pull the container into your instance 1 2 3

14. 14 ANNOUNCING H2O DRIVERLESS AI   ON NVIDIA GPU CLOUD Up to 40X Faster

15. 15

16. 16 H2O4GPU PERFORMANCE GLM XGBoost K-Means 40x10x5x

17. 17 NVIDIA GPU ACCELERATED   DRIVERLESS AI ADOPTION Downloads Activation Installs 1250+ 300+600+

18. 18 COLLABORATION THROUGH OPEN SOURCE gpuopenanalytics.com GPU DATA FRAME | APACHE ARROW

19. 19 ML ACROSS INDUSTRIES Finance Healthcare Telco

20. 20 NVIDIA + H2O CommunityEngineering GPU DATA FRAME | APACHE ARROWH2O Driverless AI on DGX and NGC Business Joint Sales and Marketing Global Initiative $40m Series C Round of Funding

21. 21 FOR MORE INFORMATION Webinar: Interpretability with Driverless AI Webinar: Accelerated Machine Learning on GPUs Blog: GOAI: Open GPU Accelerated Data Analytics Blog: Crossing the Chasm of AI Interpretability Blog: H2O Offers GPU-Accelerated ML for Enterprise Transformation Solution Brief: NVIDIA and H2O: GPU-Accelerated ML

22. 22 A NEW COMPUTING ERA Performance Beyond Moore’s law Driverless AI on DGX and NGC Cross-Industry Solutions for ML

23. 23

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA

Similar to H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIA