This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinyJo-fai Chow
Joe recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.
Automatic and Interpretable Machine Learning with H2O and LIMEJo-fai Chow
The document discusses automatic and interpretable machine learning using H2O and LIME. It provides an introduction and agenda, then discusses why interpretability is important. It introduces the LIME framework for interpreting complex machine learning models locally. It also discusses H2O AutoML for automatically training and tuning many models. Examples are provided for regression using the Boston Housing dataset, where a random forest model is trained and its predictions are explained locally using LIME.
This is my Deep Water talk for the TensorFlow Paris meetup.
Deep Water is H2O's integration with multiple open source deep learning libraries such as TensorFlow, MXNet and Caffe. On top of the performance gains from GPU backends, Deep Water naturally inherits all H2O properties in scalability. ease of use and deployment.
Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...Sri Ambati
This document outlines an agenda for a workshop on automatic and interpretable machine learning using H2O and LIME. The agenda includes an introduction to H2O, AutoML, and LIME, as well as examples of regression and classification modeling. It also covers a real-world use case of applying these techniques to help make multimillion dollar decisions for a baseball team. The document provides information on installing the necessary packages and an overview of the featured machine learning techniques.
World's Fastest Machine Learning With GPUsSri Ambati
Presented during the "Introduction to H2O4GPU and Driverless AI" webinar on April 11th, 2018.
Watch the recording here:
https://attendee.gotowebinar.com/register/6156356209443281667?source=SlideshareH2O4GPU
Joe Chow gave a presentation about machine learning use cases using H2O. He introduced H2O, an open source machine learning platform that works with R, Python, and other languages. He discussed how companies like Telenor and customers in Brazil are using H2O for tasks like predictive modeling. Joe also highlighted current algorithms in H2O and how it is evolving with new features like deep learning integration. The talk provided an overview of H2O and real-world examples of how organizations are applying machine learning with the platform.
Scalable and Automatic Machine Learning with H2OSri Ambati
H2O is widely used for machine learning projects. A TechCrunch article, published in January 2017 by John Mannes, reported that around 20% of Fortune 500 companies use H2O.
Talk 1: Introduction to Scalable & Automatic Machine Learning with H2O
In recent years, the demand for machine learning experts has outpaced the supply, despite the surge of people entering the field. To address this gap, there have been big strides in the development of user-friendly machine learning software that can be used by non-experts. Although H2O and other tools have made it easier for practitioners to train and deploy machine learning models at scale, there is still a fair bit of knowledge and background in data science that is required to produce high-performing machine learning models.
In this presentation, Joe will introduce the AutoML functionality in H2O. H2O's AutoML provides an easy-to-use interface which automates the process of training a large, comprehensive selection of candidate models and a stacked ensemble model which, in most cases, will be the top performing model in the AutoML Leaderboard.
Talk 2: Making Multimillion-dollar Baseball Decisions with H2O AutoML and Shiny
Joe recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.
Bio : Jo-fai (or Joe) Chow is a data scientist at H2O.ai. Before joining H2O, he was in the business intelligence team at Virgin Media in UK where he developed data products to enable quick and smart business decisions. He also worked remotely for Domino Data Lab in the US as a data science evangelist promoting products via blogging and giving talks at meetups. Joe has a background in water engineering. Before his data science journey, he was an EngD research engineer at STREAM Industrial Doctorate Centre working on machine learning techniques for drainage design optimization. Prior to that, he was an asset management consultant specialized in data mining and constrained optimization for the utilities sector in the UK and abroad. He also holds an MSc in Environmental Management and a BEng in Civil Engineering.
Introduction to Machine Learning with H2O and PythonSri Ambati
This document provides an introduction and agenda for a tutorial on machine learning with H2O and Python. The introduction discusses the presenter's background and qualifications. The agenda outlines topics to be covered including an overview of H2O.ai as a company and machine learning platform, tutorials on using the H2O Python module to import data, build regression and classification models, and improve model performance through techniques like cross-validation, grid search, and stacking. Case study notebooks and examples will be used to demonstrate key machine learning concepts and the H2O framework in Python.
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinyJo-fai Chow
Joe recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.
Automatic and Interpretable Machine Learning with H2O and LIMEJo-fai Chow
The document discusses automatic and interpretable machine learning using H2O and LIME. It provides an introduction and agenda, then discusses why interpretability is important. It introduces the LIME framework for interpreting complex machine learning models locally. It also discusses H2O AutoML for automatically training and tuning many models. Examples are provided for regression using the Boston Housing dataset, where a random forest model is trained and its predictions are explained locally using LIME.
This is my Deep Water talk for the TensorFlow Paris meetup.
Deep Water is H2O's integration with multiple open source deep learning libraries such as TensorFlow, MXNet and Caffe. On top of the performance gains from GPU backends, Deep Water naturally inherits all H2O properties in scalability. ease of use and deployment.
Automatic and Interpretable Machine Learning in R with H2O and LIME (Milan Ed...Sri Ambati
This document outlines an agenda for a workshop on automatic and interpretable machine learning using H2O and LIME. The agenda includes an introduction to H2O, AutoML, and LIME, as well as examples of regression and classification modeling. It also covers a real-world use case of applying these techniques to help make multimillion dollar decisions for a baseball team. The document provides information on installing the necessary packages and an overview of the featured machine learning techniques.
World's Fastest Machine Learning With GPUsSri Ambati
Presented during the "Introduction to H2O4GPU and Driverless AI" webinar on April 11th, 2018.
Watch the recording here:
https://attendee.gotowebinar.com/register/6156356209443281667?source=SlideshareH2O4GPU
Joe Chow gave a presentation about machine learning use cases using H2O. He introduced H2O, an open source machine learning platform that works with R, Python, and other languages. He discussed how companies like Telenor and customers in Brazil are using H2O for tasks like predictive modeling. Joe also highlighted current algorithms in H2O and how it is evolving with new features like deep learning integration. The talk provided an overview of H2O and real-world examples of how organizations are applying machine learning with the platform.
Scalable and Automatic Machine Learning with H2OSri Ambati
H2O is widely used for machine learning projects. A TechCrunch article, published in January 2017 by John Mannes, reported that around 20% of Fortune 500 companies use H2O.
Talk 1: Introduction to Scalable & Automatic Machine Learning with H2O
In recent years, the demand for machine learning experts has outpaced the supply, despite the surge of people entering the field. To address this gap, there have been big strides in the development of user-friendly machine learning software that can be used by non-experts. Although H2O and other tools have made it easier for practitioners to train and deploy machine learning models at scale, there is still a fair bit of knowledge and background in data science that is required to produce high-performing machine learning models.
In this presentation, Joe will introduce the AutoML functionality in H2O. H2O's AutoML provides an easy-to-use interface which automates the process of training a large, comprehensive selection of candidate models and a stacked ensemble model which, in most cases, will be the top performing model in the AutoML Leaderboard.
Talk 2: Making Multimillion-dollar Baseball Decisions with H2O AutoML and Shiny
Joe recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.
Bio : Jo-fai (or Joe) Chow is a data scientist at H2O.ai. Before joining H2O, he was in the business intelligence team at Virgin Media in UK where he developed data products to enable quick and smart business decisions. He also worked remotely for Domino Data Lab in the US as a data science evangelist promoting products via blogging and giving talks at meetups. Joe has a background in water engineering. Before his data science journey, he was an EngD research engineer at STREAM Industrial Doctorate Centre working on machine learning techniques for drainage design optimization. Prior to that, he was an asset management consultant specialized in data mining and constrained optimization for the utilities sector in the UK and abroad. He also holds an MSc in Environmental Management and a BEng in Civil Engineering.
Introduction to Machine Learning with H2O and PythonSri Ambati
This document provides an introduction and agenda for a tutorial on machine learning with H2O and Python. The introduction discusses the presenter's background and qualifications. The agenda outlines topics to be covered including an overview of H2O.ai as a company and machine learning platform, tutorials on using the H2O Python module to import data, build regression and classification models, and improve model performance through techniques like cross-validation, grid search, and stacking. Case study notebooks and examples will be used to demonstrate key machine learning concepts and the H2O framework in Python.
H2O Deep Water - Making Deep Learning Accessible to EveryoneJo-fai Chow
H2O Deep Water is a tool that integrates distributed deep learning with H2O's machine learning platform. It allows users to build, stack, and deploy deep learning models from libraries like TensorFlow, MXNet, and Caffe through a unified interface. Deep Water inherits properties from H2O like scalability, ease of use, and deployment capabilities. It also makes deep learning more accessible by supporting popular network architectures and allowing easy ensemble of deep models with other H2O algorithms.
Automatic and Interpretable Machine Learning in R with H2O and LIMEJo-fai Chow
This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
H2O.ai - Road Ahead - keynote presentation by Sri AmbatiSri Ambati
Artificial Intelligence for Business Transformation.
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Introduction to Data Science with H2O- Mountain ViewSri Ambati
This document provides an overview of H2O.ai, an open source in-memory machine learning platform. It discusses that H2O.ai was founded in 2011 and is venture-backed, with a team of 37 people working on distributed systems for machine learning. It also summarizes that H2O provides easy to use APIs for Java, R, Python and other languages, and allows for scalable machine learning on large datasets using distributed algorithms to make full use of data without downsampling. Finally, it highlights how H2O works with other technologies like Spark, Hadoop, and HDFS to enable reading of large datasets for machine learning.
H2O Random Grid Search - PyData AmsterdamSri Ambati
This document discusses using H2O's random grid search for hyperparameter optimization. It begins with introductions of the presenter and H2O company. It then draws an analogy between a baker's process of making cake and a data scientist's process of building models. The document explains common techniques for hyperparameter optimization including manual search, grid search, and random grid search. It provides evidence that random search performs as well as manual/grid search in less time. Finally, it demonstrates H2O's random grid search API in Python and discusses other useful H2O features.
The document discusses H2O.ai's Driverless AI product, which aims to automate and simplify the machine learning process. It provides an overview of H2O.ai as a company, their goals of operationalizing data science. Driverless AI uses techniques like automated feature engineering, model tuning and selection, and model ensembling to build accurate models fast. It also allows for interpreting and explaining machine learning models through features like model inspection and reason codes. A demo of Driverless AI predicting credit card default risk is shown to illustrate the system.
SF Big Data Science Meetup 12.15.15
Scripts here: https://github.com/h2oai/h2o-meetups/tree/master/2015_12_15_MessyDataMeetup
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
H2O Deep Water - Making Deep Learning Accessible to EveryoneJo-fai Chow
- H2O.ai is a company that provides an open-source machine learning platform called H2O.
- Their new project "Deep Water" integrates popular deep learning frameworks like TensorFlow, MXNet, and Caffe into H2O to enable distributed deep learning on GPUs for improved performance.
- This provides a unified interface for deep learning within H2O and allows users to easily build, stack, and deploy deep learning models from different frameworks.
H2O Machine Learning and Kalman Filters for Machine Prognostics - Galvanize SFSri Ambati
Hank Roark's presentation at Galvanize SF, 02.23.16
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
H2O Machine Learning and Kalman Filters for Machine PrognosticsSri Ambati
This document discusses using machine learning and Kalman filters for machine health monitoring. It presents a case study using turbine engine degradation data from NASA to predict remaining useful life. The document walks through loading and preparing the data in H2O, including feature engineering techniques like clustering operating modes and standardizing sensor measurements. Various machine learning models like GLM are trained and evaluated on the data. Cross validation and hyperparameter tuning are also demonstrated to select the best model. Finally, the document mentions using Kalman filters for post-processing model outputs.
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIASri Ambati
Presented at #H2OWorld 2017 in Mountain View, CA.
Enjoy the recording: https://youtu.be/NyaJ7uDroww.
Learn more about H2O.ai: https://www.h2o.ai/.
Follow @h2oai: https://www.twitter.com/h2oai.
Intro to H2O in Python - Data Science LASri Ambati
Erin LeDell's presentation on Intro to H2O Machine Learning in Python at Data Science LA meetup on 1.19.16
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Introduction to data science with H2O-ChicagoSri Ambati
This document provides an overview of H2O.ai, an open source in-memory machine learning platform. It describes H2O.ai's product as an in-memory prediction engine, its team of 37 distributed systems engineers doing machine learning, and its headquarters in Mountain View, CA. It also provides details on how to use H2O with R and Python for scalable machine learning on large datasets across distributed systems.
Slides from Matt Dowle's presentation at H2O Open Tour: NYC
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
H2O Deep Water - Making Deep Learning Accessible to EveryoneSri Ambati
Deep Water is H2O's integration with multiple open source deep learning libraries such as TensorFlow, MXNet and Caffe. On top of the performance gains from GPU backends, Deep Water naturally inherits all H2O properties in scalability. ease of use and deployment. In this talk, I will go through the motivation and benefits of Deep Water. After that, I will demonstrate how to build and deploy deep learning models with or without programming experience using H2O's R/Python/Flow (Web) interfaces.
Jo-fai (or Joe) is a data scientist at H2O.ai. Before joining H2O, he was in the business intelligence team at Virgin Media in UK where he developed data products to enable quick and smart business decisions. He also worked remotely for Domino Data Lab in the US as a data science evangelist promoting products via blogging and giving talks at meetups. Joe has a background in water engineering. Before his data science journey, he was an EngD research engineer at STREAM Industrial Doctorate Centre working on machine learning techniques for drainage design optimization. Prior to that, he was an asset management consultant specialized in data mining and constrained optimization for the utilities sector in the UK and abroad. He also holds an MSc in Environmental Management and a BEng in Civil Engineering.
Driverless AI - Intro + Interactive Hands-on LabSri Ambati
Enjoy the webinar recording here: https://youtu.be/Lll1qwQJKVw.
Driverless AI speeds up data science workflows by automating feature engineering, model tuning, ensembling, and model deployment.
In this presentation, Arno Candel (CTO, H2O.ai), gives a quick overview and guide attendees through an interactive hands-on lab using Qwiklabs.
Driverless AI turns Kaggle-winning recipes into production-ready code and is specifically designed to avoid common mistakes such as under or overfitting, data leakage or improper model validation. Avoiding these pitfalls alone can save weeks or more for each model, and is necessary to achieve high modeling accuracy.
With Driverless AI, everyone can now train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client/server API through a variety of languages such as Python, Java, C++, go, C# and many more. To speed up training, Driverless AI uses highly optimized C++/CUDA algorithms to take full advantage of the latest compute hardware.
For example, Driverless AI runs orders of magnitudes faster on the latest Nvidia GPU supercomputers on Intel and IBM platforms, both in the cloud or on-premise. There are two more product innovations in Driverless AI: statistically rigorous automatic data visualization and interactive model interpretation with reason codes and explanations in plain English. Both help data scientists and analysts to quickly validate the data and models.
Scalable Machine Learning in R and Python with H2OSri Ambati
This document provides an overview of scalable machine learning in R and Python using the H2O platform. It introduces H2O.ai, the company and H2O, the open source machine learning platform. Key features of the H2O platform include its distributed algorithms, APIs for R and Python, and interfaces like H2O Flow. The document outlines tutorials for using popular algorithms like deep learning and ensembles in H2O and describes ongoing developments like DeepWater and AutoML.
Deploying your Predictive Models as a Service via DominoJo-fai Chow
The document discusses how Domino Data Lab can be used to deploy predictive models as APIs. It provides examples of using Domino to build, evaluate, and deploy predictive models for the Iris dataset and stock market forecasting. Key features discussed include the web and R interfaces, code sharing, scheduled runs, automatic version control, and publishing models as APIs for other applications to access.
Driverless AI - Introduction and Live DemoSri Ambati
H2O.ai is an AI company founded in 2011 with around 90 employees based in Mountain View, CA. They are a leader in the Gartner Magic Quadrant and are VC funded including by NVIDIA. Their Driverless AI software can automate the entire machine learning process from data preparation to deployment of models in just a few hours, compared to months for experts. It provides automatic data visualization, outlier detection, model interpretation and regular updates.
Introducción al Machine Learning AutomáticoSri Ambati
¿Cómo puede llevar el aprendizaje automático a las masas? Los proyectos de Machine Learning con la búsqueda de talento, el tiempo para construir e implementar modelos y confiar en los modelos que se construyen.
¿Cómo puede tener varios equipos en su organización para crear modelos de ML precisos sin ser expertos en ciencia de datos o aprendizaje automático?
¿Se pregunta sobre los diferentes sabores de AutoML?
H2O Driverless AI emplea las técnicas de científicos expertos en datos en una aplicación fácil de usar que ayuda a escalar sus esfuerzos de ciencia de datos. La inteligencia artificial Driverless permite a los científicos de datos trabajar en proyectos más rápido utilizando la automatización y la potencia de computación de vanguardia de las GPU para realizar tareas en minutos que solían tomar meses.
Con H2O Driverless AI, todos, incluyendo expertos y científicos de datos junior, científicos de dominio e ingenieros de datos pueden desarrollar modelos confiables de aprendizaje automático. Esta plataforma de aprendizaje automático de última generación ofrece una funcionalidad única y avanzada para la visualización de datos, la ingeniería de características, la interpretabilidad del modelo y la implementación de baja latencia.
H2O Driverless AI hace:
* Visualización automática de datos
* Ingeniería automática de funciones a nivel de Grandmaster
* Selección automática del modelo
* Ajuste y capacitación automáticos del modelo
* Paralelización automática utilizando múltiples CPU o GPU
* Ensamblaje automático del modelo
*automática del Interpretaciónaprendizaje automático (MLI)
* Generación automática de código de puntuación
¿Quieres probarlo tú mismo? Puede obtener una prueba gratuita aquí: H2O Driverless AI trial.
Venga a esta sesión y descubra cómo comenzar con el Aprendizaje automático automático con AI sin conductor H2O, y cree modelos potentes con solo unos pocos clics.
¡Te veo pronto!
Acerca de H2O.ai
H2O.ai es una empresa visionaria de software de código abierto de Silicon Valley que creó y reimaginó lo que es posible. Somos una empresa de fabricantes que trajeron al mercado nuevas plataformas y tecnologías para impulsar el movimiento de inteligencia artificial. Somos los creadores de, H2O, la principal plataforma de aprendizaje de ciencia de datos de fuente abierta y de aprendizaje automático utilizada por casi la mitad de Fortune 500 y en la que confían más de 14,000 organizaciones y cientos de miles de científicos de datos de todo el mundo.
This document summarizes a presentation given by Joe Chow on machine learning using H2O.ai's platform. The presentation introduced H2O, its products like Deep Water for deep learning, and demonstrated examples of building models with R and Python. It showed how H2O provides a unified interface for TensorFlow, MXNet and Caffe, allowing users to easily build and deploy deep learning models with different frameworks. The document provided an overview of the company and platform capabilities like scalable algorithms, model export and multiple language interfaces like R and Python.
H2O Deep Water - Making Deep Learning Accessible to EveryoneJo-fai Chow
H2O Deep Water is a tool that integrates distributed deep learning with H2O's machine learning platform. It allows users to build, stack, and deploy deep learning models from libraries like TensorFlow, MXNet, and Caffe through a unified interface. Deep Water inherits properties from H2O like scalability, ease of use, and deployment capabilities. It also makes deep learning more accessible by supporting popular network architectures and allowing easy ensemble of deep models with other H2O algorithms.
Automatic and Interpretable Machine Learning in R with H2O and LIMEJo-fai Chow
This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
H2O.ai - Road Ahead - keynote presentation by Sri AmbatiSri Ambati
Artificial Intelligence for Business Transformation.
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Introduction to Data Science with H2O- Mountain ViewSri Ambati
This document provides an overview of H2O.ai, an open source in-memory machine learning platform. It discusses that H2O.ai was founded in 2011 and is venture-backed, with a team of 37 people working on distributed systems for machine learning. It also summarizes that H2O provides easy to use APIs for Java, R, Python and other languages, and allows for scalable machine learning on large datasets using distributed algorithms to make full use of data without downsampling. Finally, it highlights how H2O works with other technologies like Spark, Hadoop, and HDFS to enable reading of large datasets for machine learning.
H2O Random Grid Search - PyData AmsterdamSri Ambati
This document discusses using H2O's random grid search for hyperparameter optimization. It begins with introductions of the presenter and H2O company. It then draws an analogy between a baker's process of making cake and a data scientist's process of building models. The document explains common techniques for hyperparameter optimization including manual search, grid search, and random grid search. It provides evidence that random search performs as well as manual/grid search in less time. Finally, it demonstrates H2O's random grid search API in Python and discusses other useful H2O features.
The document discusses H2O.ai's Driverless AI product, which aims to automate and simplify the machine learning process. It provides an overview of H2O.ai as a company, their goals of operationalizing data science. Driverless AI uses techniques like automated feature engineering, model tuning and selection, and model ensembling to build accurate models fast. It also allows for interpreting and explaining machine learning models through features like model inspection and reason codes. A demo of Driverless AI predicting credit card default risk is shown to illustrate the system.
SF Big Data Science Meetup 12.15.15
Scripts here: https://github.com/h2oai/h2o-meetups/tree/master/2015_12_15_MessyDataMeetup
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
H2O Deep Water - Making Deep Learning Accessible to EveryoneJo-fai Chow
- H2O.ai is a company that provides an open-source machine learning platform called H2O.
- Their new project "Deep Water" integrates popular deep learning frameworks like TensorFlow, MXNet, and Caffe into H2O to enable distributed deep learning on GPUs for improved performance.
- This provides a unified interface for deep learning within H2O and allows users to easily build, stack, and deploy deep learning models from different frameworks.
H2O Machine Learning and Kalman Filters for Machine Prognostics - Galvanize SFSri Ambati
Hank Roark's presentation at Galvanize SF, 02.23.16
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
H2O Machine Learning and Kalman Filters for Machine PrognosticsSri Ambati
This document discusses using machine learning and Kalman filters for machine health monitoring. It presents a case study using turbine engine degradation data from NASA to predict remaining useful life. The document walks through loading and preparing the data in H2O, including feature engineering techniques like clustering operating modes and standardizing sensor measurements. Various machine learning models like GLM are trained and evaluated on the data. Cross validation and hyperparameter tuning are also demonstrated to select the best model. Finally, the document mentions using Kalman filters for post-processing model outputs.
H2O World 2017 Keynote - Jim McHugh, VP & GM of Data Center, NVIDIASri Ambati
Presented at #H2OWorld 2017 in Mountain View, CA.
Enjoy the recording: https://youtu.be/NyaJ7uDroww.
Learn more about H2O.ai: https://www.h2o.ai/.
Follow @h2oai: https://www.twitter.com/h2oai.
Intro to H2O in Python - Data Science LASri Ambati
Erin LeDell's presentation on Intro to H2O Machine Learning in Python at Data Science LA meetup on 1.19.16
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Introduction to data science with H2O-ChicagoSri Ambati
This document provides an overview of H2O.ai, an open source in-memory machine learning platform. It describes H2O.ai's product as an in-memory prediction engine, its team of 37 distributed systems engineers doing machine learning, and its headquarters in Mountain View, CA. It also provides details on how to use H2O with R and Python for scalable machine learning on large datasets across distributed systems.
Slides from Matt Dowle's presentation at H2O Open Tour: NYC
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
H2O Deep Water - Making Deep Learning Accessible to EveryoneSri Ambati
Deep Water is H2O's integration with multiple open source deep learning libraries such as TensorFlow, MXNet and Caffe. On top of the performance gains from GPU backends, Deep Water naturally inherits all H2O properties in scalability. ease of use and deployment. In this talk, I will go through the motivation and benefits of Deep Water. After that, I will demonstrate how to build and deploy deep learning models with or without programming experience using H2O's R/Python/Flow (Web) interfaces.
Jo-fai (or Joe) is a data scientist at H2O.ai. Before joining H2O, he was in the business intelligence team at Virgin Media in UK where he developed data products to enable quick and smart business decisions. He also worked remotely for Domino Data Lab in the US as a data science evangelist promoting products via blogging and giving talks at meetups. Joe has a background in water engineering. Before his data science journey, he was an EngD research engineer at STREAM Industrial Doctorate Centre working on machine learning techniques for drainage design optimization. Prior to that, he was an asset management consultant specialized in data mining and constrained optimization for the utilities sector in the UK and abroad. He also holds an MSc in Environmental Management and a BEng in Civil Engineering.
Driverless AI - Intro + Interactive Hands-on LabSri Ambati
Enjoy the webinar recording here: https://youtu.be/Lll1qwQJKVw.
Driverless AI speeds up data science workflows by automating feature engineering, model tuning, ensembling, and model deployment.
In this presentation, Arno Candel (CTO, H2O.ai), gives a quick overview and guide attendees through an interactive hands-on lab using Qwiklabs.
Driverless AI turns Kaggle-winning recipes into production-ready code and is specifically designed to avoid common mistakes such as under or overfitting, data leakage or improper model validation. Avoiding these pitfalls alone can save weeks or more for each model, and is necessary to achieve high modeling accuracy.
With Driverless AI, everyone can now train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client/server API through a variety of languages such as Python, Java, C++, go, C# and many more. To speed up training, Driverless AI uses highly optimized C++/CUDA algorithms to take full advantage of the latest compute hardware.
For example, Driverless AI runs orders of magnitudes faster on the latest Nvidia GPU supercomputers on Intel and IBM platforms, both in the cloud or on-premise. There are two more product innovations in Driverless AI: statistically rigorous automatic data visualization and interactive model interpretation with reason codes and explanations in plain English. Both help data scientists and analysts to quickly validate the data and models.
Scalable Machine Learning in R and Python with H2OSri Ambati
This document provides an overview of scalable machine learning in R and Python using the H2O platform. It introduces H2O.ai, the company and H2O, the open source machine learning platform. Key features of the H2O platform include its distributed algorithms, APIs for R and Python, and interfaces like H2O Flow. The document outlines tutorials for using popular algorithms like deep learning and ensembles in H2O and describes ongoing developments like DeepWater and AutoML.
Deploying your Predictive Models as a Service via DominoJo-fai Chow
The document discusses how Domino Data Lab can be used to deploy predictive models as APIs. It provides examples of using Domino to build, evaluate, and deploy predictive models for the Iris dataset and stock market forecasting. Key features discussed include the web and R interfaces, code sharing, scheduled runs, automatic version control, and publishing models as APIs for other applications to access.
Driverless AI - Introduction and Live DemoSri Ambati
H2O.ai is an AI company founded in 2011 with around 90 employees based in Mountain View, CA. They are a leader in the Gartner Magic Quadrant and are VC funded including by NVIDIA. Their Driverless AI software can automate the entire machine learning process from data preparation to deployment of models in just a few hours, compared to months for experts. It provides automatic data visualization, outlier detection, model interpretation and regular updates.
Introducción al Machine Learning AutomáticoSri Ambati
¿Cómo puede llevar el aprendizaje automático a las masas? Los proyectos de Machine Learning con la búsqueda de talento, el tiempo para construir e implementar modelos y confiar en los modelos que se construyen.
¿Cómo puede tener varios equipos en su organización para crear modelos de ML precisos sin ser expertos en ciencia de datos o aprendizaje automático?
¿Se pregunta sobre los diferentes sabores de AutoML?
H2O Driverless AI emplea las técnicas de científicos expertos en datos en una aplicación fácil de usar que ayuda a escalar sus esfuerzos de ciencia de datos. La inteligencia artificial Driverless permite a los científicos de datos trabajar en proyectos más rápido utilizando la automatización y la potencia de computación de vanguardia de las GPU para realizar tareas en minutos que solían tomar meses.
Con H2O Driverless AI, todos, incluyendo expertos y científicos de datos junior, científicos de dominio e ingenieros de datos pueden desarrollar modelos confiables de aprendizaje automático. Esta plataforma de aprendizaje automático de última generación ofrece una funcionalidad única y avanzada para la visualización de datos, la ingeniería de características, la interpretabilidad del modelo y la implementación de baja latencia.
H2O Driverless AI hace:
* Visualización automática de datos
* Ingeniería automática de funciones a nivel de Grandmaster
* Selección automática del modelo
* Ajuste y capacitación automáticos del modelo
* Paralelización automática utilizando múltiples CPU o GPU
* Ensamblaje automático del modelo
*automática del Interpretaciónaprendizaje automático (MLI)
* Generación automática de código de puntuación
¿Quieres probarlo tú mismo? Puede obtener una prueba gratuita aquí: H2O Driverless AI trial.
Venga a esta sesión y descubra cómo comenzar con el Aprendizaje automático automático con AI sin conductor H2O, y cree modelos potentes con solo unos pocos clics.
¡Te veo pronto!
Acerca de H2O.ai
H2O.ai es una empresa visionaria de software de código abierto de Silicon Valley que creó y reimaginó lo que es posible. Somos una empresa de fabricantes que trajeron al mercado nuevas plataformas y tecnologías para impulsar el movimiento de inteligencia artificial. Somos los creadores de, H2O, la principal plataforma de aprendizaje de ciencia de datos de fuente abierta y de aprendizaje automático utilizada por casi la mitad de Fortune 500 y en la que confían más de 14,000 organizaciones y cientos de miles de científicos de datos de todo el mundo.
This document summarizes a presentation given by Joe Chow on machine learning using H2O.ai's platform. The presentation introduced H2O, its products like Deep Water for deep learning, and demonstrated examples of building models with R and Python. It showed how H2O provides a unified interface for TensorFlow, MXNet and Caffe, allowing users to easily build and deploy deep learning models with different frameworks. The document provided an overview of the company and platform capabilities like scalable algorithms, model export and multiple language interfaces like R and Python.
This document summarizes a presentation given by Joe Chow on machine learning using H2O.ai's platform. The presentation covered:
1) An introduction to Joe and H2O.ai, including the company's mission to operationalize data science.
2) An overview of the H2O platform for machine learning, including its distributed algorithms, interfaces for R and Python, and model export capabilities.
3) A demonstration of deep learning using H2O's Deep Water integration with TensorFlow, MXNet, and Caffe, allowing users to build and deploy models across different frameworks.
This document introduces H2O.ai and their product H2O Driverless AI, an automated machine learning platform. It provides an overview of the company and product, including use cases across different industries where customers saw improvements in key metrics like accuracy, time savings, and performance. It also shares resources for learning more about Driverless AI through tutorials, documentation, and community forums.
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Safe Software
Learn where FME meets AI in this upcoming webinar to offer you incredible time savings. This webinar is tailored to ignite imaginations and offer solutions to your data integration challenges. As the new digital era sets sail on the winds of AI, the tangibility of its integration in our daily schema is unfolding.
Segment 1, titled “AI: The Good, the Bad and the FME” by Darren Fergus of Locus, navigates through the realms of AI, scrutinizing its pervasive impact while underscoring the symbiotic potential of FME and AI. Join in an engaging demonstration as FME and ChatGPT collaboratively orchestrate a PowerPoint narrative, epitomizing the alliance of AI with human ingenuity.
In Segment 2, “Integrating GeoAI Models in FME” by Dennis Wilhelm and Dr. Christopher Britsch of con terra GmbH, the spotlight veers towards operationalizing AI in our daily tasks through FME. A practical approach to embedding GeoAI Models into FME Workspaces is unveiled, showcasing the ease of incorporating AI-driven methodologies into your FME workflows, skyrocketing productivity levels.
To follow, Segment 3, "Unleash generative AI on your terms!" by Oliver Morris of Avineon-Tensing. While the prospects of Generative AI are thrilling, security and IT reservations, especially with 'phone home' tools, are genuine concerns. However, with open-source tools, you can locally harness large language models. In this demo, we'll unravel the magic of local AI deployment and its seamless integration into an FME workspace.
Bonus! Dmitri will join us for a fourth segment to tie us off, showcasing what he has been up to this week, including using OpenAI API for texturing in FME, amoung other projects.
Join us to explore the synergy of FME and AI: opening portals to a realm of revolutionized productivity and enriched user experiences.
H2o.ai presentation at 2nd Virtual Pydata Piraeus meetupPyData Piraeus
AI and Machine Learning have become must-haves for almost all industries and companies. H2O.ai's goal is to help companies all over the world to use Machine Learning.
H2O.ai's opensource toolset, which includes packages R, Python and Spark, starts from offering products which can accelerate the data preparation, then help with ML model building and finally make the deployment easier and platform agnostic!
Women in Automation - Intro to Studio Session 1Cristina Vidu
With this very first product training session we kick off the RPA Developer thread of the program and get you started with UiPath Studio in a completely assisted and supportive manner by our very own UiPath MVPs. From women in RPA to all the women who wish to step into the automation world.
🌺 About this event:
What is RPA (explain RPA technology)
Why RPA (explain technological benefits)
Why RPA as a career
Platform overview
Small automation demo
Install Studio demo
Q&A
Gather your courage and curiosity, and join us on March 9th!! 👩🏽🤝👩🏼
👩🏫 Your UiPath MVP trainers:
Maria Irimias, UiPath MVP, Service Delivery Manager, accesa.eu (Romania)
Nadia Ghoufa, UiPath MVP, RPA Tech Lead, Talan (France)
Igniting Next Level Productivity with AI-Infused Data Integration WorkflowsSafe Software
Learn where FME meets AI in this upcoming webinar to offer you incredible time savings. This webinar is tailored to ignite imaginations and offer solutions to your data integration challenges. As the new digital era sets sail on the winds of AI, the tangibility of its integration in our daily schema is unfolding.
Segment 1, titled “AI: The Good, the Bad and the FME” by Darren Fergus of Locus, navigates through the realms of AI, scrutinizing its pervasive impact while underscoring the symbiotic potential of FME and AI. Join in an engaging demonstration as FME and ChatGPT collaboratively orchestrate a PowerPoint narrative, epitomizing the alliance of AI with human ingenuity.
In Segment 2, “Integrating GeoAI Models in FME” by Dennis Wilhelm and Dr. Christopher Britsch of con terra GmbH, the spotlight veers towards operationalizing AI in our daily tasks through FME. A practical approach to embedding GeoAI Models into FME Workspaces is unveiled, showcasing the ease of incorporating AI-driven methodologies into your FME workflows, skyrocketing productivity levels.
To follow, Segment 3, "Unleash generative AI on your terms!" by Oliver Morris of Avineon-Tensing. While the prospects of Generative AI are thrilling, security and IT reservations, especially with 'phone home' tools, are genuine concerns. However, with open-source tools, you can locally harness large language models. In this demo, we'll unravel the magic of local AI deployment and its seamless integration into an FME workspace.
Bonus! Dmitri will join us for a fourth segment to tie us off, showcasing what he has been up to this week, including using OpenAI API for texturing in FME, amoung other projects.
Join us to explore the synergy of FME and AI: opening portals to a realm of revolutionized productivity and enriched user experiences.
GDSC USICT organized an “INFO SESSION”. In this event the leads of all the teams introduced themselves to all the students and informed them about the benefits of joining GDSC. Leads gave students a broad idea about the technologies they would be working on and how it would help the students to solve real-life problems of society and to grow themselves.
This document provides an introduction to H2O, an open source machine learning platform, and discusses potential Internet of Things (IoT) use cases for predictive maintenance and outlier detection. The document outlines Joe Chow's background and experience, provides an overview of H2O's capabilities including algorithms, interfaces, and exporting models for production. It then demonstrates how to use H2O for predictive maintenance on a dataset of sensor readings to predict equipment failures, and for outlier detection on the MNIST handwritten digits dataset to identify anomalous images.
This document discusses best practices for developing data science products at Philip Morris International (PMI). It covers:
- PMI's data science team of over 40 people across four hubs working on fraud prevention and other problems.
- Key principles for PMI's data science work, including being business-driven, investing in people, self-organizing, iterating to improve, and co-creating solutions.
- Challenges in data product development involving integrating work between data scientists and other teams, and practices like continuous integration/delivery to overcome these challenges.
- The role of data scientists in contributing code that is readable, testable, reusable, reproducible, and usable by other teams to integrate into
Introducción al Aprendizaje Automatico con H2O-3 (1)Sri Ambati
En esta reunión virtual, damos una introducción a la plataforma de aprendizaje automático de código abierto número 1, H2O-3 y te mostramos cómo puedes usarla para desarrollar modelos para resolver diferentes casos de uso.
Making Multimillion-Dollar Baseball Decisions with H2O AutoML, LIME and ShinySri Ambati
Joe recently teamed up with IBM and Aginity to create a proof of concept “Moneyball” app for the IBM Think conference in Vegas. The original goal was to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real “Moneyball” guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe’s journey to a real “Moneyball” application.
Unleashing the Power of Generative AI.pdfeoinhalpin99
Slide deck for session named "Unleashing the Power of Generative AI: Python API Integration with ChatGPT, DALL-E, and D-ID Studio" that was presented on Nov 11th at the PyCon Ireland 2023 Conference.
Similar to Automatic and Interpretable Machine Learning in R with H2O and LIME (20)
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
This document provides an overview of H2O.ai, an AI company that offers products and services to democratize AI. It mentions that H2O products are backed by 10% of the world's top data scientists from Kaggle and that H2O has customers in 7 of the top 10 banks, 4 of the top 10 insurance companies, and top manufacturing companies. It also provides details on H2O's founders, funding, customers, products, and vision to make AI accessible to more organizations.
Generative AI Masterclass - Model Risk Management.pptxSri Ambati
Here are some key points about benchmarking and evaluating generative AI models like large language models:
- Foundation models require large, diverse datasets to be trained on in order to learn broad language skills and knowledge. Fine-tuning can then improve performance on specific tasks.
- Popular benchmarks evaluate models on tasks involving things like commonsense reasoning, mathematics, science questions, generating truthful vs false responses, and more. This helps identify model capabilities and limitations.
- Custom benchmarks can also be designed using tools like Eval Studio to systematically test models on specific applications or scenarios. Both automated and human evaluations are important.
- Leaderboards like HELM aggregate benchmark results to compare how different models perform across a wide range of tests and metrics.
AI and the Future of Software Development: A Sneak Peek Sri Ambati
The document discusses developers including both in-house and outsourced developers. It also mentions dev shops and low/no code platforms. The main topics covered are developers, dev shops, and low-code platforms.
LLMOps: Match report from the top of the 5thSri Ambati
The document discusses LLMOps (Large Language Model Operations) compared to traditional MLOps. Some key points:
- LLMOps and MLOps face similar challenges across the development lifecycle, but LLMOps requires more GPU resources and integration is faster due to more models in each application. Evaluation is also less clear.
- The LLMOps field is around the 5th generation of models, with debates around proprietary vs open source models, and balancing privacy, cost and control.
- LLMOps platforms are emerging to provide solutions for tasks like prompting, embedding databases, evaluation, and governance, similar to how MLOps platforms have evolved.
Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati
The document discusses optimizing question answering systems called RAG (Retrieve-and-Generate) stacks. It outlines challenges with naive RAG approaches and proposes solutions like improved data representations, advanced retrieval techniques, and fine-tuning large language models. Table stakes optimizations include tuning chunk sizes, prompt engineering, and customizing LLMs. More advanced techniques involve small-to-big retrieval, multi-document agents, embedding fine-tuning, and LLM fine-tuning.
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Sri Ambati
Sandeep Singh, Head of Applied AI Computer Vision, Beans.ai
H2O Open Source GenAI World SF 2023
In the modern era of machine learning, leveraging both open-source and closed-source solutions has become paramount for achieving cutting-edge results. This talk delves into the intricacies of seamlessly integrating open-source Large Language Model (LLM) solutions like Vicuna, Falcon, and Llama with industry giants such as ChatGPT and Google's Palm. As the demand for fine-tuned and specialized datasets grows, it is imperative to understand the synergy between these tools. Attendees will gain insights into best practices for building and enriching datasets tailored for fine-tuning tasks, ensuring that their LLM projects are both robust and efficient. Through real-world examples and hands-on demonstrations, this talk will equip attendees with the knowledge to harness the power of both open and closed-source tools in a coherent and effective manner.
Patrick Hall, Professor, AI Risk Management, The George Washington University
H2O Open Source GenAI World SF 2023
Language models are incredible engineering breakthroughs but require auditing and risk management before productization. These systems raise concerns about toxicity, transparency and reproducibility, intellectual property licensing and ownership, disinformation and misinformation, supply chains, and more. How can your organization leverage these new tools without taking on undue or unknown risks? While language models and associated risk management are in their infancy, a small number of best practices in governance and risk are starting to emerge. If you have a language model use case in mind, want to understand your risks, and do something about them, this presentation is for you!
Dr. Alexy Khrabrov, Open Source Science Community Director, IBM
H2O Open Source GenAI World SF 2023
In this talk, Dr. Alexy Khrabrov, recently elected Chair of the new Generative AI Commons at Linux Foundation for AI & Data, outlines the OSS AI landscape, challenges, and opportunities. With new models and frameworks being unveiled weekly, one thing remains constant: community building and validation of all aspects of AI is key to reliable and responsible AI we can use for business and society needs. Industrial AI is one key area where such community validation can prove invaluable.
The document announces the launch of the H2O GenAI App Store, which provides a collection of applications that make it easier for average users to leverage large language models through custom interfaces for specific tasks like getting gardening advice or feedback on code. The app store is designed to accelerate the development of these GenAI apps using the H2O Wave platform and provides access to H2OGPTE for retrieval augmented generation and language model calls. Developers can also contribute their own apps through the GitHub repository listed.
Applied Gen AI for the Finance Vertical Sri Ambati
Megan Kurka, Vice President, Customer Data Scientist, H2O.ai
H2O Open Source GenAI World SF 2023
Discover the transformative power of Applied Gen AI. Learn how the H2O team builds customized applications and workflows that integrate capabilities of Gen AI and AutoML specifically designed to address and enhance financial use cases. Explore real world examples, learn best practices, and witness firsthand how our innovative solutions are reshaping the landscape of finance technology.
This document discusses techniques for improving language models (LLMs) discussed in recent papers. It describes building blocks of LLMs like fine-tuning, foundation training, memory, and databases. Specific techniques covered include LIMA which uses 1,000 carefully curated examples, instruction backtranslation to generate question-answer pairs, fine-tuning models on API examples like Gorilla, and reducing false answers through techniques like not agreeing with incorrect user opinions. The goal is to discuss cutting edge tricks to build better LLMs.
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Sri Ambati
Pascal Pfeiffer, Principal Data Scientist, H2O.ai
H2O Open Source GenAI World SF 2023
This talk dives into the expansive ecosystem of Large Language Models (LLMs), offering practitioners an insightful guide to various relevant applications, from natural language understanding to creative content generation. While exploring use cases across different industries, it also honestly addresses the current limitations of LLMs and anticipates future advancements.
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...Sri Ambati
This document discusses using large language models (LLMs) for text classification tasks. It begins by describing how LLMs are commonly used for text generation and question answering. For classification, models are usually trained supervised on labeled data. The document then explores using LLMs for zero-shot classification without training, and techniques like fine-tuning LLMs on tasks to improve performance. It provides an example of fine-tuning an LLM on a financial sentiment dataset. The document concludes by describing H2O.ai's LLM Studio tool for fine-tuning and a few Kaggle competitions where LLMs achieved success in text classification.
1) Generative AI (GenAI) enables the creation of novel content by learning patterns in unstructured data rather than labeling outputs like traditional AI.
2) Both traditional and generative AI models lack transparency and may contain biases, but generative models can additionally hallucinate or leak private information.
3) To interpret generative models, researchers evaluate accuracy globally by checking for hallucinations or undesirable content, and locally by confirming the quality of individual responses.
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...Sri Ambati
Numerai is an open, crowd-sourced hedge fund powered by predictions from data scientists around the world. In return, participants are rewarded with weekly payouts in crypto.
In this talk, Joe will give an overview of the Numerai tournament based on his own experience. He will then explain how he automates the time-consuming tasks such as testing different modelling strategies, scoring new datasets, submitting predictions to Numerai as well as monitoring model performance with H2O Driverless AI and R.
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...Sri Ambati
In this session, you will learn about what you should do after you’ve taken an AI transformation baseline. Over the span of this session, we will discuss the next steps in moving toward AI readiness through alignment of talent and tools to drive successful adoption and continuous use within an organization.
To find additional videos on AI courses, earn badges, join the courses at H2O.ai Learning Center: https://training.h2o.ai/products/ai-foundations-course
To find the Youtube video about this presentation: https://youtu.be/K1Cl3x3rd8g
Speaker:
Chemere Davis (H2O.ai - Senior Data Scientist Training Specialist)
AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati
The chances of successfully implementing AI strategies within an organization significantly improve when you can recognize where your organization is on the maturity scale. Over this course, you will learn the keys to unlocking value with AI which include asking the right questions about the problems you are solving and ensuring you have the right cross-section of talent, tools, and resources. By the end of this module, you should be able to recognize where your organization is on the AI transformation spectrum and identify some strategies that can get you to the next stage in your journey.
To find additional videos on AI courses, earn badges, join the courses at H2O.ai Learning Center: https://training.h2o.ai/products/ai-foundations-course
To find the Youtube video about this presentation: https://youtu.be/PJgr2epM6qs
Speakers:
Chemere Davis (H2O.ai - Senior Data Scientist Training Specialist)
Ingrid Burton (H2O.ai - CMO)
How information systems are built or acquired puts information, which is what they should be about, in a secondary place. Our language adapted accordingly, and we no longer talk about information systems but applications. Applications evolved in a way to break data into diverse fragments, tightly coupled with applications and expensive to integrate. The result is technical debt, which is re-paid by taking even bigger "loans", resulting in an ever-increasing technical debt. Software engineering and procurement practices work in sync with market forces to maintain this trend. This talk demonstrates how natural this situation is. The question is: can something be done to reverse the trend?
Must Know Postgres Extension for DBA and Developer during MigrationMydbops
Mydbops Opensource Database Meetup 16
Topic: Must-Know PostgreSQL Extensions for Developers and DBAs During Migration
Speaker: Deepak Mahto, Founder of DataCloudGaze Consulting
Date & Time: 8th June | 10 AM - 1 PM IST
Venue: Bangalore International Centre, Bangalore
Abstract: Discover how PostgreSQL extensions can be your secret weapon! This talk explores how key extensions enhance database capabilities and streamline the migration process for users moving from other relational databases like Oracle.
Key Takeaways:
* Learn about crucial extensions like oracle_fdw, pgtt, and pg_audit that ease migration complexities.
* Gain valuable strategies for implementing these extensions in PostgreSQL to achieve license freedom.
* Discover how these key extensions can empower both developers and DBAs during the migration process.
* Don't miss this chance to gain practical knowledge from an industry expert and stay updated on the latest open-source database trends.
Mydbops Managed Services specializes in taking the pain out of database management while optimizing performance. Since 2015, we have been providing top-notch support and assistance for the top three open-source databases: MySQL, MongoDB, and PostgreSQL.
Our team offers a wide range of services, including assistance, support, consulting, 24/7 operations, and expertise in all relevant technologies. We help organizations improve their database's performance, scalability, efficiency, and availability.
Contact us: info@mydbops.com
Visit: https://www.mydbops.com/
Follow us on LinkedIn: https://in.linkedin.com/company/mydbops
For more details and updates, please follow up the below links.
Meetup Page : https://www.meetup.com/mydbops-databa...
Twitter: https://twitter.com/mydbopsofficial
Blogs: https://www.mydbops.com/blog/
Facebook(Meta): https://www.facebook.com/mydbops/
Essentials of Automations: Exploring Attributes & Automation ParametersSafe Software
Building automations in FME Flow can save time, money, and help businesses scale by eliminating data silos and providing data to stakeholders in real-time. One essential component to orchestrating complex automations is the use of attributes & automation parameters (both formerly known as “keys”). In fact, it’s unlikely you’ll ever build an Automation without using these components, but what exactly are they?
Attributes & automation parameters enable the automation author to pass data values from one automation component to the next. During this webinar, our FME Flow Specialists will cover leveraging the three types of these output attributes & parameters in FME Flow: Event, Custom, and Automation. As a bonus, they’ll also be making use of the Split-Merge Block functionality.
You’ll leave this webinar with a better understanding of how to maximize the potential of automations by making use of attributes & automation parameters, with the ultimate goal of setting your enterprise integration workflows up on autopilot.
"Scaling RAG Applications to serve millions of users", Kevin GoedeckeFwdays
How we managed to grow and scale a RAG application from zero to thousands of users in 7 months. Lessons from technical challenges around managing high load for LLMs, RAGs and Vector databases.
In the realm of cybersecurity, offensive security practices act as a critical shield. By simulating real-world attacks in a controlled environment, these techniques expose vulnerabilities before malicious actors can exploit them. This proactive approach allows manufacturers to identify and fix weaknesses, significantly enhancing system security.
This presentation delves into the development of a system designed to mimic Galileo's Open Service signal using software-defined radio (SDR) technology. We'll begin with a foundational overview of both Global Navigation Satellite Systems (GNSS) and the intricacies of digital signal processing.
The presentation culminates in a live demonstration. We'll showcase the manipulation of Galileo's Open Service pilot signal, simulating an attack on various software and hardware systems. This practical demonstration serves to highlight the potential consequences of unaddressed vulnerabilities, emphasizing the importance of offensive security practices in safeguarding critical infrastructure.
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyScyllaDB
Freshworks creates AI-boosted business software that helps employees work more efficiently and effectively. Managing data across multiple RDBMS and NoSQL databases was already a challenge at their current scale. To prepare for 10X growth, they knew it was time to rethink their database strategy. Learn how they architected a solution that would simplify scaling while keeping costs under control.
Have you ever been confused by the myriad of choices offered by AWS for hosting a website or an API?
Lambda, Elastic Beanstalk, Lightsail, Amplify, S3 (and more!) can each host websites + APIs. But which one should we choose?
Which one is cheapest? Which one is fastest? Which one will scale to meet our needs?
Join me in this session as we dive into each AWS hosting service to determine which one is best for your scenario and explain why!
Session 1 - Intro to Robotic Process Automation.pdfUiPathCommunity
👉 Check out our full 'Africa Series - Automation Student Developers (EN)' page to register for the full program:
https://bit.ly/Automation_Student_Kickstart
In this session, we shall introduce you to the world of automation, the UiPath Platform, and guide you on how to install and setup UiPath Studio on your Windows PC.
📕 Detailed agenda:
What is RPA? Benefits of RPA?
RPA Applications
The UiPath End-to-End Automation Platform
UiPath Studio CE Installation and Setup
💻 Extra training through UiPath Academy:
Introduction to Automation
UiPath Business Automation Platform
Explore automation development with UiPath Studio
👉 Register here for our upcoming Session 2 on June 20: Introduction to UiPath Studio Fundamentals: https://community.uipath.com/events/details/uipath-lagos-presents-session-2-introduction-to-uipath-studio-fundamentals/
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
This talk will cover ScyllaDB Architecture from the cluster-level view and zoom in on data distribution and internal node architecture. In the process, we will learn the secret sauce used to get ScyllaDB's high availability and superior performance. We will also touch on the upcoming changes to ScyllaDB architecture, moving to strongly consistent metadata and tablets.
AppSec PNW: Android and iOS Application Security with MobSFAjin Abraham
Mobile Security Framework - MobSF is a free and open source automated mobile application security testing environment designed to help security engineers, researchers, developers, and penetration testers to identify security vulnerabilities, malicious behaviours and privacy concerns in mobile applications using static and dynamic analysis. It supports all the popular mobile application binaries and source code formats built for Android and iOS devices. In addition to automated security assessment, it also offers an interactive testing environment to build and execute scenario based test/fuzz cases against the application.
This talk covers:
Using MobSF for static analysis of mobile applications.
Interactive dynamic security assessment of Android and iOS applications.
Solving Mobile app CTF challenges.
Reverse engineering and runtime analysis of Mobile malware.
How to shift left and integrate MobSF/mobsfscan SAST and DAST in your build pipeline.
inQuba Webinar Mastering Customer Journey Management with Dr Graham HillLizaNolte
HERE IS YOUR WEBINAR CONTENT! 'Mastering Customer Journey Management with Dr. Graham Hill'. We hope you find the webinar recording both insightful and enjoyable.
In this webinar, we explored essential aspects of Customer Journey Management and personalization. Here’s a summary of the key insights and topics discussed:
Key Takeaways:
Understanding the Customer Journey: Dr. Hill emphasized the importance of mapping and understanding the complete customer journey to identify touchpoints and opportunities for improvement.
Personalization Strategies: We discussed how to leverage data and insights to create personalized experiences that resonate with customers.
Technology Integration: Insights were shared on how inQuba’s advanced technology can streamline customer interactions and drive operational efficiency.
3. About Me
3
• Before H2O
• Water Engineer / Researcher /
Matlab Fan Boy @matlabulous
• Discovered R, Python, H2O …
never look back again
• Data Scientist at Virgin Media (UK),
Domino Data Lab (US)
• At H2O …
• Data Scientist / Evangelist /
• Sales Engineer / Solution Architect /
• Community Manager
(The harsh reality of startup life)
• H2O SWAG Photographer
(#AroundTheWorldWithH2Oai)
• H2O SWAG Distributor
(Love H2O? Come get some stickers!)
10. CONFIDENTIAL
Gartner names H2O as Leader with the most completeness of vision
• H2O.ai recognized as a technology
leader with most completeness of
vision
• H2O.ai was recognized for the
mindshare, partner network and
status as a quasi-industry standard
for machine learning and AI.
• H2O customers gave the highest
overall score among all the vendors
for sales relationship and account
management, customer support
(onboarding, troubleshooting, etc.)
and overall service and support.
12. Founded 2012, Series C in Nov, 2017
Products • Driverless AI – Automated Machine Learning
• H2O Open Source Machine Learning
• Sparkling Water
Mission Democratize AI. Do Good
Team ~100 employees
• Distributed Systems Engineers doing Machine Learning
• World-class visualization designers
Offices Mountain View, London, Prague
12
Company Overview
13. H2O Products
In-Memory, Distributed
Machine Learning Algorithms
with H2O Flow GUI
H2O AI Open Source Engine
Integration with Spark
Lightning Fast machine
learning on GPUs
Automatic feature
engineering, machine
learning and interpretability
Secure multi-tenant H2O clusters
14. H2O Products
In-Memory, Distributed
Machine Learning Algorithms
with H2O Flow GUI
H2O AI Open Source Engine
Integration with Spark
Lightning Fast machine
learning on GPUs
Automatic feature
engineering, machine
learning and interpretability
Secure multi-tenant H2O clusters
In-Memory, Distributed
Machine Learning Algorithms
with H2O Flow GUI
This Workshop
32. Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest: Classification
or regression models
• Gradient Boosting Machine: Produces an
ensemble of decision trees with increasing
refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with an
input layer followed by multiple layers of
nonlinear transformations
H2O-3 Algorithms Overview
Unsupervised Learning
• K-means: Partitions observations into k
clusters/groups of the same spatial size.
Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly transforms
correlated variables to independent components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of numerical,
Boolean, categorical, and missing data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction using
deep learning
32
51. 51
Time Topics / Tasks
1:30 – 1:45 pm
Install h2o, lime, mlbench from CRAN
slides/code: bit.ly/joe_eRum_2018
1:45 – 2:00 pm Introduction (H2O, AutoML, LIME)
2:00 – 2:30 pm
Regression Example
examplesregression_...Rmd
2:30 – 3:00 pm Classification Example
3:00 – 3:30 pm ☕ 🍰 🍪
3:30 – 3:45 pm Quick Recap
3:45 – 4:15 pm Real Use-Case: Moneyball
4:15 – 4:30 pm Other H2O News + Q & A
52. Learning from Boston Housing Data
Features:
No. of rooms, Crime Rate,
Age …
Median
House
Value
(medv)
Predictions
H2O Machine Learning
Learn the Pattern
52
53. 53
Time Topics / Tasks
1:30 – 1:45 pm
Install h2o, lime, mlbench from CRAN
slides/code: bit.ly/joe_eRum_2018
1:45 – 2:00 pm Introduction (H2O, AutoML, LIME)
2:00 – 2:30 pm Regression Example
2:30 – 3:00 pm
Classification Example
examplesclassification_...Rmd
3:00 – 3:30 pm ☕ 🍰 🍪
3:30 – 3:45 pm Quick Recap
3:45 – 4:15 pm Real Use-Case: Moneyball
4:15 – 4:30 pm Other H2O News + Q & A
54. Learning from Diabetes Data
Features:
Pregnant, Age, Glucose …
Positive
/
Negative
Predictions
H2O Machine Learning
Learn the Pattern
54
64. Approach One: Learning from Lahman only
Lahman: Age, Height, Weight …
Historical Performance Stats
Home Runs
Batting Average
…
Predictions
H2O AutoML:
Learn the Pattern
Sliding Windows (Stats from previous n years)
About 300 Lahman Features
64
65. Approach Two: Learning from Lahman & AriDB
Lahman: Age, Height, Weight …
Historical Performance Stats
Home Runs
Batting Average
…
Predictions
H2O AutoML:
Learn the Pattern
Sliding Windows (Stats from previous n years)
About 300 Lahman Features +
200 AriDB Features
AriDB: Fastball, curveball,
slider, velocity …
65
69. 69
Time Topics / Tasks
1:30 – 1:45 pm
Install h2o, lime, mlbench from CRAN
slides/code: bit.ly/joe_eRum_2018
1:45 – 2:00 pm Introduction (H2O, AutoML, LIME)
2:00 – 2:30 pm Regression Example
2:30 – 3:00 pm Classification Example
3:00 – 3:30 pm ☕ 🍰 🍪
3:30 – 3:45 pm Quick Recap
3:45 – 4:15 pm Real Use-Case: Moneyball
4:15 – 4:30 pm Other H2O News + Q & A
70. H2O Products
In-Memory, Distributed
Machine Learning Algorithms
with H2O Flow GUI
H2O AI Open Source Engine
Integration with Spark
Lightning Fast machine
learning on GPUs
Automatic feature
engineering, machine
learning and interpretability
Secure multi-tenant H2O clusters
Lightning Fast machine
learning on GPUs
71. “Confidential and property of H2O.ai. All rights reserved”
Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and
Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest:
Classification or regression models
• Gradient Boosting Machine: Produces
an ensemble of decision trees with
increasing refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with
an input layer followed by multiple
layers of nonlinear transformations
Algorithms on H2O-3 (CPU)
Unsupervised Learning
• K-means: Partitions observations into
k clusters/groups of the same spatial
size. Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly
transforms correlated variables to independent
components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of
numerical, Boolean, categorical, and missing
data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction
using deep learning
72. “Confidential and property of H2O.ai. All rights reserved”
Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and
Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest:
Classification or regression models
• Gradient Boosting Machine: Produces
an ensemble of decision trees with
increasing refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with
an input layer followed by multiple
layers of nonlinear transformations
Algorithms on H2O4GPU (more to come)
Unsupervised Learning
• K-means: Partitions observations into
k clusters/groups of the same spatial
size. Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly
transforms correlated variables to independent
components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of
numerical, Boolean, categorical, and missing
data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction
using deep learning
78. Supported Formats & Data Sources
CSV
XLS
XLSX
ORC*
Hive*
SVMLight
ARFF
Parquet
Avro 1.8.0*
HDFS
S3
NFS
LOCAL
SQL
9Formats 5Sources
File type or Folder of Files
* 1. only if H2O is running as a Hadoop job
* 2. Hive files that are saved in ORC format
* 3. without multi-file parsing or column type modification
84. Distributed Algorithms
• Foundation for In-Memory Distributed Algorithm
Calculation - Distributed Data Frames and columnar
compression
• All algorithms are distributed in H2O: GBM, GLM, DRF, Deep
Learning and more. Fine-grained map-reduce iterations.
• Only enterprise-grade, open-source distributed algorithms
in the market
User Benefits
Advantageous Foundation
• “Out-of-box” functionalities for all algorithms (NO MORE
SCRIPTING) and uniform interface across all languages: R,
Python, Java
• Designed for all sizes of data sets, especially large data
• Highly optimized Java code for model exports
• In-house expertise for all algorithms
Parallel Parse into Distributed Rows
Fine Grain Map Reduce Illustration: Scalable
Distributed Histogram Calculation for GBM
Foundation for Distributed Algorithms
84