Predicting Medical Test Results using Driverless AI

•Download as PPTX, PDF•

2 likes•274 views

1. poder.IO uses AI to predict customer behavior and personalize experiences. It deploys over 100 models daily using techniques like regression, classification, text analysis and deep learning. 2. Driverless AI is currently used to benchmark models before production and for research cases. It may be used starting Q3 2018 for advertising optimization, content classification, profile matching and look-alike modeling. 3. A joint team from poder.IO and Bayer developed models to predict individual medical test results using healthcare data, without direct lab measures. This could help improve treatment strategies. They used techniques like GLM, GBM, random forest and Driverless AI to develop and compare models for a medical test, finding Driver

Alexander Gedranovich
Chief Technology Officer
poder.IO
linkedin.com/in/alexander-gedranovich-73847435
Predicting medical
tests results using
Driverless AI

Outline
1. poder.IO Introduction
2. H2O at poder.IO
3. Cases for Driverless AI
4. Predicting medical tests results

poder.IO Introduction
Our main product is cloud platform EPICA.
EPICA uses AI to predict what your audience is going to do and when
with a high degree of accuracy.
You can use these predictions to have granular understanding of
customer journeys and personalize user’s experience at individual
level across web, email, social and digital advertising.

H2O at poder.IO
We update and deploy as API 100+ models daily (POJO / MOJO)
• Regression / Classification (GBM, GLM, RandomForest)
• Text Classification (Word2Vec +)
• Time Series Patterns (iSAX)
• Deep Networks (DeepWater + Tensorflow)
• Etc.

Cases for Driverless AI
At the moment:
1. Driverless AI as a benchmark for all models before production
2. Research Department for handle clients’ cases
Planning to use in production Q3 2018:
1. Advertising Campaigns Optimization
2. Content Classification
3. Profiles Matching
4. Look-a-like models

Predicting medical tests results
Disclaimer
The research was supported by Bayer AG.
The project was completed by the joint team of Data Scientists from
RocketScience.ai and Analytics from Bayer.
Currently RocketScience.ai team is a part of poder.IO.

Predicting medical tests results: Problem
The research goal is to develop an approach to predict individual
medical test results based on longitudinal medical and pharma claims
data without direct lab measures using data-driven analytic
techniques.
Such discoveries may result in improved treatment strategies.
TODO: // Substitute to graphics

Predicting medical tests results: Problem
• Medical laboratory test, which is required for making a decision on a
patient’s treatment strategy
• The test results are not available in most healthcare databases
• There is a need to predict the results of the test for any patient at any
point of time

Predicting medical tests results: Design

Predicting medical tests results: Data
• 10 years time interval
• 11 M records
• 4 M unique patients
• Training data: 80%
• Test data: 20%
• Number of raw features: ~260

Predicting medical tests results: Prerequisites
Models / methods:
• ETL (C++, R, ggplot2)
• H2O.ai based GLM, GBM, Random Forest
• H2O.ai Driverless AI
Hardware:
• ETL, H2O models: 128Gb / 1Tb / 32 cores
• Driverless AI: AWS g3.8xlarge

Predicting medical tests results: Outcome
Model Training time RMSE R2 MAE Top features
GLM (ElasticNet) 00:13:20 16.477 0.5540 13.3785 100% original
GBM 100% original
Random Forest 100% original
Ensemble (3
models)
-
Ensemble (9
models)
-
Driverless AI 00:55:15 15.913 0.5857 12.8999 46% original
TODO: // Fill details

Predicting medical tests results: top 10 features
TODO: // Insert table with top 10 feature intersection from different
models

Predicting medical tests results: Surrogate model

Predicting medical tests results: Partial dependence

This talk was given at H2O World 2018 NYC and can be viewed here: https://youtu.be/oxLZZMR1lVY Description: Driverless AI is H2O.ai's latest flagship product for automatic machine learning. It fully automates some of the most challenging and productive tasks in applied data science such as feature engineering, model tuning, model ensembling and model deployment. Driverless AI turns Kaggle-winning grandmaster recipes into production-ready code, and is specifically designed to avoid common mistakes such as under- or overfitting, data leakage or improper model validation, some of the hardest challenges in data science. Avoiding these pitfalls alone can save weeks or more for each model, and is necessary to achieve high modeling accuracy, especially for time-series problems. With Driverless AI, data scientists of all proficiency levels can train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client API from Python. Driverless AI builds hundreds or thousands of models under the hood to select the best feature engineering and modeling pipeline for every specific problem such as churn prediction, fraud detection, real-estate pricing, store sales prediction, marketing ad campaigns and many more. To speed up training, Driverless AI uses highly optimized C++/CUDA algorithms to take full advantage of the latest compute hardware. For example, Driverless AI runs orders of magnitudes faster on the latest Nvidia GPU supercomputers on Intel and IBM platforms, both in the cloud or on premise. Driverless AI is fully supported on all major cloud providers. There are two more product innovations in Driverless AI: statistically rigorous automatic data visualization and machine learning interpretability with reason codes and explanations in plain English. Both help data scientists and analysts to quickly validate the data and the models. In this talk, we explain how Driverless AI works and show how easy it is to reach top 5% rankings for several highly competitive Kaggle competitions. (edited) Speaker's Bio: Arno Candel is the Chief Technology Officer at H2O.ai. He is the main committer of H2O-3 and Driverless AI and has been designing and implementing high-performance machine-learning algorithms since 2012. Previously, he spent a decade in supercomputing at ETH and SLAC and collaborated with CERN on next-generation particle accelerators. Arno holds a PhD and Masters summa cum laude in Physics from ETH Zurich, Switzerland. He was named “2014 Big Data All-Star” by Fortune Magazine and featured by ETH GLOBE in 2015. Follow him on Twitter: @ArnoCandel.

Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...

Sri Ambati

This talk was given at H2O World 2018 NYC and can be viewed here: https://youtu.be/xc3j20Om3UM Description: Data science is indeed one of the sexy jobs of the 21st century. But it is also a lot of hard work. And the hard work is seldom about the math or the algorithms. It is about building relevant machine learning products for the real world. We will go over some of the must-haves as you take your machine learning model out of the sandbox and make it work in the big, bad world outside. Speaker's Bio: Krish Swamy is an experienced professional with deep skills in applying analytics and BigData capabilities to challenging business problems and driving customer insights. Krish's analytic experience includes marketing and pricing, credit risk, digital analytics and most recently, big data analytics and data transformation. His key experiences lie in banking and financial services, the digital customer experience domain, with a background in management consulting. Other key skills include influencing organizational change towards a data and analytics driven culture, and building teams of analysts, statisticians and data scientists.

Machine Learning with H2O

Sri Ambati

Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018

Sri Ambati

This talk was recorded in London on Oct 30, 2018 and can be viewed here: https://youtu.be/p4iAnxwC_Eg The good news is building fair, accountable, and transparent machine learning systems is possible. The bad news is it’s harder than many blogs and software package docs would have you believe. The truth is nearly all interpretable machine learning techniques generate approximate explanations, that the fields of eXplainable AI (XAI) and Fairness, Accountability, and Transparency in Machine Learning (FAT/ML) are very new, and that few best practices have been widely agreed upon. This combination can lead to some ugly outcomes! This talk aims to make your interpretable machine learning project a success by describing fundamental technical challenges you will face in building an interpretable machine learning system, defining the real-world value proposition of approximate explanations for exact models, and then outlining the following viable techniques for debugging, explaining, and testing machine learning models Mateusz is a software developer who loves all things distributed, machine learning and hates buzzwords. His favourite hobby data juggling. He obtained his M.Sc. in Computer Science from AGH UST in Krakow, Poland, during which he did an exchange at L’ECE Paris in France and worked on distributed flight booking systems. After graduation he move to Tokyo to work as a researcher at Fujitsu Laboratories on machine learning and NLP projects, where he is still currently based.

Scalable Automatic Machine Learning with H2O

Sri Ambati

In this presentation, Parul Pandey, will provide a history and overview of the field of “Automatic Machine Learning” (AutoML), followed by a detailed look inside H2O’s open source AutoML algorithm. H2O AutoML provides an easy-to-use interface which automates data pre-processing, training and tuning a large selection of candidate models (including multiple stacked ensemble models for superior model performance). The result of the AutoML run is a “leaderboard” of H2O models which can be easily exported for use in production. AutoML is available in all H2O interfaces (R, Python, Scala, web GUI) and due to the distributed nature of the H2O platform, can scale to very large datasets. The presentation will end with a demo of H2O AutoML in R and Python, including a handful of code examples to get you started using automatic machine learning on your own projects. Parul's Bio: Parul is a Data Science Evangelist here at H2O.ai. She combines Data Science, evangelism and community in her work. Her emphasis is to spread the information about H2O and Driverless AI to as many people as possible, She is also an active writer and has contributed towards various national and international publications.

Near realtime AI deployment with huge data and super low latency - Levi Brack...

Sri Ambati

Published on Nov 2, 2018 This talk was recorded in London on October 30th, 2018 and can be viewed here: https://youtu.be/erHt-1yBuUw Session: Travelport is a leading travel commerce platform that has truly huge data and many complex needs in terms of processing, performance and latency. This talk will demonstrate how we were able to harness big data technologies, H2O and cloud integration to deploy AI at scale and at low latency. The talk to cover practical advice taken from our AI journey; you will learn the successful strategies and the pitfalls of near real-time retraining ML models with streaming data and using all opensource technologies. Bio: As principal data scientist at Travelport, Levi Brackman leads a team of data scientists that are putting ML model into production. Prior to Travelport, Levi spent most of his career in the start-up world. He founded and led an organization that created innovative educational software applications and solutions used by high schools and youth organizations in the USA and Australia. Levi earned a PhD in the quantitative social sciences under the supervision of one the world's leading educational psychologists. He earned master’s degree from University College London and is author of a business book published in eight languages that was a bestseller in multiple countries. A native of North London (UK) Levi is married and has five children and now lives in Broomfield, Colorado.

H2O AutoML roadmap - Ray Peck

Sri Ambati

Building Real Time Targeting Capabilities - Ryan Zotti, Subbu Thiruppathy - C...

Sri Ambati

The document discusses building real-time targeting capabilities at Capital One. It introduces two speakers, Ryan Zotti and Subbu Thiruppathy, and describes challenges around striving for speed in everything. It then covers how to achieve fast model data, training, deployment, and scoring through techniques like using the most up-to-date data, distributed computing in the cloud, automatic model refitting, and response times under 100 milliseconds.

This document discusses guidelines for building machine learning models using Dun & Bradstreet data. It explores several hypotheses through experiments on different datasets. The main findings are: (1) ML models outperform traditional models when there are at least 1,000 "bad" records, rather than total records; (2) variable filtering before ML modeling improves performance; (3) segmenting models can boost ML performance similar to traditional models; and (4) ML models provide less lift when a few variables are much more predictive than others. The recommendations are to focus on "bad" record count, filter variables, consider segmentation, and prefer traditional models if few variables dominate predictions.

Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One

Sri Ambati

Presented at #H2OWorld 2017 in Mountain View, CA. Learn more about H2O.ai: https://www.h2o.ai/. Follow @h2oai: https://twitter.com/h2oai. - - - Effective volume anomaly detection presents unique challenges when monitoring customer transaction volumes across thousands of platforms and systems. We overcome this by using H2O, building on open source tools, and delivering machine learning anomaly detection for enterprise scale. Hear how we model, visualize then automatically alert on anomalous Mobile app volumes in real-time. Donald Gennetten has over 15 years experience supporting digital channels in the Financial Services industry. In his current role as a Data Engineer for Capital One’s Monitoring Intelligence team, he leads a cross-functional group of Data, Business, and Engineering subject matter experts to deliver Advanced Analytics solutions for real-time customer transaction monitoring and issue detection. Rahul Gupta is a Data Engineer in Capital One's Center for Machine Learning, focusing heavily on back-end development and model creation. His primary efforts include building an Algorithmic IT Operations (AIOps) platform that utilizes a combination of batch and streaming data with Machine Learning capabilities to improve the stability of Capital One services and overall customer experience.

Data Science, Machine Learning, and H2O

Sri Ambati

Dive into H2O: NYC

Sri Ambati

This session took place at New York City on November 4th, 2019. Speaker Bio: Chemere is a Senior Data Science Training Specialist for H2O.ai. Chemere has a Master's in Business Administration with focus in Marketing Analytics from the University of North Carolina at Charlotte. She is an experienced data scientist with a diverse background in transformational decision-making in various industries including Banking, Manufacturing, Logistics, and Medical Devices. Chemere joins us from Venus Concept/2two5, where she was the Lead Data Scientist focused on building predictive models with Internet of Things (IoT) data and for a subscription-based marketing product for B2B customers. Prior to that, Chemere worked as a Senior Data Scientist at Wells Fargo Bank focused on various applied predictive analytic solutions. More details about the event can be had here: https://www.eventbrite.com/e/dive-into-h2o-new-york-tickets-76351721053

Introducción al Machine Learning Automático

Sri Ambati

¿Cómo puede llevar el aprendizaje automático a las masas? Los proyectos de Machine Learning con la búsqueda de talento, el tiempo para construir e implementar modelos y confiar en los modelos que se construyen. ¿Cómo puede tener varios equipos en su organización para crear modelos de ML precisos sin ser expertos en ciencia de datos o aprendizaje automático? ¿Se pregunta sobre los diferentes sabores de AutoML? H2O Driverless AI emplea las técnicas de científicos expertos en datos en una aplicación fácil de usar que ayuda a escalar sus esfuerzos de ciencia de datos. La inteligencia artificial Driverless permite a los científicos de datos trabajar en proyectos más rápido utilizando la automatización y la potencia de computación de vanguardia de las GPU para realizar tareas en minutos que solían tomar meses. Con H2O Driverless AI, todos, incluyendo expertos y científicos de datos junior, científicos de dominio e ingenieros de datos pueden desarrollar modelos confiables de aprendizaje automático. Esta plataforma de aprendizaje automático de última generación ofrece una funcionalidad única y avanzada para la visualización de datos, la ingeniería de características, la interpretabilidad del modelo y la implementación de baja latencia. H2O Driverless AI hace: * Visualización automática de datos * Ingeniería automática de funciones a nivel de Grandmaster * Selección automática del modelo * Ajuste y capacitación automáticos del modelo * Paralelización automática utilizando múltiples CPU o GPU * Ensamblaje automático del modelo *automática del Interpretaciónaprendizaje automático (MLI) * Generación automática de código de puntuación ¿Quieres probarlo tú mismo? Puede obtener una prueba gratuita aquí: H2O Driverless AI trial. Venga a esta sesión y descubra cómo comenzar con el Aprendizaje automático automático con AI sin conductor H2O, y cree modelos potentes con solo unos pocos clics. ¡Te veo pronto! Acerca de H2O.ai H2O.ai es una empresa visionaria de software de código abierto de Silicon Valley que creó y reimaginó lo que es posible. Somos una empresa de fabricantes que trajeron al mercado nuevas plataformas y tecnologías para impulsar el movimiento de inteligencia artificial. Somos los creadores de, H2O, la principal plataforma de aprendizaje de ciencia de datos de fuente abierta y de aprendizaje automático utilizada por casi la mitad de Fortune 500 y en la que confían más de 14,000 organizaciones y cientos de miles de científicos de datos de todo el mundo.

H2O for Medicine and Intro to H2O in Python

Sri Ambati

Erin LeDell presents on machine learning for medicine using the H2O platform. She discusses how electronic health records, genomic data, medical images, and data from wearables can be used with machine learning for applications like predictive diagnostics, prognosis, and remote patient monitoring. H2O is an open source machine learning platform that provides algorithms like deep learning, random forests, and gradient boosting in an easy to use interface. It demonstrates an EEG example to predict eye state from brain signals.

Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...

Sri Ambati

This session was recorded in San Francisco on February 9th, 2019 and can be viewed here: https://youtu.be/6KY4CSA1AzU Marc Stein is the founder and CEO of Underwrite.ai. Underwrite.ai applies advances in artificial intelligence derived from genomics and particle physics to provide lenders with non-linear, dynamic models of credit risk which radically outperform traditional approaches. Marc’s career has always revolved around deep interests in artificial intelligence, quantum physics, genomics, sugar cream pie, and all ice cream flavors found at Berthillon and the challenge of how to combine all these in practical applications.

Prithvi Prabhu + Shivam Bansal, H2O.ai - Building Blocks for AI Applications ...

Sri Ambati

This session was recorded in NYC on October 22nd, 2019 and can be viewed here: https://www.youtube.com/watch?v=xAhQAYV5_PY&list=PLNtMya54qvOE3AvWRCNF2tybxNobUbAYp&index=3&t=2s Bio: Prithvi is Chief of Technology, Applications at H2O.ai. Prithvi leads the design and development of “Q”, H2O.ai’s high scale exploratory data analysis and analytical application development platform. Prithvi has been with H2O.ai since its early days and has been responsible for several products including Driverless AI (our flagship automatic machine learning platform), Steam (distributed cluster management, model management and deployment for H2O), H2O.js (Javascript transpiler for H2O’s distributed runtime), Play (on-demand cloud provisioning system for H2O), Flow (a hybrid GUI/REPL/Notebook for H2O) and Lightning (statistical graphics for H2O). Bio: Shivam Bansal is a Data Scientist at H2O.ai and Kaggle Grandmaster in Kernels Section. He is the three times winner of Kaggle’s Data Science for Good Competition and winner of multiple other offline AI and Data Science competitions. Shivam has extensive cross-industry and hands-on experience in building data science products. He has helped clients in the Insurance, Healthcare, Banking, and Retail domains to solve unstructured data science problems by building end to end pipelines and solutions.

Introducción al Aprendizaje Automatico con H2O-3 (1)

Sri Ambati

ML Model Deployment and Scoring on the Edge with Automatic ML & DF

Sri Ambati

Machine Learning Model Deployment and Scoring on the Edge with Automatic Machine Learning and Data Flow YouTube Video URL: https://youtu.be/gB0bTH-L6DE Deploying Machine Learning models to the edge can present significant ML/IoT challenges centered around the need for low latency and accurate scoring on minimal resource environments. H2O.ai's Driverless AI AutoML and Cloudera Data Flow work nicely together to solve this challenge. Driverless AI automates the building of accurate Machine Learning models, which are deployed as light footprint and low latency Java or C++ artifacts, also known as a MOJO (Model Optimized). And Cloudera Data Flow leverage Apache NiFi that offers an innovative data flow framework to host MOJOs to make predictions on data moving on the edge.

H2O.ai's Driverless AI

Sri Ambati

The document discusses H2O.ai's Driverless AI product, which aims to automate and simplify the machine learning process. It provides an overview of H2O.ai as a company, their goals of operationalizing data science. Driverless AI uses techniques like automated feature engineering, model tuning and selection, and model ensembling to build accurate models fast. It also allows for interpreting and explaining machine learning models through features like model inspection and reason codes. A demo of Driverless AI predicting credit card default risk is shown to illustrate the system.

Dealing with uncertainty in fintech using AI

Data Products Meetup

Patrick Hall, H2O.ai - Human Friendly Machine Learning - H2O World San Francisco

Sri Ambati

This document provides a blueprint for developing a human-centered machine learning framework that combines techniques from AutoML, interpretable models, fairness, and post-hoc explanations to create low-risk models. It outlines steps for data exploration, benchmarking, training interpretable models, performing post-hoc analysis, implementing human review processes, and continually iterating to improve models. Open questions are also discussed around automation levels and implementing human appeals.

Driverless AI - Intro + Interactive Hands-on Lab

Sri Ambati

Enjoy the webinar recording here: https://youtu.be/Lll1qwQJKVw. Driverless AI speeds up data science workflows by automating feature engineering, model tuning, ensembling, and model deployment. In this presentation, Arno Candel (CTO, H2O.ai), gives a quick overview and guide attendees through an interactive hands-on lab using Qwiklabs. Driverless AI turns Kaggle-winning recipes into production-ready code and is specifically designed to avoid common mistakes such as under or overfitting, data leakage or improper model validation. Avoiding these pitfalls alone can save weeks or more for each model, and is necessary to achieve high modeling accuracy. With Driverless AI, everyone can now train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client/server API through a variety of languages such as Python, Java, C++, go, C# and many more. To speed up training, Driverless AI uses highly optimized C++/CUDA algorithms to take full advantage of the latest compute hardware. For example, Driverless AI runs orders of magnitudes faster on the latest Nvidia GPU supercomputers on Intel and IBM platforms, both in the cloud or on-premise. There are two more product innovations in Driverless AI: statistically rigorous automatic data visualization and interactive model interpretation with reason codes and explanations in plain English. Both help data scientists and analysts to quickly validate the data and models.

Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC

Sri Ambati

This talk was recorded in NYC on October 22nd, 2019 and can be viewed here: https://youtu.be/aJJsrQHqsGg AutoDoc with H2O Driverless AI Driverless AI with Auto Doc is the next logical step of the data science workflow by taking the final step of automatically documenting and explaining the processes used by the platform. Auto Doc frees up the user from the time consuming task of documenting and summarizing their workflow while building machine learning models. The resulting documentation provides users with insight into machine learning workflow created by Driverless AI including details about the data used, the validation schema selected, model and feature tuning, and the final model created. With this capability in Driverless AI, users can focus on model insights and results. Bio: Megan is a Customer Data Scientist at H2O. Prior to working at H2O, she worked as a Data Scientist building products driven by machine learning for B2B customers. She has experience working with customers across multiple industries, identifying common problems, and designing robust and automated solutions.

Introduction & Hands-on with H2O Driverless AI

Sri Ambati

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Sri Ambati

Presented at #H2OWorld 2017 in Mountain View, CA. Enjoy the video: https://youtu.be/-rGRHrED94Y. Learn more about H2O.ai: https://www.h2o.ai/. Follow @h2oai: https://twitter.com/h2oai. - - - Abstract: Most machine learning systems enable two essential processes: creating a model and applying the model in a repeatable and controlled fashion. These two processes are interrelated and pose technological and organizational challenges as they evolve from research to prototype to production. This presentation outlines common design patterns for tackling such challenges while implementing machine learning in a production environment. Sergei's Bio: Dr. Sergei Izrailev is Chief Data Scientist at BeeswaxIO, where he is responsible for data strategy and building AI applications powering the next generation of real-time bidding technology. Before Beeswax, Sergei led data science teams at Integral Ad Science and Collective, where he focused on architecture, development and scaling of data science based advertising technology products. Prior to advertising, Sergei was a quant/trader and developed trading strategies and portfolio optimization methodologies. Previously, he worked as a senior scientist at Johnson & Johnson, where he developed intelligent tools for structure-based drug discovery. Sergei holds a Ph.D. in Physics and Master of Computer Science degrees from the University of Illinois at Urbana-Champaign.

Real-Time AI: Designing for Low Latency and High Throughput - Dr. Sergei Izra...

Sri Ambati

This talk was recorded in London on October 30th, 2018 and can be viewed here: https://youtu.be/CeOJFynB6BE Real-Time AI: Designing for Low Latency and High Throughput Bio: Dr. Sergei Izrailev is Chief Data Scientist at Beeswax, where he is responsible for data strategy and building AI applications powering the next generation of real-time bidding technology. Before Beeswax, Sergei led data science teams at Integral Ad Science and Collective, where he focused on architecture, development, and scaling of data science-based advertising technology products. Prior to advertising, Sergei was a quant/trader and developed trading strategies and portfolio optimization methodologies. Previously, he worked as a senior scientist at Johnson & Johnson, where he developed intelligent tools for structure-based drug discovery.

Predicting medical tests results using Driverless AI

Alexander Gedranovich

1. The document discusses using Driverless AI to predict medical test results based on patient data from a medical laboratory. 2. Over 11 million patient records spanning 10 years with 260 raw features were used to train and test models to predict a specific medical test result. 3. Driverless AI was able to predict the test results with less than 1 hour of training time and achieved similar accuracy as other models that took days to train, while only using 46% of the original features.

Bring Your Own Recipes Hands-On Session

Sri Ambati

1. Driverless AI can be used across many industries like banking, healthcare, telecom, and marketing to save time and money through tasks like fraud detection, customer churn prediction, and personalized recommendations. 2. The document highlights new features in Driverless AI 1.7.1 including improved time series recipes, natural language processing features, automatic visualization, and machine learning interpretability tools. 3. Driverless AI provides fully automated machine learning through techniques such as automatic feature engineering, model tuning, standalone scoring pipelines, and massively parallel processing to find optimal solutions.

What's hot

H2O Driverless AI Workshop

Sri Ambati

Intro to Machine Learning with H2O and AWS

Sri Ambati

Mark Seiss, Dun & Bradstreet - Importance of Domain Expertise for Building ML...

Sri Ambati

Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One

Sri Ambati

Data Science, Machine Learning, and H2O

Sri Ambati

Dive into H2O: NYC

Sri Ambati

Introducción al Machine Learning Automático

Sri Ambati

H2O for Medicine and Intro to H2O in Python

Sri Ambati

Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...

Sri Ambati

Prithvi Prabhu + Shivam Bansal, H2O.ai - Building Blocks for AI Applications ...

Sri Ambati

Introducción al Aprendizaje Automatico con H2O-3 (1)

Sri Ambati

ML Model Deployment and Scoring on the Edge with Automatic ML & DF

Sri Ambati

H2O.ai's Driverless AI

Sri Ambati

Dealing with uncertainty in fintech using AI

Data Products Meetup

Patrick Hall, H2O.ai - Human Friendly Machine Learning - H2O World San Francisco

Sri Ambati

Driverless AI - Intro + Interactive Hands-on Lab

Sri Ambati

Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC

Sri Ambati

Introduction & Hands-on with H2O Driverless AI

Sri Ambati

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Sri Ambati

Real-Time AI: Designing for Low Latency and High Throughput - Dr. Sergei Izra...

Sri Ambati

What's hot (20)

H2O Driverless AI Workshop

Intro to Machine Learning with H2O and AWS

Mark Seiss, Dun & Bradstreet - Importance of Domain Expertise for Building ML...

Using H2O for Mobile Transaction Forecasting & Anomaly Detection - Capital One

Data Science, Machine Learning, and H2O

Dive into H2O: NYC

Introducción al Machine Learning Automático

H2O for Medicine and Intro to H2O in Python

Marc Stein, Underwrite.ai - Driverless AI Use Cases in Finance and Cancer Gen...

Prithvi Prabhu + Shivam Bansal, H2O.ai - Building Blocks for AI Applications ...

Introducción al Aprendizaje Automatico con H2O-3 (1)

ML Model Deployment and Scoring on the Edge with Automatic ML & DF

H2O.ai's Driverless AI

Dealing with uncertainty in fintech using AI

Patrick Hall, H2O.ai - Human Friendly Machine Learning - H2O World San Francisco

Driverless AI - Intro + Interactive Hands-on Lab

Megan Kurka, H2O.ai - AutoDoc with H2O Driverless AI - H2O World 2019 NYC

Introduction & Hands-on with H2O Driverless AI

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Real-Time AI: Designing for Low Latency and High Throughput - Dr. Sergei Izra...

Similar to Predicting Medical Test Results using Driverless AI

Predicting medical tests results using Driverless AI

Alexander Gedranovich

Bring Your Own Recipes Hands-On Session

Sri Ambati

[Webinar] Getting to Insights Faster: A Framework for Agile Big Data

Infochimps, a CSC Big Data Business

OpenPOWER/POWER9 AI webinar

Ganesan Narayanasamy

IBM AI Solutions on Power Systems is a presentation about IBM's AI solutions. It introduces IBM Visual Insights for tasks like image classification, object detection, and segmentation. A use case demo shows breast cancer classification in under one second with high accuracy. Another demo detects diabetic retinopathy in eye images. The presentation discusses open issues in medical imaging AI and IBM's response to COVID-19, including an X-ray demo to detect COVID-19 in lung images. It calls for collaboration to share medical data and models.

Applying linear regression and predictive analytics

MariaDB plc

Crossing the Analytics Chasm and Getting the Models You Developed Deployed

Robert Grossman

There are two cultures in data science and analytics - those that develop analytic models and those that deploy analytic models into operational systems. In this talk, we review the life cycle of analytic models and provide an overview of some of the approaches that have been developed for managing analytic models and workflows and for deploying them, including using analytic engines and analytic containers . We give a quick overview of languages for analytic models (PMML) and analytic workflows (PFA). We also describe the emerging discipline of AnalyticOps that has borrowed some of the techniques of DevOps.

AI at Scale in Enterprises

Ganesan Narayanasamy

A confluence of events is accelerating the growth of AI in the Enterprise - (i) The COVID pandemic is accelerating the digital transformation of enterprises, (ii) increased digital sales & digital interaction is fueling interest in operationalizing AI to drive revenue and cost efficiencies and (iii) Enterprise databases and enterprise apps are infusing AI to transparently augment predictive capabilities for clients. Enterprise Power Systems are pillars of the global economy hosting our trinity of operating systems

A Look Under the Hood of H2O Driverless AI, Arno Candel - H2O World San Franc...

Sri Ambati

This session was recorded in San Francisco on February 4th, 2019 and can be viewed here: https://youtu.be/oQfFPPUg5t8 Bio: Arno Candel is the Chief Technology Officer at H2O.ai. He is the main committer of H2O-3 and Driverless AI and has been designing and implementing high-performance machine-learning algorithms since 2012. Previously, he spent a decade in supercomputing at ETH and SLAC and collaborated with CERN on next-generation particle accelerators. Arno holds a PhD and Masters summa cum laude in Physics from ETH Zurich, Switzerland. He was named “2014 Big Data All-Star” by Fortune Magazine and featured by ETH GLOBE in 2015. Follow him on Twitter: @ArnoCandel.

A Look Under the Hood of H2O Driverless AI

Sri Ambati

Driverless AI is H2O.ai's latest flagship product for automatic machine learning. It fully automates some of the most challenging and productive tasks in applied data science such as feature engineering, model tuning, model ensembling and production deployment. Driverless AI turns Kaggle-winning grandmaster recipes into production-ready code (Java and C++), and is specifically designed to avoid common mistakes such as under- or overfitting, data leakage or improper model validation, some of the hardest challenges in data science. Other industry-leading capabilities include automatic data visualization and machine learning interpretability. With Driverless AI, data scientists of all proficiency levels can train and deploy modeling pipelines with just a few clicks from the GUI. Advanced users can use the client API from Python or R. Driverless AI builds hundreds or thousands of models under the hood to select the best feature engineering and modeling pipeline for every specific problem such as churn prediction, fraud detection, real-estate pricing, store sales prediction, marketing ad campaigns and many more. With Bring-Your-Own-Recipe, domain experts and advanced data scientists can now write their own recipes and seamlessly extend Driverless AI with their favorite tools from the rich ecosystem of open-source data science and machine learning libraries. In this talk, we explain how Driverless AI works and demonstrate it with live demos. Arno's Bio: Arno Candel is the Chief Technology Officer at H2O.ai. He is the main committer of H2O-3 and Driverless AI and has been designing and implementing high-performance machine-learning algorithms since 2012. Previously, he spent a decade in supercomputing at ETH and SLAC and collaborated with CERN on next-generation particle accelerators. Arno holds a PhD and Masters summa cum laude in Physics from ETH Zurich, Switzerland. He was named “2014 Big Data All-Star” by Fortune Magazine and featured by ETH GLOBE in 2015. Follow him on Twitter: @ArnoCandel.

AI for Software Engineering

Miroslaw Staron

Introduction to Machine Learning on IBM Power Systems

David Spurway

Jonathon Wright - Intelligent Performance Cognitive Learning (AIOps)

Neotys_Partner

Since its beginning, the Performance Advisory Council aims to promote engagement between various experts from around the world, to create relevant, value-added content sharing between members. For Neotys, to strengthen our position as a thought leader in load & performance testing. During this event, 12 participants convened in Chamonix (France) exploring several topics on the minds of today’s performance tester such as DevOps, Shift Left/Right, Test Automation, Blockchain and Artificial Intelligence.

Data Science as a Service: Intersection of Cloud Computing and Data Science

Pouria Amirian

Data Science as a Service: Intersection of Cloud Computing and Data Science

Pouria Amirian

Customer value analysis of big data products

Vikas Sardana

Knowledge Discovery

André Karpištšenko

This document summarizes the 22nd ACM SIGKDD conference on knowledge discovery and data mining. It discusses the following topics in 3 sentences or less each: - Overview of the conference with ~80 sessions and 2,700 participants - Popular business applications of data mining like recommendation systems, predictive maintenance, and customer targeting - The typical predictive modeling flow including data preparation, model training, evaluation, and deployment

Using the power of OpenAI with your own data: what's possible and how to start?

Maxim Salnikov

This document provides an overview of a talk by Maxim Salnikov and Jon Jahren at Oslo Spektrum from November 7-9. It discusses using OpenAI with your own data and how to get started. Examples of enterprise use cases for generative AI are presented, such as chatbots, document indexing, and financial analysis. Tools for prompt engineering like LangChain and Semantic Kernel are introduced. Best practices for fine-tuning models on proprietary data are covered, including data formatting, training data size, and an iterative tuning process. Responsible AI techniques like grounding responses and maintaining a positive tone are also discussed.

AI as a Service, Build Shared AI Service Platforms Based on Deep Learning Tec...

Databricks

I will share the vision and the production journey of how we build enterprise shared AI As A Service platforms with distributed deep learning technologies. Including those topics: 1) The vision of Enterprise Shared AI As A Service and typical AI services use cases at FinTech industry 2) The high level architecture design principles for AI As A Service 3) The technical evaluation journey to choose an enterprise deep learning framework with comparisons, such as why we choose Deep learning framework based on Spark ecosystem 4) Share some production AI use cases, such as how we implemented new Users-Items Propensity Models with deep learning algorithms with Spark,improve the quality , performance and accuracy of offer and campaigns design, targeting offer matching and linking etc. 5) Share some experiences and tips of using deep learning technologies on top of Spark , such as how we conduct Intel BigDL into a real production.

Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi

Felicia Haggarty

The document discusses challenges with building operational data applications on Hadoop and introduces the Cask Data Application Platform (CDAP) as a solution. It provides an agenda that covers data applications, challenges, CDAP motivation and goals, use cases, and an introduction and architecture overview of CDAP. The document aims to demonstrate how CDAP provides a unified platform that simplifies application development and lifecycle while supporting reusable data and processing patterns.

Deep Learning & AI for Healthcare and Retail

E2E Networks Limited

Similar to Predicting Medical Test Results using Driverless AI (20)

Predicting medical tests results using Driverless AI

Bring Your Own Recipes Hands-On Session

[Webinar] Getting to Insights Faster: A Framework for Agile Big Data

OpenPOWER/POWER9 AI webinar

Applying linear regression and predictive analytics

Crossing the Analytics Chasm and Getting the Models You Developed Deployed

AI at Scale in Enterprises

A Look Under the Hood of H2O Driverless AI, Arno Candel - H2O World San Franc...

A Look Under the Hood of H2O Driverless AI

AI for Software Engineering

Introduction to Machine Learning on IBM Power Systems

Jonathon Wright - Intelligent Performance Cognitive Learning (AIOps)

Data Science as a Service: Intersection of Cloud Computing and Data Science

Customer value analysis of big data products

Knowledge Discovery

Using the power of OpenAI with your own data: what's possible and how to start?

AI as a Service, Build Shared AI Service Platforms Based on Deep Learning Tec...

Whither the Hadoop Developer Experience, June Hadoop Meetup, Nitin Motgi

Deep Learning & AI for Healthcare and Retail

More from Sri Ambati

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Sri Ambati

This document provides an overview of H2O.ai, an AI company that offers products and services to democratize AI. It mentions that H2O products are backed by 10% of the world's top data scientists from Kaggle and that H2O has customers in 7 of the top 10 banks, 4 of the top 10 insurance companies, and top manufacturing companies. It also provides details on H2O's founders, funding, customers, products, and vision to make AI accessible to more organizations.

Generative AI Masterclass - Model Risk Management.pptx

Sri Ambati

Here are some key points about benchmarking and evaluating generative AI models like large language models: - Foundation models require large, diverse datasets to be trained on in order to learn broad language skills and knowledge. Fine-tuning can then improve performance on specific tasks. - Popular benchmarks evaluate models on tasks involving things like commonsense reasoning, mathematics, science questions, generating truthful vs false responses, and more. This helps identify model capabilities and limitations. - Custom benchmarks can also be designed using tools like Eval Studio to systematically test models on specific applications or scenarios. Both automated and human evaluations are important. - Leaderboards like HELM aggregate benchmark results to compare how different models perform across a wide range of tests and metrics.

AI and the Future of Software Development: A Sneak Peek

Sri Ambati

LLMOps: Match report from the top of the 5th

Sri Ambati

The document discusses LLMOps (Large Language Model Operations) compared to traditional MLOps. Some key points: - LLMOps and MLOps face similar challenges across the development lifecycle, but LLMOps requires more GPU resources and integration is faster due to more models in each application. Evaluation is also less clear. - The LLMOps field is around the 5th generation of models, with debates around proprietary vs open source models, and balancing privacy, cost and control. - LLMOps platforms are emerging to provide solutions for tasks like prompting, embedding databases, evaluation, and governance, similar to how MLOps platforms have evolved.

Building, Evaluating, and Optimizing your RAG App for Production

Sri Ambati

The document discusses optimizing question answering systems called RAG (Retrieve-and-Generate) stacks. It outlines challenges with naive RAG approaches and proposes solutions like improved data representations, advanced retrieval techniques, and fine-tuning large language models. Table stakes optimizations include tuning chunk sizes, prompt engineering, and customizing LLMs. More advanced techniques involve small-to-big retrieval, multi-document agents, embedding fine-tuning, and LLM fine-tuning.

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Sri Ambati

Sandeep Singh, Head of Applied AI Computer Vision, Beans.ai H2O Open Source GenAI World SF 2023 In the modern era of machine learning, leveraging both open-source and closed-source solutions has become paramount for achieving cutting-edge results. This talk delves into the intricacies of seamlessly integrating open-source Large Language Model (LLM) solutions like Vicuna, Falcon, and Llama with industry giants such as ChatGPT and Google's Palm. As the demand for fine-tuned and specialized datasets grows, it is imperative to understand the synergy between these tools. Attendees will gain insights into best practices for building and enriching datasets tailored for fine-tuning tasks, ensuring that their LLM projects are both robust and efficient. Through real-world examples and hands-on demonstrations, this talk will equip attendees with the knowledge to harness the power of both open and closed-source tools in a coherent and effective manner.

Risk Management for LLMs

Sri Ambati

Patrick Hall, Professor, AI Risk Management, The George Washington University H2O Open Source GenAI World SF 2023 Language models are incredible engineering breakthroughs but require auditing and risk management before productization. These systems raise concerns about toxicity, transparency and reproducibility, intellectual property licensing and ownership, disinformation and misinformation, supply chains, and more. How can your organization leverage these new tools without taking on undue or unknown risks? While language models and associated risk management are in their infancy, a small number of best practices in governance and risk are starting to emerge. If you have a language model use case in mind, want to understand your risks, and do something about them, this presentation is for you!

Open-Source AI: Community is the Way

Sri Ambati

Dr. Alexy Khrabrov, Open Source Science Community Director, IBM H2O Open Source GenAI World SF 2023 In this talk, Dr. Alexy Khrabrov, recently elected Chair of the new Generative AI Commons at Linux Foundation for AI & Data, outlines the OSS AI landscape, challenges, and opportunities. With new models and frameworks being unveiled weekly, one thing remains constant: community building and validation of all aspects of AI is key to reliable and responsible AI we can use for business and society needs. Industrial AI is one key area where such community validation can prove invaluable.

Building Custom GenAI Apps at H2O

Sri Ambati

The document announces the launch of the H2O GenAI App Store, which provides a collection of applications that make it easier for average users to leverage large language models through custom interfaces for specific tasks like getting gardening advice or feedback on code. The app store is designed to accelerate the development of these GenAI apps using the H2O Wave platform and provides access to H2OGPTE for retrieval augmented generation and language model calls. Developers can also contribute their own apps through the GitHub repository listed.

Applied Gen AI for the Finance Vertical

Sri Ambati

Megan Kurka, Vice President, Customer Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 Discover the transformative power of Applied Gen AI. Learn how the H2O team builds customized applications and workflows that integrate capabilities of Gen AI and AutoML specifically designed to address and enhance financial use cases. Explore real world examples, learn best practices, and witness firsthand how our innovative solutions are reshaping the landscape of finance technology.

Cutting Edge Tricks from LLM Papers

Sri Ambati

This document discusses techniques for improving language models (LLMs) discussed in recent papers. It describes building blocks of LLMs like fine-tuning, foundation training, memory, and databases. Specific techniques covered include LIMA which uses 1,000 carefully curated examples, instruction backtranslation to generate question-answer pairs, fine-tuning models on API examples like Gorilla, and reducing false answers through techniques like not agreeing with incorrect user opinions. The goal is to discuss cutting edge tricks to build better LLMs.

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Sri Ambati

Pascal Pfeiffer, Principal Data Scientist, H2O.ai H2O Open Source GenAI World SF 2023 This talk dives into the expansive ecosystem of Large Language Models (LLMs), offering practitioners an insightful guide to various relevant applications, from natural language understanding to creative content generation. While exploring use cases across different industries, it also honestly addresses the current limitations of LLMs and anticipates future advancements.

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

Sri Ambati

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

Sri Ambati

This document discusses using large language models (LLMs) for text classification tasks. It begins by describing how LLMs are commonly used for text generation and question answering. For classification, models are usually trained supervised on labeled data. The document then explores using LLMs for zero-shot classification without training, and techniques like fine-tuning LLMs on tasks to improve performance. It provides an example of fine-tuning an LLM on a financial sentiment dataset. The document concludes by describing H2O.ai's LLM Studio tool for fine-tuning and a few Kaggle competitions where LLMs achieved success in text classification.

LLM Interpretability

Sri Ambati

1) Generative AI (GenAI) enables the creation of novel content by learning patterns in unstructured data rather than labeling outputs like traditional AI. 2) Both traditional and generative AI models lack transparency and may contain biases, but generative models can additionally hallucinate or leak private information. 3) To interpret generative models, researchers evaluate accuracy globally by checking for hallucinations or undesirable content, and locally by confirming the quality of individual responses.

Never Reply to an Email Again

Sri Ambati

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

Sri Ambati

Numerai is an open, crowd-sourced hedge fund powered by predictions from data scientists around the world. In return, participants are rewarded with weekly payouts in crypto. In this talk, Joe will give an overview of the Numerai tournament based on his own experience. He will then explain how he automates the time-consuming tasks such as testing different modelling strategies, scoring new datasets, submitting predictions to Numerai as well as monitoring model performance with H2O Driverless AI and R.

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

Sri Ambati

In this session, you will learn about what you should do after you’ve taken an AI transformation baseline. Over the span of this session, we will discuss the next steps in moving toward AI readiness through alignment of talent and tools to drive successful adoption and continuous use within an organization. To find additional videos on AI courses, earn badges, join the courses at H2O.ai Learning Center: https://training.h2o.ai/products/ai-foundations-course To find the Youtube video about this presentation: https://youtu.be/K1Cl3x3rd8g Speaker: Chemere Davis (H2O.ai - Senior Data Scientist Training Specialist)

AI Foundations Course Module 1 - An AI Transformation Journey

Sri Ambati

The chances of successfully implementing AI strategies within an organization significantly improve when you can recognize where your organization is on the maturity scale. Over this course, you will learn the keys to unlocking value with AI which include asking the right questions about the problems you are solving and ensuring you have the right cross-section of talent, tools, and resources. By the end of this module, you should be able to recognize where your organization is on the AI transformation spectrum and identify some strategies that can get you to the next stage in your journey. To find additional videos on AI courses, earn badges, join the courses at H2O.ai Learning Center: https://training.h2o.ai/products/ai-foundations-course To find the Youtube video about this presentation: https://youtu.be/PJgr2epM6qs Speakers: Chemere Davis (H2O.ai - Senior Data Scientist Training Specialist) Ingrid Burton (H2O.ai - CMO)

More from Sri Ambati (20)

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day

Generative AI Masterclass - Model Risk Management.pptx

AI and the Future of Software Development: A Sneak Peek

LLMOps: Match report from the top of the 5th

Building, Evaluating, and Optimizing your RAG App for Production

Building LLM Solutions using Open Source and Closed Source Solutions in Coher...

Risk Management for LLMs

Open-Source AI: Community is the Way

Building Custom GenAI Apps at H2O

Applied Gen AI for the Finance Vertical

Cutting Edge Tricks from LLM Papers

Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...

Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...

KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...

LLM Interpretability

Never Reply to an Email Again

From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...

AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...

AI Foundations Course Module 1 - An AI Transformation Journey

Recently uploaded

How to use Firebase Data Connect For Flutter

Daiki Mogmet Ito

Data structures and Algorithms in Python.pdf

TIPNGVN2

A tale of scale & speed: How the US Navy is enabling software delivery from l...

sonjaschweigert1

Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved: - Reduction in onboarding time from 5 weeks to 1 day - Improved developer experience and productivity through actionable findings and reduction of false positives - Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO) Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production. We will cover: - How to remove silos in DevSecOps - How to build efficient development pipeline roles and component templates - How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence) - How to streamline operations with automated policy checks on container images

Large Language Model (LLM) and it’s Geospatial Applications

Rohit Gautam

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Neo4j

Sudheer Mechineni, Head of Application Frameworks, Standard Chartered Bank Discover how Standard Chartered Bank harnessed the power of Neo4j to transform complex data access challenges into a dynamic, scalable graph database solution. This keynote will cover their journey from initial adoption to deploying a fully automated, enterprise-grade causal cluster, highlighting key strategies for modelling organisational changes and ensuring robust disaster recovery. Learn how these innovations have not only enhanced Standard Chartered Bank’s data infrastructure but also positioned them as pioneers in the banking sector’s adoption of graph technology.

みなさんこんにちはこれ何文字まで入るの？40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの？えこ...

名前です男

Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack

shyamraj55

TrustArc Webinar - 2024 Global Privacy Survey

TrustArc

How does your privacy program stack up against your peers? What challenges are privacy teams tackling and prioritizing in 2024? In the fifth annual Global Privacy Benchmarks Survey, we asked over 1,800 global privacy professionals and business executives to share their perspectives on the current state of privacy inside and outside of their organizations. This year’s report focused on emerging areas of importance for privacy and compliance professionals, including considerations and implications of Artificial Intelligence (AI) technologies, building brand trust, and different approaches for achieving higher privacy competence scores. See how organizational priorities and strategic approaches to data security and privacy are evolving around the globe. This webinar will review: - The top 10 privacy insights from the fifth annual Global Privacy Benchmarks Survey - The top challenges for privacy leaders, practitioners, and organizations in 2024 - Key themes to consider in developing and maintaining your privacy program

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Aggregage

Video Streaming: Then, Now, and in the Future

Alpen-Adria-Universität

In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.

How to Get CNIC Information System with Paksim Ga.pptx

danishmna97

PCI PIN Basics Webinar from the Controlcase Team

ControlCase

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

Neo4j

Leonard Jayamohan, Partner & Generative AI Lead, Deloitte This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.

20240607 QFM018 Elixir Reading List May 2024

Matthew Sinclair

Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...

Zilliz

Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI

Vladimir Iglovikov, Ph.D.

Presented by Vladimir Iglovikov: - https://www.linkedin.com/in/iglovikov/ - https://x.com/viglovikov - https://www.instagram.com/ternaus/ This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation. Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners. This case study covers various aspects, including: People: The contributors and community that have supported Albumentations. Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions. Challenges: The hurdles in monetizing open-source projects and measuring user engagement. Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration. Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community. Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations. Mental Health: Maintaining balance and not feeling pressured by user demands. Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth. Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects. Explore more about Albumentations and join the community at: GitHub: https://github.com/albumentations-team/albumentations Website: https://albumentations.ai/ LinkedIn: https://www.linkedin.com/company/100504475 Twitter: https://x.com/albumentations

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf

Malak Abu Hammad

Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers: * What is Vector Search? * Importance and benefits of vector search * Practical use cases across various industries * Step-by-step implementation guide * Live demos with code snippets * Enhancing LLM capabilities with vector search * Best practices and optimization strategies Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications. #MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology

Building RAG with self-deployed Milvus vector database and Snowpark Container...

Zilliz

Climate Impact of Software Testing at Nordic Testing Days

Kari Kakkonen

My slides at Nordic Testing Days 6.6.2024 Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf

Paige Cruz

Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack. While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack. I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:

Recently uploaded (20)

How to use Firebase Data Connect For Flutter

Data structures and Algorithms in Python.pdf

A tale of scale & speed: How the US Navy is enabling software delivery from l...

Large Language Model (LLM) and it’s Geospatial Applications

GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...

Let's Integrate MuleSoft RPA, COMPOSER, APM with AWS IDP along with Slack

TrustArc Webinar - 2024 Global Privacy Survey

Generative AI Deep Dive: Advancing from Proof of Concept to Production

Video Streaming: Then, Now, and in the Future

How to Get CNIC Information System with Paksim Ga.pptx

PCI PIN Basics Webinar from the Controlcase Team

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...

20240607 QFM018 Elixir Reading List May 2024

Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...

Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf

Building RAG with self-deployed Milvus vector database and Snowpark Container...

Climate Impact of Software Testing at Nordic Testing Days

Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf

Predicting Medical Test Results using Driverless AI

1. Alexander Gedranovich Chief Technology Officer poder.IO linkedin.com/in/alexander-gedranovich-73847435 Predicting medical tests results using Driverless AI

2. Outline 1. poder.IO Introduction 2. H2O at poder.IO 3. Cases for Driverless AI 4. Predicting medical tests results

3. poder.IO Introduction Our main product is cloud platform EPICA. EPICA uses AI to predict what your audience is going to do and when with a high degree of accuracy. You can use these predictions to have granular understanding of customer journeys and personalize user’s experience at individual level across web, email, social and digital advertising.

4. H2O at poder.IO We update and deploy as API 100+ models daily (POJO / MOJO) • Regression / Classification (GBM, GLM, RandomForest) • Text Classification (Word2Vec +) • Time Series Patterns (iSAX) • Deep Networks (DeepWater + Tensorflow) • Etc.

5. Cases for Driverless AI At the moment: 1. Driverless AI as a benchmark for all models before production 2. Research Department for handle clients’ cases Planning to use in production Q3 2018: 1. Advertising Campaigns Optimization 2. Content Classification 3. Profiles Matching 4. Look-a-like models

6. Predicting medical tests results Disclaimer The research was supported by Bayer AG. The project was completed by the joint team of Data Scientists from RocketScience.ai and Analytics from Bayer. Currently RocketScience.ai team is a part of poder.IO.

7. Predicting medical tests results: Problem The research goal is to develop an approach to predict individual medical test results based on longitudinal medical and pharma claims data without direct lab measures using data-driven analytic techniques. Such discoveries may result in improved treatment strategies. TODO: // Substitute to graphics

8. Predicting medical tests results: Problem • Medical laboratory test, which is required for making a decision on a patient’s treatment strategy • The test results are not available in most healthcare databases • There is a need to predict the results of the test for any patient at any point of time

9. Predicting medical tests results: Design

10. Predicting medical tests results: Design

11. Predicting medical tests results: Data • 10 years time interval • 11 M records • 4 M unique patients • Training data: 80% • Test data: 20% • Number of raw features: ~260

12. Predicting medical tests results: Prerequisites Models / methods: • ETL (C++, R, ggplot2) • H2O.ai based GLM, GBM, Random Forest • H2O.ai Driverless AI Hardware: • ETL, H2O models: 128Gb / 1Tb / 32 cores • Driverless AI: AWS g3.8xlarge

13. Predicting medical tests results: Outcome Model Training time RMSE R2 MAE Top features GLM (ElasticNet) 00:13:20 16.477 0.5540 13.3785 100% original GBM 100% original Random Forest 100% original Ensemble (3 models) - Ensemble (9 models) - Driverless AI 00:55:15 15.913 0.5857 12.8999 46% original TODO: // Fill details

14. Predicting medical tests results: top 10 features TODO: // Insert table with top 10 feature intersection from different models

15. Predicting medical tests results: Surrogate model

16. Predicting medical tests results: Partial dependence

17. Thank you!

Predicting Medical Test Results using Driverless AI

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Predicting Medical Test Results using Driverless AI

Similar to Predicting Medical Test Results using Driverless AI (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

Predicting Medical Test Results using Driverless AI