This document summarizes a presentation about how H2O AutoML, LIME, and Shiny were used to help make a multimillion-dollar decision for a baseball team based on player performance data. The problem was framed as predicting player performance over multiple years. H2O AutoML was used to generate predictions from both public baseball data and a team's proprietary data. LIME was used to explain the H2O models. The results were shared via a Shiny app and helped the team sign a player to a multimillion dollar contract. The presentation covered how the business problem was framed, the tools used, and a demo of the results.
Joe Chow (H2O.ai) recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was just to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.
Briefing for the lemmings.io summer program on building bots and apps on top of all major messaging platforms (Facebook Messenger, Skype, Telegram, WeChat, Kik, …).
Presentation design & structure: @@david_pfluegl
Web 2.0 for Dummies presentation. describing Web 2.0 concepts, technologies and applications. The presentation includes links to relevant URLs. For more details see my blog Web 2.0 for Dummies posts at
http://avirosenthal.blogspot.com/
Joe Chow (H2O.ai) recently teamed up with IBM and Aginity to create a proof of concept "Moneyball" app for the IBM Think conference in Vegas. The original goal was just to prove that different tools (e.g. H2O, Aginity AMP, IBM Data Science Experience, R and Shiny) could work together seamlessly for common business use-cases. Little did Joe know, the app would be used by Ari Kaplan (the real "Moneyball" guy) to validate the future performance of some baseball players. Ari recommended one player to a Major League Baseball team. The player was signed the next day with a multimillion-dollar contract. This talk is about Joe's journey to a real "Moneyball" application.
Briefing for the lemmings.io summer program on building bots and apps on top of all major messaging platforms (Facebook Messenger, Skype, Telegram, WeChat, Kik, …).
Presentation design & structure: @@david_pfluegl
Web 2.0 for Dummies presentation. describing Web 2.0 concepts, technologies and applications. The presentation includes links to relevant URLs. For more details see my blog Web 2.0 for Dummies posts at
http://avirosenthal.blogspot.com/
A closer look at the trends that will shape Digital Marketing in 2017 – and how you can leverage them to outperform your peers. Read more in this blogpost series: https://webrepublic.com/en/blog/2016/11/18/digital-marketing-trends-2017-en/.
Transform your Business with AI, Deep Learning and Machine LearningSri Ambati
Video: https://www.youtube.com/watch?v=R3IXd1iwqjc
Meetup: http://www.meetup.com/SF-Bay-ACM/events/231709894/
In this talk, Arno Candel presents a brief history of AI and how Deep Learning and Machine Learning techniques are transforming our everyday lives. Arno will introduce H2O, a scalable open-source machine learning platform, and show live demos on how to train sophisticated machine learning models on large distributed datasets. He will show how data scientists and application developers can use the Flow GUI, R, Python, Java, Scala, JavaScript and JSON to build smarter applications, and how to take them to production. He will present customer use cases from verticals including insurance, fraud, churn, fintech, and marketing.
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Social & Search, the next step into the integration of the social profile into the search google box.
Here a brief introduction into Google+ a social network still in private beta.
Automatic and Interpretable Machine Learning in R with H2O and LIMEJo-fai Chow
This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
Automatic and Interpretable Machine Learning in R with H2O and LIMESri Ambati
This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
Doing Things That Don't Scale - Counter intuitive marketing for startups...Almog Koren
Counter intuitive marketing for startups...
After closing down Scoreoid now having some free time, I enrolled startup class. Part of the reading was Paul’s Graham’s do things that don’t scale.
I realized what I did for Scoreoid in the being was exactly this, and it really works….
Social & Search, the next step into the integration of the social profile into the search google box.
Here a brief introduction into Google+ a social network still in private beta.
A Multiplatform, Multi-Tenant Challenge - Droidcon Lisbon 2023Pedro Vicente
What if you had to build a multiplatform (Android & iOS) and multi-tenant app with the objective of sharing the biggest amout of code possible while having all apps being UI/UX independent?
We want to take you through the discovery trip we made while building this. From architecture to ins and outs of KMM via Gradle magic that enabled us to have a Android, iOS and Desktop app.
Also sharing our rational over each of the options we took: Why not React Native? Or Xamarin? Should we use Compose Multiplatform?
The story of how Wingify was bootstrapped from ZERO to millions of dollars in annual revenue without raising of external funding. This is also a story of the genesis of Visual Website Optimizer as Wingify's flagship product and the world's easiest A/B testing tool.
Why Just Making Great Games Is Not Enough | Anders LykkeJessica Tams
Delivered at Casual Connect Tel Aviv 2016 | One of the exciting things about the mobile games market is that it changes so fast. What worked last year may not work this year. We will take you on a journey through the market developments and emerging trends and discuss why just making a killer game is no longer enough. To conclude, we'll look at what the most successful developers are doing differently, and what others can do to make the cut.
Product Managers: Treat Your Strategy as a ProductProductPlan
Having a strategy as a guide for your teams and products is essential; we all know that. Moreover, product managers agonize over how to make these statements brief and powerful. As such, strategy has become a mad lib, fill in the blank word game that often feels abstract and disconnected from your immediate product goals.
In our this webinar, our expert panel explains the undervalued necessity of treating your strategy the same way you do your products. In other words HOW to set cause and effect metrics on your strategy and WHY this is vital.
A closer look at the trends that will shape Digital Marketing in 2017 – and how you can leverage them to outperform your peers. Read more in this blogpost series: https://webrepublic.com/en/blog/2016/11/18/digital-marketing-trends-2017-en/.
Transform your Business with AI, Deep Learning and Machine LearningSri Ambati
Video: https://www.youtube.com/watch?v=R3IXd1iwqjc
Meetup: http://www.meetup.com/SF-Bay-ACM/events/231709894/
In this talk, Arno Candel presents a brief history of AI and how Deep Learning and Machine Learning techniques are transforming our everyday lives. Arno will introduce H2O, a scalable open-source machine learning platform, and show live demos on how to train sophisticated machine learning models on large distributed datasets. He will show how data scientists and application developers can use the Flow GUI, R, Python, Java, Scala, JavaScript and JSON to build smarter applications, and how to take them to production. He will present customer use cases from verticals including insurance, fraud, churn, fintech, and marketing.
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
Social & Search, the next step into the integration of the social profile into the search google box.
Here a brief introduction into Google+ a social network still in private beta.
Automatic and Interpretable Machine Learning in R with H2O and LIMEJo-fai Chow
This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
Automatic and Interpretable Machine Learning in R with H2O and LIMESri Ambati
This is a hands-on tutorial for R beginners. I will demonstrate the use of two R packages, h2o & LIME, for automatic and interpretable machine learning. Participants will be able to follow and build regression and classification models quickly with H2O’s AutoML. They will then be able to explain the model outcomes with a framework called Local Interpretable Model-Agnostic Explanations (LIME).
Doing Things That Don't Scale - Counter intuitive marketing for startups...Almog Koren
Counter intuitive marketing for startups...
After closing down Scoreoid now having some free time, I enrolled startup class. Part of the reading was Paul’s Graham’s do things that don’t scale.
I realized what I did for Scoreoid in the being was exactly this, and it really works….
Social & Search, the next step into the integration of the social profile into the search google box.
Here a brief introduction into Google+ a social network still in private beta.
A Multiplatform, Multi-Tenant Challenge - Droidcon Lisbon 2023Pedro Vicente
What if you had to build a multiplatform (Android & iOS) and multi-tenant app with the objective of sharing the biggest amout of code possible while having all apps being UI/UX independent?
We want to take you through the discovery trip we made while building this. From architecture to ins and outs of KMM via Gradle magic that enabled us to have a Android, iOS and Desktop app.
Also sharing our rational over each of the options we took: Why not React Native? Or Xamarin? Should we use Compose Multiplatform?
The story of how Wingify was bootstrapped from ZERO to millions of dollars in annual revenue without raising of external funding. This is also a story of the genesis of Visual Website Optimizer as Wingify's flagship product and the world's easiest A/B testing tool.
Why Just Making Great Games Is Not Enough | Anders LykkeJessica Tams
Delivered at Casual Connect Tel Aviv 2016 | One of the exciting things about the mobile games market is that it changes so fast. What worked last year may not work this year. We will take you on a journey through the market developments and emerging trends and discuss why just making a killer game is no longer enough. To conclude, we'll look at what the most successful developers are doing differently, and what others can do to make the cut.
Product Managers: Treat Your Strategy as a ProductProductPlan
Having a strategy as a guide for your teams and products is essential; we all know that. Moreover, product managers agonize over how to make these statements brief and powerful. As such, strategy has become a mad lib, fill in the blank word game that often feels abstract and disconnected from your immediate product goals.
In our this webinar, our expert panel explains the undervalued necessity of treating your strategy the same way you do your products. In other words HOW to set cause and effect metrics on your strategy and WHY this is vital.
Presentato al sesto WebMeetup del Machine Learning / Data Science Meetup Roma: https://www.meetup.com/it-IT/Machine-Learning-Data-Science-Meetup/events/273089965/
Presentazione per il sesto WebMeetup del Machine Learning / Data Science Meetup Roma: https://www.meetup.com/it-IT/Machine-Learning-Data-Science-Meetup/events/273089965/
Paolo Galeone - Dissecting tf.function to discover auto graph strengths and s...MeetupDataScienceRoma
Original presentation available on GitHub: https://pgaleone.eu/tf-function-talk/
Meetup: https://www.meetup.com/it-IT/Machine-Learning-Data-Science-Meetup/events/264338606/
Multimodal AI Approach to Provide Assistive Services (Francesco Puja)MeetupDataScienceRoma
Presentazione dal Meetup del Machine Learning / Data Science Meetup di Roma - Giugno 2019:
https://www.meetup.com/it-IT/Machine-Learning-Data-Science-Meetup/events/262120815/
Presentazione dal Meetup del Machine Learning / Data Science Meetup di Roma - Giugno 2019:
https://www.meetup.com/it-IT/Machine-Learning-Data-Science-Meetup/events/262120815/
Zero, One, Many - Machine Learning in Produzione (Luca Palmieri)MeetupDataScienceRoma
Talk dal Meetup del Machine Learning / Data Science Meetup di Roma - Giugno 2019:
https://www.meetup.com/it-IT/Machine-Learning-Data-Science-Meetup/events/262120815/
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
3. About this talk
• Quick Overview
• Business problem
• Solution and result
• Details
• The “Moneyball” team
• Baseball data → ML problem
• H2O AutoML, LIME, Shiny
You will learn …
• How to frame a business problem
(e.g. Moneyball) for machine
learning.
• If we have time …
• How to use H2O AutoML with R
interface.
• How to use LIME to explain H2O
models.
3
4. About Me
4
• Before H2O
• Water Engineer / EngD Researcher /
Matlab Fan Boy
(wonder why @matlabulous?)
• Discovered R, Python, H2O …
never look back again
• Data Scientist at Virgin Media (UK),
Domino Data Lab (US)
• At H2O …
• Data Scientist / Evangelist /
• Sales Engineer / Solution Architect /
• Community Manager
… The harsh reality of startup life …
• H2O SWAG Photographer
#AroundTheWorldWithH2Oai
Love H2O? Get some stickers!
Barcelona
Berlin
Brussels Paris
Milan (2016)
Milan (2018)
8. Founded 2012, Series C in Nov, 2017
Products • Driverless AI – Automated Machine Learning
• H2O Open Source Machine Learning
• Sparkling Water
Mission Democratize AI. Do Good
Team ~100 employees
• Distributed Systems Engineers doing Machine Learning
• World-class visualization designers
Offices Mountain View, London, Prague
8
Company Overview
9. H2O Products
In-Memory, Distributed
Machine Learning Algorithms
with H2O Flow GUI
H2O AI Open Source Engine
Integration with Spark
Lightning Fast machine
learning on GPUs
Automatic feature
engineering, machine
learning and interpretability
Secure multi-tenant H2O clusters
11. Gartner names H2O as Leader with the most completeness of vision
• H2O.ai recognized as a technology
leader with most completeness of
vision
• H2O.ai was recognized for the
mindshare, partner network and
status as a quasi-industry standard
for machine learning and AI.
• H2O customers gave the highest
overall score among all the vendors
for sales relationship and account
management, customer support
(onboarding, troubleshooting, etc.)
and overall service and support.
12. Platforms with H2O integration
H2O + KNIME Talk
at KNIME Summit
Mar 2017
16. Our Solution
16
• Open data – Lahman Database.
• Proprietary data from Ari Kaplan – our real
Moneyball guy.
• Enriched Lahman data with Ari’s Data
• Framed data for machine learning.
• Used H2O AutoML to make predictions for
next three years.
25. Lahman Data Framed as a ML problem Player
Attributes
One of the Targets
Past
Performance
Sliding
Windows
+
Other
Stats
Training
Validation
Forecast
No data. Used 2017 value. Not perfect (a quick hack).
26. Approach One: Learning from Lahman only
Lahman: Age, Height, Weight …
Historical Performance Stats
Home Runs
Batting Average
…
Predictions
H2O AutoML:
Learn the Pattern
Sliding Windows (Stats from previous n years)
About 300 Lahman Features
26
27. Approach Two: Learning from Lahman & AriDB
Lahman: Age, Height, Weight …
Historical Performance Stats
Home Runs
Batting Average
…
Predictions
H2O AutoML:
Learn the Pattern
Sliding Windows (Stats from previous n years)
About 300 Lahman Features +
200 AriDB Features
AriDB: Fastball, curveball,
slider, velocity …
27
32. Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest: Classification
or regression models
• Gradient Boosting Machine: Produces an
ensemble of decision trees with increasing
refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with an
input layer followed by multiple layers of
nonlinear transformations
H2O-3 Algorithms Overview
Unsupervised Learning
• K-means: Partitions observations into k
clusters/groups of the same spatial size.
Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly transforms
correlated variables to independent components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of numerical,
Boolean, categorical, and missing data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction using
deep learning
32
35. Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest: Classification
or regression models
• Gradient Boosting Machine: Produces an
ensemble of decision trees with increasing
refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with an
input layer followed by multiple layers of
nonlinear transformations
H2O-3 Algorithms in AutoML
Unsupervised Learning
• K-means: Partitions observations into k
clusters/groups of the same spatial size.
Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly transforms
correlated variables to independent components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of numerical,
Boolean, categorical, and missing data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction using
deep learning
35
47. H2O Products
In-Memory, Distributed
Machine Learning Algorithms
with H2O Flow GUI
H2O AI Open Source Engine
Integration with Spark
Lightning Fast machine
learning on GPUs
Automatic feature
engineering, machine
learning and interpretability
Secure multi-tenant H2O clusters
Lightning Fast machine
learning on GPUs
48. “Confidential and property of H2O.ai. All rights reserved”
Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and
Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest:
Classification or regression models
• Gradient Boosting Machine: Produces
an ensemble of decision trees with
increasing refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with
an input layer followed by multiple
layers of nonlinear transformations
Algorithms on H2O-3 (CPU)
Unsupervised Learning
• K-means: Partitions observations into
k clusters/groups of the same spatial
size. Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly
transforms correlated variables to independent
components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of
numerical, Boolean, categorical, and missing
data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction
using deep learning
49. “Confidential and property of H2O.ai. All rights reserved”
Supervised Learning
• Generalized Linear Models: Binomial,
Gaussian, Gamma, Poisson and
Tweedie
• Naïve Bayes
Statistical
Analysis
Ensembles
• Distributed Random Forest:
Classification or regression models
• Gradient Boosting Machine: Produces
an ensemble of decision trees with
increasing refined approximations
Deep Neural
Networks
• Deep learning: Create multi-layer feed
forward neural networks starting with
an input layer followed by multiple
layers of nonlinear transformations
Algorithms on H2O4GPU (more to come)
Unsupervised Learning
• K-means: Partitions observations into
k clusters/groups of the same spatial
size. Automatically detect optimal k
Clustering
Dimensionality
Reduction
• Principal Component Analysis: Linearly
transforms correlated variables to independent
components
• Generalized Low Rank Models: extend the idea of
PCA to handle arbitrary data consisting of
numerical, Boolean, categorical, and missing
data
Anomaly
Detection
• Autoencoders: Find outliers using a
nonlinear dimensionality reduction
using deep learning
52. Coming Up
52
Date City Talk / Workshop
25 June Milan H2O + LIME Workshop
28 June Rome H2O Intro + Use Cases
Mid-July London Next London Meetup (T.B.C.)
Mid-Oct London H2O World London
13-14 Nov London
Big Data LDN
Keynote by Sri (CEO) + Meetup