Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai

Sri Ambati
Sri AmbatiCEO & Founder at H2O.ai
Patrick Hall
Data Science Products
H2O.ai
@jpatrickhall
https://www.linkedin.com/in/jpatrickhall/
Practical Tips for
Interpreting Machine
Learning Models
What is Machine Learning Interpretability?
“The ability to explain or to present in understandable terms to a
human.”
-- Finale Doshi-Velez and Been Kim. “Towards a rigorous science of
interpretable machine learning.” arXiv preprint. 2017.
https://arxiv.org/pdf/1702.08608.pdf
• FAT*: https://www.fatml.org/resources/principles-for-accountable-algorithms
• XAI: https://www.darpa.mil/program/explainable-artificial-intelligence
Why Should You Care About Machine Learning
Interpretability?
“The now-contemplated field of data science amounts to a superset of the fields of statistics and
machine learning, which adds some technology for “scaling up” to “big data.” This chosen superset
is motivated by commercial rather than intellectual developments. Choosing in this way is likely
to miss out on the really important intellectual event of the next 50 years.”
-- David Donoho. “50 years of Data Science.” Tukey Centennial Workshop, 2015. http://bit.ly/2GQOh1J
Social Motivation: Interpretability plays a critical role in the increased convenience, automation, and organization
in our day-to-day lives promised by AI.
Commercial Motivation: Interpretability is required for regulated industry to adopt machine learning.
• Check and balance against accidental or intentional discrimination.
• “Right to explanation.”
• Hacking and adversarial attacks.
• Improved revenue, i.e. Equifax NeuroDecision: https://www.youtube.com/watch?v=9Z_GW9WDS2c
How is Machine Learning Interpretability
Practiced?
Reason Codes
https://github.com/slundberg/shap
Interpretable
Models
https://web.stanford.edu/~hastie/ElemStatLearn/printings/ESLII_print12.pdf
Surrogate
Models
https://github.com/jphall663/interpretable_machine_learning_with_python
Model
Visualizations
https://github.com/jphall663/interpretable_machine_learning_with_python
… and many more!
Tip 1: Test Your ML Models with Sensitivity Analysis and
Random Data Attacks
If you are using a machine learning
model, you should probably be
conducting sensitivity analysis.
http://www.vias.org/tmdatanaleng/
Tip 2: Test Your Explanatory Software
By human quality assessment or …
Simulated data
You can use simulated data with known characteristics to test explanations. For instance, models trained on totally
random data with no relationship between a number of input variables and a prediction target should not give strong
weight to any input variable nor generate compelling local explanations or reason codes. Conversely, you can use
simulated data with a known signal generating function to test that explanations accurately represent that known
function.
https://github.com/h2oai/mli-resources/tree/master/lime_shap_treeint_compare
Explanation stability under data perturbation
Trustworthy explanations likely should not change drastically for minor changes in input data. You can set and test
thresholds for allowable explanation value changes automatically by perturbing input data. (Explanations or reason code
values can also be averaged across a number of models to create more stable explanations.)
Explanation stability with increased prediction accuracy
If previously known, accurate explanations or reason codes from a simpler linear model are available, you can use them as
a reference for the accuracy of explanations from a related, but more complex and hopefully more accurate, model. You
can perform tests to see how accurate a model can become before its prediction’s reason codes veer away from known
standards.
Tip 3: Consider Deployment
Production Scoring Environment
R/Py/Etc.
Training Environment
Tip 4: Beware of Uninterpretable Features
ClusterTE:ClusterID20:LIMIT_BAL:PAY_3:PAY_AMT2
Try:
• Sparse PCA
• Simple Interactions
• Weight of Evidence
Tip 5: Use Global and Local Explanatory Techniques
Together
Tip 6: Use Shapley Explanations for Tree-Based Models
Tip 7: Most Direct Path to Interpretable ML Model Today
is …
• Monotonic XGBoost + Shapley Explanations
OR
• H2O Driverless AI ;)
Tip 8: Combine Decision Tree Surrogate, Partial Dependence,
and ICE to Understand High-Degree Interactions
With complimentary 2-D visualizations of trained models that enable understanding of learned high-degree
interactions.
Decision Tree Surrogate Models
Partial Dependence and Individual Conditional Expectation
https://github.com/jphall663/interpretable_machine_learning_with_python
Tip 9: LIME …
• LIME can give an indication of its own trustworthiness using fit statistics.
• LIME can fail, particularly in the presence of extreme nonlinearity or high-degree interactions.
• LIME is difficult to deploy, but there are highly deployable variants, e.g. H2O’s K-LIME, Wells
Fargo LIME-SUP.
• Reason codes are offsets from a local intercept.
• Note that the intercept in LIME can account for the most important local phenomena.
• Generated LIME samples can contain large proportions of out-of-range data that can lead
to unrealistically high or low intercept values.
• Try LIME on discretized input features and on manually constructed interactions.
• Use cross-validation to construct standard deviations or even confidence intervals for reason
code values.
Resources
• https://github.com/h2oai/mli-resources
• https://github.com/jphall663/interpretable_machine_learning_with_python
1 of 14

Recommended

Interpretable machine learning by
Interpretable machine learningInterpretable machine learning
Interpretable machine learningSri Ambati
5K views22 slides
Unified Approach to Interpret Machine Learning Model: SHAP + LIME by
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEUnified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEDatabricks
2.8K views18 slides
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo... by
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...
Achieving Algorithmic Transparency with Shapley Additive Explanations (H2O Lo...Sri Ambati
1.6K views28 slides
Ideas on Machine Learning Interpretability by
Ideas on Machine Learning InterpretabilityIdeas on Machine Learning Interpretability
Ideas on Machine Learning InterpretabilitySri Ambati
815 views23 slides
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We... by
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...
Helping data scientists escape the seduction of the sandbox - Krish Swamy, We...Sri Ambati
346 views16 slides
Philip Rathle- Graph Boosted Artificial Intelligence by
Philip Rathle- Graph Boosted Artificial IntelligencePhilip Rathle- Graph Boosted Artificial Intelligence
Philip Rathle- Graph Boosted Artificial IntelligenceNeo4j
536 views30 slides

More Related Content

What's hot

Driverless AI - Arno Candel, H2O.ai by
Driverless AI - Arno Candel, H2O.aiDriverless AI - Arno Candel, H2O.ai
Driverless AI - Arno Candel, H2O.aiSri Ambati
526 views14 slides
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ... by
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...Dataconomy Media
739 views50 slides
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig... by
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig..."How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...Data Science Milan
1.3K views22 slides
Demystifying Data Science by
Demystifying Data ScienceDemystifying Data Science
Demystifying Data ScienceData Science Milan
4.3K views17 slides
Building a performing Machine Learning model from A to Z by
Building a performing Machine Learning model from A to ZBuilding a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to ZCharles Vestur
31.4K views127 slides
Explainable AI - making ML and DL models more interpretable by
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretableAditya Bhattacharya
205 views39 slides

What's hot(20)

Driverless AI - Arno Candel, H2O.ai by Sri Ambati
Driverless AI - Arno Candel, H2O.aiDriverless AI - Arno Candel, H2O.ai
Driverless AI - Arno Candel, H2O.ai
Sri Ambati526 views
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ... by Dataconomy Media
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
DN18 | Demystifying the Buzz in Machine Learning! (This Time for Real) | Dat ...
Dataconomy Media739 views
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig... by Data Science Milan
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig..."How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
"How Pirelli uses Domino and Plotly for Smart Manufacturing" by Alberto Arrig...
Data Science Milan1.3K views
Building a performing Machine Learning model from A to Z by Charles Vestur
Building a performing Machine Learning model from A to ZBuilding a performing Machine Learning model from A to Z
Building a performing Machine Learning model from A to Z
Charles Vestur31.4K views
Explainable AI - making ML and DL models more interpretable by Aditya Bhattacharya
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretable
DN18 | The Evolution and Future of Graph Technology: Intelligent Systems | Ax... by Dataconomy Media
DN18 | The Evolution and Future of Graph Technology: Intelligent Systems | Ax...DN18 | The Evolution and Future of Graph Technology: Intelligent Systems | Ax...
DN18 | The Evolution and Future of Graph Technology: Intelligent Systems | Ax...
Dataconomy Media139 views
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to... by Aditya Bhattacharya
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
Aditya Bhattacharya - Enterprise DL - Accelerating Deep Learning Solutions to...
End-to-End Machine Learning Project by Eng Teong Cheah
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning Project
Eng Teong Cheah2.5K views
Data Science as a Service: Intersection of Cloud Computing and Data Science by Pouria Amirian
Data Science as a Service: Intersection of Cloud Computing and Data ScienceData Science as a Service: Intersection of Cloud Computing and Data Science
Data Science as a Service: Intersection of Cloud Computing and Data Science
Pouria Amirian1.1K views
Top Machine Learning Tools and Frameworks for Beginners | Edureka by Edureka!
Top Machine Learning Tools and Frameworks for Beginners | EdurekaTop Machine Learning Tools and Frameworks for Beginners | Edureka
Top Machine Learning Tools and Frameworks for Beginners | Edureka
Edureka!447 views
Feature Importance Analysis with XGBoost in Tax audit by Michael BENESTY
Feature Importance Analysis with XGBoost in Tax auditFeature Importance Analysis with XGBoost in Tax audit
Feature Importance Analysis with XGBoost in Tax audit
Michael BENESTY28.4K views
An Interactive Visual Analytics Dashboard for the Employment Situation Report by Benjamin Bengfort
An Interactive Visual Analytics Dashboard for the Employment Situation ReportAn Interactive Visual Analytics Dashboard for the Employment Situation Report
An Interactive Visual Analytics Dashboard for the Employment Situation Report
Benjamin Bengfort1.4K views
Predicting the Future with Azure Machine Learning by Paul Prae
Predicting the Future with Azure Machine LearningPredicting the Future with Azure Machine Learning
Predicting the Future with Azure Machine Learning
Paul Prae2.4K views
Guide to end end machine learning projects by Skyl.ai
Guide to end end machine learning projectsGuide to end end machine learning projects
Guide to end end machine learning projects
Skyl.ai769 views
Brian James Hu resume 2016 7-5 by Brian Hu
Brian James Hu resume 2016 7-5Brian James Hu resume 2016 7-5
Brian James Hu resume 2016 7-5
Brian Hu96 views

Similar to Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai

Interpretable Machine Learning by
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine LearningSri Ambati
1.4K views19 slides
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018 by
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018Sri Ambati
460 views18 slides
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym! by
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Sri Ambati
1.2K views31 slides
ODSC APAC 2022 - Explainable AI by
ODSC APAC 2022 - Explainable AIODSC APAC 2022 - Explainable AI
ODSC APAC 2022 - Explainable AIAditya Bhattacharya
188 views37 slides
Human in the loop: Bayesian Rules Enabling Explainable AI by
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIPramit Choudhary
851 views39 slides
Self Study Business Approach to DS_01022022.docx by
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docxShanmugasundaram M
169 views11 slides

Similar to Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai(20)

Interpretable Machine Learning by Sri Ambati
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine Learning
Sri Ambati1.4K views
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018 by Sri Ambati
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Machine Learning Interpretability - Mateusz Dymczyk - H2O AI World London 2018
Sri Ambati460 views
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym! by Sri Ambati
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!
Sri Ambati1.2K views
Human in the loop: Bayesian Rules Enabling Explainable AI by Pramit Choudhary
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary851 views
Self Study Business Approach to DS_01022022.docx by Shanmugasundaram M
Self Study Business Approach to DS_01022022.docxSelf Study Business Approach to DS_01022022.docx
Self Study Business Approach to DS_01022022.docx
Shanmugasundaram M169 views
The importance of model fairness and interpretability in AI systems by Francesca Lazzeri, PhD
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
Learning to Learn Model Behavior ( Capital One: data intelligence conference ) by Pramit Choudhary
Learning to Learn Model Behavior ( Capital One: data intelligence conference )Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Learning to Learn Model Behavior ( Capital One: data intelligence conference )
Pramit Choudhary315 views
Reasoning over big data by OSTHUS
Reasoning over big dataReasoning over big data
Reasoning over big data
OSTHUS810 views
Building Data Ecosystems for Accelerated Discovery by adamkraut
Building Data Ecosystems for Accelerated DiscoveryBuilding Data Ecosystems for Accelerated Discovery
Building Data Ecosystems for Accelerated Discovery
adamkraut33 views
Designing a Generative AI QnA solution with Proprietary Enterprise Business K... by IRJET Journal
Designing a Generative AI QnA solution with Proprietary Enterprise Business K...Designing a Generative AI QnA solution with Proprietary Enterprise Business K...
Designing a Generative AI QnA solution with Proprietary Enterprise Business K...
IRJET Journal28 views
Train, explain, acclaim. Build a good model in three steps by Przemek Biecek
Train, explain, acclaim.  Build a good model in three stepsTrain, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim. Build a good model in three steps
Przemek Biecek300 views
Driverless AI Hands-on Focused on Machine Learning Interpretability - H2O.ai by Sri Ambati
Driverless AI Hands-on Focused on Machine Learning Interpretability - H2O.aiDriverless AI Hands-on Focused on Machine Learning Interpretability - H2O.ai
Driverless AI Hands-on Focused on Machine Learning Interpretability - H2O.ai
Sri Ambati817 views
FAIR data_ Superior data visibility and reuse without warehousing.pdf by Alan Morrison
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
Alan Morrison27 views

More from Sri Ambati

AI and the Future of Software Development: A Sneak Peek by
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek Sri Ambati
20 views12 slides
LLMOps: Match report from the top of the 5th by
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thSri Ambati
6 views36 slides
Building, Evaluating, and Optimizing your RAG App for Production by
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati
21 views30 slides
Building LLM Solutions using Open Source and Closed Source Solutions in Coher... by
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Sri Ambati
7 views17 slides
Open-Source AI: Community is the Way by
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the WaySri Ambati
6 views12 slides
Building Custom GenAI Apps at H2O by
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OSri Ambati
17 views9 slides

More from Sri Ambati(20)

AI and the Future of Software Development: A Sneak Peek by Sri Ambati
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
Sri Ambati20 views
LLMOps: Match report from the top of the 5th by Sri Ambati
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
Sri Ambati6 views
Building, Evaluating, and Optimizing your RAG App for Production by Sri Ambati
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
Sri Ambati21 views
Building LLM Solutions using Open Source and Closed Source Solutions in Coher... by Sri Ambati
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Sri Ambati7 views
Open-Source AI: Community is the Way by Sri Ambati
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the Way
Sri Ambati6 views
Building Custom GenAI Apps at H2O by Sri Ambati
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2O
Sri Ambati17 views
Applied Gen AI for the Finance Vertical by Sri Ambati
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical
Sri Ambati11 views
Cutting Edge Tricks from LLM Papers by Sri Ambati
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM Papers
Sri Ambati9 views
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren... by Sri Ambati
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Sri Ambati12 views
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and... by Sri Ambati
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Sri Ambati15 views
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C... by Sri Ambati
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
Sri Ambati8 views
LLM Interpretability by Sri Ambati
LLM Interpretability LLM Interpretability
LLM Interpretability
Sri Ambati7 views
Never Reply to an Email Again by Sri Ambati
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email Again
Sri Ambati9 views
Introducción al Aprendizaje Automatico con H2O-3 (1) by Sri Ambati
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)
Sri Ambati2.7K views
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use... by Sri Ambati
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
Sri Ambati1.7K views
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo... by Sri Ambati
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
Sri Ambati1.1K views
AI Foundations Course Module 1 - An AI Transformation Journey by Sri Ambati
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
Sri Ambati842 views
ML Model Deployment and Scoring on the Edge with Automatic ML & DF by Sri Ambati
ML Model Deployment and Scoring on the Edge with Automatic ML & DFML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
Sri Ambati683 views
Scaling & Managing Production Deployments with H2O ModelOps by Sri Ambati
Scaling & Managing Production Deployments with H2O ModelOpsScaling & Managing Production Deployments with H2O ModelOps
Scaling & Managing Production Deployments with H2O ModelOps
Sri Ambati556 views
Automatic Model Documentation with H2O by Sri Ambati
Automatic Model Documentation with H2OAutomatic Model Documentation with H2O
Automatic Model Documentation with H2O
Sri Ambati1.5K views

Recently uploaded

Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...ShapeBlue
86 views25 slides
Data Integrity for Banking and Financial Services by
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesPrecisely
76 views26 slides
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... by
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...ShapeBlue
114 views12 slides
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...ShapeBlue
113 views18 slides
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueShapeBlue
63 views15 slides
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ... by
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...ShapeBlue
121 views15 slides

Recently uploaded(20)

Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit... by ShapeBlue
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
Transitioning from VMware vCloud to Apache CloudStack: A Path to Profitabilit...
ShapeBlue86 views
Data Integrity for Banking and Financial Services by Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely76 views
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ... by ShapeBlue
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
Backup and Disaster Recovery with CloudStack and StorPool - Workshop - Venko ...
ShapeBlue114 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue113 views
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue63 views
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ... by ShapeBlue
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
Backroll, News and Demo - Pierre Charton, Matthias Dhellin, Ousmane Diarra - ...
ShapeBlue121 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue128 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ... by ShapeBlue
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
Import Export Virtual Machine for KVM Hypervisor - Ayush Pandey - University ...
ShapeBlue48 views
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson142 views
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading... by The Digital Insurer
Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...Webinar : Desperately Seeking Transformation - Part 2:  Insights from leading...
Webinar : Desperately Seeking Transformation - Part 2: Insights from leading...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc130 views
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti... by ShapeBlue
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
DRaaS using Snapshot copy and destination selection (DRaaS) - Alexandre Matti...
ShapeBlue69 views
DRBD Deep Dive - Philipp Reisner - LINBIT by ShapeBlue
DRBD Deep Dive - Philipp Reisner - LINBITDRBD Deep Dive - Philipp Reisner - LINBIT
DRBD Deep Dive - Philipp Reisner - LINBIT
ShapeBlue110 views
Digital Personal Data Protection (DPDP) Practical Approach For CISOs by Priyanka Aash
Digital Personal Data Protection (DPDP) Practical Approach For CISOsDigital Personal Data Protection (DPDP) Practical Approach For CISOs
Digital Personal Data Protection (DPDP) Practical Approach For CISOs
Priyanka Aash103 views
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue by ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlueCloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
CloudStack Managed User Data and Demo - Harikrishna Patnala - ShapeBlue
ShapeBlue68 views
State of the Union - Rohit Yadav - Apache CloudStack by ShapeBlue
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStack
ShapeBlue218 views
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... by ShapeBlue
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
ShapeBlue93 views
NTGapps NTG LowCode Platform by Mustafa Kuğu
NTGapps NTG LowCode Platform NTGapps NTG LowCode Platform
NTGapps NTG LowCode Platform
Mustafa Kuğu287 views

Practical Tips for Interpreting Machine Learning Models - Patrick Hall, H2O.ai

  • 1. Patrick Hall Data Science Products H2O.ai @jpatrickhall https://www.linkedin.com/in/jpatrickhall/ Practical Tips for Interpreting Machine Learning Models
  • 2. What is Machine Learning Interpretability? “The ability to explain or to present in understandable terms to a human.” -- Finale Doshi-Velez and Been Kim. “Towards a rigorous science of interpretable machine learning.” arXiv preprint. 2017. https://arxiv.org/pdf/1702.08608.pdf • FAT*: https://www.fatml.org/resources/principles-for-accountable-algorithms • XAI: https://www.darpa.mil/program/explainable-artificial-intelligence
  • 3. Why Should You Care About Machine Learning Interpretability? “The now-contemplated field of data science amounts to a superset of the fields of statistics and machine learning, which adds some technology for “scaling up” to “big data.” This chosen superset is motivated by commercial rather than intellectual developments. Choosing in this way is likely to miss out on the really important intellectual event of the next 50 years.” -- David Donoho. “50 years of Data Science.” Tukey Centennial Workshop, 2015. http://bit.ly/2GQOh1J Social Motivation: Interpretability plays a critical role in the increased convenience, automation, and organization in our day-to-day lives promised by AI. Commercial Motivation: Interpretability is required for regulated industry to adopt machine learning. • Check and balance against accidental or intentional discrimination. • “Right to explanation.” • Hacking and adversarial attacks. • Improved revenue, i.e. Equifax NeuroDecision: https://www.youtube.com/watch?v=9Z_GW9WDS2c
  • 4. How is Machine Learning Interpretability Practiced? Reason Codes https://github.com/slundberg/shap Interpretable Models https://web.stanford.edu/~hastie/ElemStatLearn/printings/ESLII_print12.pdf Surrogate Models https://github.com/jphall663/interpretable_machine_learning_with_python Model Visualizations https://github.com/jphall663/interpretable_machine_learning_with_python … and many more!
  • 5. Tip 1: Test Your ML Models with Sensitivity Analysis and Random Data Attacks If you are using a machine learning model, you should probably be conducting sensitivity analysis. http://www.vias.org/tmdatanaleng/
  • 6. Tip 2: Test Your Explanatory Software By human quality assessment or … Simulated data You can use simulated data with known characteristics to test explanations. For instance, models trained on totally random data with no relationship between a number of input variables and a prediction target should not give strong weight to any input variable nor generate compelling local explanations or reason codes. Conversely, you can use simulated data with a known signal generating function to test that explanations accurately represent that known function. https://github.com/h2oai/mli-resources/tree/master/lime_shap_treeint_compare Explanation stability under data perturbation Trustworthy explanations likely should not change drastically for minor changes in input data. You can set and test thresholds for allowable explanation value changes automatically by perturbing input data. (Explanations or reason code values can also be averaged across a number of models to create more stable explanations.) Explanation stability with increased prediction accuracy If previously known, accurate explanations or reason codes from a simpler linear model are available, you can use them as a reference for the accuracy of explanations from a related, but more complex and hopefully more accurate, model. You can perform tests to see how accurate a model can become before its prediction’s reason codes veer away from known standards.
  • 7. Tip 3: Consider Deployment Production Scoring Environment R/Py/Etc. Training Environment
  • 8. Tip 4: Beware of Uninterpretable Features ClusterTE:ClusterID20:LIMIT_BAL:PAY_3:PAY_AMT2 Try: • Sparse PCA • Simple Interactions • Weight of Evidence
  • 9. Tip 5: Use Global and Local Explanatory Techniques Together
  • 10. Tip 6: Use Shapley Explanations for Tree-Based Models
  • 11. Tip 7: Most Direct Path to Interpretable ML Model Today is … • Monotonic XGBoost + Shapley Explanations OR • H2O Driverless AI ;)
  • 12. Tip 8: Combine Decision Tree Surrogate, Partial Dependence, and ICE to Understand High-Degree Interactions With complimentary 2-D visualizations of trained models that enable understanding of learned high-degree interactions. Decision Tree Surrogate Models Partial Dependence and Individual Conditional Expectation https://github.com/jphall663/interpretable_machine_learning_with_python
  • 13. Tip 9: LIME … • LIME can give an indication of its own trustworthiness using fit statistics. • LIME can fail, particularly in the presence of extreme nonlinearity or high-degree interactions. • LIME is difficult to deploy, but there are highly deployable variants, e.g. H2O’s K-LIME, Wells Fargo LIME-SUP. • Reason codes are offsets from a local intercept. • Note that the intercept in LIME can account for the most important local phenomena. • Generated LIME samples can contain large proportions of out-of-range data that can lead to unrealistically high or low intercept values. • Try LIME on discretized input features and on manually constructed interactions. • Use cross-validation to construct standard deviations or even confidence intervals for reason code values.