Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019

•Download as PPTX, PDF•

2 likes•2,018 views

This session was recorded in NYC on October 22nd, 2019 and can be viewed here: https://youtu.be/ngOBhhINWb8 Explainable Machine Learning with Shapley Values Shapley values are popular approach for explaining predictions made by complex machine learning models. In this talk I will discuss what problems Shapley values solve, an intuitive presentation of what they mean, and examples of how they can be used through the ‘shap’ python package. Bio: I am a senior researcher at Microsoft Research. Before joining Microsoft, I did my Ph.D. studies at the Paul G. Allen School of Computer Science & Engineering of the University of Washington working with Su-In Lee. My work focuses on explainable artificial intelligence and its application to problems in medicine and healthcare. This has led to the development of broadly applicable methods and tools for interpreting complex machine learning models that are now used in banking, logistics, sports, manufacturing, cloud services, economics, and many other areas.

Technology

Explainable Machine Learning with Shapley Values
Scott Lundberg
Senior Researcher
Microsoft

Explainable AI in practice
Model
development

model 22%
chance John will have
repayment problems
John, a bank customer
No loan
Why was I denied?
What is our
financial risk?
How do I
debug?
Accuracy = $

Interpretable Accurate
Complex model ✘ ✔
Simple model ✔ ✘
Interpretable or accurate: choose one.
😀 ⚖️ 💰
3
?
?

Complex models are
inherently complex!
But a single prediction involves only a
small piece of that complexity.
Inputvalue
Outputvalue
5

6
How did we get here?
Base rate Prediction for John
22%
𝑓 𝑥0
16%
𝐸[𝑓 𝑋 ]

7
16% 22%
𝐸[𝑓 𝑋 ] 𝑓 𝑥0
Base rate
𝜙0
Income not verified
18.2%
𝐸 𝑓 𝑋 𝑑𝑜(𝑋1 = 𝑥1)]
𝜙1
No recent account openings
21%
𝐸 𝑓 𝑋 𝑑𝑜(𝑋1,2 = 𝑥1,2)]
22.5%
𝐸 𝑓 𝑋 𝑑𝑜(𝑋1,2,3 = 𝑥1,2,3)]
18.5%
𝐸 𝑓 𝑋 𝑑𝑜(𝑋1,2,3,4 = 𝑥1,2,3,4)]
𝜙2
𝜙3𝜙4
𝜙5
DTI = 30
Delinquent 10 months ago
46 years of credit history
The order matters!
*Janzing et al. 2019
*

8
𝐸[𝑓 𝑋 ] 𝑓 𝑥0
𝜙0
𝜙1
No recent account openings
𝜙2
𝜙3
𝜙4
𝜙5
46 years of credit history
The order matters!
Nobel Prize in 2012
Lloyd Shapley

9
𝐸[𝑓 𝑋 ] 𝑓 𝑥0
𝜙0
𝜙1
𝜙2
𝜙3
𝜙4
𝜙5
Shapley properties
Additivity (local accuracy) – The sum of the local
feature attributions equals the difference between
rate and the model output.
1

10
𝐸[𝑓 𝑋 ] 𝑓 𝑥0
𝜙0
𝜙1
𝜙2
𝜙3
𝜙4
𝜙5
Shapley properties
Monotonicity (consistency) – If you change the
original model such that a feature has a larger
possible ordering, then that input’s attribution
decrease.
2
Violating consistency means you can’t trust feature orderings
based on your attributions.
…even within the same model!

11
𝐸[𝑓 𝑋 ] 𝑓 𝑥0
𝜙0
𝜙1
𝜙2
𝜙3
𝜙4
𝜙5
Shapley values result from averaging over all N! possible orderings.
(NP-hard)

ex = shap.TreeExplainer(model, …)
shap_values = ex.shap_values(X)
shap.force_plot(ex.expected_value, shap_values[john_ind,:], X.iloc[john_ind,:])
Why does 46 years of credit history increase
the risk of payment problems?

shap.dependence_plot(“Months of credit history”, shap_values, X)
The model is identifying
retirement-age individuals based
on their long credit histories!
Explain and
debug your models!

Explainable AI in practice
Model
development
Debugging/exploration
Monitoring

Model monitoring
Time
Training performance Test performance
Can you find where we introduced the bug?
16

Model monitoring
Now can you find where we introduced the bug?
17
False True

Model monitoring
Time
Transient electronic medical record
Time
18
False True

Model monitoring
Time
Gradual change in atrial fibrillation
ablation procedure durations
Time
19
False True

Explainable AI in practice
Model
development
Human/AI
collaboration
Regulatory
compliance
Debugging/exploration Customer retention Consumer explanations
Encoding prior beliefs
Monitoring Decision support Anti-discrimination
Risk managementHuman risk oversight
Scientific
discovery
Pattern discovery
Population subtyping
Signal recovery

Global feature importance Local explanation summary(A)
(log relative risk of mortality)
Mortality model
(F/M)
Mortality risk model

Reveal rare high-magnitude mortality effects
Global feature importance Local explanation summary(A)
(log relative risk of mortality)
Mortality model
(F/M)
Conflates the
prevalence of an effect
with the
magnitude of an effect
Mortality risk model

What's hot

Explainable AIArithmer Inc.

An Introduction to XAI! Towards Trusting Your ML Models!Mansour Saffar

A Unified Approach to Interpreting Model Predictions (SHAP)Rama Irsheidat

Explainability and bias in AIBill Liu

eScience SHAP talkScott Lundberg

Machine Learning Explanations: LIME framework Deep Learning Italia

Explainable AIWagston Staehler

Latent diffusions vs DALL-E v2Vitaly Bondar

Intepretability / Explainable AI for Deep Neural NetworksUniversitat Politècnica de Catalunya

Introduction to TCAV (ICML2018)Thien Q. Tran

Explainable AI in Industry (KDD 2019 Tutorial)Krishnaram Kenthapadi

Explainability for Natural Language ProcessingYunyao Li

Causality without headachesBenoît Rostykus

Interpretable Machine LearningSri Ambati

Explainable AI (XAI)Manojkumar Parmar

Interpretability of machine learningDaiki Tanaka

Introduction to Grad-CAM (complete version)Hsing-chuan Hsieh

Explainable AI in Industry (FAT* 2020 Tutorial)Krishnaram Kenthapadi

Machine Learning Interpretabilityinovex GmbH

Interpretable Machine Learning Using LIME Framework - Kasia Kulma (PhD), Data...Sri Ambati

What's hot (20)

Explainable AI

An Introduction to XAI! Towards Trusting Your ML Models!

A Unified Approach to Interpreting Model Predictions (SHAP)

Explainability and bias in AI

eScience SHAP talk

Machine Learning Explanations: LIME framework

Explainable AI

Latent diffusions vs DALL-E v2

Intepretability / Explainable AI for Deep Neural Networks

Introduction to TCAV (ICML2018)

Explainable AI in Industry (KDD 2019 Tutorial)

Explainability for Natural Language Processing

Causality without headaches

Interpretable Machine Learning

Explainable AI (XAI)

Interpretability of machine learning

Introduction to Grad-CAM (complete version)

Explainable AI in Industry (FAT* 2020 Tutorial)

Machine Learning Interpretability

Interpretable Machine Learning Using LIME Framework - Kasia Kulma (PhD), Data...

Similar to Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019

Practical Aspects of Stochastic Modeling.pptxRon Harasym

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

Using Machine Learning on AWS for Continuous Sentiment Analysis from Labeling...Amazon Web Services

Revolutionizing your Business with AI (AUC VLabs).pdfOmar Maher

JDO 2019: Data Science for Developers - Matthew RenzePROIDEA

Rsqrd AI: Exploring Machine Learning Model PredictionsSanjana Chowdhury

Machine learning in credit risk modeling : a James white paperJames by CrowdProcess

Regoli fairness deep_learningitalia_20220127Matteo Testi

Human-Machine Collaboration in Organizations: Impact of Algorithm Bias on De...Anh Luong

Marketing Analytics with R Lifting Campaign Success RatesRevolution Analytics

205250 crystall ballp6academy

Predictive Model and Record Description with Segmented Sensitivity Analysis (...Greg Makowski

Alexandr Honchar. Financial ML != ML and FinanceLviv Startup Club

Patrick Hall, H2O.ai - The Case for Model Debugging - H2O World 2019 NYCSri Ambati

Introduction To Six Sigmamakarand_lotankar

When the AIs failures send us back to our own societal biasesClément DUFFAU

Project presentation by Debendra AdhikariDEBENDRA ADHIKARI

Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...QA or the Highway

How Four Statistical Rules Forecast Who Wins a Competitive BidIntelCollab.com

Stochastic Modeling - Financial ReportingRon Harasym

Similar to Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019 (20)

Practical Aspects of Stochastic Modeling.pptx

Der Spagat zwischen BIAS und FAIRNESS (2024)

Using Machine Learning on AWS for Continuous Sentiment Analysis from Labeling...

Revolutionizing your Business with AI (AUC VLabs).pdf

JDO 2019: Data Science for Developers - Matthew Renze

Rsqrd AI: Exploring Machine Learning Model Predictions

Machine learning in credit risk modeling : a James white paper

Regoli fairness deep_learningitalia_20220127

Human-Machine Collaboration in Organizations: Impact of Algorithm Bias on De...

Marketing Analytics with R Lifting Campaign Success Rates

205250 crystall ball

Predictive Model and Record Description with Segmented Sensitivity Analysis (...

Alexandr Honchar. Financial ML != ML and Finance

Patrick Hall, H2O.ai - The Case for Model Debugging - H2O World 2019 NYC

Introduction To Six Sigma

When the AIs failures send us back to our own societal biases

Project presentation by Debendra Adhikari

Scale your Testing and Quality with Automation Engineering and ML - Carlos Ki...

How Four Statistical Rules Forecast Who Wins a Competitive Bid

Stochastic Modeling - Financial Reporting

Recently uploaded

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Slack Application Development 101 Slidespraypatel2

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Key Features Of Token Development (1).pptxLBM Solutions

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Recently uploaded (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Azure Monitor & Application Insight to monitor Infrastructure & Application

08448380779 Call Girls In Friends Colony Women Seeking Men

SIEMENS: RAPUNZEL – A Tale About Knowledge Graph

Scaling API-first – The story of a global engineering organization

Presentation on how to chat with PDF using ChatGPT code interpreter

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Slack Application Development 101 Slides

Pigging Solutions in Pet Food Manufacturing

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

GenCyber Cyber Security Day Presentation

Key Features Of Token Development (1).pptx

My Hashitalk Indonesia April 2024 Presentation

Benefits Of Flutter Compared To Other Frameworks

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Pigging Solutions Piggable Sweeping Elbows

Handwritten Text Recognition for manuscripts and early printed texts

Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019

1. Explainable Machine Learning with Shapley Values Scott Lundberg Senior Researcher Microsoft

2. Explainable AI in practice Model development

3. model 22% chance John will have repayment problems John, a bank customer No loan Why was I denied? What is our financial risk? How do I debug? Accuracy = $

4. Interpretable Accurate Complex model ✘ ✔ Simple model ✔ ✘ Interpretable or accurate: choose one. 😀 ⚖️ 💰 3 ? ?

5. Complex models are inherently complex! But a single prediction involves only a small piece of that complexity. Inputvalue Outputvalue 5

6. 6 How did we get here? Base rate Prediction for John 22% 𝑓 𝑥0 16% 𝐸[𝑓 𝑋 ]

7. 7 16% 22% 𝐸[𝑓 𝑋 ] 𝑓 𝑥0 Base rate 𝜙0 Income not verified 18.2% 𝐸 𝑓 𝑋 𝑑𝑜(𝑋1 = 𝑥1)] 𝜙1 No recent account openings 21% 𝐸 𝑓 𝑋 𝑑𝑜(𝑋1,2 = 𝑥1,2)] 22.5% 𝐸 𝑓 𝑋 𝑑𝑜(𝑋1,2,3 = 𝑥1,2,3)] 18.5% 𝐸 𝑓 𝑋 𝑑𝑜(𝑋1,2,3,4 = 𝑥1,2,3,4)] 𝜙2 𝜙3𝜙4 𝜙5 DTI = 30 Delinquent 10 months ago 46 years of credit history The order matters! *Janzing et al. 2019 *

8. 8 𝐸[𝑓 𝑋 ] 𝑓 𝑥0 𝜙0 𝜙1 No recent account openings 𝜙2 𝜙3 𝜙4 𝜙5 46 years of credit history The order matters! Nobel Prize in 2012 Lloyd Shapley

9. 9 𝐸[𝑓 𝑋 ] 𝑓 𝑥0 𝜙0 𝜙1 𝜙2 𝜙3 𝜙4 𝜙5 Shapley properties Additivity (local accuracy) – The sum of the local feature attributions equals the difference between rate and the model output. 1

10. 10 𝐸[𝑓 𝑋 ] 𝑓 𝑥0 𝜙0 𝜙1 𝜙2 𝜙3 𝜙4 𝜙5 Shapley properties Monotonicity (consistency) – If you change the original model such that a feature has a larger possible ordering, then that input’s attribution decrease. 2 Violating consistency means you can’t trust feature orderings based on your attributions. …even within the same model!

11. 11 𝐸[𝑓 𝑋 ] 𝑓 𝑥0 𝜙0 𝜙1 𝜙2 𝜙3 𝜙4 𝜙5 Shapley values result from averaging over all N! possible orderings. (NP-hard)

12.

13. ex = shap.TreeExplainer(model, …) shap_values = ex.shap_values(X) shap.force_plot(ex.expected_value, shap_values[john_ind,:], X.iloc[john_ind,:]) Why does 46 years of credit history increase the risk of payment problems?

14. shap.dependence_plot(“Months of credit history”, shap_values, X) The model is identifying retirement-age individuals based on their long credit histories! Explain and debug your models!

15. Explainable AI in practice Model development Debugging/exploration Monitoring

16. Model monitoring Time Training performance Test performance Can you find where we introduced the bug? 16

17. Model monitoring Now can you find where we introduced the bug? 17 False True

18. Model monitoring Time Transient electronic medical record Time 18 False True

19. Model monitoring Time Gradual change in atrial fibrillation ablation procedure durations Time 19 False True

20. Explainable AI in practice Model development Human/AI collaboration Regulatory compliance Debugging/exploration Customer retention Consumer explanations Encoding prior beliefs Monitoring Decision support Anti-discrimination Risk managementHuman risk oversight Scientific discovery Pattern discovery Population subtyping Signal recovery

21. Thank You github.com/slundberg/shap

22. Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Mortality risk model

23. Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Mortality risk model

24. Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Mortality risk model

25. Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Mortality risk model

26. Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Mortality risk model

27. Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Mortality risk model

28. Reveal rare high-magnitude mortality effects Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Conflates the prevalence of an effect with the magnitude of an effect Mortality risk model

29. Reveal rare high-magnitude mortality effects Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M)

30. Reveal rare high-magnitude mortality effects Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Rare high magnitude effects

31. Reveal rare high-magnitude mortality effects Global feature importance Local explanation summary(A) (log relative risk of mortality) Mortality model (F/M) Lots of ways to die young…Not many ways to live longer…

Editor's Notes

We'll start with a simple motivational example. This is John, a typical bank customer. Like many consumers today, when he applies for a loan, information about him is sent through a predictive model XX. This model is designed to calculate the risk that John will have repayment problems, XX which, unfortunately for John, is 55%. And because his risk is high, the bank declines his loan application XX. A natural first question John has XX is “Why?” The bank XX also want to know why, because these are key business decisions. But unfortunately for the data scientist that built this model XX, she used a complex model and so these questions are hard to answer XX …. XX
The reason the data scientist used a complex model is because they are often very accurate on large data sets XX, but that same complexity also makes them hard to interpret XX. In contrast, simple models are easy to interpret XX, but are often less accurate XX. This leads to a trade-off XX between interpretability and accuracy. This trade-off is particularly painful for the bank since accuracy directly corresponds to profitability XX, while interpretability has important implications for customer satisfaction XX and even legality. This tradeoff effects a wide range of applications, and so many recent methods have been developed to address it. XX
These methods do not try and make an entire complex model interpretable, because there is inherently too much to succinctly explain. Instead they focus on explaining a single prediction XX, because mapping a single input to an output involves only a small part of the complexity of the overall model. XX
To see, let’s return to John and answer why his loan application was denied. To explain his denial it is important to start with the base rate of loan repayment problems XX, denoted here by the expected value of the model’s output. To explain John's risk XX, we need to explain how we got from the base rate XX, to his risk of 55%. XX
To see, let’s return to John and answer why his loan application was denied. To explain his denial it is important to start with the base rate of loan repayment problems XX, denoted here by the expected value of the model’s output. To explain John's risk XX, we need to explain how we got from the base rate XX, to his risk of 55%. XX
To see, let’s return to John and answer why his loan application was denied. To explain his denial it is important to start with the base rate of loan repayment problems XX, denoted here by the expected value of the model’s output. To explain John's risk XX, we need to explain how we got from the base rate XX, to his risk of 55%. XX
To see, let’s return to John and answer why his loan application was denied. To explain his denial it is important to start with the base rate of loan repayment problems XX, denoted here by the expected value of the model’s output. To explain John's risk XX, we need to explain how we got from the base rate XX, to his risk of 55%. XX
To see, let’s return to John and answer why his loan application was denied. To explain his denial it is important to start with the base rate of loan repayment problems XX, denoted here by the expected value of the model’s output. To explain John's risk XX, we need to explain how we got from the base rate XX, to his risk of 55%. XX
To see, let’s return to John and answer why his loan application was denied. To explain his denial it is important to start with the base rate of loan repayment problems XX, denoted here by the expected value of the model’s output. To explain John's risk XX, we need to explain how we got from the base rate XX, to his risk of 55%. XX
Make title shorter
Focus more on for each
What is an EMR?
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter
Make title shorter

Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019

Similar to Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019 (20)

More from Sri Ambati

More from Sri Ambati (20)

Recently uploaded

Recently uploaded (20)

Scott Lundberg, Microsoft Research - Explainable Machine Learning with Shapley Values - H2O World NYC 2019

Editor's Notes