Attempts to understand the results of machine learning models by Michael Tiernay

Opening the black box: Attempts to understand
the results of machine learning models
Michael Tiernay, PhD
R&D Data Scientist, Edmunds.com
07/29/2017

Inferential Models Might Be Bad
−300
0
300
600
0 100 200 300
temperature
pressure

Non-Parametric Models Are Hard to Understand

Local Vs. Global Interpretability

Local Vs. Global Interpretability
1. Local Interpretability - Focus on how a model works around a
single or cluster of simililar observations
2. Global Interpretability - Focus on how a model works across all
observations (i.e. coeﬃcients from a liner regression)

Why Do we Want Local Interpretability?
Undrestand why a prediction is positive/negative
Trust individual predictions (i.e. reasons for a prediction make
sense to domain experts)
Provide guidence for intervening strategies (i.e. the cancer is
predicted to be caused by X, which can be treated with Y)
These problems have been addressed by recent literature

Why Do we Want Global Interpretability?
Hypothesis Generation: Model can help generate new ideas
that can be tested experimentally
A global understanding of the ‘causes’ of an outcome can drive
signiﬁcant business/product changes
This problem has not received much attention in the machine
learning literature

Raw Data
## Survived Pclass Sex Age
## 1 class_0 3 2 22
## 2 class_1 1 1 38
## 3 class_1 3 1 26
## 4 class_1 1 1 35
## 5 class_0 3 2 35
## 7 class_0 1 2 54

Logistic Regression
##
## Call:
## NULL
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.7270 -0.6799 -0.3947 0.6483 2.4668
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 7.578137 0.618091 12.261 < 2e-16 ***
## Pclass -1.288545 0.139259 -9.253 < 2e-16 ***
## Sex -2.522131 0.207283 -12.168 < 2e-16 ***
## Age -0.036929 0.007628 -4.841 1.29e-06 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1
##

Lime (Locally Interpretable Model-agnostic
Explanations)

Lime (Locally Interpretable Model-agnostic Explanations)
Mainly created for images and text
Model agnostic
Focus on one observation (x) at a time
Sample other observations (z) weighted by distance to x
Compute f(z) (The predicted outcome)
Select K features with LASSO then compute least squares
Coeﬃcients from LS are ‘local eﬀects’

Local Interpretability Example

Lime’s Conclusions
1. Women survive at a higher rate than men
2. Being in third class has a substantially more negative effect
than being in second class for both men and women
3. The effect of being in third class and second class is the same
for men and women
4. Age has small positive effects on men and small negative
effects on women

Simulate Changes By Gender
Change Female to Male Change Male to Female
−0.4 0.0 0.4 −0.4 0.0 0.4
0
25
50
75
100
Change In Survival Probability
count
colour
red

Simulate Changes By Class
Class 1 to 3 Class 2 to 3 Class 3 to 2
−0.4 0.0 0.4 −0.4 0.0 0.4 −0.4 0.0 0.4
0
50
100
150
200
0
50
100
150
200
count
colour
red

By Class (and Gender)
−0.4 0.0 0.4 −0.4 0.0 0.4 −0.4 0.0 0.4
0
50
100
150
200
0
50
100
150
200
count
sex
Female
Male
colour
red

Same Passengers as Lime
0.2
0.4
0.6
0.8
Class 1 Class 2 Class3
Passenger Class
LikelihoodofSurvival
Sex
Female
Male

Simulate A 1 Year Change in Age
Negative Positive
−0.1 0.0 0.1 −0.1 0.0 0.1
0
100
200
300
400
500
NumberofPassengers

Combine Both Eﬀects
0
100
200
300
400
−0.05 0.00 0.05 0.10 0.15
NumberofPassengers

Plot Individual Eﬀect By Age
−0.05
0.00
0.05
0.10
0.15
0 20 40 60 80
Passenger Age
IndividualEffectofChangingAgeby1

Surivial by Age for All Passengers
0.2
0.4
0.6
0.8
0 20 40 60 80
Age
ProbabilityofSurvival
group
Female−1st
Female−2nd
Female−3rd
Male−1st
Male−2nd
Male−3rd

Ice Box - Gender
−10123
Sex
partiallog−odds
0 1

Shortcomings of Lime
Good out-of-the-box solution that requires little thought
Doesn’t allow control over local space
Need to center/scale features for distance calculation
Diﬀerent eﬀects up and down of binary/ordinal features

My Simulation Ideas
Simple simulations reveal a lot
Calculating effects for binary predictors is trivial
Calculating effects for categorical predictors is harder (with
many categories)
Numeric Predictors:
Examine the effect of a change of a specific size across the
entire population (Similar to average partial effects in
econometrics)
Look at one observation and see how predictions change across
the universe of the one predictor
Simulate every possible value of a predictor for every
observation (or a random subset)

https:
//datascienceinc.github.io/Skater/overview.html
https://www.oreilly.com/ideas/
ideas-on-interpreting-machine-learning
https://github.com/marcotcr/lime
https://www.oreilly.com/learning/
introduction-to-local-interpretable-model-agnostic-expl
http:
//marcotcr.github.io/lime/tutorials/Tutorial%20-%
20continuous%20and%20categorical%20features.html
https://arxiv.org/pdf/1602.04938.pdf

http://scweiss.blogspot.com/2015/12/
beyond-beta-relationships-between.html
https://cran.r-project.org/web/packages/ICEbox/
ICEbox.pdf
https://arxiv.org/pdf/1309.6392.pdf
https://stats.stackexchange.com/questions/73449/
average-partial-effects

Attempts to understand the results of machine learning models by Michael Tiernay

Recommended

Recommended

More Related Content

Similar to Attempts to understand the results of machine learning models by Michael Tiernay

Similar to Attempts to understand the results of machine learning models by Michael Tiernay (20)

More from Data Con LA

More from Data Con LA (20)

Recently uploaded

Recently uploaded (20)

Attempts to understand the results of machine learning models by Michael Tiernay