Explainable Machine Learning

Explainable Machine
Learning
Gabriel Cypriano

But...
What if we could train a Random
Forest that seems to perform much
better?

Random Forest Feature Importances

Decision Paths
Prediction for:
RM LSTAT NOX DIST
3.1 4.5 0.54 2.6
http://blog.datadive.net/interpreting-ra
ndom-forests

treeinterpreter — Interpreting predictions with Decision Paths

treeinterpreter — Explaining difference between 2 datasets with Decision Paths

treeinterpreter — What about feature interactions?

treeinterpreter — Classification on the Iris
dataset

treeinterpreter / Pivotal — Explanation on steroids — give me some dataviz

treeinterpreter / Pivotal — Contribution vs Feature Value (single Decision Tree)

treeinterpreter / Pivotal — Contribution vs Feature Value (Random Forest)

treeinterpreter / Pivotal — Classification
Feature Contributions Violin Plot (for class “Infant”)

treeinterpreter / Pivotal — Classification
Contributions to each class vs Feature Value

What about
Boosted Trees?
Instead of averaging contributions
across trees we just have have to
sum them.
Available with:
● ELI5
e.g., XGBoost, LightGBM

ELI5 — XGBoost — Feature Importances (Titanic dataset)

ELI5 — XGBoost prediction — Titanic dataset

Model-agnostic explanations
e.g., for non-tree based models

Lime
● Local
approximations
● Model agnostic
● Can select a set of
representative
instances with
explanations

Lime — creating superpixels to aid in image recognition

More use cases
● Learn the right features (dogs
vs wolves)
● Understand whether model
overfits a particular feature
● Identifying data leakage
● Dataset shift (training data
different than test data)
● Pneumonia/asthma case
Amazon, Netflix

● Not only useful for when things aren't working
● Different costs for making mistakes

References
Interpreting Random Forests
Random forest interpretation with scikit-learn
Random forest interpretation – conditional feature contributions
Interpreting Decision Trees and Random Forests
XGBoost Decision Paths
Explaining XGBoost predictions on the Titanic dataset
“Why Should I Trust You?” Explaining the Predictions of Any Classifier

Tools
treeinterpreter
Lime
ELI5

Gracias!
gabrielcs.me
vagas.creditas.com.br
somostera.com

Explainable Machine Learning

Recommended

Recommended

More Related Content

Similar to Explainable Machine Learning

Similar to Explainable Machine Learning (20)

Recently uploaded

Recently uploaded (20)

Explainable Machine Learning