Explaining the Explainability: ‘Why’ and ‘How’ of Explainability in Research

Explaining the Explainability
‘Why’ and ‘How’ of Explainability
in Research
August 2021

Melih Bahar
➔ Data Scientist, Riskified
➔ Born in Turkey
➔ Loves whiskey!
melih.bahar@riskified.com
@melih_bhr
About Me

Riskified by the numbers
Global team,
nearly 50% in R&D
Countries across
the globe
Online volume
reviewed in 2020
650+ 180+
$60B+
50+
Publicly held companies
among our clients
98%
Client retention
rate in 2020
As of August 2021

Account Takeover (ATO)
A quick glance...
An ATO is when a bad actor gains access to another party’s legitimate account.

Agenda
01 Why
02 Explainability
03 SHAP
04 Research

Why now?
Explainable Accurate
Complex Model ✘ ✔
Simple Model ✔ ✘

Why
End Users POV
Loan?
Magic!
Model 55% Denied
Why?

Why
Data Scientist POV
“If you can’t explain it simply, you don’t understand it well enough.”
Albert Einstein
x1
.
.
xn
Data Model Interpretability
Methods
Humans
Validation
Performance
Trust, Informativeness,
Transferability, Fairness

Explainability vs. Interpretability
Explainability
Explainability
An explainable model is a function that is too
complicated for a human to understand.
An additional method is needed to
understand how the model works.
Interpretability
A model is interpretable if it is capable
of being understood by humans on its
own.

Explainability
… is NOT causality!
Source: https://tuowang.rbind.io/project/causal-inference-notes/

Shapley Values
A Short Introduction
The average of the marginal contributions across all permutations.
Source: https://towardsdatascience.com/explain-your-model-with-the-shap-values-bc36aac4de3d
Order matters!
Ann
Beth
Cindy
Beth
Ann Cindy

SHAP
SHapley Additive exPlanations
• Proposed by Lundberg and Lee (2016) - Based on Shapley Values
• A united approach to explaining the output of any machine learning model -
model agnostic
Properties
• Local Accuracy
• Consistency
• Missingness
Assumptions
• Independent Features

Global vs. Local
Global Interpretability
Provides explanations about the
general behavior of the model
over the entire population.
Local Interpretability
Provides explanations for a specified
prediction of the model

Explainable
Models in
Research

Model Comparison
• Not all models have out of the box feature importances (such as random forest
etc.)
- SHAP creates a common ground for comparison.
• Adds “explanation” to the “performance”
Tree-based Model (Boosting)
PRAUC: 0.829
Kernel-based Model (SVM)
PRAUC: 0.838

Model Debugging
Error Analysis on Instances
It is easier to detect that a selected feature should have a different effect than the observed one.
False Positive False Negative

Model Debugging
Error Analysis on Incorrect Predictions
We can take the subset of the errors of our models and see which features
affect the most for these incorrect classifications.

Model Debugging
Adding Domain Experts to the Loop
Feedback for/from analysts from/for the model helps getting more accurate
labels and find the logic flaws in the model.
ATO ? Legit ?

Feature Selection
• The wrapper-based methods (BORUTA, RFE etc.) may result in suboptimal performances.
• The standard feature importance method of decision trees tends to overestimate the importance
of continuous or high-cardinality categorical variables. SHAP reduces this bias.

Feature Selection
Effect of Specific Features
• Helps see how the values of a specific feature affect the predictions.
• Helps decide if a specific feature is helpful or not.

Feature Selection
Effect of Specific Features
Too high effect of a specific feature (with respect to the other features) may
indicate too high correlation to the label.

Feature Prioritization
Helps us decide which features (or group of features) are “more” important to develop in
production right now and which ones can wait.
For Production

Clustering
Creating the sub-segments
Clustering using the SHAP values can show that there are different sub-segments in our set.

Clustering
Understanding the Sub-segments
ATO type A ATO type B

The Catch
• Computationally expensive and time-consuming for large numbers of features.
• Explanations are generated too late in the machine learning pipeline.
• Provide no guarantee that your model will behave as expected in the future for new
data.

Available on MEDIUM
https://medium.com/riskified-technology

Melih Bahar
melih.bahar@riskified.com
@melih_bhr /Twitter
Thank You
For Your Time!

Explaining the Explainability: ‘Why’ and ‘How’ of Explainability in Research

Recommended

Recommended

More Related Content

Similar to Explaining the Explainability: ‘Why’ and ‘How’ of Explainability in Research

Similar to Explaining the Explainability: ‘Why’ and ‘How’ of Explainability in Research (20)

Recently uploaded

Recently uploaded (20)

Explaining the Explainability: ‘Why’ and ‘How’ of Explainability in Research

Editor's Notes