SlideShare a Scribd company logo
1 of 45
Download to read offline
1/45
Explaining Machine Learning Decisions
Grégoire Montavon, TU Berlin
Joint work with: Wojciech Samek, Klaus-Robert Müller,
Sebastian Lapuschkin, Alexander Binder
18/09/2018 Intl. Workshop ML & AI, Telecom ParisTech
2/45
From ML Successes to Applications
Autonomous Driving
Medical Diagnosis
Networks (smart grids, etc.)
Visual Reasoning
AlphaGo beats Go
human champ
Deep Net outperforms
humans in image
classification
3/45
Can we interpret
what a ML model has
learned?
4/45
First, we need to define
what we want from
interpretable ML.
5/45
Understanding Deep Nets: Two Views
Understanding what
mechanism the network
uses to solve a problem or
implement a function.
Understanding how the
network relates the input
to the output variables.
6/45
interpreting
predicted classes
explaining
individual decisions
Interpreting Predicted Classes
Image from Symonian’13
Example: “How does a goose typically look like
according to the neural network?”
goose
non-goose
8/45
Explaining Individual Decisions
Images from Lapuschkin’16
Example: “Why is a given image classified as a
sheep?”
sheep
non-sheep
9/45
Example: Autonomous Driving [Bojarski’17]
Input:
Decision
Bojarski et al. 2017 “Explaining How a Deep Neural Network Trained with End-to-
End Learning Steers a Car”
PilotNet
Explanation:
10/45
Example: Pascal VOC Classification [Lapuschkin’16]
Comparing Performance on Pascal VOC 2007
(Fisher Vector Classifier vs. DeepNet pretrained on ImageNet)
Fisher classifier (pretrained) deep net
Lapuschkin et al. 2016. Analyzing Classifiers: Fisher Vectors and Deep Neural
Networks
11/45
Example: Pascal VOC Classification [Lapuschkin’16]
Lapuschkin et al. 2016. Analyzing Classifiers: Fisher Vectors and Deep Neural
Networks
12/45
Example: Medical Diagnosis [Binder’18]
A: Invasive breast cancer, H&E stain; B: Normal mammary glands and fibrous tissue,
H&E stain; C: Diffuse carcinoma infiltrate in fibrous tissue, Hematoxylin stain
Binder et al. 2018 “Towards computational fluorescence microscopy: Machine learning-
based integrated prediction of morphological and molecular tumor profiles”
13/45
Example: Quantum Chemistry [Schütt’17]
DFT calculation of
the stationary
Schrödinger
Equation
DTNN,
Schütt’17
molecular structure (e.g. atoms positions)
molecular electronic properties (e.g. atomization energy)
PBE0,
Pedrew’86
interpretable insight
Schütt et al. 2017: Quantum-Chemical Insights from Deep Tensor Neural Networks
14/45
Examples of Explanation Methods
15/45
Explaining by Decomposing
Importance of a variable is the share of the function score
that can be attributed to it.
Decomposition property:
input
DNN
16/45
Explaning Linear Models
A simple method:
17/45
Explaning Linear Models
Taylor decomposition approach:
Insight: explanation depends
on the root point.
18/45
Explaining Nonlinear Models
second-order terms are
hard to interpret and
can be very large
19/45
Explaining Nonlinear Models by Propagation
Layer-Wise Relevance
Propagation (LRP) [Bach’15]
20/45
Explaining Nonlinear Models by Propagation
Is there an
underlying
mathematical
framework?
21/45
Deep Taylor Decomposition (DTD) [Montavon’17]
Question: Suppose that we have propagated LRP scores
(“relevance”) until a given layer. How should it be
propagated one layer further?
Key idea: Let’s use Taylor expansions for this.
22/45
DTD Step 1: The Structure of Relevance
Observation: Relevance at each layer is a product of
the activation and an approximately constant term.
23/45
DTD Step 1: The Stucture of Relevance
24/45
DTD Step 2: Taylor Expansion
25/45
DTD Step 2: Taylor Expansion
Taylor expansion at root point:
Relevance can now be
backward propagated
26/45
DTD Step 3: Choosing the Root Point
(same as LRP-α1
β0
)
(Deep Taylor generic)
✔
1. nearest root
2. rescaled excitations
Choice of root point
27/45
DTD: Choosing the Root Point
(Deep Taylor generic)
Pixels domain
Choice of root point
28/45
DTD: Choosing the Root Point
(Deep Taylor generic)
Choice of root point
Embedding:
image source:
Tensorflow tutorial
29/45
DTD: Application to Pooling Layers
A sum-pooling layer over positive activations is equivalent to a ReLU
layer with weights 1.
A p-norm pooling layer can be approximated as a sum-pooling layer
multiplied by a ratio of norms that we treat as constant [Montavon’17].
→ Treat pooling layers as ReLU detection layers
30/45
Deep Taylor Decomposition on ConvNets
LRP-α1
β0
DTDfor
pixels
LRP-α1
β0
LRP-α1
β0
LRP-α1
β0
LRP-α1
β0
LRP-α1
β0
*
* For top-layers, other rules may improve selectivity
forward pass
backward pass
31/45
Implementing Propagation Rules
Sequence of element-wise
computations
Sequence of vector
computations
Example: LRP-α1
β0
:
32/45
Implementing Propagation Rules
Example: LRP-α1
β0
:
Code that reuses forward and gradient computations:
33/45
How Deep Taylor / LRP Scales
34/45
Implementation on Large-Scale Models [Alber’18]
https://github.com/albermax/innvestigate
35/45
DTD for Kernel Models [Kauffmann’18]
1. Build a neural network equivalent of the One-Class SVM:
Gaussian/Laplace Kernel
Student Kernel
2. Computes its deep Taylor
decomposition
Outlier score
36/45
DTD: Choosing the Root Point (Revisited)
✔
✔
1. nearest root
3. rescaled activation
2. rescaled excitations
Choice of root point
2. LRP-α1
β0
(Deep Taylor generic)
3. Another rule
37/45
Selecting the Explanation Technique
How to select the
best root points?
38/45
Selecting the Explanation Technique
Which rule leads to the
best explanation?
39/45
Selecting the Explanation Technique
What axioms should
an explanation satisfy?
40/45
Selection based on Axioms
division by zero →
scores explode.
LRP-α1
β0
41/45
Selection based on Axioms
discontinuous
step function
for bj
= 0
LRP-α1
β0
42/45
Explainable ML: Challenges
Validating
Explanations
Underlying
mathematical
framework
Axioms of an
explanation
Perturbation
analysis [Samek’17]
Similarity to
ground truth
Human
perception
43/45
Explainable ML: Opportunities
Using
Explanations
Human
interaction
Designing better
ML algorithms?
Extracting new
domain knowledge
Detecting unexpected
ML behavior
Finding weaknesses
of a dataset
44/45
Check our webpage
with interactive demos, software, tutorials, ...
and our tutorial paper:
Montavon, G., Samek, W., Müller, K.-R. Methods for Interpreting and Understanding Deep
Neural Networks, Digital Signal Processing, 2018
45/45
References
● [Alber’18] Alber, M. Lapuschkin, S., Seegerer, P., Hägele, M., Schütt, K., Montavon, G., Samek, W., Müller, K.-R.,
Dähne, S., Kindermans. P.-J. iNNvestigate neural networks. CoRR abs/1808.04260, 2018
● [Bach’15] Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.-R., Samek, W. On pixel-wise explanations
for nonlinear classifier decisions by layer-wise relevance propagation. PLOS ONE 10 (7), 2015
● [Binder’18] Binder, A., Bockmayr, M., …, Müller, K.-R., Klauschen, F. Towards computational fluorescence
microscopy: Machine learning-based integrated prediction of morphological and molecular tumor profiles CoRR
abs/1805.11178, 2018
● [Bojarski’17] Bojarski, M., Yeres, P., Choromanska, A., Choromanski, K., Firner, B., Jackel, L. D., Muller, U.
Explaining how a deep neural network trained with end-to-end learning steers a car. CoRR abs/1704.07911, 2017
● [Kauffmann’18] Kauffmann, J. Müller, K.-R., Montavon, G., Towards Explaining Anomalies: A Deep Taylor
Decomposition of One-Class Models CoRR abs/1805.06230, 2018
● [Lapuschkin’16] Lapuschkin, S., Binder, A., Montavon, G., Müller, K.-R., Samek, W. Analyzing classifiers: Fisher
vectors and deep neural networks. CVPR 2016
● [Montavon’17] Montavon, G., Lapuschkin, S., Binder, A., Samek, W., Müller, K.-R. Explaining nonlinear
classification decisions with deep Taylor decomposition. Pattern Recognition 65, 211–222, 2017
● [Samek’17] Samek, W., Binder, A., Montavon, G., Lapuschkin, S., Müller, K.-R. Evaluating the visualization of
what a deep neural network has learned. IEEE Transactions on Neural Networks and Learning Systems, 2017
● [Schütt’17] Schütt, K. T., Arbabzadah, F., Chmiela, S., Müller, K.-R., Tkatchenko, A. Quantum-Chemical Insights
from Deep Tensor Neural Networks, Nature Communications 8, 13890, 2017
● [Symonian’13] Symonian, K. Vedaldi, A. Zisserman, A. Deep Inside Convolutional Networks: Visualising Image
Classification Models and Saliency Maps. ArXiv 2013

More Related Content

Recently uploaded

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 

Recently uploaded (20)

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

Featured

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...DevGAMM Conference
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationErica Santiago
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellSaba Software
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming LanguageSimplilearn
 

Featured (20)

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
 

Machine Learning Models: Explaining their Decisions, and Validating the Explanations

  • 1. 1/45 Explaining Machine Learning Decisions Grégoire Montavon, TU Berlin Joint work with: Wojciech Samek, Klaus-Robert Müller, Sebastian Lapuschkin, Alexander Binder 18/09/2018 Intl. Workshop ML & AI, Telecom ParisTech
  • 2. 2/45 From ML Successes to Applications Autonomous Driving Medical Diagnosis Networks (smart grids, etc.) Visual Reasoning AlphaGo beats Go human champ Deep Net outperforms humans in image classification
  • 3. 3/45 Can we interpret what a ML model has learned?
  • 4. 4/45 First, we need to define what we want from interpretable ML.
  • 5. 5/45 Understanding Deep Nets: Two Views Understanding what mechanism the network uses to solve a problem or implement a function. Understanding how the network relates the input to the output variables.
  • 7. Interpreting Predicted Classes Image from Symonian’13 Example: “How does a goose typically look like according to the neural network?” goose non-goose
  • 8. 8/45 Explaining Individual Decisions Images from Lapuschkin’16 Example: “Why is a given image classified as a sheep?” sheep non-sheep
  • 9. 9/45 Example: Autonomous Driving [Bojarski’17] Input: Decision Bojarski et al. 2017 “Explaining How a Deep Neural Network Trained with End-to- End Learning Steers a Car” PilotNet Explanation:
  • 10. 10/45 Example: Pascal VOC Classification [Lapuschkin’16] Comparing Performance on Pascal VOC 2007 (Fisher Vector Classifier vs. DeepNet pretrained on ImageNet) Fisher classifier (pretrained) deep net Lapuschkin et al. 2016. Analyzing Classifiers: Fisher Vectors and Deep Neural Networks
  • 11. 11/45 Example: Pascal VOC Classification [Lapuschkin’16] Lapuschkin et al. 2016. Analyzing Classifiers: Fisher Vectors and Deep Neural Networks
  • 12. 12/45 Example: Medical Diagnosis [Binder’18] A: Invasive breast cancer, H&E stain; B: Normal mammary glands and fibrous tissue, H&E stain; C: Diffuse carcinoma infiltrate in fibrous tissue, Hematoxylin stain Binder et al. 2018 “Towards computational fluorescence microscopy: Machine learning- based integrated prediction of morphological and molecular tumor profiles”
  • 13. 13/45 Example: Quantum Chemistry [Schütt’17] DFT calculation of the stationary Schrödinger Equation DTNN, Schütt’17 molecular structure (e.g. atoms positions) molecular electronic properties (e.g. atomization energy) PBE0, Pedrew’86 interpretable insight Schütt et al. 2017: Quantum-Chemical Insights from Deep Tensor Neural Networks
  • 15. 15/45 Explaining by Decomposing Importance of a variable is the share of the function score that can be attributed to it. Decomposition property: input DNN
  • 17. 17/45 Explaning Linear Models Taylor decomposition approach: Insight: explanation depends on the root point.
  • 18. 18/45 Explaining Nonlinear Models second-order terms are hard to interpret and can be very large
  • 19. 19/45 Explaining Nonlinear Models by Propagation Layer-Wise Relevance Propagation (LRP) [Bach’15]
  • 20. 20/45 Explaining Nonlinear Models by Propagation Is there an underlying mathematical framework?
  • 21. 21/45 Deep Taylor Decomposition (DTD) [Montavon’17] Question: Suppose that we have propagated LRP scores (“relevance”) until a given layer. How should it be propagated one layer further? Key idea: Let’s use Taylor expansions for this.
  • 22. 22/45 DTD Step 1: The Structure of Relevance Observation: Relevance at each layer is a product of the activation and an approximately constant term.
  • 23. 23/45 DTD Step 1: The Stucture of Relevance
  • 24. 24/45 DTD Step 2: Taylor Expansion
  • 25. 25/45 DTD Step 2: Taylor Expansion Taylor expansion at root point: Relevance can now be backward propagated
  • 26. 26/45 DTD Step 3: Choosing the Root Point (same as LRP-α1 β0 ) (Deep Taylor generic) ✔ 1. nearest root 2. rescaled excitations Choice of root point
  • 27. 27/45 DTD: Choosing the Root Point (Deep Taylor generic) Pixels domain Choice of root point
  • 28. 28/45 DTD: Choosing the Root Point (Deep Taylor generic) Choice of root point Embedding: image source: Tensorflow tutorial
  • 29. 29/45 DTD: Application to Pooling Layers A sum-pooling layer over positive activations is equivalent to a ReLU layer with weights 1. A p-norm pooling layer can be approximated as a sum-pooling layer multiplied by a ratio of norms that we treat as constant [Montavon’17]. → Treat pooling layers as ReLU detection layers
  • 30. 30/45 Deep Taylor Decomposition on ConvNets LRP-α1 β0 DTDfor pixels LRP-α1 β0 LRP-α1 β0 LRP-α1 β0 LRP-α1 β0 LRP-α1 β0 * * For top-layers, other rules may improve selectivity forward pass backward pass
  • 31. 31/45 Implementing Propagation Rules Sequence of element-wise computations Sequence of vector computations Example: LRP-α1 β0 :
  • 32. 32/45 Implementing Propagation Rules Example: LRP-α1 β0 : Code that reuses forward and gradient computations:
  • 33. 33/45 How Deep Taylor / LRP Scales
  • 34. 34/45 Implementation on Large-Scale Models [Alber’18] https://github.com/albermax/innvestigate
  • 35. 35/45 DTD for Kernel Models [Kauffmann’18] 1. Build a neural network equivalent of the One-Class SVM: Gaussian/Laplace Kernel Student Kernel 2. Computes its deep Taylor decomposition Outlier score
  • 36. 36/45 DTD: Choosing the Root Point (Revisited) ✔ ✔ 1. nearest root 3. rescaled activation 2. rescaled excitations Choice of root point 2. LRP-α1 β0 (Deep Taylor generic) 3. Another rule
  • 37. 37/45 Selecting the Explanation Technique How to select the best root points?
  • 38. 38/45 Selecting the Explanation Technique Which rule leads to the best explanation?
  • 39. 39/45 Selecting the Explanation Technique What axioms should an explanation satisfy?
  • 40. 40/45 Selection based on Axioms division by zero → scores explode. LRP-α1 β0
  • 41. 41/45 Selection based on Axioms discontinuous step function for bj = 0 LRP-α1 β0
  • 42. 42/45 Explainable ML: Challenges Validating Explanations Underlying mathematical framework Axioms of an explanation Perturbation analysis [Samek’17] Similarity to ground truth Human perception
  • 43. 43/45 Explainable ML: Opportunities Using Explanations Human interaction Designing better ML algorithms? Extracting new domain knowledge Detecting unexpected ML behavior Finding weaknesses of a dataset
  • 44. 44/45 Check our webpage with interactive demos, software, tutorials, ... and our tutorial paper: Montavon, G., Samek, W., Müller, K.-R. Methods for Interpreting and Understanding Deep Neural Networks, Digital Signal Processing, 2018
  • 45. 45/45 References ● [Alber’18] Alber, M. Lapuschkin, S., Seegerer, P., Hägele, M., Schütt, K., Montavon, G., Samek, W., Müller, K.-R., Dähne, S., Kindermans. P.-J. iNNvestigate neural networks. CoRR abs/1808.04260, 2018 ● [Bach’15] Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.-R., Samek, W. On pixel-wise explanations for nonlinear classifier decisions by layer-wise relevance propagation. PLOS ONE 10 (7), 2015 ● [Binder’18] Binder, A., Bockmayr, M., …, Müller, K.-R., Klauschen, F. Towards computational fluorescence microscopy: Machine learning-based integrated prediction of morphological and molecular tumor profiles CoRR abs/1805.11178, 2018 ● [Bojarski’17] Bojarski, M., Yeres, P., Choromanska, A., Choromanski, K., Firner, B., Jackel, L. D., Muller, U. Explaining how a deep neural network trained with end-to-end learning steers a car. CoRR abs/1704.07911, 2017 ● [Kauffmann’18] Kauffmann, J. Müller, K.-R., Montavon, G., Towards Explaining Anomalies: A Deep Taylor Decomposition of One-Class Models CoRR abs/1805.06230, 2018 ● [Lapuschkin’16] Lapuschkin, S., Binder, A., Montavon, G., Müller, K.-R., Samek, W. Analyzing classifiers: Fisher vectors and deep neural networks. CVPR 2016 ● [Montavon’17] Montavon, G., Lapuschkin, S., Binder, A., Samek, W., Müller, K.-R. Explaining nonlinear classification decisions with deep Taylor decomposition. Pattern Recognition 65, 211–222, 2017 ● [Samek’17] Samek, W., Binder, A., Montavon, G., Lapuschkin, S., Müller, K.-R. Evaluating the visualization of what a deep neural network has learned. IEEE Transactions on Neural Networks and Learning Systems, 2017 ● [Schütt’17] Schütt, K. T., Arbabzadah, F., Chmiela, S., Müller, K.-R., Tkatchenko, A. Quantum-Chemical Insights from Deep Tensor Neural Networks, Nature Communications 8, 13890, 2017 ● [Symonian’13] Symonian, K. Vedaldi, A. Zisserman, A. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps. ArXiv 2013