SlideShare a Scribd company logo
Interactive and Interpretable
Machine Learning Models
for Human Machine Collaboration
Been Kim
Nov 2015
Vision
Harness the relative strength of
humans and machine learning models
2
Human
Machine
Learning
Models
http://blogs.teradata.com/
Research objectives
Develop machine learning models inspired by how
humans think that can…
3
Human
Machine
Learning
Models
http://blogs.teradata.com/
Go#here#
4
Human
Machine
Learning
Models
infer decisions
of humans
Develop machine learning models inspired by how
humans think that can…
http://blogs.teradata.com/
Research objectives
5
Human
Machine
Learning
Models
make sense
to humansinfer decisions
of humans
Develop machine learning models inspired by how
humans think that can…
http://blogs.teradata.com/
Research objectives
6
Human
Machine
Learning
Models
infer decisions
of humans
interact with
humans
make sense
to humans
Develop machine learning models inspired by how
humans think that can…
http://blogs.teradata.com/
Research objectives
Develop machine learning models inspired by how
humans think that can…
1. Infer human team
decisions from
team planning
conversation
2. Communication
from machine to
human:
provide intuitive
explanations
3. Communication
from human to
machine:
incorporate
feedback
Go#here#
infer decisions
of humans
make sense to
humans
interact with
humans
7
Research objectives
Road map
8
2. Communication
from machine to
human:
provide intuitive
explanations
3. Communication
from human to
machine:
incorporate
feedback
make sense to
humans
interact with
humans
1. Infer human team
decisions from
team planning
conversation
Go#here#
infer decisions
of humans
Road map
1. Infer human team
decisions from
team planning
conversation
9
2. Communication
from machine to
human:
provide intuitive
explanations
3. Communication
from human to
machine:
incorporate
feedback
infer decisions
of humans
make sense to
humans
interact with
humans
Go#here#
• Human’s tactical decision is based on
exemplar-based reasoning (matching and
prototyping) [Cohen 96, Newell 72]
• Skilled fire fighters use recognition-primed
decision making — a situation is matched
to typical cases [Klein 89]
• Machines can better support peoples’
decision-making by representing data in
the same way
Mirror the way humans think
10
Case-based reasoning and
interpretable models
11
Case-based reasoning
• Applied to various applications thanks
to its intuitive power
[Aamodt 94, Slade 91, Bekkerman 06]
Limitations
• Always require labels (supervised)
• Does not scale to complex problems
• Does not leverage global patterns of
data
Interpretable models
• Decision tree [De`ath 00]
• Sparse linear classifiers
[Tibshirani 96, Ustun 14]
• Prototype-based [Graf 09]
Limitations
• Sparsity is not enough [Freitas 14]
• Linear models or supervised
Our approach:
Bayesian Case Model (BCM)
*
Bayesian generative models
Case-based reasoning
Bayesian Case Model (BCM)
• Leverage the power of examples (prototypes) and
subspaces (hot features) to explain machine
learning results
prototypes
subspaces
Explain
complicated
concepts using
examples
12
[Kim, Rudin, Shah NIPS 2014]
Bayesian Case Model (BCM)
13
• A general framework for Bayesian case-based reasoning
• Joint inference on prototypes, subspaces and cluster labels
Cluster A
Bayesian Case Model (BCM)
…
Cluster B Cluster C
prototypes subspaces cluster labels
14
subspaces
prototypes
Explanations provided by
Bayesian Case Model (BCM)
salsa
sour cream
avocado
salt, pepper, taco
shell, lettuce, oil
Taco
flour
egg
water, salt, milk,
butter
Basic crepe
chocolate
strawberry
pie crust, whipping cream,
kirsch, almonds
Chocolate berry tart
Cluster A Cluster B Cluster C
15
Prototype
Quintessential observation
that best represents the cluster
Subspace
sets of important features
in characterizing clusters
• A general framework for Bayesian case-based reasoning
• Joint inference on cluster labels, prototypes and subspaces
Bayesian Case Model (BCM)
salsa
sour cream
avocado
Taco
Explain Cluster A
1. clustering
2. learning
explanation
prototypes subspacescluster labels
16
It is a crepe, since it has flour and egg.
It is inspired by Mexican food, because
it has avocado, salsa and sour cream.
Cluster labels:
• Admixture model for modeling the underlying distributions
Cluster A Cluster B Cluster C
= [A, B, A]mexican_crepe
Bayesian Case Model (BCM)
1. Clustering part
17
It is a crepe, since it has flour and egg.
It is sweet crepe that is like chocolate
and berry dessert.
• Admixture model for modeling the underlying distributions
Cluster A Cluster B Cluster C
= [B, C, C]chocolate_crepe
Bayesian Case Model (BCM)
1. Clustering part
18
• Cluster distribution + supervised classification methods can be
used for evaluating the clustering performance[1]
• Hyper parameter can be used to control how many different cluster
labels within one data point
The concentration
parameter:
Cluster distribution
of the data point
[1] D. Blei, A. Ng, M. Jordan 2003
Bayesian Case Model (BCM)
1. Clustering part
19
Subspaces
binary variable
1 for important
features
• Each cluster is characterized by a prototype and subspaces
Prototype
Bayesian Case Model (BCM)
2. Learning explanation part
20
A prototype is an
actual data point that
exists in the dataset
Prototype
Bayesian Case Model (BCM)
2. Learning explanation part
• prototype: quintessential observation that best represents the cluster
21
Bayesian Case Model (BCM)
2. Learning explanation part
• subspace: sets of important features in characterizing clusters
Subspaces
binary variable
1 for important
features
22
Subspaces
binary variable
1 for important
features
• subspace: sets of important features in characterizing clusters
Any similarity measure can be used.
For example, using any loss function:
The feature j of
cluster s is an
important feature
(i.e., subspace)
The value of feature j
is identical to the
value of the
prototype of clusters
Bayesian Case Model (BCM)
2. Learning explanation part
23
Results
Challenges of interpretable models
1. Do the learned prototypes and subspaces make
sense?
2. Are we sacrificing performance for the interpretability?
3. Do learned prototypes and subspaces help humans’
understanding?
24
Data from computer cooking contest: liris/cnrs.fr/ccc/ccc2014
• Unsupervised clustering on a subset of recipe data
1. Do the learned prototypes and subspaces make sense?
BCM on recipe data
25
sesam
e
1. Do the learned prototypes and subspaces make sense?
BCM on digit data
http://www.cs.nyu.edu/~roweis/data.html
26
Learned cluster D
Gibbs sampling iteration
1. Do the learned prototypes and subspaces make sense?
BCM on digit data
27
2. Are we sacrificing anything for the interpretability?
Maintain accuracy
Handdigit
dataset
20Newsgroups
dataset
Sensitivity
Analysis
BCM BCM
28
2. Are we sacrificing anything for the interpretability?
Joint inference on prototypes,
subspaces and cluster labels is the key
Posterior distribution
Level set
Another solution
that clusters data equally well
and
has better interpretability
—- BCM gives higher score for
this solution
One solution
that clusters
data well
29
Collapsed Gibbs sampling
for inference
• Observed to converge quickly in admixture models
• Integrating out and for efficient inference
30
[Kim, Rudin, Shah NIPS 2014]
3. Does the model make sense to humans?
Objective measure of human understanding
Accuracy of human classifier
a new data
point to be
classified
• Participant’s task is to assign
the ingredients of a specific
dish (a new data point) to a
cluster
• Each cluster is explained
using either BCM or LDA
31
• 384 classification questions asked to 24 people
• Statistically significantly better performance with BCM
(85.9% v.s. 71.3%)
a new data
point to be
classified
Explanations of clusters
Clusters explained
using
1. BCM :
ingredients of the
prototype recipe
2. LDA:
representative
ingredients of
each cluster
3. Does the model make sense to humans?
Objective measure of human understanding
Accuracy of human classifier
32
[Kim, Rudin, Shah NIPS 2014]
sesam
e
Road map
1. Infer human team
decisions from
team planning
conversation
33
2. Communication
from machine to
human:
provide intuitive
explanations
3. Communication
from human to
machine:
incorporate
feedback
make sense to
humans
interact with
humans
Go#here#
infer decisions
of humans
Why interactive?
34
Why interactive?
35
Why interactive?
36
Related work on
interactive machine learning
• Interact via multiple model parameter
settings [Patel 10, Amershi 15]
• Design smart interfaces [Amershi 11]
and visualization [Chaney 12, Gou 03]
• Interact via simplified medium of
interaction [Kapoor 10, Ware 01]
Prototypes
and
Subspaces!
37
interactive BCM (iBCM)
38
BCM iBCM
Double circled nodes
represent interacted
latent variables —
Node that get
information from both
user feedback and
information obtained
from data points
interactive BCM (iBCM)
39
BCM iBCM
Double circled nodes
represent interacted
latent variables —
Node that get
information from both
user feedback and
information obtained
from data points
interactive BCM (iBCM)
internal mechanism
40
3. Listen to
Data
Key: Balance between what the data indicates and
what makes most sense to the user
Our approach: Decompose Gibbs sampling steps to
1) adjust the feedback propagation depending on user’s confidence
2) accelerate the inference by rearranging latent variables
2. Propagate
Users feedback
to accelerate
inference
1. Listen to
Users
User’s workflow with iBCM
abstract domain
41
click to change
to
to
click to promote
any items to be
prototype
Experiment procedure
1. Subjects are asked how they want to
group items
2. Subjects view results from BCM
• Essentially shows one of the
optimal clustering
3. Subjects indicate how well the results
matched their preferred clustering
4. Subjects interact with iBCM
5. Subjects indicate how well the results
matches with what they want
42
Compare 24 participants, 192 questions
Experiment results
1. Subjects are asked how they want to
group items
2. Subjects view results from BCM
• Essentially shows one of the
optimal clustering
3. Subjects indicate how well the results
matched their preferred clustering
4. Subjects interact with iBCM
5. Subjects indicate how well the results
matches with what they want
43
24 participants, 192 questions
Participants agreed more
strongly that final clusters
matched their preferences
compared to the initial
clusters
Wilcoxon signed rank test
iBCM for introductory
programming education
44
• Why education?
• Current teachers’ workflow for creating grading rubric:
randomly pick 4-5 assignments and Hodgepodge Grading
[Cross 99]
• Understanding this variation is important for providing
appropriate, tailored feedback to students [Basu13, Huang 13]
• What are the challenges?
• Extracting right features — OverCode [Glassman 15]
iBCM + OverCode system
45submissions from MIT introductory python classes
iBCM + OverCode system
46
Select/unselect
subspaces
(keywords)
Promote/demote
prototypes
iBCM experiment with
domain experts results
Click here to get a
new grouping
V.S.
Task: Explore the full spectrum of students’ submissions and
write down `discovery list’ for a recitation
47
d
48
Experiment with
domain experts results
• 48 problems explored by 12
subjects who previously
taught introductory python
class
• participants agreed more
strongly to the following
compared to BCM ( )
Were more satisfied
Better explored the full spectrum of
students’ submissions
Better identified important features to
expand discovery list
Important features and prototypes are
useful𝑝 < 0.001
49
with iBCM, they…
Wilcoxon signed rank test
Experiment with
domain experts results
• 48 problems explored by 12
subjects who previously
taught introductory python
class
• participants agreed more
strongly to the following
compared to BCM ( )
Were more satisfied
Better explored the full spectrum of
students’ submissions
Better identified important features to
expand discovery list
Important features and prototypes are
useful𝑝 < 0.001
50
with iBCM, they…
“[iBCM enabled me to] go in depth
as to how students could do”
“ [iBCM] is useful with large datasets
where brute-force would not be practical.”
Wilcoxon signed rank test
Summary
51
[Kim, Chacha, Shah AAAI13]
[Kim, Chacha, Shah JAIR15]
Communication from
machine to human:
provide intuitive
explanations
make sense to
humans
interact with
humans
[Kim, Rudin, Shah NIPS 2014]
[Kim, Glassman, Johnson, Shah submitted*]
[Kim, Patel, Rostamizadeh, Shah AAAI 2015]
Inspiration: how humans
make decisions
Approach: case-based
Bayesian model
Results: provided intuitive
explanations while
maintaining performance
Approach: enable
interaction by
decomposing sampling
inference steps
Results: implemented and
validated the approach in
education domain
Communication from
human to machine:
incorporate feedback
miss-classified data
Next steps
• Interpretability for data
exploration: visualization
• Domain specific interpretability:
learning features that
distinguishes clusters
• Interactive machine learning for
debugging models or hyper
parameter explorations
predicted:
politics
Doc id #24
True label: medicine
[Kim, Patel, Rostamizadeh, Shah AAAI 2015]
52
[Kim, Doshi-Velez, Shah NIPS 2015]
Next steps at AI2
• Extend interpretability for initially
uninterpretable features (neural nets)
53
4th grade
science
exam
question
Q&A
[Kim, Chacha, Shah AAAI13]
[Kim, Chacha, Shah JAIR15]
Communication from
machine to human:
provide intuitive
explanations
make sense to
humans
interact with
humans
[Kim, Rudin, Shah NIPS 2014]
[Kim, Glassman, Johnson, Shah submitted*]
[Kim, Patel, Rostamizadeh, Shah AAAI 2015]
Inspiration: how humans
make decisions
Approach: case-based
Bayesian model
Results: provided intuitive
explanations while
maintaining performance
Approach: enable
interaction by
decomposing sampling
inference steps
Results: implemented and
validated the approach in
education domain
Communication from
human to machine:
incorporate feedback
[Kim, Doshi-Velez, Shah NIPS 2015]
AI2 is hiring
research interns
any time of the year.
Shoot me an email
if interested!
beenk@allenai.org

More Related Content

What's hot

Unified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEUnified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEDatabricks
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsFrancesca Lazzeri, PhD
 
ML Interpretability Inside Out
ML Interpretability Inside OutML Interpretability Inside Out
ML Interpretability Inside OutMara Graziani
 
How can pre-training help to solve the cold start problem?
How can pre-training help to solve the cold start problem?How can pre-training help to solve the cold start problem?
How can pre-training help to solve the cold start problem?Lokesh Vadlamudi
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine LearningKnoldus Inc.
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learningUmmeSalmaM1
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt Poojamanic
 
A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningHaptik
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)butest
 
Explainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretableAditya Bhattacharya
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learningbutest
 
Ideas on Machine Learning Interpretability
Ideas on Machine Learning InterpretabilityIdeas on Machine Learning Interpretability
Ideas on Machine Learning InterpretabilitySri Ambati
 
Machine learning
Machine learningMachine learning
Machine learninghplap
 
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Jerrin George
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionGirish Gore
 
Hot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesisHot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesisWriteMyThesis
 
Machine Learning
Machine LearningMachine Learning
Machine LearningRahul Kumar
 

What's hot (20)

Unified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIMEUnified Approach to Interpret Machine Learning Model: SHAP + LIME
Unified Approach to Interpret Machine Learning Model: SHAP + LIME
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
 
ML Interpretability Inside Out
ML Interpretability Inside OutML Interpretability Inside Out
ML Interpretability Inside Out
 
How can pre-training help to solve the cold start problem?
How can pre-training help to solve the cold start problem?How can pre-training help to solve the cold start problem?
How can pre-training help to solve the cold start problem?
 
Introduction To Machine Learning
Introduction To Machine LearningIntroduction To Machine Learning
Introduction To Machine Learning
 
Lecture1 introduction to machine learning
Lecture1 introduction to machine learningLecture1 introduction to machine learning
Lecture1 introduction to machine learning
 
Machine learning
Machine learningMachine learning
Machine learning
 
Machine learning ppt
Machine learning ppt Machine learning ppt
Machine learning ppt
 
A Friendly Introduction to Machine Learning
A Friendly Introduction to Machine LearningA Friendly Introduction to Machine Learning
A Friendly Introduction to Machine Learning
 
Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)Lecture #1: Introduction to machine learning (ML)
Lecture #1: Introduction to machine learning (ML)
 
Explainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretableExplainable AI - making ML and DL models more interpretable
Explainable AI - making ML and DL models more interpretable
 
Brief Tour of Machine Learning
Brief Tour of Machine LearningBrief Tour of Machine Learning
Brief Tour of Machine Learning
 
Ideas on Machine Learning Interpretability
Ideas on Machine Learning InterpretabilityIdeas on Machine Learning Interpretability
Ideas on Machine Learning Interpretability
 
Machine learning
Machine learningMachine learning
Machine learning
 
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...Identification of Relevant Sections in Web Pages Using a Machine Learning App...
Identification of Relevant Sections in Web Pages Using a Machine Learning App...
 
Machine learning
Machine learningMachine learning
Machine learning
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
 
Hot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesisHot Topics in Machine Learning For Research and thesis
Hot Topics in Machine Learning For Research and thesis
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Can Think
Machine Can ThinkMachine Can Think
Machine Can Think
 

Similar to Been Kim - Interpretable machine learning, Nov 2015

[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language ModelsDataScienceConferenc1
 
Ch 6 only 1. Distinguish between a purpose statement, research p
Ch 6 only 1. Distinguish between a purpose statement, research pCh 6 only 1. Distinguish between a purpose statement, research p
Ch 6 only 1. Distinguish between a purpose statement, research pMaximaSheffield592
 
Ch 6 only 1. distinguish between a purpose statement, research p
Ch 6 only 1. distinguish between a purpose statement, research pCh 6 only 1. distinguish between a purpose statement, research p
Ch 6 only 1. distinguish between a purpose statement, research pnand15
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsAxel de Romblay
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation SystemsAxel de Romblay
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIPramit Choudhary
 
Train, explain, acclaim. Build a good model in three steps
Train, explain, acclaim.  Build a good model in three stepsTrain, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim. Build a good model in three stepsPrzemek Biecek
 
LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4Hendrik Drachsler
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningDaniel Tunkelang
 
Machine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and ApplicationsMachine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and ApplicationsQuantUniversity
 
ICMT 2016: Search-Based Model Transformations with MOMoT
ICMT 2016: Search-Based Model Transformations with MOMoTICMT 2016: Search-Based Model Transformations with MOMoT
ICMT 2016: Search-Based Model Transformations with MOMoTMartin Fleck
 
​​Explainability in AI and Recommender systems: let’s make it interactive!
​​Explainability in AI and Recommender systems: let’s make it interactive!​​Explainability in AI and Recommender systems: let’s make it interactive!
​​Explainability in AI and Recommender systems: let’s make it interactive!Eindhoven University of Technology / JADS
 
Interpretable Machine Learning
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine LearningSri Ambati
 
When UX (guy) Meets Operations
When UX (guy) Meets OperationsWhen UX (guy) Meets Operations
When UX (guy) Meets OperationsTim Sheiner
 
Optimizing Market Segmentation
Optimizing Market SegmentationOptimizing Market Segmentation
Optimizing Market SegmentationRobert Colner
 
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM RecommendersYONG ZHENG
 
On building more human query answering systems
On building more human query answering systemsOn building more human query answering systems
On building more human query answering systemsINRIA-OAK
 
deep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptdeep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptPerumalPitchandi
 

Similar to Been Kim - Interpretable machine learning, Nov 2015 (20)

[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
[DSC Europe 23] Dmitry Ustalov - Design and Evaluation of Large Language Models
 
Ch 6 only 1. Distinguish between a purpose statement, research p
Ch 6 only 1. Distinguish between a purpose statement, research pCh 6 only 1. Distinguish between a purpose statement, research p
Ch 6 only 1. Distinguish between a purpose statement, research p
 
Ch 6 only 1. distinguish between a purpose statement, research p
Ch 6 only 1. distinguish between a purpose statement, research pCh 6 only 1. distinguish between a purpose statement, research p
Ch 6 only 1. distinguish between a purpose statement, research p
 
Ai in finance
Ai in financeAi in finance
Ai in finance
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation Systems
 
[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems[UPDATE] Udacity webinar on Recommendation Systems
[UPDATE] Udacity webinar on Recommendation Systems
 
Patterns Overview
Patterns OverviewPatterns Overview
Patterns Overview
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
 
Train, explain, acclaim. Build a good model in three steps
Train, explain, acclaim.  Build a good model in three stepsTrain, explain, acclaim.  Build a good model in three steps
Train, explain, acclaim. Build a good model in three steps
 
LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4LinkedUp kickoff meeting session 4
LinkedUp kickoff meeting session 4
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
 
Machine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and ApplicationsMachine Learning and AI: Core Methods and Applications
Machine Learning and AI: Core Methods and Applications
 
ICMT 2016: Search-Based Model Transformations with MOMoT
ICMT 2016: Search-Based Model Transformations with MOMoTICMT 2016: Search-Based Model Transformations with MOMoT
ICMT 2016: Search-Based Model Transformations with MOMoT
 
​​Explainability in AI and Recommender systems: let’s make it interactive!
​​Explainability in AI and Recommender systems: let’s make it interactive!​​Explainability in AI and Recommender systems: let’s make it interactive!
​​Explainability in AI and Recommender systems: let’s make it interactive!
 
Interpretable Machine Learning
Interpretable Machine LearningInterpretable Machine Learning
Interpretable Machine Learning
 
When UX (guy) Meets Operations
When UX (guy) Meets OperationsWhen UX (guy) Meets Operations
When UX (guy) Meets Operations
 
Optimizing Market Segmentation
Optimizing Market SegmentationOptimizing Market Segmentation
Optimizing Market Segmentation
 
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
[CIKM 2014] Deviation-Based Contextual SLIM Recommenders
 
On building more human query answering systems
On building more human query answering systemsOn building more human query answering systems
On building more human query answering systems
 
deep_Visualization in Data mining.ppt
deep_Visualization in Data mining.pptdeep_Visualization in Data mining.ppt
deep_Visualization in Data mining.ppt
 

More from Seattle DAML meetup

Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016Seattle DAML meetup
 
Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016
Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016
Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016Seattle DAML meetup
 
Understanding disparities using the American Community Survey - Sean Green, M...
Understanding disparities using the American Community Survey - Sean Green, M...Understanding disparities using the American Community Survey - Sean Green, M...
Understanding disparities using the American Community Survey - Sean Green, M...Seattle DAML meetup
 
Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016
Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016
Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016Seattle DAML meetup
 
Frequent Pattern Mining - Krishna Sridhar, Feb 2016
Frequent Pattern Mining - Krishna Sridhar, Feb 2016Frequent Pattern Mining - Krishna Sridhar, Feb 2016
Frequent Pattern Mining - Krishna Sridhar, Feb 2016Seattle DAML meetup
 
Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Seattle DAML meetup
 
Hunting criminals with hybrid analytics -- October 2015
Hunting criminals with hybrid analytics -- October 2015Hunting criminals with hybrid analytics -- October 2015
Hunting criminals with hybrid analytics -- October 2015Seattle DAML meetup
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Seattle DAML meetup
 
Adventures in Data Visualization - Jeff Heer, May 2015
Adventures in Data Visualization - Jeff Heer, May 2015Adventures in Data Visualization - Jeff Heer, May 2015
Adventures in Data Visualization - Jeff Heer, May 2015Seattle DAML meetup
 
The Road to Data Science - Joel Grus, June 2015
The Road to Data Science - Joel Grus, June 2015The Road to Data Science - Joel Grus, June 2015
The Road to Data Science - Joel Grus, June 2015Seattle DAML meetup
 
Scaling decision trees - George Murray, July 2015
Scaling decision trees - George Murray, July 2015Scaling decision trees - George Murray, July 2015
Scaling decision trees - George Murray, July 2015Seattle DAML meetup
 

More from Seattle DAML meetup (11)

Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016Karin Strauss - DNA Storage, July 2016
Karin Strauss - DNA Storage, July 2016
 
Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016
Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016
Alex Korbonits, "AUC at what costs?" Seattle DAML June 2016
 
Understanding disparities using the American Community Survey - Sean Green, M...
Understanding disparities using the American Community Survey - Sean Green, M...Understanding disparities using the American Community Survey - Sean Green, M...
Understanding disparities using the American Community Survey - Sean Green, M...
 
Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016
Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016
Towards Automatic Moderation of Online Hate Speech - Emily Spahn, March 2016
 
Frequent Pattern Mining - Krishna Sridhar, Feb 2016
Frequent Pattern Mining - Krishna Sridhar, Feb 2016Frequent Pattern Mining - Krishna Sridhar, Feb 2016
Frequent Pattern Mining - Krishna Sridhar, Feb 2016
 
Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016Streaming Hypothesis Reasoning - William Smith, Jan 2016
Streaming Hypothesis Reasoning - William Smith, Jan 2016
 
Hunting criminals with hybrid analytics -- October 2015
Hunting criminals with hybrid analytics -- October 2015Hunting criminals with hybrid analytics -- October 2015
Hunting criminals with hybrid analytics -- October 2015
 
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
Machine Learning in Biology and Why It Doesn't Make Sense - Theo Knijnenburg,...
 
Adventures in Data Visualization - Jeff Heer, May 2015
Adventures in Data Visualization - Jeff Heer, May 2015Adventures in Data Visualization - Jeff Heer, May 2015
Adventures in Data Visualization - Jeff Heer, May 2015
 
The Road to Data Science - Joel Grus, June 2015
The Road to Data Science - Joel Grus, June 2015The Road to Data Science - Joel Grus, June 2015
The Road to Data Science - Joel Grus, June 2015
 
Scaling decision trees - George Murray, July 2015
Scaling decision trees - George Murray, July 2015Scaling decision trees - George Murray, July 2015
Scaling decision trees - George Murray, July 2015
 

Recently uploaded

SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSELF-EXPLANATORY
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxmuralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONChetanK57
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent Universitypablovgd
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxAlguinaldoKong
 
INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversitySteffi Friedrichs
 
FAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesFAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesAlex Henderson
 
Seminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxSeminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxRUDYLUMAPINET2
 
Transport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSETransport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSEjordanparish425
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...muralinath2
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsMichel Dumontier
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Sérgio Sacani
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxmuralinath2
 
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Sérgio Sacani
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard Gill
 
The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...Sérgio Sacani
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptsreddyrahul
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthSérgio Sacani
 
SAMPLING.pptx for analystical chemistry sample techniques
SAMPLING.pptx for analystical chemistry sample techniquesSAMPLING.pptx for analystical chemistry sample techniques
SAMPLING.pptx for analystical chemistry sample techniquesrodneykiptoo8
 
National Biodiversity protection initiatives and Convention on Biological Di...
National Biodiversity protection initiatives and  Convention on Biological Di...National Biodiversity protection initiatives and  Convention on Biological Di...
National Biodiversity protection initiatives and Convention on Biological Di...PABOLU TEJASREE
 

Recently uploaded (20)

SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
NuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent UniversityNuGOweek 2024 full programme - hosted by Ghent University
NuGOweek 2024 full programme - hosted by Ghent University
 
EY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptxEY - Supply Chain Services 2018_template.pptx
EY - Supply Chain Services 2018_template.pptx
 
INSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere UniversityINSIGHT Partner Profile: Tampere University
INSIGHT Partner Profile: Tampere University
 
FAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS imagesFAIRSpectra - Towards a common data file format for SIMS images
FAIRSpectra - Towards a common data file format for SIMS images
 
Seminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptxSeminar on Halal AGriculture and Fisheries.pptx
Seminar on Halal AGriculture and Fisheries.pptx
 
Transport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSETransport in plants G1.pptx Cambridge IGCSE
Transport in plants G1.pptx Cambridge IGCSE
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
Gliese 12 b: A Temperate Earth-sized Planet at 12 pc Ideal for Atmospheric Tr...
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...The importance of continents, oceans and plate tectonics for the evolution of...
The importance of continents, oceans and plate tectonics for the evolution of...
 
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynypptAerodynamics. flippatterncn5tm5ttnj6nmnynyppt
Aerodynamics. flippatterncn5tm5ttnj6nmnynyppt
 
A Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on EarthA Giant Impact Origin for the First Subduction on Earth
A Giant Impact Origin for the First Subduction on Earth
 
SAMPLING.pptx for analystical chemistry sample techniques
SAMPLING.pptx for analystical chemistry sample techniquesSAMPLING.pptx for analystical chemistry sample techniques
SAMPLING.pptx for analystical chemistry sample techniques
 
National Biodiversity protection initiatives and Convention on Biological Di...
National Biodiversity protection initiatives and  Convention on Biological Di...National Biodiversity protection initiatives and  Convention on Biological Di...
National Biodiversity protection initiatives and Convention on Biological Di...
 

Been Kim - Interpretable machine learning, Nov 2015

  • 1. Interactive and Interpretable Machine Learning Models for Human Machine Collaboration Been Kim Nov 2015
  • 2. Vision Harness the relative strength of humans and machine learning models 2 Human Machine Learning Models http://blogs.teradata.com/
  • 3. Research objectives Develop machine learning models inspired by how humans think that can… 3 Human Machine Learning Models http://blogs.teradata.com/
  • 4. Go#here# 4 Human Machine Learning Models infer decisions of humans Develop machine learning models inspired by how humans think that can… http://blogs.teradata.com/ Research objectives
  • 5. 5 Human Machine Learning Models make sense to humansinfer decisions of humans Develop machine learning models inspired by how humans think that can… http://blogs.teradata.com/ Research objectives
  • 6. 6 Human Machine Learning Models infer decisions of humans interact with humans make sense to humans Develop machine learning models inspired by how humans think that can… http://blogs.teradata.com/ Research objectives
  • 7. Develop machine learning models inspired by how humans think that can… 1. Infer human team decisions from team planning conversation 2. Communication from machine to human: provide intuitive explanations 3. Communication from human to machine: incorporate feedback Go#here# infer decisions of humans make sense to humans interact with humans 7 Research objectives
  • 8. Road map 8 2. Communication from machine to human: provide intuitive explanations 3. Communication from human to machine: incorporate feedback make sense to humans interact with humans 1. Infer human team decisions from team planning conversation Go#here# infer decisions of humans
  • 9. Road map 1. Infer human team decisions from team planning conversation 9 2. Communication from machine to human: provide intuitive explanations 3. Communication from human to machine: incorporate feedback infer decisions of humans make sense to humans interact with humans Go#here#
  • 10. • Human’s tactical decision is based on exemplar-based reasoning (matching and prototyping) [Cohen 96, Newell 72] • Skilled fire fighters use recognition-primed decision making — a situation is matched to typical cases [Klein 89] • Machines can better support peoples’ decision-making by representing data in the same way Mirror the way humans think 10
  • 11. Case-based reasoning and interpretable models 11 Case-based reasoning • Applied to various applications thanks to its intuitive power [Aamodt 94, Slade 91, Bekkerman 06] Limitations • Always require labels (supervised) • Does not scale to complex problems • Does not leverage global patterns of data Interpretable models • Decision tree [De`ath 00] • Sparse linear classifiers [Tibshirani 96, Ustun 14] • Prototype-based [Graf 09] Limitations • Sparsity is not enough [Freitas 14] • Linear models or supervised
  • 12. Our approach: Bayesian Case Model (BCM) * Bayesian generative models Case-based reasoning Bayesian Case Model (BCM) • Leverage the power of examples (prototypes) and subspaces (hot features) to explain machine learning results prototypes subspaces Explain complicated concepts using examples 12 [Kim, Rudin, Shah NIPS 2014]
  • 14. • A general framework for Bayesian case-based reasoning • Joint inference on prototypes, subspaces and cluster labels Cluster A Bayesian Case Model (BCM) … Cluster B Cluster C prototypes subspaces cluster labels 14
  • 15. subspaces prototypes Explanations provided by Bayesian Case Model (BCM) salsa sour cream avocado salt, pepper, taco shell, lettuce, oil Taco flour egg water, salt, milk, butter Basic crepe chocolate strawberry pie crust, whipping cream, kirsch, almonds Chocolate berry tart Cluster A Cluster B Cluster C 15
  • 16. Prototype Quintessential observation that best represents the cluster Subspace sets of important features in characterizing clusters • A general framework for Bayesian case-based reasoning • Joint inference on cluster labels, prototypes and subspaces Bayesian Case Model (BCM) salsa sour cream avocado Taco Explain Cluster A 1. clustering 2. learning explanation prototypes subspacescluster labels 16
  • 17. It is a crepe, since it has flour and egg. It is inspired by Mexican food, because it has avocado, salsa and sour cream. Cluster labels: • Admixture model for modeling the underlying distributions Cluster A Cluster B Cluster C = [A, B, A]mexican_crepe Bayesian Case Model (BCM) 1. Clustering part 17
  • 18. It is a crepe, since it has flour and egg. It is sweet crepe that is like chocolate and berry dessert. • Admixture model for modeling the underlying distributions Cluster A Cluster B Cluster C = [B, C, C]chocolate_crepe Bayesian Case Model (BCM) 1. Clustering part 18
  • 19. • Cluster distribution + supervised classification methods can be used for evaluating the clustering performance[1] • Hyper parameter can be used to control how many different cluster labels within one data point The concentration parameter: Cluster distribution of the data point [1] D. Blei, A. Ng, M. Jordan 2003 Bayesian Case Model (BCM) 1. Clustering part 19
  • 20. Subspaces binary variable 1 for important features • Each cluster is characterized by a prototype and subspaces Prototype Bayesian Case Model (BCM) 2. Learning explanation part 20
  • 21. A prototype is an actual data point that exists in the dataset Prototype Bayesian Case Model (BCM) 2. Learning explanation part • prototype: quintessential observation that best represents the cluster 21
  • 22. Bayesian Case Model (BCM) 2. Learning explanation part • subspace: sets of important features in characterizing clusters Subspaces binary variable 1 for important features 22
  • 23. Subspaces binary variable 1 for important features • subspace: sets of important features in characterizing clusters Any similarity measure can be used. For example, using any loss function: The feature j of cluster s is an important feature (i.e., subspace) The value of feature j is identical to the value of the prototype of clusters Bayesian Case Model (BCM) 2. Learning explanation part 23
  • 24. Results Challenges of interpretable models 1. Do the learned prototypes and subspaces make sense? 2. Are we sacrificing performance for the interpretability? 3. Do learned prototypes and subspaces help humans’ understanding? 24
  • 25. Data from computer cooking contest: liris/cnrs.fr/ccc/ccc2014 • Unsupervised clustering on a subset of recipe data 1. Do the learned prototypes and subspaces make sense? BCM on recipe data 25 sesam e
  • 26. 1. Do the learned prototypes and subspaces make sense? BCM on digit data http://www.cs.nyu.edu/~roweis/data.html 26
  • 27. Learned cluster D Gibbs sampling iteration 1. Do the learned prototypes and subspaces make sense? BCM on digit data 27
  • 28. 2. Are we sacrificing anything for the interpretability? Maintain accuracy Handdigit dataset 20Newsgroups dataset Sensitivity Analysis BCM BCM 28
  • 29. 2. Are we sacrificing anything for the interpretability? Joint inference on prototypes, subspaces and cluster labels is the key Posterior distribution Level set Another solution that clusters data equally well and has better interpretability —- BCM gives higher score for this solution One solution that clusters data well 29
  • 30. Collapsed Gibbs sampling for inference • Observed to converge quickly in admixture models • Integrating out and for efficient inference 30 [Kim, Rudin, Shah NIPS 2014]
  • 31. 3. Does the model make sense to humans? Objective measure of human understanding Accuracy of human classifier a new data point to be classified • Participant’s task is to assign the ingredients of a specific dish (a new data point) to a cluster • Each cluster is explained using either BCM or LDA 31
  • 32. • 384 classification questions asked to 24 people • Statistically significantly better performance with BCM (85.9% v.s. 71.3%) a new data point to be classified Explanations of clusters Clusters explained using 1. BCM : ingredients of the prototype recipe 2. LDA: representative ingredients of each cluster 3. Does the model make sense to humans? Objective measure of human understanding Accuracy of human classifier 32 [Kim, Rudin, Shah NIPS 2014] sesam e
  • 33. Road map 1. Infer human team decisions from team planning conversation 33 2. Communication from machine to human: provide intuitive explanations 3. Communication from human to machine: incorporate feedback make sense to humans interact with humans Go#here# infer decisions of humans
  • 37. Related work on interactive machine learning • Interact via multiple model parameter settings [Patel 10, Amershi 15] • Design smart interfaces [Amershi 11] and visualization [Chaney 12, Gou 03] • Interact via simplified medium of interaction [Kapoor 10, Ware 01] Prototypes and Subspaces! 37
  • 38. interactive BCM (iBCM) 38 BCM iBCM Double circled nodes represent interacted latent variables — Node that get information from both user feedback and information obtained from data points
  • 39. interactive BCM (iBCM) 39 BCM iBCM Double circled nodes represent interacted latent variables — Node that get information from both user feedback and information obtained from data points
  • 40. interactive BCM (iBCM) internal mechanism 40 3. Listen to Data Key: Balance between what the data indicates and what makes most sense to the user Our approach: Decompose Gibbs sampling steps to 1) adjust the feedback propagation depending on user’s confidence 2) accelerate the inference by rearranging latent variables 2. Propagate Users feedback to accelerate inference 1. Listen to Users
  • 41. User’s workflow with iBCM abstract domain 41 click to change to to click to promote any items to be prototype
  • 42. Experiment procedure 1. Subjects are asked how they want to group items 2. Subjects view results from BCM • Essentially shows one of the optimal clustering 3. Subjects indicate how well the results matched their preferred clustering 4. Subjects interact with iBCM 5. Subjects indicate how well the results matches with what they want 42 Compare 24 participants, 192 questions
  • 43. Experiment results 1. Subjects are asked how they want to group items 2. Subjects view results from BCM • Essentially shows one of the optimal clustering 3. Subjects indicate how well the results matched their preferred clustering 4. Subjects interact with iBCM 5. Subjects indicate how well the results matches with what they want 43 24 participants, 192 questions Participants agreed more strongly that final clusters matched their preferences compared to the initial clusters Wilcoxon signed rank test
  • 44. iBCM for introductory programming education 44 • Why education? • Current teachers’ workflow for creating grading rubric: randomly pick 4-5 assignments and Hodgepodge Grading [Cross 99] • Understanding this variation is important for providing appropriate, tailored feedback to students [Basu13, Huang 13] • What are the challenges? • Extracting right features — OverCode [Glassman 15]
  • 45. iBCM + OverCode system 45submissions from MIT introductory python classes
  • 46. iBCM + OverCode system 46 Select/unselect subspaces (keywords) Promote/demote prototypes
  • 47. iBCM experiment with domain experts results Click here to get a new grouping V.S. Task: Explore the full spectrum of students’ submissions and write down `discovery list’ for a recitation 47
  • 48. d 48
  • 49. Experiment with domain experts results • 48 problems explored by 12 subjects who previously taught introductory python class • participants agreed more strongly to the following compared to BCM ( ) Were more satisfied Better explored the full spectrum of students’ submissions Better identified important features to expand discovery list Important features and prototypes are useful𝑝 < 0.001 49 with iBCM, they… Wilcoxon signed rank test
  • 50. Experiment with domain experts results • 48 problems explored by 12 subjects who previously taught introductory python class • participants agreed more strongly to the following compared to BCM ( ) Were more satisfied Better explored the full spectrum of students’ submissions Better identified important features to expand discovery list Important features and prototypes are useful𝑝 < 0.001 50 with iBCM, they… “[iBCM enabled me to] go in depth as to how students could do” “ [iBCM] is useful with large datasets where brute-force would not be practical.” Wilcoxon signed rank test
  • 51. Summary 51 [Kim, Chacha, Shah AAAI13] [Kim, Chacha, Shah JAIR15] Communication from machine to human: provide intuitive explanations make sense to humans interact with humans [Kim, Rudin, Shah NIPS 2014] [Kim, Glassman, Johnson, Shah submitted*] [Kim, Patel, Rostamizadeh, Shah AAAI 2015] Inspiration: how humans make decisions Approach: case-based Bayesian model Results: provided intuitive explanations while maintaining performance Approach: enable interaction by decomposing sampling inference steps Results: implemented and validated the approach in education domain Communication from human to machine: incorporate feedback
  • 52. miss-classified data Next steps • Interpretability for data exploration: visualization • Domain specific interpretability: learning features that distinguishes clusters • Interactive machine learning for debugging models or hyper parameter explorations predicted: politics Doc id #24 True label: medicine [Kim, Patel, Rostamizadeh, Shah AAAI 2015] 52 [Kim, Doshi-Velez, Shah NIPS 2015]
  • 53. Next steps at AI2 • Extend interpretability for initially uninterpretable features (neural nets) 53 4th grade science exam question
  • 54. Q&A [Kim, Chacha, Shah AAAI13] [Kim, Chacha, Shah JAIR15] Communication from machine to human: provide intuitive explanations make sense to humans interact with humans [Kim, Rudin, Shah NIPS 2014] [Kim, Glassman, Johnson, Shah submitted*] [Kim, Patel, Rostamizadeh, Shah AAAI 2015] Inspiration: how humans make decisions Approach: case-based Bayesian model Results: provided intuitive explanations while maintaining performance Approach: enable interaction by decomposing sampling inference steps Results: implemented and validated the approach in education domain Communication from human to machine: incorporate feedback [Kim, Doshi-Velez, Shah NIPS 2015] AI2 is hiring research interns any time of the year. Shoot me an email if interested! beenk@allenai.org