SlideShare a Scribd company logo
1 of 60
Download to read offline
An Empirical Study of Knowledge Tradeoffs in Case-Based
Reasoning
Devi Ganesan and Sutanu Chakraborti
Department of Computer Science & Engineering
{gdevi, sutanuc}@cse.iitm.ac.in
Indian Institute of Technology Madras
1 / 29
What is Case-Based Reasoning?
Case-Based Reasoning (CBR) is a problem solving paradigm that uses
past experiences to solve new problems.
2 / 29
Experiences → Cases
Experiences are captured as Problem-Solution pairs
3 / 29
Experiences → Cases
Experiences are captured as Problem-Solution pairs
Stored experiences are called Cases
3 / 29
Knowledge Containers 1
in a Case-Based Reasoner
1
Richter, M.M. 1995. The knowledge contained in similarity measures. Invited talk at ICCBR-95, Sesimbra,4 / 29
Knowledge Containers 1
in a Case-Based Reasoner
Unlike most machine learning algorithms, CBR provides a framework for integrating domain
knowledge with data.
1
Richter, M.M. 1995. The knowledge contained in similarity measures. Invited talk at ICCBR-95, Sesimbra,4 / 29
Interplay
Knowledge containers are not independent of each other
5 / 29
Interplay
Problem Solving Cycle
5 / 29
Why is Interplay between Knowledge Containers Important?
Design
Maintenance
Learning and Evolution
6 / 29
Our Contributions
We propose a novel measure to quantify the knowledge tradeoffs between
containers. The proposed measure is evaluated on synthetic and real-world datasets.
7 / 29
Our Contributions
We propose a novel measure to quantify the knowledge tradeoffs between
containers. The proposed measure is evaluated on synthetic and real-world datasets.
The tradeoffs are visually illustrated using parallel coordinate plots.
7 / 29
Our Contributions
We propose a novel measure to quantify the knowledge tradeoffs between
containers. The proposed measure is evaluated on synthetic and real-world datasets.
The tradeoffs are visually illustrated using parallel coordinate plots.
We have also studied the influence of the following two factors on knowledge tradeoffs:
Nature of underlying domain
User demands on solution quality
7 / 29
Proposed Measure
For Quantifying Knowledge Tradeoffs
Footprint Size Reduction
8 / 29
Footprint Set 2
Footprint set is a minimal set of cases that has the same problem solving
ability as the entire case base.
2
Smyth, Barry, and Elizabeth McKenna. ”Footprint-based retrieval.” ICCBR. Vol. 1650. 1999.
9 / 29
Hypothesis 1
Knowledge contained in footprint set is as good as the knowledge
contained in entire case base.
10 / 29
Hypothesis 1
Knowledge contained in footprint set is as good as the knowledge
contained in entire case base.
Hence, we use footprint size to quantify the knowledge contained in case
base.
10 / 29
From CBR Literature
Iglezakis and Roth-Berghofer (2000) discuss the centrality of case base in maintenance
activities and one of their hypotheses is cases are natural crystallization points for the
knowledge in case-based reasoning systems.
11 / 29
From CBR Literature
Iglezakis and Roth-Berghofer (2000) discuss the centrality of case base in maintenance
activities and one of their hypotheses is cases are natural crystallization points for the
knowledge in case-based reasoning systems.
Further, Smyth and McKenna (1998) argue that competence group is a fundamental unit of
competence in a case base.
11 / 29
From CBR Literature
Iglezakis and Roth-Berghofer (2000) discuss the centrality of case base in maintenance
activities and one of their hypotheses is cases are natural crystallization points for the
knowledge in case-based reasoning systems.
Further, Smyth and McKenna (1998) argue that competence group is a fundamental unit of
competence in a case base.
These works support our way of quantifying the knowledge contained in case base by footprint
size.
11 / 29
Hypothesis 2
Any addition of novel knowledge to the Vocabulary/Similarity/Adaptation container will lead
to a reduction in the size of footprint set
12 / 29
Motivation for Hypothesis 2
Solves function in footprint algorithm connects the four containers.
13 / 29
Motivation for Hypothesis 2
Solves function in footprint algorithm connects the four containers.
A case c is said to solve a target problem t iff c can be retrieved and adapted
to solve the target problem.
13 / 29
Motivation for Hypothesis 2
Solves function in footprint algorithm connects the four containers.
13 / 29
Hypotheses 1 and 2 motivate the idea of using reduction in footprint size
as a measure of quantifying the knowledge tradeoffs between containers.
14 / 29
Formulation
Let V , CB, S, R represent the Vocabulary, Case Base, Similarity and Adaptation
containers and |FP(V ,CB,S,R)| be the size of footprint set.
15 / 29
Formulation
Let V , CB, S, R represent the Vocabulary, Case Base, Similarity and Adaptation
containers and |FP(V ,CB,S,R)| be the size of footprint set.
The knowledge added by changing V to V , S to S and R to R can be quantified as
below.
∆Knowledge ≈ |FP(V ,CB,S,R)| − |FP(V ,CB,S ,R )| (1)
15 / 29
Illustration of Tradeoffs on Synthetic Dataset
16 / 29
Illustration of Tradeoffs on Synthetic Dataset
Underlying Domain Theory : D = 6A + 3B + C.
16 / 29
Visualization of Tradeoffs
Parallel Coordinates
17 / 29
Visualization of Tradeoffs
Parallel Coordinates
17 / 29
Knowledge Configurations
18 / 29
Knowledge Configurations
18 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Case Base
19 / 29
Visualization of Tradeoffs
Similarity versus Adaptation
20 / 29
Visualization of Tradeoffs
Similarity versus Adaptation
20 / 29
Visualization of Tradeoffs
Similarity versus Adaptation
20 / 29
Visualization of Tradeoffs
Similarity versus Adaptation
20 / 29
Real World Datasets
We have evaluated the proposed measure on many UCI datasets.
21 / 29
Real World Datasets
We have evaluated the proposed measure on many UCI datasets.
Iris
Boston Housing
Auto-MPG
20 Newsgroup - Relpol and Hardware
21 / 29
Characterization of Domains
CBR is intended to operate over ill-defined domains.
22 / 29
Characterization of Domains
CBR is intended to operate over ill-defined domains.
But, the general CBR paradigm does not place any restrictions on modeling a wide
spectrum of domains.
22 / 29
Characterization of Domains
CBR is intended to operate over ill-defined domains.
But, the general CBR paradigm does not place any restrictions on modeling a wide
spectrum of domains.
We have attempted a quantitative characterization of domains using the proposed
measure.
22 / 29
Characterization of Domains
Simulation
23 / 29
Characterization of Domains
Simulation
Benefit of domain knowledge is defined as the maximum reduction in footprint size.
23 / 29
Characterization of Domains
Impact of User Demands
Results
24 / 29
Impact of User Demands
Results
25 / 29
Summary
Proposed a novel measure to quantify knowledge tradeoffs between CBR containers.
26 / 29
Summary
Proposed a novel measure to quantify knowledge tradeoffs between CBR containers.
Visual illustration of tradeoffs using parallel coordinates.
26 / 29
Summary
Proposed a novel measure to quantify knowledge tradeoffs between CBR containers.
Visual illustration of tradeoffs using parallel coordinates.
Studied the impact of underlying domain and user demands on knowledge tradeoffs.
26 / 29
Summary
Proposed a novel measure to quantify knowledge tradeoffs between CBR containers.
Visual illustration of tradeoffs using parallel coordinates.
Studied the impact of underlying domain and user demands on knowledge tradeoffs.
We expect that our interpretation of footprint cases and knowledge transfers will streamline
and motivate new avenues of cross-container maintenance activities in CBR.
26 / 29
If you find our work to be interesting, please visit our poster in Hall B
during coffee break...
27 / 29
Acknowledgements
Travel Grant
28 / 29
Thank you all for your attention!
29 / 29

More Related Content

What's hot

Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical models
butest
 
B colouring
B colouringB colouring
B colouring
xs76250
 
Chapter 11-modeling financialreportingconcepts
Chapter 11-modeling financialreportingconceptsChapter 11-modeling financialreportingconcepts
Chapter 11-modeling financialreportingconcepts
jps619
 
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
Erlangen Artificial Intelligence & Machine Learning Meetup
 

What's hot (12)

Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
 
Customer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclustCustomer Segmentation with R - Deep Dive into flexclust
Customer Segmentation with R - Deep Dive into flexclust
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical models
 
Financial Networks VI - Correlation Networks
Financial Networks VI - Correlation NetworksFinancial Networks VI - Correlation Networks
Financial Networks VI - Correlation Networks
 
Mit203 analysis and design of algorithms
Mit203  analysis and design of algorithmsMit203  analysis and design of algorithms
Mit203 analysis and design of algorithms
 
B colouring
B colouringB colouring
B colouring
 
Mca 4040 analysis and design of algorithm
Mca 4040  analysis and design of algorithmMca 4040  analysis and design of algorithm
Mca 4040 analysis and design of algorithm
 
Mca 4040 analysis and design of algorithm
Mca 4040  analysis and design of algorithmMca 4040  analysis and design of algorithm
Mca 4040 analysis and design of algorithm
 
Chapter 11-modeling financialreportingconcepts
Chapter 11-modeling financialreportingconceptsChapter 11-modeling financialreportingconcepts
Chapter 11-modeling financialreportingconcepts
 
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
XGBoostLSS - An extension of XGBoost to probabilistic forecasting, Alexander ...
 
Random Features Strengthen Graph Neural Networks
Random Features Strengthen Graph Neural NetworksRandom Features Strengthen Graph Neural Networks
Random Features Strengthen Graph Neural Networks
 
An Abstract Framework for Agent-Based Explanations in AI
An Abstract Framework for Agent-Based Explanations in AIAn Abstract Framework for Agent-Based Explanations in AI
An Abstract Framework for Agent-Based Explanations in AI
 

Similar to An Empirical Study of Knowledge Tradeoffs in Case-Based Reasoning - IJCAI-ECAI-18

Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017 Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
MLconf
 
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
acijjournal
 

Similar to An Empirical Study of Knowledge Tradeoffs in Case-Based Reasoning - IJCAI-ECAI-18 (20)

A Parallel Depth First Search Branch And Bound Algorithm For The Quadratic As...
A Parallel Depth First Search Branch And Bound Algorithm For The Quadratic As...A Parallel Depth First Search Branch And Bound Algorithm For The Quadratic As...
A Parallel Depth First Search Branch And Bound Algorithm For The Quadratic As...
 
main
mainmain
main
 
E1062530
E1062530E1062530
E1062530
 
SVD BASED LATENT SEMANTIC INDEXING WITH USE OF THE GPU COMPUTATIONS
SVD BASED LATENT SEMANTIC INDEXING WITH USE OF THE GPU COMPUTATIONSSVD BASED LATENT SEMANTIC INDEXING WITH USE OF THE GPU COMPUTATIONS
SVD BASED LATENT SEMANTIC INDEXING WITH USE OF THE GPU COMPUTATIONS
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
A MODEL-BASED APPROACH MACHINE LEARNING TO SCALABLE PORTFOLIO SELECTION
A MODEL-BASED APPROACH MACHINE LEARNING TO SCALABLE PORTFOLIO SELECTIONA MODEL-BASED APPROACH MACHINE LEARNING TO SCALABLE PORTFOLIO SELECTION
A MODEL-BASED APPROACH MACHINE LEARNING TO SCALABLE PORTFOLIO SELECTION
 
factorization methods
factorization methodsfactorization methods
factorization methods
 
Srikanta Mishra
Srikanta MishraSrikanta Mishra
Srikanta Mishra
 
FSRM 582 Project
FSRM 582 ProjectFSRM 582 Project
FSRM 582 Project
 
Improve the Performance of Clustering Using Combination of Multiple Clusterin...
Improve the Performance of Clustering Using Combination of Multiple Clusterin...Improve the Performance of Clustering Using Combination of Multiple Clusterin...
Improve the Performance of Clustering Using Combination of Multiple Clusterin...
 
A Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data MiningA Novel Approach to Mathematical Concepts in Data Mining
A Novel Approach to Mathematical Concepts in Data Mining
 
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
 
CONFLICT-AWARE WEIGHTED BIPARTITE B-MATCHING AND ITS APPLICATION TO E-COMMERCE
CONFLICT-AWARE WEIGHTED BIPARTITE B-MATCHING AND ITS APPLICATION TO E-COMMERCECONFLICT-AWARE WEIGHTED BIPARTITE B-MATCHING AND ITS APPLICATION TO E-COMMERCE
CONFLICT-AWARE WEIGHTED BIPARTITE B-MATCHING AND ITS APPLICATION TO E-COMMERCE
 
ODSC West 2021 – Composition in ML
ODSC West 2021 – Composition in MLODSC West 2021 – Composition in ML
ODSC West 2021 – Composition in ML
 
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
TOPIC EXTRACTION OF CRAWLED DOCUMENTS COLLECTION USING CORRELATED TOPIC MODEL...
 
Design and Optimization of Supply Chain Network with Nonlinear Log-Space Mode...
Design and Optimization of Supply Chain Network with Nonlinear Log-Space Mode...Design and Optimization of Supply Chain Network with Nonlinear Log-Space Mode...
Design and Optimization of Supply Chain Network with Nonlinear Log-Space Mode...
 
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017 Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
Byron Galbraith, Chief Data Scientist, Talla, at MLconf SEA 2017
 
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
 
clustering.pptx
clustering.pptxclustering.pptx
clustering.pptx
 
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
COMPARATIVE PERFORMANCE ANALYSIS OF RNSC AND MCL ALGORITHMS ON POWER-LAW DIST...
 

Recently uploaded

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...Stronger Together: Developing an Organizational Strategy for Accessible Desig...
Stronger Together: Developing an Organizational Strategy for Accessible Desig...
 
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)
 
How to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cfHow to Check CNIC Information Online with Pakdata cf
How to Check CNIC Information Online with Pakdata cf
 
Modernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using BallerinaModernizing Legacy Systems Using Ballerina
Modernizing Legacy Systems Using Ballerina
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
UiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overviewUiPath manufacturing technology benefits and AI overview
UiPath manufacturing technology benefits and AI overview
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Introduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDMIntroduction to use of FHIR Documents in ABDM
Introduction to use of FHIR Documents in ABDM
 

An Empirical Study of Knowledge Tradeoffs in Case-Based Reasoning - IJCAI-ECAI-18