SlideShare a Scribd company logo
1 of 17
Download to read offline
Department of Business
Information Systems II
Department of Business
Information Systems II
Informed Machine Learning for Improved
Similarity Assessment in
Process-Oriented Case-Based Reasoning
Maximilian Hoffmann1,2 and Ralph Bergmann1,2
1Department of Business Information Systems II
University of Trier, Germany
2German Research Center for Artificial Intelligence (DFKI),
Branch University of Trier, Germany
Department of Business
Information Systems II
Introduction & Motivation
• Deep Learning (DL) components within Case-Based Reasoning
(CBR) applications are gaining popularity
• Both components create a synergy:
• DL provides powerful offline learning capabilities for core CBR tasks,
e.g., similarity assessment
• CBR provides, among others, structured knowledge about the case
representation or the definition of similarity measures
• Many current approaches lack a comprehensive integration of the
CBR-provided knowledge into the DL components, resulting in
unused potential for improved quality and performance
• Informed Machine Learning targets this shortcoming
Goal: Investigate the possibilities of Informed ML for similarity learning
in Process-Oriented CBR (POCBR)
- 2 -
Department of Business
Information Systems II
Foundations – NEST Graphs
- 3 -
Source: Bergmann, R., Gil, Y.: Similarity assessment and efficient retrieval of semantic workflows.
Information Systems 40, pp. 115–127 (2014)
Department of Business
Information Systems II
Foundations – Similarity
Assessment of Semantic Graphs
• Local-global principle by Richter is used to compute 𝑠𝑠𝑠𝑠𝑠𝑠 𝑄𝑄, 𝐶𝐶
between query graph 𝑄𝑄 and case graph 𝐶𝐶:
– Local pairwise similarities between nodes and edges of the same type
are calculated
– Global similarity results from aggregation of local similarities according to
the most similar mapping of nodes and edges
- 4 -
Department of Business
Information Systems II
Similarity Learning with Graph
Neural Networks (GNNs)
- 5 -
Source: Hoffmann, M., Malburg, L., Klein, P., Bergmann, R. (2020). Using Siamese Graph Neural Networks for Similarity-Based
Retrieval in Process-Oriented Case-Based Reasoning. In: ICCBR 2020, 12311, pp. 229-244. Springer.
GMN
GEM
Department of Business
Information Systems II
Similarity Assessment with Informed
ML Methods
- 6 -
Goal: Improving similarity assessment by means of
increased quality and reduced time effort
GNN
variant
Approach Knowledge source Suited
for
First
Extension
Novel tree-based
encoding procedure for
semantic descriptions
Case representation,
domain model
GEM,
GMN
Second
Extension
Novel constraint-based
matching procedure of
the GMN
Graph-based
similarity measure
GMN
Department of Business
Information Systems II
First Extension: Tree-based encoding
• Semantic description is
composed of composite
and atomic parts
• Previous encoding
methods built a sequence
of atomic encodings
• Limitations:
– No encoding of composites
and hierarchical relations
– Sequence processing in
neural networks can be slow
- 7 -
Department of Business
Information Systems II
First Extension: Tree-based encoding (2)
• Encoding of semantic
descriptions as tree structures:
– Maintains hierarchical
structure
– Explicitly encodes composite
types
– Enables parallel processing
• Tree structures are processed
by a local GNN:
– Information propagation
between nodes along the
edges
- 8 -
Department of Business
Information Systems II
Second Extension: Constraint Matching
• Propagation of GMN is inspired by graph matching
algorithms that determine graph similarities
• Propagation between nodes within a graph is extended to
propagation between graphs
• Problems:
– Graph matching algorithm defines compatible types of nodes
– GMN is currently not able to use this information
- 9 -
Department of Business
Information Systems II
Second Extension: Constraint Matching (2)
• Integration of compatible types of nodes into the cross-graph
matching component
• Compatibility is expressed by defining a cosine similarity of 0
for incompatible pairs of nodes
• Propagation of vector information between incompatible pairs
of nodes is stopped
- 10 -
Department of Business
Information Systems II
Experimental Evaluation – Setup
• Implementation of our approach in the
open-source CBR system ProCAKE
with use of TensorFlow DL models
• Comparison of base GEM and GMN with extended variants (nine
different combinations of both extensions) for similarity learning
• Two domains:
• CB-I: 800 cooking workflows (660 training, 60 validation, 80 test)
• CB-II: 609 data mining workflows (509 training, 40 validation, 60 test)
• Hyperparameters are tuned for base models in each domain and
used by all model variants
- 11 -
procake.uni-trier.de
Source: Bergmann, R., Grumbach, L., Malburg, L., Zeyen, C. (2019). ProCAKE: A Process-Oriented Case-Based Reasoning
Framework. In: Workshops Proc. of the 27th Int. Conf. on Case-Based Reasoning (ICCBR 2019), 2567, pp. 156–161. CEUR-WS.org
Department of Business
Information Systems II
Experimental Results
• Training time (in milliseconds) and MAE is measured for
both domains (CB-I and CB-II)
• GEM and GMN:
• Subscript stands for extension
• Combinations of extension also examined
• Superscript stands for reuse of layers for extension 1
- 12 -
Department of Business
Information Systems II
Experimental Results (2)
• Quality:
• No significant positive influence of extensions on GEM
• Base model shows best quality for GEM
• For CB-II, quality is even reduced
• GMN shows quality improvements with use of extensions
• Max. decrease in MAE of approx. 10%
• Effects are not consistent across both domains and all variants
- 13 -
Department of Business
Information Systems II
Experimental Results (3)
• Training time:
• Positive influence of extensions on GEM (decrease of up to 23%)
• Also negative effects occur (increase of up to 44%)
• GMN shows similar results as GEM
• Max. decrease in training time of approx. 24%
• Effects are not consistent across both domains and all variants
• Some extensions decrease training time and MAE (e.g., GMN1
R)
- 14 -
Department of Business
Information Systems II
Experimental Results – Discussion
• Effects of extensions on quality or training time are very inconsistent
(dependent on domain and model)
• Careful choice of extensions and tuning of hyperparameters
inevitable
• Individual optimization of extended variants might further improve the
shown results
- 15 -
Department of Business
Information Systems II
Conclusion & Future Work
• Presented extensions introduce a form of Informed ML for
DL models in POCBR
• Extended GNN models are capable of prediction errors
and training time
• Whether a benefit can be achieved, depends on the
target domain and the underlying model architecture
• Future work:
– More in-depth analysis of Informed ML in the context of CBR with
challenges, opportunities, and guidelines
– Examination and evaluation of Informed ML methods for other
phases of the CBR cycle such as the reuse phase
- 16 -
Department of Business
Information Systems II
Department of Business
Information Systems II
Contact
- 17 -
Maximilian Hoffmann
Business Information Systems II
University of Trier, Germany
German Research Center for Artificial
Intelligence (DFKI),
Branch University of Trier, Germany
hoffmannm@uni-trier.de
maximilian.hoffmann@dfki.de
procake.uni-trier.de

More Related Content

What's hot

Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite eSAT Journals
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLlauratoni4
 
Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...BRNSS Publication Hub
 
An optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic NetworkAn optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic NetworkISA Interchange
 
Parallel Machine Learning
Parallel Machine LearningParallel Machine Learning
Parallel Machine LearningJanani C
 
A Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsA Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsaftab alam
 
Application of a merit function based interior point method to linear model p...
Application of a merit function based interior point method to linear model p...Application of a merit function based interior point method to linear model p...
Application of a merit function based interior point method to linear model p...Zac Darcy
 
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...Ricardo Viana Vargas
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...lauratoni4
 
A mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problemA mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problemDiego Montero
 
Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...
Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...
Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...TELKOMNIKA JOURNAL
 
Distributed graph summarization
Distributed graph summarizationDistributed graph summarization
Distributed graph summarizationaftab alam
 
Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...Chris Kirk, PhD, FIAP
 
Using Data Mining to Identify COSMIC Function Point Measurement Competence
Using Data Mining to Identify COSMIC Function Point Measurement Competence  Using Data Mining to Identify COSMIC Function Point Measurement Competence
Using Data Mining to Identify COSMIC Function Point Measurement Competence IJECEIAES
 
An Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed SimulationAn Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed SimulationGabriele D'Angelo
 
Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...TELKOMNIKA JOURNAL
 

What's hot (18)

Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RL
 
Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...
 
An optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic NetworkAn optimal general type-2 fuzzy controller for Urban Traffic Network
An optimal general type-2 fuzzy controller for Urban Traffic Network
 
Parallel Machine Learning
Parallel Machine LearningParallel Machine Learning
Parallel Machine Learning
 
A Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsA Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphs
 
Application of a merit function based interior point method to linear model p...
Application of a merit function based interior point method to linear model p...Application of a merit function based interior point method to linear model p...
Application of a merit function based interior point method to linear model p...
 
nnUNet
nnUNetnnUNet
nnUNet
 
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
 
A mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problemA mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problem
 
Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...
Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...
Application Profiling and Mapping on NoC-based MPSoC Emulation Platform on Re...
 
Distributed graph summarization
Distributed graph summarizationDistributed graph summarization
Distributed graph summarization
 
Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...
 
Using Data Mining to Identify COSMIC Function Point Measurement Competence
Using Data Mining to Identify COSMIC Function Point Measurement Competence  Using Data Mining to Identify COSMIC Function Point Measurement Competence
Using Data Mining to Identify COSMIC Function Point Measurement Competence
 
An Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed SimulationAn Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed Simulation
 
Jf3515881595
Jf3515881595Jf3515881595
Jf3515881595
 
Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...
 

Similar to Informed Machine Learning for Improved Similarity Assessment in Process-Oriented Case-Based Reasoning

A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...
A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...
A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...cscpconf
 
First Year Report, PhD presentation
First Year Report, PhD presentationFirst Year Report, PhD presentation
First Year Report, PhD presentationBang Xiang Yong
 
Reinventing Auditing with Machine Learning
Reinventing Auditing with Machine LearningReinventing Auditing with Machine Learning
Reinventing Auditing with Machine LearningAndrew Clark
 
Multi-modal sources for predictive modeling using deep learning
Multi-modal sources for predictive modeling using deep learningMulti-modal sources for predictive modeling using deep learning
Multi-modal sources for predictive modeling using deep learningSanghamitra Deb
 
Recuriter Recommendation System
Recuriter Recommendation SystemRecuriter Recommendation System
Recuriter Recommendation SystemIRJET Journal
 
Intelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptxIntelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptxAnonymous366406
 
An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey
 An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey
An Adjacent Analysis of the Parallel Programming Model Perspective: A SurveyIRJET Journal
 
IRJET- Segmentation and Representation of Data Dependent Label Distribution L...
IRJET- Segmentation and Representation of Data Dependent Label Distribution L...IRJET- Segmentation and Representation of Data Dependent Label Distribution L...
IRJET- Segmentation and Representation of Data Dependent Label Distribution L...IRJET Journal
 
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image TagsA Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image TagsIRJET Journal
 
Machine Learning with Data Science Online Course | Learn and Build
 Machine Learning with Data Science Online Course | Learn and Build  Machine Learning with Data Science Online Course | Learn and Build
Machine Learning with Data Science Online Course | Learn and Build Learn and Build
 
Everything you need to know about AutoML
Everything you need to know about AutoMLEverything you need to know about AutoML
Everything you need to know about AutoMLArpitha Gurumurthy
 
Performance analysis of machine learning approaches in software complexity pr...
Performance analysis of machine learning approaches in software complexity pr...Performance analysis of machine learning approaches in software complexity pr...
Performance analysis of machine learning approaches in software complexity pr...Sayed Mohsin Reza
 
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )SBGC
 
IRJET- Analysis of Software Cost Estimation Techniques
IRJET- Analysis of Software Cost Estimation TechniquesIRJET- Analysis of Software Cost Estimation Techniques
IRJET- Analysis of Software Cost Estimation TechniquesIRJET Journal
 
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...IRJET Journal
 
Proceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docxProceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docxwkyra78
 
SIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMSSIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMScscpconf
 
Size estimation of olap systems
Size estimation of olap systemsSize estimation of olap systems
Size estimation of olap systemscsandit
 

Similar to Informed Machine Learning for Improved Similarity Assessment in Process-Oriented Case-Based Reasoning (20)

A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...
A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...
A SIMPLE PROCESS TO SPEED UP MACHINE LEARNING METHODS: APPLICATION TO HIDDEN ...
 
M tech-cse-hptu
M tech-cse-hptuM tech-cse-hptu
M tech-cse-hptu
 
First Year Report, PhD presentation
First Year Report, PhD presentationFirst Year Report, PhD presentation
First Year Report, PhD presentation
 
Reinventing Auditing with Machine Learning
Reinventing Auditing with Machine LearningReinventing Auditing with Machine Learning
Reinventing Auditing with Machine Learning
 
Multi-modal sources for predictive modeling using deep learning
Multi-modal sources for predictive modeling using deep learningMulti-modal sources for predictive modeling using deep learning
Multi-modal sources for predictive modeling using deep learning
 
Recuriter Recommendation System
Recuriter Recommendation SystemRecuriter Recommendation System
Recuriter Recommendation System
 
Intelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptxIntelligent Career Guidance System.pptx
Intelligent Career Guidance System.pptx
 
An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey
 An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey
An Adjacent Analysis of the Parallel Programming Model Perspective: A Survey
 
IRJET- Segmentation and Representation of Data Dependent Label Distribution L...
IRJET- Segmentation and Representation of Data Dependent Label Distribution L...IRJET- Segmentation and Representation of Data Dependent Label Distribution L...
IRJET- Segmentation and Representation of Data Dependent Label Distribution L...
 
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image TagsA Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
 
Machine Learning with Data Science Online Course | Learn and Build
 Machine Learning with Data Science Online Course | Learn and Build  Machine Learning with Data Science Online Course | Learn and Build
Machine Learning with Data Science Online Course | Learn and Build
 
Everything you need to know about AutoML
Everything you need to know about AutoMLEverything you need to know about AutoML
Everything you need to know about AutoML
 
Performance analysis of machine learning approaches in software complexity pr...
Performance analysis of machine learning approaches in software complexity pr...Performance analysis of machine learning approaches in software complexity pr...
Performance analysis of machine learning approaches in software complexity pr...
 
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
 
IRJET- Analysis of Software Cost Estimation Techniques
IRJET- Analysis of Software Cost Estimation TechniquesIRJET- Analysis of Software Cost Estimation Techniques
IRJET- Analysis of Software Cost Estimation Techniques
 
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
Comparative Analysis of Machine Learning Algorithms for their Effectiveness i...
 
Proceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docxProceedings of the 2015 Industrial and Systems Engineering Res.docx
Proceedings of the 2015 Industrial and Systems Engineering Res.docx
 
06522405
0652240506522405
06522405
 
SIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMSSIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMS
 
Size estimation of olap systems
Size estimation of olap systemsSize estimation of olap systems
Size estimation of olap systems
 

Recently uploaded

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 

Recently uploaded (20)

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 

Informed Machine Learning for Improved Similarity Assessment in Process-Oriented Case-Based Reasoning

  • 1. Department of Business Information Systems II Department of Business Information Systems II Informed Machine Learning for Improved Similarity Assessment in Process-Oriented Case-Based Reasoning Maximilian Hoffmann1,2 and Ralph Bergmann1,2 1Department of Business Information Systems II University of Trier, Germany 2German Research Center for Artificial Intelligence (DFKI), Branch University of Trier, Germany
  • 2. Department of Business Information Systems II Introduction & Motivation • Deep Learning (DL) components within Case-Based Reasoning (CBR) applications are gaining popularity • Both components create a synergy: • DL provides powerful offline learning capabilities for core CBR tasks, e.g., similarity assessment • CBR provides, among others, structured knowledge about the case representation or the definition of similarity measures • Many current approaches lack a comprehensive integration of the CBR-provided knowledge into the DL components, resulting in unused potential for improved quality and performance • Informed Machine Learning targets this shortcoming Goal: Investigate the possibilities of Informed ML for similarity learning in Process-Oriented CBR (POCBR) - 2 -
  • 3. Department of Business Information Systems II Foundations – NEST Graphs - 3 - Source: Bergmann, R., Gil, Y.: Similarity assessment and efficient retrieval of semantic workflows. Information Systems 40, pp. 115–127 (2014)
  • 4. Department of Business Information Systems II Foundations – Similarity Assessment of Semantic Graphs • Local-global principle by Richter is used to compute 𝑠𝑠𝑠𝑠𝑠𝑠 𝑄𝑄, 𝐶𝐶 between query graph 𝑄𝑄 and case graph 𝐶𝐶: – Local pairwise similarities between nodes and edges of the same type are calculated – Global similarity results from aggregation of local similarities according to the most similar mapping of nodes and edges - 4 -
  • 5. Department of Business Information Systems II Similarity Learning with Graph Neural Networks (GNNs) - 5 - Source: Hoffmann, M., Malburg, L., Klein, P., Bergmann, R. (2020). Using Siamese Graph Neural Networks for Similarity-Based Retrieval in Process-Oriented Case-Based Reasoning. In: ICCBR 2020, 12311, pp. 229-244. Springer. GMN GEM
  • 6. Department of Business Information Systems II Similarity Assessment with Informed ML Methods - 6 - Goal: Improving similarity assessment by means of increased quality and reduced time effort GNN variant Approach Knowledge source Suited for First Extension Novel tree-based encoding procedure for semantic descriptions Case representation, domain model GEM, GMN Second Extension Novel constraint-based matching procedure of the GMN Graph-based similarity measure GMN
  • 7. Department of Business Information Systems II First Extension: Tree-based encoding • Semantic description is composed of composite and atomic parts • Previous encoding methods built a sequence of atomic encodings • Limitations: – No encoding of composites and hierarchical relations – Sequence processing in neural networks can be slow - 7 -
  • 8. Department of Business Information Systems II First Extension: Tree-based encoding (2) • Encoding of semantic descriptions as tree structures: – Maintains hierarchical structure – Explicitly encodes composite types – Enables parallel processing • Tree structures are processed by a local GNN: – Information propagation between nodes along the edges - 8 -
  • 9. Department of Business Information Systems II Second Extension: Constraint Matching • Propagation of GMN is inspired by graph matching algorithms that determine graph similarities • Propagation between nodes within a graph is extended to propagation between graphs • Problems: – Graph matching algorithm defines compatible types of nodes – GMN is currently not able to use this information - 9 -
  • 10. Department of Business Information Systems II Second Extension: Constraint Matching (2) • Integration of compatible types of nodes into the cross-graph matching component • Compatibility is expressed by defining a cosine similarity of 0 for incompatible pairs of nodes • Propagation of vector information between incompatible pairs of nodes is stopped - 10 -
  • 11. Department of Business Information Systems II Experimental Evaluation – Setup • Implementation of our approach in the open-source CBR system ProCAKE with use of TensorFlow DL models • Comparison of base GEM and GMN with extended variants (nine different combinations of both extensions) for similarity learning • Two domains: • CB-I: 800 cooking workflows (660 training, 60 validation, 80 test) • CB-II: 609 data mining workflows (509 training, 40 validation, 60 test) • Hyperparameters are tuned for base models in each domain and used by all model variants - 11 - procake.uni-trier.de Source: Bergmann, R., Grumbach, L., Malburg, L., Zeyen, C. (2019). ProCAKE: A Process-Oriented Case-Based Reasoning Framework. In: Workshops Proc. of the 27th Int. Conf. on Case-Based Reasoning (ICCBR 2019), 2567, pp. 156–161. CEUR-WS.org
  • 12. Department of Business Information Systems II Experimental Results • Training time (in milliseconds) and MAE is measured for both domains (CB-I and CB-II) • GEM and GMN: • Subscript stands for extension • Combinations of extension also examined • Superscript stands for reuse of layers for extension 1 - 12 -
  • 13. Department of Business Information Systems II Experimental Results (2) • Quality: • No significant positive influence of extensions on GEM • Base model shows best quality for GEM • For CB-II, quality is even reduced • GMN shows quality improvements with use of extensions • Max. decrease in MAE of approx. 10% • Effects are not consistent across both domains and all variants - 13 -
  • 14. Department of Business Information Systems II Experimental Results (3) • Training time: • Positive influence of extensions on GEM (decrease of up to 23%) • Also negative effects occur (increase of up to 44%) • GMN shows similar results as GEM • Max. decrease in training time of approx. 24% • Effects are not consistent across both domains and all variants • Some extensions decrease training time and MAE (e.g., GMN1 R) - 14 -
  • 15. Department of Business Information Systems II Experimental Results – Discussion • Effects of extensions on quality or training time are very inconsistent (dependent on domain and model) • Careful choice of extensions and tuning of hyperparameters inevitable • Individual optimization of extended variants might further improve the shown results - 15 -
  • 16. Department of Business Information Systems II Conclusion & Future Work • Presented extensions introduce a form of Informed ML for DL models in POCBR • Extended GNN models are capable of prediction errors and training time • Whether a benefit can be achieved, depends on the target domain and the underlying model architecture • Future work: – More in-depth analysis of Informed ML in the context of CBR with challenges, opportunities, and guidelines – Examination and evaluation of Informed ML methods for other phases of the CBR cycle such as the reuse phase - 16 -
  • 17. Department of Business Information Systems II Department of Business Information Systems II Contact - 17 - Maximilian Hoffmann Business Information Systems II University of Trier, Germany German Research Center for Artificial Intelligence (DFKI), Branch University of Trier, Germany hoffmannm@uni-trier.de maximilian.hoffmann@dfki.de procake.uni-trier.de