SlideShare a Scribd company logo
1 of 16
Download to read offline
Department of Business
Information Systems II
Using Siamese Graph Neural Networks for
Similarity-Based Retrieval in Process-Oriented
Case-Based Reasoning
Maximilian Hoffmann, Lukas Malburg, Patrick Klein,
and Ralph Bergmann
Department of Business Information Systems II
University of Trier, Germany
www.wi2.uni-trier.de
hoffmannm@uni-trier.de
Funded by (BE 1373/3-3 and 375342983)
Department of Business
Information Systems II
Outline
• Motivation
• Related Work
• Approach for Learning Similarities of Workflow Graphs via Siamese
Graph Neural Network (GNNs)
• Experimental Evaluation
• Conclusion and Future Work
- 2 -
Department of Business
Information Systems II
Motivation
• Semantic graphs as cases in Process-Oriented CBR:
– Similarity assessment: kind of inexact sub-graph matching
– Similarity considers structure of nodes and edges and their semantic annotations
– High complexity leads to long retrieval times
• Two-phase MAC/FAC retrieval
– MAC: similarity measure for a fast pre-selection
– FAC: graph matching is applied
– Existing automatically-learned POCBR MAC/FAC approaches use embeddings …
• Advantage: Usage of fast and simple vector similarity measures
– … but complex semantics are handled insufficiently
• No consideration of graph structure and semantic annotations
• Decreased utility for rather complex domains
• Advantages of embedding representation should be adapted for other
approaches
- 3 -
Goal: Similarity assessment of complex graphs via siamese GNNs
Department of Business
Information Systems II
Related Work
1. Embedding-based approach by Klein et al. (2019)
• Automatically-learned low-dimensional embedding vectors for graph nodes
• Similarity of two graphs given by aggregating node vectors and applying a
vector similarity measure
• Very fast method, but lacks proper integration of graph structure and semantic
annotations into embedding vectors
2. Cluster-based approach by Müller and Bergmann (2014)
• based on the cluster structure of the case base
• finding clusters that are similar to the query
• no additional modeling effort
• not reaching performance of the feature-based approach
3. Domain specific feature-based case representation by Bergmann and
Stromer (2013)
• simplified version of native representation
• similarity computation only uses feature representation and not the graph
• requires additional modeling effort (manually modeled)
- 4 -
Department of Business
Information Systems II
Graph Similarity Assessment with
Siamese GNNs
• Challenges:
1. Transforming all relevant information of semantic graphs into a
format, interpretable by neural networks
2. Finding a siamese GNN that is capable of producing meaningful
graph similarities and short retrieval times
• Contributions:
– Encoding method of semantic graphs
• Supports complex semantic descriptions, including nested entries
• Supports the complete graph structure with all nodes and edges
– Two siamese GNN architectures by Li et al. (2019)
• Different levels of computational complexity
• Modified for retrieving semantic graphs
- 5 -
Department of Business
Information Systems II
Encoding Semantic Graphs for
Similarity Learning
• Composition of node and edge encodings:
– Encoding of node and edge type
– Encoding of semantic annotations and ProCAKE data types such
as key-value pairs, lists, numerics, strings, taxonomies or sets.
• Sequence of vectors can be transformed to a matrix
• Matrix is processable by neural networks
- 6 -
Department of Business
Information Systems II
Graph Embedding Model (GEM)
- 7 -
• Encoder:
− Transforms raw encoded data
into node and edge embeddings
− Usage of Feed-Forward- and
Recurrent Neural Networks
• Propagation Layer:
− Propagates embeddings of con-
nected nodes in both graphs
− Captures information on the
neighborhood of individual nodes
• Aggregator:
− Merges final node embeddings
to a whole-graph embedding
− Pairwise vector similarity
Complexity O(n)
Department of Business
Information Systems II
Graph Matching Network (GMN)
- 8 -
• Encoder:
− Transforms raw encoded data
into node and edge embeddings
− Usage of Feed-Forward- and
Recurrent Neural Networks
• Propagation Layer:
− Same goal as propagation layer
in GEM but different scope
− Propagates information across
graphs via attention mechanism
• Aggregator:
− Merges final node embeddings
to a whole-graph embedding
− Pairwise similarity using a Feed-
Forward Neural Network
Complexity O(n²)
Department of Business
Information Systems II
Graph Retrieval using GEM or GMN
• Retrieval utilizes one of the neural networks (GEM/GMN):
– Input data: Encoded semantic graphs
– Output data: pairwise graph similarities
– Prediction of the similarity of each case with the query
– K-most similar cases are determined
– Both neural networks are also applicable in a MAC/FAC setup
• Implementation uses ProCAKE and Tensorflow
- 9 -
(https://procake.uni-trier.de)
Department of Business
Information Systems II
Experimental Evaluation
• Setup:
– Comparison of GEM and GMN to the embedding-based retriever (EBR)
[1] and the feature-based retriever (FBR) [2] in two experiments
1. MAC/FAC retrieval with different filter sizes and values of k (similar to [1])
2. Approximation of the A-Star retriever (A*R) by Bergmann and Gil (2014) [3]
– Experiments for two workflow domains:
• CB-I: Simple cooking recipes (680 training and 120 testing cases)
• CB-II: Complex data mining workflows (529 training and 80 testing cases)
– Examination of retrieval quality and performance
• Hypotheses:
– H1: Using GEM and GMN as a MAC retriever of a MAC/FAC retrieval
leads to better retrieval results than using EBR as MAC retriever.
– H2: The GMN retriever is able to approximate the ground-truth graph
similarities better than A*R, using parameter settings such that the
retrieval time of both retrievers is comparable.
- 10 -
Department of Business
Information Systems II
Experimental Results (MAC/FAC)
• CB-I:
– FBR has the best qualities and GEM and EBR are the fastest retrievers
– GEM and GMN are not able to outperform EBR
• CB-II:
– FBR still performs very well, GEM and EBR are still the fastest retrievers
– GEM and GMN now outperform EBR
• H1 is partly confirmed: Rejected for CB-I and accepted for CB-II
- 11 -
Department of Business
Information Systems II
Experimental Results (A-Star
Approximation)
• CB-I: GMN has lowest MAE
and A*R highest correctness
• CB-II: GMN has lowest MAE
and highest correctness
• H2: Clearly accepted for CB-II
and partly accepted for CB-I
- 12 -
Department of Business
Information Systems II
Conclusion and Future Work
Conclusion
• GEM and GMN show high potential in graph retrieval scenarios, especially in
more complex domains
– GEM as a MAC measure outperforms other machine-learned approaches
– GMN as a FAC measure outperforms an A-Star-based graph matching measure
Future Work
• Optimization of the presented neural networks
– Usage of a differentiable ranking loss function for GEM
– Optimized encoding scheme for more data types and other graph structures
• Evaluation on other domains, e.g., argument graphs, and other types of
complex graph similarity measures
• Investigation of suitable methods for explaining the results of the neural
networks in the context of Explainable Artificial Intelligence (e.g., see XAI
workshop of ICCBR 2019)
- 13 -
Department of Business
Information Systems II - 14 -
Thank you for your
attention!
Department of Business
Information Systems II
Department of Business
Information Systems II
References
[1] Klein, P., Malburg, L., Bergmann, R.: Learning Workflow Embeddings to Improve
the Performance of Similarity-Based Retrieval for Process-Oriented Case-Based
Reasoning. In: Case-Based Reasoning Research and Development: 27th Inter-
Conference, ICCBR 2019, Germany, pp. 188–203. Springer. (2019)
[2] Bergmann, R., Stromer, A.: MAC/FAC Retrieval of Semantic Workflows. In:
Boonthum-Denecke, C., Youngblood, G.M. (eds.) Proceedings of the Twenty-
Sixth International Florida Artificial Intelligence Research Society Conference,
FLAIRS 2013. AAAI Press (2013)
[3] Bergmann, R., Gil, Y.: Similarity assessment and efficient retrieval of semantic
workflows. Information Systems 40, pp. 115–127 (2014)
[4] Li, Y., Gu, C., Dullien, T., Vinyals, O., Kohli, P.: Graph Matching Networks for
Learning the Similarity of Graph Structured Objects. In: Chaudhuri, K.,
Salakhutdinov, R. (eds.) Proc. of the 36th Int. Conf. on Machine Learning, ICML
2019, USA. Proc. of Machine Learning Research, vol. 97, pp. 3835–3845. PMLR
- 16 -

More Related Content

What's hot

Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...lauratoni4
 
Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite eSAT Journals
 
Efficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multipliersEfficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multiplierseSAT Publishing House
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLlauratoni4
 
Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...BRNSS Publication Hub
 
Locally densest subgraph discovery
Locally densest subgraph discoveryLocally densest subgraph discovery
Locally densest subgraph discoveryaftab alam
 
Parallel Machine Learning
Parallel Machine LearningParallel Machine Learning
Parallel Machine LearningJanani C
 
A Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsA Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsaftab alam
 
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...Databricks
 
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...Ricardo Viana Vargas
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...lauratoni4
 
Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...TELKOMNIKA JOURNAL
 
A mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problemA mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problemDiego Montero
 
An Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed SimulationAn Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed SimulationGabriele D'Angelo
 
Distributed graph summarization
Distributed graph summarizationDistributed graph summarization
Distributed graph summarizationaftab alam
 
Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...Chris Kirk, PhD, FIAP
 

What's hot (18)

Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
 
Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite Efficient implementation of bit parallel finite
Efficient implementation of bit parallel finite
 
Efficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multipliersEfficient implementation of bit parallel finite field multipliers
Efficient implementation of bit parallel finite field multipliers
 
Learning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RLLearning Graph Representation for Data-Efficiency RL
Learning Graph Representation for Data-Efficiency RL
 
Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...Interior Dual Optimization Software Engineering with Applications in BCS Elec...
Interior Dual Optimization Software Engineering with Applications in BCS Elec...
 
Locally densest subgraph discovery
Locally densest subgraph discoveryLocally densest subgraph discovery
Locally densest subgraph discovery
 
Parallel Machine Learning
Parallel Machine LearningParallel Machine Learning
Parallel Machine Learning
 
A Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphsA Graph Summarization: A Survey | Summarizing and understanding large graphs
A Graph Summarization: A Survey | Summarizing and understanding large graphs
 
HOCSA: AN EFFICIENT DOWNLINK BURST ALLOCATION ALGORITHM TO ACHIEVE HIGH FRAME...
HOCSA: AN EFFICIENT DOWNLINK BURST ALLOCATION ALGORITHM TO ACHIEVE HIGH FRAME...HOCSA: AN EFFICIENT DOWNLINK BURST ALLOCATION ALGORITHM TO ACHIEVE HIGH FRAME...
HOCSA: AN EFFICIENT DOWNLINK BURST ALLOCATION ALGORITHM TO ACHIEVE HIGH FRAME...
 
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...Recurrent Neural Networks for Recommendations and Personalization with Nick P...
Recurrent Neural Networks for Recommendations and Personalization with Nick P...
 
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
Applying Neural Networks and Analogous Estimating to Determine the Project Bu...
 
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
 
Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...Transformation and dynamic visualization of images from computer through an F...
Transformation and dynamic visualization of images from computer through an F...
 
A mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problemA mathematical model and a heuristic memory allocation problem
A mathematical model and a heuristic memory allocation problem
 
An Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed SimulationAn Adaptive Load Balancing Middleware for Distributed Simulation
An Adaptive Load Balancing Middleware for Distributed Simulation
 
Distributed graph summarization
Distributed graph summarizationDistributed graph summarization
Distributed graph summarization
 
Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...Integration of a Predictive, Continuous Time Neural Network into Securities M...
Integration of a Predictive, Continuous Time Neural Network into Securities M...
 
Jf3515881595
Jf3515881595Jf3515881595
Jf3515881595
 

Similar to Using Siamese Graph Neural Networks for Similarity-Based Retrieval in Process-Oriented Case-Based Reasoning

RNNs for Recommendations and Personalization
RNNs for Recommendations and PersonalizationRNNs for Recommendations and Personalization
RNNs for Recommendations and PersonalizationNick Pentreath
 
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving SystemsPRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving SystemsNECST Lab @ Politecnico di Milano
 
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...ssuser4b1f48
 
IRJET- Recommendation System based on Graph Database Techniques
IRJET- Recommendation System based on Graph Database TechniquesIRJET- Recommendation System based on Graph Database Techniques
IRJET- Recommendation System based on Graph Database TechniquesIRJET Journal
 
STOCK MARKET PREDICTION USING NEURAL NETWORKS
STOCK MARKET PREDICTION USING NEURAL NETWORKSSTOCK MARKET PREDICTION USING NEURAL NETWORKS
STOCK MARKET PREDICTION USING NEURAL NETWORKSIRJET Journal
 
Supervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemSupervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemMarsan Ma
 
Parallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingParallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingIRJET Journal
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Predictionmiyurud
 
Deep learning for 3 d point clouds presentation
Deep learning for 3 d point clouds presentationDeep learning for 3 d point clouds presentation
Deep learning for 3 d point clouds presentationVijaylaxmiNagurkar
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsNick Pentreath
 
IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...
IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...
IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...IRJET Journal
 
[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-Diagrams[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-DiagramsNetBrain Technologies
 
Measuring method complexity of the case management modeling and notation (CMMN)
Measuring method complexity of the case management modeling and notation (CMMN)Measuring method complexity of the case management modeling and notation (CMMN)
Measuring method complexity of the case management modeling and notation (CMMN)Mike Marin
 
Optimal Chain Matrix Multiplication Big Data Perspective
Optimal Chain Matrix Multiplication Big Data PerspectiveOptimal Chain Matrix Multiplication Big Data Perspective
Optimal Chain Matrix Multiplication Big Data Perspectiveপল্লব রায়
 
IRJET- Semantics based Document Clustering
IRJET- Semantics based Document ClusteringIRJET- Semantics based Document Clustering
IRJET- Semantics based Document ClusteringIRJET Journal
 

Similar to Using Siamese Graph Neural Networks for Similarity-Based Retrieval in Process-Oriented Case-Based Reasoning (20)

RNNs for Recommendations and Personalization
RNNs for Recommendations and PersonalizationRNNs for Recommendations and Personalization
RNNs for Recommendations and Personalization
 
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving SystemsPRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems
 
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
NS-CUK Seminar: S.T.Nguyen, Review on "Do We Really Need Complicated Model Ar...
 
IRJET- Recommendation System based on Graph Database Techniques
IRJET- Recommendation System based on Graph Database TechniquesIRJET- Recommendation System based on Graph Database Techniques
IRJET- Recommendation System based on Graph Database Techniques
 
GRID COMPUTING
GRID COMPUTINGGRID COMPUTING
GRID COMPUTING
 
STOCK MARKET PREDICTION USING NEURAL NETWORKS
STOCK MARKET PREDICTION USING NEURAL NETWORKSSTOCK MARKET PREDICTION USING NEURAL NETWORKS
STOCK MARKET PREDICTION USING NEURAL NETWORKS
 
Supervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking systemSupervised embedding techniques in search ranking system
Supervised embedding techniques in search ranking system
 
Parallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive IndexingParallel KNN for Big Data using Adaptive Indexing
Parallel KNN for Big Data using Adaptive Indexing
 
MLIntro_ADA.pptx
MLIntro_ADA.pptxMLIntro_ADA.pptx
MLIntro_ADA.pptx
 
D04573033
D04573033D04573033
D04573033
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
 
Deep learning for 3 d point clouds presentation
Deep learning for 3 d point clouds presentationDeep learning for 3 d point clouds presentation
Deep learning for 3 d point clouds presentation
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...
IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...
IRJET- Efficient Auto Annotation for Tag and Image based Searching Over Large...
 
[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-Diagrams[White paper] Maintain-Accurate-Network-Diagrams
[White paper] Maintain-Accurate-Network-Diagrams
 
Measuring method complexity of the case management modeling and notation (CMMN)
Measuring method complexity of the case management modeling and notation (CMMN)Measuring method complexity of the case management modeling and notation (CMMN)
Measuring method complexity of the case management modeling and notation (CMMN)
 
Optimal Chain Matrix Multiplication Big Data Perspective
Optimal Chain Matrix Multiplication Big Data PerspectiveOptimal Chain Matrix Multiplication Big Data Perspective
Optimal Chain Matrix Multiplication Big Data Perspective
 
IRJET- Semantics based Document Clustering
IRJET- Semantics based Document ClusteringIRJET- Semantics based Document Clustering
IRJET- Semantics based Document Clustering
 
ICIECA 2014 Paper 05
ICIECA 2014 Paper 05ICIECA 2014 Paper 05
ICIECA 2014 Paper 05
 
Icbai 2018 ver_1
Icbai 2018 ver_1Icbai 2018 ver_1
Icbai 2018 ver_1
 

Recently uploaded

Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 

Recently uploaded (20)

Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 

Using Siamese Graph Neural Networks for Similarity-Based Retrieval in Process-Oriented Case-Based Reasoning

  • 1. Department of Business Information Systems II Using Siamese Graph Neural Networks for Similarity-Based Retrieval in Process-Oriented Case-Based Reasoning Maximilian Hoffmann, Lukas Malburg, Patrick Klein, and Ralph Bergmann Department of Business Information Systems II University of Trier, Germany www.wi2.uni-trier.de hoffmannm@uni-trier.de Funded by (BE 1373/3-3 and 375342983)
  • 2. Department of Business Information Systems II Outline • Motivation • Related Work • Approach for Learning Similarities of Workflow Graphs via Siamese Graph Neural Network (GNNs) • Experimental Evaluation • Conclusion and Future Work - 2 -
  • 3. Department of Business Information Systems II Motivation • Semantic graphs as cases in Process-Oriented CBR: – Similarity assessment: kind of inexact sub-graph matching – Similarity considers structure of nodes and edges and their semantic annotations – High complexity leads to long retrieval times • Two-phase MAC/FAC retrieval – MAC: similarity measure for a fast pre-selection – FAC: graph matching is applied – Existing automatically-learned POCBR MAC/FAC approaches use embeddings … • Advantage: Usage of fast and simple vector similarity measures – … but complex semantics are handled insufficiently • No consideration of graph structure and semantic annotations • Decreased utility for rather complex domains • Advantages of embedding representation should be adapted for other approaches - 3 - Goal: Similarity assessment of complex graphs via siamese GNNs
  • 4. Department of Business Information Systems II Related Work 1. Embedding-based approach by Klein et al. (2019) • Automatically-learned low-dimensional embedding vectors for graph nodes • Similarity of two graphs given by aggregating node vectors and applying a vector similarity measure • Very fast method, but lacks proper integration of graph structure and semantic annotations into embedding vectors 2. Cluster-based approach by Müller and Bergmann (2014) • based on the cluster structure of the case base • finding clusters that are similar to the query • no additional modeling effort • not reaching performance of the feature-based approach 3. Domain specific feature-based case representation by Bergmann and Stromer (2013) • simplified version of native representation • similarity computation only uses feature representation and not the graph • requires additional modeling effort (manually modeled) - 4 -
  • 5. Department of Business Information Systems II Graph Similarity Assessment with Siamese GNNs • Challenges: 1. Transforming all relevant information of semantic graphs into a format, interpretable by neural networks 2. Finding a siamese GNN that is capable of producing meaningful graph similarities and short retrieval times • Contributions: – Encoding method of semantic graphs • Supports complex semantic descriptions, including nested entries • Supports the complete graph structure with all nodes and edges – Two siamese GNN architectures by Li et al. (2019) • Different levels of computational complexity • Modified for retrieving semantic graphs - 5 -
  • 6. Department of Business Information Systems II Encoding Semantic Graphs for Similarity Learning • Composition of node and edge encodings: – Encoding of node and edge type – Encoding of semantic annotations and ProCAKE data types such as key-value pairs, lists, numerics, strings, taxonomies or sets. • Sequence of vectors can be transformed to a matrix • Matrix is processable by neural networks - 6 -
  • 7. Department of Business Information Systems II Graph Embedding Model (GEM) - 7 - • Encoder: − Transforms raw encoded data into node and edge embeddings − Usage of Feed-Forward- and Recurrent Neural Networks • Propagation Layer: − Propagates embeddings of con- nected nodes in both graphs − Captures information on the neighborhood of individual nodes • Aggregator: − Merges final node embeddings to a whole-graph embedding − Pairwise vector similarity Complexity O(n)
  • 8. Department of Business Information Systems II Graph Matching Network (GMN) - 8 - • Encoder: − Transforms raw encoded data into node and edge embeddings − Usage of Feed-Forward- and Recurrent Neural Networks • Propagation Layer: − Same goal as propagation layer in GEM but different scope − Propagates information across graphs via attention mechanism • Aggregator: − Merges final node embeddings to a whole-graph embedding − Pairwise similarity using a Feed- Forward Neural Network Complexity O(n²)
  • 9. Department of Business Information Systems II Graph Retrieval using GEM or GMN • Retrieval utilizes one of the neural networks (GEM/GMN): – Input data: Encoded semantic graphs – Output data: pairwise graph similarities – Prediction of the similarity of each case with the query – K-most similar cases are determined – Both neural networks are also applicable in a MAC/FAC setup • Implementation uses ProCAKE and Tensorflow - 9 - (https://procake.uni-trier.de)
  • 10. Department of Business Information Systems II Experimental Evaluation • Setup: – Comparison of GEM and GMN to the embedding-based retriever (EBR) [1] and the feature-based retriever (FBR) [2] in two experiments 1. MAC/FAC retrieval with different filter sizes and values of k (similar to [1]) 2. Approximation of the A-Star retriever (A*R) by Bergmann and Gil (2014) [3] – Experiments for two workflow domains: • CB-I: Simple cooking recipes (680 training and 120 testing cases) • CB-II: Complex data mining workflows (529 training and 80 testing cases) – Examination of retrieval quality and performance • Hypotheses: – H1: Using GEM and GMN as a MAC retriever of a MAC/FAC retrieval leads to better retrieval results than using EBR as MAC retriever. – H2: The GMN retriever is able to approximate the ground-truth graph similarities better than A*R, using parameter settings such that the retrieval time of both retrievers is comparable. - 10 -
  • 11. Department of Business Information Systems II Experimental Results (MAC/FAC) • CB-I: – FBR has the best qualities and GEM and EBR are the fastest retrievers – GEM and GMN are not able to outperform EBR • CB-II: – FBR still performs very well, GEM and EBR are still the fastest retrievers – GEM and GMN now outperform EBR • H1 is partly confirmed: Rejected for CB-I and accepted for CB-II - 11 -
  • 12. Department of Business Information Systems II Experimental Results (A-Star Approximation) • CB-I: GMN has lowest MAE and A*R highest correctness • CB-II: GMN has lowest MAE and highest correctness • H2: Clearly accepted for CB-II and partly accepted for CB-I - 12 -
  • 13. Department of Business Information Systems II Conclusion and Future Work Conclusion • GEM and GMN show high potential in graph retrieval scenarios, especially in more complex domains – GEM as a MAC measure outperforms other machine-learned approaches – GMN as a FAC measure outperforms an A-Star-based graph matching measure Future Work • Optimization of the presented neural networks – Usage of a differentiable ranking loss function for GEM – Optimized encoding scheme for more data types and other graph structures • Evaluation on other domains, e.g., argument graphs, and other types of complex graph similarity measures • Investigation of suitable methods for explaining the results of the neural networks in the context of Explainable Artificial Intelligence (e.g., see XAI workshop of ICCBR 2019) - 13 -
  • 14. Department of Business Information Systems II - 14 - Thank you for your attention!
  • 16. Department of Business Information Systems II References [1] Klein, P., Malburg, L., Bergmann, R.: Learning Workflow Embeddings to Improve the Performance of Similarity-Based Retrieval for Process-Oriented Case-Based Reasoning. In: Case-Based Reasoning Research and Development: 27th Inter- Conference, ICCBR 2019, Germany, pp. 188–203. Springer. (2019) [2] Bergmann, R., Stromer, A.: MAC/FAC Retrieval of Semantic Workflows. In: Boonthum-Denecke, C., Youngblood, G.M. (eds.) Proceedings of the Twenty- Sixth International Florida Artificial Intelligence Research Society Conference, FLAIRS 2013. AAAI Press (2013) [3] Bergmann, R., Gil, Y.: Similarity assessment and efficient retrieval of semantic workflows. Information Systems 40, pp. 115–127 (2014) [4] Li, Y., Gu, C., Dullien, T., Vinyals, O., Kohli, P.: Graph Matching Networks for Learning the Similarity of Graph Structured Objects. In: Chaudhuri, K., Salakhutdinov, R. (eds.) Proc. of the 36th Int. Conf. on Machine Learning, ICML 2019, USA. Proc. of Machine Learning Research, vol. 97, pp. 3835–3845. PMLR - 16 -