SlideShare a Scribd company logo
Computer Science and Engineering
Lijun Chang
Efficient Subgraph Matching by
Postponing Cartesian Products
Lijun.Chang@unsw.edu.au
The University of New South Wales, Australia
2
Outline
Ø  Introduction & Existing Works
Ø  Challenges of Subgraph Matching
Ø  Our Approach: CFL-Match
v Core-First based Framework
v Compact Path Index (CPI) based Matching
Ø  Experiment
Ø  Conclusion
3
Introduction
Ø  Subgraph Matching
Given a query q and a large data graph G, the problem is
to extract all subgraph isomorphic embeddings of q in G.
4
Introduction
Ø  Subgraph Matching
Given a query q and a large data graph G, the problem is
to extract all subgraph isomorphic embeddings of q in G.
5
Introduction
Ø  Subgraph Matching
Given a query q and a large data graph G, the problem is
to extract all subgraph isomorphic embeddings of q in G.
6
Introduction
Ø  Subgraph Matching
Given a query q and a large data graph G, the problem is
to extract all subgraph isomorphic embeddings of q in G.
7
Introduction
Ø  Applications
§  Protein interaction network analysis
§  Social network analysis
§  Chemical compound search
8
Hardness Result
Ø  Subgraph Isomorphism Testing
Ø  Decide whether there is a subgraph of G that is isomophic to q
Ø  NP-complete
Ø  Enumerating all subgraph embeddings is harder
Ø  This is the problem we study
9
Existing Work
Ø  Ullmann’s algorithm [J.ACM’76]
§  Iteratively maps query vertices one by one, following the input order of
query vertices.
§  Example: Input order could be (u1, u2, u3, u4, u5, u6)
§  Cartesian Products between vertices’ candidates.
Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
Ø  TurboISO [SIGMOD’13]
Ø  BoostISO [VLDB’15]
10
Existing Work
Ø  Ullmann’s algorithm [J.ACM’76]
Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
§  Independently propose to enforce connectivity of the matching order to
reduce Cartesian products caused by disconnected query vertices.
§  QuickSI further removes false-positive candidates by first
processing infrequent query vertices and edges.
§  Connected order could be (u5, u1, u2, u3, u6, u4)
Ø  TurboISO [SIGMOD’13]
Ø  BoostISO [VLDB’15]
11
Existing Work
Ø  Ullmann’s algorithm [J.ACM’76]
Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
Ø  TurboISO [SIGMOD’13]
§  Merge together query vertices with the same neighborhood.
§  Reduces Cartesian product caused by similar query vertices
§  Build a data structure online to facilitate the search process.
Ø  BoostISO [VLDB’15]
12
Existing Work
Ø  Ullmann’s algorithm [J.ACM’76]
Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08]
Ø  TurboISO [SIGMOD’13]
Ø  BoostISO [VLDB’15]
§  Compress a data graph G by merging together similar vertices in G.
§  Develop query-dependent relationship between vertices in G.
It is still challenging for matching large query graphs.
13
Challenges of Subgraph Matching
Matching order of QuickSI and TurboISO : (u1,u2,u3,u4,u5,u6).
Challenge I: Redundant Cartesian Products by Dissimilar Vertices.
Cartesian products:
Ø  100 mappings (v0,v2, v1000+i, v2100 +i) (3 ≤ i ≤ 102) of (u1,u2,u3,u4)
Ø  1000 mappings (v0, vj) (3 ≤ j ≤ 1002) of (u1,u5)
105 - 100 partial mappings
are redundant.
(u1,u2,u5,u3,u4,u6)
14
Challenges of Subgraph Matching
Our Solution : Postpone Cartesian products.
Ø  Decompose q into a dense subgraph and a forest, and
process the dense subgraph first.
Challenge I: Redundant Cartesian Products by Dissimilar Vertices.
15
Challenges of Subgraph Matching
Challenge II: Exponential size of the path-based data structure in
TurboISO.
Ø  TurboISO builds a data structure that materializes all embeddings of
query paths in a data graph
1.  for generating matching order based on estimation of #candidates.
2.  for enumerating subgraph isomorphic embeddings.
Ø  Worst-case space complexity: O(|V(G)||v(q)-1|).
16
Challenges of Subgraph Matching
Challenge II: Exponential size of the path-based data structure in
TurboISO.
Our Solution: Polynomial-size data structure, compact path-index (CPI) .
17
Our Approach
Ø CFL-Match
v A Core-First based Framework
v Compact Path-Index (CPI) based Matching
18
CFL-Match
Ø  A Core-First based Framework
§  Core-Forest Decomposition
Compute the minimal connected subgraph containing all non-tree edges of
q regarding any spanning tree.
§  Forest-Leaf Decomposition
Compute the set of leaf vertices by rooting each tree at its connection vertex.
19
CFL-Match
Ø  A Core-First based Framework
1)  Core-Forest-Leaf Decomposition
2)  CPI Construction
3)  Mapping Extraction
i.  Core-Match
ii.  Forest-Match
iii.  Leaf-Match
•  Categorize leaf nodes according to label
•  Perform combination instead of enumeration among different labels.
20
Auxiliary Data Structure
Ø  Compact Path-Index (CPI)
§  Compactly store candidate embeddings of query spanning trees.
§  Serve for computing an effective matching order.
Ø  CPI Structure
§  Candidate sets
Each query node u has a candidate set u.C.
§  Edge sets
This is an edge between v ∈ u.C and v’ ∈ u’.C for adjacent query
nodes u and u’ in CPI if and only if (v, v’) exists in G.
21
Auxiliary Data Structure
Ø  Compact Path-Index (CPI)
§  Compactly store candidate embeddings of query spanning trees.
§  Serve for computing an effective matching order.
Ø  CPI Structure
§  Example
22
Auxiliary Data Structure
Ø  Soundness of CPI
For every query node u in CPI, if there is an embedding of q in G that
maps u to v, then v must be in u.C.
Given a sound CPI, all embeddings of q in G can be computed by
traversing only the CPI while G is only probed for non-tree edge
checkings.
Ø  It is NP-hard to build a minimum sound CPI.
Ø  Aim to build a small and sound CPI.
Theorem
23
CPI Construction
Ø  General Idea
§  A heuristic approach:
1) u.C is initialized to contain all vertices in G with the same label as u
2) A data vertex v is pruned from u.C ,
if ∃u’ ∈ Nq(u), such that ∄v’ ∈ NG(v) & v’ ∈ u’.C.
Ø  A two-phase CPI construction process:
§  Top-down construction, bottom-up refinement
§  Exploit the pruning power of both directions of every query edge.
§  Construct CPI of O(|E(G)| X |V(q)|) size in O(|E(G)| X |E(q)|) time
24
CPI-based Match
Ø  Compute path-based matching order using CPI
Ø  Estimate #matches for each root-to-leaf path in CPI
Ø  Add paths to the matching order in increasing order regarding #matches
Ø  Traverse CPI to find mappings for query vertices
Ø  Only probe G for non-tree edge validation
(u0,u1,u4,u3,u2, u5, u6, u7, u8, u9, u10)
25
Experiment
Ø  All algorithms are implemented in C++ and run on a machine with
3.2G CPU and 8G RAM.
Ø  Datasets
§  Real Graphs
§  Synthetic Graphs
§  Randomly generate graphs with 100k vertices with average degree 8 and 50
distinct labels.
Ø  Query Graphs
§  Randomly generate by random walk
§  Two Categories:
S: sparse (average degree ≤ 3). N: non-sparse (average degree > 3).
|V| |E| |∑| Degree
HPRD 9460 37081 307 7.8
Yeast 3112 12519 71 8.1
Human 4674 86282 44 36.9
26
Comparing with Existing Techniques
Varying the size of query graph |V(q)|
CFL-Match: our proposed algorithm
27
Effectiveness of Our New Framework
Evaluating our framework
Ø  Match: subgraph matching algorithm with CPI but no query
decomposition.
Ø  CF-Match: only core-forest decomposition with CPI.
Ø  CFL-Match: our best algorithm.
28
Scalability Testing
29
Conclusion
Ø  We proposed a core-first framework for subgraph matching by
postponing Cartesian products
Ø  We proposed a new polynomial-size path-based auxiliary data
structure CPI, and proposed efficient and effective technique for
constructing a small CPI
Ø  We proposed efficient algorithms for subgraph matching based on
the core-first framework and the CPI
Ø  Extensive empirical studies on real and synthetic graphs
demonstrate that our technique outperforms the state-of-the-art
algorithms.
30
Thank you!
Questions?
Lijun.Chang@unsw.edu.au

More Related Content

What's hot

20131216 Stat Journal
20131216 Stat Journal20131216 Stat Journal
20131216 Stat Journal
Med_KU
 
Day 2 examples u6w14
Day 2 examples u6w14Day 2 examples u6w14
Day 2 examples u6w14jchartiersjsd
 
An Improved Optimization Techniques for Parallel Prefix Adder using FPGA
An Improved Optimization Techniques for Parallel Prefix Adder using FPGAAn Improved Optimization Techniques for Parallel Prefix Adder using FPGA
An Improved Optimization Techniques for Parallel Prefix Adder using FPGA
IJMER
 
Lecture 28
Lecture 28Lecture 28
Lecture 28
Shani729
 
Design and implementation of Parallel Prefix Adders using FPGAs
Design and implementation of Parallel Prefix Adders using FPGAsDesign and implementation of Parallel Prefix Adders using FPGAs
Design and implementation of Parallel Prefix Adders using FPGAs
IOSR Journals
 
Kaggle boschコンペ振り返り
Kaggle boschコンペ振り返りKaggle boschコンペ振り返り
Kaggle boschコンペ振り返り
Keisuke Hosaka
 
Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...
Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...
Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...Computer Science Club
 
Kaggle bosch presentation material for Kaggle Tokyo Meetup #2
Kaggle bosch presentation material for Kaggle Tokyo Meetup #2Kaggle bosch presentation material for Kaggle Tokyo Meetup #2
Kaggle bosch presentation material for Kaggle Tokyo Meetup #2
Keisuke Hosaka
 
Area–delay–power efficient carry select adder
Area–delay–power efficient carry select adderArea–delay–power efficient carry select adder
Area–delay–power efficient carry select adder
LogicMindtech Nologies
 
Advanced Analytics and Data Visualisation in Forest Management and Planning
Advanced Analytics and Data Visualisation in  Forest Management and PlanningAdvanced Analytics and Data Visualisation in  Forest Management and Planning
Advanced Analytics and Data Visualisation in Forest Management and Planning
Noli Sicad
 
Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...
Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...
Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...
Neo4j
 
Data_Visualization_LP Result_Dashboard_Using_R_Graphics
Data_Visualization_LP Result_Dashboard_Using_R_GraphicsData_Visualization_LP Result_Dashboard_Using_R_Graphics
Data_Visualization_LP Result_Dashboard_Using_R_GraphicsNoli Sicad
 
R and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenR and Visualization: A match made in Heaven
R and Visualization: A match made in Heaven
Edureka!
 
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
The Statistical and Applied Mathematical Sciences Institute
 
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
Asai Masataro
 
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AES
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AESCFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AES
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AES
idescitation
 
Karnaugh Map (K-map)
Karnaugh Map (K-map)Karnaugh Map (K-map)
Karnaugh Map (K-map)
Saimur Swarnab
 
THoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property GraphsTHoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property Graphs
Giacomo Bergami
 

What's hot (20)

20131216 Stat Journal
20131216 Stat Journal20131216 Stat Journal
20131216 Stat Journal
 
Day 2 examples u6w14
Day 2 examples u6w14Day 2 examples u6w14
Day 2 examples u6w14
 
An Improved Optimization Techniques for Parallel Prefix Adder using FPGA
An Improved Optimization Techniques for Parallel Prefix Adder using FPGAAn Improved Optimization Techniques for Parallel Prefix Adder using FPGA
An Improved Optimization Techniques for Parallel Prefix Adder using FPGA
 
Lecture 28
Lecture 28Lecture 28
Lecture 28
 
Design and implementation of Parallel Prefix Adders using FPGAs
Design and implementation of Parallel Prefix Adders using FPGAsDesign and implementation of Parallel Prefix Adders using FPGAs
Design and implementation of Parallel Prefix Adders using FPGAs
 
Kaggle boschコンペ振り返り
Kaggle boschコンペ振り返りKaggle boschコンペ振り返り
Kaggle boschコンペ振り返り
 
Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...
Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...
Andrew Goldberg. Highway Dimension and Provably Efficient Shortest Path Algor...
 
3 facility layout
3 facility layout3 facility layout
3 facility layout
 
Kaggle bosch presentation material for Kaggle Tokyo Meetup #2
Kaggle bosch presentation material for Kaggle Tokyo Meetup #2Kaggle bosch presentation material for Kaggle Tokyo Meetup #2
Kaggle bosch presentation material for Kaggle Tokyo Meetup #2
 
Area–delay–power efficient carry select adder
Area–delay–power efficient carry select adderArea–delay–power efficient carry select adder
Area–delay–power efficient carry select adder
 
Advanced Analytics and Data Visualisation in Forest Management and Planning
Advanced Analytics and Data Visualisation in  Forest Management and PlanningAdvanced Analytics and Data Visualisation in  Forest Management and Planning
Advanced Analytics and Data Visualisation in Forest Management and Planning
 
Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...
Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...
Graphing Enterprise IT – Representing IT Infrastructure and Business Processe...
 
Data_Visualization_LP Result_Dashboard_Using_R_Graphics
Data_Visualization_LP Result_Dashboard_Using_R_GraphicsData_Visualization_LP Result_Dashboard_Using_R_Graphics
Data_Visualization_LP Result_Dashboard_Using_R_Graphics
 
R and Visualization: A match made in Heaven
R and Visualization: A match made in HeavenR and Visualization: A match made in Heaven
R and Visualization: A match made in Heaven
 
distance_matrix_ch
distance_matrix_chdistance_matrix_ch
distance_matrix_ch
 
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
GDRR Opening Workshop - Bayesian Inference for Common Cause Failure Rate Base...
 
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
[AAAI-16] Tiebreaking Strategies for A* Search: How to Explore the Final Fron...
 
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AES
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AESCFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AES
CFA based SBOX and Modified Mixcolumn Implementation of 8 Bit Datapath for AES
 
Karnaugh Map (K-map)
Karnaugh Map (K-map)Karnaugh Map (K-map)
Karnaugh Map (K-map)
 
THoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property GraphsTHoSP: an Algorithm for Nesting Property Graphs
THoSP: an Algorithm for Nesting Property Graphs
 

Similar to 8th TUC Meeting | Lijun Chang (University of New South Wales). Efficient Subgraph Matching by Postponing Cartesian Products.

Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection
aftab alam
 
Distributed graph summarization
Distributed graph summarizationDistributed graph summarization
Distributed graph summarization
aftab alam
 
SCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATION
SCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATIONSCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATION
SCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATION
aftab alam
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
Kyong-Ha Lee
 
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
Ryan Rossi
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
Ben Mabey
 
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
Xiaoyu Shi
 
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...Markov Chain Monitoring - Application to demand prediction in bike sharing sy...
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...
Harshal Chaudhari
 
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
Nesreen K. Ahmed
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
Nesreen K. Ahmed
 
List intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and OptimizationsList intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and Optimizations
Sunghwan Kim
 
Project seminar ppt_steelcasting
Project seminar ppt_steelcastingProject seminar ppt_steelcasting
Project seminar ppt_steelcasting
Rudra Narayan Paul
 
4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...
4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...
4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...GISRUK conference
 
15 unionfind
15 unionfind15 unionfind
15 unionfind
Carlos andré dantas
 
Algorithms, Union Find
Algorithms, Union FindAlgorithms, Union Find
Algorithms, Union Find
Nikita Shpilevoy
 
On the value of Sampling and Pruning for SBSE
On the value of Sampling and Pruning for SBSEOn the value of Sampling and Pruning for SBSE
On the value of Sampling and Pruning for SBSE
Jianfeng Chen
 
Fast and Scalable NUMA-based Thread Parallel Breadth-first Search
Fast and Scalable NUMA-based Thread Parallel Breadth-first SearchFast and Scalable NUMA-based Thread Parallel Breadth-first Search
Fast and Scalable NUMA-based Thread Parallel Breadth-first Search
Yuichiro Yasui
 

Similar to 8th TUC Meeting | Lijun Chang (University of New South Wales). Efficient Subgraph Matching by Postponing Cartesian Products. (20)

Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection Compressing Graphs and Indexes with Recursive Graph Bisection
Compressing Graphs and Indexes with Recursive Graph Bisection
 
Distributed graph summarization
Distributed graph summarizationDistributed graph summarization
Distributed graph summarization
 
SCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATION
SCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATIONSCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATION
SCALABLE PATTERN MATCHING OVER COMPRESSED GRAPHS VIA DE-DENSIFICATION
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
 
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks Leveraging Multiple GPUs and CPUs for  Graphlet Counting in Large Networks
Leveraging Multiple GPUs and CPUs for Graphlet Counting in Large Networks
 
SVD and the Netflix Dataset
SVD and the Netflix DatasetSVD and the Netflix Dataset
SVD and the Netflix Dataset
 
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
CSBP: A Fast Circuit Similarity-Based Placement for FPGA Incremental Design a...
 
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...Markov Chain Monitoring - Application to demand prediction in bike sharing sy...
Markov Chain Monitoring - Application to demand prediction in bike sharing sy...
 
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
The Power of Motif Counting Theory, Algorithms, and Applications for Large Gr...
 
Ch06 multalign
Ch06 multalignCh06 multalign
Ch06 multalign
 
High-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and ModelingHigh-Performance Graph Analysis and Modeling
High-Performance Graph Analysis and Modeling
 
List intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and OptimizationsList intersection for web search: Algorithms, Cost Models, and Optimizations
List intersection for web search: Algorithms, Cost Models, and Optimizations
 
Project seminar ppt_steelcasting
Project seminar ppt_steelcastingProject seminar ppt_steelcasting
Project seminar ppt_steelcasting
 
Lecture13
Lecture13Lecture13
Lecture13
 
post119s1-file3
post119s1-file3post119s1-file3
post119s1-file3
 
4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...
4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...
4A_ 3_Parallel k-means clustering using gp_us for the geocomputation of real-...
 
15 unionfind
15 unionfind15 unionfind
15 unionfind
 
Algorithms, Union Find
Algorithms, Union FindAlgorithms, Union Find
Algorithms, Union Find
 
On the value of Sampling and Pruning for SBSE
On the value of Sampling and Pruning for SBSEOn the value of Sampling and Pruning for SBSE
On the value of Sampling and Pruning for SBSE
 
Fast and Scalable NUMA-based Thread Parallel Breadth-first Search
Fast and Scalable NUMA-based Thread Parallel Breadth-first SearchFast and Scalable NUMA-based Thread Parallel Breadth-first Search
Fast and Scalable NUMA-based Thread Parallel Breadth-first Search
 

More from LDBC council

8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...
8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...
8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...
LDBC council
 
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
LDBC council
 
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
LDBC council
 
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...
LDBC council
 
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
LDBC council
 
8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...
8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...
8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...
LDBC council
 
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...
LDBC council
 
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status
LDBC council
 
8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...
8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...
8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...
LDBC council
 
8th TUC Meeting -
8th TUC Meeting - 8th TUC Meeting -
8th TUC Meeting -
LDBC council
 
8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...
8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...
8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...
LDBC council
 
8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...
8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...
8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...
LDBC council
 
8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...
8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...
8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...
LDBC council
 
LDBC 8th TUC Meeting: Introduction and status update
LDBC 8th TUC Meeting: Introduction and status updateLDBC 8th TUC Meeting: Introduction and status update
LDBC 8th TUC Meeting: Introduction and status update
LDBC council
 
LDBC 6th TUC Meeting conclusions
LDBC 6th TUC Meeting conclusionsLDBC 6th TUC Meeting conclusions
LDBC 6th TUC Meeting conclusions
LDBC council
 
Parallel and incremental materialisation of RDF/DATALOG in RDFOX
Parallel and incremental materialisation of RDF/DATALOG in RDFOXParallel and incremental materialisation of RDF/DATALOG in RDFOX
Parallel and incremental materialisation of RDF/DATALOG in RDFOX
LDBC council
 
MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service Selection
LDBC council
 
E-Commerce and Graph-driven Applications: Experiences and Optimizations while...
E-Commerce and Graph-driven Applications: Experiences and Optimizations while...E-Commerce and Graph-driven Applications: Experiences and Optimizations while...
E-Commerce and Graph-driven Applications: Experiences and Optimizations while...
LDBC council
 
LDBC SNB Benchmark Auditing
LDBC SNB Benchmark AuditingLDBC SNB Benchmark Auditing
LDBC SNB Benchmark Auditing
LDBC council
 
Social Network Benchmark Interactive Workload
Social Network Benchmark Interactive WorkloadSocial Network Benchmark Interactive Workload
Social Network Benchmark Interactive Workload
LDBC council
 

More from LDBC council (20)

8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...
8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...
8th TUC Meeting - Juan Sequeda (Capsenta). Integrating Data using Graphs and ...
 
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...8th TUC Meeting -  Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
8th TUC Meeting - Zhe Wu (Oracle USA). Bridging RDF Graph and Property Graph...
 
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
8th TUC Meeting – Yinglong Xia (Huawei), Big Graph Analytics Engine
 
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...
8th TUC Meeting – George Fletcher (TU Eindhoven), gMark: Schema-driven data a...
 
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
8th TUC Meeting – Marcus Paradies (SAP) Social Network Benchmark
 
8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...
8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...
8th TUC Meeting - Sergey Edunov (Facebook). Generating realistic trillion-edg...
 
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...
Weining Qian (ECNU). On Statistical Characteristics of Real-Life Knowledge Gr...
 
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status
8th TUC Meeting - Peter Boncz (CWI). Query Language Task Force status
 
8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...
8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...
8th TUC Meeting - Eugene I. Chong (Oracle USA). Balancing Act to improve RDF ...
 
8th TUC Meeting -
8th TUC Meeting - 8th TUC Meeting -
8th TUC Meeting -
 
8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...
8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...
8th TUC Meeting - David Meibusch, Nathan Hawes (Oracle Labs Australia). Frapp...
 
8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...
8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...
8th TUC Meeting - Martin Zand University of Rochester Clinical and Translatio...
 
8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...
8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...
8th TUC Meeting - Tim Hegeman (TU Delft). Social Network Benchmark, Analytics...
 
LDBC 8th TUC Meeting: Introduction and status update
LDBC 8th TUC Meeting: Introduction and status updateLDBC 8th TUC Meeting: Introduction and status update
LDBC 8th TUC Meeting: Introduction and status update
 
LDBC 6th TUC Meeting conclusions
LDBC 6th TUC Meeting conclusionsLDBC 6th TUC Meeting conclusions
LDBC 6th TUC Meeting conclusions
 
Parallel and incremental materialisation of RDF/DATALOG in RDFOX
Parallel and incremental materialisation of RDF/DATALOG in RDFOXParallel and incremental materialisation of RDF/DATALOG in RDFOX
Parallel and incremental materialisation of RDF/DATALOG in RDFOX
 
MODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service SelectionMODAClouds Decision Support System for Cloud Service Selection
MODAClouds Decision Support System for Cloud Service Selection
 
E-Commerce and Graph-driven Applications: Experiences and Optimizations while...
E-Commerce and Graph-driven Applications: Experiences and Optimizations while...E-Commerce and Graph-driven Applications: Experiences and Optimizations while...
E-Commerce and Graph-driven Applications: Experiences and Optimizations while...
 
LDBC SNB Benchmark Auditing
LDBC SNB Benchmark AuditingLDBC SNB Benchmark Auditing
LDBC SNB Benchmark Auditing
 
Social Network Benchmark Interactive Workload
Social Network Benchmark Interactive WorkloadSocial Network Benchmark Interactive Workload
Social Network Benchmark Interactive Workload
 

Recently uploaded

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
UiPathCommunity
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
Globus
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 

Recently uploaded (20)

Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
Enhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZEnhancing Performance with Globus and the Science DMZ
Enhancing Performance with Globus and the Science DMZ
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 

8th TUC Meeting | Lijun Chang (University of New South Wales). Efficient Subgraph Matching by Postponing Cartesian Products.

  • 1. Computer Science and Engineering Lijun Chang Efficient Subgraph Matching by Postponing Cartesian Products Lijun.Chang@unsw.edu.au The University of New South Wales, Australia
  • 2. 2 Outline Ø  Introduction & Existing Works Ø  Challenges of Subgraph Matching Ø  Our Approach: CFL-Match v Core-First based Framework v Compact Path Index (CPI) based Matching Ø  Experiment Ø  Conclusion
  • 3. 3 Introduction Ø  Subgraph Matching Given a query q and a large data graph G, the problem is to extract all subgraph isomorphic embeddings of q in G.
  • 4. 4 Introduction Ø  Subgraph Matching Given a query q and a large data graph G, the problem is to extract all subgraph isomorphic embeddings of q in G.
  • 5. 5 Introduction Ø  Subgraph Matching Given a query q and a large data graph G, the problem is to extract all subgraph isomorphic embeddings of q in G.
  • 6. 6 Introduction Ø  Subgraph Matching Given a query q and a large data graph G, the problem is to extract all subgraph isomorphic embeddings of q in G.
  • 7. 7 Introduction Ø  Applications §  Protein interaction network analysis §  Social network analysis §  Chemical compound search
  • 8. 8 Hardness Result Ø  Subgraph Isomorphism Testing Ø  Decide whether there is a subgraph of G that is isomophic to q Ø  NP-complete Ø  Enumerating all subgraph embeddings is harder Ø  This is the problem we study
  • 9. 9 Existing Work Ø  Ullmann’s algorithm [J.ACM’76] §  Iteratively maps query vertices one by one, following the input order of query vertices. §  Example: Input order could be (u1, u2, u3, u4, u5, u6) §  Cartesian Products between vertices’ candidates. Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08] Ø  TurboISO [SIGMOD’13] Ø  BoostISO [VLDB’15]
  • 10. 10 Existing Work Ø  Ullmann’s algorithm [J.ACM’76] Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08] §  Independently propose to enforce connectivity of the matching order to reduce Cartesian products caused by disconnected query vertices. §  QuickSI further removes false-positive candidates by first processing infrequent query vertices and edges. §  Connected order could be (u5, u1, u2, u3, u6, u4) Ø  TurboISO [SIGMOD’13] Ø  BoostISO [VLDB’15]
  • 11. 11 Existing Work Ø  Ullmann’s algorithm [J.ACM’76] Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08] Ø  TurboISO [SIGMOD’13] §  Merge together query vertices with the same neighborhood. §  Reduces Cartesian product caused by similar query vertices §  Build a data structure online to facilitate the search process. Ø  BoostISO [VLDB’15]
  • 12. 12 Existing Work Ø  Ullmann’s algorithm [J.ACM’76] Ø  VF2 [IEEE Trans’04] and QuickSI [VLDB’08] Ø  TurboISO [SIGMOD’13] Ø  BoostISO [VLDB’15] §  Compress a data graph G by merging together similar vertices in G. §  Develop query-dependent relationship between vertices in G. It is still challenging for matching large query graphs.
  • 13. 13 Challenges of Subgraph Matching Matching order of QuickSI and TurboISO : (u1,u2,u3,u4,u5,u6). Challenge I: Redundant Cartesian Products by Dissimilar Vertices. Cartesian products: Ø  100 mappings (v0,v2, v1000+i, v2100 +i) (3 ≤ i ≤ 102) of (u1,u2,u3,u4) Ø  1000 mappings (v0, vj) (3 ≤ j ≤ 1002) of (u1,u5) 105 - 100 partial mappings are redundant. (u1,u2,u5,u3,u4,u6)
  • 14. 14 Challenges of Subgraph Matching Our Solution : Postpone Cartesian products. Ø  Decompose q into a dense subgraph and a forest, and process the dense subgraph first. Challenge I: Redundant Cartesian Products by Dissimilar Vertices.
  • 15. 15 Challenges of Subgraph Matching Challenge II: Exponential size of the path-based data structure in TurboISO. Ø  TurboISO builds a data structure that materializes all embeddings of query paths in a data graph 1.  for generating matching order based on estimation of #candidates. 2.  for enumerating subgraph isomorphic embeddings. Ø  Worst-case space complexity: O(|V(G)||v(q)-1|).
  • 16. 16 Challenges of Subgraph Matching Challenge II: Exponential size of the path-based data structure in TurboISO. Our Solution: Polynomial-size data structure, compact path-index (CPI) .
  • 17. 17 Our Approach Ø CFL-Match v A Core-First based Framework v Compact Path-Index (CPI) based Matching
  • 18. 18 CFL-Match Ø  A Core-First based Framework §  Core-Forest Decomposition Compute the minimal connected subgraph containing all non-tree edges of q regarding any spanning tree. §  Forest-Leaf Decomposition Compute the set of leaf vertices by rooting each tree at its connection vertex.
  • 19. 19 CFL-Match Ø  A Core-First based Framework 1)  Core-Forest-Leaf Decomposition 2)  CPI Construction 3)  Mapping Extraction i.  Core-Match ii.  Forest-Match iii.  Leaf-Match •  Categorize leaf nodes according to label •  Perform combination instead of enumeration among different labels.
  • 20. 20 Auxiliary Data Structure Ø  Compact Path-Index (CPI) §  Compactly store candidate embeddings of query spanning trees. §  Serve for computing an effective matching order. Ø  CPI Structure §  Candidate sets Each query node u has a candidate set u.C. §  Edge sets This is an edge between v ∈ u.C and v’ ∈ u’.C for adjacent query nodes u and u’ in CPI if and only if (v, v’) exists in G.
  • 21. 21 Auxiliary Data Structure Ø  Compact Path-Index (CPI) §  Compactly store candidate embeddings of query spanning trees. §  Serve for computing an effective matching order. Ø  CPI Structure §  Example
  • 22. 22 Auxiliary Data Structure Ø  Soundness of CPI For every query node u in CPI, if there is an embedding of q in G that maps u to v, then v must be in u.C. Given a sound CPI, all embeddings of q in G can be computed by traversing only the CPI while G is only probed for non-tree edge checkings. Ø  It is NP-hard to build a minimum sound CPI. Ø  Aim to build a small and sound CPI. Theorem
  • 23. 23 CPI Construction Ø  General Idea §  A heuristic approach: 1) u.C is initialized to contain all vertices in G with the same label as u 2) A data vertex v is pruned from u.C , if ∃u’ ∈ Nq(u), such that ∄v’ ∈ NG(v) & v’ ∈ u’.C. Ø  A two-phase CPI construction process: §  Top-down construction, bottom-up refinement §  Exploit the pruning power of both directions of every query edge. §  Construct CPI of O(|E(G)| X |V(q)|) size in O(|E(G)| X |E(q)|) time
  • 24. 24 CPI-based Match Ø  Compute path-based matching order using CPI Ø  Estimate #matches for each root-to-leaf path in CPI Ø  Add paths to the matching order in increasing order regarding #matches Ø  Traverse CPI to find mappings for query vertices Ø  Only probe G for non-tree edge validation (u0,u1,u4,u3,u2, u5, u6, u7, u8, u9, u10)
  • 25. 25 Experiment Ø  All algorithms are implemented in C++ and run on a machine with 3.2G CPU and 8G RAM. Ø  Datasets §  Real Graphs §  Synthetic Graphs §  Randomly generate graphs with 100k vertices with average degree 8 and 50 distinct labels. Ø  Query Graphs §  Randomly generate by random walk §  Two Categories: S: sparse (average degree ≤ 3). N: non-sparse (average degree > 3). |V| |E| |∑| Degree HPRD 9460 37081 307 7.8 Yeast 3112 12519 71 8.1 Human 4674 86282 44 36.9
  • 26. 26 Comparing with Existing Techniques Varying the size of query graph |V(q)| CFL-Match: our proposed algorithm
  • 27. 27 Effectiveness of Our New Framework Evaluating our framework Ø  Match: subgraph matching algorithm with CPI but no query decomposition. Ø  CF-Match: only core-forest decomposition with CPI. Ø  CFL-Match: our best algorithm.
  • 29. 29 Conclusion Ø  We proposed a core-first framework for subgraph matching by postponing Cartesian products Ø  We proposed a new polynomial-size path-based auxiliary data structure CPI, and proposed efficient and effective technique for constructing a small CPI Ø  We proposed efficient algorithms for subgraph matching based on the core-first framework and the CPI Ø  Extensive empirical studies on real and synthetic graphs demonstrate that our technique outperforms the state-of-the-art algorithms.