SlideShare a Scribd company logo
1 of 26
Ho-Beom Kim
Network Science Lab
Dept. of Mathematics
The Catholic University of Korea
E-mail: hobeom2001@catholic.ac.kr
2023 / 06 / 26
OU, Mingdong, et al.
2
 Introduction
• Contributions
 Methodology
 Related work
 Experiments
 Results
 Discussion
 Conclusion
3
1. Introduction
Problem Statements
• Nowadays, more and more applications are based on larger and larger networks and graphs.
• Most of the existing graph embedding methods target on undirected graphs
• We can not straightforwardly apply undirected graph embedding methods on directed grpahs.
4
1. Introduction
Contributions
• Propose a high-order proximity preserved embedding (HOPE) method to solve the challenging
problem of asymmetric transitivity in directed graph embedding
• Derive a general form covering multiple commonly used high-order proximities, enabling the scalable
solution of HOPE with generalized SVD
• Provide an upper bound on the approximation error of HOPE
• Extensive experiments are conducted to verify the usefulness and generality of the learned embedding
in various applications
5
1. Introduction
Framework of Asymmetric Transitivity Preserving Graph Embedding
6
2. Related Work
Graph Embedding
• Propose a high-order proximity preserved embedding (HOPE) method to solve the challenging
problem of asymmetric transitivity in directed graph embedding
• Derive a general form covering multiple commonly used high-order proximities, enabling the scalable
solution of HOPE with generalized SVD
• Provide an upper bound on the approximation error of HOPE
• Extensive experiments are conducted to verify the usefulness and generality of the learned embedding
in various applications
7
3. High-Order Proximity Preserved Embedding
Notations
• 𝐴 ∶ 𝑎𝑑𝑗𝑎𝑐𝑒𝑛𝑐𝑦 𝑚𝑎𝑡𝑟𝑖𝑥
• 𝑠𝑖𝑗 ∶ 𝑝𝑟𝑜𝑥𝑖𝑚𝑖𝑡𝑦 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑣𝑖, 𝑣𝑗
• 𝑈 = 𝑈𝑠, 𝑈𝑡 : 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑚𝑎𝑡𝑟𝑖𝑥, 𝑈𝑠 ∶ 𝑠𝑜𝑢𝑟𝑐𝑒 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑣𝑒𝑐𝑡𝑜𝑟𝑠, 𝑈𝑡: 𝑡𝑎𝑟𝑔𝑒𝑡 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑣𝑒𝑐𝑡𝑜𝑟𝑠
• 𝐾 ∶ 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑑𝑖𝑚𝑒𝑛𝑠𝑖𝑜𝑛𝑠
8
3. High-Order Proximity Preserved Embedding
Problem Definition
• m𝑖𝑛 ∥ 𝑆 − 𝑈𝑠
∙ 𝑈𝑡𝑇
∥𝐹
2
: loss function
9
3. High-Order Proximity Preserved Embedding
High order proximities
• 𝑆 = 𝑀𝑔
−1
∙ 𝑀𝑙
• Katz Index : An ensemble of all paths, which is a weighted summation over the path set between two
vertexes
• Weight of a path : exponential function of its length
• 𝑆𝐾𝑎𝑡𝑧
= 𝑙=1
∞
𝛽 ∙ 𝐴𝑙
= 𝛽 ∙ 𝐴 ∙ 𝑆𝐾𝑎𝑡𝑧
+ 𝛽 ∙ 𝐴
• 𝛽 : decay parameter
• 𝑆𝐾𝑎𝑡𝑧
= (𝐼 − 𝛽 ∙ A)
−1
∙ 𝛽 ∙A
• 𝑀𝑔 = 𝐼 − 𝛽 ∙ A
• 𝑀𝑙 = 𝛽 ∙ A
10
3. High-Order Proximity Preserved Embedding
High order proximities
• 𝑆 = 𝑀𝑔
−1
∙ 𝑀𝑙
• 𝑀𝑔, 𝑀𝑙 : polynomial matrices
• Katz Index : An ensemble of all paths, which is a weighted summation over the path set between two
vertexes
• Weight of a path : exponential function of its length
• 𝑆𝐾𝑎𝑡𝑧 = 𝑙=1
∞
𝛽 ∙ 𝐴𝑙 = 𝛽 ∙ 𝐴 ∙ 𝑆𝐾𝑎𝑡𝑧 + 𝛽 ∙ 𝐴
• 𝛽 : decay parameter
• 𝑆𝐾𝑎𝑡𝑧
= (𝐼 − 𝛽 ∙ A)
−1
∙ 𝛽 ∙A
• 𝑀𝑔 = 𝐼 − 𝛽 ∙ A
• 𝑀𝑙 = 𝛽 ∙ A
11
3. High-Order Proximity Preserved Embedding
High order proximities
• Rooted PageRank (RPR)
• 𝑆𝑖𝑗
𝑅𝑃𝑅
∶ 𝑡ℎ𝑒 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑡ℎ𝑎𝑡 𝑎 𝑟𝑎𝑛𝑑𝑜𝑚 𝑤𝑎𝑙𝑘 𝑓𝑟𝑜𝑚 𝑛𝑜𝑑𝑒 𝑣𝑖 𝑤𝑖𝑙𝑙 𝑙𝑜𝑐𝑎𝑡𝑒 𝑎𝑡 𝑣𝑗𝑖𝑛 𝑡ℎ𝑒 𝑠𝑡𝑒𝑎𝑑𝑦 𝑠𝑡𝑎𝑡𝑒
• 𝛼 : jumps to one of the neighbor of current node with probability , jumps back probability
• 𝑃 ∶ 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑡𝑟𝑎𝑛𝑠𝑖𝑡𝑖𝑜𝑛 𝑚𝑎𝑡𝑟𝑖𝑥 𝑠𝑎𝑡𝑖𝑠𝑓𝑦𝑖𝑛𝑔 𝑖=1
𝑁
𝑃𝑖𝑗 = 1
• 𝑆𝑖𝑗
𝑅𝑃𝑅
= 1 − 𝛼 ∙ 𝐼 − 𝛼𝑃 −1
• 𝑀𝑔 = 𝐼 − 𝛼𝑃
• 𝑀𝑙 = (1 − 𝛼) ∙ I
12
3. High-Order Proximity Preserved Embedding
High order proximities
• Common Neighbors (CN)
• 𝑆𝑖𝑗
𝐶𝑁
∶ 𝐶𝑜𝑢𝑛𝑡𝑠 𝑡ℎ𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑏𝑒𝑟𝑡𝑒𝑥𝑒𝑠 𝑐𝑜𝑛𝑛𝑒𝑐𝑡𝑖𝑛𝑔 𝑡𝑜 𝑏𝑜𝑡ℎ 𝑣𝑖𝑎𝑛𝑑 𝑣𝑗
: The number of vertexes which is target of an edge from 𝑣𝑖 and the source of an edge to 𝑣𝑗
• 𝑆𝐶𝑁
= 𝐴2
• 𝑀𝑔 = 𝐼
• 𝑀𝑙 = 𝐴2
13
3. High-Order Proximity Preserved Embedding
High order proximities
• Adamic-Adar (AA)
• AA is a variant of common neighbors.
• AA assigns each neighbor a weight, that is the reciprocal of the degree of the neighbor
• The more vertexes one vertex connected to, the less important it is on evaluating the proximity
between a pair of vertex
• 𝑆𝐴𝐴
= 𝐴 ∙D ∙A
• D : diagonal matrix
• 𝐷𝑖𝑖 = 1/ 𝑗(𝐴𝑖𝑗 + 𝐴𝑗𝑖)
• 𝑀𝑔 = 𝐼
• 𝑀𝑙 = 𝐴 ∙ D ∙ A
14
3. High-Order Proximity Preserved Embedding
Approximation of High-Order Proximity
• Objective Equation aims to find an optimal rank-K approximation of the proximity matrix S
• 𝑠 = 𝑖=1
𝑁
𝜎𝑖𝑉𝑖
𝑆
𝑉𝑖
𝑡𝑇
• Make the SVD on S very expensive, the solution is not feasible for large scale graphs
15
3. High-Order Proximity Preserved Embedding
Approximation of High-Order Proximity
• Adopt a partial generalized SVD algorithm (JDGSVD) : an iterative Jacobi-Davidson type algorithm which is
very scalable when k<< N
16
3. High-Order Proximity Preserved Embedding
Approximation Error
• {𝜎𝑖} 𝑎𝑟𝑒 𝑡ℎ𝑒 𝑠𝑖𝑛𝑔𝑢𝑙𝑎𝑟 𝑣𝑎𝑙𝑢𝑒𝑠 𝑜𝑓 𝑆 𝑖𝑛 𝑑𝑒𝑠𝑐𝑒𝑛𝑑 𝑜𝑟𝑑𝑒𝑟
• The lower the rank of S is, the smaller the error is
17
4. Experiments
Datasets
• Synthetic Data (Syn)
• Generate the synthetic data by the forest fire model
• Can generate powerlaw graphs
• As the time com- plexity of computation of some high-order proximities (e.g. Katz, RPR) is too
high, we generate small-scaled synthetic data to allow for the computation of high- order
proximities
• Randomly generate the synthetic datasets
• Cora
• Citation network of academic papers
• Vertexes : academic papers, Directed edges : the citation relationship between papers
18
4. Experiments
Datasets
• Twitter Social Network (SN-Twitter)
• A subnetwork of Twitter
• Vertexes : users of Twitter, Directed edges : following relationships between users
• Tencent Weibo Social Network (SN-Tweibo)
• Contains a subnetwork of the social network in Tencent Weibo, a Twitter-style social network in
China
• Vertexes : Users, Directed edges : following relationship between users
19
4. Experiments
Baseline Methods
• LINE : Preserves the first-order and second-order proximity between vertexes.
• DeepWalk : Randomly walks on the graph, and assumes a pair of vertexes similar if they are close in
the random path. Then, the embedding is learned to preserve these pairwise similarities in the
embedding
• PPE(Partial Proximity Embedding) : First selects a small subset of vertexes as landmarks, and learns
the embedding by approximating the proximity between vertexes and landmarks
• Common Neighbors : Rank the links by the number of common neighbors. Use it for link prediction
and vertex recommendation
• Adamic-Adar : Rank the links by the Adamic-Adar values Use it for link prediction and vertex
recommendation
20
4. Experiments
Baseline Methods
21
4. Experiments
Evaluation Metrics
22
4. Experiments
High-order Proximity Approximation
23
4. Experiments
Graph Reconstruction
24
4. Experiments
Graph Reconstruction
25
4. Experiments
Graph Reconstruction
26
4. Experiments
Conclusion
• we aim to preserve asymmetric transitiv- ity in directed graphs, and propose to preserve asymmetric
transitivity by approximating high-order proximities.
• We propose a scalable approximation algorithm , called High- Order Proximity preserved Embedding
(HOPE).
• we first derive a general formulation of a class of high-order proximity measurements, then apply
generalized SVD to the general formulation, whose time complexity is linear with the size of graph.
• The empirical study demon- strates the superiority of asymmetric transitivity and our proposed
algorithm, HOPE.
• Our future direction is to de- velop a nonlinear model to better capture the complex struc- ture of
directed graphs.

More Related Content

Similar to NS-CUK Seminar:H.B.Kim, Review on "Asymmetric transitivity preserving graph embedding", KDD 2016

Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
KamleshKumar394
 

Similar to NS-CUK Seminar:H.B.Kim, Review on "Asymmetric transitivity preserving graph embedding", KDD 2016 (20)

Temporal Superpixels Based on Proximity-Weighted Patch Matching
Temporal Superpixels Based on Proximity-Weighted Patch MatchingTemporal Superpixels Based on Proximity-Weighted Patch Matching
Temporal Superpixels Based on Proximity-Weighted Patch Matching
 
Adaptive Geographical Search in Networks
Adaptive Geographical Search in NetworksAdaptive Geographical Search in Networks
Adaptive Geographical Search in Networks
 
"Sparse Graph Attention Networks", IEEE Transactions on Knowledge and Data En...
"Sparse Graph Attention Networks", IEEE Transactions on Knowledge and Data En..."Sparse Graph Attention Networks", IEEE Transactions on Knowledge and Data En...
"Sparse Graph Attention Networks", IEEE Transactions on Knowledge and Data En...
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphs
 
NS-CUK Seminar: H.E.Lee, Review on "Structural Deep Embedding for Hyper-Netw...
NS-CUK Seminar: H.E.Lee,  Review on "Structural Deep Embedding for Hyper-Netw...NS-CUK Seminar: H.E.Lee,  Review on "Structural Deep Embedding for Hyper-Netw...
NS-CUK Seminar: H.E.Lee, Review on "Structural Deep Embedding for Hyper-Netw...
 
Data Mining Lecture_9.pptx
Data Mining Lecture_9.pptxData Mining Lecture_9.pptx
Data Mining Lecture_9.pptx
 
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
Weisfeiler and Leman Go Neural: Higher-order Graph Neural Networks, arXiv e-...
 
cutnpeel_wsdm2022_slide.pdf
cutnpeel_wsdm2022_slide.pdfcutnpeel_wsdm2022_slide.pdf
cutnpeel_wsdm2022_slide.pdf
 
Random graph models
Random graph modelsRandom graph models
Random graph models
 
Financial Networks III. Centrality and Systemic Importance
Financial Networks III. Centrality and Systemic ImportanceFinancial Networks III. Centrality and Systemic Importance
Financial Networks III. Centrality and Systemic Importance
 
Graph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear AlgebraGraph Analysis Beyond Linear Algebra
Graph Analysis Beyond Linear Algebra
 
Machine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional ManagersMachine Learning Foundations for Professional Managers
Machine Learning Foundations for Professional Managers
 
is anyone_interest_in_auto-encoding_variational-bayes
is anyone_interest_in_auto-encoding_variational-bayesis anyone_interest_in_auto-encoding_variational-bayes
is anyone_interest_in_auto-encoding_variational-bayes
 
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit sat
 
GIS Data Quality
GIS Data QualityGIS Data Quality
GIS Data Quality
 
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
ICDE-2015 Shortest Path Traversal Optimization and Analysis for Large Graph C...
 
Sketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignmentSketching and locality sensitive hashing for alignment
Sketching and locality sensitive hashing for alignment
 
LSH
LSHLSH
LSH
 

More from ssuser4b1f48

More from ssuser4b1f48 (20)

NS-CUK Seminar: V.T.Hoang, Review on "GOAT: A Global Transformer on Large-sca...
NS-CUK Seminar: V.T.Hoang, Review on "GOAT: A Global Transformer on Large-sca...NS-CUK Seminar: V.T.Hoang, Review on "GOAT: A Global Transformer on Large-sca...
NS-CUK Seminar: V.T.Hoang, Review on "GOAT: A Global Transformer on Large-sca...
 
NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...
NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...
NS-CUK Seminar: J.H.Lee, Review on "Graph Propagation Transformer for Graph R...
 
NS-CUK Seminar: H.B.Kim, Review on "Cluster-GCN: An Efficient Algorithm for ...
NS-CUK Seminar: H.B.Kim,  Review on "Cluster-GCN: An Efficient Algorithm for ...NS-CUK Seminar: H.B.Kim,  Review on "Cluster-GCN: An Efficient Algorithm for ...
NS-CUK Seminar: H.B.Kim, Review on "Cluster-GCN: An Efficient Algorithm for ...
 
NS-CUK Seminar: H.E.Lee, Review on "Weisfeiler and Leman Go Neural: Higher-O...
NS-CUK Seminar: H.E.Lee,  Review on "Weisfeiler and Leman Go Neural: Higher-O...NS-CUK Seminar: H.E.Lee,  Review on "Weisfeiler and Leman Go Neural: Higher-O...
NS-CUK Seminar: H.E.Lee, Review on "Weisfeiler and Leman Go Neural: Higher-O...
 
NS-CUK Seminar:V.T.Hoang, Review on "GRPE: Relative Positional Encoding for G...
NS-CUK Seminar:V.T.Hoang, Review on "GRPE: Relative Positional Encoding for G...NS-CUK Seminar:V.T.Hoang, Review on "GRPE: Relative Positional Encoding for G...
NS-CUK Seminar:V.T.Hoang, Review on "GRPE: Relative Positional Encoding for G...
 
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
 
Aug 22nd, 2023: Case Studies - The Art and Science of Animation Production)
Aug 22nd, 2023: Case Studies - The Art and Science of Animation Production)Aug 22nd, 2023: Case Studies - The Art and Science of Animation Production)
Aug 22nd, 2023: Case Studies - The Art and Science of Animation Production)
 
Aug 17th, 2023: Case Studies - Examining Gamification through Virtual/Augment...
Aug 17th, 2023: Case Studies - Examining Gamification through Virtual/Augment...Aug 17th, 2023: Case Studies - Examining Gamification through Virtual/Augment...
Aug 17th, 2023: Case Studies - Examining Gamification through Virtual/Augment...
 
Aug 10th, 2023: Case Studies - The Power of eXtended Reality (XR) with 360°
Aug 10th, 2023: Case Studies - The Power of eXtended Reality (XR) with 360°Aug 10th, 2023: Case Studies - The Power of eXtended Reality (XR) with 360°
Aug 10th, 2023: Case Studies - The Power of eXtended Reality (XR) with 360°
 
Aug 8th, 2023: Case Studies - Utilizing eXtended Reality (XR) in Drones)
Aug 8th, 2023: Case Studies - Utilizing eXtended Reality (XR) in Drones)Aug 8th, 2023: Case Studies - Utilizing eXtended Reality (XR) in Drones)
Aug 8th, 2023: Case Studies - Utilizing eXtended Reality (XR) in Drones)
 
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
NS-CUK Seminar: J.H.Lee, Review on "Learnable Structural Semantic Readout for...
 
NS-CUK Seminar: H.E.Lee, Review on "Gated Graph Sequence Neural Networks", I...
NS-CUK Seminar: H.E.Lee,  Review on "Gated Graph Sequence Neural Networks", I...NS-CUK Seminar: H.E.Lee,  Review on "Gated Graph Sequence Neural Networks", I...
NS-CUK Seminar: H.E.Lee, Review on "Gated Graph Sequence Neural Networks", I...
 
NS-CUK Seminar:V.T.Hoang, Review on "Augmentation-Free Self-Supervised Learni...
NS-CUK Seminar:V.T.Hoang, Review on "Augmentation-Free Self-Supervised Learni...NS-CUK Seminar:V.T.Hoang, Review on "Augmentation-Free Self-Supervised Learni...
NS-CUK Seminar:V.T.Hoang, Review on "Augmentation-Free Self-Supervised Learni...
 
NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based ...
NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based ...NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based ...
NS-CUK Journal club: H.E.Lee, Review on " A biomedical knowledge graph-based ...
 
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
NS-CUK Seminar: H.E.Lee,  Review on "PTE: Predictive Text Embedding through L...NS-CUK Seminar: H.E.Lee,  Review on "PTE: Predictive Text Embedding through L...
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
 
NS-CUK Seminar: H.B.Kim, Review on "Inductive Representation Learning on Lar...
NS-CUK Seminar: H.B.Kim,  Review on "Inductive Representation Learning on Lar...NS-CUK Seminar: H.B.Kim,  Review on "Inductive Representation Learning on Lar...
NS-CUK Seminar: H.B.Kim, Review on "Inductive Representation Learning on Lar...
 
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
NS-CUK Seminar: H.E.Lee,  Review on "PTE: Predictive Text Embedding through L...NS-CUK Seminar: H.E.Lee,  Review on "PTE: Predictive Text Embedding through L...
NS-CUK Seminar: H.E.Lee, Review on "PTE: Predictive Text Embedding through L...
 
NS-CUK Seminar: J.H.Lee, Review on "Relational Self-Supervised Learning on Gr...
NS-CUK Seminar: J.H.Lee, Review on "Relational Self-Supervised Learning on Gr...NS-CUK Seminar: J.H.Lee, Review on "Relational Self-Supervised Learning on Gr...
NS-CUK Seminar: J.H.Lee, Review on "Relational Self-Supervised Learning on Gr...
 
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.B.Kim,  Review on "metapath2vec: Scalable representation le...NS-CUK Seminar: H.B.Kim,  Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
 
NS-CUK Seminar: H.E.Lee, Review on "Graph Star Net for Generalized Multi-Tas...
NS-CUK Seminar: H.E.Lee,  Review on "Graph Star Net for Generalized Multi-Tas...NS-CUK Seminar: H.E.Lee,  Review on "Graph Star Net for Generalized Multi-Tas...
NS-CUK Seminar: H.E.Lee, Review on "Graph Star Net for Generalized Multi-Tas...
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

NS-CUK Seminar:H.B.Kim, Review on "Asymmetric transitivity preserving graph embedding", KDD 2016

  • 1. Ho-Beom Kim Network Science Lab Dept. of Mathematics The Catholic University of Korea E-mail: hobeom2001@catholic.ac.kr 2023 / 06 / 26 OU, Mingdong, et al.
  • 2. 2  Introduction • Contributions  Methodology  Related work  Experiments  Results  Discussion  Conclusion
  • 3. 3 1. Introduction Problem Statements • Nowadays, more and more applications are based on larger and larger networks and graphs. • Most of the existing graph embedding methods target on undirected graphs • We can not straightforwardly apply undirected graph embedding methods on directed grpahs.
  • 4. 4 1. Introduction Contributions • Propose a high-order proximity preserved embedding (HOPE) method to solve the challenging problem of asymmetric transitivity in directed graph embedding • Derive a general form covering multiple commonly used high-order proximities, enabling the scalable solution of HOPE with generalized SVD • Provide an upper bound on the approximation error of HOPE • Extensive experiments are conducted to verify the usefulness and generality of the learned embedding in various applications
  • 5. 5 1. Introduction Framework of Asymmetric Transitivity Preserving Graph Embedding
  • 6. 6 2. Related Work Graph Embedding • Propose a high-order proximity preserved embedding (HOPE) method to solve the challenging problem of asymmetric transitivity in directed graph embedding • Derive a general form covering multiple commonly used high-order proximities, enabling the scalable solution of HOPE with generalized SVD • Provide an upper bound on the approximation error of HOPE • Extensive experiments are conducted to verify the usefulness and generality of the learned embedding in various applications
  • 7. 7 3. High-Order Proximity Preserved Embedding Notations • 𝐴 ∶ 𝑎𝑑𝑗𝑎𝑐𝑒𝑛𝑐𝑦 𝑚𝑎𝑡𝑟𝑖𝑥 • 𝑠𝑖𝑗 ∶ 𝑝𝑟𝑜𝑥𝑖𝑚𝑖𝑡𝑦 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑣𝑖, 𝑣𝑗 • 𝑈 = 𝑈𝑠, 𝑈𝑡 : 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑚𝑎𝑡𝑟𝑖𝑥, 𝑈𝑠 ∶ 𝑠𝑜𝑢𝑟𝑐𝑒 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑣𝑒𝑐𝑡𝑜𝑟𝑠, 𝑈𝑡: 𝑡𝑎𝑟𝑔𝑒𝑡 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑣𝑒𝑐𝑡𝑜𝑟𝑠 • 𝐾 ∶ 𝑒𝑚𝑏𝑒𝑑𝑑𝑖𝑛𝑔 𝑑𝑖𝑚𝑒𝑛𝑠𝑖𝑜𝑛𝑠
  • 8. 8 3. High-Order Proximity Preserved Embedding Problem Definition • m𝑖𝑛 ∥ 𝑆 − 𝑈𝑠 ∙ 𝑈𝑡𝑇 ∥𝐹 2 : loss function
  • 9. 9 3. High-Order Proximity Preserved Embedding High order proximities • 𝑆 = 𝑀𝑔 −1 ∙ 𝑀𝑙 • Katz Index : An ensemble of all paths, which is a weighted summation over the path set between two vertexes • Weight of a path : exponential function of its length • 𝑆𝐾𝑎𝑡𝑧 = 𝑙=1 ∞ 𝛽 ∙ 𝐴𝑙 = 𝛽 ∙ 𝐴 ∙ 𝑆𝐾𝑎𝑡𝑧 + 𝛽 ∙ 𝐴 • 𝛽 : decay parameter • 𝑆𝐾𝑎𝑡𝑧 = (𝐼 − 𝛽 ∙ A) −1 ∙ 𝛽 ∙A • 𝑀𝑔 = 𝐼 − 𝛽 ∙ A • 𝑀𝑙 = 𝛽 ∙ A
  • 10. 10 3. High-Order Proximity Preserved Embedding High order proximities • 𝑆 = 𝑀𝑔 −1 ∙ 𝑀𝑙 • 𝑀𝑔, 𝑀𝑙 : polynomial matrices • Katz Index : An ensemble of all paths, which is a weighted summation over the path set between two vertexes • Weight of a path : exponential function of its length • 𝑆𝐾𝑎𝑡𝑧 = 𝑙=1 ∞ 𝛽 ∙ 𝐴𝑙 = 𝛽 ∙ 𝐴 ∙ 𝑆𝐾𝑎𝑡𝑧 + 𝛽 ∙ 𝐴 • 𝛽 : decay parameter • 𝑆𝐾𝑎𝑡𝑧 = (𝐼 − 𝛽 ∙ A) −1 ∙ 𝛽 ∙A • 𝑀𝑔 = 𝐼 − 𝛽 ∙ A • 𝑀𝑙 = 𝛽 ∙ A
  • 11. 11 3. High-Order Proximity Preserved Embedding High order proximities • Rooted PageRank (RPR) • 𝑆𝑖𝑗 𝑅𝑃𝑅 ∶ 𝑡ℎ𝑒 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑡ℎ𝑎𝑡 𝑎 𝑟𝑎𝑛𝑑𝑜𝑚 𝑤𝑎𝑙𝑘 𝑓𝑟𝑜𝑚 𝑛𝑜𝑑𝑒 𝑣𝑖 𝑤𝑖𝑙𝑙 𝑙𝑜𝑐𝑎𝑡𝑒 𝑎𝑡 𝑣𝑗𝑖𝑛 𝑡ℎ𝑒 𝑠𝑡𝑒𝑎𝑑𝑦 𝑠𝑡𝑎𝑡𝑒 • 𝛼 : jumps to one of the neighbor of current node with probability , jumps back probability • 𝑃 ∶ 𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑖𝑡𝑦 𝑡𝑟𝑎𝑛𝑠𝑖𝑡𝑖𝑜𝑛 𝑚𝑎𝑡𝑟𝑖𝑥 𝑠𝑎𝑡𝑖𝑠𝑓𝑦𝑖𝑛𝑔 𝑖=1 𝑁 𝑃𝑖𝑗 = 1 • 𝑆𝑖𝑗 𝑅𝑃𝑅 = 1 − 𝛼 ∙ 𝐼 − 𝛼𝑃 −1 • 𝑀𝑔 = 𝐼 − 𝛼𝑃 • 𝑀𝑙 = (1 − 𝛼) ∙ I
  • 12. 12 3. High-Order Proximity Preserved Embedding High order proximities • Common Neighbors (CN) • 𝑆𝑖𝑗 𝐶𝑁 ∶ 𝐶𝑜𝑢𝑛𝑡𝑠 𝑡ℎ𝑒 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑏𝑒𝑟𝑡𝑒𝑥𝑒𝑠 𝑐𝑜𝑛𝑛𝑒𝑐𝑡𝑖𝑛𝑔 𝑡𝑜 𝑏𝑜𝑡ℎ 𝑣𝑖𝑎𝑛𝑑 𝑣𝑗 : The number of vertexes which is target of an edge from 𝑣𝑖 and the source of an edge to 𝑣𝑗 • 𝑆𝐶𝑁 = 𝐴2 • 𝑀𝑔 = 𝐼 • 𝑀𝑙 = 𝐴2
  • 13. 13 3. High-Order Proximity Preserved Embedding High order proximities • Adamic-Adar (AA) • AA is a variant of common neighbors. • AA assigns each neighbor a weight, that is the reciprocal of the degree of the neighbor • The more vertexes one vertex connected to, the less important it is on evaluating the proximity between a pair of vertex • 𝑆𝐴𝐴 = 𝐴 ∙D ∙A • D : diagonal matrix • 𝐷𝑖𝑖 = 1/ 𝑗(𝐴𝑖𝑗 + 𝐴𝑗𝑖) • 𝑀𝑔 = 𝐼 • 𝑀𝑙 = 𝐴 ∙ D ∙ A
  • 14. 14 3. High-Order Proximity Preserved Embedding Approximation of High-Order Proximity • Objective Equation aims to find an optimal rank-K approximation of the proximity matrix S • 𝑠 = 𝑖=1 𝑁 𝜎𝑖𝑉𝑖 𝑆 𝑉𝑖 𝑡𝑇 • Make the SVD on S very expensive, the solution is not feasible for large scale graphs
  • 15. 15 3. High-Order Proximity Preserved Embedding Approximation of High-Order Proximity • Adopt a partial generalized SVD algorithm (JDGSVD) : an iterative Jacobi-Davidson type algorithm which is very scalable when k<< N
  • 16. 16 3. High-Order Proximity Preserved Embedding Approximation Error • {𝜎𝑖} 𝑎𝑟𝑒 𝑡ℎ𝑒 𝑠𝑖𝑛𝑔𝑢𝑙𝑎𝑟 𝑣𝑎𝑙𝑢𝑒𝑠 𝑜𝑓 𝑆 𝑖𝑛 𝑑𝑒𝑠𝑐𝑒𝑛𝑑 𝑜𝑟𝑑𝑒𝑟 • The lower the rank of S is, the smaller the error is
  • 17. 17 4. Experiments Datasets • Synthetic Data (Syn) • Generate the synthetic data by the forest fire model • Can generate powerlaw graphs • As the time com- plexity of computation of some high-order proximities (e.g. Katz, RPR) is too high, we generate small-scaled synthetic data to allow for the computation of high- order proximities • Randomly generate the synthetic datasets • Cora • Citation network of academic papers • Vertexes : academic papers, Directed edges : the citation relationship between papers
  • 18. 18 4. Experiments Datasets • Twitter Social Network (SN-Twitter) • A subnetwork of Twitter • Vertexes : users of Twitter, Directed edges : following relationships between users • Tencent Weibo Social Network (SN-Tweibo) • Contains a subnetwork of the social network in Tencent Weibo, a Twitter-style social network in China • Vertexes : Users, Directed edges : following relationship between users
  • 19. 19 4. Experiments Baseline Methods • LINE : Preserves the first-order and second-order proximity between vertexes. • DeepWalk : Randomly walks on the graph, and assumes a pair of vertexes similar if they are close in the random path. Then, the embedding is learned to preserve these pairwise similarities in the embedding • PPE(Partial Proximity Embedding) : First selects a small subset of vertexes as landmarks, and learns the embedding by approximating the proximity between vertexes and landmarks • Common Neighbors : Rank the links by the number of common neighbors. Use it for link prediction and vertex recommendation • Adamic-Adar : Rank the links by the Adamic-Adar values Use it for link prediction and vertex recommendation
  • 26. 26 4. Experiments Conclusion • we aim to preserve asymmetric transitiv- ity in directed graphs, and propose to preserve asymmetric transitivity by approximating high-order proximities. • We propose a scalable approximation algorithm , called High- Order Proximity preserved Embedding (HOPE). • we first derive a general formulation of a class of high-order proximity measurements, then apply generalized SVD to the general formulation, whose time complexity is linear with the size of graph. • The empirical study demon- strates the superiority of asymmetric transitivity and our proposed algorithm, HOPE. • Our future direction is to de- velop a nonlinear model to better capture the complex struc- ture of directed graphs.

Editor's Notes

  1. Mg is highly related to global asymmetric transitivity. Mg has a formulation like I−α·B, where α is a parameter and B is a transition matrix. The larger the alpha is, the easier the observed relationship (the edges of the graph) is to be transited in the graph. When α = 0, the observed relationship can just transit in a sub- graph whose range is limited by the order of Ml.
  2. First evaluate the high-order proximity approximation error on two small datasets(Syn, Cora) conduct other appli- cation experiments on two large datasets, i.e. SN-Twitter and SN-TWeibo,
  3. In graph reconstruction, link prediction and vertex rec- ommendation experiments, we adopt Katz as the target high-order proximity for HOPE and PPE methods, as Katz has shown superior performance in related tasks in previ- ous works, to approximate for HOPE and PPE. For PPE, we sample 1000 landmarks. For LINE and DeepWalk, we search the parameters grid search.
  4. RMSE is used to evaluate the approximation error of the proximity approximation algorithms, including HOPE and PPE. The formula of RMSE in our problem NRMSE (Normalized RMSE) is used to evaluate the rel- ative error of the proximity approximation algorithms. Precision@k is used to evaluate the performance of link prediction, which measures the prediction precision of top k links. where Ep is the set of top k predicted links, Eo is the set of observed links and | · | represents the size of set. MAP is used to evaluate the performance of vertex rec- ommendation, which measures the rank accuracy of recom- mended vertex list.
  5. 2 High-order Proximity Approximation As we preserve asymmetric transitivity by approximating high-order proximity, the error of approximation can evalu- ate how well we preserve asymmetric transitivity.
  6. As the representation of a graph, embedding vectors are expected to well reconstruct the graph. We evaluate the reconstruction ability on training sets of SN-TWeibo and SN- Twitter. We reconstruct the graph edges based on the recon- structed proximity between vertexes.
  7. As asymmetric transitivity captures the correlation among edges, we can use it to predict missing edges in graphs. On the other hand, the performance of HOPE on link prediction can also reflect how well the asymmetric transitivity is pre- served. This experiment is conducted on SN-TWeibo and SN-TWitter. We train the embedding vectors on training graphs, and evaluate the prediction performance on testing graphs.
  8. The setting of training procedure in this experiment is the same as link prediction. But, we evaluate the per- formance of algorithms from the vertex view, i.e. vertex recommendation performance.