Easing embedding learning by comprehensive transcription of heterogeneous information networks

Easing Embedding Learning by
Comprehensive Transcription of
Heterogeneous Information Network
Yu Shi, Qi Zhu, Fang Guo, Chao Zhang, Jiawei Han
University of Illinois at Urbana-Champaign, Urbana, IL, USA
Presenter: Zhiwei (Jim) Liu
Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks, KDD’18

Road Map
• Background: Network Embedding + HIN
• Preliminary
• Proposed Model
• Experiment
• Conclusion and Future work
• Q&A

Background: Network Embedding + HIN

𝐺 = (𝑉, 𝐸)
𝜙 𝑣 : 𝑉 → 𝑇𝑉
𝜓 𝑒 : 𝐸 → 𝑇𝐸
𝑇𝑦𝑝𝑒 𝑀𝑎𝑝𝑝𝑖𝑛𝑔 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛𝑠：
|𝑇𝑉 = 1 𝑎𝑛𝑑 |𝑇𝐸 = 1 ∶ 𝐻𝑜𝑚𝑒𝑔𝑒𝑛𝑒𝑜𝑢𝑠
|𝑇𝑉 > 1 𝑜𝑟 |𝑇𝐸 > 1 ∶ 𝐻𝑒𝑡𝑒𝑟𝑜𝑔𝑒𝑛𝑒𝑜𝑢𝑠

Network Embedding
[1] W. Zachary. An information flow model for conflict and fission in small groups1. Journal of anthropological
research, 33(4):452–473, 1977.

DeepWalk
• Algorithm: Random Walk + Skip-gram Model
[1] B. Perozzi, R. Al-Rfou, and S. Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD
international conference on Knowledge discovery and data mining, pages 701–710. ACM, 2014.

LINE
• Algorithm: First-order + Second-order Proximity
[1] J. Tang, M. Qu, M. Wang, M. Zhang, J. Yan, and Q. Mei. LINE: Large-scale Information Network Embedding. In WWW,
2015.
• First-order Proximity:
Local Pairwise Similarity
• Second-order Proximity:
Neighborhood structure
similarity

node2vec
• Algorithm: Random Walk with two balance parameters
[1] Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable Feature Learning for Networks. In ACM SIGKDD.
• Return parameter: p
• In-out parameter: q

Heterogeneous Information Network (HIN)
[1] metapath2vec: Scalable Representation Learning for Heterogeneous Networks

Homogeneous Network Embedding
• No type structure
• No side information
• Types are always compatible?
• …
Heterogeneous Information Network

DeepWalk
• Algorithm: Random Walk + Skip-gram Model
[1] B. Perozzi, R. Al-Rfou, and S. Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD
international conference on Knowledge discovery and data mining, pages 701–710. ACM, 2014.
• Random Walk over the connection
• Only one type of connection
• Only one type of node

Meta-path on HIN
[1] Y. Sun, J. Han, X. Yan, P. S. Yu, and T. Wu, “Pathsim: Meta path- based top-k similarity search in heterogeneous
information networks,” Proceedings of the VLDB Endowment, vol. 4, no. 11, pp. 992–1003, 2011.

Metapath2vec
• Homogeneous Skip-Gram
• Heterogeneous Skip-Gram

Metapath2vec(++)

Incompatibility
• Similar nodes via different meta-paths (connections)
• Jaccard Coefficient

Incompatibility
• Closeness under different
metric
• User-director and user-genre
type is incompatible
• Incompatible connections
cannot be close at the same
time in one metric space

HEER model
• Comprehensive transcription of HINs in embedding learning
• Dealing with the semantic incompatibility of connection in HINs
• Leveraging the edge representation and heterogeneous metrics
• And neural network model for learning both node and edge
representation

Preliminary: Notation and Definition

𝐺 = (𝑉, 𝐸)
𝜙 𝑣 : 𝑉 → 𝑇𝑉
𝜓 𝑒 : 𝐸 → 𝑇𝐸
𝑇𝑦𝑝𝑒 𝑀𝑎𝑝𝑝𝑖𝑛𝑔 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛𝑠：
|𝑇𝑉 = 1 𝑎𝑛𝑑 |𝑇𝐸 = 1 ∶ 𝐻𝑜𝑚𝑒𝑔𝑒𝑛𝑒𝑜𝑢𝑠
|𝑇𝑉 > 1 𝑜𝑟 |𝑇𝐸 > 1 ∶ 𝐻𝑒𝑡𝑒𝑟𝑜𝑔𝑒𝑛𝑒𝑜𝑢𝑠
Preliminary

Preliminary
• Network:𝐺 = (𝒱 , ℰ; 𝜑, 𝜓)
• Network Schema:G~
= (𝒯, ℛ)

Notations
• only one node type can be associated with a certain end of an edge
type
Edge type 𝑟
E.g., Director Fatih Akin living in Germany,
Movie In the Fade being produced in Germany

HIN Embedding Definition
• Given an HIN, 𝐺 = (𝒱 , ℰ; 𝜑, 𝜓), 𝑣 ∈ 𝒱, 𝑢, 𝑣 ∈ ℰ;
• Learning a node embedding mapping, 𝑓 𝑣 : 𝒱 → ℝ 𝑑 𝒱
• Learning an edge embedding mapping, 𝑔(𝑢, 𝑣): 𝒱 × 𝒱 → ℝ 𝑑ℰ
• A node pair can be of multi-type, 𝑔 𝑢, 𝑣 encapsulate such
information

Typed closeness
• Node pair, 𝑢, 𝑣 , edge embedding g 𝑢𝑣,
• 𝜇 𝑟 is an edge-type-specific vector to be inferred which
represents the metric coupled with this type
• Compatible edge types share similar 𝜇 𝑟

Objective Function
• KL-divergence between the original weights and embedding similarity
• Overall objective function

Details in the HEER Model
• Edge embedding
• Node embedding

• Type filter can distinguish the compatibility between edge types

• Type filter can distinguish the compatibility between edge types
• Negative sampling

Experiment: Reconstruction + Case study

Dataset
• DBLP[1]: Bibliographical network
• Five types of nodes: author, paper, key term, venue, and year
• Edge types: author—paper, term—paper, year—paper, venue—paper,
paper—>paper (directed)
• YAGO[2]: Large scale knowledge graph
• Seven types of nodes:person, location, organization, piece ofwork, prize,
position, and event;
• 24 Edge types.
[1] Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. Arnet- miner: extraction and mining of academic social networks. In KDD.
[2] Fabian M Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. Yago: a core of semantic knowledge. In WWW

Baselines
• LINE
• AspEm: Old version of HEER, embeddings learned independently for
each aspect(metric)
• Metapath2vec++
• Pretrained + logit: logistic regression model for each edge type
[1] Yu Shi, Huan Gui, Qi Zhu, Lance Kaplan, and Jiawei Han. 2018. AspEm: Embed- ding Learning by Aspects in Heterogeneous
Information Networks.. In SDM.

Edge Reconstruction
• Evaluation method: Mean Reciprocal Rank (MRR)
• Task Goal: Knock-out + Associated by type-𝑟 edge

Experiment analysis
• Modeling Incompatibility benefits embedding quality
• YAGO has much more (sophistic) incompatible types
• Heterogeneous metrics helps improving embedding quality
• HEER more prone to suffering from over-fitting at knock-out rate=0.8

Learned Heterogeneous Metrics (DBLP)

Learned Heterogeneous Metrics (YAGO)

Future Work
• Different metrics but not exact represented
• Heat map: reference with term and the term year relationship

Future Work
• Incompatibility need designing manually

Future Work
• Incompatibility learned from network? Not just “drop-out”
• Edge embedding function is too weak to maintain the edge
information
• More experiment to verify the embedding
• Meta-path incompatibility? (YAGO)
• …

Open discussion
• How to build an graph embedding model leveraging the meta-path
incompatibility?
• Random Walk over meta-paths?
• Probability distribution? E.g. Skip-gram model

Easing embedding learning by comprehensive transcription of heterogeneous information networks

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Easing embedding learning by comprehensive transcription of heterogeneous information networks

Similar to Easing embedding learning by comprehensive transcription of heterogeneous information networks (20)

Recently uploaded

Recently uploaded (20)

Easing embedding learning by comprehensive transcription of heterogeneous information networks