DeepWalk: Online Learning of Social Representations

•Download as PPTX, PDF•

0 likes•883 views

SOYEON KIM

Paper review "DeepWalk: Online Learning of Social Representations", ACM SIG-KDD 2014

Data & Analytics

DeepWalk: Online Learning of
Social Representations
Bryan Perozzi, Rami Al-Rfou, Steven Skiena
ACM SIG-KDD 2014
1

Outline
• Introduction: Graphs as Features
• Language Modeling
• DeepWalk
• Evaluation: Network Classification
• Conclusions & Future Work
2

Introduction
• Deep Learning을 이용해 graph의 인접 행렬의
latent representation(social relations-> vector
space)을 학습함
3

Introduction
• 소셜 네트워크(커뮤니티) 상의 유저간의
상호작용을 vector space에서 표현하여 학습
모델에 적용
4
Zachary’s Karate Network

Language modeling
• Corpus 안에서 특정 word
sequence가 얼마나 나오는지
• 문서상에서 word의 latent
representation을
학습함(word co-occurrence):
– word2vec:
• 단어의 semantic한 의미를
나타낼 수 있다
5

From language modeling to graphs
• 자연어 corpus에서 단어의
빈도가 power law를 따른다
• scale-free graph의 Random
walk에서 vertex frequency
역시 power law를 따른다
6

From language modeling to graphs
• 각각의 random walk는 한
문장을 나타낸다
• Short random walks =
sentences
7

Random Walks
1. 각각의 vertex(root)에서 𝛾 개의 random
walks 를 생성한다
2. 각 random walk의 길이는 𝑡 이다
3. vertex의 이웃들로부터 다음으로 방문할
vertex를 선택한다(uniformly sample)
9

Representation Mapping
• 𝑣1 을 representation
vector에 mapping한다
-> Φ(𝑣1)
• 이웃 vertex들을 vector
space상에 mapping한다.
11
Maximize: Pr 𝑣3 Φ 𝑣1
Pr 𝑣5 Φ 𝑣1

Skip-gram model
12
한 문장에서 window size (w) 내에 등장하는
단어들의 동시 발생 확률을 최대화하는
언어 모델
Φ(𝑉1)
Maximize: Pr 𝑣3 Φ 𝑣1
Pr 𝑣5 Φ 𝑣1

Hierarchical Softmax
• Pr 𝑣3 Φ 𝑣1 을 계산하는 데에는 vertex의
수 만큼의 operation이 필요하다 -> 𝑂(𝑉)
14
• vertex를 binary tree에 표현
• Pr 𝑣3 Φ 𝑣1 를
최대화한다 = root에서
node까지 가는 path의 발생
확률을 최대화한다
• 𝑪 𝟏, 𝑪 𝟐, 𝑪 𝟑는 logistic binary
classifier
• 따라서, 𝑂(𝑉) -> 𝑂 log 𝑉

Learning
• Learned parameters:
– Vertex representations
– Tree binary classifiers weights
• vertex representation은 먼저 random하게
초기화한다.
• 𝑪 𝟏, 𝑪 𝟐, 𝑪 𝟑와 같은 classifier가 loss function을
계산한다
• Stochastic Gradient Descent(SGD) 를 이용해
parameter를 동시에 update한다.
15

Experiments
• Node Classification
– graph의 node중 일부만 label을 가지고 있을 때,
label이 없는 node들을 예측한다
• DataSet
– BlogCatalog
– Flickr
– YouTube
• Baselines
– SpectralClustering, MaxModularity, EdgeCluster(k-
means), weighted vote Relational Neighbor(wvRN)
17

Results: BlogCatalog
• DeepWalk 는 label이 있는 node가 적은 data에서도 잘
동작한다
18

Results: Youtube
• Scalable on very large graph!
19

Parallelization
• 모델 학습 과정에서 parameter를 공유하지
않으므로 각 부분을 병렬처리해도 성능에 영향을
주지 않는다
20

Conclusions
• Network로 표현되는 데이터를 continuous
vector space 상에서 표현하여 학습이
가능하다.
• Word sequence들을 graph로 표현하여
language model에 사용 가능하다.
• label이 부족한 경우에도 잘 작동하다.
• 큰 graph에도 Scalable하기 때문에 online
learning에 사용 가능하다
21

What's hot

30th コンピュータビジョン勉強会@関東 DynamicFusionHiroki Mizuno

MIRU2013チュートリアル：SIFTとそれ以降のアプローチHironobu Fujiyoshi

십분딥러닝_17_DIM(Deep InfoMax)HyunKyu Jeon

[DL輪読会]Deep High-Resolution Representation Learning for Human Pose EstimationDeep Learning JP

MediaPipeの紹介emakryo

transformer解説~Chat-GPTの源流~MasayoshiTsutsui

Recurrent Neural NetworksSeiya Tokui

[DL輪読会]NeRF: Representing Scenes as Neural Radiance Fields for View SynthesisDeep Learning JP

Neural scene representation and rendering の解説（第3回3D勉強会@関東）Masaya Kaneko

ICCV 2019 論文紹介 (26 papers)Hideki Okada

[DL輪読会]Life-Long Disentangled Representation Learning with Cross-Domain Laten...Deep Learning JP

【DL輪読会】HexPlaneとK-PlanesDeep Learning JP

[DL輪読会]End-to-end Recovery of Human Shape and PoseDeep Learning JP

Soft Rasterizer: A Differentiable Renderer for Image-based 3D ReasoningKohei Nishimura

[DL輪読会]BANMo: Building Animatable 3D Neural Models from Many Casual VideosDeep Learning JP

딥러닝을 이용한 얼굴 인식if kakao

スパースモデリング入門Hideo Terada

（文献紹介）深層学習による動被写体ロバストなカメラの動き推定Morpho, Inc.

【輪読会】Learning Continuous Image Representation with Local Implicit Image Funct...Deep Learning JP

[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video Processing (NeRF...Deep Learning JP

What's hot (20)

30th コンピュータビジョン勉強会@関東 DynamicFusion

MIRU2013チュートリアル：SIFTとそれ以降のアプローチ

십분딥러닝_17_DIM(Deep InfoMax)

[DL輪読会]Deep High-Resolution Representation Learning for Human Pose Estimation

MediaPipeの紹介

transformer解説~Chat-GPTの源流~

Recurrent Neural Networks

[DL輪読会]NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Neural scene representation and rendering の解説（第3回3D勉強会@関東）

ICCV 2019 論文紹介 (26 papers)

[DL輪読会]Life-Long Disentangled Representation Learning with Cross-Domain Laten...

【DL輪読会】HexPlaneとK-Planes

[DL輪読会]End-to-end Recovery of Human Shape and Pose

Soft Rasterizer: A Differentiable Renderer for Image-based 3D Reasoning

[DL輪読会]BANMo: Building Animatable 3D Neural Models from Many Casual Videos

딥러닝을 이용한 얼굴 인식

スパースモデリング入門

（文献紹介）深層学習による動被写体ロバストなカメラの動き推定

【輪読会】Learning Continuous Image Representation with Local Implicit Image Funct...

[DL輪読会]Neural Radiance Flow for 4D View Synthesis and Video Processing (NeRF...

Viewers also liked

Convolutional Neural Networks on Graphs with Fast Localized Spectral FilteringSOYEON KIM

Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchSOYEON KIM

A survey of heterogeneous information network analysisSOYEON KIM

Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...SOYEON KIM

Deep learning based multi-omics integration, a surveySOYEON KIM

Semi-automatic ground truth generation using unsupervised clustering and limi...SOYEON KIM

Translated learningSOYEON KIM

Self taught clusteringSOYEON KIM

TAG BASED IMAGE SEARCH BY SOCIAL RE-RANKINGNexgen Technology

Image re ranking systemveningstonk

[DL輪読会]Unsupervised Learning of 3D Structure from ImagesDeep Learning JP

[DL輪読会]Learning What and Where to Draw (NIPS’16)Deep Learning JP

[DL輪読会]Learning convolutional neural networks for graphsDeep Learning JP

[DL輪読会]TREE-STRUCTURED VARIATIONAL AUTOENCODERDeep Learning JP

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...Deep Learning JP

[DL輪読会]Image-to-Image Translation with Conditional Adversarial NetworksDeep Learning JP

Text extraction from natural scene image, a surveySOYEON KIM

[DL輪読会]QUASI-RECURRENT NEURAL NETWORKSDeep Learning JP

[DL輪読会]Exploiting Cyclic Symmetry in Convolutional Neural NetworksDeep Learning JP

[DL輪読会]Semi supervised qa with generative domain-adaptive netsDeep Learning JP

Viewers also liked (20)

Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering

Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search

A survey of heterogeneous information network analysis

Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...

Deep learning based multi-omics integration, a survey

Semi-automatic ground truth generation using unsupervised clustering and limi...

Translated learning

Self taught clustering

TAG BASED IMAGE SEARCH BY SOCIAL RE-RANKING

Image re ranking system

[DL輪読会]Unsupervised Learning of 3D Structure from Images

[DL輪読会]Learning What and Where to Draw (NIPS’16)

[DL輪読会]Learning convolutional neural networks for graphs

[DL輪読会]TREE-STRUCTURED VARIATIONAL AUTOENCODER

[DL輪読会]Combining Fully Convolutional and Recurrent Neural Networks for 3D Bio...

[DL輪読会]Image-to-Image Translation with Conditional Adversarial Networks

Text extraction from natural scene image, a survey

[DL輪読会]QUASI-RECURRENT NEURAL NETWORKS

[DL輪読会]Exploiting Cyclic Symmetry in Convolutional Neural Networks

[DL輪読会]Semi supervised qa with generative domain-adaptive nets

Similar to DeepWalk: Online Learning of Social Representations

소프트웨어 2.0을 활용한 게임 어뷰징 검출정주 김

Oop design principleRyan Park

자바 직렬화 (Java serialization)중선 곽

Scala, ScalabilityDongwook Lee

ScalabilityDongwook Lee

Spark overview 이상훈(SK C&C)_스파크 사용자 모임_20141106SangHoon Lee

Attention is all you needHoon Heo

AWS CLOUD 2018- Amazon Neptune, 신규 그래프 데이터베이스 서비스 (김상필 솔루션즈 아키텍트)Amazon Web Services Korea

Papago/N2MT 개발이야기NAVER D2

1st cardano korea_meetupWangjea Lee

5일차.map reduce 활용주영 송

Imagination-Augmented Agents for Deep Reinforcement Learning성재 최

Mahout주영 송

알기쉬운 Variational autoencoder홍배 김

[14.10.21] Far Cry and DX9 번역(shaderstudy)해강

Ndc12 이창희 render_pipelinechangehee lee

NDC08_실시간비주얼그래프편집noerror

[224] backend 개발자의 neural machine translation 개발기 김상경NAVER D2

Uml 세미나Daniel Shin

파이콘 한국 2019 튜토리얼 - LRP (Part 2)XAIC

Similar to DeepWalk: Online Learning of Social Representations (20)

소프트웨어 2.0을 활용한 게임 어뷰징 검출

Oop design principle

자바 직렬화 (Java serialization)

Scala, Scalability

Scalability

Spark overview 이상훈(SK C&C)_스파크 사용자 모임_20141106

Attention is all you need

AWS CLOUD 2018- Amazon Neptune, 신규 그래프 데이터베이스 서비스 (김상필 솔루션즈 아키텍트)

Papago/N2MT 개발이야기

1st cardano korea_meetup

5일차.map reduce 활용

Imagination-Augmented Agents for Deep Reinforcement Learning

Mahout

알기쉬운 Variational autoencoder

[14.10.21] Far Cry and DX9 번역(shaderstudy)

Ndc12 이창희 render_pipeline

NDC08_실시간비주얼그래프편집

[224] backend 개발자의 neural machine translation 개발기 김상경

Uml 세미나

파이콘 한국 2019 튜토리얼 - LRP (Part 2)

More from SOYEON KIM

Network-based machine learning approach for aggregating multi-modal dataSOYEON KIM

Revealing disease-associated pathways by network integration of untargeted me...SOYEON KIM

Systems genetics approaches to understand complex traitsSOYEON KIM

Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...SOYEON KIM

Network embeddingSOYEON KIM

Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...SOYEON KIM

Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...SOYEON KIM

Opinion Fraud Detection in Online Reviews by Network EffectsSOYEON KIM

Evaluating color descriptors for object and scene recognitionSOYEON KIM

Outcome-guided mutual information networks for investigating gene-gene intera...SOYEON KIM

Spectral clusteringSOYEON KIM

Sentiwordnet: A publicly available lexical resource for opinion miningSOYEON KIM

Opinion spam and analysisSOYEON KIM

Investigating the Effectiveness of E-mail Spam Image Data for Phone Spam Imag...SOYEON KIM

Graph-based KNN Algorithm for Spam SMS DetectionSOYEON KIM

Deep belief networks for spam filteringSOYEON KIM

A study on the spacio temporal trend of brand index using twitter messages se...SOYEON KIM

A method to improve survival prediction using mutual information based networkSOYEON KIM

More from SOYEON KIM (18)

Network-based machine learning approach for aggregating multi-modal data

Revealing disease-associated pathways by network integration of untargeted me...

Systems genetics approaches to understand complex traits

Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...

Network embedding

Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...

Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...

Opinion Fraud Detection in Online Reviews by Network Effects

Evaluating color descriptors for object and scene recognition

Outcome-guided mutual information networks for investigating gene-gene intera...

Spectral clustering

Sentiwordnet: A publicly available lexical resource for opinion mining

Opinion spam and analysis

Investigating the Effectiveness of E-mail Spam Image Data for Phone Spam Imag...

Graph-based KNN Algorithm for Spam SMS Detection

Deep belief networks for spam filtering

A study on the spacio temporal trend of brand index using twitter messages se...

A method to improve survival prediction using mutual information based network

DeepWalk: Online Learning of Social Representations

1. DeepWalk: Online Learning of Social Representations Bryan Perozzi, Rami Al-Rfou, Steven Skiena ACM SIG-KDD 2014 1

2. Outline • Introduction: Graphs as Features • Language Modeling • DeepWalk • Evaluation: Network Classification • Conclusions & Future Work 2

3. Introduction • Deep Learning을 이용해 graph의 인접 행렬의 latent representation(social relations-> vector space)을 학습함 3

4. Introduction • 소셜 네트워크(커뮤니티) 상의 유저간의 상호작용을 vector space에서 표현하여 학습 모델에 적용 4 Zachary’s Karate Network

5. Language modeling • Corpus 안에서 특정 word sequence가 얼마나 나오는지 • 문서상에서 word의 latent representation을 학습함(word co-occurrence): – word2vec: • 단어의 semantic한 의미를 나타낼 수 있다 5

6. From language modeling to graphs • 자연어 corpus에서 단어의 빈도가 power law를 따른다 • scale-free graph의 Random walk에서 vertex frequency 역시 power law를 따른다 6

7. From language modeling to graphs • 각각의 random walk는 한 문장을 나타낸다 • Short random walks = sentences 7

8. Framework 8

9. Random Walks 1. 각각의 vertex(root)에서 𝛾 개의 random walks 를 생성한다 2. 각 random walk의 길이는 𝑡 이다 3. vertex의 이웃들로부터 다음으로 방문할 vertex를 선택한다(uniformly sample) 9

10. Framework 10

11. Representation Mapping • 𝑣1 을 representation vector에 mapping한다 -> Φ(𝑣1) • 이웃 vertex들을 vector space상에 mapping한다. 11 Maximize: Pr 𝑣3 Φ 𝑣1 Pr 𝑣5 Φ 𝑣1

12. Skip-gram model 12 한 문장에서 window size (w) 내에 등장하는 단어들의 동시 발생 확률을 최대화하는 언어 모델 Φ(𝑉1) Maximize: Pr 𝑣3 Φ 𝑣1 Pr 𝑣5 Φ 𝑣1

13. Framework 13

14. Hierarchical Softmax • Pr 𝑣3 Φ 𝑣1 을 계산하는 데에는 vertex의 수 만큼의 operation이 필요하다 -> 𝑂(𝑉) 14 • vertex를 binary tree에 표현 • Pr 𝑣3 Φ 𝑣1 를 최대화한다 = root에서 node까지 가는 path의 발생 확률을 최대화한다 • 𝑪 𝟏, 𝑪 𝟐, 𝑪 𝟑는 logistic binary classifier • 따라서, 𝑂(𝑉) -> 𝑂 log 𝑉

15. Learning • Learned parameters: – Vertex representations – Tree binary classifiers weights • vertex representation은 먼저 random하게 초기화한다. • 𝑪 𝟏, 𝑪 𝟐, 𝑪 𝟑와 같은 classifier가 loss function을 계산한다 • Stochastic Gradient Descent(SGD) 를 이용해 parameter를 동시에 update한다. 15

16. Framework 16

17. Experiments • Node Classification – graph의 node중 일부만 label을 가지고 있을 때, label이 없는 node들을 예측한다 • DataSet – BlogCatalog – Flickr – YouTube • Baselines – SpectralClustering, MaxModularity, EdgeCluster(k- means), weighted vote Relational Neighbor(wvRN) 17

18. Results: BlogCatalog • DeepWalk 는 label이 있는 node가 적은 data에서도 잘 동작한다 18

19. Results: Youtube • Scalable on very large graph! 19

20. Parallelization • 모델 학습 과정에서 parameter를 공유하지 않으므로 각 부분을 병렬처리해도 성능에 영향을 주지 않는다 20

21. Conclusions • Network로 표현되는 데이터를 continuous vector space 상에서 표현하여 학습이 가능하다. • Word sequence들을 graph로 표현하여 language model에 사용 가능하다. • label이 부족한 경우에도 잘 작동하다. • 큰 graph에도 Scalable하기 때문에 online learning에 사용 가능하다 21

22. Thank you! Q & A 22

DeepWalk: Online Learning of Social Representations

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to DeepWalk: Online Learning of Social Representations

Similar to DeepWalk: Online Learning of Social Representations (20)

More from SOYEON KIM

More from SOYEON KIM (18)

DeepWalk: Online Learning of Social Representations