SlideShare a Scribd company logo
Understanding
Large social network
LINE: Large-scale Information Network Embedding
Why learn representation of nodes?
Tasks like node classification , link prediction, node visualization can be
solved easily using simple machine learning techniques once we
analyze the large scale graphs using the deep learning techniques.
Why LINE?
Most existing graph embedding algorithms are designed to preserve this first-
order proximity, e.g., IsoMap and Laplacian eigenmap , even if they do not
scale.
As a complement, we explore the second-order proximity between the vertices,
which is not determined through the observed tie strength but through the
shared neighborhood structures of the vertices.
The Line optimizes
● a carefully designed objective function that preserves both the local and
global network structures.
● An edge-sampling algorithm is proposed that addresses the limitation of the
classical stochastic gradient descent and improves both the effectiveness and
the efficiency of the inference.
Empirical experiments prove the effectiveness of the LINE on a variety of real-
world information networks,
Some Definitions.
(Information Network) An information network is defined as G = (V, E), where V is
the set of vertices, each representing a data object and E is the set of edges
between the vertices, each representing a relationship between two data objects.
Each edge e ∈ E is an ordered pair e = (u, v) and is associated with a weight w uv
> 0, which indicates the strength of the relation. If G is undirected, we have (u, v) ≡
(v, u) and w uv ≡ w vu ; if G is directed, we have (u, v) ≡ (v, u) and w uv ≡ w vu .
Some Definitions
(First-order Proximity) The first-order proximity in a network is the local pairwise
proximity between two vertices. For each pair of vertices linked by an edge (u,
v), the weight on that edge, wuv, indicates the first-order proximity between u
and v. If no edge is observed between u and v, their first-order proximity is 0.
(Second-order Proximity) The second-order proximity between a pair of vertices
(u, v) in a network is the similarity between their neighborhood network
structures. Mathematically, let pu = (wu,1, . . . , wu,|V |) denote the first-order
proximity of u with all the other vertices, then the second-order proximity
between u and v is determined by the similarity between pu and pv. If no
vertex is linked from/to both u and v, the second-order proximity between u
and v is 0.
LINE
(Large-scale Information Network Embedding) Given a large network G = (V, E),
the problem of Large-scale Information Network Embedding aims to represent
each vertex v ∈ V into a low-dimensional space R d , i.e., learning a function fG : V
→ R d , where d ≪ |V |. In the space R d , both the first-order proximity and the
second-order proximity between the vertices are preserved.
Related Work
Various methods of graph embedding have been proposed in the machine learning literature. They
generally perform well on smaller networks. The problem becomes much more challenging when a
real world information network is concerned, which typically contains millions of nodes and billions of
edges. For example, the Twitter followee-follower network contains 175 million active users and
around twenty billion edges in 2012 . Most existing graph embedding algorithms do not scale for
networks of this size. The time complexity of classical graph embedding algorithms such as MDS,
IsoMap, Laplacian eigenmap are at least quadratic to the number of vertices, which is too
expensive for networks with millions of nodes.
Another parallel work is deep walk, DeepWalk uses random walks to expand
the neighborhood of a vertex, which is analogical to a depth-first search. We
use a breadth- first search strategy, which is a more reasonable approach to
the second-order proximity
Model Description.
LINE with First-order Proximity
Where
p1(·, ·) -> a distribution over the space V × V
ui ∈ Rd is the low-dimensional vector representation of vertex v i .
The empirical probability of p1 would be :
To preserve the first-order proximity, a straightforward way is to minimize the
distance between two distributions. We choose to minimize the KL-divergence of
two probability distributions
LINE with Second-order Proximity:
Where
u i is the representation of v i when it is treated as a vertex while u ′ i is the representation of v i when it is treated
as a specific “context”.
|V | is the number of vertices or “contexts.”
w ij is the weight of the edge (i, j)
d i is the outdegree of vertex i,
The empirical probability of p2 would be :
To preserve the second-order proximity, a straightforward way is to minimize the
distance between two distributions. We choose to minimize the KL-divergence of
two probability distributions
Optimizing the second order Proximity
The approach of negative sampling
σ(x) = 1/(1 + exp(−x)) is the sigmoid function
We adopt the asynchronous stochastic gradient algorithm (ASGD) for optimizing the above Eqn. . In each
step, the ASGD algorithm samples a mini-batch of edges and then updates the model parameters. If an
edge (i, j) is sampled, the gradient w.r.t. the embedding vector u i of vertex i will
be calculated as:
Combining the first order and second order
Proximities
To embed the networks by preserving both the first-order and second-order
proximity, a simple and effective way we find in practice is to train the LINE model
which preserves the first-order proximity and second-order proximity separately
and then concatenate the embeddings trained by the two methods for each vertex.
Links.
Github :
https://github.com/AbhishekKumarSingh/graphEmbeddings/
Youtube :
https://www.youtube.com/playlist?list=PLjh56Mrcs5QxGQgV8JpdKTBXOn_ek3VJm

More Related Content

What's hot

Progressive Meshes
Progressive MeshesProgressive Meshes
Progressive Meshes
Roopesh Jhurani
 
overviewPCA
overviewPCAoverviewPCA
overviewPCA
Edwin Heredia
 
Spacey random walks from Householder Symposium XX 2017
Spacey random walks from Householder Symposium XX 2017Spacey random walks from Householder Symposium XX 2017
Spacey random walks from Householder Symposium XX 2017
Austin Benson
 
Bayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco Scutari
Bayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco ScutariBayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco Scutari
Bayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco Scutari
Bayes Nets meetup London
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
SiddhantVerma49
 
Spectral Clustering
Spectral ClusteringSpectral Clustering
Spectral Clustering
ssusered887b
 
International journal of applied sciences and innovation vol 2015 - no 1 - ...
International journal of applied sciences and innovation   vol 2015 - no 1 - ...International journal of applied sciences and innovation   vol 2015 - no 1 - ...
International journal of applied sciences and innovation vol 2015 - no 1 - ...
sophiabelthome
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction models
tuxette
 
Massive parallelism with gpus for centrality ranking in complex networks
Massive parallelism with gpus for centrality ranking in complex networksMassive parallelism with gpus for centrality ranking in complex networks
Massive parallelism with gpus for centrality ranking in complex networks
ijcsit
 
Spectral graph theory
Spectral graph theorySpectral graph theory
Spectral graph theory
Danushka Bollegala
 
properties, application and issues of support vector machine
properties, application and issues of support vector machineproperties, application and issues of support vector machine
properties, application and issues of support vector machine
Dr. Radhey Shyam
 
Self-Assembling Hyper-heuristics: a proof of concept
Self-Assembling Hyper-heuristics: a proof of conceptSelf-Assembling Hyper-heuristics: a proof of concept
Self-Assembling Hyper-heuristics: a proof of concept
German Terrazas
 
Optics
OpticsOptics
Optics
RohitPaul52
 
a paper reading of table recognition
a paper reading of table recognitiona paper reading of table recognition
a paper reading of table recognition
Ning Lu
 
Transfer learningforclp
Transfer learningforclpTransfer learningforclp
Transfer learningforclp
Danushka Bollegala
 
Topology-conform segmented volume meshing of volume images (Oct 2012)
Topology-conform segmented volume meshing of volume images (Oct 2012)Topology-conform segmented volume meshing of volume images (Oct 2012)
Topology-conform segmented volume meshing of volume images (Oct 2012)
Christian Kehl
 
1
11
Associative Memory Model について
Associative Memory Model についてAssociative Memory Model について
Associative Memory Model について
ohken
 
A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:
A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:
A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:
Sean Golliher
 
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Universitat Politècnica de Catalunya
 

What's hot (20)

Progressive Meshes
Progressive MeshesProgressive Meshes
Progressive Meshes
 
overviewPCA
overviewPCAoverviewPCA
overviewPCA
 
Spacey random walks from Householder Symposium XX 2017
Spacey random walks from Householder Symposium XX 2017Spacey random walks from Householder Symposium XX 2017
Spacey random walks from Householder Symposium XX 2017
 
Bayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco Scutari
Bayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco ScutariBayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco Scutari
Bayes Nets Meetup Sept 29th 2016 - Bayesian Network Modelling by Marco Scutari
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
 
Spectral Clustering
Spectral ClusteringSpectral Clustering
Spectral Clustering
 
International journal of applied sciences and innovation vol 2015 - no 1 - ...
International journal of applied sciences and innovation   vol 2015 - no 1 - ...International journal of applied sciences and innovation   vol 2015 - no 1 - ...
International journal of applied sciences and innovation vol 2015 - no 1 - ...
 
A short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction modelsA short and naive introduction to using network in prediction models
A short and naive introduction to using network in prediction models
 
Massive parallelism with gpus for centrality ranking in complex networks
Massive parallelism with gpus for centrality ranking in complex networksMassive parallelism with gpus for centrality ranking in complex networks
Massive parallelism with gpus for centrality ranking in complex networks
 
Spectral graph theory
Spectral graph theorySpectral graph theory
Spectral graph theory
 
properties, application and issues of support vector machine
properties, application and issues of support vector machineproperties, application and issues of support vector machine
properties, application and issues of support vector machine
 
Self-Assembling Hyper-heuristics: a proof of concept
Self-Assembling Hyper-heuristics: a proof of conceptSelf-Assembling Hyper-heuristics: a proof of concept
Self-Assembling Hyper-heuristics: a proof of concept
 
Optics
OpticsOptics
Optics
 
a paper reading of table recognition
a paper reading of table recognitiona paper reading of table recognition
a paper reading of table recognition
 
Transfer learningforclp
Transfer learningforclpTransfer learningforclp
Transfer learningforclp
 
Topology-conform segmented volume meshing of volume images (Oct 2012)
Topology-conform segmented volume meshing of volume images (Oct 2012)Topology-conform segmented volume meshing of volume images (Oct 2012)
Topology-conform segmented volume meshing of volume images (Oct 2012)
 
1
11
1
 
Associative Memory Model について
Associative Memory Model についてAssociative Memory Model について
Associative Memory Model について
 
A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:
A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:
A Unifying Probabilistic Perspective for Spectral Dimensionality Reduction:
 
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
 

Viewers also liked

Coursera_PMP
Coursera_PMPCoursera_PMP
Coursera_PMP
Aditya Grover
 
Writing Test Cases with PHPUnit
Writing Test Cases with PHPUnitWriting Test Cases with PHPUnit
Writing Test Cases with PHPUnit
Shouvik Chatterjee
 
NodeJS Microservices, Built it Now, Scale it Later!
NodeJS Microservices, Built it Now, Scale it Later!NodeJS Microservices, Built it Now, Scale it Later!
NodeJS Microservices, Built it Now, Scale it Later!
Lalit Shandilya
 
Php psr standard 2014 01-22
Php psr standard 2014 01-22Php psr standard 2014 01-22
Php psr standard 2014 01-22
Võ Duy Tuấn
 
React introduction
React introductionReact introduction
React introduction
Võ Duy Tuấn
 
Scale with Microservices
Scale with MicroservicesScale with Microservices
Scale with Microservices
Võ Duy Tuấn
 
Microservices and docker
Microservices and dockerMicroservices and docker
Microservices and docker
Võ Duy Tuấn
 
Node Architecture Implications for In-Memory Data Analytics on Scale-in Clusters
Node Architecture Implications for In-Memory Data Analytics on Scale-in ClustersNode Architecture Implications for In-Memory Data Analytics on Scale-in Clusters
Node Architecture Implications for In-Memory Data Analytics on Scale-in Clusters
Ahsan Javed Awan
 
Chatbot in Sale Management
Chatbot in Sale ManagementChatbot in Sale Management
Chatbot in Sale Management
Võ Duy Tuấn
 
Business Intelligence in Retail Industry
Business Intelligence in Retail IndustryBusiness Intelligence in Retail Industry
Business Intelligence in Retail Industry
Võ Duy Tuấn
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!
Roelof Pieters
 
Percona XtraDB Cluster
Percona XtraDB ClusterPercona XtraDB Cluster
Percona XtraDB Cluster
Kenny Gryp
 
Real World Lessons on the Pain Points of Node.JS Application
Real World Lessons on the Pain Points of Node.JS ApplicationReal World Lessons on the Pain Points of Node.JS Application
Real World Lessons on the Pain Points of Node.JS Application
Ben Hall
 
China organosilicon industry market demand prospects and investment strategy ...
China organosilicon industry market demand prospects and investment strategy ...China organosilicon industry market demand prospects and investment strategy ...
China organosilicon industry market demand prospects and investment strategy ...
Qianzhan Intelligence
 
Interbrand vianey maya
Interbrand  vianey mayaInterbrand  vianey maya
Interbrand vianey maya
vianey Maya Tello
 
Ostern in griechenland von panagiotis sagali und raphael zarko
Ostern in griechenland von panagiotis sagali und raphael zarkoOstern in griechenland von panagiotis sagali und raphael zarko
Ostern in griechenland von panagiotis sagali und raphael zarkoEine kleine Deutschkiste
 
Национальная Программа "Лучшие социальные проекты России"
Национальная Программа "Лучшие социальные проекты России"Национальная Программа "Лучшие социальные проекты России"
Национальная Программа "Лучшие социальные проекты России"AlexandraObydenova
 
Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008
Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008
Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008
MarcBachrach
 
Audience Reach Measurement Guidelines 1.0
Audience Reach Measurement Guidelines 1.0Audience Reach Measurement Guidelines 1.0
Audience Reach Measurement Guidelines 1.0
Leonardo Naressi
 
Mitosis
MitosisMitosis

Viewers also liked (20)

Coursera_PMP
Coursera_PMPCoursera_PMP
Coursera_PMP
 
Writing Test Cases with PHPUnit
Writing Test Cases with PHPUnitWriting Test Cases with PHPUnit
Writing Test Cases with PHPUnit
 
NodeJS Microservices, Built it Now, Scale it Later!
NodeJS Microservices, Built it Now, Scale it Later!NodeJS Microservices, Built it Now, Scale it Later!
NodeJS Microservices, Built it Now, Scale it Later!
 
Php psr standard 2014 01-22
Php psr standard 2014 01-22Php psr standard 2014 01-22
Php psr standard 2014 01-22
 
React introduction
React introductionReact introduction
React introduction
 
Scale with Microservices
Scale with MicroservicesScale with Microservices
Scale with Microservices
 
Microservices and docker
Microservices and dockerMicroservices and docker
Microservices and docker
 
Node Architecture Implications for In-Memory Data Analytics on Scale-in Clusters
Node Architecture Implications for In-Memory Data Analytics on Scale-in ClustersNode Architecture Implications for In-Memory Data Analytics on Scale-in Clusters
Node Architecture Implications for In-Memory Data Analytics on Scale-in Clusters
 
Chatbot in Sale Management
Chatbot in Sale ManagementChatbot in Sale Management
Chatbot in Sale Management
 
Business Intelligence in Retail Industry
Business Intelligence in Retail IndustryBusiness Intelligence in Retail Industry
Business Intelligence in Retail Industry
 
Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!Deep Learning & NLP: Graphs to the Rescue!
Deep Learning & NLP: Graphs to the Rescue!
 
Percona XtraDB Cluster
Percona XtraDB ClusterPercona XtraDB Cluster
Percona XtraDB Cluster
 
Real World Lessons on the Pain Points of Node.JS Application
Real World Lessons on the Pain Points of Node.JS ApplicationReal World Lessons on the Pain Points of Node.JS Application
Real World Lessons on the Pain Points of Node.JS Application
 
China organosilicon industry market demand prospects and investment strategy ...
China organosilicon industry market demand prospects and investment strategy ...China organosilicon industry market demand prospects and investment strategy ...
China organosilicon industry market demand prospects and investment strategy ...
 
Interbrand vianey maya
Interbrand  vianey mayaInterbrand  vianey maya
Interbrand vianey maya
 
Ostern in griechenland von panagiotis sagali und raphael zarko
Ostern in griechenland von panagiotis sagali und raphael zarkoOstern in griechenland von panagiotis sagali und raphael zarko
Ostern in griechenland von panagiotis sagali und raphael zarko
 
Национальная Программа "Лучшие социальные проекты России"
Национальная Программа "Лучшие социальные проекты России"Национальная Программа "Лучшие социальные проекты России"
Национальная Программа "Лучшие социальные проекты России"
 
Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008
Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008
Marc Bachrach’S Hypnotism Show Pre Show Sjsu 12 2008
 
Audience Reach Measurement Guidelines 1.0
Audience Reach Measurement Guidelines 1.0Audience Reach Measurement Guidelines 1.0
Audience Reach Measurement Guidelines 1.0
 
Mitosis
MitosisMitosis
Mitosis
 

Similar to Line

240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
thanhdowork
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
SiddhantVerma49
 
Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...
Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...
Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...
csandit
 
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKSA NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
gerogepatton
 
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKSA NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
ijaia
 
GraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaperGraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaper
Chiraz Nafouki
 
Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...
Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...
Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...
Vladimir Kulyukin
 
Using spectral radius ratio for node degree
Using spectral radius ratio for node degreeUsing spectral radius ratio for node degree
Using spectral radius ratio for node degree
IJCNCJournal
 
Siegel
SiegelSiegel
Siegel
Joel Siegel
 
Colloquium.pptx
Colloquium.pptxColloquium.pptx
Colloquium.pptx
Mythili680896
 
Spectral Clustering Report
Spectral Clustering ReportSpectral Clustering Report
Spectral Clustering Report
Miaolan Xie
 
Graceful labelings
Graceful labelingsGraceful labelings
Graceful labelings
radhikamathy
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
SiddhantVerma49
 
Face recognition using laplacianfaces (synopsis)
Face recognition using laplacianfaces (synopsis)Face recognition using laplacianfaces (synopsis)
Face recognition using laplacianfaces (synopsis)
Mumbai Academisc
 
ON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRY
ON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRYON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRY
ON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRY
Fransiskeran
 
Lecture 5b graphs and hashing
Lecture 5b graphs and hashingLecture 5b graphs and hashing
Lecture 5b graphs and hashing
Victor Palmar
 
On algorithmic problems concerning graphs of higher degree of symmetry
On algorithmic problems concerning graphs of higher degree of symmetryOn algorithmic problems concerning graphs of higher degree of symmetry
On algorithmic problems concerning graphs of higher degree of symmetry
graphhoc
 
SkNoushadddoja_28100119039.pptx
SkNoushadddoja_28100119039.pptxSkNoushadddoja_28100119039.pptx
SkNoushadddoja_28100119039.pptx
PrakasBhowmik
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
26 spanning
26 spanning26 spanning
26 spanning
Aŕŷa Ķĥaŕel
 

Similar to Line (20)

240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
240401_JW_labseminar[LINE: Large-scale Information Network Embeddin].pptx
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
 
Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...
Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...
Effect of Rigidity on Trilateration Technique for Localization in Wireless Se...
 
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKSA NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
 
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKSA NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
A NEW GENERALIZATION OF EDGE OVERLAP TO WEIGHTED NETWORKS
 
GraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaperGraphSignalProcessingFinalPaper
GraphSignalProcessingFinalPaper
 
Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...
Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...
Connect-the-Dots in a Graph and Buffon's Needle on a Chessboard: Two Problems...
 
Using spectral radius ratio for node degree
Using spectral radius ratio for node degreeUsing spectral radius ratio for node degree
Using spectral radius ratio for node degree
 
Siegel
SiegelSiegel
Siegel
 
Colloquium.pptx
Colloquium.pptxColloquium.pptx
Colloquium.pptx
 
Spectral Clustering Report
Spectral Clustering ReportSpectral Clustering Report
Spectral Clustering Report
 
Graceful labelings
Graceful labelingsGraceful labelings
Graceful labelings
 
Deepwalk vs Node2vec
Deepwalk vs Node2vecDeepwalk vs Node2vec
Deepwalk vs Node2vec
 
Face recognition using laplacianfaces (synopsis)
Face recognition using laplacianfaces (synopsis)Face recognition using laplacianfaces (synopsis)
Face recognition using laplacianfaces (synopsis)
 
ON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRY
ON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRYON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRY
ON ALGORITHMIC PROBLEMS CONCERNING GRAPHS OF HIGHER DEGREE OF SYMMETRY
 
Lecture 5b graphs and hashing
Lecture 5b graphs and hashingLecture 5b graphs and hashing
Lecture 5b graphs and hashing
 
On algorithmic problems concerning graphs of higher degree of symmetry
On algorithmic problems concerning graphs of higher degree of symmetryOn algorithmic problems concerning graphs of higher degree of symmetry
On algorithmic problems concerning graphs of higher degree of symmetry
 
SkNoushadddoja_28100119039.pptx
SkNoushadddoja_28100119039.pptxSkNoushadddoja_28100119039.pptx
SkNoushadddoja_28100119039.pptx
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
26 spanning
26 spanning26 spanning
26 spanning
 

Recently uploaded

Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
gerogepatton
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
IJECEIAES
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
JamalHussainArman
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Christina Lin
 
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
amsjournal
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
NazakatAliKhoso2
 
john krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptxjohn krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptx
Madan Karki
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
gerogepatton
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
LAXMAREDDY22
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
IJECEIAES
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
Dr Ramhari Poudyal
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
Addu25809
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
MiscAnnoy1
 
Hematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood CountHematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood Count
shahdabdulbaset
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 

Recently uploaded (20)

Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...International Conference on NLP, Artificial Intelligence, Machine Learning an...
International Conference on NLP, Artificial Intelligence, Machine Learning an...
 
Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...Advanced control scheme of doubly fed induction generator for wind turbine us...
Advanced control scheme of doubly fed induction generator for wind turbine us...
 
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptxML Based Model for NIDS MSc Updated Presentation.v2.pptx
ML Based Model for NIDS MSc Updated Presentation.v2.pptx
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming PipelinesHarnessing WebAssembly for Real-time Stateless Streaming Pipelines
Harnessing WebAssembly for Real-time Stateless Streaming Pipelines
 
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
UNLOCKING HEALTHCARE 4.0: NAVIGATING CRITICAL SUCCESS FACTORS FOR EFFECTIVE I...
 
Textile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdfTextile Chemical Processing and Dyeing.pdf
Textile Chemical Processing and Dyeing.pdf
 
john krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptxjohn krisinger-the science and history of the alcoholic beverage.pptx
john krisinger-the science and history of the alcoholic beverage.pptx
 
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODELDEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
DEEP LEARNING FOR SMART GRID INTRUSION DETECTION: A HYBRID CNN-LSTM-BASED MODEL
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
BRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdfBRAIN TUMOR DETECTION for seminar ppt.pdf
BRAIN TUMOR DETECTION for seminar ppt.pdf
 
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
Redefining brain tumor segmentation: a cutting-edge convolutional neural netw...
 
Literature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptxLiterature Review Basics and Understanding Reference Management.pptx
Literature Review Basics and Understanding Reference Management.pptx
 
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENTNATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
NATURAL DEEP EUTECTIC SOLVENTS AS ANTI-FREEZING AGENT
 
Introduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptxIntroduction to AI Safety (public presentation).pptx
Introduction to AI Safety (public presentation).pptx
 
Hematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood CountHematology Analyzer Machine - Complete Blood Count
Hematology Analyzer Machine - Complete Blood Count
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 

Line

  • 1. Understanding Large social network LINE: Large-scale Information Network Embedding
  • 2. Why learn representation of nodes? Tasks like node classification , link prediction, node visualization can be solved easily using simple machine learning techniques once we analyze the large scale graphs using the deep learning techniques.
  • 3. Why LINE? Most existing graph embedding algorithms are designed to preserve this first- order proximity, e.g., IsoMap and Laplacian eigenmap , even if they do not scale. As a complement, we explore the second-order proximity between the vertices, which is not determined through the observed tie strength but through the shared neighborhood structures of the vertices.
  • 4. The Line optimizes ● a carefully designed objective function that preserves both the local and global network structures. ● An edge-sampling algorithm is proposed that addresses the limitation of the classical stochastic gradient descent and improves both the effectiveness and the efficiency of the inference. Empirical experiments prove the effectiveness of the LINE on a variety of real- world information networks,
  • 5. Some Definitions. (Information Network) An information network is defined as G = (V, E), where V is the set of vertices, each representing a data object and E is the set of edges between the vertices, each representing a relationship between two data objects. Each edge e ∈ E is an ordered pair e = (u, v) and is associated with a weight w uv > 0, which indicates the strength of the relation. If G is undirected, we have (u, v) ≡ (v, u) and w uv ≡ w vu ; if G is directed, we have (u, v) ≡ (v, u) and w uv ≡ w vu .
  • 6. Some Definitions (First-order Proximity) The first-order proximity in a network is the local pairwise proximity between two vertices. For each pair of vertices linked by an edge (u, v), the weight on that edge, wuv, indicates the first-order proximity between u and v. If no edge is observed between u and v, their first-order proximity is 0. (Second-order Proximity) The second-order proximity between a pair of vertices (u, v) in a network is the similarity between their neighborhood network structures. Mathematically, let pu = (wu,1, . . . , wu,|V |) denote the first-order proximity of u with all the other vertices, then the second-order proximity between u and v is determined by the similarity between pu and pv. If no vertex is linked from/to both u and v, the second-order proximity between u and v is 0.
  • 7. LINE (Large-scale Information Network Embedding) Given a large network G = (V, E), the problem of Large-scale Information Network Embedding aims to represent each vertex v ∈ V into a low-dimensional space R d , i.e., learning a function fG : V → R d , where d ≪ |V |. In the space R d , both the first-order proximity and the second-order proximity between the vertices are preserved.
  • 8. Related Work Various methods of graph embedding have been proposed in the machine learning literature. They generally perform well on smaller networks. The problem becomes much more challenging when a real world information network is concerned, which typically contains millions of nodes and billions of edges. For example, the Twitter followee-follower network contains 175 million active users and around twenty billion edges in 2012 . Most existing graph embedding algorithms do not scale for networks of this size. The time complexity of classical graph embedding algorithms such as MDS, IsoMap, Laplacian eigenmap are at least quadratic to the number of vertices, which is too expensive for networks with millions of nodes. Another parallel work is deep walk, DeepWalk uses random walks to expand the neighborhood of a vertex, which is analogical to a depth-first search. We use a breadth- first search strategy, which is a more reasonable approach to the second-order proximity
  • 9. Model Description. LINE with First-order Proximity Where p1(·, ·) -> a distribution over the space V × V ui ∈ Rd is the low-dimensional vector representation of vertex v i . The empirical probability of p1 would be :
  • 10. To preserve the first-order proximity, a straightforward way is to minimize the distance between two distributions. We choose to minimize the KL-divergence of two probability distributions
  • 11. LINE with Second-order Proximity: Where u i is the representation of v i when it is treated as a vertex while u ′ i is the representation of v i when it is treated as a specific “context”. |V | is the number of vertices or “contexts.” w ij is the weight of the edge (i, j) d i is the outdegree of vertex i, The empirical probability of p2 would be :
  • 12. To preserve the second-order proximity, a straightforward way is to minimize the distance between two distributions. We choose to minimize the KL-divergence of two probability distributions
  • 13. Optimizing the second order Proximity The approach of negative sampling σ(x) = 1/(1 + exp(−x)) is the sigmoid function We adopt the asynchronous stochastic gradient algorithm (ASGD) for optimizing the above Eqn. . In each step, the ASGD algorithm samples a mini-batch of edges and then updates the model parameters. If an edge (i, j) is sampled, the gradient w.r.t. the embedding vector u i of vertex i will be calculated as:
  • 14. Combining the first order and second order Proximities To embed the networks by preserving both the first-order and second-order proximity, a simple and effective way we find in practice is to train the LINE model which preserves the first-order proximity and second-order proximity separately and then concatenate the embeddings trained by the two methods for each vertex.