SlideShare a Scribd company logo
Higher-order spectral graph
clustering with motifs
Austin R. Benson
Cornell SCAN seminar
September 18, 2017
Joint work with
David Gleich (Purdue),
Jure Leskovec (Stanford), &
Hao Yin (Stanford)
2
Brains
nodes are neurons
edges are synapses
Social networks
nodes are people
edges are
friendships
Electrical grid
nodes are power plants
edges are transmission
linesTim Meko, Washington Post
Currency
nodes are accounts
edges are transactions
Background. Networks are sets of nodes and edges (graphs)
that model real-world systems.
3
G = (V, E) A To study
relationships in the
network
• centrality
• reachability
• clustering
• and more…
we often use matrix
computations
Just like most other things in life,
I often think about networks as matrices.
Given an undirected graph G = (V, E)
Instead of using its symmetric
adjacency matrix A
consider using the weighted matrix
Hadamard /
element-wise
Given a directed graph G = (V, E)
Instead of using its non-symmetric
adjacency matrix A = B + U
consider using a weighted matrix
what
why
where
The matrix counts the number
of triangles that edges participate in.
The matrix comes up in our higher-order
network clustering framework when using
triangles as the motif.
We often get better empirical results in
real-world networks and better numerical
properties in model partitioning problems
(stochastic block model).
7
Key insight [Flake 2000; Newman 2004, 2006; many others…].
Networks for real-world systems have modules, clusters, communities.
• We want algorithms to uncover the clusters automatically.
• Main idea has been to optimize metrics involving the number of
nodes and edges in a cluster. Conductance, modularity, density, ratio cut,
…
Co-author network
Brain network, de Reus et al., RSTB,
2014.
Background. Networks are sets of nodes and edges (graphs)
that model real-world systems.
8
Key insight [Milo+02].
Networks modelling real-world systems
contain certain small subgraphs patterns
way more frequently than expected.
[Mangan+ 2003; Alon 2007]
Signed feed-forward loops
in genetic transcription.
Triangles in social
relationships.
[Simmel 1908;
Rapoport 1953;
Granovetter 1973]
Bi-directed length-2 paths
in brain networks.
[Sporns-Kötter 2004;
Sporns+ 2007; Honey+ 2007]
We call these small subgraph patterns motifs.
Background. Networks are sets of nodes and edges (graphs)
that model real-world systems.
9
Motifs are the fundamental units of
complex networks.
We should design our
clustering algorithms around motifs.
Network
Motif
Different motifs give different clusters.
10
Higher-order graph clustering is our technique for finding
coherent groups of nodes based on motifs.
Conductance is one of the most important cluster quality scores [Schaeffer
2007]
used in Markov chain theory, spectral clustering, bioinformatics, vision, etc.
The conductance of a set of vertices S is the ratio of
edges leaving to total edges
small conductance  good cluster
(edges leaving S)
(edge end points in S)
11
S S
Background. Graph clustering and conductance.
Cheeger Inequality
Finding the best conductance set is NP-hard. 
• Cheeger realized the eigenvalues of the Laplacian
provided surface area to volume bounds in
manifolds.
• Alon and Milman independently realized the same
thing for a graph (conductance)!
Laplacian
12
[Cheeger 1970; Alon-Milman 1985]
Background. Spectral clustering has theoretical guarantees.
We can find a set S that achieves the
Cheeger bound.
1. Compute the eigenvector z associated with
λ2 and scale to f = D-1/2z
2. Sort the vertices by their values in f:
σ1, σ2, …, σn
3. Let Sr = {σ1, …, σr} and compute the
conductance of φ(Sr) of each Sr.
4. Pick the set Sm with minimum
conductance.
13
[Mihail 1989; Chung 1992]
0 20 40
0
0.2
0.4
0.6
0.8
1
Si
fi
Background. The sweep cut realizes the guarantee.
0 20 40
0
0.2
0.4
0.6
0.8
1
Si
fi
14
Background. The sweep cut visualized.
15
Signed feed-forward loops in genetic transcription
[Mangan+03]
• Gene X activates transcription in gene Y.
• Gene X suppresses transcription in gene Z.
• Gene Y suppresses transcription in gene Z.X
Y
Z
Spectral clustering is theoretically justified for undirected graphs
• Various extensions to multiple clusters
[Ng-Jordan-Weiss 2002; Dhillon-Guan-Kulis 2004; Lee-Gharan-Trevisan
2014]
• Weighted graphs are okay (weighted cut, weighted volume)
• Approximate eigenvectors are okay [Mihail89; Fairbanks+ 2017]
Current network models are more richly annotated
• directed, signed, colored, layered, multiplex, higher-order, etc.
Modern datasets are much more rich than the simple networks
for which spectral clustering is justified.
[Benson-Gleich-Leskovec 2016; Yin-Benson-Leskovec-Gleich 2017]
16
• A generalized conductance metric for motifs.
• A “new” spectral clustering algorithm to
minimize the generalized conductance.
• AND an associated motif Cheeger inequality
guarantee.
• Several applications
Aquatic layers in food webs, functional groups in
genetic regulation Hub structure in transportation.
• Extensions to localized clustering.
Our contributions.
New and preliminary! Studies on stochastic block models.
Need new notions of cut and volume…
17
M = triangle motif
Motif conductance.
Motif M
18
9
10
3
5
6
7
8
1
2
4
9
10
3
5
6
7
8
1
2
4
There is a symmetric matrix that is appropriate for studying motif
conductance—even if the network and motif are directed.
1
11
1
1
1
3
1
1
1
1
1
2
1
1
19
9
10
3
5
6
7
8
1
2
4
motif M
graph G weighted graph W(M)
bidirectional edges
unidirectional links
There is a symmetric matrix that is appropriate for studying motif
conductance—even if the network and motif are directed.
20
Key insight. [Benson-Gleich-Leskovec
2016]
Classical spectral clustering on
weighted graph W(M) finds clusters of
low motif conductance.
Using matrix tools for higher-order clustering.
1
11
1
1
1
3
1
1
1
1
1
2
1
1
weighted graph W(M)
1. Pick your favorite motif.
2. Form weighted matrix W(M).
3. Compute the Fiedler eigenvector f(M)
associated with λ2 of the normalized
Laplacian matrix of W(M).
4. Run a sweep on f(M)
21
f(M)
Higher-order spectral clustering.
22
(there are faster algorithms to
compute these matrices, but this
is a useful jumping off point.)
There are nice matrix computations for 3-node motifs.
bidirection
al
unidirection
al
23
There are nice matrix computations for 3-node motifs.
Theorem.
If the motif has three nodes, then
the sweep procedure on the
weighted graph finds a set S of
nodes for which
24
Key Proof Step
Implication.
Just run spectral clustering with
the weighted matrix coming from
your favorite motif.
The three-node motif Cheeger inequality.
25
Awesome advantages!
• Works for arbitrary non-negative combos
of motifs and weighted motifs, too.
• We inherit 40+ years of research!
• Fast algorithms (ARPACK, etc.)
• Local methods!
[Yin-Benson-Leskovec-Gleich 2017]
• Overlapping!
via [Whang-Dillon-Gleich 2015]
• Easy to implement 
• Scalable (1.4B edges graphs
are not a problem)
bit.ly/motif-spectral-julia
26
1. We do not know the motif of interest.
food webs and new applications
2. We know the motif of interest from domain knowledge.
yeast transcription regulation networks, connectome, social networks
3. We seek richer information from our data.
transportation networks and new applications
Lots of fun applications.
Florida bay food web
• Nodes are species.
• Edges represent carbon exchange
i → j if j eats i.
• Motifs represent energy flow patterns.
27
http://marinebio.org/oceans/marine-zones/
Application 1. Food webs.
Which motif clusters the food web?
Our approach
• Run motif spectral clustering for all 3-node motifs as well as
for just edges.
• Look at sweep cuts to see which motif gives the best clusters.
28
Application 1. Food webs.
29
Our finding.
Motif M6 organizes the food
web into good clusters
Application 1. Food webs.
Micronutrient
sources
Pelagic fishes
and benthic
prey
Benthic
macro-
invertebrates
Benthic Fishes
Motif M6
reveals aquatic
layers
61% accuracy vs.
48% with edge-
based methods
30
Application 1. Food webs.
• Nodes are groups of genes.
• Edge i → j means i regulates transcription to j.
• Sign + / - denotes activation / suppression.
• Coherent feedforward loops encode biological function.
[Mangan-Zaslaver-Alon 2003; Mangan-Alon 2003; Alon 2007]
31
Application 2. Yeast transcription regulation networks.
Clustering based on coherent
feedforward loops identifies
functions studied individually
by biologists [Mangan+ 2003]
97% accuracy vs.
68–82% with edge-based methods
32
Application 2. Yeast transcription regulation networks.
• North American air
transport network.
• Nodes are cites.
• i → j if you can travel from
i to j in < 8 hours.
[Frey-Dueck 2007]
33
Application 3. Transportation networks.
Important motifs from literature
[Rosvall+ 2014]
Weighted adjacency matrix already reveals hub-like structure.
34
Application 3. Transportation networks.
Top 8
U.S. hubs
East coast non-hubs
West coast non-
hubs
Primary spectral coordinate
Atlanta, the top hub, is
next to Salina, a non-hub.
MOTIF SPECTRAL
EMBEDDING
EDGE SPECTRAL EMBEDDING
35
Application 3. Transportation networks.
36
In localized clustering,
we aim to find a small
set of nodes of low
conductance containing
a particular seed node
without looking at the
entire graph.
Seed
node
Local cluster
A major benefit of our theory is that it easily generalizes to related
problems.
37
Random walk procedure
• With probability a, hop to a random neighbor.
• With probability 1- a, jump back to the seed node.
• Stationary distribution tells us what is “close” to the seed.
If the seed is in a good cluster, the stationary distribution is
localized and we can approximate it in sublinear time using
results of [Andersen-Chung-Lang 2006]. Uses sweep cut on approx.
x.
Background. Localized clustering works in theory and practice
with the (fast) personalized PageRank method.
38
We use our same trick and generalize the theory to
triangles (and other motifs).
Now random walks look like
 With probability a, hop to a random neighbor adjacent triangle and then
to a random endpoint of that triangle (not the one we came from).
 With probability 1- a, go back to the seed node.
Higher-order localized clustering.
[Yin-Benson-Leskovec-Gleich 2017]
39
 Nodes are people at a research institute. Each person is in a department.
 Edges are email correspondence.
 Using each person as a seed, can we recover their department?
Average F1 0.40 0.50
Using the triangle motif in higher-order localized clustering better
captures community structure in email data.
what
why
where
The matrix counts the number
of triangles that edges participate in.
The matrix comes up in our higher-order
network clustering framework when using
triangles as the motif.
We often get better empirical results in
real-world networks and better numerical
properties in model partitioning problems
(stochastic block model).
41
SSBM
m = 200, k = 5,
p = 0.3, q = 0.13
The symmetric stochastic block model
(SSBM)
• k blocks, each of size m-by-m
• within-block edges exist with prob. p
• between-block edges with prob. q
The task. Given a graph that is an SSBM
and given m, k, p, q, find the k blocks.
Theory (lots!). See in-prep. survey
“Community Detection and the Stochastic
Block Model” from E. Abbe.
• Exact recovery (get all correct)
• Detectability (find a non-trivial portion)
• Uses non-backtracking random walks.
The stochastic block model is a standard model used to study
recovery of planted cluster structure.
We are currently looking at the SSBM using our motif-
weighting based on triangles.
42
Based also on Tsourakakis, Pachocki, & Mitzenmacher, WWW,
2017.
→ triangle conductance of a block < edge-conductance of the
block.
We use a mixing parameter
thought of as the expected fraction of neighbors outside of a block
43
From the eyeball test, just using the motif weighting highlights the
blocks better for a range of parameters.
Exp. details
Randomly sample
SSBM(50, 10, p=0.2, q)
for varying q. Plot 2D
histogram (density).
Detectability
Exact recovery
Exp. details.
We take normalized
Laplacian and shift to
reverse the spectrum.
Then we deflate given
knowledge of the
leading eigenvector.
Accuracy is the most
accurate block in the
extremal m (size of
block) entries.
Note: threshold is for
all block recovery.
Accuracy
The power method identifies a cluster better using the motif
weighting than the adjacency.
Most accurate block
Nodes whose value is
greater than the smallest
green node value
Nodes whose value is
smaller than the smallest
green node value.
The power method identifies a cluster better using the motif
weighting than the adjacency.
46
We don’t converge faster for the usual reasons that we would
think of the power method converging faster.
47
There is a gap deeper in the spectrum that could explain what
is going on.
48
semi-circle law (not a Marchenko-Pastur law)
The motif weighting shifts all eigenvalues down, but it pushes the
lowest ones down the most.
49
• Eigenvalues show that we’ll
converge to the “cluster
subspace” faster.
• Conjecture.
Higher accuracy for motifs
because the eigenvectors we
converge to are more
localized—or sharper—around
the clusters.
• But we don’t know why they
are sharper!
We would like a numerical explanation for why we get better
results with motifs.
Top non-trivial
eigenvectors of
normalized
Laplacian fighting
less with motif
weighting?
50
The iterative algorithm for personalized PageRank tends to
localize more.
Nodes whose
value is greater
than the smallest
node value in the
seed’s block.
Rest of the nodes.
51
Papers
• “Higher-order organization of complex networks.”
Benson, Gleich, and Leskovec. Science, 2016.
• “Local higher-order graph clustering.”
Yin, Benson, Leskovec, and Gleich. KDD, 2017.
1. A generalized conductance metric for
motifs.
2. A “new” spectral clustering algorithm to
minimize the generalized conductance.
3. AND an associated motif Cheeger inequality
guarantee.
4. Extensions to localized clustering.
5. Applications with food webs, genetic
regulation networks, transportation systems,
social networks.
6. Eigenvalues with motifs in SSBMs.
Open questions
• What is the distribution
law for the eigenvalues of
the Laplacian of A2 ⊙ A?
• Relationship between
convergence of the power
method and localization?
• How to work with element-
wise prods like matvecs
for
http://cs.cornell.edu/~arb
@austinbenson
arb@cs.cornell.edu
Thanks!
Austin Benson
Code & Data
snap.stanford.edu/higher-order
github.com/arbenson/higher-order-organization-julia
bit.ly/motif-spectral-julia
github.com/dgleich/motif-ssbm
9 10
8
7
2
0
4
3
11
6
5
1

More Related Content

What's hot

μαθηματικα γ΄γυμνασιου θεματα απολυτηριων εξετασεων
μαθηματικα γ΄γυμνασιου θεματα  απολυτηριων εξετασεωνμαθηματικα γ΄γυμνασιου θεματα  απολυτηριων εξετασεων
μαθηματικα γ΄γυμνασιου θεματα απολυτηριων εξετασεων
Μάκης Χατζόπουλος
 
Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...
Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...
Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...
MLconf
 
Yasunori Futamura
Yasunori FutamuraYasunori Futamura
Yasunori Futamura
Suurist
 
Επανάληψη θεωρίας 2016 για την Άλγεβρα της Β Λυκείου
Επανάληψη θεωρίας 2016 για την Άλγεβρα της Β ΛυκείουΕπανάληψη θεωρίας 2016 για την Άλγεβρα της Β Λυκείου
Επανάληψη θεωρίας 2016 για την Άλγεβρα της Β Λυκείου
Μάκης Χατζόπουλος
 
anisotikes
anisotikesanisotikes
anisotikesperi2005
 
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
Brocade
 
Επανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ Λυκείου
Επανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ ΛυκείουΕπανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ Λυκείου
Επανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ Λυκείου
HOME
 
7. lauzta līnija un daudzstūris
7. lauzta līnija un daudzstūris7. lauzta līnija un daudzstūris
7. lauzta līnija un daudzstūrisMaija Liepa
 
Αστρονομία - Εξέλιξη του Σύμπαντος-Μεγάλη Έκρηξη
Αστρονομία  - Εξέλιξη του Σύμπαντος-Μεγάλη ΈκρηξηΑστρονομία  - Εξέλιξη του Σύμπαντος-Μεγάλη Έκρηξη
Αστρονομία - Εξέλιξη του Σύμπαντος-Μεγάλη Έκρηξη
4o Lykeio Alex/polis
 
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
NextMove Software
 
Διαγνωστικό τεστ από την Α΄ στη Β΄ λυκείου
Διαγνωστικό τεστ από την Α΄ στη Β΄ λυκείουΔιαγνωστικό τεστ από την Α΄ στη Β΄ λυκείου
Διαγνωστικό τεστ από την Α΄ στη Β΄ λυκείου
Μάκης Χατζόπουλος
 

What's hot (11)

μαθηματικα γ΄γυμνασιου θεματα απολυτηριων εξετασεων
μαθηματικα γ΄γυμνασιου θεματα  απολυτηριων εξετασεωνμαθηματικα γ΄γυμνασιου θεματα  απολυτηριων εξετασεων
μαθηματικα γ΄γυμνασιου θεματα απολυτηριων εξετασεων
 
Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...
Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...
Tamara G. Kolda, Distinguished Member of Technical Staff, Sandia National Lab...
 
Yasunori Futamura
Yasunori FutamuraYasunori Futamura
Yasunori Futamura
 
Επανάληψη θεωρίας 2016 για την Άλγεβρα της Β Λυκείου
Επανάληψη θεωρίας 2016 για την Άλγεβρα της Β ΛυκείουΕπανάληψη θεωρίας 2016 για την Άλγεβρα της Β Λυκείου
Επανάληψη θεωρίας 2016 για την Άλγεβρα της Β Λυκείου
 
anisotikes
anisotikesanisotikes
anisotikes
 
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
Recent Advances in Machine Learning: Bringing a New Level of Intelligence to ...
 
Επανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ Λυκείου
Επανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ ΛυκείουΕπανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ Λυκείου
Επανάληψη ΄΄ΕΝ ΟΛΙΓΟΙΣ΄΄ Ορμής-Κρούσης Β΄ Λυκείου
 
7. lauzta līnija un daudzstūris
7. lauzta līnija un daudzstūris7. lauzta līnija un daudzstūris
7. lauzta līnija un daudzstūris
 
Αστρονομία - Εξέλιξη του Σύμπαντος-Μεγάλη Έκρηξη
Αστρονομία  - Εξέλιξη του Σύμπαντος-Μεγάλη ΈκρηξηΑστρονομία  - Εξέλιξη του Σύμπαντος-Μεγάλη Έκρηξη
Αστρονομία - Εξέλιξη του Σύμπαντος-Μεγάλη Έκρηξη
 
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
Chemical similarity using multi-terabyte graph databases: 68 billion nodes an...
 
Διαγνωστικό τεστ από την Α΄ στη Β΄ λυκείου
Διαγνωστικό τεστ από την Α΄ στη Β΄ λυκείουΔιαγνωστικό τεστ από την Α΄ στη Β΄ λυκείου
Διαγνωστικό τεστ από την Α΄ στη Β΄ λυκείου
 

Similar to Higher-order spectral graph clustering with motifs

Spectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structuresSpectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structures
David Gleich
 
Higher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIHigher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoI
Austin Benson
 
Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)
Tin180 VietNam
 
Analytic tools for higher-order data
Analytic tools for higher-order dataAnalytic tools for higher-order data
Analytic tools for higher-order data
Austin Benson
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
Austin Benson
 
Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifs
Austin Benson
 
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Devansh16
 
Higher-order Link Prediction GraphEx
Higher-order Link Prediction GraphExHigher-order Link Prediction GraphEx
Higher-order Link Prediction GraphEx
Austin Benson
 
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.B.Kim,  Review on "metapath2vec: Scalable representation le...NS-CUK Seminar: H.B.Kim,  Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
ssuser4b1f48
 
Network Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsNetwork Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systems
Ganesh Bagler
 
Simplicial closure and higher-order link prediction (SIAMNS18)
Simplicial closure and higher-order link prediction (SIAMNS18)Simplicial closure and higher-order link prediction (SIAMNS18)
Simplicial closure and higher-order link prediction (SIAMNS18)
Austin Benson
 
O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...
O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...
O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...
csandit
 
Networks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and BeyondNetworks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and Beyond
Mason Porter
 
Presentation2 2000
Presentation2 2000Presentation2 2000
Presentation2 2000suvobgd
 
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphsUse of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
csandit
 
Higher-order graph clustering at AMS Spring Western Sectional
Higher-order graph clustering at AMS Spring Western SectionalHigher-order graph clustering at AMS Spring Western Sectional
Higher-order graph clustering at AMS Spring Western Sectional
Austin Benson
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)CSCJournals
 
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
The Statistical and Applied Mathematical Sciences Institute
 
08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)
dnac
 

Similar to Higher-order spectral graph clustering with motifs (20)

Spectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structuresSpectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structures
 
Higher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoIHigher-order clustering coefficients at Purdue CSoI
Higher-order clustering coefficients at Purdue CSoI
 
Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)
 
Analytic tools for higher-order data
Analytic tools for higher-order dataAnalytic tools for higher-order data
Analytic tools for higher-order data
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifs
 
Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...Paper Explained: Understanding the wiring evolution in differentiable neural ...
Paper Explained: Understanding the wiring evolution in differentiable neural ...
 
Higher-order Link Prediction GraphEx
Higher-order Link Prediction GraphExHigher-order Link Prediction GraphEx
Higher-order Link Prediction GraphEx
 
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.B.Kim,  Review on "metapath2vec: Scalable representation le...NS-CUK Seminar: H.B.Kim,  Review on "metapath2vec: Scalable representation le...
NS-CUK Seminar: H.B.Kim, Review on "metapath2vec: Scalable representation le...
 
Network Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systemsNetwork Biology: A paradigm for modeling biological complex systems
Network Biology: A paradigm for modeling biological complex systems
 
Simplicial closure and higher-order link prediction (SIAMNS18)
Simplicial closure and higher-order link prediction (SIAMNS18)Simplicial closure and higher-order link prediction (SIAMNS18)
Simplicial closure and higher-order link prediction (SIAMNS18)
 
O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...
O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...
O N T HE D ISTRIBUTION OF T HE M AXIMAL C LIQUE S IZE F OR T HE V ERTICES IN ...
 
Networks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and BeyondNetworks in Space: Granular Force Networks and Beyond
Networks in Space: Granular Force Networks and Beyond
 
Presentation2 2000
Presentation2 2000Presentation2 2000
Presentation2 2000
 
mlss
mlssmlss
mlss
 
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphsUse of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
 
Higher-order graph clustering at AMS Spring Western Sectional
Higher-order graph clustering at AMS Spring Western SectionalHigher-order graph clustering at AMS Spring Western Sectional
Higher-order graph clustering at AMS Spring Western Sectional
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)
 
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
Climate Extremes Workshop - Networks and Extremes: Review and Further Studies...
 
08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)08 Exponential Random Graph Models (ERGM)
08 Exponential Random Graph Models (ERGM)
 

More from Austin Benson

Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)
Austin Benson
 
Spectral embeddings and evolving networks
Spectral embeddings and evolving networksSpectral embeddings and evolving networks
Spectral embeddings and evolving networks
Austin Benson
 
Computational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data AnalysisComputational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data Analysis
Austin Benson
 
Higher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modelingHigher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modeling
Austin Benson
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
Austin Benson
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
Austin Benson
 
Higher-order link prediction
Higher-order link predictionHigher-order link prediction
Higher-order link prediction
Austin Benson
 
Three hypergraph eigenvector centralities
Three hypergraph eigenvector centralitiesThree hypergraph eigenvector centralities
Three hypergraph eigenvector centralities
Austin Benson
 
Semi-supervised learning of edge flows
Semi-supervised learning of edge flowsSemi-supervised learning of edge flows
Semi-supervised learning of edge flows
Austin Benson
 
Choosing to grow a graph
Choosing to grow a graphChoosing to grow a graph
Choosing to grow a graph
Austin Benson
 
Link prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structureLink prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structure
Austin Benson
 
Higher-order Link Prediction Syracuse
Higher-order Link Prediction SyracuseHigher-order Link Prediction Syracuse
Higher-order Link Prediction Syracuse
Austin Benson
 
Random spatial network models for core-periphery structure
Random spatial network models for core-periphery structureRandom spatial network models for core-periphery structure
Random spatial network models for core-periphery structure
Austin Benson
 
Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.
Austin Benson
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
Austin Benson
 
Simplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusionsSimplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusions
Austin Benson
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
Austin Benson
 
Set prediction three ways
Set prediction three waysSet prediction three ways
Set prediction three ways
Austin Benson
 
Sequences of Sets KDD '18
Sequences of Sets KDD '18Sequences of Sets KDD '18
Sequences of Sets KDD '18
Austin Benson
 
Simplicial closure and higher-order link prediction --- SIAMNS18
Simplicial closure and higher-order link prediction --- SIAMNS18Simplicial closure and higher-order link prediction --- SIAMNS18
Simplicial closure and higher-order link prediction --- SIAMNS18
Austin Benson
 

More from Austin Benson (20)

Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)
 
Spectral embeddings and evolving networks
Spectral embeddings and evolving networksSpectral embeddings and evolving networks
Spectral embeddings and evolving networks
 
Computational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data AnalysisComputational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data Analysis
 
Higher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modelingHigher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modeling
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
 
Higher-order link prediction
Higher-order link predictionHigher-order link prediction
Higher-order link prediction
 
Three hypergraph eigenvector centralities
Three hypergraph eigenvector centralitiesThree hypergraph eigenvector centralities
Three hypergraph eigenvector centralities
 
Semi-supervised learning of edge flows
Semi-supervised learning of edge flowsSemi-supervised learning of edge flows
Semi-supervised learning of edge flows
 
Choosing to grow a graph
Choosing to grow a graphChoosing to grow a graph
Choosing to grow a graph
 
Link prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structureLink prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structure
 
Higher-order Link Prediction Syracuse
Higher-order Link Prediction SyracuseHigher-order Link Prediction Syracuse
Higher-order Link Prediction Syracuse
 
Random spatial network models for core-periphery structure
Random spatial network models for core-periphery structureRandom spatial network models for core-periphery structure
Random spatial network models for core-periphery structure
 
Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
Simplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusionsSimplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusions
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
 
Set prediction three ways
Set prediction three waysSet prediction three ways
Set prediction three ways
 
Sequences of Sets KDD '18
Sequences of Sets KDD '18Sequences of Sets KDD '18
Sequences of Sets KDD '18
 
Simplicial closure and higher-order link prediction --- SIAMNS18
Simplicial closure and higher-order link prediction --- SIAMNS18Simplicial closure and higher-order link prediction --- SIAMNS18
Simplicial closure and higher-order link prediction --- SIAMNS18
 

Recently uploaded

一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
slg6lamcq
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
74nqk8xf
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 

Recently uploaded (20)

一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
一比一原版(UniSA毕业证书)南澳大学毕业证如何办理
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
一比一原版(Coventry毕业证书)考文垂大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 

Higher-order spectral graph clustering with motifs

  • 1. Higher-order spectral graph clustering with motifs Austin R. Benson Cornell SCAN seminar September 18, 2017 Joint work with David Gleich (Purdue), Jure Leskovec (Stanford), & Hao Yin (Stanford)
  • 2. 2 Brains nodes are neurons edges are synapses Social networks nodes are people edges are friendships Electrical grid nodes are power plants edges are transmission linesTim Meko, Washington Post Currency nodes are accounts edges are transactions Background. Networks are sets of nodes and edges (graphs) that model real-world systems.
  • 3. 3 G = (V, E) A To study relationships in the network • centrality • reachability • clustering • and more… we often use matrix computations Just like most other things in life, I often think about networks as matrices.
  • 4. Given an undirected graph G = (V, E) Instead of using its symmetric adjacency matrix A consider using the weighted matrix Hadamard / element-wise
  • 5. Given a directed graph G = (V, E) Instead of using its non-symmetric adjacency matrix A = B + U consider using a weighted matrix
  • 6. what why where The matrix counts the number of triangles that edges participate in. The matrix comes up in our higher-order network clustering framework when using triangles as the motif. We often get better empirical results in real-world networks and better numerical properties in model partitioning problems (stochastic block model).
  • 7. 7 Key insight [Flake 2000; Newman 2004, 2006; many others…]. Networks for real-world systems have modules, clusters, communities. • We want algorithms to uncover the clusters automatically. • Main idea has been to optimize metrics involving the number of nodes and edges in a cluster. Conductance, modularity, density, ratio cut, … Co-author network Brain network, de Reus et al., RSTB, 2014. Background. Networks are sets of nodes and edges (graphs) that model real-world systems.
  • 8. 8 Key insight [Milo+02]. Networks modelling real-world systems contain certain small subgraphs patterns way more frequently than expected. [Mangan+ 2003; Alon 2007] Signed feed-forward loops in genetic transcription. Triangles in social relationships. [Simmel 1908; Rapoport 1953; Granovetter 1973] Bi-directed length-2 paths in brain networks. [Sporns-Kötter 2004; Sporns+ 2007; Honey+ 2007] We call these small subgraph patterns motifs. Background. Networks are sets of nodes and edges (graphs) that model real-world systems.
  • 9. 9 Motifs are the fundamental units of complex networks. We should design our clustering algorithms around motifs.
  • 10. Network Motif Different motifs give different clusters. 10 Higher-order graph clustering is our technique for finding coherent groups of nodes based on motifs.
  • 11. Conductance is one of the most important cluster quality scores [Schaeffer 2007] used in Markov chain theory, spectral clustering, bioinformatics, vision, etc. The conductance of a set of vertices S is the ratio of edges leaving to total edges small conductance  good cluster (edges leaving S) (edge end points in S) 11 S S Background. Graph clustering and conductance.
  • 12. Cheeger Inequality Finding the best conductance set is NP-hard.  • Cheeger realized the eigenvalues of the Laplacian provided surface area to volume bounds in manifolds. • Alon and Milman independently realized the same thing for a graph (conductance)! Laplacian 12 [Cheeger 1970; Alon-Milman 1985] Background. Spectral clustering has theoretical guarantees.
  • 13. We can find a set S that achieves the Cheeger bound. 1. Compute the eigenvector z associated with λ2 and scale to f = D-1/2z 2. Sort the vertices by their values in f: σ1, σ2, …, σn 3. Let Sr = {σ1, …, σr} and compute the conductance of φ(Sr) of each Sr. 4. Pick the set Sm with minimum conductance. 13 [Mihail 1989; Chung 1992] 0 20 40 0 0.2 0.4 0.6 0.8 1 Si fi Background. The sweep cut realizes the guarantee.
  • 15. 15 Signed feed-forward loops in genetic transcription [Mangan+03] • Gene X activates transcription in gene Y. • Gene X suppresses transcription in gene Z. • Gene Y suppresses transcription in gene Z.X Y Z Spectral clustering is theoretically justified for undirected graphs • Various extensions to multiple clusters [Ng-Jordan-Weiss 2002; Dhillon-Guan-Kulis 2004; Lee-Gharan-Trevisan 2014] • Weighted graphs are okay (weighted cut, weighted volume) • Approximate eigenvectors are okay [Mihail89; Fairbanks+ 2017] Current network models are more richly annotated • directed, signed, colored, layered, multiplex, higher-order, etc. Modern datasets are much more rich than the simple networks for which spectral clustering is justified.
  • 16. [Benson-Gleich-Leskovec 2016; Yin-Benson-Leskovec-Gleich 2017] 16 • A generalized conductance metric for motifs. • A “new” spectral clustering algorithm to minimize the generalized conductance. • AND an associated motif Cheeger inequality guarantee. • Several applications Aquatic layers in food webs, functional groups in genetic regulation Hub structure in transportation. • Extensions to localized clustering. Our contributions. New and preliminary! Studies on stochastic block models.
  • 17. Need new notions of cut and volume… 17 M = triangle motif Motif conductance.
  • 18. Motif M 18 9 10 3 5 6 7 8 1 2 4 9 10 3 5 6 7 8 1 2 4 There is a symmetric matrix that is appropriate for studying motif conductance—even if the network and motif are directed.
  • 19. 1 11 1 1 1 3 1 1 1 1 1 2 1 1 19 9 10 3 5 6 7 8 1 2 4 motif M graph G weighted graph W(M) bidirectional edges unidirectional links There is a symmetric matrix that is appropriate for studying motif conductance—even if the network and motif are directed.
  • 20. 20 Key insight. [Benson-Gleich-Leskovec 2016] Classical spectral clustering on weighted graph W(M) finds clusters of low motif conductance. Using matrix tools for higher-order clustering. 1 11 1 1 1 3 1 1 1 1 1 2 1 1 weighted graph W(M)
  • 21. 1. Pick your favorite motif. 2. Form weighted matrix W(M). 3. Compute the Fiedler eigenvector f(M) associated with λ2 of the normalized Laplacian matrix of W(M). 4. Run a sweep on f(M) 21 f(M) Higher-order spectral clustering.
  • 22. 22 (there are faster algorithms to compute these matrices, but this is a useful jumping off point.) There are nice matrix computations for 3-node motifs.
  • 23. bidirection al unidirection al 23 There are nice matrix computations for 3-node motifs.
  • 24. Theorem. If the motif has three nodes, then the sweep procedure on the weighted graph finds a set S of nodes for which 24 Key Proof Step Implication. Just run spectral clustering with the weighted matrix coming from your favorite motif. The three-node motif Cheeger inequality.
  • 25. 25 Awesome advantages! • Works for arbitrary non-negative combos of motifs and weighted motifs, too. • We inherit 40+ years of research! • Fast algorithms (ARPACK, etc.) • Local methods! [Yin-Benson-Leskovec-Gleich 2017] • Overlapping! via [Whang-Dillon-Gleich 2015] • Easy to implement  • Scalable (1.4B edges graphs are not a problem) bit.ly/motif-spectral-julia
  • 26. 26 1. We do not know the motif of interest. food webs and new applications 2. We know the motif of interest from domain knowledge. yeast transcription regulation networks, connectome, social networks 3. We seek richer information from our data. transportation networks and new applications Lots of fun applications.
  • 27. Florida bay food web • Nodes are species. • Edges represent carbon exchange i → j if j eats i. • Motifs represent energy flow patterns. 27 http://marinebio.org/oceans/marine-zones/ Application 1. Food webs.
  • 28. Which motif clusters the food web? Our approach • Run motif spectral clustering for all 3-node motifs as well as for just edges. • Look at sweep cuts to see which motif gives the best clusters. 28 Application 1. Food webs.
  • 29. 29 Our finding. Motif M6 organizes the food web into good clusters Application 1. Food webs.
  • 30. Micronutrient sources Pelagic fishes and benthic prey Benthic macro- invertebrates Benthic Fishes Motif M6 reveals aquatic layers 61% accuracy vs. 48% with edge- based methods 30 Application 1. Food webs.
  • 31. • Nodes are groups of genes. • Edge i → j means i regulates transcription to j. • Sign + / - denotes activation / suppression. • Coherent feedforward loops encode biological function. [Mangan-Zaslaver-Alon 2003; Mangan-Alon 2003; Alon 2007] 31 Application 2. Yeast transcription regulation networks.
  • 32. Clustering based on coherent feedforward loops identifies functions studied individually by biologists [Mangan+ 2003] 97% accuracy vs. 68–82% with edge-based methods 32 Application 2. Yeast transcription regulation networks.
  • 33. • North American air transport network. • Nodes are cites. • i → j if you can travel from i to j in < 8 hours. [Frey-Dueck 2007] 33 Application 3. Transportation networks.
  • 34. Important motifs from literature [Rosvall+ 2014] Weighted adjacency matrix already reveals hub-like structure. 34 Application 3. Transportation networks.
  • 35. Top 8 U.S. hubs East coast non-hubs West coast non- hubs Primary spectral coordinate Atlanta, the top hub, is next to Salina, a non-hub. MOTIF SPECTRAL EMBEDDING EDGE SPECTRAL EMBEDDING 35 Application 3. Transportation networks.
  • 36. 36 In localized clustering, we aim to find a small set of nodes of low conductance containing a particular seed node without looking at the entire graph. Seed node Local cluster A major benefit of our theory is that it easily generalizes to related problems.
  • 37. 37 Random walk procedure • With probability a, hop to a random neighbor. • With probability 1- a, jump back to the seed node. • Stationary distribution tells us what is “close” to the seed. If the seed is in a good cluster, the stationary distribution is localized and we can approximate it in sublinear time using results of [Andersen-Chung-Lang 2006]. Uses sweep cut on approx. x. Background. Localized clustering works in theory and practice with the (fast) personalized PageRank method.
  • 38. 38 We use our same trick and generalize the theory to triangles (and other motifs). Now random walks look like  With probability a, hop to a random neighbor adjacent triangle and then to a random endpoint of that triangle (not the one we came from).  With probability 1- a, go back to the seed node. Higher-order localized clustering. [Yin-Benson-Leskovec-Gleich 2017]
  • 39. 39  Nodes are people at a research institute. Each person is in a department.  Edges are email correspondence.  Using each person as a seed, can we recover their department? Average F1 0.40 0.50 Using the triangle motif in higher-order localized clustering better captures community structure in email data.
  • 40. what why where The matrix counts the number of triangles that edges participate in. The matrix comes up in our higher-order network clustering framework when using triangles as the motif. We often get better empirical results in real-world networks and better numerical properties in model partitioning problems (stochastic block model).
  • 41. 41 SSBM m = 200, k = 5, p = 0.3, q = 0.13 The symmetric stochastic block model (SSBM) • k blocks, each of size m-by-m • within-block edges exist with prob. p • between-block edges with prob. q The task. Given a graph that is an SSBM and given m, k, p, q, find the k blocks. Theory (lots!). See in-prep. survey “Community Detection and the Stochastic Block Model” from E. Abbe. • Exact recovery (get all correct) • Detectability (find a non-trivial portion) • Uses non-backtracking random walks. The stochastic block model is a standard model used to study recovery of planted cluster structure.
  • 42. We are currently looking at the SSBM using our motif- weighting based on triangles. 42 Based also on Tsourakakis, Pachocki, & Mitzenmacher, WWW, 2017. → triangle conductance of a block < edge-conductance of the block.
  • 43. We use a mixing parameter thought of as the expected fraction of neighbors outside of a block 43 From the eyeball test, just using the motif weighting highlights the blocks better for a range of parameters. Exp. details Randomly sample SSBM(50, 10, p=0.2, q) for varying q. Plot 2D histogram (density).
  • 44. Detectability Exact recovery Exp. details. We take normalized Laplacian and shift to reverse the spectrum. Then we deflate given knowledge of the leading eigenvector. Accuracy is the most accurate block in the extremal m (size of block) entries. Note: threshold is for all block recovery. Accuracy The power method identifies a cluster better using the motif weighting than the adjacency.
  • 45. Most accurate block Nodes whose value is greater than the smallest green node value Nodes whose value is smaller than the smallest green node value. The power method identifies a cluster better using the motif weighting than the adjacency.
  • 46. 46 We don’t converge faster for the usual reasons that we would think of the power method converging faster.
  • 47. 47 There is a gap deeper in the spectrum that could explain what is going on.
  • 48. 48 semi-circle law (not a Marchenko-Pastur law) The motif weighting shifts all eigenvalues down, but it pushes the lowest ones down the most.
  • 49. 49 • Eigenvalues show that we’ll converge to the “cluster subspace” faster. • Conjecture. Higher accuracy for motifs because the eigenvectors we converge to are more localized—or sharper—around the clusters. • But we don’t know why they are sharper! We would like a numerical explanation for why we get better results with motifs. Top non-trivial eigenvectors of normalized Laplacian fighting less with motif weighting?
  • 50. 50 The iterative algorithm for personalized PageRank tends to localize more. Nodes whose value is greater than the smallest node value in the seed’s block. Rest of the nodes.
  • 51. 51 Papers • “Higher-order organization of complex networks.” Benson, Gleich, and Leskovec. Science, 2016. • “Local higher-order graph clustering.” Yin, Benson, Leskovec, and Gleich. KDD, 2017. 1. A generalized conductance metric for motifs. 2. A “new” spectral clustering algorithm to minimize the generalized conductance. 3. AND an associated motif Cheeger inequality guarantee. 4. Extensions to localized clustering. 5. Applications with food webs, genetic regulation networks, transportation systems, social networks. 6. Eigenvalues with motifs in SSBMs. Open questions • What is the distribution law for the eigenvalues of the Laplacian of A2 ⊙ A? • Relationship between convergence of the power method and localization? • How to work with element- wise prods like matvecs for http://cs.cornell.edu/~arb @austinbenson arb@cs.cornell.edu Thanks! Austin Benson Code & Data snap.stanford.edu/higher-order github.com/arbenson/higher-order-organization-julia bit.ly/motif-spectral-julia github.com/dgleich/motif-ssbm 9 10 8 7 2 0 4 3 11 6 5 1

Editor's Notes

  1. Title: medium 30 Text: light 26 Subtext: light 22 References: thin 22 Annotation: thin 20 blue
  2. What is this weighted matrix? Why do I should look at it? How do you do this?
  3. \phi(S) =  \frac{ \text{cut}(S) }{ \raisebox{-0.1cm}{ min($\text{vol}(S)$, $\text{vol}(S)$) } }
  4. \phi(S) =  \frac{ \text{cut}(S) }{ \raisebox{-0.1cm}{ min($\text{vol}(S)$, $\text{vol}(S)$) } } text{vol}_M(S) = \#(\text{motif end points in $S$}) \text{cut}_M(S) = \#(\text{motifs cut}) \text{cut}(S) = \#(\text{edges cut}) \text{vol}(S) = \#(\text{edge end points in $S$})
  5. $\phi(S) = \frac{\text{cut}(S)}{\min(\text{vol}(S), \text{vol}(\bar{S}))}$
  6. D &= \text{diag}(W^{(M)}e) \\ \mathcal{L}^{(M)} &= D^{-1/2}(D - W^{(M)})D^{-1/2} \\ \mathcal{L}^{(M)}z &= \lambda_2 z \\ f^{(M)} &= D^{-1/2}z
  7. $\phi(S) = \frac{\text{cut}(S)}{\min(\text{vol}(S), \text{vol}(\bar{S}))}$
  8. $\phi(S) = \frac{\text{cut}(S)}{\min(\text{vol}(S), \text{vol}(\bar{S}))}$
  9. $\phi(S) = \frac{\text{cut}(S)}{\min(\text{vol}(S), \text{vol}(\bar{S}))}$
  10. $\phi(S) = \frac{\text{cut}(S)}{\min(\text{vol}(S), \text{vol}(\bar{S}))}$
  11. What is this weighted matrix? Why do I should look at it? How do you do this?
  12. Mu = 0.41 in this Y-axis is value of Bottom plot is sorted vector