SlideShare a Scribd company logo
1 of 76
Download to read offline
Introduction to Network
Introduction to Network
Theory
Theory
What is a Network?
What is a Network?

 Network = graph
Network = graph

 Informally a
Informally a graph
graph is a set of nodes joined by a set of lines or
is a set of nodes joined by a set of lines or
arrows.
arrows.
1 1
2 3
4 4
5 5
6 6
2 3
Graph-based representations
 Representing a problem as a graph can
provide a different point of view
 Representing a problem as a graph can
make a problem much simpler
 More accurately, it can provide the
appropriate tools for solving the problem
What is network theory?
 Network theory provides a set of
techniques for analysing graphs
 Complex systems network theory provides
techniques for analysing structure in a
system of interacting agents, represented
as a network
 Applying network theory to a system
means using a graph-theoretic
representation
What makes a problem graph-like?
 There are two components to a graph
 Nodes and edges
 In graph-like problems, these components
have natural correspondences to problem
elements
 Entities are nodes and interactions between
entities are edges
 Most complex systems are graph-like
Friendship Network
Scientific collaboration network
Business ties in US biotech-
industry
Genetic interaction network
Protein-Protein Interaction
Networks
Transportation Networks
Internet
Ecological Networks
Graph Theory - History
Graph Theory - History
Leonhard
Leonhard Euler's paper on
Euler's paper on “
“Seven
Seven
Bridges of
Bridges of Königsberg
Königsberg”
” ,
,
published in 1736.
published in 1736.
Graph Theory - History
Graph Theory - History
Cycles in Polyhedra
Thomas P. Kirkman William R. Hamilton
Hamiltonian cycles in Platonic graphs
Graph Theory - History
Graph Theory - History
Gustav Kirchhoff
Trees in Electric Circuits
Graph Theory - History
Graph Theory - History
Arthur Cayley James J. Sylvester George Polya
Enumeration of Chemical Isomers
Graph Theory - History
Graph Theory - History
Francis Guthrie Auguste DeMorgan
Four Colors of Maps
Definition: Graph
Definition: Graph

 G is an ordered triple G:=(V, E, f)
G is an ordered triple G:=(V, E, f)

 V is a set of nodes, points, or vertices.
V is a set of nodes, points, or vertices.

 E is a set, whose elements are known as edges or lines.
E is a set, whose elements are known as edges or lines.

 f is a function
f is a function

 maps each element of E
maps each element of E

 to an unordered pair of vertices in V.
to an unordered pair of vertices in V.
Definitions
Definitions

 Vertex
Vertex

 Basic Element
Basic Element

 Drawn as a
Drawn as a node
node or a
or a dot
dot.
.

 V
Vertex set
ertex set of
of G
G is usually denoted by
is usually denoted by V
V(
(G
G), or
), or V
V

 Edge
Edge

 A set of two elements
A set of two elements

 Drawn as a line connecting two vertices, called end vertices, or
Drawn as a line connecting two vertices, called end vertices, or
endpoints.
endpoints.

 The edge set of G is usually denoted by E(G), or E.
The edge set of G is usually denoted by E(G), or E.
Example

 V:={1,2,3,4,5,6}
V:={1,2,3,4,5,6}

 E:={{1,2},{1,5},{2,3},{2,5},{3,4},{4,5},{4,6}}
E:={{1,2},{1,5},{2,3},{2,5},{3,4},{4,5},{4,6}}
Simple Graphs
Simple graphs
Simple graphs are graphs without multiple edges or self-loops.
are graphs without multiple edges or self-loops.
Directed Graph (digraph)
Directed Graph (digraph)

 Edges have directions
Edges have directions

 An edge is an
An edge is an ordered
ordered pair of nodes
pair of nodes
loop
node
multiple arc
arc
Weighted graphs
1 2 3
4 5 6
.5
1.2
.2
.5
1.5
.3
1
4 5 6
2 3
2
1
3
5

 is a graph for which each edge has an associated
is a graph for which each edge has an associated weight
weight, usually
, usually
given by a
given by a weight function
weight function w: E
w: E →
→ R
R.
.
Structures and structural
metrics
 Graph structures are used to isolate
interesting or important sections of a
graph
 Structural metrics provide a measurement
of a structural property of a graph
 Global metrics refer to a whole graph
 Local metrics refer to a single node in a graph
Graph structures
 Identify interesting sections of a graph
 Interesting because they form a significant
domain-specific structure, or because they
significantly contribute to graph properties
 A subset of the nodes and edges in a
graph that possess certain characteristics,
or relate to each other in particular ways
Connectivity

 a graph is
a graph is connected
connected if
if

 you can get from any node to any other by following a sequence of edges
you can get from any node to any other by following a sequence of edges
OR
OR

 any two nodes are connected by a path.
any two nodes are connected by a path.

 A directed graph is
A directed graph is strongly connected
strongly connected if there is a directed path from
if there is a directed path from
any node to any other node.
any node to any other node.
Component
Component

 Every disconnected graph can be split up into a number of
Every disconnected graph can be split up into a number of
connected
connected components
components.
.
Degree
Degree

 Number of edges incident on a node
Number of edges incident on a node
The degree of 5 is 3
Degree (Directed Graphs)
Degree (Directed Graphs)

 In-degree: Number of edges entering
In-degree: Number of edges entering

 Out-degree: Number of edges leaving
Out-degree: Number of edges leaving

 Degree =
Degree = indeg
indeg +
+ outdeg
outdeg
outdeg(1)=2
indeg(1)=0
outdeg(2)=2
indeg(2)=2
outdeg(3)=1
indeg(3)=4
Degree: Simple Facts

 If
If G
G is a graph with
is a graph with m
m edges, then
edges, then
Σ
Σ deg(
deg(v
v) = 2
) = 2m
m = 2 |
= 2 |E
E |
|

 If
If G
G is a digraph then
is a digraph then
Σ
Σ indeg
indeg(
(v
v)=
)=Σ
Σ outdeg
outdeg(
(v
v)
) =
= |
|E
E |
|

 Number of Odd degree Nodes is even
Number of Odd degree Nodes is even
Walks
A walk of length k in a graph is a succession of k
(not necessarily different) edges of the form
uv,vw,wx,…,yz.
This walk is denote by uvwx…xz, and is referred to
as a walk between u and z.
A walk is closed is u=z.
Path
Path

 A
A path
path is a walk in which all the edges and all the nodes are different.
is a walk in which all the edges and all the nodes are different.
Walks and Paths
1,2,5,2,3,4 1,2,5,2,3,2,1 1,2,3,4,6
walk of length 5 CW of length 6 path of length 4
Cycle

 A
A cycle
cycle is a closed path in which all the edges are different.
is a closed path in which all the edges are different.
1,2,5,1 2,3,4,5,2
3-cycle 4-cycle
Special Types of Graphs

 Empty Graph / Edgeless graph
Empty Graph / Edgeless graph

 No edge
No edge

 Null graph
Null graph

 No nodes
No nodes

 Obviously no edge
Obviously no edge
Trees
Trees

 Connected Acyclic Graph
Connected Acyclic Graph

 Two nodes have
Two nodes have exactly
exactly one path
one path
between them
between them
Special Trees
Special Trees
Paths
Stars
Connected Graph
All nodes have the same
degree
Regular
Special Regular Graphs: Cycles
C3 C4 C5
Bipartite
Bipartite graph
graph

 V
V can be partitioned into 2 sets
can be partitioned into 2 sets V
V1
1
and
and V
V2
2
such that (
such that (u
u,
,v
v)
)∈
∈E
E implies
implies

 either
either u
u ∈
∈V
V1
1 and
and v
v ∈
∈V
V2
2

 OR
OR v
v ∈
∈V
V1
1 and
and u
u∈
∈V
V2.
2.
Complete Graph
Complete Graph

 Every pair of vertices are adjacent
Every pair of vertices are adjacent

 Has n(n-1)/2 edges
Has n(n-1)/2 edges
Complete Bipartite Graph
Complete Bipartite Graph

 Bipartite Variation of Complete Graph
Bipartite Variation of Complete Graph

 Every node of one set is connected to every other node on the
Every node of one set is connected to every other node on the
other set
other set
Stars
Planar Graphs
Planar Graphs

 Can be drawn on a plane such that no two edges intersect
Can be drawn on a plane such that no two edges intersect

 K
K4
4 is the largest complete graph that is planar
is the largest complete graph that is planar
Subgraph
Subgraph

 Vertex and edge sets are subsets of those of G
Vertex and edge sets are subsets of those of G

 a
a supergraph
supergraph of a graph G is a graph that contains G as a
of a graph G is a graph that contains G as a
subgraph
subgraph.
.
Special
Special Subgraphs
Subgraphs: Cliques
: Cliques
A clique is a maximum complete
connected subgraph.
.
A B
D
H
F
E
C
I
G
Spanning
Spanning subgraph
subgraph

 Subgraph
Subgraph H has the same vertex set as G.
H has the same vertex set as G.

 Possibly not all the edges
Possibly not all the edges

 “
“H spans G
H spans G”
”.
.
Spanning tree
Spanning tree

 Let G be a connected graph. Then a
Let G be a connected graph. Then a
spanning tree
spanning tree in G is a
in G is a subgraph
subgraph of G
of G
that includes every node and is also a
that includes every node and is also a
tree.
tree.
Isomorphism
Isomorphism

 Bijection
Bijection, i.e., a one-to-one mapping:
, i.e., a one-to-one mapping:
f : V(G) -> V(H)
f : V(G) -> V(H)
u and v from G are adjacent if and only if f(u) and f(v) are
u and v from G are adjacent if and only if f(u) and f(v) are
adjacent in H.
adjacent in H.

 If an isomorphism can be constructed between two graphs, then
If an isomorphism can be constructed between two graphs, then
we say those graphs are
we say those graphs are isomorphic
isomorphic.
.
Isomorphism Problem
Isomorphism Problem

 Determining whether two graphs are
Determining whether two graphs are
isomorphic
isomorphic

 Although these graphs look very different,
Although these graphs look very different,
they are isomorphic; one isomorphism
they are isomorphic; one isomorphism
between them is
between them is
f(a)=1 f(b)=6 f(c)=8 f(d)=3
f(a)=1 f(b)=6 f(c)=8 f(d)=3
f(g)=5 f(h)=2 f(i)=4 f(j)=7
f(g)=5 f(h)=2 f(i)=4 f(j)=7
Representation (Matrix)
Representation (Matrix)

 Incidence Matrix
Incidence Matrix

 V x E
V x E

 [vertex, edges] contains the edge's data
[vertex, edges] contains the edge's data

 Adjacency Matrix
Adjacency Matrix

 V x V
V x V

 Boolean values (adjacent or not)
Boolean values (adjacent or not)

 Or Edge Weights
Or Edge Weights
Matrices
Matrices
1
0
0
0
0
0
0
6
0
1
0
1
0
1
0
5
1
1
1
0
0
0
0
4
0
0
1
0
1
0
0
3
0
0
0
1
1
0
1
2
0
0
0
0
0
1
1
1
6
,
4
5
,
4
4
,
3
5
,
2
3
,
2
5
,
1
2
,
1
0
0
1
0
0
0
6
0
0
1
0
1
1
5
1
1
0
1
0
0
4
0
0
1
0
1
0
3
0
1
0
1
0
1
2
0
1
0
0
1
0
1
6
5
4
3
2
1
Representation (List)
Representation (List)

 Edge List
Edge List

 pairs (ordered if directed) of vertices
pairs (ordered if directed) of vertices

 Optionally weight and other data
Optionally weight and other data

 Adjacency List (node list)
Adjacency List (node list)
Implementation of a Graph.
Implementation of a Graph.

 Adjacency-list representation
Adjacency-list representation

 an array of |
an array of |V
V | lists, one for each vertex in
| lists, one for each vertex in V
V.
.

 For each
For each u
u ∈
∈ V
V ,
, ADJ
ADJ [
[ u
u ] points to all its adjacent vertices.
] points to all its adjacent vertices.
Edge and Node Lists
Edge and Node Lists
Edge List
1 2
1 2
2 3
2 5
3 3
4 3
4 5
5 3
5 4
Node List
1 2 2
2 3 5
3 3
4 3 5
5 3 4
Edge List
1 2 1.2
2 4 0.2
4 5 0.3
4 1 0.5
5 4 0.5
6 3 1.5
Edge Lists for Weighted
Edge Lists for Weighted
Graphs
Graphs
Topological Distance
A shortest path is the minimum path
A shortest path is the minimum path
connecting two nodes.
connecting two nodes.
The number of edges in the shortest path
The number of edges in the shortest path
connecting
connecting p
p and
and q
q is the
is the topological
topological
distance
distance between these two nodes,
between these two nodes, d
dp
p,q
,q
|
|V
V | x |
| x |V |
V | matrix D
matrix D = (
= ( d
dij
ij )
) such that
such that
d
dij
ij is the topological distance between
is the topological distance between i
i and
and j
j.
.
0
2
1
2
3
3
6
2
0
1
2
1
1
5
1
1
0
1
2
2
4
2
2
1
0
1
2
3
3
1
2
1
0
1
2
3
1
2
2
1
0
1
6
5
4
3
2
1
Distance Matrix
Distance Matrix
Random Graphs
N
N nodes
nodes
A pair of nodes has probability
A pair of nodes has probability p
p of
of
being connected.
being connected.
Average degree,
Average degree, k
k ≈
≈ pN
pN
What interesting things can be said for
What interesting things can be said for
different values of p or k ?
different values of p or k ?
(that are true as N
(that are true as N 
 ∞
∞)
)
Erdős and Renyi (1959)
p = 0.0 ; k = 0
N = 12
p = 0.09 ; k = 1
p = 1.0 ; k ≈ ½N2
Random Graphs
Erdős and Renyi (1959)
p = 0.0 ; k = 0
p = 0.09 ; k = 1
p = 1.0 ; k ≈ ½N2
p = 0.045 ; k = 0.5
Let’s look at…
Size of the largest connected cluster
Diameter (maximum path length between nodes) of the largest cluster
Average path length between nodes (if a path exists)
Random Graphs
Erdős and Renyi (1959)
p = 0.0 ; k = 0 p = 0.09 ; k = 1 p = 1.0 ; k ≈ ½N2
p = 0.045 ; k = 0.5
Size of largest component
Diameter of largest component
Average path length between nodes
1 5 11 12
0 4 7 1
0.0 2.0 4.2 1.0
Random Graphs
If
If k
k < 1:
< 1:

 small, isolated clusters
small, isolated clusters

 small diameters
small diameters

 short path lengths
short path lengths
At k = 1:
At k = 1:

 a
a giant component
giant component appears
appears

 diameter peaks
diameter peaks

 path lengths are high
path lengths are high
For k > 1:
For k > 1:

 almost all nodes connected
almost all nodes connected

 diameter shrinks
diameter shrinks

 path lengths shorten
path lengths shorten
Erdős and Renyi (1959)
Percentage
of
nodes
in
largest
component
Diameter
of
largest
component
(not
to
scale)
1.0
0
k
1.0
phase transition
Random Graphs
What does this mean?
What does this mean?

 If connections between people can be modeled as a random graph, then
If connections between people can be modeled as a random graph, then…
…

 Because the average person easily knows more than one person (k >> 1),
Because the average person easily knows more than one person (k >> 1),

 We live in a
We live in a “
“small world
small world”
” where within a few links, we are connected to anyone in the world.
where within a few links, we are connected to anyone in the world.

 Erd
Erdő
ős
s and
and Renyi
Renyi showed that average
showed that average
path length between connected nodes is
path length between connected nodes is
Erdős and Renyi (1959)
David
Mumford Peter
Belhumeur
Kentaro
Toyama
Fan
Chung
Random Graphs
What does this mean?
What does this mean?

 If connections between people can be modeled as a random graph, then
If connections between people can be modeled as a random graph, then…
…

 Because the average person easily knows more than one person (k >> 1),
Because the average person easily knows more than one person (k >> 1),

 We live in a
We live in a “
“small world
small world”
” where within a few links, we are connected to anyone in the world.
where within a few links, we are connected to anyone in the world.

 Erd
Erdő
ős
s and
and Renyi
Renyi computed average
computed average
path length between connected nodes to be:
path length between connected nodes to be:
Erdős and Renyi (1959)
David
Mumford Peter
Belhumeur
Kentaro
Toyama
Fan
Chung
BIG “IF”!!!
The Alpha Model
The people you know aren
The people you know arenʼ
ʼt randomly chosen.
t randomly chosen.
People tend to get to know those who are two
People tend to get to know those who are two
links away (
links away (Rapoport
Rapoport *
*, 1957).
, 1957).
The real world exhibits a lot of
The real world exhibits a lot of clustering.
clustering.
Watts (1999)
* Same Anatol Rapoport, known for TIT FOR TAT!
The Personal Map
by MSR Redmond’s Social Computing Group
The Alpha Model
Watts (1999)
α
α model: Add edges to nodes, as in random
model: Add edges to nodes, as in random
graphs, but makes links more likely when
graphs, but makes links more likely when
two nodes have a common friend.
two nodes have a common friend.
For a range of
For a range of α
α values:
values:

 The world is small (average path length is
The world is small (average path length is
short), and
short), and

 Groups tend to form (high clustering
Groups tend to form (high clustering
coefficient).
coefficient).
Probability of linkage as a function
of number of mutual friends
(α is 0 in upper left,
1 in diagonal,
and ∞ in bottom right curves.)
The Alpha Model
Watts (1999)
α
Clustering
coefficient
/
Normalized
path
length
Clustering coefficient (C) and
average path length (L)
plotted against α
α
α model: Add edges to nodes, as in random
model: Add edges to nodes, as in random
graphs, but makes links more likely when
graphs, but makes links more likely when
two nodes have a common friend.
two nodes have a common friend.
For a range of
For a range of α
α values:
values:

 The world is small (average path length is
The world is small (average path length is
short), and
short), and

 Groups tend to form (high clustering
Groups tend to form (high clustering
coefficient).
coefficient).
The Beta Model
Watts and Strogatz (1998)
β = 0 β = 0.125 β = 1
People know
others at
random.
Not clustered,
but “small world”
People know
their neighbors,
and a few distant people.
Clustered and
“small world”
People know
their neighbors.
Clustered, but
not a “small world”
The Beta Model
First five random links reduce the average path
First five random links reduce the average path
length of the network by half, regardless of
length of the network by half, regardless of N
N!
!
Both
Both α
α and
and β
β models reproduce short-path results
models reproduce short-path results
of random graphs, but also allow for clustering.
of random graphs, but also allow for clustering.
Small-world phenomena occur at threshold
Small-world phenomena occur at threshold
between order and chaos.
between order and chaos.
Watts and Strogatz (1998) Nobuyuki
Hanaki
Jonathan
Donner
Kentaro
Toyama
Clustering
coefficient
/
Normalized
path
length
Clustering coefficient (C) and average
path length (L) plotted against β
Power Laws
Albert and Barabasi (1999)
Degree distribution of a random graph,
N = 10,000 p = 0.0015 k = 15.
(Curve is a Poisson curve, for comparison.)
What
Whatʼ
ʼs the degree (number of edges) distribution
s the degree (number of edges) distribution
over a graph, for real-world graphs?
over a graph, for real-world graphs?
Random-graph model results in Poisson
Random-graph model results in Poisson
distribution.
distribution.
But, many real-world networks exhibit a
But, many real-world networks exhibit a power-law
power-law
distribution.
distribution.
Power Laws
Albert and Barabasi (1999)
Typical shape of a power-law distribution.
What
Whatʼ
ʼs the degree (number of edges) distribution
s the degree (number of edges) distribution
over a graph, for real-world graphs?
over a graph, for real-world graphs?
Random-graph model results in Poisson
Random-graph model results in Poisson
distribution.
distribution.
But, many real-world networks exhibit a
But, many real-world networks exhibit a power-law
power-law
distribution.
distribution.
Power Laws
Albert and Barabasi (1999)
Power-law distributions are straight lines in log-log
Power-law distributions are straight lines in log-log
space.
space.
How should random graphs be generated to create
How should random graphs be generated to create
a power-law distribution of node degrees?
a power-law distribution of node degrees?
Hint:
Hint:
Pareto
Paretoʼ
ʼs
s*
* Law: Wealth distribution follows a
Law: Wealth distribution follows a
power law.
power law.
Power laws in real networks:
(a) WWW hyperlinks
(b) co-starring in movies
(c) co-authorship of physicists
(d) co-authorship of neuroscientists
* Same Velfredo Pareto, who defined Pareto optimality in game theory.
Power Laws
“
“The rich get richer!
The rich get richer!”
”
Power-law distribution of node distribution arises if
Power-law distribution of node distribution arises if

 Number of nodes grow;
Number of nodes grow;

 Edges are added in proportion to the number of edges
Edges are added in proportion to the number of edges
a node already has.
a node already has.
Additional variable fitness coefficient allows for some
Additional variable fitness coefficient allows for some
nodes to grow faster than others.
nodes to grow faster than others.
Albert and Barabasi (1999) Jennifer
Chayes
Anandan
Kentaro
Toyama
“Map of the Internet” poster
Searchable Networks
Just because a short path exists, doesn
Just because a short path exists, doesnʼ
ʼt mean
t mean
you can easily find it.
you can easily find it.
You don
You donʼ
ʼt know all of the people whom your
t know all of the people whom your
friends know.
friends know.
Under what conditions is a network
Under what conditions is a network searchable
searchable?
?
Kleinberg (2000)
Searchable Networks
a)
a) Variation of
Variation of Watts
Wattsʼ
ʼs
s β
β model:
model:

 Lattice is
Lattice is d
d-dimensional (
-dimensional (d
d=2).
=2).

 One random link per node.
One random link per node.

 Parameter
Parameter α
α controls probability of random link
controls probability of random link –
– greater for closer nodes.
greater for closer nodes.
b)
b) For
For d
d=2, dip in time-to-search at
=2, dip in time-to-search at α
α=2
=2

 For low
For low α
α, random graph; no
, random graph; no “
“geographic
geographic”
” correlation in links
correlation in links

 For high
For high α
α, not a small world; no short paths to be found.
, not a small world; no short paths to be found.
c)
c) Searchability
Searchability dips at
dips at α
α=2, in simulation
=2, in simulation
Kleinberg (2000)
Searchable Networks
Watts,
Watts, Dodds
Dodds, Newman (2002) show that for
, Newman (2002) show that for d
d = 2
= 2
or 3, real networks are quite searchable.
or 3, real networks are quite searchable.
Killworth
Killworth and Bernard (1978) found that people
and Bernard (1978) found that people
tended to search their networks by
tended to search their networks by d
d = 2:
= 2:
geography and profession.
geography and profession.
Kleinberg (2000) Ramin
Zabih
Kentaro
Toyama
The Watts-Dodds-Newman model
closely fitting a real-world experiment
References

ldous & Wilson, Graphs and Applications. An
Introductory Approach, Springer, 2000.

Wasserman & Faust, Social Network Analysis,
Cambridge University Press, 2008.

More Related Content

Similar to graph_theory_1-11.pdf___________________

Lecture 5b graphs and hashing
Lecture 5b graphs and hashingLecture 5b graphs and hashing
Lecture 5b graphs and hashing
Victor Palmar
 

Similar to graph_theory_1-11.pdf___________________ (20)

Lecture 5b graphs and hashing
Lecture 5b graphs and hashingLecture 5b graphs and hashing
Lecture 5b graphs and hashing
 
Introduction to Graph Theory
Introduction to Graph TheoryIntroduction to Graph Theory
Introduction to Graph Theory
 
Graphs.pdf
Graphs.pdfGraphs.pdf
Graphs.pdf
 
Spanningtreesppt
SpanningtreespptSpanningtreesppt
Spanningtreesppt
 
graph theory
graph theorygraph theory
graph theory
 
Graphs.pptx
Graphs.pptxGraphs.pptx
Graphs.pptx
 
Graphs and eularian circuit & path with c++ program
Graphs and eularian circuit & path with c++ programGraphs and eularian circuit & path with c++ program
Graphs and eularian circuit & path with c++ program
 
Graph Introduction.ppt
Graph Introduction.pptGraph Introduction.ppt
Graph Introduction.ppt
 
Graphs
GraphsGraphs
Graphs
 
Ass. (3)graph d.m
Ass. (3)graph d.mAss. (3)graph d.m
Ass. (3)graph d.m
 
Graph in Data Structure
Graph in Data StructureGraph in Data Structure
Graph in Data Structure
 
graph.pptx
graph.pptxgraph.pptx
graph.pptx
 
FREQUENT SUBGRAPH MINING ALGORITHMS - A SURVEY AND FRAMEWORK FOR CLASSIFICATION
FREQUENT SUBGRAPH MINING ALGORITHMS - A SURVEY AND FRAMEWORK FOR CLASSIFICATIONFREQUENT SUBGRAPH MINING ALGORITHMS - A SURVEY AND FRAMEWORK FOR CLASSIFICATION
FREQUENT SUBGRAPH MINING ALGORITHMS - A SURVEY AND FRAMEWORK FOR CLASSIFICATION
 
Cs6702 2marks rejinpaul
Cs6702 2marks rejinpaulCs6702 2marks rejinpaul
Cs6702 2marks rejinpaul
 
Cs6702 GCC
Cs6702 GCCCs6702 GCC
Cs6702 GCC
 
Unit 2: All
Unit 2: AllUnit 2: All
Unit 2: All
 
graph ASS (1).ppt
graph ASS (1).pptgraph ASS (1).ppt
graph ASS (1).ppt
 
Data structure - Graph
Data structure - GraphData structure - Graph
Data structure - Graph
 
Graph ASS DBATU.pptx
Graph ASS DBATU.pptxGraph ASS DBATU.pptx
Graph ASS DBATU.pptx
 
Data Structures and Agorithm: DS 21 Graph Theory.pptx
Data Structures and Agorithm: DS 21 Graph Theory.pptxData Structures and Agorithm: DS 21 Graph Theory.pptx
Data Structures and Agorithm: DS 21 Graph Theory.pptx
 

Recently uploaded

一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
Amil baba
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
ju0dztxtn
 
edited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdfedited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdf
great91
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
zifhagzkk
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
pwgnohujw
 

Recently uploaded (20)

一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
Jual Obat Aborsi Bandung (Asli No.1) Wa 082134680322 Klinik Obat Penggugur Ka...
 
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI  MANAJEMEN OF PENYAKIT TETANUS.pptMATERI  MANAJEMEN OF PENYAKIT TETANUS.ppt
MATERI MANAJEMEN OF PENYAKIT TETANUS.ppt
 
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor NetworksSensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
Sensing the Future: Anomaly Detection and Event Prediction in Sensor Networks
 
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital AgeCredit Card Fraud Detection: Safeguarding Transactions in the Digital Age
Credit Card Fraud Detection: Safeguarding Transactions in the Digital Age
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting TechniquesPredictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
NO1 Best Kala Jadu Expert Specialist In Germany Kala Jadu Expert Specialist I...
 
Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024Northern New England Tableau User Group (TUG) May 2024
Northern New England Tableau User Group (TUG) May 2024
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
 
edited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdfedited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdf
 
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
如何办理(Dalhousie毕业证书)达尔豪斯大学毕业证成绩单留信学历认证
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
原件一样(UWO毕业证书)西安大略大学毕业证成绩单留信学历认证
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction社内勉強会資料_Object Recognition as Next Token Prediction
社内勉強会資料_Object Recognition as Next Token Prediction
 

graph_theory_1-11.pdf___________________

  • 1. Introduction to Network Introduction to Network Theory Theory
  • 2. What is a Network? What is a Network?   Network = graph Network = graph   Informally a Informally a graph graph is a set of nodes joined by a set of lines or is a set of nodes joined by a set of lines or arrows. arrows. 1 1 2 3 4 4 5 5 6 6 2 3
  • 3. Graph-based representations  Representing a problem as a graph can provide a different point of view  Representing a problem as a graph can make a problem much simpler  More accurately, it can provide the appropriate tools for solving the problem
  • 4. What is network theory?  Network theory provides a set of techniques for analysing graphs  Complex systems network theory provides techniques for analysing structure in a system of interacting agents, represented as a network  Applying network theory to a system means using a graph-theoretic representation
  • 5. What makes a problem graph-like?  There are two components to a graph  Nodes and edges  In graph-like problems, these components have natural correspondences to problem elements  Entities are nodes and interactions between entities are edges  Most complex systems are graph-like
  • 8. Business ties in US biotech- industry
  • 14. Graph Theory - History Graph Theory - History Leonhard Leonhard Euler's paper on Euler's paper on “ “Seven Seven Bridges of Bridges of Königsberg Königsberg” ” , , published in 1736. published in 1736.
  • 15. Graph Theory - History Graph Theory - History Cycles in Polyhedra Thomas P. Kirkman William R. Hamilton Hamiltonian cycles in Platonic graphs
  • 16. Graph Theory - History Graph Theory - History Gustav Kirchhoff Trees in Electric Circuits
  • 17. Graph Theory - History Graph Theory - History Arthur Cayley James J. Sylvester George Polya Enumeration of Chemical Isomers
  • 18. Graph Theory - History Graph Theory - History Francis Guthrie Auguste DeMorgan Four Colors of Maps
  • 19. Definition: Graph Definition: Graph   G is an ordered triple G:=(V, E, f) G is an ordered triple G:=(V, E, f)   V is a set of nodes, points, or vertices. V is a set of nodes, points, or vertices.   E is a set, whose elements are known as edges or lines. E is a set, whose elements are known as edges or lines.   f is a function f is a function   maps each element of E maps each element of E   to an unordered pair of vertices in V. to an unordered pair of vertices in V.
  • 20. Definitions Definitions   Vertex Vertex   Basic Element Basic Element   Drawn as a Drawn as a node node or a or a dot dot. .   V Vertex set ertex set of of G G is usually denoted by is usually denoted by V V( (G G), or ), or V V   Edge Edge   A set of two elements A set of two elements   Drawn as a line connecting two vertices, called end vertices, or Drawn as a line connecting two vertices, called end vertices, or endpoints. endpoints.   The edge set of G is usually denoted by E(G), or E. The edge set of G is usually denoted by E(G), or E.
  • 22. Simple Graphs Simple graphs Simple graphs are graphs without multiple edges or self-loops. are graphs without multiple edges or self-loops.
  • 23. Directed Graph (digraph) Directed Graph (digraph)   Edges have directions Edges have directions   An edge is an An edge is an ordered ordered pair of nodes pair of nodes loop node multiple arc arc
  • 24. Weighted graphs 1 2 3 4 5 6 .5 1.2 .2 .5 1.5 .3 1 4 5 6 2 3 2 1 3 5   is a graph for which each edge has an associated is a graph for which each edge has an associated weight weight, usually , usually given by a given by a weight function weight function w: E w: E → → R R. .
  • 25. Structures and structural metrics  Graph structures are used to isolate interesting or important sections of a graph  Structural metrics provide a measurement of a structural property of a graph  Global metrics refer to a whole graph  Local metrics refer to a single node in a graph
  • 26. Graph structures  Identify interesting sections of a graph  Interesting because they form a significant domain-specific structure, or because they significantly contribute to graph properties  A subset of the nodes and edges in a graph that possess certain characteristics, or relate to each other in particular ways
  • 27. Connectivity   a graph is a graph is connected connected if if   you can get from any node to any other by following a sequence of edges you can get from any node to any other by following a sequence of edges OR OR   any two nodes are connected by a path. any two nodes are connected by a path.   A directed graph is A directed graph is strongly connected strongly connected if there is a directed path from if there is a directed path from any node to any other node. any node to any other node.
  • 28. Component Component   Every disconnected graph can be split up into a number of Every disconnected graph can be split up into a number of connected connected components components. .
  • 29. Degree Degree   Number of edges incident on a node Number of edges incident on a node The degree of 5 is 3
  • 30. Degree (Directed Graphs) Degree (Directed Graphs)   In-degree: Number of edges entering In-degree: Number of edges entering   Out-degree: Number of edges leaving Out-degree: Number of edges leaving   Degree = Degree = indeg indeg + + outdeg outdeg outdeg(1)=2 indeg(1)=0 outdeg(2)=2 indeg(2)=2 outdeg(3)=1 indeg(3)=4
  • 31. Degree: Simple Facts   If If G G is a graph with is a graph with m m edges, then edges, then Σ Σ deg( deg(v v) = 2 ) = 2m m = 2 | = 2 |E E | |   If If G G is a digraph then is a digraph then Σ Σ indeg indeg( (v v)= )=Σ Σ outdeg outdeg( (v v) ) = = | |E E | |   Number of Odd degree Nodes is even Number of Odd degree Nodes is even
  • 32. Walks A walk of length k in a graph is a succession of k (not necessarily different) edges of the form uv,vw,wx,…,yz. This walk is denote by uvwx…xz, and is referred to as a walk between u and z. A walk is closed is u=z.
  • 33. Path Path   A A path path is a walk in which all the edges and all the nodes are different. is a walk in which all the edges and all the nodes are different. Walks and Paths 1,2,5,2,3,4 1,2,5,2,3,2,1 1,2,3,4,6 walk of length 5 CW of length 6 path of length 4
  • 34. Cycle   A A cycle cycle is a closed path in which all the edges are different. is a closed path in which all the edges are different. 1,2,5,1 2,3,4,5,2 3-cycle 4-cycle
  • 35. Special Types of Graphs   Empty Graph / Edgeless graph Empty Graph / Edgeless graph   No edge No edge   Null graph Null graph   No nodes No nodes   Obviously no edge Obviously no edge
  • 36. Trees Trees   Connected Acyclic Graph Connected Acyclic Graph   Two nodes have Two nodes have exactly exactly one path one path between them between them
  • 38. Connected Graph All nodes have the same degree Regular
  • 39. Special Regular Graphs: Cycles C3 C4 C5
  • 40. Bipartite Bipartite graph graph   V V can be partitioned into 2 sets can be partitioned into 2 sets V V1 1 and and V V2 2 such that ( such that (u u, ,v v) )∈ ∈E E implies implies   either either u u ∈ ∈V V1 1 and and v v ∈ ∈V V2 2   OR OR v v ∈ ∈V V1 1 and and u u∈ ∈V V2. 2.
  • 41. Complete Graph Complete Graph   Every pair of vertices are adjacent Every pair of vertices are adjacent   Has n(n-1)/2 edges Has n(n-1)/2 edges
  • 42. Complete Bipartite Graph Complete Bipartite Graph   Bipartite Variation of Complete Graph Bipartite Variation of Complete Graph   Every node of one set is connected to every other node on the Every node of one set is connected to every other node on the other set other set Stars
  • 43. Planar Graphs Planar Graphs   Can be drawn on a plane such that no two edges intersect Can be drawn on a plane such that no two edges intersect   K K4 4 is the largest complete graph that is planar is the largest complete graph that is planar
  • 44. Subgraph Subgraph   Vertex and edge sets are subsets of those of G Vertex and edge sets are subsets of those of G   a a supergraph supergraph of a graph G is a graph that contains G as a of a graph G is a graph that contains G as a subgraph subgraph. .
  • 45. Special Special Subgraphs Subgraphs: Cliques : Cliques A clique is a maximum complete connected subgraph. . A B D H F E C I G
  • 46. Spanning Spanning subgraph subgraph   Subgraph Subgraph H has the same vertex set as G. H has the same vertex set as G.   Possibly not all the edges Possibly not all the edges   “ “H spans G H spans G” ”. .
  • 47. Spanning tree Spanning tree   Let G be a connected graph. Then a Let G be a connected graph. Then a spanning tree spanning tree in G is a in G is a subgraph subgraph of G of G that includes every node and is also a that includes every node and is also a tree. tree.
  • 48. Isomorphism Isomorphism   Bijection Bijection, i.e., a one-to-one mapping: , i.e., a one-to-one mapping: f : V(G) -> V(H) f : V(G) -> V(H) u and v from G are adjacent if and only if f(u) and f(v) are u and v from G are adjacent if and only if f(u) and f(v) are adjacent in H. adjacent in H.   If an isomorphism can be constructed between two graphs, then If an isomorphism can be constructed between two graphs, then we say those graphs are we say those graphs are isomorphic isomorphic. .
  • 49. Isomorphism Problem Isomorphism Problem   Determining whether two graphs are Determining whether two graphs are isomorphic isomorphic   Although these graphs look very different, Although these graphs look very different, they are isomorphic; one isomorphism they are isomorphic; one isomorphism between them is between them is f(a)=1 f(b)=6 f(c)=8 f(d)=3 f(a)=1 f(b)=6 f(c)=8 f(d)=3 f(g)=5 f(h)=2 f(i)=4 f(j)=7 f(g)=5 f(h)=2 f(i)=4 f(j)=7
  • 50. Representation (Matrix) Representation (Matrix)   Incidence Matrix Incidence Matrix   V x E V x E   [vertex, edges] contains the edge's data [vertex, edges] contains the edge's data   Adjacency Matrix Adjacency Matrix   V x V V x V   Boolean values (adjacent or not) Boolean values (adjacent or not)   Or Edge Weights Or Edge Weights
  • 52. Representation (List) Representation (List)   Edge List Edge List   pairs (ordered if directed) of vertices pairs (ordered if directed) of vertices   Optionally weight and other data Optionally weight and other data   Adjacency List (node list) Adjacency List (node list)
  • 53. Implementation of a Graph. Implementation of a Graph.   Adjacency-list representation Adjacency-list representation   an array of | an array of |V V | lists, one for each vertex in | lists, one for each vertex in V V. .   For each For each u u ∈ ∈ V V , , ADJ ADJ [ [ u u ] points to all its adjacent vertices. ] points to all its adjacent vertices.
  • 54. Edge and Node Lists Edge and Node Lists Edge List 1 2 1 2 2 3 2 5 3 3 4 3 4 5 5 3 5 4 Node List 1 2 2 2 3 5 3 3 4 3 5 5 3 4
  • 55. Edge List 1 2 1.2 2 4 0.2 4 5 0.3 4 1 0.5 5 4 0.5 6 3 1.5 Edge Lists for Weighted Edge Lists for Weighted Graphs Graphs
  • 56. Topological Distance A shortest path is the minimum path A shortest path is the minimum path connecting two nodes. connecting two nodes. The number of edges in the shortest path The number of edges in the shortest path connecting connecting p p and and q q is the is the topological topological distance distance between these two nodes, between these two nodes, d dp p,q ,q
  • 57. | |V V | x | | x |V | V | matrix D matrix D = ( = ( d dij ij ) ) such that such that d dij ij is the topological distance between is the topological distance between i i and and j j. . 0 2 1 2 3 3 6 2 0 1 2 1 1 5 1 1 0 1 2 2 4 2 2 1 0 1 2 3 3 1 2 1 0 1 2 3 1 2 2 1 0 1 6 5 4 3 2 1 Distance Matrix Distance Matrix
  • 58. Random Graphs N N nodes nodes A pair of nodes has probability A pair of nodes has probability p p of of being connected. being connected. Average degree, Average degree, k k ≈ ≈ pN pN What interesting things can be said for What interesting things can be said for different values of p or k ? different values of p or k ? (that are true as N (that are true as N   ∞ ∞) ) Erdős and Renyi (1959) p = 0.0 ; k = 0 N = 12 p = 0.09 ; k = 1 p = 1.0 ; k ≈ ½N2
  • 59. Random Graphs Erdős and Renyi (1959) p = 0.0 ; k = 0 p = 0.09 ; k = 1 p = 1.0 ; k ≈ ½N2 p = 0.045 ; k = 0.5 Let’s look at… Size of the largest connected cluster Diameter (maximum path length between nodes) of the largest cluster Average path length between nodes (if a path exists)
  • 60. Random Graphs Erdős and Renyi (1959) p = 0.0 ; k = 0 p = 0.09 ; k = 1 p = 1.0 ; k ≈ ½N2 p = 0.045 ; k = 0.5 Size of largest component Diameter of largest component Average path length between nodes 1 5 11 12 0 4 7 1 0.0 2.0 4.2 1.0
  • 61. Random Graphs If If k k < 1: < 1:   small, isolated clusters small, isolated clusters   small diameters small diameters   short path lengths short path lengths At k = 1: At k = 1:   a a giant component giant component appears appears   diameter peaks diameter peaks   path lengths are high path lengths are high For k > 1: For k > 1:   almost all nodes connected almost all nodes connected   diameter shrinks diameter shrinks   path lengths shorten path lengths shorten Erdős and Renyi (1959) Percentage of nodes in largest component Diameter of largest component (not to scale) 1.0 0 k 1.0 phase transition
  • 62. Random Graphs What does this mean? What does this mean?   If connections between people can be modeled as a random graph, then If connections between people can be modeled as a random graph, then… …   Because the average person easily knows more than one person (k >> 1), Because the average person easily knows more than one person (k >> 1),   We live in a We live in a “ “small world small world” ” where within a few links, we are connected to anyone in the world. where within a few links, we are connected to anyone in the world.   Erd Erdő ős s and and Renyi Renyi showed that average showed that average path length between connected nodes is path length between connected nodes is Erdős and Renyi (1959) David Mumford Peter Belhumeur Kentaro Toyama Fan Chung
  • 63. Random Graphs What does this mean? What does this mean?   If connections between people can be modeled as a random graph, then If connections between people can be modeled as a random graph, then… …   Because the average person easily knows more than one person (k >> 1), Because the average person easily knows more than one person (k >> 1),   We live in a We live in a “ “small world small world” ” where within a few links, we are connected to anyone in the world. where within a few links, we are connected to anyone in the world.   Erd Erdő ős s and and Renyi Renyi computed average computed average path length between connected nodes to be: path length between connected nodes to be: Erdős and Renyi (1959) David Mumford Peter Belhumeur Kentaro Toyama Fan Chung BIG “IF”!!!
  • 64. The Alpha Model The people you know aren The people you know arenʼ ʼt randomly chosen. t randomly chosen. People tend to get to know those who are two People tend to get to know those who are two links away ( links away (Rapoport Rapoport * *, 1957). , 1957). The real world exhibits a lot of The real world exhibits a lot of clustering. clustering. Watts (1999) * Same Anatol Rapoport, known for TIT FOR TAT! The Personal Map by MSR Redmond’s Social Computing Group
  • 65. The Alpha Model Watts (1999) α α model: Add edges to nodes, as in random model: Add edges to nodes, as in random graphs, but makes links more likely when graphs, but makes links more likely when two nodes have a common friend. two nodes have a common friend. For a range of For a range of α α values: values:   The world is small (average path length is The world is small (average path length is short), and short), and   Groups tend to form (high clustering Groups tend to form (high clustering coefficient). coefficient). Probability of linkage as a function of number of mutual friends (α is 0 in upper left, 1 in diagonal, and ∞ in bottom right curves.)
  • 66. The Alpha Model Watts (1999) α Clustering coefficient / Normalized path length Clustering coefficient (C) and average path length (L) plotted against α α α model: Add edges to nodes, as in random model: Add edges to nodes, as in random graphs, but makes links more likely when graphs, but makes links more likely when two nodes have a common friend. two nodes have a common friend. For a range of For a range of α α values: values:   The world is small (average path length is The world is small (average path length is short), and short), and   Groups tend to form (high clustering Groups tend to form (high clustering coefficient). coefficient).
  • 67. The Beta Model Watts and Strogatz (1998) β = 0 β = 0.125 β = 1 People know others at random. Not clustered, but “small world” People know their neighbors, and a few distant people. Clustered and “small world” People know their neighbors. Clustered, but not a “small world”
  • 68. The Beta Model First five random links reduce the average path First five random links reduce the average path length of the network by half, regardless of length of the network by half, regardless of N N! ! Both Both α α and and β β models reproduce short-path results models reproduce short-path results of random graphs, but also allow for clustering. of random graphs, but also allow for clustering. Small-world phenomena occur at threshold Small-world phenomena occur at threshold between order and chaos. between order and chaos. Watts and Strogatz (1998) Nobuyuki Hanaki Jonathan Donner Kentaro Toyama Clustering coefficient / Normalized path length Clustering coefficient (C) and average path length (L) plotted against β
  • 69. Power Laws Albert and Barabasi (1999) Degree distribution of a random graph, N = 10,000 p = 0.0015 k = 15. (Curve is a Poisson curve, for comparison.) What Whatʼ ʼs the degree (number of edges) distribution s the degree (number of edges) distribution over a graph, for real-world graphs? over a graph, for real-world graphs? Random-graph model results in Poisson Random-graph model results in Poisson distribution. distribution. But, many real-world networks exhibit a But, many real-world networks exhibit a power-law power-law distribution. distribution.
  • 70. Power Laws Albert and Barabasi (1999) Typical shape of a power-law distribution. What Whatʼ ʼs the degree (number of edges) distribution s the degree (number of edges) distribution over a graph, for real-world graphs? over a graph, for real-world graphs? Random-graph model results in Poisson Random-graph model results in Poisson distribution. distribution. But, many real-world networks exhibit a But, many real-world networks exhibit a power-law power-law distribution. distribution.
  • 71. Power Laws Albert and Barabasi (1999) Power-law distributions are straight lines in log-log Power-law distributions are straight lines in log-log space. space. How should random graphs be generated to create How should random graphs be generated to create a power-law distribution of node degrees? a power-law distribution of node degrees? Hint: Hint: Pareto Paretoʼ ʼs s* * Law: Wealth distribution follows a Law: Wealth distribution follows a power law. power law. Power laws in real networks: (a) WWW hyperlinks (b) co-starring in movies (c) co-authorship of physicists (d) co-authorship of neuroscientists * Same Velfredo Pareto, who defined Pareto optimality in game theory.
  • 72. Power Laws “ “The rich get richer! The rich get richer!” ” Power-law distribution of node distribution arises if Power-law distribution of node distribution arises if   Number of nodes grow; Number of nodes grow;   Edges are added in proportion to the number of edges Edges are added in proportion to the number of edges a node already has. a node already has. Additional variable fitness coefficient allows for some Additional variable fitness coefficient allows for some nodes to grow faster than others. nodes to grow faster than others. Albert and Barabasi (1999) Jennifer Chayes Anandan Kentaro Toyama “Map of the Internet” poster
  • 73. Searchable Networks Just because a short path exists, doesn Just because a short path exists, doesnʼ ʼt mean t mean you can easily find it. you can easily find it. You don You donʼ ʼt know all of the people whom your t know all of the people whom your friends know. friends know. Under what conditions is a network Under what conditions is a network searchable searchable? ? Kleinberg (2000)
  • 74. Searchable Networks a) a) Variation of Variation of Watts Wattsʼ ʼs s β β model: model:   Lattice is Lattice is d d-dimensional ( -dimensional (d d=2). =2).   One random link per node. One random link per node.   Parameter Parameter α α controls probability of random link controls probability of random link – – greater for closer nodes. greater for closer nodes. b) b) For For d d=2, dip in time-to-search at =2, dip in time-to-search at α α=2 =2   For low For low α α, random graph; no , random graph; no “ “geographic geographic” ” correlation in links correlation in links   For high For high α α, not a small world; no short paths to be found. , not a small world; no short paths to be found. c) c) Searchability Searchability dips at dips at α α=2, in simulation =2, in simulation Kleinberg (2000)
  • 75. Searchable Networks Watts, Watts, Dodds Dodds, Newman (2002) show that for , Newman (2002) show that for d d = 2 = 2 or 3, real networks are quite searchable. or 3, real networks are quite searchable. Killworth Killworth and Bernard (1978) found that people and Bernard (1978) found that people tended to search their networks by tended to search their networks by d d = 2: = 2: geography and profession. geography and profession. Kleinberg (2000) Ramin Zabih Kentaro Toyama The Watts-Dodds-Newman model closely fitting a real-world experiment
  • 76. References  ldous & Wilson, Graphs and Applications. An Introductory Approach, Springer, 2000.  Wasserman & Faust, Social Network Analysis, Cambridge University Press, 2008.