Presentation slides

Drawing Large Graphs Using Divisive Hierarchical
k-means
Barbara Ikica
22. 11. 2012
Barbara Ikica
Drawing Large Graphs Using Divisive Hierarchical k-means

Abstract
Idea behind the algorithm
Barbara Ikica

Abstract
Implementation
Barbara Ikica

Abstract
Implementation
Hierarchical clustering (Divisive Hierarchical k-means)
Barbara Ikica

Abstract
Implementation
Graph representation
Barbara Ikica

Abstract
Implementation
Identifying connected components
Barbara Ikica

Abstract
Implementation
Diﬀusion kernels on graphs
Barbara Ikica

Abstract
Implementation
Drawing
Barbara Ikica

Abstract
Implementation
Drawing
Time complexity
Barbara Ikica

Abstract
Implementation
Drawing
Time complexity
Random projections
Barbara Ikica

0
1
2
3 4
5 6
7
8
Barbara Ikica

0
1
2
3 4
5 6
7
8
0
1
2
3 4
5 6
7
8
Barbara Ikica

0
1
2
3 4
5 6
7
8
0
1
2
3 4
5 6
7
8
012 3456
78
Barbara Ikica

Objective: Arrangement of the (sets of the) vertices of a graph
G = V (G), E(G) at the level of hierarchy n
Barbara Ikica

1. Determining the partition of V (G) : Pn = {Mi}m
i=1
Barbara Ikica

1. Determining the partition of V (G) : Pn = {Mi}m
i=1
2. Determining the drawing area for each Mi ∈ Pn
Barbara Ikica

Implementation – Hierarchical clustering
Barbara Ikica

k-means clustering
M := {vi}n
i=1, vi ∈ Rd ∀i, 1 ≤ i ≤ n.
We aim to partition the set M into k clusters P = {S1, S2, . . . , Sk}
so as to minimize the within-cluster sum of squares
min
P
k
i=1 vj∈Si
vj − µi
2
,
where µi := 1
|Si|( vj∈Si
vj) is the centroid vector of the cluster Si.
Barbara Ikica

k-means clustering
NP-hard problem
Barbara Ikica

k-means clustering
NP-hard problem
If k and d are ﬁxed, the problem can be exactly solved in
O(ndk+1 log n) steps.
Barbara Ikica

k-means clustering
Recall that:
We aim to partition the set M into k clusters P = {S1, S2, . . . , Sk} so as to minimize
the within-cluster sum of squares
min
P
k
i=1 vj ∈Si
vj − µi
2
.
Corollary 1
min
z∈Rd
v∈S
v − z 2
=
v∈S
v − µ 2
Barbara Ikica

k-means clustering
Lemma 1
Choose arbitrary S ⊂ Rd and z ∈ Rd. Then
v∈S
v − z 2
=
v∈S
v − µ 2
+ |S| z − µ 2
,
where µ is the centroid vector of the cluster S, i.e. µ = 1
|S| v∈S
v.
Corollary 1
min
z∈Rd
v∈S
v − z 2
=
v∈S
v − µ 2
Barbara Ikica

k-means clustering
Lemma 1
Choose arbitrary S ⊂ Rd and z ∈ Rd. Then
v∈S
v − z 2
=
v∈S
v − µ 2
+ |S| z − µ 2
,
where µ is the centroid vector of the cluster S, i.e. µ = 1
|S| v∈S
v.
Lemma 2
Let X denote an arbitrary random variable with values in Rd. For any z ∈ Rd the
following holds:
E( X − z 2
) = E( X − E(X) 2
) + z − E(X) 2
.
Barbara Ikica

k-means clustering – iterative algorithm
Randomly pick k vectors from M = {v1, v2, . . . , vn} as the centroids µ
(0)
i for
each i = 1, 2, . . . , k.
Barbara Ikica

(0)
i for
each i = 1, 2, . . . , k.
For t = 0, 1, 2, . . . repeat (for each i = 1, 2, . . . , k)
1. S
(t)
i = vp : vp − µ
(t)
i
2
≤ vp − µ
(t)
j
2
∀ j : 1 ≤ j ≤ k
Barbara Ikica

(0)
i for
each i = 1, 2, . . . , k.
1. S
(t)
i = vp : vp − µ
(t)
i
2
≤ vp − µ
(t)
j
2
∀ j : 1 ≤ j ≤ k and
2. µ
(t+1)
i = 1
S
(t)
i
vj ∈S
(t)
i
vj
Barbara Ikica

(0)
i for
each i = 1, 2, . . . , k.
1. S
(t)
i = vp : vp − µ
(t)
i
2
≤ vp − µ
(t)
j
2
∀ j : 1 ≤ j ≤ k and
2. µ
(t+1)
i = 1
S
(t)
i
vj ∈S
(t)
i
vj
until there is no further change in the assignments of the vectors to the clusters
S
(t)
i (for each i) in the partition V.
Barbara Ikica

Lemma 3
The value of the expression
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
decreases monotonically
during iteration.
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof.
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof.
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
≤
k
i=1 vj ∈S
(t−1)
i
vj − µ
(t)
i
2
(1)
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof.
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
≤
k
i=1 vj ∈S
(t−1)
i
vj − µ
(t)
i
2
, since (1)
S
(t)
i = vp : vp − µ
(t)
i
2
≤ vp − µ
(t)
j
2
∀ j : 1 ≤ j ≤ k .
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof.
k
i=1 vj ∈S
(t)
i
vj − µ
(t+1)
i
2
≤
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
(2)
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof.
k
i=1 vj ∈S
(t)
i
vj − µ
(t+1)
i
2
≤
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
, since (2)
µ
(t+1)
i =
1
S
(t)
i
vj ∈S
(t)
i
vj and min
z∈Rd
v∈S
v − z 2
=
v∈S
v − µ 2
.
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof. Thus
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
≤
k
i=1 vj ∈S
(t−1)
i
vj − µ
(t)
i
2
(1)
k
i=1 vj ∈S
(t)
i
vj − µ
(t+1)
i
2
≤
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
(2)
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof. Thus
k
i=1 vj ∈S
(t+1)
i
vj − µ
(t+1)
i
2
≤
k
i=1 vj ∈S
(t)
i
vj − µ
(t+1)
i
2
(1)
k
i=1 vj ∈S
(t)
i
vj − µ
(t+1)
i
2
≤
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
(2)
Barbara Ikica

Lemma 3
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
during iteration.
Proof. Thus
k
i=1 vj ∈S
(t+1)
i
vj − µ
(t+1)
i
2
≤
k
i=1 vj ∈S
(t)
i
vj − µ
(t)
i
2
.
Barbara Ikica

Hierarchical clustering
M
A1
A11
A111 A112
A12
A12
A2
A21
A211 A212
A22
A221 A222
Figure: Schematic representation of the hierarchical partition of the set
M.
Barbara Ikica

Implementation – Graph representation
Input (graph data)
0 2
1 3
1 5
3 8
3 9
4 7
6 7
9 10
9 11
Barbara Ikica

Input (graph data)
0 2
1 3
1 5
3 8
3 9
4 7
6 7
9 10
9 11
0
2
5
1
3
8
4
6
7
9
11
10
Barbara Ikica

Input (graph data)
0 2
1 3
1 5
3 8
3 9
4 7
6 7
9 10
9 11
0
2
5
1
33
8
4
6
7
99
11
10
Barbara Ikica

Adjacency matrix MG
[MG
]ij =
1; vi ∼ vj,
0; otherwise.
Barbara Ikica

Adjacency matrix MG
[MG
]ij =
1; vi ∼ vj,
0; otherwise.
Example
MG
=







0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0







Barbara Ikica

Adjacency matrix MG
[MG
]ij =
1; vi ∼ vj,
0; otherwise.
Example
MG
=







0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0






 4
6
7
Barbara Ikica

MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









Barbara Ikica

MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









−→















[2]
[3,5]
[0]
[1,8,9]
[7]
[1]
[7]
[4,6]
[3]
[3,10,11]
[9]
[9]















Barbara Ikica

Implementation – Identifying connected components
0
2
5
1
3
8
4
6
7
9
11
10
Barbara Ikica

0
2
5
1
3
8
4
6
7
9
11
10
5
1
3
8
9
10
11
Barbara Ikica

5
1
3
8
9
11
10 MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 0 1 0 0 0 0 0 0 0 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 0 1 0 0 0 0 0 0 0 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 0 0 0 0 0 0 0 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
5
3
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 0 0 0 0 0 0 0 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
5
3
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 2 0 2 0 0 0 0 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
5
3
8
9
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 2 0 2 0 0 0 0 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
5
3
8
9
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 2 0 2 0 0 2 2 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
5
3
8
9
10
11
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 2 0 2 0 0 2 2 0 0
Barbara Ikica

5
1
3
8
9
11
10
1
5
3
8
9
10
11
MG
=









0 0 1 0 0 0 0 0 0 0 0 0
0 0 0 1 0 1 0 0 0 0 0 0
1 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 1 1 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 1 0 0 0 0
0 0 0 0 1 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 0 0
0 0 0 1 0 0 0 0 0 0 1 1
0 0 0 0 0 0 0 0 0 1 0 0
0 0 0 0 0 0 0 0 0 1 0 0









1 2 1 2 0 2 0 0 2 2 2 2
Barbara Ikica

0
2
5
1 3
8 4
6 7
9
11
10
Barbara Ikica

0
2
5
1 3
8 4
6 7
9
11
10
1 2 1 2 3 2 3 3 2 2 2 2
Barbara Ikica

Implementation – Diﬀusion kernels on graphs
0 1
2 3
Barbara Ikica

0 1
2 3
0
3
1
2 Hierarchical clustering on the row
vectors of the adjacency matrix
MG:
S1 = {1, 2}
S2 = {0, 3}
Barbara Ikica

0 1
2 3
0
3
1
2 Hierarchical clustering on the row
vectors of ?:
S1 = {0, 1, 2}
S2 = {3}
Barbara Ikica

The adjacency matrix MG has the following property:
[(MG
)k
]ij = # of all paths of length up to k between vertices i and j
Barbara Ikica

The adjacency matrix MG has the following property:
[(MG
)k
]ij = # of all paths of length up to k between vertices i and j
We modify the algorithm by replacing MG with the kernel matrix
KG:
KG
:=
∞
k=0
αk(MG)k
k!
= exp(αMG
) .
Barbara Ikica

k : Ω × Ω → R is a similarity measure if:
k(x, y) characterizes the similarities of x, y ∈ Ω.
Barbara Ikica

k : Ω × Ω → R is a kernel if:
1. k(x, y) = k(y, x), ∀x, y ∈ Ω,
2. k is positive semideﬁnite:
the kernel matrix K ∈ Rn×n, [K]ij = k(xi, xj), is positive
semideﬁnite for all x1, x2, . . . , xn ∈ Ω.
Barbara Ikica

k : Ω × Ω → R is a kernel if:
1. k(x, y) = k(y, x), ∀x, y ∈ Ω,
2. k is positive semideﬁnite:
the kernel matrix K ∈ Rn×n, [K]ij = k(xi, xj), is positive
semideﬁnite for all x1, x2, . . . , xn ∈ Ω.
Given a kernel k, there exist a Hilbert space Hk and a map φ : Ω → Hk such
that
φ(x), φ(y) Hk
= k(x, y) for all x, y ∈ Ω.
Barbara Ikica

KG
:=
∞
k=0
αk(MG)k
k!
= exp(αMG
) indeed is a kernel matrix,
since
Barbara Ikica

KG
:=
∞
k=0
αk(MG)k
k!
= exp(αMG
since
KG
= exp(αMG
)
Barbara Ikica

KG
:=
∞
k=0
αk(MG)k
k!
= exp(αMG
since
KG
= exp(αMG
) = exp(αUDUT
)
Barbara Ikica

KG
:=
∞
k=0
αk(MG)k
k!
= exp(αMG
since
KG
= exp(αMG
) = exp(αUDUT
) = αU exp(D)UT
Barbara Ikica

KG
:=
∞
k=0
αk(MG)k
k!
= exp(αMG
since
KG
= exp(αMG
) = exp(αUDUT
) = αU exp(D)UT
=⇒ KG is a symmetric positive semideﬁnite matrix
Barbara Ikica

Implementation – Hierarchical clustering on KG
Example
"1#1"
"2#11"
"1#2"
"2#211"
"3#11"
"2#12"
"3#2"
"3#12"
"2#221"
"2#212"
"2#2221"
"2#2222"
0
2
4
6 7
1 3
5
8
9
10
11
Barbara Ikica

Example
"1#1"
"2#11"
"1#2"
"2#211"
"3#11"
"2#12"
"3#2"
"3#12"
"2#221"
"2#212"
"2#2221"
"2#2222"
0
2
4
6 7
1
5
3
8
9
10
11
Barbara Ikica

Example
"1#1"
"2#11"
"1#2"
"2#211"
"3#11"
"2#12"
"3#2"
"3#12"
"2#221"
"2#212"
"2#2221"
"2#2222"
0
2
4
6 7
1
5
8
10
11
3
9
Barbara Ikica

Implementation – Drawing
A111A112
A12
A211 A212
A221 A222
Figure: Determining the
drawing area for the sets
in the partition P0.
V (G)
A1
A11
A111 A112
A12
A12
A2
A21
A211 A212
A22
A221 A222
Barbara Ikica

A111A112
A12
A211 A212
A221 A222
V (G)
A1
A11
A111 A112
A12
A12
A2
A21
A211 A212
A22
A221 A222
Barbara Ikica

Implementation – Time complexity
Eﬃcient computation of KG
= exp(αMG
)
Computation of KG si needed in the algorithm:
to determine the centroids µj,
to minimize vi − µj
2
Barbara Ikica

= exp(αMG
)
2:
vi − µj
2
= vi − µj, vi − µj = vi, vi − 2 vi, µj + µj, µj .
Barbara Ikica

= exp(αMG
)
2:
vi − µj
2
= vi − µj, vi − µj = vi, vi − 2 vi, µj + µj, µj .
The question is thus:
How to eﬃciently multiply the matrix KG with an arbitrary vector and how to
eﬃciently compute the inner products vi, vi = vi
2?
Barbara Ikica

Multiplying the matrix KG
with an arbitrary vector v
KG
v = Iv +
α
1
MG
v +
α2
2!
(MG
)2
v +
α3
3!
(MG
)3
v + . . .
Barbara Ikica

Multiplying the matrix KG
with an arbitrary vector v
KG
v = Iv +
α
1
MG
v +
α2
2!
(MG
)2
v +
α3
3!
(MG
)3
v + . . . =
= v +
α
1
(MG
v) +
α
2
MG α
1
(MG
v) +
α
3
MG α
2
MG α
1
(MG
v) + . . .
Barbara Ikica

Implementation – Random projections
Random projections
Corollary of the Johnson–Lindenstrauss lemma:
Theorem 1
Let P be an arbitrary set of n points in Rd
, represented as an n × d matrix A ∈ Rn×d
. Given ε > 0, β > 0 let
k0 =
4+2β
ε2/2−ε3/3
log n. For integer k ≥ k0, let R ∈ Rd×k
be a random matrix with elements rij , where rij
are independent random variables from either one of the following two probability distributions:
rij :
1 −1
1
2
1
2
, rij :
1 0 −1
1
6
2
3
1
6
.
Let E = 1√
k
AR and let f : Rd
→ Rk
map the i-th row of A to the i-th row of E.
Then
(1 − ε) u − v
2
≤ f (u) − f (v)
2
≤ (1 + ε) u − v
2
holds for all u, v ∈ P with probability at least 1 − n−β
.
Barbara Ikica

Implementation – Random projections
Random projections
Corollary of the Johnson–Lindenstrauss lemma:
Theorem 1
Let P be an arbitrary set of n points in Rd
, represented as an n × d matrix A ∈ Rn×d
. Given ε > 0, β > 0 let
k0 =
4+2β
ε2/2−ε3/3
log n. For integer k ≥ k0, let R ∈ Rd×k
be a random matrix with elements rij , where rij
are independent random variables from either one of the following two probability distributions:
rij :
1 −1
1
2
1
2
, rij :
1 0 −1
1
6
2
3
1
6
.
Let E =
1
√
k
AR and let f : Rd
→ Rk
map the i-th row of A to the i-th row of E.
Then
(1 − ε) u − v
2
≤ f (u) − f (v)
2
≤ (1 + ε) u − v
2
holds for all u, v ∈ P with probability at least 1 − n−β
.
Barbara Ikica

Presentation slides

Recommended

Recommended

More Related Content

What's hot

What's hot (19)

Similar to Presentation slides

Similar to Presentation slides (20)

Recently uploaded

Recently uploaded (20)

Presentation slides