Rank-Aware Hard Thresholding for Sparse Approximation

Rank-Awareness in Compressed Sensing
Caleb Leedy and Yimin Wu
MAP: Rank Aware Hard Thresholding for Sparse Approximation
Dr. Blanchard
February 9, 2016
1

What is Compressed Sensing?
Trying to recover a signal with minimal measurement.
We use matrices to represent this signal processing problem.
Original Signal : X
Measurement Matrix: A
Observation Matrix: Y
AX = Y
4

Signal Example
Suppose we have the following signal:
A
1 0
0 2
X
x1
x2
=
Y
1
4
In practice we can only observe Y and A. However, to recover X we
can exploit the fact that A is invertible and full rank so that we can
recover X.
5

Signal Example
Suppose we have the following signal:
A
1 0
0 2
X
x1
x2
=
Y
1
4
In practice we can only observe Y and A. However, to recover X we
can exploit the fact that A is invertible and full rank so that we can
recover X.
1 0
0 1
2
A−1
1
4
Y
=
1
2
X
6

Understanding the Matrices
Dimensions of Matrices:
X ∈ Rn×
A ∈ Rm×n
Y ∈ Rm×
In the previous example
X ∈ R2×1 =
1
2
A ∈ R2×2 =
1 0
0 2
Y ∈ R2×1 =
1
4
7

Understanding the Measurement Matrix
If we think of how matrix multiplication works, each row of the
measurement matrix is a measurement.
1 0
0 2
1
2
=
1(1) + 0(2)
0(1) + 2(2)
=
1
4
8

Compressed Sensing
Real Problem in Signal Processing: Measurement
Is there a better way to recover the original data with fewer
measurements?
What are the fewest number of measurements we need to recover
the original signal?
9

Compressed Sensing
We can recover matrices but NOT with traditional linear algebra
techniques.
A becomes a nonsquare matrix.
Not invertible
Underdetermined System of Equations where m < n
a1,1x1 + a1,2x2 + · · · + a1,nxn = c1
a2,1x1 + a2,2x2 + · · · + a2,nxn = c2
...
...
...
am,1x1 + am,2x2 + · · · + am,nxn = cm
10

Compressed Sensing
We can recover matrices but NOT with traditional linear algebra
techniques.
A becomes a nonsquare matrix.
Not invertible
Underdetermined System of Equations where m < n
a1,1x1 + a1,2x2 + · · · + a1,nxn = c1
a2,1x1 + a2,2x2 + · · · + a2,nxn = c2
...
...
...
am,1x1 + am,2x2 + · · · + am,nxn = cm
Pseudoinverse
The pseudoinverse A† is deﬁned as A† = (ATA)−1AT where AT is the
transpose of A.

Sparsity Assumption
Deﬁnition: Jointly k-sparse Matrix
A matrix A is jointly k-sparse if there exist no more than k rows in A
that have nonzero entries. For example:






1 3 0
2 −1 2
0 0 0
0 0 0
5 2 −6






A 3-sparse matrix




0 0 0
2 −4 −2
0 9 2
4 1 1




A 3-sparse matrix
12

Sparsity Assumption






1 3 0
2 −1 2
0 0 0
0 0 0
5 2 −6






A 3-sparse matrix




0 0 0
2 −4 −2
0 9 2
4 1 1




A 3-sparse matrix
Many matrices are well approximated by k-sparse matrices after a
change of basis.
13

Sparsity Assumption






1 3 0
2 −1 2
0 0 0
0 0 0
5 2 −6






A 3-sparse matrix




0 0 0
2 −4 −2
0 9 2
4 1 1




A 3-sparse matrix
We assume k , which means the rank of X is determined by its
column space.
14

Compressed Sensing
Theorem: Donoho 2004, Candes and Tao 2004
Using randomness in matrix A we can recover the signal X where X
is a k-sparse matrix at a fraction of measurements.
15

Decoding Algorithms
Algorithm 1 Thresholding
Input: A, Y, k
Output: A k-sparse approximation of ˆX of the target signal X
16

Decoding Algorithms
Input: A, Y, k
1: X = A∗Y (rough approximation of the inverse)
17

Decoding Algorithms
Input: A, Y, k
2: T = PrincipalSupportk(X) (estimate for support set)
18

Decoding Algorithms
Input: A, Y, k
3: ˆX = A†
TY (projection onto k subspace deﬁned by T)
19

Thresholding Coherence
Thereom: Thresholding
Let A ∈ Rm×n be a matrix with entries drawn i.i.d from N(0, m−1)
and let X ∈ Rn× be a jointly k-sparse matrix. Let Y ∈ Rm× be a
matrix such that Y = AX. If µ is the coherence of A, then the
Thresholding Algorithm can recover the jointly k-sparse matrix X
from matrices Y and A as long as
k <
1
2
µ−1
2ν∞ + 1 − 1
where ν∞ is deﬁned as
ν∞ =
mini∈S ||x(i)||2
2
r
=1
||x ||2
∞
.

Thresholding Coherence
Corollary
The Thresholding Algorithm recovers a jointly k-sparse matrix
X ∈ Rn× as long as
m >
4k2
2ν∞ + 1

Rank Aware Thresholding
Algorithm 2 Rank Aware Thresholding
Input: A, Y, k
26

Input: A, Y, k
1: U = orth(Y) (U is an orthogonal basis of the columns of Y)
27

Input: A, Y, k
1: U = orth(Y) (U is an orthogonal basis of the columns of Y)
2: X = A∗U (rough approximation of the inverse)
4: ˆX = A†
TY (projection onto k subspace deﬁned by T)

Thereom (BLW): Rank Aware Thresholding
Let A ∈ Rm×n be a matrix with entries drawn i.i.d from N(0, m−1)
and let X ∈ Rn× be a jointly k-sparse matrix. Let Y ∈ Rm× be a
matrix such that Y = AX. We deﬁne the matrix U ∈ Rm× to be a
matrix with orthogonal columns where there exist matrices Σ and V
such that UΣVT is the singular value decomposition of Y. Then the
Rank Aware Thresholding Algorithm can recover the matrix X from
the matrices Y and A with probability 1 − δ as long as
δ (n − k)e−C(mν2r/k−4r)
where
ν2 =
mini∈S ||A∗U||2
2
maxi∈S ||A∗U||2
2
.

Corollary
The Rank Aware Thresholding Algorithm correctly selects the
support of a jointly k-sparse matrix X ∈ Rn× if
m
Ck
ν2
1 +
1
r
ln
n
δ

Corollary
m
Ck
ν2
1 +
1
r
ln
n
δ
Importance of Corollary:
m ∼ Ck

Corollary
m
Ck
ν2
1 +
1
r
ln
n
δ
m ∼ Ck
Eliminates the “square-root bottleneck"

Corollary
m
Ck
ν2
1 +
1
r
ln
n
δ
m ∼ Ck
Thresholding Corollary
The Thresholding Algorithm recovers a jointly
k-sparse matrix X ∈ Rn× as long as
m >
4k2
2ν∞ + 1

Corollary
m
Ck
ν2
1 +
1
r
ln
n
δ
m ∼ Ck
Tells us how many measurements we need to recover a jointly
k-sparse signal

Corollary
m
Ck
ν2
1 +
1
r
ln
n
δ
m ∼ Ck
Tells us how many measurements we need to recover a jointly
k-sparse signal
Rank Aware Thresholding is actually rank aware
36

Thresholding Algorithm (rank = 5)
37

38

39

Rank Aware Thresholding Algorithm (rank = 5)
40

41

42

Modeling Process
Modeling Process
Predict the behavior of decoding algorithms for problems with
large dimensions

Modeling Process
Modeling Process
large dimensions
New model

Modeling Process
Modeling Process
large dimensions
New model
Typical tests are problems on the scale 27
or 28
.

Modeling Process
Modeling Process
large dimensions
New model
or 28
.
We want to know behavior of problems at least 212
.

Modeling Process
Modeling Process
large dimensions
New model
or 28
.
.
Hard to run many tests of this size.

Modeling Process
Modeling Process
large dimensions
New model
or 28
.
.
Model the region of the 50% Success Curve

Modeling Process
Modeling Process
large dimensions
New model
or 28
.
.
Model the region of the 50% Success Curve
Relate 50% Success Curve and Fraction of Correct Support

50% Success and Fraction of Correct Support
50

50% Success and Number of Missed Entries
51

52

53

54

Modeling Rank Aware Thresholding
56

Iterative Algorithms
Examples of Iterative Algorithms in Compressed Sensing
Orthogonal Matching Pursuit (OMP)
Compressive Sampling Matching Pursuit (CoSaMP)
Iterative Hard Thresholding (IHT)
Normal Iterative Hard Thresholding (NIHT)
Conjugate Gradient Iterative Hard Thresholding (CGIHT)
58

OMP
Algorithm 3 Orthogonal Matching Pursuit (OMP)
Input: A, y, k
Output: A k-sparse approximation of ˆx of the target signal x
Initialization: Set x0 = 0, r0 = y, T0 = {}.
Iteration:
1. for j = 1, 2, . . . , k
2. i = argmax |A∗rj−1| (identify column of A most correlated to residual)
3. Tj = Tj−1 ∪ {i} (add the new column index to the index set)
4. xj = A†
Tj y (project the measurements onto the T subspace)
5. rj = y − Axj (update the residual)
6. end for
7. return ˆx = xk (return the k-sparse vector xk
)
59

CoSaMP
Algorithm 4 Compressive Sampling Matching Pursuit (CoSaMP)
Input: A, y, k
Output: A k-sparse approximation of ˆx of the target signal x
Iteration:
1. Tn = {indices of the largest 2k in modulus entries of A∗(y − Axn) }
2. Un = Tn ∪ Sn where Sn = supp(xn)
3. un = argmin{||y − Az||2, supp(z) ⊆ Un}
4. xn+1 = Hs(un) where Hs keeps the largest k rows of un
60

Conjecture
1 We know Thresholding in Rank Aware

Conjecture
2 We know OMP and CoSaMP are Rank Aware

Conjecture
3 We know OMP and CoSaMP contain Thresholding at every
iteration

Conjecture
3 We know OMP and CoSaMP contain Thresholding at every
iteration
Conjecture
The rank awareness of OMP and CoSaMP comes from the
Thresholding step in their implementation
65

Modeling Iterative Algorithms
We want to model the iterative algorithms like we modeled
Thresholding
If this approach works for OMP and CoSaMP then we have
evidence supporting our conjecture
66

Rank-Aware Hard Thresholding for Sparse Approximation

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Rank-Aware Hard Thresholding for Sparse Approximation

Similar to Rank-Aware Hard Thresholding for Sparse Approximation (20)

Rank-Aware Hard Thresholding for Sparse Approximation