sparse image representation via combined transforms - Convex ...
sparse image representation via combined transforms - Convex ... sparse image representation via combined transforms - Convex ...
108 CHAPTER 5. ITERATIVE METHODS diagonal matrix and its transpose as follows: à = ⎛ ⎜ ⎝ ⎞ ⎛ T 1 T 2 1 . .. ⎟ 2 A ⎜ ⎠ ⎝ T m T T 1 T T 2 ⎞ ⎟ ⎠ . .. T T m = ⎛ ⎜ ⎝ ⎞ I + S 1 I ··· I I I + S 2 . , . .. . ⎟ ⎠ I ··· I + S m where S i = T i D i Ti T , i =1, 2,... ,m. In the remainder of this section, we consider what a good preconditioner for à should be. We considered three possible preconditioners: 1. a preconditioner from complete Cholesky factorization, 2. a preconditioner from sparse incomplete factorization of the inverse [9], 3. a diagonal block preconditioner [4, 5, 31]. The main result is that we found the preconditioner 1 and preconditioner 2 are not “optimal”. Here “optimal” means that the matrix-vector multiplication with the matrix associated with the preconditioner L −H should still have fast algorithms, and these fast algorithms should be based on fast algorithms for matrix-vector multiplication for the matrices T 1 ,T 2 ,... ,T m . So the block diagonal preconditioner is the only one we are going to use. We describe the results about the block diagonal preconditioner here, and postpone the discussion about the preconditioners in case 1 and 2 to Section 5.4. The optimal block diagonal preconditioner for à is ⎛ ⎞ (I + S 1 ) −1/2 (I + S 2 ) −1/2 ⎜ . . (5.7) .. ⎟ ⎝ ⎠ (I + S m ) −1/2 A striking result is due to Demmel [78, Page 168, Theorem 10.5.3]. The key idea is that
5.2. LSQR 109 among all the block diagonal preconditioners, the one that takes the Cholesky factorizer of the diagonal submatrix as its diagonal submatrix is nearly optimal. Here “nearly” means that the resulting condition number cannot be larger than m times the best achievable condition number by using a block diagonal preconditioner. Obviously, we have fast algorithms to multiply with matrix (I + S i ) −1/2 ,i=1, 2,... ,m. 5.2 LSQR 5.2.1 What is LSQR? LSQR [117, 116] solves the following two least-squares (LS) problems, depending on whether the damping parameter α is zero or not. [N] When the damping parameter is equal to zero (α = 0), solve Ax = b or minimize x ‖b − Ax‖ 2 . [R] When the damping parameter is not equal to zero (α ≠0): minimize x b ∥ ∥( 0 ) ( A − αI ) x ∥ 2 2 . Here [N] is a nonregularized problem and [R] is a regularized problem. The problem in (5.1) can be rewritten as follows: ⎡ ⎛ ⎢ ⎣ 2ΦT Φ+λ ⎜ ⎝ ¯ρ ′′ (x k 1 ) ⎞⎤ . .. ¯ρ ′′ (x k N ) ⎛ ⎟⎥ ⎠⎦ d =2ΦT (y − Φx k ) − λ ⎜ ⎝ ¯ρ ′ (x k 1 ) ⎞ . ⎟ ⎠ , ¯ρ ′ (x k N ) where d is the Newton direction that we want to solve for and the remaining variables are defined in the previous chapter. The above equation is equivalent to solving an LS problem: (LS) : ] [ ]∥ Φ y − Φxk ∥∥∥∥ minimize ∥ d − , d ∥[ D(x k ) −D −1 (x k )g(x k ) 2
- Page 85 and 86: 3.2. WAVELETS AND POINT SINGULARITI
- Page 87 and 88: 3.2. WAVELETS AND POINT SINGULARITI
- Page 89 and 90: 3.2. WAVELETS AND POINT SINGULARITI
- Page 91 and 92: 3.2. WAVELETS AND POINT SINGULARITI
- Page 93 and 94: 3.3. EDGELETS AND LINEAR SINGULARIT
- Page 95 and 96: 3.4. OTHER TRANSFORMS 67 uncertaint
- Page 97 and 98: 3.4. OTHER TRANSFORMS 69 Chirplets
- Page 99 and 100: 3.4. OTHER TRANSFORMS 71 Folding. A
- Page 101 and 102: 3.4. OTHER TRANSFORMS 73 We can app
- Page 103 and 104: 3.5. DISCUSSION 75 give only a few
- Page 105 and 106: 3.7. PROOFS 77 the ijth component o
- Page 107 and 108: 3.7. PROOFS 79 Similarly, we have [
- Page 109 and 110: Chapter 4 Combined Image Representa
- Page 111 and 112: 4.2. SPARSE DECOMPOSITION 83 interi
- Page 113 and 114: 4.3. MINIMUM l 1 NORM SOLUTION 85 l
- Page 115 and 116: 4.4. LAGRANGE MULTIPLIERS 87 ρ( x
- Page 117 and 118: 4.5. HOW TO CHOOSE ρ AND λ 89 3 (
- Page 119 and 120: 4.6. HOMOTOPY 91 A way to interpret
- Page 121 and 122: 4.7. NEWTON DIRECTION 93 4.7 Newton
- Page 123 and 124: 4.9. ITERATIVE METHODS 95 1. Avoidi
- Page 125 and 126: 4.11. DISCUSSION 97 ρ(β) =‖β
- Page 127 and 128: 4.12. PROOFS 99 4.12.2 Proof of The
- Page 129 and 130: 4.12. PROOFS 101 case of (4.16). Co
- Page 131 and 132: Chapter 5 Iterative Methods This ch
- Page 133 and 134: 5.1. OVERVIEW 105 the k-th iteratio
- Page 135: 5.1. OVERVIEW 107 5.1.4 Preconditio
- Page 139 and 140: 5.2. LSQR 111 5.2.3 Algorithm LSQR
- Page 141 and 142: 5.3. MINRES 113 2. For k =1, 2,...,
- Page 143 and 144: 5.3. MINRES 115 using the precondit
- Page 145 and 146: 5.4. DISCUSSION 117 From (I + S 1 )
- Page 147 and 148: Chapter 6 Simulations Section 6.1 d
- Page 149 and 150: 6.3. DECOMPOSITION 121 10 5 5 5 20
- Page 151 and 152: 6.4. DECAY OF COEFFICIENTS 123 10 2
- Page 153 and 154: 6.5. COMPARISON WITH MATCHING PURSU
- Page 155 and 156: 6.6. SUMMARY OF COMPUTATIONAL EXPER
- Page 157 and 158: Chapter 7 Future Work In the future
- Page 159 and 160: 7.2. MODIFYING EDGELET DICTIONARY 1
- Page 161 and 162: 7.3. ACCELERATING THE ITERATIVE ALG
- Page 163 and 164: Appendix A Direct Edgelet Transform
- Page 165 and 166: A.2. EXAMPLES 137 edgelet transform
- Page 167 and 168: A.3. DETAILS 139 (a) Stick image (b
- Page 169 and 170: A.3. DETAILS 141 (a) Lenna image (b
- Page 171 and 172: A.3. DETAILS 143 Ordering of Dyadic
- Page 173 and 174: A.3. DETAILS 145 (1,K +1), (1,K +2)
- Page 175 and 176: A.3. DETAILS 147 x 1 , y 1 x 2 , y
- Page 177 and 178: Appendix B Fast Edgelet-like Transf
- Page 179 and 180: B.1. TRANSFORMS FOR 2-D CONTINUOUS
- Page 181 and 182: B.2. DISCRETE ALGORITHM 153 B.2.1 S
- Page 183 and 184: B.2. DISCRETE ALGORITHM 155 extensi
- Page 185 and 186: B.2. DISCRETE ALGORITHM 157 For the
108 CHAPTER 5. ITERATIVE METHODS<br />
diagonal matrix and its transpose as follows:<br />
à =<br />
⎛<br />
⎜<br />
⎝<br />
⎞ ⎛<br />
T 1 T 2 1<br />
. .. ⎟ 2 A ⎜<br />
⎠ ⎝<br />
T m<br />
T T 1<br />
T T 2<br />
⎞<br />
⎟<br />
⎠<br />
. ..<br />
T T m<br />
=<br />
⎛<br />
⎜<br />
⎝<br />
⎞<br />
I + S 1 I ··· I<br />
I I + S 2<br />
. ,<br />
.<br />
.. . ⎟<br />
⎠<br />
I ··· I + S m<br />
where S i = T i D i Ti T , i =1, 2,... ,m. In the remainder of this section, we consider what a<br />
good preconditioner for à should be.<br />
We considered three possible preconditioners:<br />
1. a preconditioner from complete Cholesky factorization,<br />
2. a preconditioner from <strong>sparse</strong> incomplete factorization of the inverse [9],<br />
3. a diagonal block preconditioner [4, 5, 31].<br />
The main result is that we found the preconditioner 1 and preconditioner 2 are not “optimal”.<br />
Here “optimal” means that the matrix-vector multiplication with the matrix associated<br />
with the preconditioner L −H should still have fast algorithms, and these fast algorithms<br />
should be based on fast algorithms for matrix-vector multiplication for the matrices<br />
T 1 ,T 2 ,... ,T m . So the block diagonal preconditioner is the only one we are going to use.<br />
We describe the results about the block diagonal preconditioner here, and postpone the<br />
discussion about the preconditioners in case 1 and 2 to Section 5.4.<br />
The optimal block diagonal preconditioner for à is<br />
⎛<br />
⎞<br />
(I + S 1 ) −1/2 (I + S 2 ) −1/2 ⎜<br />
. . (5.7)<br />
.. ⎟<br />
⎝<br />
⎠<br />
(I + S m ) −1/2<br />
A striking result is due to Demmel [78, Page 168, Theorem 10.5.3]. The key idea is that