sparse image representation via combined transforms - Convex ...
sparse image representation via combined transforms - Convex ... sparse image representation via combined transforms - Convex ...
70 CHAPTER 3. IMAGE TRANSFORMS AND IMAGE FEATURES where the constant c is a positive real number. It can be proven that a transform W c ,for c>0, is isometric in L 2 (R). We also know that the Fourier transform is isometric in L 2 (R) (Parseval). Suppose we have an orthonormal wavelet basis W = {w j,k : j, k ∈ Z}. We first apply a Fourier transform on the basis W , then apply an axis warping transform, then apply the inverse Fourier transform. Because each step is an isometric transform, the result must be another orthonormal basis in L 2 (R). Thus we get a new orthonormal basis, called a fan basis in [7]. Figure 3.7(d) gives an idealized tiling of wavelets on the time frequency plane. Figure 3.7(f) gives an idealized tiling of the fan basis. Cosine Packets Here we describe cosine packets. We review some landmarks in the historic development of time frequency localized orthonormal basis. Real-valued time frequency localized atoms not only have some intuitive optimality, but also have been the tool to circumvent the barrier brought by traditional Gabor analysis. The content of this subsubsection is more abundant than its title suggests. Collection of localized cosine and sine functions. Cosine packets are a collection of localized cosine and sine functions. A readily observable optimality of cosine and sine functions is that they are real-valued. Typically, an analyzed signal is a real-valued signal, so localized cosine and sine functions are closer to a signal than these complex-valued functions do. We also want a basis function to be localized. The localization should be in both time and frequency domains: in both time and frequency (Fourier) domain, a function must either have a finite support, or decays faster than any inverse of polynomials. Orthonormal basis for L 2 (R). In Gabor analysis, finding an orthonormal basis for L 2 (R) was a topic that has been intensively studied. The Balian-Low theorem says that if we choose a time-frequency atom following (3.36), then the atom cannot be simultaneously localized in both time domain and frequency domain—the atom must have infinite variance either in time or in frequency. To circumvent this barrier, Wilson in 1987 [34, page 120] proposed a basis function that has two peaks in frequency. In [34], Daubechies gives a construction of an orthonormal basis for L 2 (R). In her construction, basis functions have exponential decay in both time and frequency. From [34], the key idea to ideal time-frequency localization and orthonormality in the windowed Fourier framework is to use sine and cosine rather than complex exponentials. This is a not-so-obvious optimality of using sine and cosine functions in basis.
3.4. OTHER TRANSFORMS 71 Folding. Another way to obtain localized time-frequency basis, instead of using the constructive method in Daubechies [34], is to apply folding to an existing orthonormal basis for periodic functions on a fixed interval. The idea of folding is described in [141] and [142]. The folding operation is particularly important in discrete algorithms because if the basis function is from the folding of a known orthonormal basis function (e.g., Fourier basis function), since the coefficient of the transform associated with the basis is simply the inner product of the signal with the basis function, we can apply the adjoint of the folding operator to the signal, then calculate its inner product with the original basis function. If there is a fast algorithm to do the original transform (for example, for DFT, there is a FFT), then there is a fast algorithm to do the transform associated with the new time-frequency atoms. Overcomplete dictionary. For a fixed real constant, we can construct an orthonormal basis whose support of basis functions has length exactly equal to this constant. But we can choose different lengths for support. In the discrete case, for the simplicity of implementation, we usually choose the length of support equal to a power of 2. The collection of all the basis functions make an overcomplete dictionary. A consequential question is how to find a subset of this dictionary. The subset should construct an orthonormal basis, and, at the same time, be the best representation for a particular class of signal. Best orthogonal basis. The best orthogonal basis (BOB) algorithm is proposed to achieve the objective just mentioned. The key idea is to assign a probability distribution on a class of signals. Usually, the class of signals is a subspace of L 2 (R). Without loss of generality, we consider signals that are restricted on the interval [0, 1). Suppose we consider only functions whose support are dyadic intervals that have forms [2 −l (k − 1), 2 −l k), for l ∈ N,k =1, 2,... ,2 l . At each level l, for the basis functions whose support are intervals like [2 −l (k − 1), 2 −l k), the associated coefficients can be calculated. We can compute the entropy of these coefficients. For an interval [2 −l (k−1), 2 −l k), we can consider its immediate two subintervals: [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k). (Note [2 −l (k − 1), 2 −l k)=[2 −l−1 (2k−2), 2 −l−1 (2k−1))∪[2 −l−1 (2k−1), 2 −l−1 2k).) Note in the discrete case, a set of the coefficients associated with an interval [2 −l (k − 1), 2 −l k) and a set of coefficients associated with intervals [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k) have the same cardinality. In fact, there is a binary tree structure. Each dyadic interval corresponds to a node in the tree. Suppose a node in the tree corresponds to an interval I. Two subsequent nodes in the tree should correspond to the two subintervals of the interval I.
- Page 47 and 48: 2.2. SPARSITY AND COMPRESSION 19 lo
- Page 49 and 50: 2.3. DISCUSSION 21 tail compact. Th
- Page 51 and 52: 2.4. PROOF 23 The index l does not
- Page 53 and 54: Chapter 3 Image Transforms and Imag
- Page 55 and 56: 27 Some of the figures show the bas
- Page 57 and 58: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 59 and 60: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 61 and 62: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 63 and 64: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 65 and 66: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 67 and 68: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 69 and 70: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 71 and 72: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 73 and 74: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 75 and 76: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 77 and 78: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 79 and 80: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 81 and 82: 3.2. WAVELETS AND POINT SINGULARITI
- Page 83 and 84: 3.2. WAVELETS AND POINT SINGULARITI
- Page 85 and 86: 3.2. WAVELETS AND POINT SINGULARITI
- Page 87 and 88: 3.2. WAVELETS AND POINT SINGULARITI
- Page 89 and 90: 3.2. WAVELETS AND POINT SINGULARITI
- Page 91 and 92: 3.2. WAVELETS AND POINT SINGULARITI
- Page 93 and 94: 3.3. EDGELETS AND LINEAR SINGULARIT
- Page 95 and 96: 3.4. OTHER TRANSFORMS 67 uncertaint
- Page 97: 3.4. OTHER TRANSFORMS 69 Chirplets
- Page 101 and 102: 3.4. OTHER TRANSFORMS 73 We can app
- Page 103 and 104: 3.5. DISCUSSION 75 give only a few
- Page 105 and 106: 3.7. PROOFS 77 the ijth component o
- Page 107 and 108: 3.7. PROOFS 79 Similarly, we have [
- Page 109 and 110: Chapter 4 Combined Image Representa
- Page 111 and 112: 4.2. SPARSE DECOMPOSITION 83 interi
- Page 113 and 114: 4.3. MINIMUM l 1 NORM SOLUTION 85 l
- Page 115 and 116: 4.4. LAGRANGE MULTIPLIERS 87 ρ( x
- Page 117 and 118: 4.5. HOW TO CHOOSE ρ AND λ 89 3 (
- Page 119 and 120: 4.6. HOMOTOPY 91 A way to interpret
- Page 121 and 122: 4.7. NEWTON DIRECTION 93 4.7 Newton
- Page 123 and 124: 4.9. ITERATIVE METHODS 95 1. Avoidi
- Page 125 and 126: 4.11. DISCUSSION 97 ρ(β) =‖β
- Page 127 and 128: 4.12. PROOFS 99 4.12.2 Proof of The
- Page 129 and 130: 4.12. PROOFS 101 case of (4.16). Co
- Page 131 and 132: Chapter 5 Iterative Methods This ch
- Page 133 and 134: 5.1. OVERVIEW 105 the k-th iteratio
- Page 135 and 136: 5.1. OVERVIEW 107 5.1.4 Preconditio
- Page 137 and 138: 5.2. LSQR 109 among all the block d
- Page 139 and 140: 5.2. LSQR 111 5.2.3 Algorithm LSQR
- Page 141 and 142: 5.3. MINRES 113 2. For k =1, 2,...,
- Page 143 and 144: 5.3. MINRES 115 using the precondit
- Page 145 and 146: 5.4. DISCUSSION 117 From (I + S 1 )
- Page 147 and 148: Chapter 6 Simulations Section 6.1 d
3.4. OTHER TRANSFORMS 71<br />
Folding. Another way to obtain localized time-frequency basis, instead of using the<br />
constructive method in Daubechies [34], is to apply folding to an existing orthonormal<br />
basis for periodic functions on a fixed interval. The idea of folding is described in [141]<br />
and [142]. The folding operation is particularly important in discrete algorithms because if<br />
the basis function is from the folding of a known orthonormal basis function (e.g., Fourier<br />
basis function), since the coefficient of the transform associated with the basis is simply the<br />
inner product of the signal with the basis function, we can apply the adjoint of the folding<br />
operator to the signal, then calculate its inner product with the original basis function. If<br />
there is a fast algorithm to do the original transform (for example, for DFT, there is a FFT),<br />
then there is a fast algorithm to do the transform associated with the new time-frequency<br />
atoms.<br />
Overcomplete dictionary. For a fixed real constant, we can construct an orthonormal<br />
basis whose support of basis functions has length exactly equal to this constant. But we<br />
can choose different lengths for support. In the discrete case, for the simplicity of implementation,<br />
we usually choose the length of support equal to a power of 2. The collection of<br />
all the basis functions make an overcomplete dictionary. A consequential question is how<br />
to find a subset of this dictionary. The subset should construct an orthonormal basis, and,<br />
at the same time, be the best <strong>representation</strong> for a particular class of signal.<br />
Best orthogonal basis. The best orthogonal basis (BOB) algorithm is proposed to achieve<br />
the objective just mentioned. The key idea is to assign a probability distribution on a<br />
class of signals. Usually, the class of signals is a subspace of L 2 (R). Without loss of<br />
generality, we consider signals that are restricted on the interval [0, 1). Suppose we consider<br />
only functions whose support are dyadic intervals that have forms [2 −l (k − 1), 2 −l k), for<br />
l ∈ N,k =1, 2,... ,2 l . At each level l, for the basis functions whose support are intervals<br />
like [2 −l (k − 1), 2 −l k), the associated coefficients can be calculated. We can compute the<br />
entropy of these coefficients. For an interval [2 −l (k−1), 2 −l k), we can consider its immediate<br />
two subintervals: [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k). (Note [2 −l (k −<br />
1), 2 −l k)=[2 −l−1 (2k−2), 2 −l−1 (2k−1))∪[2 −l−1 (2k−1), 2 −l−1 2k).) Note in the discrete case,<br />
a set of the coefficients associated with an interval [2 −l (k − 1), 2 −l k) and a set of coefficients<br />
associated with intervals [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k) have the<br />
same cardinality. In fact, there is a binary tree structure. Each dyadic interval corresponds<br />
to a node in the tree. Suppose a node in the tree corresponds to an interval I. Two<br />
subsequent nodes in the tree should correspond to the two subintervals of the interval I.