sparse image representation via combined transforms - Convex ...

sparse image representation via combined transforms - Convex ... sparse image representation via combined transforms - Convex ...

convexoptimization.com
from convexoptimization.com More from this publisher
10.03.2015 Views

70 CHAPTER 3. IMAGE TRANSFORMS AND IMAGE FEATURES where the constant c is a positive real number. It can be proven that a transform W c ,for c>0, is isometric in L 2 (R). We also know that the Fourier transform is isometric in L 2 (R) (Parseval). Suppose we have an orthonormal wavelet basis W = {w j,k : j, k ∈ Z}. We first apply a Fourier transform on the basis W , then apply an axis warping transform, then apply the inverse Fourier transform. Because each step is an isometric transform, the result must be another orthonormal basis in L 2 (R). Thus we get a new orthonormal basis, called a fan basis in [7]. Figure 3.7(d) gives an idealized tiling of wavelets on the time frequency plane. Figure 3.7(f) gives an idealized tiling of the fan basis. Cosine Packets Here we describe cosine packets. We review some landmarks in the historic development of time frequency localized orthonormal basis. Real-valued time frequency localized atoms not only have some intuitive optimality, but also have been the tool to circumvent the barrier brought by traditional Gabor analysis. The content of this subsubsection is more abundant than its title suggests. Collection of localized cosine and sine functions. Cosine packets are a collection of localized cosine and sine functions. A readily observable optimality of cosine and sine functions is that they are real-valued. Typically, an analyzed signal is a real-valued signal, so localized cosine and sine functions are closer to a signal than these complex-valued functions do. We also want a basis function to be localized. The localization should be in both time and frequency domains: in both time and frequency (Fourier) domain, a function must either have a finite support, or decays faster than any inverse of polynomials. Orthonormal basis for L 2 (R). In Gabor analysis, finding an orthonormal basis for L 2 (R) was a topic that has been intensively studied. The Balian-Low theorem says that if we choose a time-frequency atom following (3.36), then the atom cannot be simultaneously localized in both time domain and frequency domain—the atom must have infinite variance either in time or in frequency. To circumvent this barrier, Wilson in 1987 [34, page 120] proposed a basis function that has two peaks in frequency. In [34], Daubechies gives a construction of an orthonormal basis for L 2 (R). In her construction, basis functions have exponential decay in both time and frequency. From [34], the key idea to ideal time-frequency localization and orthonormality in the windowed Fourier framework is to use sine and cosine rather than complex exponentials. This is a not-so-obvious optimality of using sine and cosine functions in basis.

3.4. OTHER TRANSFORMS 71 Folding. Another way to obtain localized time-frequency basis, instead of using the constructive method in Daubechies [34], is to apply folding to an existing orthonormal basis for periodic functions on a fixed interval. The idea of folding is described in [141] and [142]. The folding operation is particularly important in discrete algorithms because if the basis function is from the folding of a known orthonormal basis function (e.g., Fourier basis function), since the coefficient of the transform associated with the basis is simply the inner product of the signal with the basis function, we can apply the adjoint of the folding operator to the signal, then calculate its inner product with the original basis function. If there is a fast algorithm to do the original transform (for example, for DFT, there is a FFT), then there is a fast algorithm to do the transform associated with the new time-frequency atoms. Overcomplete dictionary. For a fixed real constant, we can construct an orthonormal basis whose support of basis functions has length exactly equal to this constant. But we can choose different lengths for support. In the discrete case, for the simplicity of implementation, we usually choose the length of support equal to a power of 2. The collection of all the basis functions make an overcomplete dictionary. A consequential question is how to find a subset of this dictionary. The subset should construct an orthonormal basis, and, at the same time, be the best representation for a particular class of signal. Best orthogonal basis. The best orthogonal basis (BOB) algorithm is proposed to achieve the objective just mentioned. The key idea is to assign a probability distribution on a class of signals. Usually, the class of signals is a subspace of L 2 (R). Without loss of generality, we consider signals that are restricted on the interval [0, 1). Suppose we consider only functions whose support are dyadic intervals that have forms [2 −l (k − 1), 2 −l k), for l ∈ N,k =1, 2,... ,2 l . At each level l, for the basis functions whose support are intervals like [2 −l (k − 1), 2 −l k), the associated coefficients can be calculated. We can compute the entropy of these coefficients. For an interval [2 −l (k−1), 2 −l k), we can consider its immediate two subintervals: [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k). (Note [2 −l (k − 1), 2 −l k)=[2 −l−1 (2k−2), 2 −l−1 (2k−1))∪[2 −l−1 (2k−1), 2 −l−1 2k).) Note in the discrete case, a set of the coefficients associated with an interval [2 −l (k − 1), 2 −l k) and a set of coefficients associated with intervals [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k) have the same cardinality. In fact, there is a binary tree structure. Each dyadic interval corresponds to a node in the tree. Suppose a node in the tree corresponds to an interval I. Two subsequent nodes in the tree should correspond to the two subintervals of the interval I.

3.4. OTHER TRANSFORMS 71<br />

Folding. Another way to obtain localized time-frequency basis, instead of using the<br />

constructive method in Daubechies [34], is to apply folding to an existing orthonormal<br />

basis for periodic functions on a fixed interval. The idea of folding is described in [141]<br />

and [142]. The folding operation is particularly important in discrete algorithms because if<br />

the basis function is from the folding of a known orthonormal basis function (e.g., Fourier<br />

basis function), since the coefficient of the transform associated with the basis is simply the<br />

inner product of the signal with the basis function, we can apply the adjoint of the folding<br />

operator to the signal, then calculate its inner product with the original basis function. If<br />

there is a fast algorithm to do the original transform (for example, for DFT, there is a FFT),<br />

then there is a fast algorithm to do the transform associated with the new time-frequency<br />

atoms.<br />

Overcomplete dictionary. For a fixed real constant, we can construct an orthonormal<br />

basis whose support of basis functions has length exactly equal to this constant. But we<br />

can choose different lengths for support. In the discrete case, for the simplicity of implementation,<br />

we usually choose the length of support equal to a power of 2. The collection of<br />

all the basis functions make an overcomplete dictionary. A consequential question is how<br />

to find a subset of this dictionary. The subset should construct an orthonormal basis, and,<br />

at the same time, be the best <strong>representation</strong> for a particular class of signal.<br />

Best orthogonal basis. The best orthogonal basis (BOB) algorithm is proposed to achieve<br />

the objective just mentioned. The key idea is to assign a probability distribution on a<br />

class of signals. Usually, the class of signals is a subspace of L 2 (R). Without loss of<br />

generality, we consider signals that are restricted on the interval [0, 1). Suppose we consider<br />

only functions whose support are dyadic intervals that have forms [2 −l (k − 1), 2 −l k), for<br />

l ∈ N,k =1, 2,... ,2 l . At each level l, for the basis functions whose support are intervals<br />

like [2 −l (k − 1), 2 −l k), the associated coefficients can be calculated. We can compute the<br />

entropy of these coefficients. For an interval [2 −l (k−1), 2 −l k), we can consider its immediate<br />

two subintervals: [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k). (Note [2 −l (k −<br />

1), 2 −l k)=[2 −l−1 (2k−2), 2 −l−1 (2k−1))∪[2 −l−1 (2k−1), 2 −l−1 2k).) Note in the discrete case,<br />

a set of the coefficients associated with an interval [2 −l (k − 1), 2 −l k) and a set of coefficients<br />

associated with intervals [2 −l−1 (2k −2), 2 −l−1 (2k −1)) and [2 −l−1 (2k −1), 2 −l−1 2k) have the<br />

same cardinality. In fact, there is a binary tree structure. Each dyadic interval corresponds<br />

to a node in the tree. Suppose a node in the tree corresponds to an interval I. Two<br />

subsequent nodes in the tree should correspond to the two subintervals of the interval I.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!