sparse image representation via combined transforms - Convex ...
sparse image representation via combined transforms - Convex ... sparse image representation via combined transforms - Convex ...
2 CHAPTER 1. INTRODUCTION 1.2 Outline This thesis is organized as follows: • Chapter 2 explains why sparse decomposition may lead to a more efficient image coding and compression. It contains two parts: the first part gives a brief description of the mathematical framework of a modern communication system; the second part presents some quantitative results to explain (mainly in the asymptotic sense) why sparsity in coefficients may lead to efficiency in image coding. • Chapter 3 is a survey of existing transforms for images. It serves as a background for this project. • Chapter 4 explains our approach in finding a sparse decomposition in an overcomplete dictionary. We minimize the objective function that is a sum of the residual sum of squares and a penalty on coefficients. We apply Newton’s method to solve this minimization problem. To find the Newton’s direction, we use an iterative method— LSQR—to solve a system of linear equations, which happens to be equivalent to a least squares problem. (LSQR solves a least square problem.) With carefully chosen parameters, the solution here is close to the solution of an exact Basis Pursuit, which is the minimum l 1 norm solution with exact equality constraints. • Chapter 5 is a survey of iterative methods and explains why we choose LSQR. Some alternative approaches are discussed. • Chapter 6 presents some numerical simulations. They show that a combined approach does provide a sparser decomposition than the existing approach that uses only one transform. • Chapter 7 discusses some thoughts on future research. • Appendix A documents the details about how to implement the exact edgelet transform in a direct way. This algorithm has high complexity. Some examples are given. • Appendix B documents the details about how to implement an approximate edgelet transform in a fast way. Some examples are given. This transform is the one we used in simulations.
Chapter 2 Sparsity in Image Coding This chapter explains the connection between sparse image decomposition and efficient image coding. The technique we have developed promises to improve the efficiency of transform coding, which is ubiquitous in the world of digital signal and image processing. We start with an overview of image coding and emphasize the importance of transform coding. Then we review Donoho’s work that answers in a deterministic way the question: why does sparsity lead to good compression? Finally we review some important principles in image coding. 2.1 Image Coding This section consists of three parts. The first begins with an overview of Shannon’s mathematical formulation of communication systems, and then describes the difference between source coding and channel coding. The second part describes a way of measuring the sparsity of a sequence and the connection between sparsity and coding, concluding with an asymptotic result. Finally, we discuss some of the requirements in designing a realistic transform coding scheme. Since all the work described here can easily be found in the literature, we focus on developing a historic perspective and overview. 2.1.1 Shannon’s Mathematical Communication System Nowadays, it is almost impossible to review the field of theoretical signal processing without mentioning the contributions of Shannon. In his 1948 paper [127], Shannon writes, “The 3
- Page 1 and 2: SPARSE IMAGE REPRESENTATION VIA COM
- Page 3: I certify that I have read this dis
- Page 7 and 8: To find a sparse image representati
- Page 9: Abstract We consider sparse image d
- Page 12 and 13: xii
- Page 14 and 15: 2.3 Discussion.....................
- Page 16 and 17: 6 Simulations 119 6.1 Dictionary...
- Page 18 and 19: xviii
- Page 21 and 22: List of Figures 2.1 Shannon’s sch
- Page 23 and 24: A.3 Edgelet transform of the wood g
- Page 25 and 26: Nomenclature Special sets N .......
- Page 27 and 28: List of Abbreviations BCR .........
- Page 29: Chapter 1 Introduction 1.1 Overview
- Page 33 and 34: 2.1. IMAGE CODING 5 INFORMATION SOU
- Page 35 and 36: 2.1. IMAGE CODING 7 2.1.2 Source an
- Page 37 and 38: 2.1. IMAGE CODING 9 x ✲ T y ERROR
- Page 39 and 40: 2.1. IMAGE CODING 11 where Q stands
- Page 41 and 42: 2.2. SPARSITY AND COMPRESSION 13 Pr
- Page 43 and 44: 2.2. SPARSITY AND COMPRESSION 15 av
- Page 45 and 46: 2.2. SPARSITY AND COMPRESSION 17 wi
- Page 47 and 48: 2.2. SPARSITY AND COMPRESSION 19 lo
- Page 49 and 50: 2.3. DISCUSSION 21 tail compact. Th
- Page 51 and 52: 2.4. PROOF 23 The index l does not
- Page 53 and 54: Chapter 3 Image Transforms and Imag
- Page 55 and 56: 27 Some of the figures show the bas
- Page 57 and 58: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 59 and 60: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 61 and 62: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 63 and 64: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 65 and 66: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 67 and 68: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 69 and 70: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 71 and 72: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 73 and 74: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 75 and 76: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 77 and 78: 3.1. DCT AND HOMOGENEOUS COMPONENTS
- Page 79 and 80: 3.1. DCT AND HOMOGENEOUS COMPONENTS
Chapter 2<br />
Sparsity in Image Coding<br />
This chapter explains the connection between <strong>sparse</strong> <strong>image</strong> decomposition and efficient<br />
<strong>image</strong> coding. The technique we have developed promises to improve the efficiency of<br />
transform coding, which is ubiquitous in the world of digital signal and <strong>image</strong> processing.<br />
We start with an overview of <strong>image</strong> coding and emphasize the importance of transform<br />
coding. Then we review Donoho’s work that answers in a deterministic way the question:<br />
why does sparsity lead to good compression? Finally we review some important principles<br />
in <strong>image</strong> coding.<br />
2.1 Image Coding<br />
This section consists of three parts. The first begins with an overview of Shannon’s mathematical<br />
formulation of communication systems, and then describes the difference between<br />
source coding and channel coding. The second part describes a way of measuring the sparsity<br />
of a sequence and the connection between sparsity and coding, concluding with an<br />
asymptotic result. Finally, we discuss some of the requirements in designing a realistic<br />
transform coding scheme.<br />
Since all the work described here can easily be found in the literature, we focus on<br />
developing a historic perspective and overview.<br />
2.1.1 Shannon’s Mathematical Communication System<br />
Nowadays, it is almost impossible to review the field of theoretical signal processing without<br />
mentioning the contributions of Shannon. In his 1948 paper [127], Shannon writes, “The<br />
3