10.03.2015 Views

sparse image representation via combined transforms - Convex ...

sparse image representation via combined transforms - Convex ...

sparse image representation via combined transforms - Convex ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

8 CHAPTER 2. SPARSITY IN IMAGE CODING<br />

asymptotic result, because it based on an infinite amount of information. In practice,<br />

both the source coder that compresses the message to the entropy rate and the channel<br />

coder that transmits the bits at almost channel capacity may require intolerable amount<br />

of computation and memory. In general, the use of a joint source-channel (JSC) code can<br />

lead to the same performance but lower computational and storage requirements. Although<br />

separate source and channel coding could be less effective, its appealing property is that a<br />

separate source coder and channel coder are easy to design and implement.<br />

From now on, we consider only source coding.<br />

2.1.3 Transform Coding<br />

The purpose of source encoding is to transfer a message into a bit stream; and source<br />

decoding is essentially an inverse operation. There is no need to overstate the ubiquity<br />

of transform coding in modern digital communication systems. For example, JPEG is an<br />

industry standard for still <strong>image</strong> compression and transmission. It implements a linear<br />

transform—a two-dimensional discrete cosine transform (2-D DCT). The MPEG standard<br />

is an international standard for video compression and transmission that incorporates the<br />

JPEG standards for still <strong>image</strong> compression. The MPEG standard could be the standard<br />

for future digital television.<br />

Figure 2.3 gives a general depiction of a modern digital communication system with embedded<br />

transform coding. The input message is x. The operator T is a transform operator.<br />

Most of the time, T is a linear time invariant (LTI) transform. After the transform, we<br />

get coefficients y. Operator Q is a quantizer—it transfer continuous variables (in this case,<br />

they are coefficients after transform T ) into discrete values. Without loss of generality, we<br />

can assume each message is transferred into a bit stream of finite length. The operator E<br />

is an entropy coder. It further shortens the bit stream by using, for example, run-length<br />

coding for pictures [23], Huffman coding, arithmetic coding, or Lempel-Ziv coding, etc. Operators<br />

T , Q and E together make the encoder. We assume that the bit stream is perfectly<br />

transmitted through an error-corrected channel. The operators E −1 and Q −1 are inverse<br />

operators of the entropy coder E and the quantizer Q, respectively. The operator U is the<br />

reconstruction transform of transform T . If transform T is invertible, then the transform<br />

U should be the inverse transform of T , or equivalently, U = T −1 . The final output at the<br />

receiver is the estimation, denoted by ˆx, of the original message. Operators E −1 , Q −1 and<br />

U together form the decoder.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!