v2009.01.01 - Convex Optimization

v2009.01.01 - Convex Optimization v2009.01.01 - Convex Optimization

convexoptimization.com
from convexoptimization.com More from this publisher
10.03.2015 Views

642 APPENDIX E. PROJECTION For any shape of matrix A of any rank, and given any vector b that may or may not be in R(A) , we wish to find a best Euclidean solution x ⋆ to Ax = b (1754) (more generally, Ax ≈ b given arbitrary matrices) by solving minimize x ‖Ax − b‖ 2 (1755) Necessary and sufficient condition for optimal solution to this unconstrained optimization is the so-called normal equation that results from zeroing the convex objective’s gradient: (D.2.1) A T Ax = A T b (1756) normal because error vector b −Ax is perpendicular to R(A) ; id est, A T (b −Ax)=0. Given any matrix A and any vector b , the normal equation is solvable exactly; always so, because R(A T A)= R(A T ) and A T b∈ R(A T ). When A is skinny-or-square full-rank, normal equation (1756) can be solved exactly by inversion: x ⋆ = (A T A) −1 A T b ≡ A † b (1757) For matrix A of arbitrary rank and shape, on the other hand, A T A might not be invertible. Yet the normal equation can always be solved exactly by: (1745) x ⋆ = lim A + t I) −1 A T b = A † b (1758) t→0 +(AT invertible for any positive value of t by (1354). The exact inversion (1757) and this pseudoinverse solution (1758) each solve lim minimize t→0 + x ‖Ax − b‖ 2 + t ‖x‖ 2 (1759) simultaneously providing least squares solution to (1755) and the classical least norm solution E.1 [287, App.A.4] (confer3.1.3.0.1) arg minimize ‖x‖ 2 x subject to Ax = AA † b (1760) E.1 This means: optimal solutions of lesser norm than the so-called least norm solution (1760) can be obtained (at expense of approximation Ax ≈ b hence, of perpendicularity) by ignoring the limiting operation and introducing finite positive values of t into (1759).

E.1. IDEMPOTENT MATRICES 643 where AA † b is the orthogonal projection of vector b on R(A). Least norm solution can be interpreted as orthogonal projection of the origin 0 on affine subset A = {x |Ax=AA † b} ; (E.5.0.0.5,E.5.0.0.6) arg minimize ‖x − 0‖ 2 x subject to x ∈ A (1761) equivalently, maximization of the Euclidean ball until it kisses A ; rather, dist(0, A). E.1 Idempotent matrices Projection matrices are square and defined by idempotence, P 2 =P ; [287,2.6] [178,1.3] equivalent to the condition, P be diagonalizable [176,3.3, prob.3] with eigenvalues φ i ∈ {0, 1}. [344,4.1, thm.4.1] Idempotent matrices are not necessarily symmetric. The transpose of an idempotent matrix remains idempotent; P T P T = P T . Solely excepting P = I , all projection matrices are neither orthogonal (B.5) or invertible. [287,3.4] The collection of all projection matrices of particular dimension does not form a convex set. Suppose we wish to project nonorthogonally (obliquely) on the range of any particular matrix A∈ R m×n . All idempotent matrices projecting nonorthogonally on R(A) may be expressed: P = A(A † + BZ T ) ∈ R m×m (1762) where R(P )= R(A) , E.2 B ∈ R n×k for k ∈{1... m} is otherwise arbitrary, and Z ∈ R m×k is any matrix whose range is in N(A T ) ; id est, A T Z = A † Z = 0 (1763) Evidently, the collection of nonorthogonal projectors projecting on R(A) is an affine subset P k = { A(A † + BZ T ) | B ∈ R n×k} (1764) E.2 Proof. R(P )⊆ R(A) is obvious [287,3.6]. By (131) and (132), R(A † + BZ T ) = {(A † + BZ T )y | y ∈ R m } ⊇ {(A † + BZ T )y | y ∈ R(A)} = R(A T ) R(P ) = {A(A † + BZ T )y | y ∈ R m } ⊇ {A(A † + BZ T )y | (A † + BZ T )y ∈ R(A T )} = R(A)

E.1. IDEMPOTENT MATRICES 643<br />

where AA † b is the orthogonal projection of vector b on R(A). Least norm<br />

solution can be interpreted as orthogonal projection of the origin 0 on affine<br />

subset A = {x |Ax=AA † b} ; (E.5.0.0.5,E.5.0.0.6)<br />

arg minimize ‖x − 0‖ 2<br />

x<br />

subject to x ∈ A<br />

(1761)<br />

equivalently, maximization of the Euclidean ball until it kisses A ; rather,<br />

dist(0, A).<br />

<br />

E.1 Idempotent matrices<br />

Projection matrices are square and defined by idempotence, P 2 =P ;<br />

[287,2.6] [178,1.3] equivalent to the condition, P be diagonalizable<br />

[176,3.3, prob.3] with eigenvalues φ i ∈ {0, 1}. [344,4.1, thm.4.1]<br />

Idempotent matrices are not necessarily symmetric. The transpose of an<br />

idempotent matrix remains idempotent; P T P T = P T . Solely excepting<br />

P = I , all projection matrices are neither orthogonal (B.5) or invertible.<br />

[287,3.4] The collection of all projection matrices of particular dimension<br />

does not form a convex set.<br />

Suppose we wish to project nonorthogonally (obliquely) on the range<br />

of any particular matrix A∈ R m×n . All idempotent matrices projecting<br />

nonorthogonally on R(A) may be expressed:<br />

P = A(A † + BZ T ) ∈ R m×m (1762)<br />

where R(P )= R(A) , E.2 B ∈ R n×k for k ∈{1... m} is otherwise arbitrary,<br />

and Z ∈ R m×k is any matrix whose range is in N(A T ) ; id est,<br />

A T Z = A † Z = 0 (1763)<br />

Evidently, the collection of nonorthogonal projectors projecting on R(A) is<br />

an affine subset<br />

P k = { A(A † + BZ T ) | B ∈ R n×k} (1764)<br />

E.2 Proof. R(P )⊆ R(A) is obvious [287,3.6]. By (131) and (132),<br />

R(A † + BZ T ) = {(A † + BZ T )y | y ∈ R m }<br />

⊇ {(A † + BZ T )y | y ∈ R(A)} = R(A T )<br />

R(P ) = {A(A † + BZ T )y | y ∈ R m }<br />

⊇ {A(A † + BZ T )y | (A † + BZ T )y ∈ R(A T )} = R(A)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!