v2009.01.01 - Convex Optimization
v2009.01.01 - Convex Optimization v2009.01.01 - Convex Optimization
642 APPENDIX E. PROJECTION For any shape of matrix A of any rank, and given any vector b that may or may not be in R(A) , we wish to find a best Euclidean solution x ⋆ to Ax = b (1754) (more generally, Ax ≈ b given arbitrary matrices) by solving minimize x ‖Ax − b‖ 2 (1755) Necessary and sufficient condition for optimal solution to this unconstrained optimization is the so-called normal equation that results from zeroing the convex objective’s gradient: (D.2.1) A T Ax = A T b (1756) normal because error vector b −Ax is perpendicular to R(A) ; id est, A T (b −Ax)=0. Given any matrix A and any vector b , the normal equation is solvable exactly; always so, because R(A T A)= R(A T ) and A T b∈ R(A T ). When A is skinny-or-square full-rank, normal equation (1756) can be solved exactly by inversion: x ⋆ = (A T A) −1 A T b ≡ A † b (1757) For matrix A of arbitrary rank and shape, on the other hand, A T A might not be invertible. Yet the normal equation can always be solved exactly by: (1745) x ⋆ = lim A + t I) −1 A T b = A † b (1758) t→0 +(AT invertible for any positive value of t by (1354). The exact inversion (1757) and this pseudoinverse solution (1758) each solve lim minimize t→0 + x ‖Ax − b‖ 2 + t ‖x‖ 2 (1759) simultaneously providing least squares solution to (1755) and the classical least norm solution E.1 [287, App.A.4] (confer3.1.3.0.1) arg minimize ‖x‖ 2 x subject to Ax = AA † b (1760) E.1 This means: optimal solutions of lesser norm than the so-called least norm solution (1760) can be obtained (at expense of approximation Ax ≈ b hence, of perpendicularity) by ignoring the limiting operation and introducing finite positive values of t into (1759).
E.1. IDEMPOTENT MATRICES 643 where AA † b is the orthogonal projection of vector b on R(A). Least norm solution can be interpreted as orthogonal projection of the origin 0 on affine subset A = {x |Ax=AA † b} ; (E.5.0.0.5,E.5.0.0.6) arg minimize ‖x − 0‖ 2 x subject to x ∈ A (1761) equivalently, maximization of the Euclidean ball until it kisses A ; rather, dist(0, A). E.1 Idempotent matrices Projection matrices are square and defined by idempotence, P 2 =P ; [287,2.6] [178,1.3] equivalent to the condition, P be diagonalizable [176,3.3, prob.3] with eigenvalues φ i ∈ {0, 1}. [344,4.1, thm.4.1] Idempotent matrices are not necessarily symmetric. The transpose of an idempotent matrix remains idempotent; P T P T = P T . Solely excepting P = I , all projection matrices are neither orthogonal (B.5) or invertible. [287,3.4] The collection of all projection matrices of particular dimension does not form a convex set. Suppose we wish to project nonorthogonally (obliquely) on the range of any particular matrix A∈ R m×n . All idempotent matrices projecting nonorthogonally on R(A) may be expressed: P = A(A † + BZ T ) ∈ R m×m (1762) where R(P )= R(A) , E.2 B ∈ R n×k for k ∈{1... m} is otherwise arbitrary, and Z ∈ R m×k is any matrix whose range is in N(A T ) ; id est, A T Z = A † Z = 0 (1763) Evidently, the collection of nonorthogonal projectors projecting on R(A) is an affine subset P k = { A(A † + BZ T ) | B ∈ R n×k} (1764) E.2 Proof. R(P )⊆ R(A) is obvious [287,3.6]. By (131) and (132), R(A † + BZ T ) = {(A † + BZ T )y | y ∈ R m } ⊇ {(A † + BZ T )y | y ∈ R(A)} = R(A T ) R(P ) = {A(A † + BZ T )y | y ∈ R m } ⊇ {A(A † + BZ T )y | (A † + BZ T )y ∈ R(A T )} = R(A)
- Page 591 and 592: B.5. ORTHOGONAL MATRIX 591 B.5 Orth
- Page 593 and 594: B.5. ORTHOGONAL MATRIX 593 Figure 1
- Page 595 and 596: Appendix C Some analytical optimal
- Page 597 and 598: C.2. TRACE, SINGULAR AND EIGEN VALU
- Page 599 and 600: C.2. TRACE, SINGULAR AND EIGEN VALU
- Page 601 and 602: C.2. TRACE, SINGULAR AND EIGEN VALU
- Page 603 and 604: C.3. ORTHOGONAL PROCRUSTES PROBLEM
- Page 605 and 606: C.4. TWO-SIDED ORTHOGONAL PROCRUSTE
- Page 607 and 608: C.4. TWO-SIDED ORTHOGONAL PROCRUSTE
- Page 609 and 610: Appendix D Matrix calculus From too
- Page 611 and 612: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 613 and 614: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 615 and 616: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 617 and 618: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 619 and 620: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 621 and 622: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 623 and 624: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 625 and 626: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 627 and 628: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 629 and 630: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 631 and 632: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 633 and 634: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 635 and 636: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 637 and 638: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 639 and 640: Appendix E Projection For any A∈
- Page 641: 641 U T = U † for orthonormal (in
- Page 645 and 646: E.1. IDEMPOTENT MATRICES 645 order,
- Page 647 and 648: E.1. IDEMPOTENT MATRICES 647 When t
- Page 649 and 650: E.3. SYMMETRIC IDEMPOTENT MATRICES
- Page 651 and 652: E.3. SYMMETRIC IDEMPOTENT MATRICES
- Page 653 and 654: E.3. SYMMETRIC IDEMPOTENT MATRICES
- Page 655 and 656: E.5. PROJECTION EXAMPLES 655 E.4.0.
- Page 657 and 658: E.5. PROJECTION EXAMPLES 657 a ∗
- Page 659 and 660: E.5. PROJECTION EXAMPLES 659 E.5.0.
- Page 661 and 662: E.6. VECTORIZATION INTERPRETATION,
- Page 663 and 664: E.6. VECTORIZATION INTERPRETATION,
- Page 665 and 666: E.6. VECTORIZATION INTERPRETATION,
- Page 667 and 668: E.6. VECTORIZATION INTERPRETATION,
- Page 669 and 670: E.7. ON VECTORIZED MATRICES OF HIGH
- Page 671 and 672: E.7. ON VECTORIZED MATRICES OF HIGH
- Page 673 and 674: E.8. RANGE/ROWSPACE INTERPRETATION
- Page 675 and 676: E.9. PROJECTION ON CONVEX SET 675 A
- Page 677 and 678: E.9. PROJECTION ON CONVEX SET 677 W
- Page 679 and 680: E.9. PROJECTION ON CONVEX SET 679 P
- Page 681 and 682: E.9. PROJECTION ON CONVEX SET 681 E
- Page 683 and 684: E.9. PROJECTION ON CONVEX SET 683 T
- Page 685 and 686: E.9. PROJECTION ON CONVEX SET 685
- Page 687 and 688: E.10. ALTERNATING PROJECTION 687 E.
- Page 689 and 690: E.10. ALTERNATING PROJECTION 689 b
- Page 691 and 692: E.10. ALTERNATING PROJECTION 691 a
E.1. IDEMPOTENT MATRICES 643<br />
where AA † b is the orthogonal projection of vector b on R(A). Least norm<br />
solution can be interpreted as orthogonal projection of the origin 0 on affine<br />
subset A = {x |Ax=AA † b} ; (E.5.0.0.5,E.5.0.0.6)<br />
arg minimize ‖x − 0‖ 2<br />
x<br />
subject to x ∈ A<br />
(1761)<br />
equivalently, maximization of the Euclidean ball until it kisses A ; rather,<br />
dist(0, A).<br />
<br />
E.1 Idempotent matrices<br />
Projection matrices are square and defined by idempotence, P 2 =P ;<br />
[287,2.6] [178,1.3] equivalent to the condition, P be diagonalizable<br />
[176,3.3, prob.3] with eigenvalues φ i ∈ {0, 1}. [344,4.1, thm.4.1]<br />
Idempotent matrices are not necessarily symmetric. The transpose of an<br />
idempotent matrix remains idempotent; P T P T = P T . Solely excepting<br />
P = I , all projection matrices are neither orthogonal (B.5) or invertible.<br />
[287,3.4] The collection of all projection matrices of particular dimension<br />
does not form a convex set.<br />
Suppose we wish to project nonorthogonally (obliquely) on the range<br />
of any particular matrix A∈ R m×n . All idempotent matrices projecting<br />
nonorthogonally on R(A) may be expressed:<br />
P = A(A † + BZ T ) ∈ R m×m (1762)<br />
where R(P )= R(A) , E.2 B ∈ R n×k for k ∈{1... m} is otherwise arbitrary,<br />
and Z ∈ R m×k is any matrix whose range is in N(A T ) ; id est,<br />
A T Z = A † Z = 0 (1763)<br />
Evidently, the collection of nonorthogonal projectors projecting on R(A) is<br />
an affine subset<br />
P k = { A(A † + BZ T ) | B ∈ R n×k} (1764)<br />
E.2 Proof. R(P )⊆ R(A) is obvious [287,3.6]. By (131) and (132),<br />
R(A † + BZ T ) = {(A † + BZ T )y | y ∈ R m }<br />
⊇ {(A † + BZ T )y | y ∈ R(A)} = R(A T )<br />
R(P ) = {A(A † + BZ T )y | y ∈ R m }<br />
⊇ {A(A † + BZ T )y | (A † + BZ T )y ∈ R(A T )} = R(A)