v2009.01.01 - Convex Optimization
v2009.01.01 - Convex Optimization v2009.01.01 - Convex Optimization
624 APPENDIX D. MATRIX CALCULUS which can be proved by substitution of variables in (1698). second-order total differential due to any Y ∈R K×L is The mn th d 2 g mn (X)| dX→Y = ∑ i,j ∑ k,l ∂ 2 g mn (X) Y kl Y ij = tr (∇ X tr ( ∇g mn (X) T Y ) ) T Y ∂X kl ∂X ij (1700) = ∑ ∂g mn (X + ∆t Y ) − ∂g mn (X) lim Y ij ∆t→0 ∂X i,j ij ∆t (1701) g mn (X + 2∆t Y ) − 2g mn (X + ∆t Y ) + g mn (X) = lim ∆t→0 ∆t 2 (1702) = d2 dt 2 ∣ ∣∣∣t=0 g mn (X+ t Y ) (1703) Hence the second directional derivative, →Y dg 2 (X) ∆ = ⎡ ⎢ ⎣ ⎡ tr (∇tr ( ∇g 11 (X) T Y ) ) T Y = tr (∇tr ( ∇g 21 (X) T Y ) ) T Y ⎢ ⎣ . tr (∇tr ( ∇g M1 (X) T Y ) ) T Y d 2 g 11 (X) d 2 g 12 (X) · · · d 2 g 1N (X) d 2 g 21 (X) d 2 g 22 (X) · · · d 2 g 2N (X) . . d 2 g M1 (X) d 2 g M2 (X) · · · . d 2 g MN (X) ⎤ ⎥ ∈ R M×N ⎦ ∣ ∣ dX→Y tr (∇tr ( ∇g 12 (X) T Y ) ) T Y · · · tr (∇tr ( ∇g 1N (X) T Y ) ) ⎤ T Y tr (∇tr ( ∇g 22 (X) T Y ) ) T Y · · · tr (∇tr ( ∇g 2N (X) T Y ) ) T Y . . tr (∇tr ( ∇g M2 (X) T Y ) ) T Y · · · tr (∇tr ( ∇g MN (X) T Y ) ⎥ ) ⎦ T Y ⎡ = ⎢ ⎣ ∑ ∑ i,j k,l ∑ ∑ i,j k,l ∂ 2 g 11 (X) ∂X kl ∂X ij Y kl Y ij ∂ 2 g 21 (X) ∂X kl ∂X ij Y kl Y ij . ∑ ∑ ∂ 2 g M1 (X) ∂X kl ∂X ij Y kl Y ij i,j k,l ∑ ∑ ∂ 2 g 12 (X) ∂X kl ∂X ij Y kl Y ij · · · i,j i,j k,l ∑ ∑ ∂ 2 g 22 (X) ∂X kl ∂X ij Y kl Y ij · · · k,l . ∑ ∑ ∂ 2 g M2 (X) ∂X kl ∂X ij Y kl Y ij · · · i,j k,l ∑ ∑ ⎤ ∂ 2 g 1N (X) ∂X kl ∂X ij Y kl Y ij i,j k,l ∑ ∑ ∂ 2 g 2N (X) ∂X kl ∂X ij Y kl Y ij i,j k,l . ⎥ ∑ ∑ ∂ 2 g MN (X) ⎦ ∂X kl ∂X ij Y kl Y ij i,j k,l (1704)
D.1. DIRECTIONAL DERIVATIVE, TAYLOR SERIES 625 from which it follows →Y dg 2 (X) = ∑ i,j ∑ k,l ∂ 2 g(X) ∂X kl ∂X ij Y kl Y ij = ∑ i,j ∂ ∂X ij →Y dg(X)Y ij (1705) Yet for all X ∈ domg , any Y ∈R K×L , and some open interval of t∈R g(X+ t Y ) = g(X) + t dg(X) →Y + 1 →Y t2dg 2 (X) + o(t 3 ) (1706) 2! which is the second-order Taylor series expansion about X . [190,18.4] [128,2.3.4] Differentiating twice with respect to t and subsequent t-zeroing isolates the third term of the expansion. Thus differentiating and zeroing g(X+ t Y ) in t is an operation equivalent to individually differentiating and zeroing every entry g mn (X+ t Y ) as in (1703). So the second directional derivative of g(X) : R K×L →R M×N becomes [239,2.1,5.4.5] [31,6.3.1] →Y dg 2 (X) = d2 dt 2 ∣ ∣∣∣t=0 g(X+ t Y ) ∈ R M×N (1707) which is again simplest. (confer (1686)) Directional derivative retains the dimensions of g . D.1.6 directional derivative expressions In the case of a real function g(X) : R K×L →R , all its directional derivatives are in R : →Y dg(X) = tr ( ∇g(X) T Y ) (1708) →Y dg 2 (X) = tr (∇ X tr ( ∇g(X) T Y ) ) ( ) T →Y Y = tr ∇ X dg(X) T Y ( →Y dg 3 (X) = tr ∇ X tr (∇ X tr ( ∇g(X) T Y ) ) ) ( ) T TY →Y Y = tr ∇ X dg 2 (X) T Y (1709) (1710)
- Page 573 and 574: A.7. ZEROS 573 For X,A∈ S M + [31
- Page 575 and 576: A.7. ZEROS 575 A.7.5.0.1 Propositio
- Page 577 and 578: Appendix B Simple matrices Mathemat
- Page 579 and 580: B.1. RANK-ONE MATRIX (DYAD) 579 R(v
- Page 581 and 582: B.1. RANK-ONE MATRIX (DYAD) 581 ran
- Page 583 and 584: B.2. DOUBLET 583 R([u v ]) R(Π)= R
- Page 585 and 586: B.3. ELEMENTARY MATRIX 585 If λ
- Page 587 and 588: B.4. AUXILIARY V -MATRICES 587 the
- Page 589 and 590: B.4. AUXILIARY V -MATRICES 589 18.
- Page 591 and 592: B.5. ORTHOGONAL MATRIX 591 B.5 Orth
- Page 593 and 594: B.5. ORTHOGONAL MATRIX 593 Figure 1
- Page 595 and 596: Appendix C Some analytical optimal
- Page 597 and 598: C.2. TRACE, SINGULAR AND EIGEN VALU
- Page 599 and 600: C.2. TRACE, SINGULAR AND EIGEN VALU
- Page 601 and 602: C.2. TRACE, SINGULAR AND EIGEN VALU
- Page 603 and 604: C.3. ORTHOGONAL PROCRUSTES PROBLEM
- Page 605 and 606: C.4. TWO-SIDED ORTHOGONAL PROCRUSTE
- Page 607 and 608: C.4. TWO-SIDED ORTHOGONAL PROCRUSTE
- Page 609 and 610: Appendix D Matrix calculus From too
- Page 611 and 612: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 613 and 614: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 615 and 616: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 617 and 618: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 619 and 620: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 621 and 622: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 623: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 627 and 628: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 629 and 630: D.1. DIRECTIONAL DERIVATIVE, TAYLOR
- Page 631 and 632: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 633 and 634: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 635 and 636: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 637 and 638: D.2. TABLES OF GRADIENTS AND DERIVA
- Page 639 and 640: Appendix E Projection For any A∈
- Page 641 and 642: 641 U T = U † for orthonormal (in
- Page 643 and 644: E.1. IDEMPOTENT MATRICES 643 where
- Page 645 and 646: E.1. IDEMPOTENT MATRICES 645 order,
- Page 647 and 648: E.1. IDEMPOTENT MATRICES 647 When t
- Page 649 and 650: E.3. SYMMETRIC IDEMPOTENT MATRICES
- Page 651 and 652: E.3. SYMMETRIC IDEMPOTENT MATRICES
- Page 653 and 654: E.3. SYMMETRIC IDEMPOTENT MATRICES
- Page 655 and 656: E.5. PROJECTION EXAMPLES 655 E.4.0.
- Page 657 and 658: E.5. PROJECTION EXAMPLES 657 a ∗
- Page 659 and 660: E.5. PROJECTION EXAMPLES 659 E.5.0.
- Page 661 and 662: E.6. VECTORIZATION INTERPRETATION,
- Page 663 and 664: E.6. VECTORIZATION INTERPRETATION,
- Page 665 and 666: E.6. VECTORIZATION INTERPRETATION,
- Page 667 and 668: E.6. VECTORIZATION INTERPRETATION,
- Page 669 and 670: E.7. ON VECTORIZED MATRICES OF HIGH
- Page 671 and 672: E.7. ON VECTORIZED MATRICES OF HIGH
- Page 673 and 674: E.8. RANGE/ROWSPACE INTERPRETATION
D.1. DIRECTIONAL DERIVATIVE, TAYLOR SERIES 625<br />
from which it follows<br />
→Y<br />
dg 2 (X) = ∑ i,j<br />
∑<br />
k,l<br />
∂ 2 g(X)<br />
∂X kl ∂X ij<br />
Y kl Y ij = ∑ i,j<br />
∂<br />
∂X ij<br />
→Y<br />
dg(X)Y ij (1705)<br />
Yet for all X ∈ domg , any Y ∈R K×L , and some open interval of t∈R<br />
g(X+ t Y ) = g(X) + t dg(X) →Y<br />
+ 1 →Y<br />
t2dg 2 (X) + o(t 3 ) (1706)<br />
2!<br />
which is the second-order Taylor series expansion about X . [190,18.4]<br />
[128,2.3.4] Differentiating twice with respect to t and subsequent t-zeroing<br />
isolates the third term of the expansion. Thus differentiating and zeroing<br />
g(X+ t Y ) in t is an operation equivalent to individually differentiating and<br />
zeroing every entry g mn (X+ t Y ) as in (1703). So the second directional<br />
derivative of g(X) : R K×L →R M×N becomes [239,2.1,5.4.5] [31,6.3.1]<br />
→Y<br />
dg 2 (X) = d2<br />
dt 2 ∣<br />
∣∣∣t=0<br />
g(X+ t Y ) ∈ R M×N (1707)<br />
which is again simplest. (confer (1686)) Directional derivative retains the<br />
dimensions of g .<br />
D.1.6<br />
directional derivative expressions<br />
In the case of a real function g(X) : R K×L →R , all its directional derivatives<br />
are in R :<br />
→Y<br />
dg(X) = tr ( ∇g(X) T Y ) (1708)<br />
→Y<br />
dg 2 (X) = tr<br />
(∇ X tr ( ∇g(X) T Y ) ) ( )<br />
T<br />
→Y<br />
Y = tr ∇ X dg(X) T Y<br />
(<br />
→Y<br />
dg 3 (X) = tr ∇ X tr<br />
(∇ X tr ( ∇g(X) T Y ) ) ) ( )<br />
T TY →Y<br />
Y = tr ∇ X dg 2 (X) T Y<br />
(1709)<br />
(1710)