v2010.10.26 - Convex Optimization

v2010.10.26 - Convex Optimization v2010.10.26 - Convex Optimization

convexoptimization.com
from convexoptimization.com More from this publisher
12.07.2015 Views

686 APPENDIX D. MATRIX CALCULUSIn the case g(X) : R K →R has vector argument, they further simplify:and so on.D.1.7Taylor series→Ydg(X) = ∇g(X) T Y (1834)→Ydg 2 (X) = Y T ∇ 2 g(X)Y (1835)→Ydg 3 (X) = ∇ X(Y T ∇ 2 g(X)Y ) TY (1836)Series expansions of the differentiable matrix-valued function g(X) , ofmatrix argument, were given earlier in (1808) and (1829). Assuming g(X)has continuous first-, second-, and third-order gradients over the open setdomg , then for X ∈ domg and any Y ∈ R K×L the complete Taylor seriesis expressed on some open interval of µ∈Rg(X+µY ) = g(X) + µ dg(X) →Y+ 1 →Yµ2dg 2 (X) + 1 →Yµ3dg 3 (X) + o(µ 4 ) (1837)2! 3!or on some open interval of ‖Y ‖ 2g(Y ) = g(X) +→Y −X→Y −X→Y −Xdg(X) + 1 dg 2 (X) + 1 dg 3 (X) + o(‖Y ‖ 4 ) (1838)2! 3!which are third-order expansions about X . The mean value theorem fromcalculus is what insures finite order of the series. [219] [42,1.1] [41, App.A.5][199,0.4] These somewhat unbelievable formulae imply that a function canbe determined over the whole of its domain by knowing its value and all itsdirectional derivatives at a single point X .D.1.7.0.1 Example. Inverse-matrix function.Say g(Y )= Y −1 . From the table on page 692,→Ydg(X) = d dt∣ g(X+ t Y ) = −X −1 Y X −1 (1839)t=0→Ydg 2 (X) = d2dt 2 ∣∣∣∣t=0g(X+ t Y ) = 2X −1 Y X −1 Y X −1 (1840)→Ydg 3 (X) = d3dt 3 ∣∣∣∣t=0g(X+ t Y ) = −6X −1 Y X −1 Y X −1 Y X −1 (1841)

D.1. DIRECTIONAL DERIVATIVE, TAYLOR SERIES 687Let’s find the Taylor series expansion of g about X = I : Since g(I)= I , for‖Y ‖ 2 < 1 (µ = 1 in (1837))g(I + Y ) = (I + Y ) −1 = I − Y + Y 2 − Y 3 + ... (1842)If Y is small, (I + Y ) −1 ≈ I − Y . D.3 Now we find Taylor series expansionabout X :g(X + Y ) = (X + Y ) −1 = X −1 − X −1 Y X −1 + 2X −1 Y X −1 Y X −1 − ...(1843)If Y is small, (X + Y ) −1 ≈ X −1 − X −1 Y X −1 .D.1.7.0.2 Exercise. log det. (confer [61, p.644])Find the first three terms of a Taylor series expansion for log detY . Specifyan open interval over which the expansion holds in vicinity of X . D.1.8Correspondence of gradient to derivativeFrom the foregoing expressions for directional derivative, we derive arelationship between the gradient with respect to matrix X and the derivativewith respect to real variable t :D.1.8.1first-orderRemoving evaluation at t = 0 from (1809), D.4 we find an expression for thedirectional derivative of g(X) in direction Y evaluated anywhere along aline {X+ t Y | t ∈ R} intersecting domg→Ydg(X+ t Y ) = d g(X+ t Y ) (1844)dtIn the general case g(X) : R K×L →R M×N , from (1802) and (1805) we findtr ( ∇ X g mn (X+ t Y ) T Y ) = d dt g mn(X+ t Y ) (1845)D.3 Had we instead set g(Y )=(I + Y ) −1 , then the equivalent expansion would have beenabout X =0.D.4 Justified by replacing X with X+ tY in (1802)-(1804); beginning,dg mn (X+ tY )| dX→Y= ∑ k, l∂g mn (X+ tY )Y kl∂X kl

D.1. DIRECTIONAL DERIVATIVE, TAYLOR SERIES 687Let’s find the Taylor series expansion of g about X = I : Since g(I)= I , for‖Y ‖ 2 < 1 (µ = 1 in (1837))g(I + Y ) = (I + Y ) −1 = I − Y + Y 2 − Y 3 + ... (1842)If Y is small, (I + Y ) −1 ≈ I − Y . D.3 Now we find Taylor series expansionabout X :g(X + Y ) = (X + Y ) −1 = X −1 − X −1 Y X −1 + 2X −1 Y X −1 Y X −1 − ...(1843)If Y is small, (X + Y ) −1 ≈ X −1 − X −1 Y X −1 .D.1.7.0.2 Exercise. log det. (confer [61, p.644])Find the first three terms of a Taylor series expansion for log detY . Specifyan open interval over which the expansion holds in vicinity of X . D.1.8Correspondence of gradient to derivativeFrom the foregoing expressions for directional derivative, we derive arelationship between the gradient with respect to matrix X and the derivativewith respect to real variable t :D.1.8.1first-orderRemoving evaluation at t = 0 from (1809), D.4 we find an expression for thedirectional derivative of g(X) in direction Y evaluated anywhere along aline {X+ t Y | t ∈ R} intersecting domg→Ydg(X+ t Y ) = d g(X+ t Y ) (1844)dtIn the general case g(X) : R K×L →R M×N , from (1802) and (1805) we findtr ( ∇ X g mn (X+ t Y ) T Y ) = d dt g mn(X+ t Y ) (1845)D.3 Had we instead set g(Y )=(I + Y ) −1 , then the equivalent expansion would have beenabout X =0.D.4 Justified by replacing X with X+ tY in (1802)-(1804); beginning,dg mn (X+ tY )| dX→Y= ∑ k, l∂g mn (X+ tY )Y kl∂X kl

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!