Chapter 3 Geometry of convex functions - Meboo Publishing ...

Chapter 3 Geometry of convex functions - Meboo Publishing ... Chapter 3 Geometry of convex functions - Meboo Publishing ...

convexoptimization.com
from convexoptimization.com More from this publisher
10.03.2015 Views

244 CHAPTER 3. GEOMETRY OF CONVEX FUNCTIONS whose gradient (D.2.3) ∇ X ‖XA − I‖ 2 F = 2 ( XAA T − A T) = 0 (573) vanishes when XAA T = A T (574) When A is fat full-rank, then AA T is invertible, X ⋆ = A T (AA T ) −1 is the pseudoinverse A † , and AA † =I . Otherwise, we can make AA T invertible by adding a positively scaled identity, for any A∈ R m×n X = A T (AA T + t I) −1 (575) Invertibility is guaranteed for any finite positive value of t by (1427). Then matrix X becomes the pseudoinverse X → A † X ⋆ in the limit t → 0 + . Minimizing instead ‖AX − I‖ 2 F yields the second flavor in (1829). 3.7.0.0.3 Example. Hyperplane, line, described by affine function. Consider the real affine function of vector variable, (confer Figure 71) f(x) : R p → R = a T x + b (576) whose domain is R p and whose gradient ∇f(x)=a is a constant vector (independent of x). This function describes the real line R (its range), and it describes a nonvertical [195,B.1.2] hyperplane ∂H in the space R p × R for any particular vector a (confer2.4.2); {[ ∂H = having nonzero normal x a T x + b η = [ a −1 ] } | x∈ R p ⊂ R p ×R (577) ] ∈ R p ×R (578)

3.7. GRADIENT 245 This equivalence to a hyperplane holds only for real functions. [ ] 3.17 Epigraph R p of real affine function f(x) is therefore a halfspace in , so we have: R The real affine function is to convex functions as the hyperplane is to convex sets. Similarly, the matrix-valued affine function of real variable x , for any particular matrix A∈ R M×N , describes a line in R M×N in direction A and describes a line in R×R M×N {[ h(x) : R→R M×N = Ax + B (579) {Ax + B | x∈ R} ⊆ R M×N (580) x Ax + B ] } | x∈ R ⊂ R×R M×N (581) whose slope with respect to x is A . 3.17 To prove that, consider a vector-valued affine function f(x) : R p →R M = Ax + b having gradient ∇f(x)=A T ∈ R p×M : The affine set {[ ] } x | x∈ R p ⊂ R p ×R M Ax + b is perpendicular to η [ ∇f(x) −I ] ∈ R p×M × R M×M because η T ([ x Ax + b ] [ 0 − b ]) = 0 ∀x ∈ R p Yet η is a vector (in R p ×R M ) only when M = 1.

244 CHAPTER 3. GEOMETRY OF CONVEX FUNCTIONS<br />

whose gradient (D.2.3)<br />

∇ X ‖XA − I‖ 2 F = 2 ( XAA T − A T) = 0 (573)<br />

vanishes when<br />

XAA T = A T (574)<br />

When A is fat full-rank, then AA T is invertible, X ⋆ = A T (AA T ) −1 is the<br />

pseudoinverse A † , and AA † =I . Otherwise, we can make AA T invertible<br />

by adding a positively scaled identity, for any A∈ R m×n<br />

X = A T (AA T + t I) −1 (575)<br />

Invertibility is guaranteed for any finite positive value <strong>of</strong> t by (1427). Then<br />

matrix X becomes the pseudoinverse X → A † X ⋆ in the limit t → 0 + .<br />

Minimizing instead ‖AX − I‖ 2 F yields the second flavor in (1829). <br />

3.7.0.0.3 Example. Hyperplane, line, described by affine function.<br />

Consider the real affine function <strong>of</strong> vector variable, (confer Figure 71)<br />

f(x) : R p → R = a T x + b (576)<br />

whose domain is R p and whose gradient ∇f(x)=a is a constant vector<br />

(independent <strong>of</strong> x). This function describes the real line R (its range), and<br />

it describes a nonvertical [195,B.1.2] hyperplane ∂H in the space R p × R<br />

for any particular vector a (confer2.4.2);<br />

{[<br />

∂H =<br />

having nonzero normal<br />

x<br />

a T x + b<br />

η =<br />

[ a<br />

−1<br />

] }<br />

| x∈ R p ⊂ R p ×R (577)<br />

]<br />

∈ R p ×R (578)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!