Chapter 3 Geometry of convex functions - Meboo Publishing ...
Chapter 3 Geometry of convex functions - Meboo Publishing ... Chapter 3 Geometry of convex functions - Meboo Publishing ...
244 CHAPTER 3. GEOMETRY OF CONVEX FUNCTIONS whose gradient (D.2.3) ∇ X ‖XA − I‖ 2 F = 2 ( XAA T − A T) = 0 (573) vanishes when XAA T = A T (574) When A is fat full-rank, then AA T is invertible, X ⋆ = A T (AA T ) −1 is the pseudoinverse A † , and AA † =I . Otherwise, we can make AA T invertible by adding a positively scaled identity, for any A∈ R m×n X = A T (AA T + t I) −1 (575) Invertibility is guaranteed for any finite positive value of t by (1427). Then matrix X becomes the pseudoinverse X → A † X ⋆ in the limit t → 0 + . Minimizing instead ‖AX − I‖ 2 F yields the second flavor in (1829). 3.7.0.0.3 Example. Hyperplane, line, described by affine function. Consider the real affine function of vector variable, (confer Figure 71) f(x) : R p → R = a T x + b (576) whose domain is R p and whose gradient ∇f(x)=a is a constant vector (independent of x). This function describes the real line R (its range), and it describes a nonvertical [195,B.1.2] hyperplane ∂H in the space R p × R for any particular vector a (confer2.4.2); {[ ∂H = having nonzero normal x a T x + b η = [ a −1 ] } | x∈ R p ⊂ R p ×R (577) ] ∈ R p ×R (578)
3.7. GRADIENT 245 This equivalence to a hyperplane holds only for real functions. [ ] 3.17 Epigraph R p of real affine function f(x) is therefore a halfspace in , so we have: R The real affine function is to convex functions as the hyperplane is to convex sets. Similarly, the matrix-valued affine function of real variable x , for any particular matrix A∈ R M×N , describes a line in R M×N in direction A and describes a line in R×R M×N {[ h(x) : R→R M×N = Ax + B (579) {Ax + B | x∈ R} ⊆ R M×N (580) x Ax + B ] } | x∈ R ⊂ R×R M×N (581) whose slope with respect to x is A . 3.17 To prove that, consider a vector-valued affine function f(x) : R p →R M = Ax + b having gradient ∇f(x)=A T ∈ R p×M : The affine set {[ ] } x | x∈ R p ⊂ R p ×R M Ax + b is perpendicular to η [ ∇f(x) −I ] ∈ R p×M × R M×M because η T ([ x Ax + b ] [ 0 − b ]) = 0 ∀x ∈ R p Yet η is a vector (in R p ×R M ) only when M = 1.
- Page 1 and 2: Chapter 3 Geometry of convex functi
- Page 3 and 4: 3.1. CONVEX FUNCTION 213 f 1 (x) f
- Page 5 and 6: 3.1. CONVEX FUNCTION 215 Rf (b) f(X
- Page 7 and 8: 3.3. PRACTICAL NORM FUNCTIONS, ABSO
- Page 9 and 10: 3.3. PRACTICAL NORM FUNCTIONS, ABSO
- Page 11 and 12: 3.3. PRACTICAL NORM FUNCTIONS, ABSO
- Page 13 and 14: 3.3. PRACTICAL NORM FUNCTIONS, ABSO
- Page 15 and 16: 3.3. PRACTICAL NORM FUNCTIONS, ABSO
- Page 17 and 18: 3.4. INVERTED FUNCTIONS AND ROOTS 2
- Page 19 and 20: 3.5. AFFINE FUNCTION 229 3.4.1.3 po
- Page 21 and 22: 3.5. AFFINE FUNCTION 231 3.5.0.0.2
- Page 23 and 24: 3.6. EPIGRAPH, SUBLEVEL SET 233 q(x
- Page 25 and 26: 3.6. EPIGRAPH, SUBLEVEL SET 235 To
- Page 27 and 28: 3.6. EPIGRAPH, SUBLEVEL SET 237 con
- Page 29 and 30: 3.6. EPIGRAPH, SUBLEVEL SET 239 3.6
- Page 31 and 32: 3.7. GRADIENT 241 2 1.5 1 0.5 Y 2 0
- Page 33: 3.7. GRADIENT 243 From (1749) andD.
- Page 37 and 38: 3.7. GRADIENT 247 3.7.1.0.3 Example
- Page 39 and 40: 3.7. GRADIENT 249 f(Y ) [ ∇f(X)
- Page 41 and 42: 3.7. GRADIENT 251 meaning, the grad
- Page 43 and 44: 3.8. MATRIX-VALUED CONVEX FUNCTION
- Page 45 and 46: 3.8. MATRIX-VALUED CONVEX FUNCTION
- Page 47 and 48: 3.8. MATRIX-VALUED CONVEX FUNCTION
- Page 49 and 50: 3.9. QUASICONVEX 259 3.8.3.0.6 Exam
- Page 51 and 52: 3.9. QUASICONVEX 261 3.9.0.0.2 Defi
- Page 53: 3.10. SALIENT PROPERTIES 263 7. - N
244 CHAPTER 3. GEOMETRY OF CONVEX FUNCTIONS<br />
whose gradient (D.2.3)<br />
∇ X ‖XA − I‖ 2 F = 2 ( XAA T − A T) = 0 (573)<br />
vanishes when<br />
XAA T = A T (574)<br />
When A is fat full-rank, then AA T is invertible, X ⋆ = A T (AA T ) −1 is the<br />
pseudoinverse A † , and AA † =I . Otherwise, we can make AA T invertible<br />
by adding a positively scaled identity, for any A∈ R m×n<br />
X = A T (AA T + t I) −1 (575)<br />
Invertibility is guaranteed for any finite positive value <strong>of</strong> t by (1427). Then<br />
matrix X becomes the pseudoinverse X → A † X ⋆ in the limit t → 0 + .<br />
Minimizing instead ‖AX − I‖ 2 F yields the second flavor in (1829). <br />
3.7.0.0.3 Example. Hyperplane, line, described by affine function.<br />
Consider the real affine function <strong>of</strong> vector variable, (confer Figure 71)<br />
f(x) : R p → R = a T x + b (576)<br />
whose domain is R p and whose gradient ∇f(x)=a is a constant vector<br />
(independent <strong>of</strong> x). This function describes the real line R (its range), and<br />
it describes a nonvertical [195,B.1.2] hyperplane ∂H in the space R p × R<br />
for any particular vector a (confer2.4.2);<br />
{[<br />
∂H =<br />
having nonzero normal<br />
x<br />
a T x + b<br />
η =<br />
[ a<br />
−1<br />
] }<br />
| x∈ R p ⊂ R p ×R (577)<br />
]<br />
∈ R p ×R (578)