v2009.01.01 - Convex Optimization
v2009.01.01 - Convex Optimization v2009.01.01 - Convex Optimization
234 CHAPTER 3. GEOMETRY OF CONVEX FUNCTIONS 3.2.0.0.2 Example. Taxicab distance matrix. Consider an n-dimensional vector space R n with metric induced by the 1-norm. Then distance between points x 1 and x 2 is the norm of their difference: ‖x 1 −x 2 ‖ 1 . Given a list of points arranged columnar in a matrix X = [x 1 · · · x N ] ∈ R n×N (68) then we could define a taxicab distance matrix D 1 (X) = ∆ (I ⊗ 1 T n) | vec(X)1 T − 1 ⊗X | ∈ S N h ∩ R N×N + ⎡ ⎤ 0 ‖x 1 − x 2 ‖ 1 ‖x 1 − x 3 ‖ 1 · · · ‖x 1 − x N ‖ 1 ‖x 1 − x 2 ‖ 1 0 ‖x 2 − x 3 ‖ 1 · · · ‖x 2 − x N ‖ 1 = ‖x ⎢ 1 − x 3 ‖ 1 ‖x 2 − x 3 ‖ 1 0 ‖x 3 − x N ‖ 1 ⎥ ⎣ . . ... . ⎦ ‖x 1 − x N ‖ 1 ‖x 2 − x N ‖ 1 ‖x 3 − x N ‖ 1 · · · 0 (559) where 1 n is a vector of ones having dim1 n = n and where ⊗ represents Kronecker product. This matrix-valued function is convex with respect to the nonnegative orthant since, for each and every Y,Z ∈ R n×N and all 0≤µ≤1 D 1 (µY + (1 − µ)Z) ≼ R N×N + µD 1 (Y ) + (1 − µ)D 1 (Z) (560) 3.2.0.0.3 Exercise. 1-norm distance matrix. The 1-norm is called taxicab distance because to go from one point to another in a city by car, road distance is a sum of grid lengths. Prove (560). 3.2.1 first-order convexity condition, matrix function From the scalar-definition (3.2.0.0.1) of a convex matrix-valued function, for differentiable function g and for each and every real vector w of unit norm ‖w‖= 1, we have w T g(Y )w ≥ w T →Y −X T g(X)w + w dg(X) w (561) that follows immediately from the first-order condition (544) for convexity of a real function because →Y −X T w dg(X) w = 〈 ∇ X w T g(X)w , Y − X 〉 (562)
3.2. MATRIX-VALUED CONVEX FUNCTION 235 →Y −X where dg(X) is the directional derivative (D.1.4) of function g at X in direction Y −X . By discretized dual generalized inequalities, (2.13.5) g(Y ) − g(X) − →Y −X dg(X) ≽ 0 ⇔ 〈 g(Y ) − g(X) − →Y −X 〉 dg(X) , ww T ≥ 0 ∀ww T (≽ 0) S M + For each and every X,Y ∈ domg (confer (551)) (563) S M + g(Y ) ≽ g(X) + →Y −X dg(X) (564) S M + must therefore be necessary and sufficient for convexity of a matrix-valued function of matrix variable on open convex domain. 3.2.2 epigraph of matrix-valued function, sublevel sets We generalize the epigraph to a continuous matrix-valued function g(X) : R p×k →S M : epig ∆ = {(X , T )∈ R p×k × S M | X ∈ domg , g(X) ≼ T } (565) S M + from which it follows g convex ⇔ epi g convex (566) Proof of necessity is similar to that in3.1.7 on page 214. Sublevel sets of a matrix-valued convex function corresponding to each and every S ∈ S M (confer (496)) L S g ∆ = {X ∈ dom g | g(X) ≼ S } ⊆ R p×k (567) S M + are convex. There is no converse.
- Page 183 and 184: 2.13. DUAL CONE & GENERALIZED INEQU
- Page 185 and 186: 2.13. DUAL CONE & GENERALIZED INEQU
- Page 187 and 188: 2.13. DUAL CONE & GENERALIZED INEQU
- Page 189 and 190: 2.13. DUAL CONE & GENERALIZED INEQU
- Page 191 and 192: 2.13. DUAL CONE & GENERALIZED INEQU
- Page 193 and 194: 2.13. DUAL CONE & GENERALIZED INEQU
- Page 195 and 196: Chapter 3 Geometry of convex functi
- Page 197 and 198: 3.1. CONVEX FUNCTION 197 f 1 (x) f
- Page 199 and 200: 3.1. CONVEX FUNCTION 199 3.1.3 norm
- Page 201 and 202: 3.1. CONVEX FUNCTION 201 A B 1 Figu
- Page 203 and 204: 3.1. CONVEX FUNCTION 203 k/m 1 0.9
- Page 205 and 206: k∑ i=1 3.1. CONVEX FUNCTION 205 S
- Page 207 and 208: 3.1. CONVEX FUNCTION 207 rather x >
- Page 209 and 210: 3.1. CONVEX FUNCTION 209 rather ] x
- Page 211 and 212: 3.1. CONVEX FUNCTION 211 3.1.6.0.2
- Page 213 and 214: 3.1. CONVEX FUNCTION 213 q(x) f(x)
- Page 215 and 216: 3.1. CONVEX FUNCTION 215 3.1.7.0.2
- Page 217 and 218: 3.1. CONVEX FUNCTION 217 We learned
- Page 219 and 220: 3.1. CONVEX FUNCTION 219 Since opti
- Page 221 and 222: 3.1. CONVEX FUNCTION 221 2 1.5 1 0.
- Page 223 and 224: 3.1. CONVEX FUNCTION 223 Setting th
- Page 225 and 226: 3.1. CONVEX FUNCTION 225 Similarly,
- Page 227 and 228: 3.1. CONVEX FUNCTION 227 For vector
- Page 229 and 230: 3.1. CONVEX FUNCTION 229 This means
- Page 231 and 232: 3.1. CONVEX FUNCTION 231 f(Y ) −
- Page 233: 3.2. MATRIX-VALUED CONVEX FUNCTION
- Page 237 and 238: 3.2. MATRIX-VALUED CONVEX FUNCTION
- Page 239 and 240: 3.3. QUASICONVEX 239 exponential al
- Page 241 and 242: 3.3. QUASICONVEX 241 Unlike convex
- Page 243 and 244: 3.4. SALIENT PROPERTIES 243 6. (af
- Page 245 and 246: Chapter 4 Semidefinite programming
- Page 247 and 248: 4.1. CONIC PROBLEM 247 where K is a
- Page 249 and 250: 4.1. CONIC PROBLEM 249 4.1.1.2 Redu
- Page 251 and 252: 4.1. CONIC PROBLEM 251 In any SDP f
- Page 253 and 254: 4.1. CONIC PROBLEM 253 Proposition
- Page 255 and 256: 4.2. FRAMEWORK 255 sets are closed
- Page 257 and 258: 4.2. FRAMEWORK 257 4.2.1.1.3 Exampl
- Page 259 and 260: 4.2. FRAMEWORK 259 4.2.2 Duals The
- Page 261 and 262: 4.2. FRAMEWORK 261 When equality is
- Page 263 and 264: 4.2. FRAMEWORK 263 The pseudoinvers
- Page 265 and 266: 4.2. FRAMEWORK 265 For the data giv
- Page 267 and 268: 4.2. FRAMEWORK 267 minimizes an aff
- Page 269 and 270: 4.3. RANK REDUCTION 269 whose rank
- Page 271 and 272: 4.3. RANK REDUCTION 271 and where m
- Page 273 and 274: 4.3. RANK REDUCTION 273 4.3.3 Optim
- Page 275 and 276: 4.3. RANK REDUCTION 275 Initialize:
- Page 277 and 278: 4.4. RANK-CONSTRAINED SEMIDEFINITE
- Page 279 and 280: 4.4. RANK-CONSTRAINED SEMIDEFINITE
- Page 281 and 282: 4.4. RANK-CONSTRAINED SEMIDEFINITE
- Page 283 and 284: 4.4. RANK-CONSTRAINED SEMIDEFINITE
234 CHAPTER 3. GEOMETRY OF CONVEX FUNCTIONS<br />
3.2.0.0.2 Example. Taxicab distance matrix.<br />
Consider an n-dimensional vector space R n with metric induced by the<br />
1-norm. Then distance between points x 1 and x 2 is the norm of their<br />
difference: ‖x 1 −x 2 ‖ 1 . Given a list of points arranged columnar in a matrix<br />
X = [x 1 · · · x N ] ∈ R n×N (68)<br />
then we could define a taxicab distance matrix<br />
D 1 (X) = ∆ (I ⊗ 1 T n) | vec(X)1 T − 1 ⊗X | ∈ S N h ∩ R N×N<br />
+<br />
⎡<br />
⎤<br />
0 ‖x 1 − x 2 ‖ 1 ‖x 1 − x 3 ‖ 1 · · · ‖x 1 − x N ‖ 1<br />
‖x 1 − x 2 ‖ 1 0 ‖x 2 − x 3 ‖ 1 · · · ‖x 2 − x N ‖ 1<br />
=<br />
‖x<br />
⎢ 1 − x 3 ‖ 1 ‖x 2 − x 3 ‖ 1 0 ‖x 3 − x N ‖ 1<br />
⎥<br />
⎣ . .<br />
... . ⎦<br />
‖x 1 − x N ‖ 1 ‖x 2 − x N ‖ 1 ‖x 3 − x N ‖ 1 · · · 0<br />
(559)<br />
where 1 n is a vector of ones having dim1 n = n and where ⊗ represents<br />
Kronecker product. This matrix-valued function is convex with respect to the<br />
nonnegative orthant since, for each and every Y,Z ∈ R n×N and all 0≤µ≤1<br />
D 1 (µY + (1 − µ)Z)<br />
≼<br />
R N×N<br />
+<br />
µD 1 (Y ) + (1 − µ)D 1 (Z) (560)<br />
<br />
3.2.0.0.3 Exercise. 1-norm distance matrix.<br />
The 1-norm is called taxicab distance because to go from one point to another<br />
in a city by car, road distance is a sum of grid lengths. Prove (560). <br />
3.2.1 first-order convexity condition, matrix function<br />
From the scalar-definition (3.2.0.0.1) of a convex matrix-valued function,<br />
for differentiable function g and for each and every real vector w of unit<br />
norm ‖w‖= 1, we have<br />
w T g(Y )w ≥ w T →Y −X<br />
T<br />
g(X)w + w dg(X) w (561)<br />
that follows immediately from the first-order condition (544) for convexity of<br />
a real function because<br />
→Y −X<br />
T<br />
w dg(X) w = 〈 ∇ X w T g(X)w , Y − X 〉 (562)