15.05.2015 Views

Introduction to Unconstrained Optimization - Scilab

Introduction to Unconstrained Optimization - Scilab

Introduction to Unconstrained Optimization - Scilab

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

By hypothesis x = θy 1 + (1 − θ)y 2 , so that the inequality can be simplified again<br />

and can be written as:<br />

θf(y 1 ) + (1 − θ)f(y 2 ) ≥ f(θy 1 + (1 − θ)y 2 ), (41)<br />

which shows that the function f is convex and concludes the proof.<br />

We shall find another form of the first order condition for convexity with simple<br />

algebraic manipulations. Indeed, the inequality 31 can be written g(x) T (y − x) ≤<br />

f(y) − f(x). Multiplying both terms by −1 yields<br />

g(x) T (x − y) ≥ f(x) − f(y). (42)<br />

If we interchange the role of x and y in 42, we get g(y) T (y − x) ≥ f(y) − f(x). The<br />

left hand side can be transformed so that the term x − y appear and we find that:<br />

− g(y) T (x − y) ≥ f(y) − f(x). (43)<br />

We can now take the sum of the two inequalities 42 and 43 and finally get:<br />

(g(x) − g(y)) T (x − y) ≥ 0. (44)<br />

This last derivation proved one part of the following proposition.<br />

Proposition 2.6. (First order condition of differentiable convex function - second<br />

form) Let f be continously differentiable. Then f is convex over a convex set C if<br />

and only if<br />

for all x, y ∈ C.<br />

(g(x) − g(y)) T (x − y) ≥ 0 (45)<br />

The second part of the proof is given as an exercise. The proof is based on an<br />

auxiliary function that we are going <strong>to</strong> describe in some detail.<br />

Assume that x, y are two points in the convex set C. Since C is a convex set,<br />

the point x + θ(y − x) = θy + (1 − θ)x is also in C, for all θ so that 0 ≤ θ ≤ 1. Let<br />

us define the following function<br />

φ(θ) = f(x + θ(y − x)), (46)<br />

for all θ so that 0 ≤ θ ≤ 1.<br />

The function φ should really be denoted by φ x,y , because it actually depends<br />

on the two points x, y ∈ C. Despite this lack of accuracy, we will keep our current<br />

notation for simplicity reasons, assuming that the points x, y ∈ C are kept constant..<br />

By definition, the function φ is so that the two following equalities are satisfied:<br />

φ(0) = f(x), φ(1) = f(y). (47)<br />

The function f is convex over the convex set C if and only if the function φ<br />

satisfies the following inequality<br />

φ(θ) ≥ θφ(1) + (1 − θ)φ(0), (48)<br />

23

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!