Master Qfin at Wu Vienna Lecture Optimization: R Udiger Frey
Master Qfin at Wu Vienna Lecture Optimization: R Udiger Frey
Spring 2014
1 / 29
Admin
homework assignments
2 / 29
Useful references
2014
Bertsekas, D., Nonlinear Programming, Athena Scientic
Publishing, 1999
Griva, Nash, Sofer, Linear and Nonlinear optimization, SIAM
publishing, 2009
3 / 29
Overview
Introduction and Unconstrained Optimization Introduction Mathematical Background Unconstrained Optimization: Theory
4 / 29
Optimization problems
is a subset of Rn , and the cost Here the set of admissible points X to R. Often the admissible points function f is a function from X are further restricted by explicit inequality constraints. Note that maximization problems can be addressed by replacing f with f , as supx X f (x ) = inf x X {f (x )}.
5 / 29
1, . . . S n) Here V0 is the initial wealth of an investor; S0 = (S0 0 1 ( ), . . . S n ( )) the is the initial asset price; ST ( ) = (ST T terminal asset price; represents the portfolio strategy. The increasing and concave function u : R R is the utility function that is used to model the risk aversion of the investor.
7 / 29
(gi () gi )2 .
i =1
If gi is linear in we have a standard regression problem; otherwise one speaks of a generalized regression problem. c) Maximum likelihood methods in statistics. d) Financial mathematics. Duality results from convex analysis are crucial in nancial mathematics (rst fundamental theorem of asset pricing or superhedging duality).
8 / 29
Overview
1. Unconstrained Optimization
Necessary and sucient optimality conditions Numerical methods
2. Lagrange multiplier theory and Karush-Kuhn-Tucker theory 3. Convex optimization and duality
Convexity and separation; The dual problem Duality results and economic applications
9 / 29
10 / 29
11 / 29
f (x ) =
f f x1 (x ), . . . , xn (x )
is the gradient of f .
Jg (x ) =
g1 ( x ) x1
...
. . .
g1 (x ) xn
. . .
gm (x ) x1
gm (x ) xn
12 / 29
(2)
= 0.
Idea. The function f can be approximated locally around x by the ane mapping y f (x ) + f (x )t (y x ). Similarly, we get for a C 1 function g : U Rm that g (y ) g (x ) = Jg (x )(y x )+ R (x , y x ) with lim R (x , z ) = 0. z
z 0
13 / 29
Chain rule
Theorem. Consider C 1 functions f : Rn Rm and g : Rk Rn and let h := f g . Then h is C 1 and it holds for the Jacobi matrix that Jh(x ) = Jf (g (x ))Jg (x ), i.e. the Jacobian of the concatenation is the product of the individual Jacobi matrices. Example. (Derivative along a vector). Consider a C 1 functions f : Rn R We want to consider the function f along the straight line (t ) := x + tv , for t R, x , v Rn . We have J (t ) = v , Jf (x ) = (f (x ))t and hence d d f ((t )) = (f (x +tv ))t v , in particular f ((0)) = (f (x ))t v . dt dt
14 / 29
Second derivatives
Denition. Consider C 1 function f : U Rn R. Then the rst f (x ) order partial derivatives xi , 1 i n, are themselves functions from U to R. 1. If all partial derivatives are C 1 functions, f is called twice continuously dierentiable on U (f C 2 (U )). Fix i , j {1, . . . , n}. Then one writes
f xi (x ) 2f (x ) := xi xj xj
for the second partial derivative in direction xi and xj . 2. For f C 2 (U ) the matrix Hf with Hfij (x ) = Hessian matrix of f .
2f xi xj (x )
is the
15 / 29
It follows that the Hessian is a symmetric matrix, that is Hfij (x ) = Hfji (x ), 1 i , j n. In particular, the deniteness of Hf can be checked using eigenvalues: HF is positive semi-denite if all eigenvalues are strictly positive and positive semidenite if all eigenvalues are non-negative.
16 / 29
Example
2f 2 = 6x1 x2 + 2x2 , 2 x1
2f 2 = 2x1 + 2, 2 x2
2f 2 = 3x1 + 4x1 x2 . x1 x2
17 / 29
= 0.
Idea. f can be approximated locally around x U by the quadratic function 1 y f (x ) + f (x )t (y x ) + (y x )t Hf (x )(y x ) . 2 Locally, this is a better approximation than the rst order Taylor approximation.
18 / 29
In this section we consider problems of the form = Rn minimize f (x ) for x X Moreover, we assume that f is once or twice continuously dierentiable. is an open subset of Rn . Most results hold also in the case where X (3)
19 / 29
20 / 29
Hf (x ) negative semidenite.
Necessary conditions in general not sucient: consider
f (x ) = x 3 , x = 0.
21 / 29
(4)
f (x ) = x 4 , x = 0.
No global statements possible.
22 / 29
23 / 29
Characterizations of Convexity
Lemma. Consider an open convex set X Rn . A C 1 function f : X Rn is convex on the if and only if it holds for all x , z X that f (z ) f (x ) + f (x ) (z x ). If f is C 2 a necessary and sucient condition for the convexity of f on X is the condition that Hf (x ) is positive semi-denite for all x X. Comments
f is concave on U Hf is negative semidenite on U . Note that we may decide convexity or concavity by nding the
eigenvalues of Hf (x ).
24 / 29
Example
2 + 2x x x 2 . Is f convex, Problem. Let f (x1 , x2 ) = 2x1 x2 x1 1 2 2 concave or none of both ?
Solution. The symmetric matrix representing the quadratic part of f is 1 1 A= 1 1 An easy computation gives for the Hessian that Hf (x ) = 2A. Hence we need to check the deniteness of A. Approach via eigenvalues. The characteristic polynomial of A is P () = 2 + 2; the equation P () = 0 has solutions (eigenvalues) 2 and 0. Hence, A is negative semidenite and the function is concave.
25 / 29
Proposition. Let f : X R be a convex function on some convex set X Rn . Then 1. A local minimum of f over X is also a global minimum. If f is strictly convex, there exists at most one global minimum. 2. If X is open, the condition f (x ) = 0 is necessary and sucient for x X to be a global minimum of f .
26 / 29
f (x ) = Qx b and Hf (x ) = Q . a) Local minima. If x is a local minimum we must have f (x ) = Qx b = 0, Hf (x ) = Q positive semi-denite; hence if Q is not positive semi-denite, f has no local minima. b) If Q is positive semi-denite, f is convex. In that case we need not distinguish global and local minima, and f has a global minimum if and only if there is some x with Qx = b . c) If Q is positive denite, Q 1 exists and the unique global minimum is attained at x = Q 1 b .
27 / 29
(3) There is some R such that the level set {x X : f (x ) } is non-empty and compact. Then f has at least one global minimum and the set of all global minima is compact. Remark. The result holds also for lower semicontinous functions. (f is called lower semicontinuous if for all x X , all (x k )k N with x k x lim inf k f (x k ) f (x ).)
28 / 29