lecture_09
lecture_09
November 4, 2020
Warning: This material is not meant to be lecture notes. It only gathers the main concepts
and results from the lecture, without any additional explanation, motivation, examples, figures...
1 Convex sets
Definition 1.1 (Convex set)
A set C ⊂ Rn is convex if for all x, y ∈ C and all α ∈ [0, 1],
αx + (1 − α)y ∈ C.
Proposition 1.1
If C is convex then all convex combination of elements of C remains in C.
2 Convex functions
1
Definition 2.1
A function f : Rn → R is convex if for all x, y ∈ Rn and all α ∈ [0, 1],
We say that f is strictly convex is there is strict inequality in (1) whenever x 6= y and
α ∈ (0, 1).
A function f is concave (respectively strictly concave) if −f is convex (respectively strictly
convex).
Notice that a linear function is also a convex function since it verifies (1) with equality, but
is not strictly convex.
Exercise 2.1. Let f : Rn → R a convex function and α ∈ R. Show that the “α-sublevel set”
Cα = x ∈ Rn f (x) ≤ α
is convex.
Proposition 2.1
A differentiable function f : Rn → R is convex if and only if for all x, y ∈ Rn
Corollary 2.1
Let f : Rn → R be a differentiable convex function and x ∈ Rn . Then
x is a minimizer of f ⇐⇒ ∇f (x) = 0.
2
Proposition 2.2
Let f : Rn → R be a twice-differentiable function. We denote by Hf the Hessian matrix of
f . Then f is convex if and only if for all x ∈ Rn , Hf (x) is positive semi-definite.
g: R → R
t 7→ f (x + tv)
k k
!
X X
f αi xi ≤ αi f (xi ).
i=1 i=1
Remark 2.2. If f is concave then Proposition 2.4 holds, but with inequalities in the reverse
order.
Example 2.1 (Discrete entropy). Let Z be a random variable that take value in {1, . . . , k} and
write pi = P(Z = i). The entropy of Z is defined as
k
X
H(Z) = − pi log(pi ).
i=1
The entropy of Z is a measure of the uncertainty associated with Z. We apply Jensen’s inequality
to the concave function log:
k k
!
X X
H(Z) = pi log(1/pi ) ≤ log pi /pi = log(k).
i=1 i=1
Notice that H(Z) = log(k) when Z is uniformly distributed over {1, . . . , k}, i.e. P(Z = i) = 1/k
for all i. Conclusion: maximal entropy is achieved for the uniform distribution.
3
2.3 Operations that preserve convexity
Further reading
See [1] Chapters 2 and 3 for example of properties of convex sets/functions. See also http:
//web.stanford.edu/class/ee364a/lectures.html for nice lecture slides. The book [2] is a
great reference for convex analysis, but is mathematically more involved.
References
[1] Stephen Boyd and Lieven Vandenberghe. Convex optimization. Cambridge university press,
https://web.stanford.edu/˜boyd/cvxbook/, 2004.
[2] R Tyrrell Rockafellar. Convex analysis, volume 28. Princeton university press, 1970.