Ps 2
Ps 2
The exercises for this week will not count toward your grade, but you are highly encouraged to
solve them all.
Exercise 0.
Prove that if a matrix A ∈ Rn×n is symmetric, then kAk = max(|λmax (A)| , |λmin (A)|) and give
an example of a non-symmetric matrix for which this is not true.
Exercise 1.
Exercise 2.
Exercise 3.
2β kx 0 − x ∗ k22
f (x k ) − f (x ∗ ) ≤ .
k+1
Exercise 4.
1
• Is the function β-gradient Lipschitz for some β?
• If the function is β-gradient Lipschitz give an upper bound on β – the bound should be
within a factor 4 of the true value.
Part B. Prove that there is value L ∈ R such that for all x ∈ Rn , we have f (x ) ≥ L. In other
words, the function is not unbounded below.
Part C. Prove that f is strictly convex as per Definition 3.2.8 in Chapter 3. Prove also that the
minimizer x ∗ ∈ arg minx ∈Rn f (x ) of f is unique.
Part D. Let x 0 be a given starting point and x ∗ be the minimizer of f . Suppose we have
an algorithm DecentDescent which takes a starting point x 0 , and a step count t ∈ N.
DecentDescent(x 0 , t) runs for t steps and returns x̃ ∈ Rn such that
γ kx 0 − x ∗ k22
f (x̃ ) − f (x ∗ ) ≤
t+1
where γ > 0 is a positive number.
Assume that the cost of running DecentDescent for t steps is t. Explain how, with a total cost
of at most 8γ ∗ x − x ∗ k2 ≤ δ for
b ∈ Rn such that kb
µ log(kx 0 − x k2 /δ), we can produce a point x
δ > 0.
2
Part E. Consider a function h : Rn → R which is both µ-strongly convex and β-gradient Lipschitz.
Give an algorithm that returns x 0 with
h(x 0 ) − h(x ∗ ) ≤