0% found this document useful (0 votes)

57 views

Gradient Lagrange

This document discusses concepts related to optimization including derivatives, gradients, Lagrange multipliers, and their applications. It defines the derivative and Jacobian of a function, and how they relate to the linear approximation of a function. It then defines the gradient and how it relates to the derivative and first-order Taylor expansion of a function. Finally, it discusses how to use optimality conditions involving gradients and Lagrange multipliers to minimize functions subject to equality constraints.

Uploaded by

mir emmett

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

Gradient Lagrange

Uploaded by

mir emmett

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

EE263

Prof. S. Boyd

Derivative, Gradient, and Lagrange Multipliers

Derivative
Suppose f : Rn Rm is differentiable. Its derivative or Jacobian at a point x Rn is
denoted Df (x) Rmn , defined as

fi
,
(Df (x))ij =
xj x

i = 1, . . . , m,

j = 1, . . . , n.

The first order Taylor expansion of f at (or near) x is given by

f(y) = f (x) + Df (x)(y x).
When y x is small, f (y) f(y) is very small. This is called the linearization of f at (or
near) x.
As an example, consider n = 3, m = 2, with
f (x) =

x1 x22
x1 x3

Its derivative at the point x is

Df (x) =

1 2x2 0
x3
0
x1

and its first order Taylor expansion near x = (1, 0, 1) is given by

f(y) =

1
1

1 0 0
1 0 1

y 0 .
1

Gradient
For f : Rn R, the gradient at x Rn is denoted f (x) Rn , and it is defined as
f (x) = Df (x)T , the transpose of the derivative. In terms of partial derivatives, we have

f
f (x)i =
,
xi x

i = 1, . . . , n.

The first order Taylor expansion of f at x is given by

f(x) = f (x) + f (x)T (y x).
1

Gradient of affine and quadratic functions

You can check the formulas below by working out the partial derivatives.
For f affine, i.e., f (x) = aT x + b, we have f (x) = a (independent of x).
For f a quadratic form, i.e., f (x) = xT P x with P Rnn , we have f (x) = (P + P T )x.
When P is symmetric, this simplifies to f (x) = 2P x.
We can use these basic facts and some simple calculus rules, such as linearity of gradient
operator (the gradient of a sum is the sum of the gradients, and the gradient of a scaled
function is the scaled gradient) to find the gradient of more complex functions. For example,
lets compute the gradient of
f (x) = (1/2)kAx bk2 + cT x,
with A Rmn . We expand the first term to get
f (x) = (1/2)xT (AT A)x bT Ax + (1/2)bT b + cT x,
and now use the rules above to get
f (x) = AT Ax AT b + c = AT (Ax b) + c.

Minimizing a function
Suppose f : Rn R, and we want to choose x so as to minimize f (x). Assuming f is
differentiable, any optimal x (and its possible that there isnt an optimal x) must satisfy
f (x) = 0. The converse is false: f (x) = 0 does not mean that x minimizes f . Such a
point is actually a stationary point, and could be a saddle point or a maximum of f , or a
local minimum. We refer to f (x) = 0 as an optimality condition for minimizing f . It is
necessary, but not sufficient, for x to minimize f .
We use this result as follows. To minimize f , we find all points that satisfy f (x) = 0.
If there is a point that minimizes f , it must be one of these.
Example: Least-squares. Suppose we want to choose x Rn to minimize kAx bk,
where A Rmn is skinny and full rank. This is the same as minimizing f (x) = (1/2)kAx
bk2 . The optimality condition is
f (x) = AT Ax AT b = 0.
Only one value of x satisfies this equation: xls = (AT A)1 AT b.
We have to use other methods to determine that f is actually minimized (and not, say,
maximized) by xls . Here is one method. For any z, we have
(Az)T (Axls b) = z T (AT Axls AT b) = 0,

so Az Axls b. Now we note that

kAx bk2 =
=
=

kAxls b + A(x xls )k2

kAxls bk2 + 2(A(x xls ))T (Axls b) + kA(x xls )k2
kAxls bk2 + kA(x xls )k2
kAxls bk2

using the orthogonality result above. So this shows that xls really does minimize f . With
this argument, we really didnt need the optimality condition. But the optimality condition
gave us a quick way to find the answer, if not verify it.

Lagrange multipliers
Suppose we want to solve the constrained optimization problem
minimize f (x)
subject to g(x) = 0,
where f : Rn R and g : Rn Rp .
Lagrange introduced an extension of the optimality condition above for problems with
constraints. We first form the Lagrangian
L(x, ) = f (x) + T g(x),
where Rp is called the Lagrange multiplier. The (necessary, but not sufficient) optimality
conditions are
x L(x, ) = 0,
L(x, ) = g(x) = 0.
These two conditions are called the KKT (Kharush-Kuhn-Tucker) equations. The second
condition is not very interesting; we already knew that the optimal x must satisfy g(x) = 0.
The first is interesting, however.
To solve the constrained problem, we attempt to solve the KKT equations. The optimal
point (if one exists) must satisfy the KKT equations.
Example: Linearly constrained least-squares. Consider the linearly constrained leastsquares problem (see lecture slides 8)
minimize (1/2)kAx bk2
subject to Cx d = 0
with A Rmn and C Rpn . The Lagrangian is
L(x, ) = (1/2)kAx bk2 + T (Cx d)
= (1/2)xT AAx bT Ax + (1/2)bT b + (C T )T x T d.
3

The KKT conditions are

x L(x, ) = AT Ax AT b + C T = 0,

L(x, ) = Cx d = 0.

These are a set of n + p linear equations in n + p variables, which we can write as

AT A C T
C
0

AT b
d

If the matrix on the left is invertible, this has one solution,

AT A C T
C
0

#1 "

As in the least-squares example above, you have to use another argument to show that
x found this way actually minimizes f subject to Cx = d. We dont expect you to be able
to come up with this argument, but heres how it goes. Suppose that z satisfies Cz = 0.
Then
(Az)T (Ax b) = z T (AT Ax AT b) = z T (C T ) = (Cz)T = 0,

so (Az) (Ax b). Using exactly the same calculation as for least-squares above, we get
kAx bk2 kAx bk2 ,
which shows that x does indeed minimize kAx bk subject to Cx = d.

Homework 1
No ratings yet
Homework 1
8 pages
ISMS Manual: Version: 2.9 Date: 07 May' 2012
No ratings yet
ISMS Manual: Version: 2.9 Date: 07 May' 2012
45 pages
ECN 2115 Lecture 1 - 2
No ratings yet
ECN 2115 Lecture 1 - 2
6 pages
Constrained Optimization
No ratings yet
Constrained Optimization
10 pages
lec09
No ratings yet
lec09
56 pages
Nonlinear Optimization
No ratings yet
Nonlinear Optimization
6 pages
Lagrange Multipliers: D D N×D N 1
No ratings yet
Lagrange Multipliers: D D N×D N 1
3 pages
Multi
No ratings yet
Multi
34 pages
Lecture 2 - Optimization With Equality Constraints
No ratings yet
Lecture 2 - Optimization With Equality Constraints
44 pages
15.093 Optimization Methods
No ratings yet
15.093 Optimization Methods
12 pages
Latex for Mu
No ratings yet
Latex for Mu
3 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
11 pages
Bms Basic NLP 120609
No ratings yet
Bms Basic NLP 120609
103 pages
55_Optimization
No ratings yet
55_Optimization
21 pages
Convex Optimization Quizz
No ratings yet
Convex Optimization Quizz
5 pages
6e4f6lagrange Multiplier LN 5
No ratings yet
6e4f6lagrange Multiplier LN 5
6 pages
LQR Lagrange
No ratings yet
LQR Lagrange
18 pages
M Hemn1
No ratings yet
M Hemn1
16 pages
The Method of Lagrange Multipliers
No ratings yet
The Method of Lagrange Multipliers
5 pages
The Method of Lagrange Multipliers
No ratings yet
The Method of Lagrange Multipliers
5 pages
mclas-tema1-v2
No ratings yet
mclas-tema1-v2
74 pages
Karush Kuhn Tucker
No ratings yet
Karush Kuhn Tucker
14 pages
Appendix E. Lagrange Multipliers
No ratings yet
Appendix E. Lagrange Multipliers
4 pages
Optimization-Based Control: Richard M. Murray Control and Dynamical Systems California Institute of Technology
No ratings yet
Optimization-Based Control: Richard M. Murray Control and Dynamical Systems California Institute of Technology
21 pages
Chapter 9st - Non-Linear Programming
No ratings yet
Chapter 9st - Non-Linear Programming
21 pages
Lagrange Mult
No ratings yet
Lagrange Mult
5 pages
Lec 18
No ratings yet
Lec 18
6 pages
Chapter 3: The Lagrange Method: Elements of Decision: Lecture Notes of Intermediate Microeconomics
No ratings yet
Chapter 3: The Lagrange Method: Elements of Decision: Lecture Notes of Intermediate Microeconomics
12 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
04 Nonlinear Systems and Optimization
No ratings yet
04 Nonlinear Systems and Optimization
74 pages
LN 2122 Student3
No ratings yet
LN 2122 Student3
18 pages
4 Chapter 21 Non Linear Programming
No ratings yet
4 Chapter 21 Non Linear Programming
37 pages
Karush Kuhn Tucker Slides
No ratings yet
Karush Kuhn Tucker Slides
45 pages
4 Handling Constraints: F (X) X R C J 1, - . - , M C 0, K 1, - . - , M
No ratings yet
4 Handling Constraints: F (X) X R C J 1, - . - , M C 0, K 1, - . - , M
10 pages
Assignment 2
No ratings yet
Assignment 2
24 pages
Elementary Calculus
From Everand
Elementary Calculus
George N. Frempong
No ratings yet
Quadratic Programming
No ratings yet
Quadratic Programming
19 pages
Chapter 6 Lecture Notes
No ratings yet
Chapter 6 Lecture Notes
4 pages
Opt7 20
No ratings yet
Opt7 20
8 pages
CH 10 1 (M)
No ratings yet
CH 10 1 (M)
8 pages
Appendix E. Lagrange Multipliers
No ratings yet
Appendix E. Lagrange Multipliers
4 pages
10 - Solved Problems
No ratings yet
10 - Solved Problems
26 pages
Least Squares Full Resume
No ratings yet
Least Squares Full Resume
15 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Chapitre3_FF-English
No ratings yet
Chapitre3_FF-English
16 pages
Background/Random Processes
No ratings yet
Background/Random Processes
33 pages
Equality Constrained Optimization: Daniel P. Robinson
No ratings yet
Equality Constrained Optimization: Daniel P. Robinson
33 pages
Week 5 CalculusVariation
100% (1)
Week 5 CalculusVariation
7 pages
The Kuhn-Tucker Conditions: Minimize Such That 0 1,..., H 0 1,...
No ratings yet
The Kuhn-Tucker Conditions: Minimize Such That 0 1,..., H 0 1,...
11 pages
Optimization - Homework 6
No ratings yet
Optimization - Homework 6
6 pages
Optimization Problems With Constraints - The Method of Lagrange Multipliers
No ratings yet
Optimization Problems With Constraints - The Method of Lagrange Multipliers
19 pages
latex
No ratings yet
latex
4 pages
Linear Least-Squares
No ratings yet
Linear Least-Squares
7 pages
5165 Test 2 Cheating
No ratings yet
5165 Test 2 Cheating
7 pages
4 Proximal Methods and ADMM Modified Ver1
No ratings yet
4 Proximal Methods and ADMM Modified Ver1
48 pages
Optimization and Minimum Principles: 7.1 Two Fundamental Examples
No ratings yet
Optimization and Minimum Principles: 7.1 Two Fundamental Examples
12 pages
CH 4-Design Optimization-Optimum Design Concepts PDF
No ratings yet
CH 4-Design Optimization-Optimum Design Concepts PDF
62 pages
Lecture1 introductionPCA
No ratings yet
Lecture1 introductionPCA
75 pages
AMATH 460: Mathematical Methods For Quantitative Finance: 7.1 Lagrange's Method
No ratings yet
AMATH 460: Mathematical Methods For Quantitative Finance: 7.1 Lagrange's Method
29 pages
Group Theory I Essentials
From Everand
Group Theory I Essentials
Emil Milewski
No ratings yet
A First Course in Functional Analysis
From Everand
A First Course in Functional Analysis
Martin Davis
No ratings yet
The BrainWave - August 2024
No ratings yet
The BrainWave - August 2024
7 pages
Yearwood Passion Appendix
No ratings yet
Yearwood Passion Appendix
3 pages
Ethnosymbolism
No ratings yet
Ethnosymbolism
10 pages
Gethics Review Notes
No ratings yet
Gethics Review Notes
3 pages
Recruiter_Mangalore-3
No ratings yet
Recruiter_Mangalore-3
2 pages
Bouncing Rubber Egg
No ratings yet
Bouncing Rubber Egg
3 pages
Fire Sprinkler System PDF
No ratings yet
Fire Sprinkler System PDF
4 pages
Journal of Critical Care: Biao Wang, MD, Gang Chen, MD, Yifei Cao, MD, Jiping Xue, MD, Jia Li, MD, Yunfu Wu, MD
No ratings yet
Journal of Critical Care: Biao Wang, MD, Gang Chen, MD, Yifei Cao, MD, Jiping Xue, MD, Jia Li, MD, Yunfu Wu, MD
5 pages
Third-Lesson Plan
No ratings yet
Third-Lesson Plan
8 pages
RFJSeddon Fernando Pessoa As Philosophers
No ratings yet
RFJSeddon Fernando Pessoa As Philosophers
12 pages
Hiral J Joshi: Objective
No ratings yet
Hiral J Joshi: Objective
2 pages
Resume - Justin Curtin
No ratings yet
Resume - Justin Curtin
2 pages
Week 2 Reading Matatag Curriculum Revised
No ratings yet
Week 2 Reading Matatag Curriculum Revised
14 pages
Shell Programming
100% (2)
Shell Programming
25 pages
Concurrency and Parallelism
No ratings yet
Concurrency and Parallelism
22 pages
Dbms Indexing
No ratings yet
Dbms Indexing
3 pages
Documentation - Gi Fi
No ratings yet
Documentation - Gi Fi
23 pages
TA212_FCH
No ratings yet
TA212_FCH
2 pages
BEP2 Task2 Revised
100% (2)
BEP2 Task2 Revised
10 pages
Reviewer
No ratings yet
Reviewer
14 pages
After Class - AVTC5 - Unit 1 - Line Graph
No ratings yet
After Class - AVTC5 - Unit 1 - Line Graph
3 pages
SHEWHART
100% (1)
SHEWHART
2 pages
Sample Questions SAP CR100
No ratings yet
Sample Questions SAP CR100
21 pages
Documentation in ISO: Two Days Training Course
No ratings yet
Documentation in ISO: Two Days Training Course
1 page
ORA-4031 Questions
No ratings yet
ORA-4031 Questions
40 pages
Energizing Pyramid Instructions Final
No ratings yet
Energizing Pyramid Instructions Final
1 page
Lab 10 - Unconfined Compression Test
No ratings yet
Lab 10 - Unconfined Compression Test
7 pages
Engine Pressure Sensor Open or Short PDF
100% (2)
Engine Pressure Sensor Open or Short PDF
14 pages
Tuesday, September 7, 2010: Real Time Tickets in SAP SD
No ratings yet
Tuesday, September 7, 2010: Real Time Tickets in SAP SD
2 pages