0% found this document useful (0 votes)

14 views

Slides CO-course C4C Part I

Co course

Uploaded by

kgizachewy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views

Slides CO-course C4C Part I

Co course

Uploaded by

kgizachewy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

INTRODUCTION TO OPTIMAL CONTROL

Fernando Lobo Pereira, João Borges de Sousa

Faculdade de Engenharia da Universidade do Porto
4200-465 Porto, Portugal
{flp,jtasso}@fe.up.pt

C4C Autumn School

Verona, October 5-8, 2009
Introduction to Optimal Control

Organization

1. Introduction.
General considerations. Motivation. Problem Formulation. Classes of problems. Issues in optimal control theory

2. Necessary Conditions of Optimality - Linear Systems

Linear Systems Without and with state constraints. Minimum time. Linear quadratic regulator.

3. Necessary Conditions of Optimality - Nonlinear Systems.

Basic Problem. Perturbations of ODEs. Problems with state constraints.

4. Additional issues
Nonsmoothness. Degeneracy. Nondegenerate conditions with and without a priori normality assumptions.

5. Conditions of the Hamilton-Jacobi-Bellman Type.

Verification functions. Principle of Optimality Result on Reachable sets. Introduction to Dynamic Programming

6. A Framework for Coordinated Control

An example of motion coordination of multiple AUVs

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 1

Introduction to Optimal Control

Key References
• Pravin Varaiya,“Notes on Optimization”, Van Nostrand Reinhold Company, 1972.
• Francis Clarke, ”Optimization and Nonsmooth Analysis”, John Wiley, 1983.
• Richard Vinter, “Optimal Control”, Birkhauser, 2000.
• João Sousa, Fernando Pereira, “A set-valued framework for coordinated motion
control of networked vehicles”, J. Comp. & Syst. Sci. Inter., Springer, 45, 2006,
p.824-830.
• David Luenberger, “Optimization by Vector Space Methods”, Wiley, 1969.
• Aram Arutyunov, “Optimality Conditions: Abnormal and Degenerate Problems”,
Kluwer Academic Publishers, 2000
• Martino Bardi, Italo Capuzzo-Dolcetta, “Optimal Control and Viscosity Solutions
of Hamilton-Jacobi-Bellman Equations”, Birkhauser, 1997.
• Vladimir Girsanov, “Lectures on Mathematical Theory of Extremum Problems”,
Lect. Notes in Econom. and Math. Syst., 67, Springer Verlag, 1972.
• Francesco Borrelli, http://www.me.berkeley.edu/ frborrel/Courses.html

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 2

INTRODUCTION
Introduction to Optimal Control

Ingredients of the Optimal Control Problem

o Objective functional - Criterium to quantify the performance of the system.

o Systems dynamics - The state variable characterizes the evolution of the system
overtime. Physically, the attribute “dynamic” is due to the existence of energy
storages which affect the behavior of the system.
Once specified a control strategy and the initial state, this equation fully determines
the time evolution of the state variable.
o Control Constraints - The control variable represents the possibility of intervening
in order to change the behavior of the system so that its performance is optimized.
o Constraints on the state variable - The satisfaction of these constraints affect the
evolution of the system, and restricts the admissible controls.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 4

Introduction to Optimal Control

Applications
Useful for optimization problems with inter-temporal constraints.
o Management of renewable and non-renewable resources
o Investment strategies,
o Management of financial resources,
o Resources allocation,
o Planning and control of productive systems (manufacturing, chemical processes,..),
o Planning and control of populations (cells, species),
o Definition of therapy protocols,
o Motion planning and control in autonomous mobile robotics
o Aerospace Navigation,
o Synthesis in decision support systems,
o Etc . . .
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 5
Introduction to Optimal Control

The General Problem

(P ) Minimize g(x(1))
by choosing (x, u) : [0, 1] → IRn × IRm
satisfying : ẋ(t) = f (t, x(t), u(t)), [0, 1] L-a.e., (1)
x(0) = x0, (2)
u(t) ∈ Ω(t), [0, 1] L-a.e.. (3)

(P 0) Minimize {g(z) : z ∈ A(1; (x0, 0))}

A(1; (x0, 0)) - the set in IRn that can be reached at time 1 from x0 at time 0.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 6

Introduction to Optimal Control

General definitions
Dynamic System - System whose state variable conveys its past history. Its future
evolution depends not only on the future (“inputs”) but also on the current value of
the state variable.
Trajectory - Solution of the differential equation (1) with the boundary condition
(2) and for a given controlo function satisfying (3).
Admissible Control Process - A (x, u) satisfying the constraints (1,2,3).
Attainable set - A(1; (x0, 0)) is the set of state space points that can be reached
from x0 with admissible control strategies
A(1; (x0, 0)) := {x(1) : for all admissible control processes (x, u)}
Boundary process - Control process whose trajectory (or a given function of it)
remains in the boundary of the attainable set (or a given function of it).
Local/global minimum - Point for which the value of the objective function is lower
than that associated with any other/other within a neighborhood feasible point.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 7

Introduction to Optimal Control

Types of Problems
Z 1
o Bolza - g(x(1)) + L(s, x(s), u(s))ds.
0
Z 1
o Lagrange - L(s, x(s), u(s))ds.
0
o Mayer - g(x(1)).
Other types of constraints besides the above:
o Mixed constraints - g(t, x(t), u(t)) ≤ 0, ∀t ∈ [0, 1].
Z 1
o Isoperimetric constraints, h(s, x(s), u(s))ds = a.
0
o Endpoints and intermediate state constraints, y(1) ∈ S.
o State constraints, hi(t, x(t)) ≤ 0 para todo o t ∈ [0, 1], i = 1, . . . , s.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 8

Introduction to Optimal Control

Overview of Main Issues - Necessary Conditions of Optimality

Let x∗ be an optimal trajectory for (P ). Then, ∃ a.c. p : [0, 1] → IRn, satisfying
−ṗT (t) = pT (t)Dxf (t, x∗(t), u∗(t)), [0, 1] L-a.e., (4)
−pT (1) = ∇xg(x∗(1)). (5)
where u∗ : [0, 1] → IRm is a control strategy s.t. u∗(t) maximizes
v → pT (t)f (t, x∗(t), v) in Ω(t), [0, 1] L-a.e.. (6)

(6) eliminates the control as it defines implicitly

u∗(t) = ū(x∗(t), p(t)).
Then, solving (P ) amounts to solve
−ṗT (t) = pT (t)Dxf (t, x∗(t), ū(x∗(t), p(t))), p(1) = −∇xg(x∗(1)),
ẋ∗(t) = f (t, x∗(t), ū(x∗(t), p(t))), x(0) = x0.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 9

Introduction to Optimal Control

Algorithms

Step 1 − Select an initial control strategy u.

Step 2 − Compute a pair (x, p), by using (1, 2, 4, 5).

Step 3 − Check if u(t) ssatisfies (6).

If positive, the algorithm terminates.
Otherwise, proceed to Step 4.

Step 4 − Update the control in order to lower the cost function.

Step 5 − Prepare the new iteration and goto Step 2.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 10

Introduction to Optimal Control

Overview of Main Issues - Existence of Solution

Necessary for the consistency of the optimality conditions.
o Let N ∈ IN be the largest natural number, then N ≥ n, ∀n ∈ IN .
o In particular, the inequality should hold for n = N2.
o Dividing both sides of N2 ≤ N by N, one gets N ≤ 1.
Existence conditions: Lower semi-continuity of the objective function and the
corresponding compactness of the attainable set.
H0 g is lower semi continuous.
H1 Ω(t) is compact ∀t ∈ [0, 1] and t → Ω(t) is Borel mensurable.
H2 f is continuous in all its arguments.
H3 |f (t, x, u) − f (t, y, u)| ≤ Kf kx − yk.
H4 ∃K > 0 : |x · f (t, x, u)| ≤ K(1 + kxk2) for all the values of (t, u).
H5 f (t, x, Ω(t)) is convex ∀x ∈ IRn e ∀t ∈ [0, 1].

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 11

Introduction to Optimal Control

Overview of Main Issues - Sufficient Conditions of Optimality

Let V : [0, 1] × IRn → IR be a smooth function s.t. in the neighborhood of (t, x∗(t)),
V (1, z) = g(z),
V (0, x0) ≥ g(x∗(1)),
Vt(t, x∗(t)) − sup{Vx(t, x∗(t))f (t, x∗(t), u) : u ∈ Ω(t)} = 0, [0, 1] L-a.e., (7)
where x∗ is solution to (1) with u = u∗ and x∗(0) = x0, then the control process
(x∗, u∗) is optimal for (P ).
V - solution to Hamilton-Jacobi-Bellman equation (7) - is the verification function
which under certain conditions coincides with the value function.
Although these conditions have a local character, there are results giving conditions of
global nature.
New types of solutions - Viscosity, Proximal, Dini, ... - generalizing the classic concept.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 12

NECESSARY CONDITIONS OF OPTIMALITY:
LINEAR SYSTEMS
Introduction to Optimal Control

An exercise on the separation principle applied to the following problems:

• Linear cost and affine dynamics.
• The above with affine endpoint state constraints.
• Minimum time with affine dynamics.
• Linear dynamics and quadratic cost.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 14

Introduction to Optimal Control

The Basic Linear Problem

(P1) Minimize −cT x(1)

by choosing (x, u) : [0, 1] → IRn × IRm s.t.:
ẋ(t) = A(t)x(t) + B(t)u(t), [0, 1] L-a.e.,
x(0) = x0 ∈ IRn,
u(t) ∈ Ω(t), [0, 1] L-a.e.,
being A ∈ IRn×n, B ∈ IRn×m, and c ∈ IRn.
m
Y
Example of control constraint set: Ω(t) := [αk , βk ].
k=1
Rb
Given x0 and u : [0, 1] → IRm, and, being Φ(b, a) := e a A(s)ds, we have:
Z t
x(t) = Φ(t, t0)x0 + Φ(t, s)B(s)u(s)ds.
t0

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 15

Introduction to Optimal Control

Maximum Principle

The control strategy u∗ is optimal for (P1) if and only if u∗(t) maximizes
v → pT (t)B(t)v, in Ω(t), [0, 1] L-a.e.,
where p : [0, 1] → IRn is an a.c. function s.t.:
−ṗT (t) = pT (t)A(t), [0, 1] L-a.e.
p(1) = c.

For this problem, the Maximum Principle is a necessary and sufficient condition.
Geometric Interpretation:
o Existence of a boundary control process associated with the optimal trajectory.
o The adjoint variable vector is perpendicular to the attainable set at the optimal
state value for all times.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 16

Introduction to Optimal Control

Geometric Interpretation
X1

A(t1,0,x 0) A(t2 ,t1,x*(t 1))

X*(t 1)
t

t2
p(t1 ) t1
X*(t 2)

x* (t)
p(t2 )
X2

Fig. 2. Relation between the adjoint variable and the attainable set (inspired in [17])

Proposition
Let cT x∗(1) ≥ cT z, ∀z ∈ A(1; (x0, 0)) and c 6= 0, i.e.,
−pT (1) = c is perpendicular to A(1; (x0, 0)) at x∗(1) ∈ ∂A(1; (x0, 0)).
Then, ∀t ∈ [0, 1),
o x∗(t) ∈ ∂A(t; (x0, 0)),
o −pT (t) is perpendicular to A(t; (x0, 0)) at x∗(t).
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 17
Introduction to Optimal Control

Analytic Interpretation
It consists in showing that the switching function
σ : [0, 1] → IRm := −pT (t)B(t)
is the gradient of the objective functional J(u) := −cT x(1) relatively to the value of
the control function at time t, u(t).
By computing the directional derivative and using the time response formula for the
dynamic linear system, we have:
Z 1
J 0(u; w) = σ(t)w(t)dt =< ∇uJ(u), w > .
0
Here, ∇uJ(u) : [0, 1] → IRm is the gradient of the cost functional w.r.t. to control,
and < ·, · > is the inner product, in the functional space.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 18

Introduction to Optimal Control

Deduction of the Maximum Principle

Exercise

o Express the optimality conditions as a function of the state variable at the final
time.
Check that {x∗(1)} and A(1; (x0, 0)) fulfill the conditions to apply a Separation
Theorem.
After showing the equivalence between the trajectory optimality and the fact of
being a boundary process
Observe that (cT x∗(1), x∗(1)) ∈ ∂{(z, y) : z ≥ cT y, y ∈ A(1; (x0, 0))}
write the condition of perpendicularity of the vector c to A(1; (x0, 0)).
o Express the conditions obtained above in terms of the control variable at each
instant in the given time interval by using the time response formula.
In this step, the control maximum condition, the o.d.e. and the boundary
conditions satisfied by the adjoint variable are jointly obtained.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 19

Introduction to Optimal Control

Example
   
0 1 0 0 £ ¤
   
Let t ∈ [0, 1], u(t) ∈ [−1, 1], A = 0 0 1 , B = 0 , e C = 1 0 0 .
6 −11 6 1
By writing eAτ = α0(τ )I + α1(τ )A + α2(τ )A2, where τ = 1 − t, we get
σ(t) := pT (t)B = Ce(A(1−t))B = α2(1 − t).
The eigenvalues of A - roots of the characteristic polynomial de A,
p(λ) = det(λI − A) = 0. By Cayley-Hamilton theorem,
α0(τ ) + α1(τ ) + α2(τ ) = eτ
α0(τ ) + 2α1(τ ) + 4α2(τ ) = e2τ
α0(τ ) + 3α1(τ ) + 9α2(τ ) = e3τ .
Thus,
e3τ − 2e2τ + eτ
α2(τ ) = .
2
∗
Since σ(t) > 0, ∀t ∈ [0, 1], we have u (t) = 1, ∀t ∈ [0, 1].
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 20
Introduction to Optimal Control

The Linear problem with Affine Endpoint State Constraints

(P2) Minimize − cT x(1)

by choosing (x, u) : [0, 1] → IRn × IRm s.t.:
ẋ(t) = A(t)x(t) + B(t)u(t), [0, 1] L-a.e.,
x(0) ∈ X0 ⊂ IRn,
x(1) ∈ X1 ⊂ IRn,
u(t) ∈ Ω(t), [0, 1] L-a.e.,
being Xi, i = 0, 1, given by Xi := {z ∈ IRn : Diz = ei}.
The pair (x∗(0), u∗) is optimal for (P2) if
(x∗(0), x∗(1)) ∈ X0 × X1, u∗(t) ∈ Ω(t) [0, 1] L-a.e., and
cT x∗(1) ≥ cT z ∀ z ∈ A(1; (X0, 0)) ∩ X1,
[
being A(1; (X0, 0)) := A(1; (a, 0)).
a∈X0

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 21

Introduction to Optimal Control

Maximum Principle
These conditions are necessary and sufficient.
Let (x∗, u∗) be an admissible control process for (P2), i.e., s.t. x∗(0) ∈ X0,
u∗(t) ∈ Ω(t) and x∗(1) ∈ X1. Then:
A) Necessity
If (x∗(0), u∗) is optimal, then, ∃p : [0, 1] → IRn e λ ≥ 0, s.t.:
λ + kp(t)k 6= 0, (8)
−ṗT (t) = pT (t)A(t), [0, 1] L-a.e., (9)
p(1) − λc is perpendicular to X1 at x∗(1), (10)
p(0) is perpendicular to X0 at x∗(0), (11)
u∗(t) maximizes the map v → pT (t)B(t)v on Ω(t), [0, 1] L-a.e.. (12)
B) Sufficiency
If (8)-(12) hold with λ > 0, then (x∗(0), u∗) is optimal.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 22

Introduction to Optimal Control

Geometric Interpretation
cTx

c T x * (1 )

A(1;0 ,X 0 )
x * (1)

Fig. 2. Separation of the optimal state at the final time subject to affine constraints (inspired from [17]).

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 23

Introduction to Optimal Control

Proof Sketch of the Necessity

The optimality conditions applied at (x∗(0), u∗) imply ∃λ ≥ 0 and q ∈ IRn s.t.:
λ + kqk = 6 0 (13)
q is perpendicular to X1 at x∗(1). (14)
(λc + q)T x∗(1) ≥ (λc + q)T x∀ x ∈ A(1; (X0, 0)). (15)
Φ(1, 0)T (λc + q) is perpendicular to X0 at x∗(0). (16)
Let (λ, q) be a vector defining the hyperplane separating the sets
Sa := {sa = (ra, xa) : ra > cT x∗(1), xa ∈ X1},
Sb := {sb = (rb, xb) : rb = cT xb, xb ∈ A(1; (X0, 0))}.
λra + q T xa ≥ (λc + q)T xb, ∀xb ∈ A(1; (X0, 0)), ∀xa ∈ X1, ∀ra > cT x∗(1).
o (13) ⇐= non-triviality of the separator, ra arbitrary and xa = x∗(1).
o (15) ⇐= choice of xa = x∗(1) and the arbitrary approximation of ra a cT x∗(1).
o (14) ⇐= ra arbitrarily close to cT x∗(1), xa arbitrary in X1 and xb = x∗(1).
o (16) ⇐= (15) with {Φ(1, 0)[z − x∗(0)] + x∗(1) : z ∈ X0} ⊂ A(1; (X0, 0)).
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 24
Introduction to Optimal Control

Proof Sketch of the Sufficiency

For all x ∈ A(1; (X0, 0)), we have, ∀z ∈ X0, ∀v ∈ A(1; (0, 0)):
(λc + q)T x = pT (1)x
= pT (1)[Φ(1, 0)z + v]
= pT (1)Φ(1, 0)[z − x∗(0)] + pT (1)Φ(1, 0)x∗(0) + pT (1)v
= pT (0)[z − x∗(0)] + pT (1)[Φ(1, 0)x∗(0) + v],

Note that the first parcel is null and that the second one is in
A(1; (x∗(0), 0)) ⊂ A(1; (X0, 0)).
Thus, (λc + q)T x ≤ pT (1)x∗(1) = (λc + q)T x∗(1).
Since x ∈ A(1; (X0, 0)) ∩ X1, q is perpendicular to X1 at x∗(1).
Hence, the sufficiency.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 25

Introduction to Optimal Control

Example - Formulation

Z 1
Minimize cT x(1) + α0 u(t)dt
0
where ẋ(t) = Ax(t) + Bu(t), [0, 1] L-a.e.
x1(0) + x2(0) = 0
x1(1) + 3x2(1) = 1
u(t) ∈ [0, 1], [0, 1] L-a.e.,
· ¸ · ¸ · ¸
0 1 0 1
being α0 > 0, A = , B= , c= .
−2 3 1 1
a) Determine the values of α0 for which there exist optimal control switches within the
time interval [0, 1].
b) Determine the switching function as a function of α0.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 26

Introduction to Optimal Control

Example - solution clues

For a given λ ∈ {0, 1}, the system of equations p̄ = p(1) − λc is perpendicular to X1,
p(0) is perpendicular to X0, and pT (0) = pT (1)eA fully determine the adjoint variable.
Let λ = 1. Thus, we have
· t 2t 2t t
¸ · 1
¸ · ¸
2e − e e −e 1 + 3 p1 p0
eAt = , and p(1) = e p(0) = .
−2(e2t − et) 2e2t − et 1 + p1 p0
Note that these last two relations determine p0 e p1.
To put the problem in the canonical form, add a component to the state variable, and
the maximum condition becomes:
u∗(t) maximizes, in [0, 1], the map v → [pT (1)eA(1−t)B − α0]v.
There exists an interval of values of α0 for which the switching point is in (0, 1).

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 27

Introduction to Optimal Control

The Minimum Time Problem

(P3) Minimize T
by choosing (x, u) : [0, T ] → IRn × IRm such that:
ẋ(t) = A(t)x(t) + B(t)u(t), [0, T ] L-a.e.,
x(0) = x0 ∈ IRn,
x(T ) ∈ O(T ) ⊂ IRn,
u(t) ∈ Ω(t), [0, T ] L-a.e.,
being T the final time and the multifunction O : [0, T ] ,→ P(IRn) define the target to
be attained in minimum time, being P(IRn) the set of subsets in IRn.
Typically, this multi-function is continuous and takes compact sets as values. For
example, O(t) = {z(t)}, being z : [0, 1] → IRn a continuous function.
Generalization: Objective function defined by g(t0, x(t0), t1, x(t1)); Terminal
Constraints given by (t0, x(t0), t1, x(t1)) ∈ O ⊂ IR2(n+1).

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 28

Introduction to Optimal Control

Geometric Interpretation
x1

z(t )
x0

x * (t1)
x * (t)
t*

t
0 t1
x * (t*)=z(t*)
A (t1; 0,x 0)
A (t*;0 ,x 0)

Fig. 3. Determination of minimum time for problem (P3) (inspired on [17]).

The optimal state at t∗ is the intersection of sets O(t∗) and A(t∗; (x0, 0)), and, thus,
necessarily in the boundary of both sets.
Time t∗ is given by
inf{ t > 0 : O(t) ∩ A(t; (x0, 0)) = {x∗(t)}}.
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 29
Introduction to Optimal Control

Maximum Principle
Let (t∗, u∗) be optimal.
Then, there exists h ∈ IRn e p : [0, t∗] → IRn a.c. s.t.
kp(t)k 6= 0, (17)
−ṗT (t) = pT (t)A(t), [0, t∗] L-a.e., (18)
p(t∗) = q. (19)
u∗(t) maximizes v → pT (t)B(t)v em Ω(t), [0, t∗] L-a.e., (20)
x∗(t∗) minimizes z → hT z in O(t∗). (21)
Deduction: By geometric considerations, ∃h ∈ IRn, h 6= 0, simultaneously
perpendicular to A(t∗; (x0, 0)) and to O(t∗) em x∗(t∗), i.e., ∀z ∈ O(t∗) and
∀x ∈ A(t∗; (x0, 0)),
hT z ≤ hT x∗(t∗) ≤ hT x.
From here, we have (21) and, by writing x and x∗(t∗), as the state at t∗ as response of
the system, respectively, to arbitrary admissible control and the optimal control, (20),
we obtain the optimality conditions.
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 30
Introduction to Optimal Control

The Linear Quadratic Regulator Problem

Z
1 T 1 1£ T T
¤
(P4) Minimize x (1)Sx(1) + x (t)Q(t)x(t) + u (t)R(t)u(t) dt,
2 2 0
by choosing (x, u) : [0, 1] → IRn × IRm such that:
ẋ(t) = A(t)x(t) + B(t)u(t), [0, 1] L-a.e.,
x(0) = x0 ∈ IRn.
S ∈ IRn×n and Q(t) ∈ IRn×n are positive semi-definite, ∀t ∈ [0, 1], and
R(t) ∈ IRm×m is positive definite, ∀t ∈ [0, 1].

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 31

Introduction to Optimal Control

Optimality Conditions.
The solution to (P4) is given by
u∗(t) = −R−1(t)B T (t)S(t)x∗(t)
where S(·) is solution to the Riccati equation:
−Ṡ(t) = AT (t)S(t) + S(t)A(t) − S(t)B(t)R−1(t)B T (t)S(t) + Q(t), ∀t ∈ [0, 1], (22)
S(1) = S. (23)
Observations:
(a) The optimal control is defined as a linear state feedback law.
(b) The Kalman gain, K(t) := R−1(t)B T (t)S(t), can be computed a priori.
Exercise: Given kakP = aT P a, show that the cost function on [t, 1] is:
Z
1 T 1 1 −1
x (t)S(t)x(t) + kR (s)B T (s)S(s)x(s) + u(s)k2R ds.
2 2 t
Obviously that, for u∗, the above integrand becomes zero, and the optimal cost on
1 T
[0, 1] is equal to x0 S(0)x0.
2
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 32
Introduction to Optimal Control

Sketch of the Proof

Consider the pseudo-Hamiltonian (or Pontryagin function) to be optimized by the
optimal control value
1
H(t, x, p, u) := pT [Ax + Bu] + [xT Q(t)x + uT R(t)u],
2
where p is the adjoint variable s.t.
−ṗ(t) = ∇xH(t, x∗(t), p(t), u∗(t)) = AT (t)p(t) + QT (t)x∗(t), [0, 1] L-a.e.,
p(1) = Sx∗(1).
Thus, from ∇uH(t, x∗(t), p(t), u)|u=u∗(t) = 0, we have u∗(t) = −R−1(t)B T (t)p(t), and
this enables the elimination of the control from the dynamics and the adjoint equation.
We have a system of linear differential equations in x and p. This, with p(1) = Sx(1),
implies the linear dependence of p in x, ∀t, i.e.,
∃S : [0, 1] → IRn×n s.t. p(t) = S(t)x(t).
After some simple algebraic operations, we conclude that S(·) satisfies (22) with the
boundary condition. (23).
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 33
Introduction to Optimal Control

Example - Formulation
Let us consider the following dynamic system:
ẋ(t) = Ax(t) + Bu(t), [0, 1] L-a.e.,
x(0) = x0,
y(t) = Cx(t),
with the following objective function
Z T
1 T 1 £ 2 T
¤
x (T )Sx(T ) + ky(t) − yr (t)k + u(t) R(t)u(t) dt,
2 2 0
where S and R(t) are symmetric, positive definite matrices.
a) Write down the maximum principle conditions for this problem.
b) Derive the optimal control in a state feedback form when yr (t) = Cxr (t),
xr (0) = x0 and ẋr (t) = Ar xr (t).

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 34

Introduction to Optimal Control

Example - Solution

a) Let
T
£ ¤ 2 T
H(t, x(t), u(t), p(t)) := p (t) [Ax(t) + Bu(t)] + ky(t) − yr (t)k + u (t)R(t)u(t)
where p : [0, 1] → IRn satisfies:
p(T ) = Sx∗(T ),
−ṗ(t) = AT (t)p(t) + C T C[x∗(t) − xr (t)].

The optimal control u∗ maximizes the map

v → pT (t)Bv + v T R(t)v, ∀t ∈ [0, T ].

From here, we conclude that

u∗(t) = −R−1(t)B T p(t).

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 35

Introduction to Optimal Control

Example - Solution (cont.)

b) By considering
· ¸ · ¸ · ¸ · ¸ · T T
¸
x A 0 B 0 S 0 C C −C C
z := , Ā := , B̄ := , S̄ := , Q̄ := ,
xr 0 Ar 0 0 0 0 −C T C C T C
we obtain the following auxiliary problem:
Z T
1 T 1 £ T T
¤
Minimize z (T )S̄z(T ) + z (t)Q̄z(t) + u (t)R(t)u(t) dt,
2 · ¸ 2 0
x0
such that z(0) = , and ż(t) = Āz(t) + B̄u(t), [0, T ] L-a.e. .
x0
From the optimality conditions, we have u∗(t) = −K1(t)x∗(t) − K2(t)xr (t), where
K1(t) = R−1(t)B T S1(t), K2(t) = R−1(t)B T S2(t),
and S1(·) e S2(·) satisfy, respectively,
−Ṡ1(t)=AT S1(t) + S1(t)A − S1(t)BR−1(t)B T S1(t) + C T C and
−Ṡ2(t)=AT S2(t) + S2(t)A − S1(t)BR−1(t)B T S2(t) − C T C
in the interval [0, T ], with S1(T ) = S and S2(T ) = 0.
Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 36
Introduction to Optimal Control

Additional Bibliography

• Pontryagin, L., Boltyanskii, V., Gamkrelidze, R., Mishchenko, E., “The Mathematical Theory of Optimal Processes”, Pergamon-Macmillan, 1964.
• Anderson, A., Moore, B., ”Linear Optimal Control”, Prentice-Hall, 1971.
• Athans, M., Falb, P., ”Optimal Control”, McGraw Hill, 1966.
• Aubin , J., Cellina, A., ”Differential Inclusions: Set-Valued Maps and Viability Theory”, Springer-Verlag, 1984.
• Bryson, A., Ho, Y., ”Applied Optimal Control”, Hemisphere, 1975.
• Clarke F.H., Ledyaev Yu.S., Stern R.J., Wolenski P.r., “Nonsmooth Analysis and Control Theory”, Springer, 1998.
• Grace, A., ”Optimization Toolbox, User’s Guide”, The Math. Works Inc., 1992.
• Lewis, F., ”Optimal Control”, John Wiley Sons, 1987.
• Macki, J., Strauss, A., ”Introduction to Optimal Control Theory”, Springer-Verlag, 1981.
• Monk, J. et al., ”Control Engineering, Unit 15 - Optimal Control”, 1978.
• Neustadt, L., ”Optimization, A Theory of Necessary Conditions”, Princeton University Press, 1976.
• Pesch, H., Bulirsch, R., ”The Maximum Principle, Bellman’s Equation and Carathéodory’s Work”, Historical Paper in J. of Optimization Theory
and Applications, Vol. 80, No. 2, pp. 199-225, 1994.
• Tu, P., “Introductory Optimization Dynamics”, Springer-Verlag, 1984.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 74

Gizmos Radiation WS 1
100% (1)
Gizmos Radiation WS 1
4 pages
Worksheets Grade 4 Quarter 1 Week2 LC1
100% (6)
Worksheets Grade 4 Quarter 1 Week2 LC1
9 pages
Calculus of Variations Optimal Control
No ratings yet
Calculus of Variations Optimal Control
63 pages
The Normal Distribution PDF
No ratings yet
The Normal Distribution PDF
12 pages
Woolseylecture 1
No ratings yet
Woolseylecture 1
4 pages
KOM4560 KOM Int To Opt 22 23 Fall Lecture Notes 4
No ratings yet
KOM4560 KOM Int To Opt 22 23 Fall Lecture Notes 4
10 pages
Introduction To Optimal Control Theory and Hamilton-Jacobi Equations
100% (1)
Introduction To Optimal Control Theory and Hamilton-Jacobi Equations
55 pages
Optimal Control Theory Chapter 2 V6
No ratings yet
Optimal Control Theory Chapter 2 V6
86 pages
Mengistu Chalchisa
No ratings yet
Mengistu Chalchisa
46 pages
Optimal and Robust Control
No ratings yet
Optimal and Robust Control
233 pages
Unit4-MaximumPrinciple Important
No ratings yet
Unit4-MaximumPrinciple Important
18 pages
From Variational Calculus To Pontryagin's Minimum Principle
No ratings yet
From Variational Calculus To Pontryagin's Minimum Principle
6 pages
Locatelli-Optimal Control An Introduction-Birkhäuser Basel (2001)
No ratings yet
Locatelli-Optimal Control An Introduction-Birkhäuser Basel (2001)
306 pages
Optimization Via The Hamilton-Jacobi-Bellman Method Theory and Applications
No ratings yet
Optimization Via The Hamilton-Jacobi-Bellman Method Theory and Applications
9 pages
Optimal Control and The Linear Quadratic Regulator: 1 Derivation of The Euler-Lagrange Equations
No ratings yet
Optimal Control and The Linear Quadratic Regulator: 1 Derivation of The Euler-Lagrange Equations
10 pages
ENAC Booklet 2020 OptimalControl
No ratings yet
ENAC Booklet 2020 OptimalControl
135 pages
I O C S: Ntroduction To Ptimal Ontrol Ystems
No ratings yet
I O C S: Ntroduction To Ptimal Ontrol Ystems
42 pages
Optimal and Robust Control
No ratings yet
Optimal and Robust Control
216 pages
09 Principles of Optimal Control
No ratings yet
09 Principles of Optimal Control
24 pages
Sastry Optimal 2021
No ratings yet
Sastry Optimal 2021
15 pages
16.323 Principles of Optimal Control: Mit Opencourseware
No ratings yet
16.323 Principles of Optimal Control: Mit Opencourseware
4 pages
16.323 Principles of Optimal Control: Mit Opencourseware
No ratings yet
16.323 Principles of Optimal Control: Mit Opencourseware
4 pages
5 - HJB
No ratings yet
5 - HJB
12 pages
Optimal Control - Wikipedia
No ratings yet
Optimal Control - Wikipedia
12 pages
Optimal Control
No ratings yet
Optimal Control
305 pages
Arturo Locatelli-Optimal Control - An Introduction-Birkhäuser Basel (2001) PDF
No ratings yet
Arturo Locatelli-Optimal Control - An Introduction-Birkhäuser Basel (2001) PDF
305 pages
PhysRevResearch.5.013122 Physics of networks
No ratings yet
PhysRevResearch.5.013122 Physics of networks
9 pages
Chapter One: 1.1 Optimal Control Problem
No ratings yet
Chapter One: 1.1 Optimal Control Problem
25 pages
(Evans L.C.) An Introduction To Mathematical Optim
No ratings yet
(Evans L.C.) An Introduction To Mathematical Optim
125 pages
15++ Control Óptimo
No ratings yet
15++ Control Óptimo
11 pages
An Introduction To Optimal Control Applied To Disease Models
No ratings yet
An Introduction To Optimal Control Applied To Disease Models
37 pages
0 Tlemcen Mio Contrib
No ratings yet
0 Tlemcen Mio Contrib
49 pages
Projet
No ratings yet
Projet
22 pages
8 Pontryagin
No ratings yet
8 Pontryagin
31 pages
Optimal Control
No ratings yet
Optimal Control
51 pages
Different Types of Systems: TF X (TF)
No ratings yet
Different Types of Systems: TF X (TF)
20 pages
MAE274 Notes
No ratings yet
MAE274 Notes
96 pages
The Variational Approach To Optimal Control
100% (1)
The Variational Approach To Optimal Control
48 pages
Lecture10 - Pontryagins Minimum Principle
No ratings yet
Lecture10 - Pontryagins Minimum Principle
9 pages
OTS Talk Topputo PDF
No ratings yet
OTS Talk Topputo PDF
86 pages
DC 13 Appendix A
No ratings yet
DC 13 Appendix A
42 pages
Solutions To Exercises: Min Max Min
No ratings yet
Solutions To Exercises: Min Max Min
18 pages
Tutorial On Control and State Constrained Optimal Control Problems - PART I: Examples
No ratings yet
Tutorial On Control and State Constrained Optimal Control Problems - PART I: Examples
40 pages
0 Tlemcen Mio Contrib PDF
No ratings yet
0 Tlemcen Mio Contrib PDF
48 pages
Optimal Control
No ratings yet
Optimal Control
189 pages
Pontryagin's Maximum Principle
No ratings yet
Pontryagin's Maximum Principle
21 pages
Calculus of Variations & Optimal Control - Sasane
No ratings yet
Calculus of Variations & Optimal Control - Sasane
63 pages
Control Course
No ratings yet
Control Course
126 pages
Summer 2
No ratings yet
Summer 2
29 pages
2013 Ocp
No ratings yet
2013 Ocp
9 pages
Lecture8 S21
No ratings yet
Lecture8 S21
19 pages
1 The Hamilton-Jacobi-Bellman Equation
No ratings yet
1 The Hamilton-Jacobi-Bellman Equation
7 pages
SC Dec22
No ratings yet
SC Dec22
82 pages
AA278A Lecture Notes 8. Optimal Control and Dynamic Games: Claire J. Tomlin May 11, 2005
No ratings yet
AA278A Lecture Notes 8. Optimal Control and Dynamic Games: Claire J. Tomlin May 11, 2005
12 pages
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Nonlinear Control Feedback Linearization Sliding Mode Control
From Everand
Nonlinear Control Feedback Linearization Sliding Mode Control
Mourad Boufadene
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
Algebraic Methods in Statistical Mechanics and Quantum Field Theory
From Everand
Algebraic Methods in Statistical Mechanics and Quantum Field Theory
Dr. Gérard G. Emch
No ratings yet
Feedback Control Theory
From Everand
Feedback Control Theory
Bruce Francis
5/5 (1)
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
Lecture Note
No ratings yet
Lecture Note
23 pages
Worksheet Taken Fromm
No ratings yet
Worksheet Taken Fromm
5 pages
Lmwhitfieldpdf
No ratings yet
Lmwhitfieldpdf
50 pages
Agree Disagree
No ratings yet
Agree Disagree
14 pages
Passengers Perception of Satisfaction and Its Rel
No ratings yet
Passengers Perception of Satisfaction and Its Rel
18 pages
Orthogonal Trajectorie
No ratings yet
Orthogonal Trajectorie
5 pages
Sptve Icf 8 Q1 M10
No ratings yet
Sptve Icf 8 Q1 M10
18 pages
In The Inter Media Te Grade S: by Sol Concepcion M. Pamat
0% (2)
In The Inter Media Te Grade S: by Sol Concepcion M. Pamat
9 pages
Plant Diversity and Human Welfare
No ratings yet
Plant Diversity and Human Welfare
1 page
AF1600S Subsonic Wind Tunnel Datasheet 0623
No ratings yet
AF1600S Subsonic Wind Tunnel Datasheet 0623
8 pages
Summarize - Tech Summary For: Documental - Corriente Directa Vs Corriente Alterna, La Historia
No ratings yet
Summarize - Tech Summary For: Documental - Corriente Directa Vs Corriente Alterna, La Historia
4 pages
Greece Financial Crisis Research
No ratings yet
Greece Financial Crisis Research
8 pages
BAGUIO City Councilors Mylen Yaranon and Joel Alangsab Are Pushing For Fees To Be Paid For The City
No ratings yet
BAGUIO City Councilors Mylen Yaranon and Joel Alangsab Are Pushing For Fees To Be Paid For The City
9 pages
(Ebook) W. T. Koiter's Elastic Stability of Solids and Structures by Arnold M. A. van der Heijden ISBN 9780521515283, 0521515289 pdf download
100% (1)
(Ebook) W. T. Koiter's Elastic Stability of Solids and Structures by Arnold M. A. van der Heijden ISBN 9780521515283, 0521515289 pdf download
50 pages
Grade 8 2024 Math STAAR
No ratings yet
Grade 8 2024 Math STAAR
33 pages
History of Trigonometry
No ratings yet
History of Trigonometry
5 pages
Globalization Refers To The Interconnectedness and Integration of Economies
No ratings yet
Globalization Refers To The Interconnectedness and Integration of Economies
2 pages
Analytical Approach To Predict Nonlinear Parameters For Dynamic Analysis of Structures Applied To Blast Loads
No ratings yet
Analytical Approach To Predict Nonlinear Parameters For Dynamic Analysis of Structures Applied To Blast Loads
19 pages
CHAPTER-9-Styles-of-Learning-1
100% (1)
CHAPTER-9-Styles-of-Learning-1
18 pages
Present Simple vs Present Continuous
No ratings yet
Present Simple vs Present Continuous
6 pages
Glass Micro Fibre Filter Comparison Guide
No ratings yet
Glass Micro Fibre Filter Comparison Guide
2 pages
Psychology Essay
No ratings yet
Psychology Essay
3 pages
New Microsoft Excel Worksheet
No ratings yet
New Microsoft Excel Worksheet
4 pages
Que 1. Statement of Purpose
No ratings yet
Que 1. Statement of Purpose
2 pages
Department of Agrarian Reform Funds Transform Farm To Market Roads in Pangasinan
No ratings yet
Department of Agrarian Reform Funds Transform Farm To Market Roads in Pangasinan
9 pages
Crown of Domination Boots of Teleportation
No ratings yet
Crown of Domination Boots of Teleportation
1 page
2019 Lab Report 2
No ratings yet
2019 Lab Report 2
9 pages
2018-2022 West Pokot County CIDP
No ratings yet
2018-2022 West Pokot County CIDP
249 pages
0520 - m19 - RP - 03 P Oral
No ratings yet
0520 - m19 - RP - 03 P Oral
18 pages
Ar7240 Ah1a PDF
No ratings yet
Ar7240 Ah1a PDF
1 page
6 Game Theory
No ratings yet
6 Game Theory
13 pages
Learning Theories Midterm Review Fall 2024
No ratings yet
Learning Theories Midterm Review Fall 2024
9 pages
Operator Exam
No ratings yet
Operator Exam
11 pages

Slides CO-course C4C Part I

Uploaded by

Slides CO-course C4C Part I

Uploaded by

INTRODUCTION TO OPTIMAL CONTROL

Fernando Lobo Pereira, João Borges de Sousa

C4C Autumn School

2. Necessary Conditions of Optimality - Linear Systems

3. Necessary Conditions of Optimality - Nonlinear Systems.

5. Conditions of the Hamilton-Jacobi-Bellman Type.

6. A Framework for Coordinated Control

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 1

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 2

Ingredients of the Optimal Control Problem

o Objective functional - Criterium to quantify the performance of the system.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 4

The General Problem

(P 0) Minimize {g(z) : z ∈ A(1; (x0, 0))}

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 6

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 7

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 8

Overview of Main Issues - Necessary Conditions of Optimality

(6) eliminates the control as it defines implicitly

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 9

Step 1 − Select an initial control strategy u.

Step 2 − Compute a pair (x, p), by using (1, 2, 4, 5).

Step 3 − Check if u(t) ssatisfies (6).

Step 4 − Update the control in order to lower the cost function.

Step 5 − Prepare the new iteration and goto Step 2.

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 10

Overview of Main Issues - Existence of Solution

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 11

Overview of Main Issues - Sufficient Conditions of Optimality

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 12

An exercise on the separation principle applied to the following problems:

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 14

The Basic Linear Problem

(P1) Minimize −cT x(1)

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 15

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 16

A(t1,0,x 0) A(t2 ,t1,x*(t 1))

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 18

Deduction of the Maximum Principle

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 19

The Linear problem with Affine Endpoint State Constraints

(P2) Minimize − cT x(1)

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 21

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 22

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 23

Proof Sketch of the Necessity

Proof Sketch of the Sufficiency

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 25

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 26

Example - solution clues

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 27

The Minimum Time Problem

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 28

Fig. 3. Determination of minimum time for problem (P3) (inspired on [17]).

The Linear Quadratic Regulator Problem

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 31

Sketch of the Proof

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 34

The optimal control u∗ maximizes the map

From here, we conclude that

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 35

Example - Solution (cont.)

Fernando Lobo Pereira, João Tasso Borges de Sousa FEUP, Porto 74

You might also like