Lie_group
Lie_group
Lie-group methods
in geometric numerical integration
Arieh Iserles
1
OUTLINE
4. Runge–Kutta–Munthe-Kaas methods
6. Multivariate quadrature
The linear space of all tangents at p is the tangent space TM|p, while
TM = ∪p∈MTM|p
is the tangent bundle.
6
The cotangent space T⋆M|p consists of all linear functionals acting on ele-
ments of TM|p.
The set of all vector fields over M is denoted by X(M) and, clearly, it is a
linear space.
y ′ = F (y), t ≥ 0, y(0) = y0 ∈ M,
where F ∈ X(M), evolves on the manifold M.
7
Therefore
d
F (y) = Ψt,F (y0)|t=0.
dt
In other words, F is the infinitisimal generator of the flow.
8
If flows commute then Φs,t(y) = y, but this isn’t in general true.
Φs,ts (y0 )
· ·
* i
PP
PP
Ψs,F · · ·· PPs
y0
· ·
·
··
· ··
··
s
6
··
Ψt,G
Ψt,G
Ψs,F
· · · · · · · · · · · · · · · · · · ·- s
s
9
Note that for every F, G, H ∈ X(M) and scalar α we have the following fea-
tures of the bracket operation:
Skew symmetry: [F, G] = −[G, F ];
Bilinearity:
• The projective space Pn of all lines in Rn that pass through the origin.
11
Lie groups
Lie group G is smooth manifold, endowed with group structure which is con-
tinuous with respect to the topology of the manifold.
SL(n, R): real n × n matrices with unit determinant (the special linear group);
SU(n, C): complex n × n unitary matrices with unit determinant (the special
unitary group);
E(n, R): The Euclidean group of all translations and length-preserving linear
transformations in Rn;
A(n, R): The affine group of all translations and area-preserving linear trans-
formations in Rn.
12
Why are Lie groups useful?
Lie groups are important since they provide an appropriate formalism to in-
vestigate symmetries, invariants and qualitative behaviour of differential equa-
tions.
There are many other good reasons why Lie groups are important. We’ll see one (group
actions on manifolds) but there are many others, not least in number theory.
Many physical laws are conveniently formulated with built-in Lie-group sym-
metries.
For example, laws of motion have SO(3) symmetry, equations of special rela-
tivity evolve in SO4,1 and theory of superstrings can be formulated in SU(32)
and in E(8) × E(8).
Let G be a Lie group and g = TG|I the tangent space at identity. Since G is a
group, it follows at once that
TG|X = gX for all X ∈ G.
Since (as we have already seen) there is a natural isomorphism between
(finite-dimensional) linear vector fields and square matrices, we deduce that
14
Lie algebras
We say that the abstract set g is a Lie algebra if it is a linear space, which in
addition is closed under the binary operation [ · , · ] : g × g → g which obeys
the following axioms:
1. Linearity: For every a, b, c ∈ g and scalars α, β it is true that
[αa + βb, c] = α[a, c] + β[b, c].
Lie algebras can be fairly strange objects and not all of them originate as tangent spaces
of Lie groups. A nice example is the modular group SL(n, Z) of integer matrices with unit
determinant, which is important in number theory.
15
Examples of Lie algebras
{x ∈ GL(n, R) : xpx⊤ = p}
the corresponding Lie algebra is
{a ∈ gl(n, R) : ap + pa⊤ = 0.
16
Group actions
As a matter of fact, all previous examples of manifolds are subject to transitive group action:
every point in M is reachable from any other point via the group action. In that case M is a
homogeneous manifold.
17
Differential equations and group actions
Suppose that G is a matrix group. Then, for a ∈ g the flow of λ∗, i.e.
y ′ = λ∗(a)(y), t ≥ 0, y(0) = y0 ∈ M,
can be expressed in the form
18
This can be generalized from fixed a ∈ g to a sufficiently smooth function
a : R+ × M → g: The solution of the differential equation
Therefore,
instead of solving the equation in M, i.e. finding yN +1 given yN , say, find a
group action that takes yN to yN +1.
We conclude that
if we can devise a numerical method that respects a Lie-group structure then
it can be extended to respect every homogeneous manifold structure acted
upon by that group.
19
The main paradigm of Lie-group methods
?
pN DE on a Lie group pN +1
6
? DE on a Lie algebra
ωN - ωN +1
20
Some Lie-group methods do not follow this pattern:
• The method of Crouch & Grossman represents the equation in the form
P
y ′ = dl=1 αl (t)ql , using a rigid frame ql ∈ TM| y , l = 1, . . . , d, and
composes the solution from one-dimensional ‘steps’.
21
The exponential and trivializations
An equation evolving on a homogeneous manifold M can be written in the
form
Steps 1 and 5 are already clear from our discussion, at present we focus on steps 2–4.
24
A trivialization is a smooth mapping Φ : g → G such that Φ(0) = I. The main
idea is write the solution p of a Lie-group equation in the form
Formally,
Note that a trivialization might be valid only in (hopefully, large) neighbourhood of the origin:
this represents a possible restriction on the step size of the discretization method.
25
An obvious (and most useful) trivialization is exponentiation,
p(t) = eω(t)pN .
The outcome is the dexpinv equation
∞
X Bm
ω = dexp−1
ω a= adm
ω a, ω(tN ) = 0
m=0 m!
(Hausdorff), where
m times
z }| {
m
adω a = [ω, [ω, . . . , [ω, a] · · · ]]
and {Bm}m≥0 are Bernoulli numbers,
∞
X Bm m z 1 z2 − 1 z4 + 1 z 6.
z = z ≈1−1
2 z + 12 720 30240
m=0 m! e − 1
Note that the dexpinv equation is always nonlinear and consists of an infinite
sum. However, infinite sums can be truncated and nonlinearity is a worthwhile
price to pay.
26
Specifically, the dexpinv equation is
1 [ω, a] + 1 [ω, [ω, a]] − 1 [ω, [ω, [ω, [ω, a]]]]
ω′ = a − 2 12 720
1 [ω, [ω, [ω, [ω, [ω, [ω, a]]]]]] + · · · .
+ 30240
ω′ = a − 1
2 [ω, a] + 1 ωaω.
4
The Cayley trivialization has two advantages vis-á-vis the standard exponen-
tial: the equation is much simpler and the evaluation of Φ(ω) much cheaper, in
particular for large n. On the other hand, of course, it applies only to quadratic
Lie groups.
27
Runge–Kutta–Munthe-Kaas methods
We apply an RK method to the dexpinv equation, in place of the original Lie-
group equation. E.g., instead of the familiar 3rd-order scheme
k1 = a(tN , yN )yN
k2 = a(tN +1/2, yN + 12 hk 1 )(y N + 1 hk )
2 1
k3 = a(tN+1, yN − hk1 + 2hk2)(yN − hk1 + 2hk2)
1k + 2k + 1k )
∆ = h( 6 1 3 2 6 3
yN +1 = yN + ∆,
we use
k1 = a(tN , yN )
1 hk
k2 = a(tN +1/2 , e 2 1 yN )
k3 = a(tN +1, e−hk1+2hk2 yN )
1k + 2k + 1k )
∆ = h( 6 1 3 2 6 3
1 h[∆,k ]
∆+ 6
yN +1 = e 1 yN .
This is again a 3rd-order method, except that it is guaranteed to evolve on G.
28
• The beauty of this approach is that we can take any RK method and con-
vert it into a Lie-group method without any major effort.
• Implicit RK–MK methods can be used, but they require the solution of un-
derlying nonlinear algebraic equations. Owren & Welfert have presented
an extension of Newton’s method that respects Lie group structure but,
again, it ain’t cheap.
• Similar approach is valid for other trivializations, in particular for the dcay-
inv equations.
29
Magnus and Magnus-type expansions
Consider (for simplicity) the linear equation
• Rooted tree: The pair T = (G, w), where G is a tree and w (the root) is
one of its vertices.
31
It is easy to see that all terms in a Magnus expansion of ω ′ are of the form
Z
simpler term, another simpler term .
This can be expressed by recursion (the proof follows by Picard iteration):
1. The tree u
corresponds to a(t);
Let Tm be the set of trees with m vertical lines (i.e., corresponding to terms
with m integrals). Then
∞
X X Z t
ω(t) = α(τ ) Hτ (ξ)dξ,
m=0 τ ∈Tm 0
where the constants α(τ ) are obtained recursively: Any τ ∈ Tm, m ≥ 1, can
be written uniquely in the form
τs
u u
@ @u
τ2 q
τ1 u u
q
q
u @@u
@
@u ,
where
α( u) = 1,
s
Bs Y
α(τ ) = α(τl ).
s! l=1
33
AI, Nørsett & Rasmussen: Appropriately truncated “by power”, a Magnus ex-
pansion yn+1 = Υhyn is time symmetric, i.e. Υh ◦ Υ−h = Id. Therefore
Υh = eΨh , where Ψh is an odd function and it follows that such an expan-
sion is always of an even order – if we truncate it to produce odd order, we
gain for free an extra unit of order.
2
u u +4 u + 12 u − 24 u
8 u − 24 u .
Note that we truncate the expansion by throwing away high-order terms (trun-
cation “by power”), rather than trees of sufficiently high m. This results in much
smaller number of surviving trees.
34
Convergence of the Magnus expansion
Original proof by Blanes, Casas, Oteo & Ros. Shorter proof by Moan: Integra-
tion & triangle inequality imply that
Z t Z t X
∞
−1 |Bk |
kω(t)k ≤ kdexpω(ξ)a(ξ)kdξ ≤ (2kω(ξ)k)k a(ξ)dξ
0 0 k=0 k!
Z t
= g(2kω(ξ)k)ka(ξ)kdξ,
0
where g(x) = 2 + 2 x (1 − cot x ). Bihari-type inequality: Let h, g, v ∈ C[0, t∗ )
2 Rt
positive, g nondecreasing. Then h(t) ≤ 0 g(h(ξ))v(ξ)dξ implies
Z t Z x
dξ
h(t) ≤ g̃ −1 v(ξ)dξ , g̃(x) = .
0 0 g(ξ)
Letting h(t) = 2kω(t)k, v(t) = ka(t)k completes the proof. 2
35
The Fer expansion
36
Also the Fer expansion can be expanded in rooted trees. Thus, for order 6,
just two terms are required and
ω0 : u
u
u u u u
@
u u u u u u @u
u u u u @
u u @u u u @@u
@ @ @
u @u u @u u @u
ω1 : 1 u
@
@u
u
1
+3 @
@u
1
+8 @
@u
1
+ 30 @
@u
2
37
The Cayley expansion
I+2 1 ω(t)
y(t) = cayω(t)y0 = 1
y0 , t ≥ 0,
I − 2 ω(t)
consequently
ω′ = a − 1
2 [ω, a] + 1 ωaω,
4 t ≥ 0, ω(0) = 0.
In line with the Magnus expansion, we can show that
Z t Z tZ ξ
1
ω(t) = a(ξ)dξ − 1
2 [a(ξ2), a(ξ1)]dξ2dξ1
0 0 0
Z tZ ξ Z ξ
1 1 2
+4 [[a(ξ3), a(ξ2)], a(ξ1)]dξ3dξ2dξ1
0 0 0
Z tZ ξ Z ξ
1 1
−1
4 a(ξ2)dξ2a(ξ1) a(ξ2)dξ2dξ1 + · · · .
0 0 0
This Cayley expansion can be also expanded in rooted trees, except that we
require slightly more complicated structures.
38
AI: Specifically, we have the following composition rules:
R
1. 0t a(ξ)dξ ;
u
u is a term.
2. If H ;τ is a term, then so is
Z t τ u
@
@u
[H(ξ), a(ξ)]dξ ; u .
0
We obtain u
u u
@
u @u u u
u u u u u u
@ @ @e
@u @u
ω(t) ;
u
1
−2 1
+4 −1 + ··· .
u u u
4 u
39
Multivariate quadrature
Each term in a Magnus, Fer or Cayley expansion corresponds to a multivariate
integral over a different polytope. In principle, multivariate integration is very
expensive: we need many function evaluations for every polytope. Fortunately,
everything simplifies!
All integrals in question are of the form
Z
I(h) = L(a(ξ1), a(ξ2), . . . , a(ξr ))dξ ,
hS
where L is a multilinear form and S is a polytope,
S = {x ∈ Rr : 0 ≤ xk ≤ xik , k = 1, 2, . . . , r},
with x0 = 1 and ik ≤ k − 1, x0 = 1, k = 1, 2, . . . , r. Specifically, for the first
few integrals we have the following polytopes S and multilinear forms L:
40
Tree r L(z1, . . . , zr ) S
u
u u 1 z1
u u @
@@u @
@
u u 2 [z2, z1] @
u u
@
@u
X
u u @X X
X
@ @ C
@u @ C
u
u 3 [[z3, z2], z1] @C
@
@
u u u
@
u @u @ @
@@u @ @
@
u 3 [z3, [z2, z1]] @
41
Theorem (AI & Nørsett) The order of quadrature is ν + s, where
Z 1
ζ i−1q(ζ)dζ = 0, i = 1, 2, . . . , s,
0
Qν
and q(x) = k=1(x − ck ).
The proof is long and complicated. We’ll prove instead a simpler statement.
For this (and for much future use) we require the
ℓk (ck ) = 1, ℓk (cj ) = 0, j 6= k.
Then
Z r
Y
βi = ℓik (ξk )dξ .
S k=1
GOOD NEWS
It takes just ν evaluations of the matrix a to compute all the integrals in the
expansion to order 2ν.
BAD NEWS
The numbers of commutators increases very fast, in line with the exponential
growth in the number of combinations in Cνr . Therefore, the volume of linear
algebra required in every time step becomes prohibitive for large orders.
44
Graded Lie algebras
The number of commutators can be vastly reduced by an idea due to Munthe-
Kaas & Owren.
45
Free Lie algebras
For future use we require the concept of a Free Lie algebra. Given generators
Φ = φ1, φ2, . . . , φν , we say that
F = FLA(φ1, φ2, . . . , φν )
is the free Lie algebra generated by Φ if it is the closure of the generators
under commutation and linear combinations. Trivially, it is a Lie algebra.
ν
We denote by Km the linear space of all terms in the FLA of grade m. Of
ν
course F = Sp{Km : m ≥ 1}, although we will not make much use of it.
ν
The surprisingly small dimension of Km is the key to our goal, to reduce the
number of commutators.
46
The great commutator throw-away
1. All terms of grade ≥ 2ν + 1 are not required and can be thrown out.
47
1. Then
s
ν 1 X X m/d
dim Km = µ(d) λi ,
m d|m i=1
• Eigenvalue/eigenvector decomposition:
Exceedingly expensive (in particular for large systems) and ill conditioned.
P
1. k bk = a, each bk lives in g;
3. The overall error is consistent with the order of the Lie-group method
2. Suppose that
1 21/(p+1)
α= , β=− .
2 − 21/(p+1) 2 − 21/(p+1)
Then Ψh is of order p + 2.
Let d = dim g and let {φ1, φ2, . . . , φd} be its basis. We seek scalar functions
θ1, θ2, . . . , θm s.t.
eta = eθ1(t)φ1 eθ2(t)φ2 · · · eθd(t)φd .
Such functions always exist but their practical evaluation in closed form is vir-
tually impossible for d ≥ 3 (Wei & Norman). Alternatively, we approximate the
functions θk to requisite order.
54
We have
θk (0) = 0,
θk′ (0) = αk ,
d l−1
X X
θk′′(0) = αl ckl,j αj ,
l=1 j=1
Xd l−1
X
θk′′′(0) = 2 ckl,j [θl′′(0)αj + θj′′(0)αl ]
l=1 j=1
X X j−1
d l−1 d
X X d X
X d l−1
X j
+2 cm c k αα α +
l,j i,m l j i cl,icki,j αl α2
i
l=1 j=1 i=1 m=1 l=1 j=1 i=1
and so on. More and more summation!
Assume, though, that the structure constants are sparse: they are almost all
zero. In that case the cost reduces a very great deal! This can be accom-
plished by choosing as our basis a root space decomposition of g: the number
of summations typically drops by a factor of two and the cost of a second-order
approximation in so(n) is just O(n).
55
Generalized polar decomposition (Munthe-Kaas, Quispel & Zanna)
k = {a ∈ g : dσ(a) = a},
p = {a ∈ g : dσ(a) = −a}.
While k is a Lie subalgebra, p is a Lie triple system: a linear space s.t.
It is possible to prove that the Lie group can be factorized into generalized
polar decomposition
G ∋ z = xy, σ(y) = y, σ(x) = x−1.
At the algebra level, this corresponds to
G ∋ ec = eaeb, a ∈ p, b ∈ k.
The main idea is to obtain the leading terms of a, b from p, k.
57
We have
1 [p, k]t2 − 1 [k, [p, k]]t3
a = pt − 2 6
+ ( 24 [p, [p, [p, k]]] − 24 [k, [k, [p, k]]])t + O t5 ,
1 1 4
b = kt − 12 [p, [p, k]]t + O t5 .
1 3
Suppose now that dim p is very small. In that case ea is very cheap to com-
pute. This is not the case with eb, but recall that k is a subalgebra! Hence, we
can go on splitting it!
59