2018 QFT
2018 QFT
QUANTUM FIELDS
FRANÇOIS GELIS
2 Functional quantization 85
2.1 Path integral in quantum mechanics . . . . . . . . . . . . . . . . . 85
2.2 Classical limit, Least action principle . . . . . . . . . . . . . . . . . 89
2.3 More functional machinery . . . . . . . . . . . . . . . . . . . . . . 89
2.4 Path integral in scalar field theory . . . . . . . . . . . . . . . . . . 96
2.5 Functional determinants . . . . . . . . . . . . . . . . . . . . . . . . 98
2.6 Quantum effective action . . . . . . . . . . . . . . . . . . . . . . . 101
2.7 Two-particle irreducible effective action . . . . . . . . . . . . . . . 107
2.8 Euclidean path integral and Statistical mechanics . . . . . . . . . . 114
i
ii F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Basics of Quantum
Field Theory
Special relativity plays a crucial role in quantum field theories1 . Various observers in
frames that are moving at a constant speed relative to each other should be able to
describe physical phenomena using the same laws of Physics. This does not imply
that the equations governing these phenomena are independent of the observer’s
frame, but that these equations transform in a constrained fashion –depending on the
nature of the objects they contain– under a change of reference frame.
Let us consider two frames F and F ′ , in which the coordinates of a given event
′
are respectively xµ and x µ . A Lorentz transformation is a linear transformation such
that the interval ds ≡ dt2 − dx2 is the same in the two frames2 . If we denote the
2
coordinate transformation by
x′µ = Λµ ν xν , (1.1)
1 An exception to this assertion is for quantum field models applied to condensed matter physics, where
the basic degrees of freedom are to a very good level of approximation described by Galilean kinematics.
2 The physical premises of special relativity require that the speed of light be the same in all inertial
frames, which implies solely that ds2 = 0 be preserved in all inertial frames. The group of transformations
that achieves this is called the conformal group. In four space-time dimensions, the conformal group is 15
dimensional, and in addition to the 6 orthochronous Lorentz transformations it contains dilatations as well
as non-linear transformations called special conformal transformations.
1
2 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Λµ ν = δµ ν + ωµ ν (1.5)
(with all components of ω much smaller than unity), this implies that
(with all indices down). Consequently, there are 6 independent Lorentz transfor-
mations, three of which are ordinary rotations and three are boosts. Note that the
infinitesimal transformations (1.5) have a determinant3 equal to +1 (they are called
proper transformations), and do not change the direction of the time axis since
Λ0 0 = 1 ≥ 0 (they are called orthochronous). Any combination of such infinitesimal
transformations shares the same properties, and their set forms a subgroup of the full
group of transformations that preserve the Minkowski metric. c sileG siocnarF
(The prefactor i/2 in the second term of the right hand side is conventional.) Since
the ωµν are antisymmetric, the generators Mµν can also be chosen antisymmetric.
By using eq. (1.7) for the Lorentz transformation Λ−1 Λ ′ Λ, we arrive at
indicating that Mµν transforms as a rank-2 tensor. When used with an infinitesimal
transformation Λ = 1 + ω, this identity leads to the commutation relation that defines
the Lie algebra of the Lorentz group
µν
M , Mρσ = i(gµρ Mνσ − gνρ Mµσ ) − i(gµσ Mνρ − gνσ Mµρ ) . (1.10)
When necessary, it is possible to divide the six generators Mµν into three generators
Ji for ordinary spatial rotations, and three generators Ki for the Lorentz boosts along
each of the spatial directions:
Then, we may define a generic one-particle state from those corresponding to the
reference momentum as follows
p, σ ≡ Np U(L(p)) q, σ , (1.18)
where Np is a numerical prefactor that may be necessary to properly normalize the
states. This definition leads to
U Λ p, σ = Np U L(Λp) U L−1 (Λp)ΛL(p) q, σ . (1.19)
| {z }
Σ
−1
Note that the Lorentz transformation Σ ≡ L (Λp)ΛL(p) maps qµ into itself, and
therefore belongs to the subgroup of the Lorentz group that leaves qµ invariant, called
the little group of qµ . Thus, when U(Σ) acts on the reference state, the momentum
remains unchanged and only the other quantum numbers may vary
X
U(Σ) q, σ = Cσσ ′ (Σ) q, σ ′ . (1.20)
σ′
Moreover, the coefficients Cσσ ′ (Σ) in the right hand side of this formula define a
representation of the little group,
X
Cσσ ′ (Σ2 Σ1 ) = Cσσ ′′ (Σ2 ) Cσ ′′ σ ′ (Σ1 ) . (1.21)
σ ′′
Massive particles : In the case of a massive particles, the little group is made of
the Lorentz transformations that leave the vector qµ = (m, 0, 0, 0) invariant, which
is the group of all rotations in 3-dimensional space. The additional quantum number
σ is therefore a label that enumerates the possible states in a given representation of
SO(3). These representations correspond to the angular momentum, but since we are
in the rest frame of the particle, this is in fact the spin of the particle. For a spin s, the
dimension of the representation is 2s + 1, and σ takes the values −s, 1 − s, · · · , +s.
1. BASICS OF Q UANTUM F IELD T HEORY 5
12 10
U(Σ) ≈ 1 − iθ M
|{z} −iα1 (M
| + M31}) − iα2 (M
{z |
20
− M23}) .
{z (1.23)
J3 K1 +J2 ≡B1 K2 −J1 ≡B2
Thus, the little group for massless particles is three dimensional, with generators J3
(the projection of the angular momentum in the direction of the momentum) and4
B1,2 . Using eq. (1.10), we have
J3 , B1 = i B2 , J3 , B2 = −i B1 , B 1 , B2 = 0 . (1.24)
The last commutators implies that we may choose states that are simultaneous eigen-
states of B1 and B2 . However, non-zero eigenvalues for B1,2 would lead to a con-
tinuum of states with the same momentum, that are not realized in Nature. The
remaining transformation, generated by J3 , can be viewed as a rotation about the
direction of the momentum, and the corresponding group is SO(2). Therefore, the
only eigenvalue that labels the massless states is that of J3 ,
The number σ is called the helicity of the particle. After a rotation of angle θ = 2π,
the state must return to itself (bosons) or its opposite (fermions), implying that the
helicity must be a half integer:
particle momentum, i.e. the transformations that shift the transverse velocity, vj → vj + δvj . The physical
reason of their appearance in the discussion of massless particles is time dilation: in the observer’s frame,
the transverse dynamics of a particle moving at the speed of light is infinitely slowed down by time dilation,
and is therefore non relativistic (this intuitive idea can be further substantiated by light-cone quantization).
6 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
This formula just reflects the fact that the point x where the transformed field is
evaluated was located at the point Λ−1 x before the transformation. The first derivative
∂µ φ of the field transforms as a 4-vector,
where the bar in ∂ν indicates that we are differentiating with respect to the whole
argument of φ, i.e. Λ−1 x. Likewise, the second derivative ∂µ ∂ν φ transforms like a
rank-2 tensor, but the d’Alembertian φ transforms as a scalar. c sileG siocnarF
The operators creating or destroying particles with a given momentum p obey usual
commutation relations,
ap , ap = a†p , a†p = 0 , ap , a†p ∼ 1 . (1.30)
(in the last commutator, the precise normalization will be defined later.) In contrast,
operators acting on different momenta always commute:
ap , aq = a†p , a†q = ap , a†q = 0 . (1.31)
(Implicitly in these equations is the fact that particles are non-interacting, so that
adding or removing a particle of momentum p does not affect the rest of the system.)
In these lectures, we will adopt the following normalization for the free Hamiltonian5 ,
Z
d3 p
H= Ep a†p ap + V Ep , (1.34)
(2π)3 2Ep
where V is the volume of the system. To make contact with the usual treatment6 of a
harmonic oscillator in quantum mechanics, it is useful to introduce the occupation
number fp defined by,
The expectation value of fp has the interpretation of the number of particles par unit
of phase-space (i.e. per unit of volume in coordinate space and per unit of volume
in momentum space), and the 1/2 in fp + 12 is the ground state occupation of each
oscillator7 . Of course, this additive constant is to a large extent irrelevant since only
energy differences have a physical meaning. Given eq. (1.34), the commutation
relations (1.32) and (1.33) are fulfilled provided that
ap , a†q = (2π)3 2Ep δ(p − q) . (1.37)
5 In a relativistic setting, the measure d3 p/(2π)3 2E has the important benefit of being Lorentz
p
invariant. Moreover, it results naturally from the 4-dimensional momentum integration d4 p/(2π)4
constrained by the positive energy mass-shell condition 2π θ(p0 ) δ(p2 − m2 ).
6 In relativistic quantum field theory, it is customary to use a system of units in which h̄ = 1, c = 1 (and
also kB = 1 when the Boltzmann constant is needed to relate energies and temperature). In this system of
units, the action S is dimensionless. Mass, energy, momentum and temperature have the same dimension,
which is the inverse of the dimension of length and duration:
mass = energy = momentum = temperature = length−1 = duration−1 .
Moreover, in four dimensions, the creation and annihilation operators introduced in eq. (1.34) have the
dimension of an inverse energy:
†
ap = ap = energy−1
(the occupation number fp is dimensionless.)
7 This is reminiscent of the fact that the energy of the level n in a quantized harmonic oscillator of base
energy ω is En = (n + 21 )ω.
8 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that in quantum mechanics, a particle with a well defined momentum p is not
localized at a specific point in space, due to the uncertainty principle. Thus, when we
say that a†p creates a particle of momentum p, this production process may happen
anywhere in space and at any time since the energy is also well defined. Instead of
using the momentum basis, one may introduce an operator that depends on space-time
in order to give preeminence to the time and position at which a particle is created or
destroyed. It is possible to encapsulate all the ap , a†p into the following Hermitean
operator8
Z
d3 p † +ip·x
φ(x) ≡ 3
ap e + ap e−ip·x , (1.38)
(2π) 2Ep
where p · x ≡ pµ xµ with p0 = +Ep . In the following, we will also need the time
derivative of this operator, denoted Π(x),
Z
d3 p
Π(x) ≡ ∂0 φ(x) = i Ep a†p e+ip·x − ap e−ip·x . (1.39)
(2π)3 2Ep
Given the commutation relation (1.37), we obtain the following equal-time commuta-
tion relations for φ and Π,
φ(x), φ(y) x0 =y0 = Π(x), Π(y) x0 =y0 = 0 , φ(x), Π(y) x0 =y0 = iδ(x−y) .
(1.40)
These are called the canonical field commutation relations. In this approach (known
as canonical quantization), the quantization of a field theory corresponds to promot-
ing the classical Poisson bracket between a dynamical variable and its conjugate
momentum to a commutator:
Pi , Qj = δij → ^i, Q
P ^ j = i h̄ δij . (1.41)
In addition to these relations that hold for equal times, one may prove that φ(x) and
Π(y) commute for space-like intervals (x − y)2 < 0. Physically, this is related to the
absence of causal relation between two measurements performed at space-time points
with a space-like separation.
It is possible to invert eqs. (1.38) and (1.39) in order to obtain the creation and
8 In four space-time dimensions, this field has the same dimension as energy:
φ(x) = energy .
1. BASICS OF Q UANTUM F IELD T HEORY 9
annihilation operators given the operators φ and Π. These inversion formulas read
Z Z
↔
a†p = −i d3 x e−ip·x Π(x) + iEp φ(x) = −i d3 x e−ip·x ∂0 φ(x) ,
Z Z
↔
ap = +i d3 x e+ip·x Π(x) − iEp φ(x) = +i d3 x e+ip·x ∂0 φ(x) ,
(1.42)
↔
where the operator ∂0 is defined as
↔
A ∂0 B ≡ A ∂0 B − ∂ 0 A B . (1.43)
Note that these expressions, although they appear to contain x0 , do not actually
depend on time. Using these formulas, we can rewrite the Hamiltonian in terms of φ
and Π,
Z
H = d3 x 12 Π2 (x) + 21 (∇φ(x))2 + 21 m2 φ2 (x) . (1.44)
From this Hamiltonian, one may obtain equations of motion in the form of Hamilton-
Jacobi equations. Formally, they read
δH
∂0 φ(x) = = Π(x) ,
δΠ(x)
δH
∂0 Π(x) = − = ∇2 − m2 φ(x) . (1.45)
δφ(x)
The missing potential term of the Lagrangian is obtained by requesting that we have
Z
H = d3 x Π(x)∂0 φ(x) − L . (1.48)
10 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
is a Lorentz scalar (this is not true of the Hamiltonian, which may be considered as
the time component of a 4-vector from the point of view of Lorentz transformations).
The Lagrangian (1.49) leads to the following Euler-Lagrange equation of motion,
x + m2 φ(x) = 0 , (1.51)
Such an invariance is said to be continuous when it is valid for any value of the
infinitesimal parameter ε. If the Lagrangian is unchanged by this transformation, we
can write
∂L ∂L
0 = δL = εΨ + ε∂µ Ψ
∂φ ∂(∂µ φ)
∂L ∂L
= ∂µ µ
εΨ + ε∂µ Ψ
∂(∂ φ) ∂(∂µ φ)
∂L
= ε ∂µ Ψ . (1.53)
∂(∂µ φ)
| {z }
Jµ
In the second line, we have used the Euler-Lagrange equation obeyed by the field.
The 4-vector Jµ is known as the Noether current associated to this symmetry. The fact
1. BASICS OF Q UANTUM F IELD T HEORY 11
that the variation of the Lagrangian is zero implies the following continuity equation
for this current
∂ µ Jµ = 0 . (1.54)
This is the simplest case of Noether’s theorem, where the Lagrangian itself is invariant.
But for the theory to be unmodified by the transformation of eq. (1.52), it is only
necessary that the action be invariant, which is also realized if the Lagrangian is
modified by a total derivative, i.e.
δL = ε Kµ . (1.55)
(The proportionality to ε follows from the fact that the variation must vanish when
ǫ → 0.) When the variation of the Lagrangian is a total derivative instead of zero, the
continuity equation is modified into:
∂µ Jµ − Kµ = 0 , (1.56)
where Jµ is the same current as before. As we shall see later, there are situations
where a conservation equation such as (1.54) is violated by quantum effects, due to a
delicate interplay between the symmetry responsible for the conservation law and the
ultraviolet structure of the theory.c sileG siocnarF
factor that will prove convenient later on. At this point, it seems that any degree n
may provide a reasonable interaction term. However, theories with an odd n have an
unstable vacuum, and theories with n > 4 are non-renormalizable in four space-time
dimensions, as we shall see later. For these reasons, n = 4 is the only case which is
widely studied in practice, and we will stick to this value in the rest of this chapter.
With this choice, the Hamiltonian and Lagrangian read
Z
1
H= d3 x 2
2 Π (x) + 12 (∇φ(x))2 + 21 m2 φ2 (x) + λ 4
4! φ (x) ,
Z
1
L= d3 x 2 (∂µ φ(x))(∂
µ
φ(x)) − 21 m2 φ2 (x) − λ 4
4! φ (x) , (1.58)
A field operator that obeys this non-linear equation of motion can no longer be
represented as a linear superposition of plane waves such as (1.38). Let us assume
that the coupling constant is very slowly time-dependent, in such a way that
What we have in mind here is that λ goes to zero adiabatically at asymptotic times,
i.e. much slower than all the physically relevant timescales of the theory under
consideration. Therefore, at x0 = ±∞, the theory is a free theory whose spectrum is
made of the eigenstates of the free Hamiltonian. Likewise, the field φ(x) should be
in a certain sense “close to a free field” in these limits. In the case of the x0 → −∞
limit, let us denote this by9
where φin is a free field operator that admits a Fourier decomposition similar to
eq. (1.38),
Z h i
d3 p † +ip·x −ip·x
φin (x) ≡ ap,in e + ap,in e . (1.62)
(2π)3 2Ep
9 In this equation, we ignore for now the issue of field renormalization, onto which we shall come back
where
λ
LI (φ(x)) ≡ − 4! φ4 (x) . (1.65)
U(t, t) = 1
U(t3 , t1 ) = U(t3 , t2 ) U(t2 , t1 ) (for all t2 )
−1 †
U(t1 , t2 ) = U (t2 , t1 ) = U (t2 , t1 ) . (1.66)
The in creation and annihilation operators can be used to define a space of eigenstates
of the free Hamiltonian, starting from a ground state (vacuum) denoted 0in . For
instance, one particle states would be defined as
The physical interpretation of these states is that they are states with a definite particle
content at x0 = −∞, before the interactions are turned on10 .
In the same way as we have constructed in field operators, creation and annihila-
tion operators and states, we may construct out ones such that the field φout (x) is a
10 For an interacting system, it is not possible to enumerate the particle content of states, because of
free field that coincides with the interacting field φ(x) in the limit x0 → +∞ (with
the same caveat about field renormalization). Starting from a vacuum state 0out , we
may also define a full set of states, such as pout , that have a definite particle content
at x0 = +∞. It is crucial to observe that the in and out states are not identical:
0out 6= 0in (they differ by the phase 0out 0in ) , pout 6= pin , · · · (1.69)
Taking the limit x0 → +∞ in eq. (1.63), we first see that11
from which we deduce that the in and out states must be related by
αout = U(−∞, +∞) αin . (1.71)
The two sets of states are identical for a free theory, since the evolution operator
reduces to the identity in this case. c sileG siocnarF
Firstly, we write the state |pin as the action of a creation operator on the corresponding
vacuum state, and we replace the creation operation by its expression in terms of φin ,
U(+∞, −∞).
1. BASICS OF Q UANTUM F IELD T HEORY 15
Next, we use the fact that φin , Πin are the limits when x0 → −∞ of the interacting
fields φ, Π, and we express this limit by means of the following trick:
Z +∞
0 0
lim F(x ) = lim F(x ) − dx0 ∂x0 F(x0 ) . (1.75)
x0 →−∞ x0 →+∞ −∞
The term with the limit x0 → +∞ produces a term identical to the r.h.s. of the first
line of eq. (1.74), but with an a†p,out instead of a†p,in . At this stage we have
In the first line, we use the commutation relation between creation and annihilation
operators to obtain
This term does not involve any interaction, since the initial state particle simply goes
through to the final state (in other words, this particle just acts as a spectator in the
process). Such trivial terms always appear when expressing transition amplitudes in
terms of the field operator, and they are usually dropped since they do not carry any
interesting physical information. We can then perform explicitly the time derivative
in the second line to obtain12
Z
.
qout pin = i d4 x e−ip·x (x + m2 ) qout φ(x) 0in , (1.78)
.
where we use the symbol = to indicate that the trivial non-interacting terms have been
dropped.
Next, we repeat the same procedure for the final state particle: (i) replace the
annihilation operator aq,out by its expression in terms of φout , (ii) write φout as a limit
of φ when x0 → +∞, (iii) write this limit as an integral of a time derivative plus a
term at x0 → −∞, that we rewrite as the annihilation operator aq,in :
Z
.
qout pin = i d4 x e−ip·x (x + m2 ) 0out aq,in φ(x) 0in
Z
+i d4 y ∂y0 eiq·y 0out Π(y) − iEq φ(y) φ(x) 0in .
(1.79)
12 We use here the dispersion relation p2 − p2 = m2 of the incoming particle to arrive at this expression.
0
The mass that should enter in this formula is the physical mass of the particles. This remark will become
important when we discuss renormalization.
16 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
However, at this point we are stuck because we would like to bring the aq,in to the
right where it would annihilate 0in , but we do not know the commutator between
aq,in and the interacting field operator φ(x). The remedy is to go one step back, and
note that we are free to insert a T-product in
Πout (y) − iEq φout (y) φ(x) = T Π(y) − iEq φ(y) φ(x) (1.80)
y0 →+∞
since the time y0 → +∞ is obviously larger than x0 . Then the boundary term at
y0 → −∞ will automatically lead to the desired ordering φ(x) aq,in ,
Z
.
qout pin = i d4 x e−ip·x (x + m2 ) 0out φ(x) aq,in 0in
| {z }
0
Z
+i d4 y ∂y0 eiq·y 0out T Π(y) − iEq φ(y) φ(x) 0in .
(1.81)
Z m
. m+n Y 4
q1 · · · qn out p1 · · · pm in =i d xj e−ipi ·xi (xi + m2 )
i=1
ZY
n
× d4 yj eiqj ·xj (yj + m2 )
j=1
The bottom line is that an amplitude with m + n particles is related to the vacuum
expectation value of a time-ordered product of m + n interacting field operators (a
slight but important modification to this formula will be introduced in the section 1.9,
in order to account for field renormalization). Note that the vacuum states on the left
and on the right of the expectation value are respectively the out and the in vacua. c sileG siocnarF
1. BASICS OF Q UANTUM F IELD T HEORY 17
All experiments in particle physics amount to a measurement that answers the follow-
ing question: given a certain setup that defines an initial state, how many reactions
of a certain type occur per unit time? The concept of “reaction of a certain type”
may vary widely depending on the number of criteria that are imposed on the final
state for the reaction to be worth counting. For instance, one may consider the re-
action e+ e− → anything, the reaction e+ e− → µ+ µ− , or even a reaction with the
same particles in the initial and final states, but where in addition the final muons
are required to have momenta in a certain range. As we have seen in the previous
section, the LSZ reduction formulas express transition amplitudes between states with
a definite particle content in terms of correlation functions of the field operators that
are calculable in quantum field theory. The missing link to connect this to experi-
mental measurements is an explicit formula relating reaction rates to these transition
amplitudes.
In this formula, the left hand side is measured experimentally, while in the right hand
side the ratio N1 N2 /S depends only on the setup of the collider13 . Therefore, the
cross-section can be obtained as the ratio of two known quantities. Note that the
cross-section in general depends on the momenta p1,2 of the particles participating in
the collision (and on the momenta of the particles in the final state F), but in a Lorentz
covariant way, i.e. only through Lorentz scalars such as (p1 + p2 )2 .
13 In practice, the beam conditions are monitored by measuring in parallel the event rate of another
In the first equality, we have inserted a complete set of position eigenstates in order to
highlight the interpretation of pin pin as the integral of the square of a wavefunc-
tion. The second equality follows from the canonical commutation relation between
creation and annihilation operators. This equation means that our convention of nor-
malization of the states corresponds to “2E particles per unit volume”. We are using
quotes here because 2E does not have the correct dimension to be a proper density of
particles. This is mostly an aesthetic problem: this convention of normalization will
cancel out eventually, since cross-sections are defined in such a way that they do not
depend on the incoming fluxes of particles.
is the number of events where the final state particles have their momenta in the
volume d3 q1 · · · d2 qn centered on (q1 , · · · , qn ).
1. BASICS OF Q UANTUM F IELD T HEORY 19
Xn
q1 · · · qn out p1 p2 in ≡ (2π)4 δ p1 +p2 − qj T(q1,··· ,n |p1,2 ) , (1.91)
j=1
2 Xn
q1 · · · qn out p1 p2 in = (2π)4 δ p1 + p2 − qj
j=1
2
× (2π)4 δ(0) T(q1,··· ,n |p1,2 ) . (1.92)
| {z }
VT
This expression contains the square of the delta function. One of these factors becomes
a delta of zero, which has the interpretation of space-time volume VT in which the
process takes place. Since the initial state contains a fixed number of particles of each
kind (1 and 2) per unit volume in all space, we expect the total number of events to
be extensive, because interactions may happen in all the volume at any time. This is
the meaning of the factor VT that appears in this square.
From the insight gained by studying the non-interacting theory, this square
weighted by the Lorentz invariant phase-space measure of the final state counts
the number of events in which the final state particles have momenta in the volume
d3 q1 · · · d2 qn centered on (q1 , · · · , qn ):
Number of events
n
2 Y d3 qj
= q1 · · · qn out p1 p2 in
(2π)3 2Eqj
j=1
n
2 Xn Y d3 qj
= VT T(q1,··· ,n |p1,2 ) (2π)4 δ p1 + p2 − qj .
j=1 (2π)3 2Eqj
j=1
| {z }
dΓn (p1,2 )
(1.93)
(dΓn (p1,2 ) is the invariant final state measure subject to the constraint of momentum
conservation.)
Cross-section in the target frame : At this point, the relationship with the cross-
section of this transition is most easily established in the rest frame of one of the initial
state particles, e.g. the particle 2 (this frame is called the target frame). Consider a thin
20 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
ℓ v1T
Figure 1.1: Geometry of
a two-body cross-section
in the target frame. The
S two volumes represent
the particles that can take
v1 part in the reaction in the
duration T .
2 1
Since s is Lorentz invariant, its expression in the rest frame of the particle 2 is
′
s = (m2 + E1′ )2 − p12 (in this paragraph, the primes indicate kinematical variables
in the target frame). Moreover, simple kinematics show that the combination m2 p1′
in the target frame becomes
√
m2 p1′ = s p1 (1.96)
dimensions, q1 · · · qn out p1 p2 in ∼ (mass)−(2+n) , T(q1,··· ,n |p1,2 ) ∼ (mass)2−n
14 Regarding
and dΓn ∼ (mass)2n−4 , and therefore this formula indeed gives an area.
1. BASICS OF Q UANTUM F IELD T HEORY 21
Likewise, obtaining the expression of a cross-section in a frame where the two beams
have different momenta is a simple matter of relativistic kinematics (this is useful
when the detector apparatus is neither the rest frame of one of the particles, nor the
center of momentum frame, and one counts events in terms of some kinematical
variable measured in this frame – alternatively, one may boost all the measured final
state momenta in order to convert them to momenta in one of the above two frames). c sileG siocnarF
Another very common type of observable is the decay rate Γ of a particle, defined
so that Γ dt is the decay probability of a particle at rest in the time interval dt. The
decay rate can be obtained from matrix elements with a 1-particle initial state,
Xn
q1 · · · qn out p1 in ≡ (2π)4 δ p1 − qj T(q1,··· ,n |p1 ) . (1.98)
j=1
Squaring this matrix element again produces a space-time volume factor VT , and
integrating over the invariant phase space of the final state particles gives
ZY
n Z
d3 qj 2 2
q1 · · · qn out p1 in = VT dΓn (p1 ) T(q1,··· ,n |p1 ) .
(2π)3 2Eqj
j=1
| {z } | {z }
Total number of decays Decays per unit of time and volume
(1.99)
A differential decay rate can be obtained by leaving some of the final state kinematical
variables unintegrated. c sileG siocnarF
22 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
1.6.1 Definition
To facilitate the bookkeeping, it is useful to introduce a generating functional that
encapsulates all the expectation values, by defining
X∞ Z
1
Z[j] ≡ d4 x1 · · · d4 xn ij(x1 ) · · · ij(xn ) 0out T φ(x1 ) · · · φ(xn ) 0in
n!
n=0
Z
= 0out T exp i d4 x j(x)φ(x) 0in . (1.101)
Note that
in an interacting theory (but if the vacuum state is stable, then this vacuum to vacuum
transition amplitude must be a pure phase whose squared modulus is one). From this
functional, the relevant expectation values are obtained by functional differentiation
δn Z[j]
0out T φ(x1 ) · · · φ(xn ) 0in = . (1.103)
iδj(x1 ) · · · iδj(xn ) j=0
The knowledge of Z[j] would therefore give access to all the transition amplitudes.
However, it is in general not possible to derive Z[j] in closed form, and we need
to resort to perturbation theory, in which the answer is obtained as an expansion in
powers of the coupling constant. c sileG siocnarF
φ(x1 ) · · · φ(xn ) = U(−∞, x01 ) φin (x1 ) U(x01 , x02 ) φin (x2 ) · · · φin (xn ) U(x0n , ∞) .
(1.104)
Noticing that the formula (1.104) is true for any ordering of the times x0i and using
the expression of the U’s as a time-ordered exponential, we have
Z
T φ(x1 ) · · · φ(xn ) = U(−∞, +∞) T φin (x1 ) · · · φin (xn ) exp i d4 x LI (φin (x)) ,
1. BASICS OF Q UANTUM F IELD T HEORY 23
(1.106)
where the time-ordering in the right-hand side applies to all the operators on its right.
This leads to the following representation of the generating functional
h Z i
Z[j] = 0out U(−∞, +∞) T exp i d4 x j(x)φin (x) + LI (φin (x)) 0in
| {z }
0in
Z Z
4 δ
= exp i d x LI 0in T exp i d4 x j(x)φin (x) 0in .
iδj(x)
| {z }
Z0 [j]
(1.107)
This expression of Z[j] is the most useful, since it factorizes the interactions into a
(functional) differential operator acting on Z0 [j], the generating functional for the
non-interacting theory. c sileG siocnarF
It turns out that the latter is calculable analytically. The main difficulty in evaluating
Z0 [j] is to deal with the non-commuting objects contained in the exponential. A
central mathematical result that we shall need is a particular case of the Baker-
Campbell-Hausdorff formula (see the section 4.1.5 for a derivation),
1
if [A, [A, B]] = [B, [A, B]] = 0 , eA eB = eA+B e 2 [A,B] . (1.108)
This formula is applicable here because commutators [a, a† ] are c-numbers that
commute with everything else. In order to apply it, let us slice the time axis into an
infinite number of small intervals, by writing
Z +∞ +∞
Y Z x0i+1
T exp d4 x O(x) = T exp d4 x O(x) , (1.109)
−∞ i=−∞ x0
i
where the intermediate times are ordered according to · · · x0i < x0i+1 < · · · . The
product in the right hand side should be understood with the convention that the
factors are ordered from left to right when the index i decreases. When the size
∆ ≡ x0i+1 − x0i of these intervals goes to zero, the time-ordering can be removed in
24 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that the exponential in the second line is a c-number. In the end, we will need to
evaluate the expectation value of this operator in the 0in vacuum state. Therefore, it
is desirable to transform it in such a way that the annihilation operators are on the
right and the annihilation operators are on the left. This can be achieved by writing
(+) (−)
φin (x) = φin (x) + φin (x) ,
Z
(+) d3 p
φin (x) ≡ a† e+ip·x ,
(2π)3 2Ep p,in
Z
(−) d3 p
φin (x) ≡ ap,in e−ip·x , (1.112)
(2π)3 2Ep
Moreover, when ∆ → 0, the separation between any pair of points x, y with x0i < x0 , y0 < x0i+1 is
always space-like.
1. BASICS OF Q UANTUM F IELD T HEORY 25
The operator that appears in the right hand side of the first line is called a normal-
ordered exponential, and is denoted by bracketing the exponential between a pair of
colons (: · · · :):
Z Z Z
4 4 (+) (−)
: exp i d x j(x)φin (x) : ≡ exp i d x j(x)φin (x) exp i d4 x j(x)φin (x) .
(1.114)
A crucial property of the normal ordered exponential is that its in-vacuum expectation
value is equal to unity:
Z
0in : exp i d4 x j(x)φin (x) : 0in = 1 . (1.115)
Therefore, we have proven that the generating functional of the free theory is a
Gaussian in j(x),
Z
1
Z0 [j] = exp − d4 xd4 y j(x)j(y) G0F (x, y) , (1.116)
2
where G0F (x, y) is a 2-point function called the free Feynman propagator and defined
as
(+) (−)
G0F (x, y) = θ(x0 − y0 ) φin (x), φin (y) − φin (x), φin (y) . (1.117)
Since the commutators in the right hand side of eq. (1.117) are c-numbers, we can
also write
(+) (−)
G0F (x, y) = 0in θ(x0 − y0 ) φin (x), φin (y) − φin (x), φin (y) 0in
= 0in T φin (x)φin (y) 0in . (1.118)
In other words, the free Feynman propagator is the in-vacuum expectation value of
the time-ordered product of two free fields. Using the Fourier mode decomposition of
φin and the commutation relation between creation and annihilation operators, the
Feynman propagator can be rewritten as follows
Z
d3 p
G0F (x, y) = θ(x0 − y0 ) e−ip·(x−y) + θ(y0 − x0 ) e+ip·(x−y) .
(2π)3 2Ep
(1.119)
26 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the following, we will also make an extensive use of the Fourier transform of
this propagator (with respect to the difference of coordinates xµ − yµ , since it is
translation invariant):
Z
e 0 (k) ≡
G d4 (x − y) eik·(x−y) G0F (x, y)
F
Z +∞ Z0
1 0 0 0 0
= dz0 ei(k −Ek )z + dz0 ei(k +Ek )z .(1.120)
2Ek 0 −∞
The remaining Fourier integrals over z0 are not defined as ordinary functions. Instead,
they are distributions, that can also be viewed as the limiting value of a family of
ordinary functions. In order to see this, let use write
Z +∞ Z +∞
0 0 i
dz0 eiaz = lim+ dz0 ei(a+iǫ)z = . (1.121)
0 ǫ→0 0 a + i0+
Likewise
Z0 Z0
0 0 i
dz0 eiaz = lim+ dz0 ei(a−iǫ)z = − . (1.122)
−∞ ǫ→0 +∞ a − i0+
e 0 (k) = i
G . (1.123)
F
k2 − m2 + i0+
Note that Ge 0 (k) is Lorentz invariant. Henceforth, G0 (x, y) is also Lorentz invariant16 .
F F
It is sometimes useful to have a representation of eq. (1.123) in terms of distributions.
This is provided by the following identity:
i 1
= i P + πδ(z) , (1.124)
z + i0+ z
where P(1/z) is the principal value of 1/z (i.e. the distribution obtained by cutting
out –symmetrically– an infinitesimal interval around z = 0). As far as integration
over the variable z is concerned, this prescription amounts to shifting the pole slightly
below the real axis, or equivalently to going around the pole at z = 0 from above (the
16 This is somewhat obfuscated by the fact that the step functions θ(±(x0 − y0 )) that enter in the
definition of the time-ordered product are not Lorentz invariant. The Lorentz invariance of time-ordered
products follows from the following properties:
• if (x − y)2 < 0, then the two fields commute and the time ordering is irrelevant,
• if (x − y)2 ≥ 0, then the sign of x0 − y0 is Lorentz invariant.
1. BASICS OF Q UANTUM F IELD T HEORY 27
term in πδ(z) can be viewed as the result of the integral on the infinitesimally small
half-circle around the pole):
z z
i0 + 0
From eq. (1.123), it is trivial to check that G0F (x, y) is a Green’s function of the
operator x + m2 (up to a normalization factor −i):
(x + m2 ) G0F (x, y) = −iδ(x − y) . (1.125)
Strictly speaking, the operator x +m2 is not invertible, since it admits as zero modes
all the plane waves exp(±ik · x) with an on-shell momentum k20 = k2 + m2 . The
i0+ prescription in the denominator of eq. (1.123) amounts to shifting infinitesimally
the zeroes of k20 = k2 + m2 in the complex k0 plane, in order to have a well
defined inverse. The regularization of eq. (1.123) is specific to the time-ordered
propagator. Other regularizations would provide different propagators; for instance
the free retarded propagator is given by
e 0 (k) = i
G . (1.126)
R
(k0 + i0+ )2 − (k2 + m2 )
One can easily check that its inverse Fourier transform is a function G0R (x, y) that
satisfies
(x + m2 ) G0R (x, y) = −iδ(x − y) ,
G0R (x, y) = 0 if x0 < y0 . (1.127)
In other words, G0R is also a Green’s function of the operator x + m2 , but with
boundary conditions that differ from those of G0F .c sileG siocnarF
that acts on the generating functional of the free theory. The latter is a Gaussian in j,
whose variance is given by the free Feynman propagator G0F . Although not explicit,
this formula provides a straightforward method for obtaining vacuum expectation
values of T-products of fields to a given order in the coupling constant λ. c sileG siocnarF
1.7.1 Examples
Let us first illustrate this by computing to order λ1 the following two functions:
0out 0in and 0out T φ(x)φ(y) 0in . In order to make the notations a bit lighter, we
denote G0xy ≡ G0F (x, y). At order one in λ, we have
" Z 4 #
λ 4 δ 2
0out 0in = Z[0] = 1 − i d z + O(λ ) Z0 [j]|j=0
4! iδj(z)
Z
λ
= 1−i d4 z G0zz2 + O(λ2 ) , (1.128)
8
and
Although the final expressions at order one are rather simple, the intermediate steps
are quite cumbersome due to the necessity of taking a large number of functional
derivatives. Moreover, the expression of the 2-point function 0out T φ(x)φ(y) 0in
becomes simpler after we notice that one can factor out Z[0]. This property is in fact
completely general; all transition amplitudes contain a factor Z[0]. From the remark
made after eq. (1.102), this factor is a pure phase and its squared modulus is one
and will have no effect in transition probabilities. Therefore, it would be desirable
to identify from the start the terms that lead to this prefactor, to avoid unnecessary
calculations.c sileG siocnarF
1. BASICS OF Q UANTUM F IELD T HEORY 29
G0xy ≡ x y . (1.130)
Z[0] = 1 + 1
8 z + O(λ2 )
+ 81 x y z + 1
2
x
z
y + O(λ2 ) . (1.131)
The graphs that appear in the right hand side of these equations are called Feynman
diagrams. By adding to eq. (1.130) the rule that each vertex should have a factor −iλ
and an integration over the entire space-time, then these graphs are in one-to-one
correspondence with the expressions of eqs. (1.128) and (1.129). For now, we have
recalled explicitly the numerical prefactors (1/8, 1/2,...) but they can in fact be
recovered simply from the symmetries of the graphs.
In the second of eqs. (1.131), the second term of the right hand side contains a
factor which is not connected to any of the points x and y. These disconnected graphs
are precisely the ones responsible for the factor Z[0] that appears in all transition
amplitudes. We can therefore disregard these type of graphs altogether. c sileG siocnarF
1. Draw all the graphs (with only vertices of valence 4) that connect the n points
x1 to xn and have exactly p vertices. Graphs that contain a subgraph which is
not connected to any of the xi ’s should be ignored.
2. Each line of a graph represents a free Feynman propagator G0F .
3. Each vertex represents a factor −iλ and an integral over the space-time coordi-
nate assigned to this vertex.
30 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
4. The numerical prefactor for a given graph is the inverse of the order of its
discrete symmetry group. As an illustration, we indicate below the genera-
tors of these symmetry groups and their order for the graphs that appear in
eqs. (1.131):
1
−→ order 8 −→ ,
z 8
1
x y −→ order 2 −→ . (1.132)
z 2
Note that this rule for obtaining the symmetry factor associated to a given graph
is correct only if the corresponding term in the Lagrangian has been properly
symmetrized. For instance, the operator φ4 should appear in the Lagrangian
with a prefactor 1/4!.
At the step 1, graphs made of several disconnected subgraphs can usually appear
in certain functions, provided that each subgraph is connected to at least one of the
points xi . For instance, a 4-point function contains a piece which is simply made of
the product of two 2-point functions. In addition, it contains terms that correspond to
a genuine 4-point function, not factorizable in a product of 2-point functions. The
factorizable pieces are usually less interesting because they can be recovered from
already calculated simpler building blocks17 . For this reason, it is sometimes useful
to introduce the generating function of the connected graphs, denoted W[j]. This
functional is very simply related to Z[j] by
17 Moreover, in scattering amplitudes, these disconnected contributions are not physically interesting.
+··· (1.135)
This expansion highlights how the vacuum expectation values of time-ordered prod-
ucts of fields can be factorized into products of connected contributions. c sileG siocnarF
Thus, these operators just produce Feynman graphs that are amputated of all their
external lines. Then, the Fourier transform can be propagated to all the internal lines
of the graph, leading to an expression that involves propagators and vertices that
depend only on momenta. The Feynman rules for obtaining directly these momentum
space expressions are:
3 ′′ . All the internal momenta that are not constrained by these delta functions
should be integrated over with a measure d4 k/(2π)4
Z
λ d4 k i
P = −i
2(2π) k − m2 + i0+
4 2
p2 k q2 Z
(−iλ)2 d4 k i i
= = .
p1 q1 2 (2π) k −m +i0 (p1 +p2 −k)2 −m2 +i0+
4 2 2 +
(1.137)
G ∼ λnV . (1.138)
This can also be related to the number of loops of the graph, which is a better measure
of its complexity since it determines how many momentum integrals it contains. Let
us denote nE the number of external lines, nI the number of internal lines and nL the
number of loops. These parameters are related by the following two identities:
4nV = 2nI + nE
nL = nI − nV + 1 . (1.139)
The first of these equations equates the number of “handles” carried by the vertices,
and the number of propagator endpoints that must attached to them. The right hand
side of the second equation counts the number of internal momenta that are not
constrained by the delta functions of momentum conservation carried the vertices (the
+1 comes from the fact that not all these delta functions are independent - a linear
combination of them must simply tell that the sum of the external momenta must be
1. BASICS OF Q UANTUM F IELD T HEORY 33
zero, and therefore does not constrain the internal ones in any way). From these two
identities, one obtains
nE
nV = nL − 1 + , (1.140)
2
and the order in λ of the graph is also
According to this formula, the order of a graph depends only on the number of
external lines nE (i.e. on the number of particles involved in the transition amplitude
under consideration), and on the number of loops. Thus, the perturbative expansion is
also a loop expansion, with the leading order being given by tree diagrams, the first
correction in λ by one-loop graphs, etc...
It turns out that the number of loops also counts the order in the Planck constant h̄
of a graph. Although we have been using a system of units in which h̄ = 1, it is easy
to reinstate h̄ by the substitution
Z
S 1 x + m2 λ 4
S → = − d4 x φ(x) φ(x) + φ (x) . (1.142)
h̄ 2 h̄ 4!h̄
From this, we see that h̄ enters in the Feynman rules as follows
i h̄
Propagator : ,
p2 − m2 + i0+
λ
Vertex : −i , (1.143)
h̄
Therefore, each additional loop brings a power of h̄, and the loop expansion can also
be viewed as an expansion in powers of h̄. c sileG siocnarF
Let us consider the first of the examples given in eq. (1.137) and define
Z
λ d4 k i
−iΣ(P) ≡ −i . (1.145)
2 (2π)4 k2 − m2 + i0+
34 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
k0
When the integrand depends only on the norm |kE |, we can separate the radial
integration on |kE | from the angular integration over the orientation of the vector in 4-
dimensional Euclidean space. In D dimensions, the volume measure for a rotationally
invariant integrand reads
where VD (kE ) is the volume of the D-dimensional ball of radius kE . These volumes
can be determined recursively by
Zπ
V1 (kE ) = 2kE , VD (kE ) = kE dθ sin θ VD−1 (kE sin θ) . (1.148)
0
1. BASICS OF Q UANTUM F IELD T HEORY 35
Therefore, we have
4π 3 π2 4
V2 (kE ) = πk2E , V3 (kE ) = k , V4 (kE ) = k . (1.149)
3 E 2 E
Although knowing V4 (kE ) is sufficient for performing a radial momentum integral in
four dimensions, it is interesting to have the formula for an arbitrary dimension, in
view of applications to dimensional regularization. More generally, we have
Γ(D
2 + 1) 2 πD/2
VD+1 (1) = VD (1) π1/2 and VD (1) = . (1.150)
Γ(D
2 + 3
2) D Γ(D
2)
Let us now consider the second diagram of eq. (1.137) (with the notation P ≡ p1 +p2 ),
Z
(−iλ)2 d4 k i i
−iΓ4 (P) ≡ . (1.151)
2 (2π)4 k2 −m2 +i0+ (P − k)2 −m2 +i0+
In this more complicated example, an extra difficulty is that the integrand is not
rotationally invariant. The following trick, known as Feynman parameterization can
be used to rearrange the denominators18 :
Z1
1 dx
= . (1.152)
AB 0 [xA + (1 − x)B]2
x(k2 −m2 +i0+ )+(1−x)((P−k)2 −m2 +i0+ ) = l2 −m2 −∆(x, P)+i0+ , (1.153)
where the integrand is again invariant by rotation in 4-dimensional Euclidean space. c sileG siocnarF
19 It is allowed because the integration axis can be rotated counterclockwise without passing through the
For each of the expectation values in the right hand side, let us insert an identity
operator between the two field operators, written in the form of a sum over all the
possible physical states,
X
1= λ λ . (1.156)
states λ
The states λ can be arranged into classes inside which the states differ only by a boost.
A class of states, that we will denote α, is characterized by its particle content and
by the relative momenta of these particles. Within a class, the total momentum of
the state can be varied by applying a Lorentz boost. For a class α, we will denote
αp the state of total momentum p. Each class of states has an invariant mass mα ,
such that the total energy p0 and total momentum p of the states in this class obey
p20 − p2 = m2α . In addition, it is useful to isolate the vacuum in the sum over the
states. Therefore, the identity operator can be rewritten as
X Z d3 p
1= 0 0 + p αp αp , (1.157)
(2π)3 2 p2 + m2α
classes α
where we have written the integral over the total momentum of the states in a Lorentz
invariant fashion. (We need not specify if we are using in or out states here.)
When we insert this identity operator between the two field operators, the vacuum
does not contribute. For instance
(φ creates or destroys a particle, and therefore has a vanishing matrix element between
vacuum states.) Using the momentum operator P, ^ we can write
^ ^
0out φ(x) αp = 0out eiP·x φ(0)e−iP·x αp
= 0out φ(0) αp e−ip·x
= 0out φ(0) α0 e−ip·x . (1.159)
1. BASICS OF Q UANTUM F IELD T HEORY 37
The second line uses the fact that the total momentum in the vacuum state is zero,
and is p for the state αp . In the last equality, we have applied a boost that cancels the
total momentum p, and used the fact that the vacuum is invariant, as well as the scalar
field φ(0). Therefore, we obtain the following representation for the time-ordered
2-point function
X
0out T φ(x)φ(y) 0in = 0out φ(0) α0 α0 φ(0) 0in
classes α
Z
d3 p
× p θ(x0 − y0 )e−ip·(x−y) + θ(y0 − x0 )eip·(x−y) ,
(2π)3 2 p2 + m2α
| {z }
G0 (x,y;m2
α)
F
(1.160)
where the underlined integral, G0F (x, y; m2α ), is the Feynman propagator for a hy-
pothetical scalar field of mass mα (compare this integral with eq. (1.119)). It is
customary to rewrite the above representation as
Z∞
dM2
0out T φ(x)φ(y) 0in = ρ(M2 ) G0F (x, y; M2 ) , (1.161)
0 2π
where Z is the product of matrix elements that appear in eq. (1.162), in the case of
1-particle states. In a theory with interactions, Z in general differs from unity (in fact,
it may be infinite). Note that in this equation, m must be the physical mass of the
particles, as it would be inferred from the simultaneous measurement of their energy
and momentum. As we shall see shortly, this is not the same as the parameter we
denoted m in the Lagrangian.
20 Between the 1-particle delta function and the 2-particle continuum, there may be additional delta
functions corresponding to multi-particle bound states (to have a stable bound state, the binding energy
should decrease the mass of the state compared to the mass 2m of two free particles at rest).
38 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Taking the Fourier transform of eq. (1.161) and using eq. (1.163) for the spectral
function, we obtain the following pole structure for the exact Feynman propagator:
e (p) = iZ
G + terms without poles . (1.164)
F
p2 − m2 + i0+
Therefore, the parameter Z that appears in the spectral function has also the interpre-
tation of the residue of the single particle pole in the exact Feynman propagator.
The fact that Z 6= 1 calls for√a slight modification of the LSZ reduction formulas.
Eq. (1.163) implies that a factor Z appears in the overlap between the state φ(x) 0in
and the 1-particle state pin . In other words, φ(x) creates a particle with probability
Z rather than 1. Therefore, there should be a factor Z−1/2 for each incoming and
outgoing particle in the LSZ reduction formulas that relate transition amplitudes to
products of fields φ:
m+n
. i
q1 · · · qn out p1 · · · pm in = √
Z
ZY m n
Y
−ipi ·xi
× 4
d xj e 2
(xi + m ) d4 yj eiqj ·xj (yj + m2 )
i=1 j=1
Until now, we have not attempted to calculate explicitly the integrals over the Eu-
clidean momentum kE in eqs. (1.146) and (1.154). In fact, these integrals do not
converge when |kE | → ∞, and as such they are therefore infinite. These infinities are
called ultraviolet divergences. c sileG siocnarF
1. BASICS OF Q UANTUM F IELD T HEORY 39
λ m2 1
Σ(P) = − + O(1) ,
2 (4π)2 ǫ
λ2 1 1
Γ4 (P) = − + O(1) . (1.169)
2 (4π)2 ǫ
e (P) ≡ P
G P P P
F + + + + ... ,
21 Γ (z) is analytic in the complex plane, at the exception of a discrete series of simple poles, located at
◆
zn = −n for n ∈ , with residues (−1)n /n!.
22 These examples are not completely general. As we shall see later, divergent terms proportional to P 2
(1.170)
we obtain
e (P) = i
G , (1.171)
F
p20 − p2 − m2 − Σ + i0+
from which it is immediate to see that this loop correction alters the location of the
pole, now given by
p20 − p2 = 2
|m {z+ Σ} . (1.172)
new squared mass
Since the propagator given in eq. (1.171) includes loop corrections, its poles ought to
give a value of the mass closer to the physical one. Therefore, it is tempting to write:
Of course, since Σ is infinite, the only way this can be satisfied is that the parameter
m2 that appears in the Lagrangian be itself infinite, with an opposite sign in order
to cancel the infinity from Σ. To further distinguish it from the physical mass, the
parameter m in the Lagrangian is usually called the bare mass, while mphys is the
physical –or renormalized– mass. c sileG siocnarF
e (P) i
G ≈ . (1.176)
F
P 2 →m2
phys
(1 − Σ ′ (m2phys )) (P2 − m2phys ) + i0+
This indicates that the field renormalization factor Z cannot be equal to 1 when the
propagator is corrected by a momentum-dependent loop. Instead, we have
1
Z= . (1.177)
1− Σ ′ (m2phys )
Moreover, Weinbergs’s theorem implies that the ultraviolet divergences of the 2-point
function Σ(P2 ) arise only in Σ(m2phys ) and in the first derivative Σ ′ (m2phys ), while
higher derivatives are all finite. Eqs. (1.175) and (1.177) therefore indicate that
these infinities can be “hidden” in the bare mass m2 and in a multiplicative field
renormalization factor Z. c sileG siocnarF
From the above considerations, it appears crucial that Σ has divergences only in its
0th and 1st order Taylor coefficients and Γ4 only in the 0th order, in order to be able to
absorb the divergences by a proper definition of m2 , Z and λ. A simple dimensional
argument gives plausibility to this assertion (of which Weinberg’s theorem provides a
more rigorous justification). Let us assume that we scale up all the internal momenta
of a graph by some factor ξ. In doing this, a graph G with nV vertices and nI internal
lines will scale as
G ∼ ξD nL −2nI , (1.178)
• ω(G) < 0 : The graph may be finite, or may contain a divergent subgraph.
More precisely, the convergence theorem states that a graph G is finite if
ω(G) < 0, and the degrees of divergence of all its subgraphs are negative as
well. Of course, subgraphs do not always satisfy this condition. But in the
renormalization process, the divergent subgraphs will have been dealt with at
an earlier stage since they occur at a lower order of the perturbative expansion.
1. BASICS OF Q UANTUM F IELD T HEORY 43
The superficial degree of divergence signals all the n-point functions that may have
ultraviolet divergences of their own (as opposed to being divergent because of a
divergent subgraph). Using eqs. (1.139), ω(G) can be rewritten in the following way
ω(G) = 4 − nE + (D − 4) nL . (1.179)
• 2-point: up to Λ2
• 4-point: up to log(Λ)
Note also that if we differentiate a graph with respect to the invariant norm P2 of one
of its external momenta, we get
∂G
ω = 2 − nE + (D − 4) nL . (1.180)
∂P2
(ω further decreases by two units with each additional derivative with respect to
P2 .) Therefore, the momentum derivative Σ ′ (P2 ) of the 2-point function has ω = 0
in D = 4, and its higher derivatives all have ω < 0. The fact that only Γ4 (m2phys ),
Σ(m2phys ) and Σ ′ (m2phys ) have ω ≥ 0 is the reason why it is possible to get rid of all
the divergences of this theory (in 4 dimensions) by a redefinition of the parameters of
the Lagrangian. This theory is said to be renormalizable. c sileG siocnarF
that 0-point functions (vacuum graphs) have a superficial degree of divergence equal to 4, indicating that
they may contain up to quartic divergences ∼ Λ4 .
44 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
become divergent at some loop order. These theories are usually25 non renormalizable.
One may think of introducing, as they become necessary, additional operators in the
Lagrangian with a coupling constant adjusted to cancel the new divergences that arise
at a given loop order. However, an infinite number of such parameters would need
to be introduced, thereby reducing drastically the predictive power of this type of
theory26 .
As we have seen, the renormalizability of a field theory depends both on the
interaction terms it contains, and on the dimensionality of space-time. In fact, a
simpler equivalent criterion is the mass dimension of the coupling constant in front of
the interaction term:
• dim = 0 : renormalizable,
For instance, the “coupling constant” m2 in front of the mass term has always a mass
dimension equal to two, and this term is therefore super-renormalizable. In contrast,
the coupling constant λ in front of a φ4 interaction has a mass dimension 4 − D, and
is (super)renormalizable in dimensions less than or equal to four. c sileG siocnarF
1 1 λb
L= ∂µ φb ∂µ φb − m2b φ2b − φ4b , (1.181)
2 2 4!
(here we denote φb , mb and λb the bare field, mass and coupling, to stress that they
are not the physical ones) as the sum of a renormalized Lagrangian and a correction:
L = Lr + ∆L
1 1 λr
Lr ≡ ∂µ φr ∂µ φr − m2r φ2r − φ4r
2 2 4!
1 µ 1 1
∆L ≡ ∆ ∂µ φr ∂ φr − ∆m φr − ∆λ φ4r .
2
(1.182)
2 Z 2 4!
25 It may happen that an internal symmetry, such as a gauge symmetry, renders a function finite while its
where they approximate below a certain cutoff a more fundamental –possibly unknown– theory supposedly
valid above the cutoff.
1. BASICS OF Q UANTUM F IELD T HEORY 45
∆Z = Z − 1
∆m = Zm2b − m2r
∆λ = Z2 λb − λr . (1.183)
The terms in ∆L are treated as a perturbation to Lr , and one may introduce extra
Feynman rules for the various terms it contains:
1 1 P
∆ ∂µ φr ∂µ φr − ∆m φ2r → = −i ∆Z P2 + ∆m
2 Z 2
1
− ∆λ φ4r → = −i ∆λ (1.184)
4!
At tree level, only the term Lr is used, and by construction the physical quantities
computed at this order will depend only on physical parameters. Higher orders
involve divergent loop corrections. The counterterms ∆Z , ∆m , ∆λ should be adjusted
at every order to cancel the new divergences that arise at this order. In particular,
after having included the contribution of the counterterms, the self-energy Σ(P2 ) are
usually required to satisfy the following conditions27 :
With this choice, it is not necessary to dress the external lines with the self-energy in
the LSZ reduction formulas for transition amplitudes. Indeed, the renormalization
conditions (1.185) imply that
For each external line, the reduction formula contains an operator i(x + m2r ) acting
on the corresponding external propagator. If this propagator is dressed, this gives
Therefore, all the terms are zero except the first one, and we can ignore self-energy
corrections on the external lines. c sileG siocnarF
27 Strictly
speaking, the only requirement is that the counterterms cancel the infinities, which does not fix
uniquely their finite part. Various renormalization schemes are possible, that differ in how these finite parts
are chosen.
46 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The actual proof of renormalizability is more complicated than what this superficial
discussion based on power counting may suggest. Indeed, a crucial aspect is to show
that the divergences can be removed via the subtraction of local terms only, i.e. that
the divergences are polynomial in the external momenta. While this is trivial in all
the one-loop graphs we have considered, it is not obviously true beyond one-loop. As
an illustration, let us consider the following example of a two-loop contribution to the
4-point function:
k
1 3
l
.
2 4
A B C
In the graph A, the loop we have represented with a thicker line is divergent, and is
multiplied by a non-polynomial function of P2 ≡ (p1 + p2 )2 coming from the rest
of the graph. The Feynman rules give the following integrand for this graph:
(−iλ)3 0
IA = GF (k)G0F (k − P) G0F (l)G0F (l + k + p3 ) . (1.188)
2
The superficial degree of divergence of the integration over l is ω(A; l) = 0 (at
fixed k), and therefore the boldface loop is logarithmically divergent. The diagram B
consists in subtracting from this loop a polynomial in its external momenta, whose
degree is precisely equal to its superficial degree of divergence. Since ω(A; l) = 0,
the subtraction is the zeroth order of the Taylor expansion of that loop (underlined in
the following equation):
(−iλ)3 0 h i
IA+B = GF (k)G0F (k−P) G0F (l)G0F (l+k+p3 )−G0F (l)G0F (l) . (1.189)
2
Now, the degree of divergence in l of the combination inside the square brackets
is ω(A + B; l) = −1, and the integration over l is therefore convergent in four
dimensions. After the momentum l has been integrated out, we are left with a function
of k whose behaviour is k0 , up to logarithms, whose integral is thus divergent. Since
the degree of divergence in k is ω(A + B; k) = 0, this overall divergence can again
be removed by subtracting the zeroth order of the Taylor expansion with respect to
the external momenta, i.e.
(−iλ)3 h i
IA+B+C = G0F (k)G0F (k − P) G0F (l)G0F (l + k + p3 ) − G0F (l)G0F (l)
2 h i
−G0F (k)G0F (k) G0F (l)G0F (l + k) − G0F (l)G0F (l) . (1.190)
1. BASICS OF Q UANTUM F IELD T HEORY 47
After these two successive subtractions, we have obtained a function whose integral
on both k and l is completely finite. Moreover, at each step, we have subtracted
only quantities that are polynomial in the external momenta of the corresponding
loop (with a degree equal to the superficial degree of divergence of the loop). This
recursive procedure for constructing a subtracted integrand is known as Bogoliubov-
Parasiuk-Hepp-Zimmermann renormalization. c sileG siocnarF
n = 2s + 1 . (1.191)
That the Pauli matrices (up to a factor 2) are generators of the Lie algebra of rotations
can be seen from
σi
Ji , Jj = i ǫijk Jk with Ji ≡ . (1.194)
2
This idea can be extended to quantum field theory in order to encompass all the
Lorentz transformations rather than just the spatial rotations. We are therefore seeking
a dimension 2 representation of the commutation relations (1.10). Firstly, let us
assume that we know a set of four n × n matrices γµ that satisfy the following
anti-commutation relation:
µ ν
γ , γ = 2 gµν 1n×n . (1.195)
48 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Such matrices are called Dirac matrices. From these matrices, it is easy to check that
the matrices
i µ ν
Mµν ≡ γ ,γ (1.196)
4
form an n-dimensional representation of the Lorentz algebra. However, an exhaustive
search indicates that the smallest matrices that fulfill eqs. (1.195) (in four space-time
dimensions, i.e. for µ, ν = 0, · · · , 3) are 4 × 4. Several unitarily equivalent choices
exist for these matrices. A possible representation (known as the Weyl or chiral
representation) is the following28
! !
0 0 1 i 0 σi
γ ≡ , γ ≡ . (1.197)
1 0 −σi 0
In this representation, the generators for the boosts and for the rotations are
! !
0i i σi 0 ij 1 ijk σk 0
M =− , M = ǫ . (1.198)
2 0 −σi 2 0 σk
i
U1/2 (Λ) ≡ exp − ωµν Mµν . (1.199)
2
A Dirac spinor is a 4-component field ψ(x) that transforms as follows:
In other words, the matrix U1/2 defines how the four components of this field
transform under a Lorentz transformation (since these four components mix, ψ(x) is
not the juxtaposition of four scalar fields). The fact that the lowest dimension for the
Dirac matrices is 4 indicates that the spinor ψ(x) describes two spin-1/2 particles: a
particle and its antiparticle, that are distinct from each other. c sileG siocnarF
Let us now determine an equation of motion obeyed by this field, such that it is
invariant under Lorentz transformations. Since the Mµν ’s act only on the Dirac
indices, a trivial answer could be the Klein-Gordon equation,
x + m2 ψ(x) = 0 . (1.201)
28 Although it is sometimes convenient to have an explicit representation of the Dirac matrices, most
manipulations only rely on the fact that the obey the anti-commutation relations (1.195).
1. BASICS OF Q UANTUM F IELD T HEORY 49
But there is in fact a stronger equation that remains invariant when ψ is transformed
according to eq. (1.200). Notice first that
U−1 µ µ ν
1/2 (Λ)γ U1/2 (Λ) = Λ ν γ . (1.202)
This equation indicates that rotating the Dirac indices of γµ with U1/2 is equivalent
to transforming the µ index as one would do for a normal 4-vector. Using this identity,
we can check that under the same Lorentz transformation we have
iγµ ∂µ − m ψ(x) → U1/2 (Λ) iγµ ∂µ − m ψ(Λ−1 x) . (1.203)
is Lorentz invariant. This equation implies the Klein-Gordon equation (to see it, apply
the operator iγµ ∂µ + m on the left), and is therefore stronger.
The Dirac matrices are not Hermitean. Instead, they satisfy
†
γµ = γ0 γµ γ0 . (1.205)
i i
U†1/2 (Λ) = exp ωµν (Mµν )† = γ0 exp ωµν Mµν γ0 = γ0 U−1 0
1/2 (Λ) γ .
2 2
(1.206)
The solutions u(p) and v(p) each form a 2-dimensional linear space, and it is
customary to denote a basis by us (p) and vs (p) (the index s, that takes two values
50 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
s = ±, is interpreted as the two spin states for a spin 1/2 particle). A convenient
normalization of the base vectors is
/ ≡ pµ γµ .
where we have introduced the notation p c sileG siocnarF
one would find a Hamiltonian which is not bounded from below. The resolution
of this paradox is that the commutation relation (1.212) is incorrect, and should be
replaced by an anti-commutation relation,
ψa (x), ψ†b (y) x0 =y0 = δ(x − y)δab , (1.214)
which leads to anti-commutation relations for the creation and annihilation operators
arp , a†sq = brp , b†sq = (2π)3 2Ep δ(p − q)δrs . (1.215)
(All other combinations are zero.) These anti-commutation relations imply that the
square of creation operators is zero, which means that it is not possible to have two
particles with the same momentum and spin in a quantum state. This is nothing
but the Pauli exclusion principle. This is the simplest example of the spin-statistics
theorem, which states that half-integer spin particles must obey Fermi statistics.
c sileG siocnarF
1. BASICS OF Q UANTUM F IELD T HEORY 51
From eq. (1.213), we obtain the following expression for the free Feynman propagator
of the Dirac field29
S0F (x, y) ≡ 0 θ(x0 − y0 )ψa (x)ψb (y) − θ(y0 − x0 )ψb (y)ψa (x) 0
| {z }
T (ψa (x)ψb (y))
Z 4
d p −ip·(x−y) / + m)
i(p
= e . (1.216)
(2π)4 p − m2 + i0+
2
| {z }
S0 (p)
F
p
S0F (p) = . (1.217)
The LSZ reduction formula for transition amplitudes with fermions and/or anti-
fermions in the initial and final states reads:
m+n Z Z
. i −ip·x
qσ qσ · · · out ps ps · · · in = d4
x e d4 x e−ip·x · · ·
| {z } | {z } Z1/2
n particles m particles
Z Z → →
× d4 y e+iq·y d4 y e+iq·y · · · vs (p)(i ∂
/ x −m) uσ (q)(−i ∂
/ y +m)
where we give examples for fermions and anti-fermions (indicated by a bar over the
momentum and spin), both for the initial and final states. Besides the requirement
that the external lines of the Feynman graphs should be amputated, this formula leads
29 We have introduced a minus sign in the definition of the time-ordered product of Dirac fields. One
would have to mimic the derivation of the section 1.6 in order to see that this is the propagator that naturally
appears in the generating functional for the amplitudes with fermions.
52 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that when writing the expression corresponding to a given Feynman graph, the
fermion lines it contains must be read in the direction opposite to the arrow carried by
the lines.
c sileG siocnarF
The best known spin-1 particle is the photon. In classical electrodynamics, the electric
field E and magnetic field B obey Maxwell’s equations,
∇·E=ρ
∇ × B − ∂t E = J
∇ × E + ∂t B = 0
∇·B=0, (1.219)
written here in terms of charge density ρ and current J. The local conservation of
electrical charge implies the following continuity equation
∂t ρ + ∇ · J = 0 . (1.220)
The last two Maxwell’s equations are automatically satisfied if we write the E, B
fields in terms of potentials V and A,
E ≡ ∂t A + ∇V , B ≡ −∇ × A . (1.221)
1. BASICS OF Q UANTUM F IELD T HEORY 53
This representation is not unique, since E and B are unchanged if we transform the
potentials as follows:
V → V + ∂t χ , A → A − ∇χ , (1.222)
where χ is an arbitrary function of space and time. Eq. (1.222) is called a (Abelian)
gauge transformation. Quantities that do not change under (1.222) are said to be
gauge invariant. For instance, the electrical and magnetic fields are invariant. c sileG siocnarF
(Fµν is called the field strength.) Recalling that ∂µ = (∂t , −∇), gauge transforma-
tions take the following form
Aµ → Aµ + ∂µ χ , (1.224)
Ei = F0i , Bi = 1
2 ǫijk Fjk . (1.225)
Jµ ≡ (ρ, J) , (1.226)
the first two Maxwell’s equations and the continuity equation read
1
L ≡ − Fµν Fµν + Jµ Aµ . (1.229)
4
54 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Because of the term Jµ Aµ that couples the potential to the sources, this Lagrangian
density is not gauge invariant, but the action (integral of L over all space-time) is,
provided that the current is conserved (i.e. satisfies the continuity equation). Indeed,
we have
Z Z
d4 x Jµ Aµ → d4 x Jµ (Aµ + ∂µ χ)
Z Z
boundary
= 4 µ
d x J Aµ − d4 x χ ∂µ Jµ + . (1.230)
| {z } term
0
(The boundary term is zero if we assume that there are no sources at infinity.) c sileG siocnarF
Let us illustrate this procedure in Coulomb gauge30 . Firstly, let us decompose the
vector potential Ai into longitudinal and transverse components:
Ai = Aik + Ai⊥ , (1.233)
with
∂i ∂j ∂i ∂j
Aik ≡ Aj , Ai⊥ ≡ δij − 2 Aj . (1.234)
∂2 ∂
30 One may start from another gauge condition, and follow a similar line of reasoning in order to derive a
quantized theory of the photon field in another gauge. However, as we shall see later, we can make the
gauge fixing much more transparent by using functional quantization.
1. BASICS OF Q UANTUM F IELD T HEORY 55
1 1 1
L = (∂t Ai⊥ )(∂t Ai⊥ ) − (∂j Ai⊥ )(∂j Ai⊥ ) + (∂i A0 )(∂i A0 )
2 2 2
1
+(∂t Ai⊥ )(∂i A0 ) + (∂i Aj⊥ )(∂j Ai⊥ ) + J0 A0 − Ji Ai⊥ . (1.235)
2
Note that the two underlined terms will vanish in the action, after an integration by
parts (thanks to the transversality of Ai⊥ ). The Euler-Lagrange equation for the field
A0 is
∂2 A0 = J0 , (1.236)
i.e. the Poisson equation with source term J0 . Note that this equation has no time
derivative. Therefore, A0 reflects instantaneously the changes of the charge density
J0 (this does not contradict special relativity, since A0 is not an observable – only E
and B are). Ignoring all the terms that would vanish in the action upon integration by
parts, we may thus rewrite the Lagrangian as
1 1 1 1
L= (∂t Ai⊥ )(∂t Ai⊥ ) − (∂j Ai⊥ )(∂j Ai⊥ ) − Ji Ai⊥ + J0 2 J0 , (1.237)
2 2 2 ∂
and obtain the following Euler-Lagrange equation of motion for the field Ai⊥ :
∂i ∂j
Ai⊥ = − δij − 2 Jj , (1.238)
∂
i.e. a massless Klein-Gordon equation with the transverse projection of the charge
current as source term.
In this form, electrodynamics has no redundant degrees of freedom, and can now
be quantized in the vacuum (J0 = Ji = 0) in the canonical way. Firstly, we define the
momentum conjugated to Ai⊥ ,
δL
Πi⊥ (x) ≡ = ∂t Ai⊥ (x) . (1.239)
δ ∂t Ai⊥ (x)
Then, we promote Ai⊥ and Πi⊥ to quantum operators, and we impose on them the
following canonical equal-time commutation relations,
∂i ∂j
Ai⊥ (x), Πj⊥ (y) x0 =y0 = i δij − 2 δ(x − y) ,
∂
i
A⊥ (x), Aj⊥ (y) x0 =y0 = Πi⊥ (x), Πj⊥ (y) x0 =y0 = 0 . (1.240)
56 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(In the first of these relations, the transverse projector in the right hand side follows
from the fact that Ai⊥ and Πj⊥ are both transverse.) These commutation relations can
be realized by decomposing Ai⊥ on a basis of solutions of the Klein-Gordon equation,
i.e. plane waves:
X Z d3 p h i †
i
Ai⊥ (x) ≡ ǫ (p) a e+ip·x
+ ǫ i∗
(p) a λp e−ip·x
, (1.241)
(2π)3 2|p| λ λp λ
λ=1,2
where the two vectors ǫi1,2 (p) are polarization vectors orthogonal to p,
p · ǫλ (p) = 0 . (1.242)
In 3 spatial dimensions, a basis of such vectors has two elements, that we have labeled
with λ = 1, 2. In addition, it is convenient to normalize the polarization vectors as
follows
X j pi pj
ǫλ (p) · ǫ∗λ ′ (p) = δλλ ′ , ǫi∗ ij
λ (p)ǫλ (p) = δ − . (1.243)
p2
λ=1,2
With this choice, the commutation relations of eqs. (1.240) are equivalent to the
following commutation relations between creation and annihilation operators:
aλp , aλ ′ q = a†λp , a†λ ′ q = 0 ,
aλp , a†λ ′ q = (2π)3 2|p| δλλ ′ δ(p − q) . (1.244)
With these formulas, it is easy to derive the LSZ reduction formulas for photons in
the initial and final states,
m+n Z
. i
qλ ′ · · · out pλ · · · in = d4 x e−ip·x ǫi∗
λ (p) x · · ·
| {z } | {z } Z1/2
n photons m photons
Z
× d4 y e+iq·y ǫjλ ′ (q) y · · · 0out T Ai⊥ (x)Aj⊥ (y) · · · 0in .
(1.246)
1. BASICS OF Q UANTUM F IELD T HEORY 57
The free Feynman propagator of the photon (in Coulomb gauge) can be read off the
quadratic part of the Lagrangian (1.237). In momentum space, it reads
i j
p i δij − ppp2
G0F ij (p) = i j = . (1.247)
p2 + i0+
The operator ǫiλ (p) x in the reduction formula simply amputates the external photon
line to which it is applied31 . Transition amplitudes with incoming and outgoing
photons are therefore given by amputated graphs, with a polarization vector contracted
to the Lorentz index of each external photon. c sileG siocnarF
So far, we have derived a quantized field theory for spin 1/2 fermions and a quantized
field theory of photons (in the absence of charged sources), but they appear as
unrelated constructions. The next step is to combine the two into a quantum theory of
charged fermions that interact electromagnetically via photon exchanges. c sileG siocnarF
Firstly, note that the fermion Lagrangian is invariant under the following transforma-
tion of the fermion field
ψ → Ω† ψ , (1.248)
where Ω is a phase (i.e. an element of the group U(1)), provided that we consider
only rigid transformations (i.e. independent of the space-time point x). By Noether’s
theorem (see the section 1.2.4), this continuous symmetry corresponds to the existence
of a conserved current,
Jµ = ψ γµ ψ . (1.249)
∂ µ Jµ = 0 . (1.250)
31 Note that
pi pj j
δij − ǫλ (p) = ǫiλ (p) .
p2
Therefore, the transverse projectors attached to the external photon lines can be dropped.
58 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The physical interpretation of this current emerges from the spatial integral of the
time component J0 ,
Z
Q ≡ d3 x J0 (x) . (1.251)
Using the Fourier mode decomposition (1.213) of the spinor ψ(x), we obtain the
following expression:
XZ d3 p
Q = 3 2E
asp a†sp + b†sp bsp
s=±
(2π) p
XZ d3 p
= 3 2E
b†sp bsp − a†sp asp + (infinite) constant .
s=±
(2π) p
(1.252)
Thus, the operator Q counts the number of particles created by b† minus the number
of particles created by a† . If we assign a charge +1 to the former and −1 to the latter,
we can interpret Q as the operator that measures the total charge in the system. c sileG siocnarF
analogy with the non-Abelian gauge theories that we will study later.
1. BASICS OF Q UANTUM F IELD T HEORY 59
Equivalently, the problem comes from the fact that the derivative ∂µ ψ does not
transform in the same way as ψ itself when Ω depends on x. Instead, we have
∂µ ψ → ∂µ Ω† ψ = Ω† ∂µ ψ + (∂µ Ω† ) ψ . (1.256)
But we see that the second term can be connected to the variation of a photon field
under the same transformation. This suggests that the combination (∂µ − iAµ )ψ has
a simpler transformation law:
∂µ − iAµ ψ → ∂µ − i Ω† Aµ Ω + iΩ† ∂µ Ω Ω† ψ
= Ω† ∂µ − iAµ ψ + Ω† Ω(∂µ Ω† ) + (∂µ Ω)Ω† ψ .
| {z }
∂µ (ΩΩ† )=0
(1.257)
This observation is the basis of (Abelian) gauge theories: the minimal change to the
Dirac Lagrangian that makes it locally gauge invariant introduces a coupling ψAµ ψ
between two fermion fields and a spin-1 field such as the photon. The complete
Lagrangian of this theory therefore reads:
1
L = − Fµν Fµν + ψ iD
/ − m) ψ . (1.258)
4
We already know the Feynman rules for the photon and fermion propagators, and the
prescription for external photon and fermion lines. The only additional Feynman rule
is the following interaction vertex,
µ
= −iγµ , (1.259)
parameter e that represents the (bare) electrical charge of the electron, which leads to
the following changes:
Covariant derivative : Dµ ≡ ∂µ − i e Aµ
i † µ
Gauge transformation of the photon : Aµ → Ω† Aµ Ω + Ω ∂ Ω
e
Electrical current : e ψγµ ψ
Photon-electron vertex : − i e γµ .
(1.260)
Q 0 =0. (1.261)
When acting on a 1-particle state αp , Q gives another state with the same 4-
momentum, and therefore the same invariant mass. But since single particle states are
separated from states with a higher occupancy in the spectral function of the theory,
Q |αp must in fact be proportional to αp itself,
In other words, 1-particle states are eigenvectors of the charge operator. Since Q is
Lorentz invariant, the eigenvalue qα,p cannot depend on the momentum p (nor on
the spin state of the particle), and it can only depend on the species of particle α. We
will thus denote it qα , and call it the electrical charge of the particle of type α.
In theories with 1-particle states that do not correspond to the fundamental fields
of the Lagrangian (e.g. composite bound states made of several elementary particles),
one may go a bit further. The canonical anti-commutation relations imply
0
J (x), ψ(y) x0 =y0 = −e ψ(x) δ(x − y) , Q, ψ(y) = −e ψ(y) . (1.263)
where n+ is the number of ψ’s in F and n− the number of ψ† ’s. If we evaluate this
identity between the vacuum and a 1-particle state αp , we obtain
∂µ − i er Aµ
r . (1.267)
Since the field Aµ
r is related to the bare photon field Aµ
b by
1/2
Aµ
b = Z3 Aµ
r , (1.268)
the bare and renormalized charges must be related by
−1/2
eb = Z3 er . (1.269)
In combination with eq. (1.266), this means that the charges of all 1-particle states are
−1/2
renormalized by the same factor Z3 , regardless of the species of particle contained
in the state. For this to work, cancellations between various Feynman graphs are
necessary. These cancellations are a consequence of the local gauge invariance of the
theory, and in their simplest form they can be encapsulated in the Ward-Takahashi
identities, that we shall derive now. c sileG siocnarF
invariance.
62 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where only electromagnetic currents appear inside the T-product, and all the external
charged particles are kept in the initial and final states α and β (and are therefore
on-shell).
Let us contract the Lorentz index µ1 with the momentum qµ1
1 of the first photon.
After an integration by parts, this reads
Z
µ1 µ2 ···
q1,µ1 M (q1 , q2 , · · · ) = −i d4 x1 d4 x2 · · · e−iq1 ·x1 e−iq2 ·x2 · · ·
× 0out ∂µ1 T Jµ1 (x1 )Jµ2 (x2 ) · · · 0in .
(1.271)
The derivative of the T-product involves two types of terms: (i) terms where the
derivative acts directly on the current Jµ1 (x1 ), that are zero thanks to current conser-
vation, and (ii) terms where it acts on the theta functions that order the times inside
the T-product. With two currents, the latter term reads34
∂
T Jµ (x)Jν (y) = δ(x0 − y0 ) J0 (x), Jν (y) = 0 . (1.272)
∂xµ
This generalizes to more than two currents, and we therefore have quite generally
The same property would hold for all the external photon lines of the amplitude. This
equation is known as the Ward-Takahashi identity.
A consequence of eq. (1.273) is that QED transition amplitudes are unchanged if
the photon propagators or polarization vectors are modified by terms proportional to
the momentum pµ ,
This is precisely the modification of the Feynman rules one would encounter by using
a different gauge fixing in the quantization of the theory. Thus, the Ward-Takahashi
identities imply the gauge invariance of the transitions amplitudes in QED. c sileG siocnarF
34 This step of the argument would fail if we had kept charged field operators inside the T-product,
because their equal-time commutator with J0 is non-zero. Therefore, the Ward-Takahashi identities are
valid provided all the external charged particles are on-shell, but there is no such requirement for the neutral
external particles (e.g. the photons).
1. BASICS OF Q UANTUM F IELD T HEORY 63
1.15.1 Introduction
Until now, our discussion of the symmetry of a theory has been limited to a study of
its Lagrangian or Hamiltonian, and we have tacitly assumed that the symmetry of
the Lagrangian implies that the physics of this system exhibits the symmetry under
consideration to its full extent. However, strictly speaking, a symmetric Lagrangian
only implies that the corresponding equations of motion are symmetric, i.e. that
a symmetry transformation applied to a solution of the equations of motion gives
another solution. In other words, the symmetry of the Lagrangian implies that the set
of the solutions of the equations of motion is symmetric, not that every individual
solution is symmetric. A spontaneously broken symmetry is a symmetry of the
Lagrangian which is not realized by the ground state. c sileG siocnarF
Let us first recall a standard result of quantum mechanics, that on the surface
seems to forbid the possibility of non-symmetric ground states. Consider a quantum
system of Hamiltonian H, which is also invariant under a discrete symmetry R such
that R2 = 1 (such as a mirror symmetry). The Hamiltonian commutes with the
symmetry generator,
R, H = 0 , (1.275)
φ φ φ
In order to see how this result is circumvented in quantum field theory, let us
consider a simple explicit realization of this situation by a potential made of two
infinite wells centered at φ = ±φ∗ , mirror symmetric with respect to φ = 0. Let us
64 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
denote H0 the Hamiltonian of this system. Each of the wells has its own ground state,
that we denote 0+ and 0− , respectively. They are degenerate in energy, transform
into one another by the action of R, and have a vanishing overlap,
R 0+ = 0− , 0+ 0− = 0 . (1.276)
Then, we introduce a perturbation V, also mirror symmetric, such that the energy
barrier between the two wells becomes finite (this interactions acts as a kind of
coupling between the two wells). With this perturbation, we have
Let us now return to the case of a continuous symmetry. When the volume is infinite,
a ground state v is characterized by the fact that it is en eigenstate of the momentum
Pi with a null eigenvalue37
P |vi = 0 . (1.278)
35 For this conclusion to hold, the matrix elements of H between 0
± and the excited states should
be negligible. Otherwise, the ground state of the perturbed Hamiltonian will be a more general linear
combination of the eigenstates of H0 .
36 The non-diagonal matrix elements of V e are zero per our assumption that V
e is odd under R.
37 Multiparticle states whose total momentum is zero can be excluded by the fact that they are separated
There is in general a whole set of such states, that we may choose as orthogonal,
For any matrix element u A(x)B(0) v of the equal-time product of two local
operators, we may insert a complete basis of states in order to get
X
u A(x)B(0) v = u A(0) w w B(0) v
vacua w
Z
d3 p X
+ u A(0) N, p N, p B(0) v e−ip·x ,
(2π)3
N
(1.280)
where we have separated the ground states w from the continuum of populated
states N, p (the label N – possibly continuous– distinguishes all those states that
have the same total momentum p). To obtain this relationship, we have used the
translation invariance of the ground states, and the fact that P is the generator of spatial
translations. Since the states N, p belong to a continuum of states, the integral on
the second line is smooth enough and vanishes when |x| → ∞ by Riemann’s lemma.
Therefore, we have:
X
lim u A(x)B(0) v = u A(0) w w B(0) v . (1.281)
|x|→+∞
vacua w
Causality implies that A(x)B(0) = B(0)A(x) since the separation between the two
points is space-like, so that the matrix elements u|A(0)|v and u|B(0)|v may be
viewed as commuting Hermitean matrices, that we can diagonalize simultaneously.
Moreover, since A and B are arbitrary local Hermitean operators, this property
is in fact true for all such operators. By choosing properly the basis of the vacua
when the volume is infinite, all the local Hermitean operators have vanishing matrix
elements between distinct vacua:
Consequently, any local interaction term that breaks the symmetry responsible for the
degeneracy of these vacua is diagonal in this basis. Therefore, it lifts the degeneracy
and promotes one of the states v to the status of true ground state of the system
(instead of a symmetric linear combination of the v ’s). c sileG siocnarF
66 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
∂ µ Jµ
a (x) = 0 . (1.284)
φi (x) → φi (x) + i ǫa ta
ij φj (x) , (1.285)
where the ta ij are the generators of the Lie algebra of the group or transformations,
in the representation where the fields φi live. For the fields φi to be Hermitean, the
numbers ta ij must be purely imaginary (this would be the case if the φi are in the
adjoint representation of the Lie algebra). From Noether’s theorem, the conserved
currents read:
X δL δφi (x)
Jµ
a (x) = . (1.286)
δ∂µ φi (x) δǫa
i
where πi (x) is the canonical momentum associated with φi (x). Since the matrices
ta have imaginary components, these currents are Hermitean, as well as the charges
Qa . Using the canonical commutation relations,
φi (x), φj (y) x0 =y0 = 0 ,
πi (x), πj (y) x0 =y0 = 0 ,
φi (x), πj (y) x0 =y0 = i δij δ(x − y) , (1.289)
Using also the commutation relation that defines the Lie bracket,
a b
t , t = i fabc tc , (1.291)
1. BASICS OF Q UANTUM F IELD T HEORY 67
By integrating over the positions x and y, this becomes a commutator between the
conserved charges,
Qa (x0 ), Qb (x0 ) = fabc Qc (x0 ) . (1.293)
In other words, the charges Qa (x0 ) form a real representation of the Lie algebra. In
addition, the commutator between the conserved charges and the field operators is
given by38 :
Z
Qa (x0 ), φi (x) = i d3 y πk (y)ta
kl φl (y), φi (x) x0 =y0
Z
= i d3 y(−i)δ(x − y)δki ta kl φl (x)
= ta
ij φj (x) . (1.294)
Note that the above commutation relations are not affected by the spontaneous
breaking of symmetry, since they follow from the properties of the field operators,
regardless of the nature of the ground state of system. c sileG siocnarF
The ground state of the system is characterized by the expectation values of the field
operators:
In order to see whether the ground state is invariant under the action of the symmetry
transformations, let us study the variation of the quantities φi :
δ φi = 0|δφi (x)|0
= i ǫa ta
ij 0|φj (x)|0
= i ǫa 0 Qa (x0 ), φi (x) 0
= i ǫa 0 Qa φi (x) − φi (x)Qa 0 . (1.296)
Thus, it is clear that these expectation values are invariant if the ground state is
annihilated by the all the generators of the Lie algebra (i.e. if Qa 0 = 0 for all a). c sileG siocnarF
38 Since the charges are conserved, we are free to evaluate them at the same time as the field φi .
68 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Consider now the expectation value in the ground state of the commutator between
the conserved currents and the field operators:
µ
0 Jµ
a (x), φi (y) 0 = 0 Ja (x − y), φi (0) 0
XZ h
= d4 p δ(p − pN ) 0 Jµ
a (x − y) N N φi (0) 0
N
i
− 0 φi (0) N N Jµ a (x − y) 0
XZ h
= d4 p δ(p − pN ) 0 Jµ
a (0) N N φi (0) 0 e
ip·(x−y)
N
i
− 0 φi (0) N N Jµ
a (0) 0 e
ip·(y−x)
. (1.297)
In the second line, we have summed over a complete set of states N , arranged
according to their 4-momentum pN . We have also used the translation invariance
of the ground state, and the properties of states with a definite momentum under
translations. If we define
µ
X
i Fa,i (p) ≡ (2π)3 δ(p − pN ) 0 Jµ
a (0) N N φi (0) 0
N
X
e µ (p) ≡ (2π)3 δ(p − pN ) 0 φi (0) N N Jµ
iF a,i a (0) 0 , (1.298)
N
we have
µ
Fa,i e µ (p) ∗ ,
(p) = − F (1.299)
a,i
since Jµ
a and φi are Hermitean. Moreover, Lorentz invariance implies that these
objects have the following form:
µ
Fa,i (p) = pµ θ(p0 ) ρa,i (p2 ) ,
e µ (p) = pµ θ(p0 ) ρ
F ea,i (p2 ) , (1.300)
a,i
where ρa,i and ρ ea,i are functions (so far unspecified) depending only on the invariant
p2 . The factor θ(p0 ) follows form the fact that the physical states N have a positive
energy. Then, by inserting unit factor given by
Z
1 = ds δ(p2 − s) , (1.301)
1. BASICS OF Q UANTUM F IELD T HEORY 69
we obtain
Z
0 [Jµ
a (x), φi (y)] 0 = −∂ µ
ea,i (s) ∆(y − x; s) ,
x ds ρa,i (s) ∆(x − y; s) + ρ
(1.302)
where we denote
Z
d4 p
∆(x − y; s) ≡ 2πθ(p0 ) δ(p2 − s) eip·(x−y) . (1.303)
(2π)4
Therefore,
Z
0 [Jµ
a (x), φi (y)] 0 = −∂ µ
ρa,i (s) ∆(y−x; s) if (x−y)2 < 0 .
x ds ρa,i (s)+e
(1.306)
Since the commutator in the left hand side vanishes for local operators with a space-
like separation, we get39 :
ea,i (s) = 0 .
ρa,i (s) + ρ (1.307)
By applying the derivative ∂xµ to both sides of this equation, and using the Klein-
Gordon equation and the fact that the current Jµ
a (x) is conserved, we get
Z
0 = ds s ρa,i (s) ∆(x − y; s) − ∆(y − x; s) , (1.309)
39 This property, combined with eq. (1.299), implies that ρa,i (p2 ) is real.
70 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
which implies
Therefore, ρa,i (s) = 0 for all s 6= 0, and the only possible support of ρa,i (s) is
localized at s = 0 (in the form of a delta function so that integrals over s are non-zero).
Let us now show that it is not possible that ρa,i (s) be identically zero everywhere
(including at s = 0) when the symmetry is spontaneously broken. By setting µ = 0
and x0 = y0 , and using eq. (1.303), we obtain
Z p
0 [J0a (x), φi (y)] 0 = 2i dsd4 p p2 +s ρa,i (s) eip·(x−y) δ(p2 − s)
x0 =y0
Z
= i δ(x − y) ds ρa,i (s) . (1.311)
Then, we can integrate over x and use the commutation relation (1.294) in order to
get
Z
ta
ij φj = i ds ρa,i (s) . (1.312)
Thus, the functions ρa,i (s) that have a non-zero integral are in one-to-one corre-
spondence with the non-zero ta ij φj , i.e. with the fact that the ground state is non
invariant under the action of some of the symmetry generators. When this happens,
we must have
This equation is the essence of Goldstone’s theorem. Note now that ρa,i is a spectral
function similar to the one defined in the section 1.9. Therefore, the presence of
a δ(s) in this function signals the existence of a one-particle state with zero mass
in the sum of eq. (1.298) (multiparticle states with a null total momentum would
produce a continuum extending down to s = 0 rather than a delta function). Moreover,
this results indicates that there are as many such massless particles (called Nambu-
Goldstone modes) as there are broken symmetries by the ground state.
Finally, let us note that the state φi (0) 0 is invariant under rotations, which
implies that the matrix element N φi (0) 0 is zero unless the state N has a
vanishing helicity. Thus, only spin 0 particles can contribute to the δ(s) in the non-
zero spectral functions. Moreover, 0 J0a (0) N vanishes for any state N whose
quantum numbers differ from those of J0a . Thus, the Nambu-Goldstone modes are
spin-0 particles that have the same internal quantum numbers as J0a . c sileG siocnarF
1. BASICS OF Q UANTUM F IELD T HEORY 71
In a unitary field theory, the S matrix is a unitary operator on the space of physical
states:
SS† = S† S = 1 . (1.315)
This property means that for a properly normalized initial physical state αin , we
have
X 2
|hβout |αin i| = 1 , (1.316)
states β
where the sum includes only physical states. In other words, in any interaction process,
the state α must evolve with probability one into other physical states. In general, one
subtracts from the S-matrix the identity operator, that corresponds to the absence of
interactions, and one writes:
S ≡ 1 + iT . (1.317)
1 = (1 + iT )(1 − iT † ) = 1 + iT − iT † + TT † , (1.318)
or equivalently
−i(T − T † ) = TT † . (1.319)
Let us now take the expectation value of this identity in the state αin , and insert the
identity operator written as a complete sum over physical states between T and T † in
the right hand side. This leads to:
X 2
−i αin |T − T † |αin = hαin |T |βin i . (1.320)
states β
72 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
1 X 2
Im hαin |T |αin i = hαin |T |βin i . (1.321)
2
states β
This identity is known as the optical theorem. It implies that the total probability to
scatter from the state α to any state β equals twice the imaginary part of the forward
transition amplitude α → α. c sileG siocnarF
Eq. (1.321) is valid to all orders in the interactions. But as we shall see it also
manifests itself in some properties of the perturbative expansion. Let us first consider
i
as an example a scalar field theory, with a cubic interaction in − 3! λφ3 (x).
Firstly, decompose the free Feynman propagator in two terms, depending on the
ordering between the times at the two endpoints:
G0−+ (x, y) ≡ 0in φin (x)φin (y) 0in , G0+− (x, y) ≡ 0in φin (y)φin (x) 0in .
(1.323)
where the indices ǫi = ± indicate which is the type of the i-th vertex. The usual
Feynman rules thus correspond to the function G++··· . These generalized integrands
are constructed according to the following rules:
+ vertex : −iλ ,
− vertex : +iλ ,
Propagator from ǫ to ǫ ′ : G0ǫǫ ′ (x, y) . (1.326)
Let us assume that the i-th vertex carries the largest time among all the vertices
of the graph. Since x0i is largest than all the other times, then the propagator that
connects this vertex to an adjacent vertex of type ǫ at the position x is given by
In other words, this propagator depends only on the type ǫ of the neighboring vertex,
but not on the type of the i-th vertex. Therefore, we have
where the notation [±i ] indicates that the i-th vertex has type + or − (the types of the
vertices not written explicitly are the same in the two terms, but otherwise arbitrary).
This identity, known as the largest time equation, follows from eq. (1.327) and from
the sign change when a vertex changes from + to −.
A similar identity also applies to the sum extended to all the possible assignments
of the + and − indices:
X
Gǫ1 ǫ2 ··· (x1 , x2 , · · · ) = 0 . (1.329)
{ǫi =±}
This is obtained by pairing the terms and using eq. (1.328). It is crucial to observe
that this identity is now valid for any ordering of the times at the vertices of the graph.
Therefore, it is also valid in momentum space after a Fourier transform. If we isolate
the two terms where all the vertices are of type + or all of type −, this also reads
X
G++··· + G−−··· = − Gǫ1 ǫ2 ··· , (1.330)
{ǫi =±} ′
where the symbol {ǫi = ±} ′ indicates the set of all the vertex assignments, except
+ + · · · and − − · · · .
Using eq. (1.119),
Z
d3 p
G0++ (x, y) = θ(x0 − y0 ) e−ip·(x−y) + θ(y0 − x0 ) e+ip·(x−y) ,
(2π)3 2Ep
74 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(1.331)
and comparing with eq. (1.324), we can read off the following representations for
G0−+ and G0+− ,
Z
d3 p
G0−+ (x, y) = e−ip·(x−y)
(2π)3 2Ep
Z
d3 p
G0+− (x, y) = e+ip·(x−y) . (1.332)
(2π)3 2Ep
Likewise, we obtain
Z
d3 p
G0−− (x, y) = θ(x0 − y0 ) e+ip·(x−y) + θ(y0 − x0 ) e−ip·(x−y) ,
(2π)3 2Ep
(1.333)
i
G0++ (p) = ,
− m2 + i0+
p2
−i ∗
G0−− (p) = 2 = G0++ (p) ,
p − m2 − i0+
G0−+ (p) = 2π θ(+p0 )δ(p2 − m2 ) ,
G+− (p) = 2π θ(−p0 )δ(p2 − m2 ) . (1.335)
Therefore, the momentum space Feynman rules for the − sector are the complex
conjugate of those for the + sector, since we have also +iλ = (−iλ)∗ . Note that for
this assertion to be true, it is crucial that the coupling constant λ be real, which is a
condition for unitarity.
The Fourier transform of an amputated Feynman graph G gives a contribution to a
transition amplitude (recall the LSZ reduction formula), i.e. a matrix element of the S
1. BASICS OF Q UANTUM F IELD T HEORY 75
If the graph contains N vertices, there are a priori 2N − 2 terms in the right hand side
of this equation. However, this number is considerably reduced if we notice that the
+− and −+ propagators can carry energy only in one direction (from the − vertex to
the + vertex), because of the factors θ(±p0 ). This constraint on energy flow forbids
“islands” of vertices of type + surrounded by only type − vertices, or the reverse.
From the LSZ reduction formula (1.82) and the definition (1.120) of the Fourier
transformed propagators, we see that the notation G−+ (p) implies a momentum p
defined as flowing from the + endpoint to the − endpoint:
p
G−+ (p) = . (1.337)
+ -
Thus, the proportionality G−+ (p) ∝ θ(p0 ) indicates that the energy flows from the
+ endpoint to the − endpoint.
Let us consider the example of a very simple 1-loop two-point function40 Γ (p),
p
−iΓ (p) = . (1.338)
Because of the constrained energy flow direction in the propagators G−+ , G+− , if
the momentum p is entering into the graph from the left with p0 > 0, the only
assignments that mix + and − vertices must divide the graph into two connected
subgraphs: a connected part made only of + vertices that comprises the vertex
where p0 > 0 enters in the graph, and a connected part containing only − vertices
comprising the vertex where the energy leaves the graph. For the topology shown in
eq. (1.338), there is only one possibility,
p
−iΓ+− (p) = , (1.339)
where the vertex of type − is circled in the diagrammatic representation. The division
of the graph into these two subgraphs may be materialized by drawing a line (shown
in gray above) through the graph. This line is called a cut, and the rules for calculating
40 Momentum conservation implies that it depends on a single momentum p.
76 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
the value of a graph with a given assignment of + and − vertices are called Cutkosky’s
cutting rules. For instance, in the case of the above example, they lead immediately
to the following expression41 for the imaginary part of Γ++ ,
Z
λ2 1 d4 k
Im Γ++ (p) = G−+ (k)G−+ (p − k) , (1.340)
2 2 (2π)4
that can be rewritten as
Z
λ2 d4 k1
Im Γ++ (p) = 2πθ(k01 )δ(k21 − m2 )
4 (2π)4
Z 4
d k2
× 2πθ(k02 )δ(k22 − m2 )(2π)4 δ(p − k1 − k2 ) .
(2π)4
(1.341)
In the right hand side of this equation, we recognize the square of the transition
amplitude k1 k2out pin (whose value at tree level is simply λ), integrated over the
(symmetrized) accessible phase-space for a 2-particle final state. We can therefore
view this equation as a perturbative realization of the optical theorem at order λ2 .
Indeed, at this order, the only states β that may be included in the sum over final
states are 2-particle states42 .
The considerations developed on this example can be generalized to the 2-point
function at any loop order. We can write
1 X
Im Γ++ (p) = (iΓγ (p)) , (1.342)
2
cuts γ
where the sum is now limited reduced to a sum over all the possible cuts (with the +
vertices left of the cut and the − vertices right of the cut). As an illustration, let us
consider the following 2-loop example, for which three cuts are possible:
At this order start to appear various contributions to the right hand side of eq. (1.321):
the central cut corresponds to a 3-body final state, while the other two cuts correspond
to an interference between the tree level and the 1-loop correction to a 2-body decay. c sileG siocnarF
41 The first factor 1/2 comes from eq. (1.336), and the second 1/2 is the symmetry factor of the graph for
a scalar loop. In the formula for Im Γ++ , it has the interpretation of the factor that symmetrizes a 2-particle
final state.
42 This result is consistent with the formula (1.100) for a decay rate, if we note that the decay rate Γ of a
particle is related to the imaginary part of the corresponding self-energy by Γ = Im Γ++ (p) /Ep . This
can be seen as follows: after resumming the self-energy Γ++ (p) on the propagator, the imaginary part
makes it decay as G++ (x, y) ∼ exp(−(Im Γ++ )|x0 − y0 |/2Ep ), and the particle density, quadratic in the
field operator, decays as the square of the propagator.
1. BASICS OF Q UANTUM F IELD T HEORY 77
1.16.3 Fermions
In the case of spin 1/2 fermions, the propagators connecting the various types of
vertices are given by
/ + m)
i(p
S0++ (p) = ,
− m2 + i0+
p2
−i(p/ + m)
S0−− (p) = 2 ,
p − m2 − i0+
S0−+ (p) = 2π (p
/ + m)θ(−p0 )δ(p2 − m2 ) ,
/ + m)θ(+p0 )δ(p2 − m2 ) .
S0+− (p) = 2π (p (1.344)
The cutting rules for fermions are therefore similar to those for scalar particles. The
possibility to interpret the cut fermion propagators in terms of on-shell final state
fermions is a consequence of the following identities:
X
/+m=
p us (p)us (p) ,
spin s
X
/−m=
p vs (p)vs (p) , (1.345)
spin s
p
that are valid when p0 = p2 + m2 > 0. In the case of the propagator S0−+ (p), we
may attach the spinor us (p) to the amplitude on the right of the cut, and the spinor
us (p) to the amplitude on the left, which are precisely the spinors required by the
LSZ formula for a fermion of momentum p in the final state. In the case of S0+− (p),
for which p0 < 0, we should first write
1.16.4 Photons
Coulomb gauge : For photons in Coulomb gauge, the reasoning is very similar to
the case of fermions. Firstly, the four different types of propagators read
i j
i δij − ppp2
G0++ij
(p) = ,
p2 + i0+
i j
−i δij − ppp2
G0−−ij
(p) = ,
p2 − i0+
i j
G0−+ij
(p) = 2π θ(+p0 ) δij − ppp2 δ(p2 ) ,
i j
G0+−ij
(p) = 2π θ(−p0 ) δij − ppp2 δ(p2 ) . (1.347)
we see that the projector that appears in the cut propagators can be interpreted as the
polarization vectors that should attached to amplitudes for each final state photon.
Therefore, the cutting rules in Coulomb gauge have a direct interpretation in terms of
the optical theorem. This simplicity follows from the fact that the only propagating
modes are physical modes in Coulomb gauge. c sileG siocnarF
and transverse: pµ ǫµ
1,2 (p) = 0. However, the tensor −g
µν
that appears in the cut
photon propagators cannot be written as a sum over physical polarizations:
X µ
∗
−gµν 6= ǫλ (p)ǫνλ (p) . (1.353)
λ=1,2
1
ǫµ
+ (p) ≡ √ (1, 0, 0, 1) ,
2
µ 1
ǫ− (p) ≡ √ (1, 0, 0, −1) , (1.355)
2
In other words, the physical polarization sum in the right hand side of eq. (1.353) is
equal to −gµν , plus some extra terms that are proportional to pµ of pν .
When we use Cutkosky’s cutting rules in order to calculate the imaginary part of
graph, a cut photon line carrying the momentum pµ leads to an expression that has
the following structure:
∗
iMµ
1 (p) [−g
µν
] (iMν
2 (p)) , (1.357)
where iMµ ν
1 and iM2 are the amplitudes on the left and on the right of the cut,
respectively. Here, we have highlighted only one of the cut photons, and the other
cut lines have not been written explicitly since they do not play any role in the
43 For an arbitrary momentum p, these polarization vectors read:
1
ǫµ
+ (p) ≡ √ (p0 , p) ,
2|p|
1
ǫµ
− (p) ≡ √ (p0 , −p) . (1.354)
2|p|
80 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
argument. Moreover, only the tensor structure of the cut propagator matters, and we
have therefore only written the factor −gµν . The above quantity can be rewritten as
" #
µ
X µ µ µ ∗
∗ ∗ ∗
iM1 (p) ν ν
ǫλ (p)ǫλ (p) −ǫ+ (p)ǫ− (p) −ǫ− (p)ǫ+ (p) (iMν
ν
2 (p))
λ=1,2
" #
X ∗
= iMµ
1 (p) ǫµ ν
λ (p)ǫλ (p)
∗
(iMν
2 (p)) . (1.358)
λ=1,2
Indeed, the last two terms are zero thanks to the Ward identity satisfied44 by the
amplitudes iMµ ν
1 and iM2 :
pµ Mµ ν
1 (p) = pν M2 (p) = 0 , (1.359)
the calculation of which can be performed with the usual Feynman rules.
However, there is a class of more general problems that cannot be addressed by
this standard perturbation theory. One of the simplest problems of that kind is the
evaluation of the expectation value of the number operator αin a†out (p)aout (p) αin ,
that counts the particles of momentum p in the final state, given that the initial state
was the state α. To evaluate this matrix element, one needs to calculate the amplitude
αin φ(x)φ(y) αin , that has no time ordering, and where one has in states on both
sides. More generally, one sometimes needs the amplitudes
0in T φ(x1 ) · · · φ(xn ) T φ(y1 ) · · · φ(yp ) 0in ,
44 When an amplitude has external charged particles, the Ward identity is satisfied only if these particles
are on-shell. This is indeed the case here, because all the cut lines are on-shell, as well as all the incoming
particles.
1. BASICS OF Q UANTUM F IELD T HEORY 81
As we did in the derivation of ordinary perturbation theory, let us first replace each
Heisenberg field operator by its counterpart in the interaction representation, using
eq. (1.63). After some rearrangement of the evolution operators, we get :
Here, we have exploited the fact that the factor U(−∞, +∞) that appears in these
manipulations is the anti-time ordered exponential of the interaction term, in order to
write this formula in a more symmetric way. To go further, it is useful to imagine that
the time axis is in fact a contour C made of two branches labeled + and − running
parallel to the real axis, as illustrated in figure 1.4. This contour is oriented, with
C +
x0
the + branch running in the direction of increasing time, followed by the − branch
running in the direction of decreasing time. Then, it is convenient to introduce a path
ordering, denoted by P and defined as a standard ordering along the contour C. In
more detail, one has
T A(x)B(y) if x0 , y0 ∈ C+ ,
T A(x)B(y) if x0 , y0 ∈ C− ,
P A(x)B(y) = (1.362)
A(x)B(y) if x0 ∈ C− , y0 ∈ C+ ,
B(y)A(x) if x0 ∈ C+ , y0 ∈ C− .
82 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
One can use this contour ordering to write the previous equations in a much more
compact way. In particular, eq. (1.361) can be generalized into :
i. A single overall path ordering takes care automatically of both the time ordering
and the anti-time ordering contained in the original formula,
ii. For this trick to work, one must (temporarily) assume that the fields on the
upper and lower branch of the contour C are distinct: φ+ and φ− respectively,
iii. The time integration in the exponential is now running over both branches of
the contour C.
The advantage of having introduced this more complicated time contour is that it
leads to a expressions that are formally identical to those of ordinary perturbation
theory, provided one replaces the time ordering by the path ordering and provided
❘
one extends the time integration from to C. In particular, one can first define a
generating functional,
Z
Z [j] ≡ 0in T exp i d4 x j(x)φ(x) 0in ,
SK
(1.364)
C
that encodes all the correlators considered in this section, provided the external source
j has distinct values j+ and j− on the two branches of the contour (the superscript SK
is used to distinguish this generating functional from the standard one). As in the case
of Feynman perturbation theory, one can write this generating functional as:
Z Z
δ
ZSK [j] = exp i d4 x LI 0in T exp i d4 x j(x)φin (x) 0in , (1.365)
C iδj(x) C
| {z }
0 [j]
ZSK
with
Z
1
0 [j]
ZSK = exp − d4 xd4 y j(x)j(y) G0C (x, y)
2 C
G0C (x, y) ≡ 0in P φin (x)φin (y) 0in . (1.366)
The free propagator G0C , defined on the contour C, is a natural extension of the
Feynman propagator (in particular, it coincides with the Feynman propagator if the
1. BASICS OF Q UANTUM F IELD T HEORY 83
two time arguments are on the + branch of the contour). Besides the propagator,
the other change to the perturbative expansion in the Schwinger-Keldysh formalism
is that the time integration at the vertices of a diagram must run over the contour C
instead of the real axis.
The connection with Cutkosky’s cutting rules appears when we break down the
propagator into 4 components G0±± (x, y), depending on whether the times x0 , y0
are on the upper or lower branch of the contour. An explicit calculation of these free
propagators leads to
Z
d4 p e−ip·(x−y)
G0++ (x, y) = i ,
(2π)4 p2 − m2 + iǫ
Z 4
d p e−ip·(x−y)
G0−− (x, y) = −i ,
(2π) p − m2 − iǫ
4 2
Z 4
d p −ip·(x−y)
G0+− (x, y) = e 2πθ(−p0 )δ(p2 − m2 ) ,
(2π)4
Z 4
d p −ip·(x−y)
G0−+ (x, y) = e 2πθ(+p0 )δ(p2 − m2 ) . (1.367)
(2π)4
The time integration on the contour C is also split into two terms, the upper branch
corresponding to a vertex + (−iλ) and the lower branch to a vertex − (+iλ, because
of the minus sign due to integrating from +∞ to −∞).
In the Schwinger-Keldysh formalism, the vacuum-vacuum diagrams are simpler
than in conventional perturbation theory. Here, one has
which means that all the connected vacuum-vacuum diagrams are zero. This is due to
the fact that in this formalism one is calculating correlators that have the in- vacuum
on both sides. This cancellation works individually for each diagram topology, and
results from a cancellation between the various ways of assigning the + and − indices
to the vertices of a diagram (a vacuum-vacuum diagram with a fixed assignment
of + and − vertices is not zero in general). This cancellation can be viewed as a
consequence of eq. (1.329). c sileG siocnarF
Relation between the functionals Z[j] and ZSK [j] : There is a useful functional
relation between the generating functional of conventional perturbation theory Z[j],
and that of the Schwinger-Keldysh formalism :
Z
δ2
Z [j+ , j− ] = exp
SK
d xd y 4 4
G0+− (x, y) x y Z[j+ ] Z∗ [j− ] .
δj+ (x)δj− (y)
(1.369)
84 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(Here, in order to avoid any confusion, we write explicitly the two components +
and − of the source j in the Schwinger-Keldysh generating functional.) Thanks
to this formula, one can construct diagrams in the Schwinger-Keldysh formalism
by stitching an ordinary Feynman diagram and the complex conjugate of another
Feynman diagram. In order to prove this relation, it is sufficient to establish it for the
free theory, since the interactions are always trivially factorizable (see eqs. (1.107)
and (1.365)).c sileG siocnarF
Chapter 2
Functional quantization
where q denote the eigenstate of the position operator with eigenvalue q. Let us
subdivide the time interval [ti , tf ] into N equal sub-intervals, by introducing:
tf − ti
∆≡ , tn ≡ ti + n ∆ . (2.4)
N
(Therefore, we have t0 = ti and tN = tf .) The time evolution operator can be
factorized as
e−iH(tf −ti ) = e−iH(tN −tN−1 ) ×e−iH(tN−1 −tN−2 ) ×· · ·×e−iH(t1 −t0 ) . (2.5)
85
86 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Between the successive factors in the right hand side, we can insert the identity
operator written as a complete sum over the position eigenstates:
Z +∞
1= dq q q , (2.6)
−∞
Z N−1
Y
qf e−iH(tf −ti ) qi = dqj qf e−i∆H qN−1 qN−1 e−i∆H qN−2 · · ·
j=1
· · · q1 e−i∆H qi . (2.7)
Note that this formula, illustrated in the figure 2.1, is exact for any value of N. In the
t
q
Figure 2.1: Illustration of eq. (2.7) with 10 and 200 intermediate points. The
endpoints are fixed, while the intermediate points are integrated over. The line
segments connecting the points are just a help to guide the eye, but there is no “path”
at this stage.
2. F UNCTIONAL QUANTIZATION 87
Hamiltonian (2.1), the kinetic energy and potential energy terms do not commute,
which complicates the evaluation of its exponential. We can remedy this situation by
using the Baker-Campbell-Hausdorff formula, that we shall write here as follows
∆2 3
e∆(A+B) = e∆A e∆B e− 2 [A,B]+O(∆ ) . (2.8)
In the limit ∆ → 0 (i.e. N → ∞), we may neglect the last factor since the product of
all such factors goes to unity1 when N → ∞. Therefore, each elementary factor of
eq. (2.7) is rewritten as
P2
qi+1 e−i∆H qi ≈ qi+1 e−i∆ 2m e−i∆V(Q) qi
Z
dpi P2
= qi+1 e−i∆ 2m pi pi e−i∆V(Q) qi ,
2π
(2.9)
where we have introduced the identity operator, written this time as a complete sum
over momentum eigenstates:
Z
dp
1≡ p p . (2.10)
2π
In the two factors, the exponential operator depends only on P or Q, and the matrix
elements are trivial to evaluate by using the fact that the operators are enclosed
between momentum and position eigenstates:
P2 p2
i
qi+1 e−i∆ 2m pi = e−i∆ 2m qi+1 pi ,
pi e−i∆V(Q) qi = e−i∆V(qi ) pi qi . (2.11)
Using now
q p = eipq , (2.12)
and a Q-dependent term. A proper treatment should use Weyl’s prescription for defining the quantum
Hamiltonian operator from the classical Hamiltonian. In eq. (2.13), one would obtain H(pi , 21 (qi +qi+1 ))
instead of H(pi , qi ).
88 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
If we define q̇i ≡ (qi+1 − qi )/∆ the slope of the line segments in the figure 2.1, and
we take the limit N → ∞, we may write the transition amplitude as a path integral:
Z
qf e−iH(tf −ti ) qi = Dp(t)Dq(t)
q(ti )=qi
q(tf )=qf
Z tf
× exp i dt p(t)q̇(t) − H(p(t), q(t)) . (2.14)
ti
One should be aware of the fact that the functional measure Dq(t)Dp(t) in general
lacks solid mathematical foundations, although it allows for some powerful manip-
ulations that would be extremely cumbersome to perform at the level of quantum
operators. Note that at the boundaries ti,f the position is well defined, and therefore
the momentum is not constrained (by the uncertainty principle). A crucial aspect
of eq. (2.14) is that all the objects that appear in the right hand side are ordinary
c-numbers that commute, while the left hand side is made of quantum operators and
states. In this section, we have started from the conventional formulation of transition
amplitudes in quantum mechanics, in order to arrive at the formula (2.14). However,
one may now “forget” the canonical formalism and view the path integral expression
of transition amplitudes as another way of going from a classical Hamiltonian H to a
quantized theory.
For a Hamiltonian where the P dependence has no powers higher than quadratic,
as in the example of eq. (2.1), it is possible to perform exactly the integral over p(t).
This type of integral is called a Gaussian path integral. Gaussian path integrals can
be evaluated in the same way as their ordinary counterparts, using the following
formulas,
Z +∞ √ Z +∞
π√
−x2 /(2σ) 2
dx e = 2πσ , dx e±ix /(2σ)
= e±i 4 2πσ , (2.15)
−∞ −∞
and treating each p(t) as an independent variable. In the present case, we need the
integral
Z r
p2 π 2πm i∆ mq̇
2
i∆(pq̇− 2m ) −i 4
dp e =e e 2 . (2.16)
| {z ∆ }
prefactor
independent of q,q̇
The (infinite in the limit ∆ → 0) prefactors can be hidden in the measure Dq(t)
since they do not depend on the path, and we are therefore led to the following
2. F UNCTIONAL QUANTIZATION 89
formula:
Z Z tf
−iH(tf −ti )
qf e qi = Dq(t) exp i dt L(q(t))
ti
q(ti )=qi
q(tf )=qf
Z
= Dq(t) eiS[q(t)] , (2.17)
q(ti )=qi
q(tf )=qf
(This can be guessed a posteriori based on the fact that h̄ has the dimension of an
action.) Because of the factor i inside the exponential, this integral is wildly oscillating,
except in the immediate vicinity of the function qc (t) that realizes the extremum
of the action. Note that this function is precisely the solution of the classical Euler-
Lagrange equations of motion. Roughly speaking, the phase oscillations become
significant when
S[q(t)] − S[qc (t)] ≥ 2π h̄ , (2.20)
and paths that fulfill this inequality do not contribute to the path integral. Therefore,
in the limit h̄ → 0, the path integral is dominated by the unique path qc (t), i.e. by the
classical trajectory of the system. The path integral formalism thus provides a very
intuitive way of connecting smoothly quantum and classical mechanics. c sileG siocnarF
Figure 2.2: Illustration of eq. (2.19). The paths whose action is far apart from
the classical extremum are plotted in fainter colours. The solid black line is the
classical trajectory.
that measures the expectation value of the position at the time t1 . In order to evaluate
this object, we need to insert on either side of the position operator Q an identity
operator written as a complete sum over position eigenstates, i.e.
Z Z
Q → dqdq ′ q q Q q′ q′ = dq q q q . (2.22)
| {z }
q δ(q−q ′ )
q, t ≡ eiHt q , (2.26)
(2.27)
The condition t2 > t1 is crucial here, because the left hand side would be quite
different if the times are ordered differently. In contrast, the objects q(t1 ) and q(t2 )
in the right hand side are ordinary numbers that commute. One may render this
formula true for any ordering between t1 and t2 by introducing a T-product, that
ensures that the operator with the largest time is always on the left:
Z
qf , tf T Q(t1 )Q(t2 ) qi , ti = Dq(t) q(t1 ) q(t2 ) eiS[q(t)] .
q(ti )=qi
q(tf )=qf
(2.28)
(2.29)
∂
qf , tf T Q(t1 ) · · · Q(tn ) qi , ti
∂t1
Z
= Dq(t) q̇(t1 ) · · · q(tn ) eiS[q(t)] . (2.30)
q(ti )=qi
q(tf )=qf
In other words, a time derivative in the integrand of the path integral also applies to
the step functions that enforce the time ordering in the left hand side. c sileG siocnarF
92 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where j(t) is some arbitrary function of time. From Zfi [j], the amplitudes can be
recovered by functional differentiation:
δn Zfi [j]
qf , tf T Q(t1 ) · · · Q(tn ) qi , ti = . (2.32)
in δj(t1 ) · · · δj(tn ) j≡0
Functional derivatives obey the usual rules of differentiation, with the additional
property that the values of the function j(t) at different times should be viewed as
independent variables, i.e.
δj(t)
= δ(t − t ′ ) . (2.33)
δj(t ′ )
From this formula, one may also read the dimension of a functional derivative:
h δ i
dim = −dim j(t) − dim t . (2.34)
δj(t)
From eq. (2.29), we can derive an expression of the generating functional Zfi as a
path integral,
Z
R tf
Zfi [j(t)] = Dq(t) eiS[q(t)]+i ti dt j(t)q(t) , (2.35)
q(ti )=qi
q(tf )=qf
that involves only the commuting c-number q(t) and no time-ordering. Note also
that there is an Hamiltonian version of this path integral:
Z
Zfi [j(t)] = Dp(t)Dq(t)
q(ti )=qi
q(tf )=qf
Z tf
× exp i dt p(t)q̇(t) − H(p(t), q(t)) + j(t)q(t) . (2.36)
ti
situation. Let us assume for instance that the system is in a state ψi at the time ti
and in the state ψf at the time tf . For any operator O, the expectation value between
these two states can be related to transitions between position eigenstates by writing
Z
ψf , tf O ψi , ti = dqi dqf ψ∗f (qf ) ψi (qi ) qf , tf O qi , ti , (2.37)
where
ψ(q) ≡ q ψ (2.38)
is the position representation of the wavefunction of the state ψ . However, the use
of this formula is cumbersome in practice, because of the integrations over qi,f .
In the special case where the initial and final states are the ground state of the
Hamiltonian, 0 , and the initial and final times are −∞ and +∞, there is trick to
circumvent this difficulty. Let us introduce the eigenstates n of the Hamiltonian,
with eigenvalue En and eigenfunction ψn (q) ≡ q n , and write
qi , ti = eiHti qi
X∞
= eiHti n n qi
n=0
X∞
= ψ∗n (qi ) eiEn ti n . (2.39)
n=0
We will assume that the Hamiltonian is shifted by a constant so that the energy of the
ground state 0 is E0 = 0. Now, we multiply the Hamiltonian by 1 − i0+ , where
0+ denotes some positive infinitesimal number. All the factors exp(i(1 − i0+ )En ti )
go to zero when ti → −∞, except for n = 0. Therefore, after this alteration of the
Hamiltonian, we have:
i.e.
Z
1
0 = lim dqi ϕ(qi ) qi , ti . (2.42)
ti →−∞ 0ϕ
94 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Any function ϕ(q) such that the state ϕ has a non-zero overlap with the ground
state 0 is appropriate in this role, but the simplest expressions are obtained with
the constant function ϕ(q) = 1, corresponding to the momentum eigenstate p = 0.
Likewise, changing H → (1 − i0+ )H has a similar effect on the final state in the
limit tf → +∞,
lim qf , tf = ψ0 (qf ) 0 . (2.43)
tf →+∞
From these considerations, when the initial and final states at ±∞ are the ground
state, we can write the generating functional in the following simple path integral
form:
Z
Z[j(t)] = Dp(t)Dq(t)
Z
× exp i dt p(t)q̇(t) − (1 − i0+ )H(p(t), q(t)) + j(t)q(t) .
(2.44)
From the discussion after eq. (2.42), we see that the boundary conditions on the paths
are not important. They only affect an overall prefactor, that can be adjusted by hand
in such a way that Z[0] = 1. After performing the Gaussian functional integral over
p(t), we can rewrite this expression in Lagrangian form:
Z
Z[j(t)] = Dq(t)
Z
mq̇2 (t)
× exp i dt (1 + i0+ ) − (1 − i0+ )V(q(t)) + j(t)q(t) .
2
(2.45)
The term in (i0+ )q̇2 may be viewed as contributing to the convergence of the integral
at large velocities. Likewise, for a confining potential such that V(q) → +∞ when
q → ∞, the term in (i0+ )V(q) contributes to the convergence at large coordinates. c sileG siocnarF
In other words, the Fourier conjugate of the “variable” q(t) is another function of
time, p(t). Eq. (2.46) may be inverted by
Z Z
F[q(t)] ≡ Dp(t) e F[p(t)] exp − i dt p(t)q(t) . (2.47)
The usual properties of ordinary Fourier transforms extend to the functional case, e.g.:
2. F UNCTIONAL QUANTIZATION 95
The proof of this formula consists in noticing that A[j, q; λ = 0] = B[j, q; λ = 0], and
in comparing their (ordinary) derivatives with respect to λ:
Z δ n Z
∂λ A[j, q; λ] = λ dt A[j, q; λ] = λ dt qn (t) A[j, q; λ] ,
δj(t)
Z
∂λ B[j, q; λ] = λ dt qn (t) B[j, q; λ] . (2.50)
Therefore A[j, q; λ] and B[j, q; λ] are equal at λ = 0 and obey the same differential
equation.c sileG siocnarF
q(t) ←→ φ(x)
p(t) ←→ Π(x)
j(t) ←→ j(x)
(2.55)
The main results of the previous section, namely that time-ordered products of
operators in the canonical formalism become simple products of ordinary functions
in the path integral representation, and that the ground state at ±∞ can be obtained
2. F UNCTIONAL QUANTIZATION 97
1 2 1 1
H= Π + (∇φ) · (∇φ) + m2 φ2 + V(φ) , (2.57)
2 2 2
it is easy to perform the (Gaussian) functional integration on Π, to obtain:
Z Z
Z[j] = Dφ(x) exp i d4 x L(φ) + j(x)φ(x) , (2.58)
where
1 1
L(φ) ≡ (1 + i0+ )φ̇2 − (1 − i0+ ) (∇φ) · (∇φ) + m2 φ2 − (1 − i0+ )V(φ) .
2 2
(2.59)
Note that the 1 − i0+ in front of the interaction potential plays no role if we turn off
adiabatically the coupling constant when |x0 | → ∞. Using the analogue of eq. (2.49),
we can separate the interactions as follows
Z δ
Z[j] = exp − i d4 x V Z0 [j] , (2.60)
iδj(x)
with
Z Z
Z0 [j] ≡ Dφ(x) exp i d4 x L0 (φ) + j(x)φ(x) ,
1 1
L0 (φ) = (1 + i0+ )φ̇2 − (1 − i0+ ) (∇φ) · (∇φ) + m2 φ2 . (2.61)
2 2
The functional integral that gives Z0 [j] in eq. (2.61) is Gaussian in φ and can be
performed in a straightforward manner, giving
Z
1
Z0 [j] = exp − d4 xd4 y j(x)j(y) G0F (x, y) , (2.62)
2
98 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that the terms in i0+ ensure the existence of this inverse. Going to momentum
space, we see that the Fourier transform of this inverse is
i
, (2.64)
(1 + i0+ )k20 − (1 − i0+ )(k2 + m2 )
which after some rearrangement of the i0+ ’s appears to be nothing but eq. (1.123).
Although the canonical quantization of a scalar field theory was tractable, we see on
this example that the path integral approach provides a much quicker way of obtaining
the expression of the free generating functional, with the correct pole prescription for
the free Feynman propagator. c sileG siocnarF
This integral can be calculated by representing the vector x in the orthonormal basis
made of Q the eigenvectors of A (such a basis exists, since A is symmetric). The
measure i dxi is unchanged, because the diagonalization of the matrix can be done
by an orthogonal transformation. Therefore, the above integral also reads
ZY n n r
Y
−2
1P
a i y2 2π
I(A) = dyi e i i
= , (2.67)
ai
i=1 i=1
2. F UNCTIONAL QUANTIZATION 99
where the numbers ai are the eigenvalues of A. This result can be written in a much
more compact form:
(2π)n/2
I(A) = √ . (2.68)
det A
This reasoning can be generalized to the functional case by writing:
Z Z h i−1/2
1
Dφ(x) exp − d4 xd4 y φ(x)A(x, y)φ(y) = det (A) , (2.69)
2
Zeta function regularization : Despite the elegance of this formula, one should
keep in mind that the functional determinant det A is most often infinite, because the
spectrum of the operator extends to infinity. A common regularization technique for
functional determinants is based on a generalization of Riemann’s ζ function. Let the
λn be the eigenvalues of A, and define:
X 1
ζA (s) ≡ tr A−s = . (2.72)
n
λsn
(The function ζA is called the zeta function of the operator A.) The determinant of A
is related to this function by
det A = exp − ζA′ (0) . (2.73)
The sum over n in the definition of ζA usually converges only if Re (s) is large
enough (how large depends on the distribution of eigenvalues at large n), but not for
s = 0. However, like in the case of Riemann’s zeta function, ζA (s) can be analytically
continued to most of the complex s-plane, which provides a regularized definition of
the determinant. c sileG siocnarF
100 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The argument of the exponential has a simple interpretation as a 1-loop diagram made
of a line dressed with insertions of the background field, the index n being the number
of such insertions:
1 2 n
Tr − i λϕ
2 G0F = . (2.79)
n
| {z }
n insertions
Each of the insertions of the background field (shown by lines terminated by a dot
in the above diagram) corresponds to a factor −i λ2 ϕ2 . The prefactor 1/n is the
symmetry factor for the cyclic permutations of the n insertions. The argument of the
exponential is a sum of connected 1-loop diagrams. Taking the exponential to obtain
the ratio R simply produces all the multiply connected graphs made of products of
such 1-loop diagrams. c sileG siocnarF
2. F UNCTIONAL QUANTIZATION 101
Γ2 (x1 , x2 ) is therefore the inverse of the exact propagator, Γ4 (x1 , · · · , x4 ) is the exact
4-point function (in coordinate space), etc... c sileG siocnarF
Let us replace the classical action S[φ] by the quantum effective action Γ [φ] in the
previous formula, to define
Z h Z i
WΓ [j]
ZΓ [j] = e = Dφ(x) exp iΓ [φ(x)] + i d4 x j(x)φ(x) . (2.82)
This functional generates graphs whose building blocks are the exact propagator
(Γ2−1 ), and the exact vertices (Γ3 , Γ4 . · · · ). From the definition of Γ [φ] as the “action”
that would generate the exact theory at tree level, we conclude that
WΓ [j]|tree = W[j] . (2.83)
In other words, the tree diagrams of WΓ [j] should be equal to the all-orders W[j]. The
tree diagrams may be isolated by reintroducing Planck’s constant in the definition of
ZΓ [j] as follows
Z hi Z i
WΓ [j;h̄]
ZΓ [j; h̄] = e = Dφ(x) exp Γ [φ(x)] + d4 x j(x)φ(x) . (2.84)
h̄
102 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
h̄nL −1 , (2.85)
where nL is the number of loops of the graph. Therefore, the functional WΓ [j; h̄] has
the following loop expansion:
∞
X
WΓ [j; h̄] = h̄nL −1 WΓ ,nL [j] , (2.86)
nL =0
| {z }
nL loops
and the tree level contributions in WΓ [j] are the terms that survive in the formal limit
h̄ → 0:
But from our discussion of the classical limit of path integrals in section 2.2, we know
that the limit h̄ → 0 corresponds to the extremum of the argument of the exponential,
i.e.
δΓ [φ]
+ j(x) = 0 . (2.88)
δφ(x)
Note that this equation is the analogue of the usual Euler-Lagrange equation of
motion, with the quantum effective action in place of the classical action. This
equation implicitly defines φ as a function of j, that we will denote φj , in terms of
which we can write
hi Z i
WΓ [j;h̄]
e ≈ exp Γ [φj (x)] + d4 x j(x)φj (x) , (2.89)
h̄→0 h̄
which leads to the following relationship between the quantum effective action and
the generating functional of connected graphs:
Z
Γ [φj ] = −i W[j] − d4 x j(x)φj (x) . (2.90)
Therefore, Γ [φ] can be obtained as the Legendre transform of the generating functional
W[j] of the connected graphs.
Note that the “quantum equation of motion” (2.88) may also be viewed as defining
j in terms of φ, that we shall denote jφ . Eq. (2.90) may therefore also be written as
Z
Γ [φ] = −i W[jφ ] − d4 x jφ (x)φ(x) . (2.91)
2. F UNCTIONAL QUANTIZATION 103
Taking a functional derivative of this equation with respect to φ(y) and using the
chain rule, we obtain
Z Z
δΓ [φ] δW[j] δjφ (x) δj (x)
= −i d4 x − jφ (y) − d4 x φ φ(x) . (2.92)
δφ(y) δj(x) j=j δφ(y) δφ(y)
| {z } φ
−jφ (y)
This leads to
δW[j] δW[j]
φ(x) = −i , or equivalently φj (x) = −i = φ(x) j
.
δj(x) j=jφ δj(x)
(2.93)
In other words, φj is the connected 1-point function (i.e. the vacuum expectation
value of the field) in the presence of the source j. c sileG siocnarF
δ δΓ [φj ]
δ(x − y) = −
δj(y) δφj (x)
Z
δφj (z) δ2 Γ [φj ]
= − d4 z
δj(y) δφj (x)δφj (z)
Z
δ2 W[j] δ2 Γ [φj ]
= i d4 z . (2.94)
δj(y)δj(z) δφ (z)δφ (x)
| {z } | j {z j }
G(y,z)connected Γ2 (z,x)
This formula shows a posteriori that (up to a factor i) the coefficient Γ2 in the expansion
(2.80) is indeed the inverse of the exact connected 2-point function, as was expected
from our request that the effective action Γ [φ] reproduces the full content of the
theory.
By parameterizing the inverse propagator in terms of a self-energy Σ as follows,
G−1 = G−1
0 + iΣ , (2.95)
we see that the second derivative of the quantum effective action is nothing but the
self-energy. An important class of diagrams in this discussion are the one-particle
irreducible (1PI) diagrams, that are those that remain connected if one cuts any one
104 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
of their internal propagators. For instance, the first of these diagrams is 1PI while the
second one is not:
1PI diagram :
Non-1PI diagram :
The concept of 1PI diagrams is crucial in the summation of a self-energy to all orders.
Indeed, repeated insertions of a non-1PI self-energy would lead to the erroneous
multiple countings of identical graphs. To avoid this, a self-energy should only
contain 1PI graphs, and we conclude that the second derivative of the quantum
effective action is one-particle irreducible.
c sileG siocnarF
The quantum effective action Γ [φ] is in fact one-particle irreducible at all orders in φ,
not just at quadratic order in φ as the above argument suggests. By exponentiating
eq. (2.91) and using the path integral definition of exp(W[j]), we first obtain
Z Z
ei Γ [φ] = Dϕ exp i S[ϕ] + d4 x j(x)(ϕ(x) − φ(x))
j=jφ
Z Z
= Dϕ exp i S[ϕ + φ] + d4 x j(x)ϕ(x) . (2.96)
j=jφ
(In the second line, we have shifted by φ the integration variable ϕ.) Thus, the
quantum effective action can be obtained from a shifted classical action, to which is
added a source jφ that implicitly depends on φ via the quantum equation of motion
(2.88). The expansion of the shifted classical action S[ϕ + φ] leads to a number of
vertices, some of which are φ-dependent. Thus, Γ [φ] is the sum of the connected
(because we must take the logarithm in order to extract Γ [φ]) vacuum graphs build
with these φ-dependent vertices and the φ-dependent source jφ . To every line of
such a graph is associated a free propagator G0 , determined from the quadratic term
in the action.
A very important property is the fact that the expectation value of ϕ(x) with this
2. F UNCTIONAL QUANTIZATION 105
Note that in order to obtain the final zero, it is crucial that j be set to jφ at the end. Let
us now consider a one-particle reducible vacuum graph G that may possibly contribute
to Γ [φ]. Because it is reducible, this graph contains at least one bare propagator that
connects two subgraphs A and B, such that the two subgraphs become disconnected
when removing this propagator,
Z
x y
GAB ≡ A B . (2.98)
x,y
thanks to the previous result on the expectation value of ϕ. Therefore, the one-particle
reducible graphs do not contribute to Γ [φ], which generalizes to all orders in φ what
we had already seen for the quadratic terms. c sileG siocnarF
both depending on the renormalized field φr . We will denote Sr and ∆S the corre-
sponding actions. Likewise, we write the external source j = jr + δj, where jr is the
current that solves the following equation:
δSr [φr ]
+ jr (x) = 0 , (2.101)
δφr (x) ϕ
106 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
i.e. the current that solves at lowest order the defining equation of the effective action.
The correction ∆j is then adjusted order by order so that the expectation value of the
field remains equal to ϕ at all orders,
In the path integral representation of the generating functional Z[j], we write the field
as φr = ϕ + η:
Z
iSr [ϕ+η]+∆S[ϕ+η]+R d4 x (jr +∆j)(ϕ+η)
Z[j] = Dη(x) e , (2.103)
Note that the term linear in η is zero by virtue of eq. (2.101). Therefore, we may
rewrite Z[j] as follows
Z
R 4
Z[j] = ei Sr [ϕ]+∆S[ϕ]+ d x jϕ Dη(x) ei Sϕ [η]+∆Sϕ [η] , (2.105)
where we denote
Z
1 δ2 Sr [φr ]
Sϕ [η] ≡ d4 xd4 y η(x) η(y) + · · · (2.106)
2 δφr (x)δφr (y) ϕ
(Likewise, ∆Sϕ [η] results from the expansion in powers of η of the counter-terms.)
At one loop, it is sufficient to keep only the quadratic terms in η, and the path integral
gives a determinant:
i −1/2
δ2 Sr [φr ]
det −
2 δφr (x)δφr (y) ϕ
i
1 δ2 Sr [φr ]
= exp − tr ln − . (2.107)
2 2 δφr (x)δφr (y) ϕ
2. F UNCTIONAL QUANTIZATION 107
i δ2 Sr [φr ]
Γ [ϕ] = Sr [ϕ] + ∆S[ϕ] + tr ln + ··· (2.109)
2 δφr (x)δφr (y) ϕ
Note that the object inside the logarithm is the inverse of the propagator dressed by
the background field ϕ. c sileG siocnarF
The quantum effective action Γ [φ] studied in the previous section can be extended
into a functional Γ [φ, G] that depends on a field φ and a propagator G. The starting
point of this derivation is to introduce a second source k(x, y) that couples to a pair
of fields ϕ(x)ϕ(y). The corresponding generating functional W[j, k] for connected
graphs is given by
Z Z Z
W[j,k]
1
e = Dϕ exp i S[ϕ]+ j(x)ϕ(x)+ k(x, y) ϕ(x)ϕ(y) . (2.110)
x 2 x,y
In terms of graphs, W[j, k] is the sum of the connected vacuum graphs built with the
bare propagator and the vertices defined by the classical action S[ϕ], with the external
source j, and with a kind of non-local mass term k(x, y). Let us denote
δW[j, k]
≡ φj,k (x) ,
iδj(x)
δW[j, k] 1
≡ φj,k (x)φj,k (y) + Gj,k (x, y) . (2.111)
iδk(x, y) 2
In the second equation, we have separated a disconnected part φj,k (x)φj,k (y) and a
connected two-point function Gj,k (x, y). Both the field φj,k and the propagator Gj,k
depend on the sources, which we indicate by the subscript j, k. Conversely, we may
formally invert these equations to define φ, G dependent sources, jφ,G and kφ,G .
3 We have dropped a factor − 2i inside the tr ln, since it only produces an additive constant.
108 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Then, the Legendre transform that defined Γ [φ] from W[j] can be generalized into
Z
Γ [φ, G] = −i W[jφ,G , kφ,G ] − d4 x jφ,G (x)φ(x)
Z
1
− d4 xd4 y kφ,G (x, y) φ(x)φ(y) + G(x, y) . (2.112)
2
By taking derivatives with respect to φ(x) or G(x, y), we obtain the following
equations:
Z
Γ [φ, G]
+ jφ,G (x) + d4 y kφ,G (x, y) φ(y) = 0 ,
δφ(x)
Γ [φ, G] 1
+ kφ,G (x, y) = 0 . (2.113)
δG(x, y) 2
Note that the first of these equations generalizes the quantum equation of motion
(2.88) with the adjunction of a self-energy kφ,G (x, y). c sileG siocnarF
From the Legrendre transform of eq. (2.112), we obtain the following path integral
representation of the functional Γ [φ, G]:
Z Z
ei Γ [φ,G] = Dϕ exp i S[ϕ] + j(x)(ϕ(x) − φ(x))
x
Z
1
+ k(x, y) ϕ(x)ϕ(y) − φ(x)φ(y) − G(x, y)
2 x,y
Z Z
= Dϕ exp i S[ϕ+φ]+ j(x)ϕ(x)
x
Z
1
+ k(x, y) ϕ(x)ϕ(y)+2ϕ(x)φ(y)−G(x, y) ,
2 x,y
(2.114)
where we have omitted the subscript φ, G on the sources j, k for the sake of brevity.
From the second equation, we first obtain
Z Z
k
ϕ(x) = Dϕ ϕ(x) exp i S[ϕ + φ] + jϕ + 2 ϕϕ + 2ϕφ − G
δW[j, k]
= ei Γ [φ,G] − φ(x) j=jφ,G = 0 . (2.115)
iδj(x) k=kφ,G
2. F UNCTIONAL QUANTIZATION 109
Like in the case of the 1PI functional Γ [φ], this identity ensures that the one-particle
reducible graphs do not contribute to Γ [φ, G]. But as we shall see now, the functional
Γ [φ, G] is limited to a much more restricted set of graphs, since only the two-particle
irreducible graphs contribute, i.e. the graphs that cannot be made disconnected by
removing two arbitrary propagators. Consider a 2-particle reducible graph,
Z x
GAB ≡ A B , (2.116)
x,y y
in which we have exhibited the two bare propagators that would disconnect the graph
if removed. Summing over the graphs that can contribute to B, we may write this as
X Z
GAB = d4 xd4 y A(x, y) ϕ(x)ϕ(y) c , (2.117)
B
(the subscript c indicates that we keep only the connected part of the two point
function) with
ϕ(x)ϕ(y) c
Z
R k
S[ϕ+φ]+ jϕ+ 2 [ϕϕ+2ϕφ−G]
≡e −i Γ [φ,G]
Dϕ ϕ(x)ϕ(y) ei
R k δeW[j,k]
= −φ(x)φ(y) + 2 e| −iΓ [φ,G] e−i{zjφ+ 2 (G−φφ)}
iδk(x, y)
e−W[j,k]
= G(x, y) . (2.118)
In the second equality, we have ignored some terms that have already been shown to
vanish when studying ϕ(x) , and we have extracted the combination ϕ(x)ϕ(y) by
differentiating with respect to k(x, y). From this identity, we obtain
XZ x Z
A B = A . (2.119)
B x,y y x,y
G(x,y)
In other words, when summing over all the possible graphs contributing to B, the
2-particle reducible block is replaced by a single propagator G(x, y), and the resulting
graph is two-particle irreducible. Thus, the functional Γ [φ, G], when expressed in
terms of the 2-point function G, is made only of 2-particle irreducible graphs, and its
derivatives with respect to the field φ are the 2-particle irreducible n-point functions.
Thus, Γ [φ, G] is the generating functional in φ of the 2PI correlation functions. c sileG siocnarF
110 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
2PI functional at null field : Consider first the 2PI effective action at null field,
Γ [0, G]. At φ ≡ 0, we have
δΓ [0, G]
−2 = k0,G . (2.120)
δG
Using the path integral representation (2.114), and replacing k0,G by the above
equality, we get4
δΓ [0,G] R
Z
S[ϕ]+tr [G−ϕϕ] + j0,G ϕ
ei Γ [0,G] = Dϕ ei δG
The first equality is as an implicit identity obeyed by Γ [0, G]. The diagrammatic
interpretation in the second line follows from the discussion at the end of the previous
subsection. In the right hand side, the term in j0,G ϕ cancels the 1-particle reducible
tadpole contributions, while the term in G − ϕϕ is the one that eliminates the 2-
particle reducible ones (by replacing chains like G0 ΣG0 Σ · · · ΣG0 by G). Note that
the bare propagator G0 defined by the quadratic part of the classical action S[ϕ] does
not appear in the final result for Γ [0, G], since it is replaced systematically by G. Only
the interaction terms of the classical action matter, since they define the vertices that
connect the G’s in the diagrammatic representation of Γ [0, G]. c sileG siocnarF
where the source jφ,k now has an implicit dependence on the field φ and on the
second source k. This functional obeys:
Z Z
δΓk [φ] δW δW δjφ,k (z) δjφ,k (z)
= + d4 z − d4 z φ(z)
δk(x, y) iδk(x, y) iδj(z) j=jφ,k δk(x, y) δk(x, y)
| {z }
φ(z)
δW
= . (2.123)
iδk(x, y)
4 We
R
use the compact notation x,y A(x, y)B(x, y) = tr (AB).
2. F UNCTIONAL QUANTIZATION 111
that are identical to eqs. (2.113). This proves that the definition (2.125) of the 2PI
effective action is equivalent to the original definition (2.112). c sileG siocnarF
Thus, if we denote Γk,1 [φ] ≡ Γk [φ] − Sk [φ] the terms at 1-loop and higher orders,
we have
Z
R δ(Sk [φ]+Γk,1 [φ])
e iΓk,1 [φ]
= Dϕ ei Sk [φ+ϕ]−Sk [φ]− ϕ δφ . (2.130)
112 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Z Z
δSk [φ] 1 δ2 Sk [φ]
Sk [φ + ϕ] ≡ Sk [φ] + ϕ + ϕ ϕ + Sint [φ; ϕ] , (2.131)
δφ 2 δφδφ
| {z }
k+iG−1
φ
where Sint [φ; ϕ] denotes the terms of degree at least three in ϕ in the Taylor expansion
of S[φ + ϕ], and G−1 φ is the inverse of the tree-level propagator in the background
field φ. Therefore, Γk,1 [φ] can also be written as
Z
1R −1
R δΓk,1 [φ]
eiΓk,1 [φ] = Dϕ ei 2 ϕ[k+iGφ ]ϕ+Sint [φ;ϕ]− ϕ δφ . (2.132)
This equation defines Γ2 [φ, G] so that the left hand side is indeed Γ [φ, G]. The
Combining eqs. (2.125), (2.132) and (2.133), we must have
Γ2 [φ, G] = const − 1
2 tr ([kφ,G + i G−1
φ ]G) −
i
2 tr ln(G−1 ) + Γk,1 [φ] . (2.134)
δΓ2 [φ, G] 1 i −1
+ kφ,G + Gφ − G−1 = 0 , (2.135)
δG 2 2
that follows from eqs. (2.113) and (2.133). We thus obtain
Z
δΓ2
R δΓ [φ]
k,1
eiΓ2 [φ,G]
= Dϕ ei Sφ,G [ϕ]+tr [G−ϕϕ] δG − ϕ δφ , (2.136)
with
Z
i
Sφ,G [ϕ] ≡ ϕ G−1 ϕ + Sint [φ; ϕ] . (2.137)
2
Finally, using the fact that −jφ,G = δΓk,1 /δφ and comparing with eq. (2.121), we
see that Γ2 [φ, G] is the sum of the 2PI vacuum graphs built with the propagator G
and the vertices obtained from the expansion of S[φ + ϕ]. The first four terms of the
expansion of Γ2 [φ, G] in a scalar field theory with φ4 interaction are shown in the
figure 2.3.
2. F UNCTIONAL QUANTIZATION 113
After setting the source k = 0, the equation of motion for the propagator (2.135)
becomes
δΓ2 [φ, G]
−i G−1 = −i G−1
φ −2 , (2.138)
| δG }
{z
−Σ
which is known as the Dyson equation, that resums the self-energy Σ on the propagator.
Convoluting this equation by G on the right gives
−iG−1
φ G + Σ G = −i . (2.139)
δ2 S[φ]
i G−1
φ = = −( + m2 ) − V ′′ (φ) , (2.140)
δφδφ
where V(φ) is the interaction potential in the Lagrangian. Therefore, the equation of
motion has the following more explicit form
+ m2 + V ′′ (φ) + Σ G = −i . (2.141)
δΓ2 [φ, G]
Σ = −2 . (2.142)
δG
114 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Diagrammatically, this derivative amounts to opening one internal line of the graphs
that contribute to Γ2 . For instance the graphs of the figure 2.3 give the following
topologies in Σ:
Σ∼ .
needs to substitute H → H − µQ in the definition of the partition function, where Q is the operator of the
conserved charge and µ the associated chemical potential.
2. F UNCTIONAL QUANTIZATION 115
where we have formally identified the density operator exp(−βH) with a time evolu-
tion operator for an imaginary time iβ. This relationship is called the Kubo-Martin-
Schwinger (KMS) identity. Although we have established it for an expectation value
of a product of two operators, it is completely general.
The identification of the density operator with an imaginary time evolution opera-
tor is at the heart of the formalism to evaluate canonical ensemble expectation values.
If we represent the trace that appears in the partition function in the coordinate basis,
Z
Zβ = dq q e−βH q , (2.146)
the integrand in the right hand side is a transition amplitude similar to eq. (2.3), except
that initial and final coordinates are identical, and the time interval is imaginary. We
can nevertheless formally reproduce all the manipulations of the section 2.1, with
an initial time ti ≡ 0 and a final time tf ≡ −iβ. It is common to introduce the
Euclidean time τ ≡ it, with τ varying from 0 to β. The only changes to our original
derivation of the path integral is that the path q(t) must be replaced by a path q(τ)
whose time derivative is the Euclidean velocity q̇E , related to the usual velocity q̇ by
dq dq
q̇ ≡ =i . (2.147)
dt dτ
|{z}
q̇E
In the second line, we have simplified the boundary conditions of the path q(τ), since
the only constraint it must obey is to be β-periodic in imaginary time. The integration
over the momentum p(τ) is again Gaussian, and after performing it we obtain the
following expression
Z Zβ
m 2
Zβ = Dq(τ) exp − dτ q̇E (τ) + V(q(τ)) . (2.149)
0 2
q(0)=q(β) | {z }
SE [q(τ)]
where the symbol Tτ denotes the time-ordering in the imaginary time τ. Likewise,
we may define a generating functional for these expectation values
Zβ Z
−βH
Rβ
Tr e Tτ exp dτ j(τ)Q(τ) = Dq(τ) e−SE [q(τ)]+ 0 dτ j(τ)q(τ) .
0
q(0)=q(β)
(2.151)
This formalism can be extended readily to a quantum field theory. In this context,
it can be used to calculate canonical ensemble expectation values of operators for a
system of relativistic particles. One can write directly the following generalization of
eq. (2.151),
Zβ
Tr e−βH Tτ exp d4 xE j(x)φ(x)
| {z0 }
Z[j;β]
Z
Rβ 4
= Dφ(x) e−SE [φ(x)]+ 0 d xE j(x)φ(x)
, (2.152)
φ(0,x)=φ(β,x)
where the measure d4 xE stands for dτ d3 x. Like in the case of ordinary QFT in
Minkowski space-time, we can isolate the interactions by writing:
Z
4 δ
Z[j; β] = exp − d xE LE,I Z0 [j; β] , (2.153)
δj(x)
where LE,I is the interaction term in the Euclidean Lagrangian density, and Z0 [j; β]
is the generating functional of the non-interacting theory:
Z h Zβ 1 i
Z0 [j; β] = Dφ(x) exp − d4 xE (∂τ φ)2 +(∇φ)2 +m2 φ2 −jφ .
0 2
φ(0,x)=φ(β,x)
2. F UNCTIONAL QUANTIZATION 117
(2.154)
where the free Euclidean propagator G0E (x, y) is the inverse of the operator m2 −
∂2τ −∇2 over the space of functions that are β-periodic in the imaginary time variable.
Because of this periodicity, the “energy” variable, conjugate to the Euclidean time, is
discrete:
2πn
ωn ≡ (n ∈ ❩) . (2.156)
β
In terms of these energies, called Matsubara frequencies, the free Euclidean propagator
in momentum space reads
e 0 (ωn , p) = 1
G . (2.157)
E
ω2n + p2 + m2
Note that the denominator cannot vanish, and therefore this propagator does not
need an i0+ prescription for being fully defined. Eqs. (2.153) and (2.155) lead to a
perturbative expansion that can be cast into an expansion in terms of Feynman dia-
grams. The Feynman rules associated to these graphs are very similar to those already
encountered when calculating scattering amplitudes, with only a few modifications:
1
Propagators : , (2.158)
ω2n + p2 + m2
X X
Vertices : − λ 2π δ ωni (2π)3 δ pi , (2.159)
i i
Z 3
1 X d p
Loops : . (2.160)
β (2π)3
n∈❩
In other words, the main difference with the usual perturbative expansion is that
the energies are replaced by the discrete Matsubara frequencies, and that the loop
integration on p0 is replaced by a discrete sum over these frequencies. c sileG siocnarF
118 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 3
In the previous chapter, we have learned that the quantization of a scalar field may be
performed by means of the path integral representation. This leads to a much more
concise derivation of the generating functional, and of the free propagator, compared
to the canonical approach. In this chapter, we will therefore seek a similar path
integral formalism for other types of fields, in view of the functional quantization of
a gauge theory such as QED (and later, of non-Abelian gauge theories, for which a
canonical approach would be extremely difficult to implement). c sileG siocnarF
3.1.1 Definition
A Grassmann number may be represented by a nilpotent 2 × 2 matrix, and the Grassmann algebra with N
119
120 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
numbers are the classical analogue of anti-commuting quantum operators. For a set
of Grassmann variables ψi (i = 1 · · · N), we have
ψi , ψj = 0 . (3.1)
Consider first the case N = 1. The square of a Grassmann number ψ is therefore zero,
ψ2 = 0, and by induction all higher powers of ψ are also zero. The Taylor expansion
of a function of ψ is therefore limited to the first two terms,
f(ψ) = a + ψb . (3.2)
In general, we need to deal with functions f(ψ) that are themselves commuting objects.
Therefore, the coefficient a is an ordinary number, while b is another Grassmann
number, {b, b} = {b, ψ} = 0. This implies that
f(ψ) = a + ψb = a − bψ . (3.3)
Because of the non-commuting nature of b and ψ, we may define left and right
derivatives, denoted by:
→ ←
∂ ψ f(ψ) = b , f(ψ) ∂ ψ = −b . (3.4)
• Linearity :
Z Z
dψ α f(ψ) = α dψ f(ψ) , (3.5)
generators admits a representation in terms of 2N × 2N matrices, that may be viewed as operators acting
on the Hilbert space of N identical fermions of spin 1/2 (of dimension 2N since each spin has two states).
For instance, when N = 2, one may represent the Grassmann numbers ψ1,2 as
0 0 0 0 0 0 0 0
1 0 0 0 0 0 0 0
ψ1 = , ψ2 = .
0 0 0 0 1 0 0 0
0 0 1 0 0 −1 0 0
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 121
up to an overall constant that should be the same for all functions. Thus, integration
and differentiation of functions of a Grassmann variable are essentially the same thing.
In particular, the Berezin integral satisfies:
Z Z
dψ 1 = 0 , dψ ψ = 1 . (3.8)
with implicit summations on the indices in . Terms of degree higher than N cannot
exist because they would contain the square of at least one of the ψi ’s, and therefore be
zero. We have chosen to write the Grassmann variables on the →
left of the coefficients
in order to simplify the calculation of the left derivatives ∂ ψ . Note that the last
coefficient Ci1 ···iN must be proportional to the Levi-Civita tensor:
Integration : In order to be consistent with eqs. (3.8), the integral of f(ψ) over the
N Grassmann variables ψ1 , · · · , ψN , must be given by
Z
dN ψ f(ψ) = γ . (3.12)
the integral over this ψi will therefore give zero. A somewhat more explicit for-
mulation of an integral over N Grassmann variables is to write the measure as
dN ψ ≡ dψN dψN−1 · · · dψ1 (in this order), and to perform the N integrals succes-
sively, starting with the innermost one (i.e. dψ1 ). Therefore
Z Z Z Z
N
d ψ ψ1 · · · ψN = dψN · · · dψ2 dψ1 ψ1 ψ2 · · · ψN = 1 . (3.13)
| {z }
| {z1 }
1
ψi ≡ Jij θj , (3.14)
where θ1 · · · θN are N Grassmann variables. The last term of the expansion of f(ψ),
the only one relevant for integration, can be rewritten as
ψi1 · · · ψi ǫi1 ···iN γ = Ji1 j1 θj1 · · · JiN jN θjN ǫi1 ···iN γ
N
= det J θj1 · · · θjN ǫj1 ···jN γ . (3.15)
(Recall that functions of two Grassmann variables are in fact polynomials of degree
two.) Therefore, in the case N = 2, the Gaussian integral (3.17) reads2
1/2
I(M) = µ = det (M) . (3.20)
In the case of a general even N, the matrix M may be written in the following block
diagonal form,
0 µ1
−µ
1 0
M=Q 0 µ 2 QT , (3.21)
−µ 0
2
..
.
| {z }
D
Contrast this with the result of a Gaussian integral in the case of ordinary real variables,
eq. (2.68), where the square root of the determinant appeared in the denominator.
It is often necessary to perform a Gaussian integral in the presence of a source
that shifts the minimum of the quadratic form in the exponential,
Z
I(M, η) ≡ dN ψ exp 21 ψi Mij ψj + ηi ψi , (3.24)
where ǫ(σ) is the signature of the permutation σ, and with τ ≡ ρσ−1 in the second
line. Using ǫ(σ)ǫ(τσ) = ǫ(τ), this becomes:
N(N−1) 1 X X
J(M) = (−1) 2 1 ǫ(τ) M1τ(1) · · · MNτ(N) . (3.29)
N!
σ∈Sn τ∈Sn
| {z } | {z }
1 det (M)
Note that this overall sign may be absorbed into a reordering of the measure, since:
N(N−1)
dN ξdN ψ = (−1) 2 dξN dψN · · · dξ1 dψ1 . (3.30)
Therefore, we have
Z
dξN dψN · · · dξ1 dψ1 exp ψi Mij ξj = det M . (3.31)
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 125
Now, let us define complex Grassmann variables, from two of the previously defined
Grassmann variables ψ and ξ:
ψ + iξ ψ − iξ
χ≡ √ , χ≡ √ . (3.32)
2 2
Conversely, we have
χ+χ i (χ − χ)
ψ= √ , ξ= √ , (3.33)
2 2
and the integrations over these variables are related by
dξdψ = i dχdχ ,
ψξ = −i χχ ,
Z Z
dχdχ χχ = dξdψ ψξ = 1 . (3.34)
where S0F (x, y) is the free Dirac time-ordered propagator and η and η are a pair of
complex Grassmann-valued sources. Indeed, we have
→ ←
δ δ
Z0 [η, η] = S0F (x, y) . (3.39)
iδη(x) iδη(y)
η=η=0
Taking more than two derivatives (but with an equal number of derivatives with respect
to η and with respect to η) will lead to all the contributions in the free time-ordered
product of spinors, with the correct signs to account for their anti-commuting nature.
Note that using Grassmann-valued sources was necessary in order to get these signs.
Then, by comparing eqs. (3.37) and (3.38), we can represent this free generating
function as a path integral over Grassmann variables:
Z Z
Z0 [η, η] = Dψ(x)Dψ(x) exp i d4 x ψ(x)(i∂
/ − m)ψ(x)
+η(x)ψ(x) + ψ(x)η(x)
Z
R 4
= Dψ(x)Dψ(x) eiS[ψ,ψ] ei d x η(x)ψ(x)+ψ(x)η(x)
.
(3.40)
We have ignored the determinant, since it is independent of the sources. Instead, one
simply adjusts the normalization of the generating functional so that Z0 [0, 0] = 1.
The second line shows that the path integral formulation of a field theory of spin 1/2
fermions takes the same form as that of scalar fields, provided we use Grassmann
variables instead of commuting c-numbers. c sileG siocnarF
LI = −i e ψγµ Aµ ψ . (3.41)
As in the scalar case, this interaction can be factored out of the generating functional,
by writing:
Z → ←
δ δ
Z[η, η] = exp − ie d4 x Aµ (x) γµ Z0 [η, η] . (3.42)
iδη(x) iδη(x)
Here, we are treating the photon field as a fixed background. When we consider
the path integral representation of dynamical photons in the next section, the Aµ (x)
inside the exponential will also be replaced by a functional derivative.
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 127
In the case of photons, the difficulties encountered in the path integral formulation
are of a different nature. Since photons are bosons, we expect that they can be
represented by a functional integration over commuting functions Aµ (x). But the
gauge invariance of the theory implies that there is an unavoidable redundancy in this
representation: the naive path integral over [DAµ (x)] would integrate over infinitely
many copies of the same physical configurations. Therefore, we need a way to cut
through this redundancy, which is achieved by gauge fixing.
In order to better see the nature of this difficulty, let us assume that we can treat
Aµ (x) as four scalar fields, and write the following path integral,
Z Z
Z0 [j ] ≡ DAµ (x) exp i d4 x − 41 Fµν Fµν + jµ Aµ .
µ
(3.43)
(3.44)
Performing this Gaussian integral requires the inverse of the object gµν k2 − kµ kν ,
that one may seek as a linear combination of the metric tensor gµν and kµ kν /k2 , i.e.
we are looking for coefficients α and β such that:
ν ρ
gµν k2 − kµ kν α gνρ + β kkk2 = δρµ . (3.45)
| {z }
α k 2 δρ
µ −α kµ k
ρ
This equation has clearly no solution, and therefore it is impossible to invert gµν k2 −
kµ kν . This means that some eigenvalues ν of this operator are zero, and that the
quadratic form A e µ (k) gµν k2 − kµ kν Ae (−k) has flat directions. Along these flat
directions, the exponential in the path integral (3.43) does not decrease, which spoils
its convergence. These flat directions correspond to the projection of Ae µ (k) along kµ .
µ
Note that they also do not contribute to the linear term j Aµ , for a conserved current
that satisfies ∂µ jµ = 0. Therefore, one should not integrate over these components of
Aµ in eq. (3.43). c sileG siocnarF
128 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Aµ = Aµ µ
⊥ + Ak ,
kµ kν e
e µ (k) ≡
A gµν − Aν (k) ,
⊥
k2
kµ kν
e µ (k) ≡
A e ν (k) .
A (3.46)
k k2
Z Z
µ
Z0 [j ] ≡ exp i d4 x jµ Aµ
DAµ
k (x) k
Z Z
× DAµ 4
⊥ (x) exp i d x − 4 F
1 µν
Fµν + jµ Aµ
⊥ . (3.48)
Z
1
Z0 [jµ ] = exp − d4 xd4 y jµ (x) G0F µν (x, y) jν (y) , (3.49)
2
−i pµ pν
G0F µν (p) ≡ gµν
− . (3.50)
p2 + i0+ p2
c sileG siocnarF
(We have introduced the i0+ prescription that selects the ground state at x0 → ±∞,
using the same argument as in the section 2.3.3.) The procedure used here is equivalent
to imposing the gauge fixing condition ∂µ Aµ = 0, called Lorenz gauge or Landau
gauge. As one can see, the resulting propagator (3.50) differs from the Coulomb
gauge propagator given in eq. (1.247). c sileG siocnarF
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 129
where ω(x) is some arbitrary function of space-time. This can be done by introducing
a functional delta function, δ[∂µ Aµ − ω], inside the path integral. However, the
introduction of the function ω(x) breaks Lorentz invariance. To mitigate this problem,
one integrates over all the functions ω(x), with a Gaussian weight. This amounts to
defining the generating functional as follows4 ,
Z Z
ξ
Z0 [jµ ] ≡ Dω(x) exp − i d4 x ω2 (x)
2
Z Z
× DAµ (x) δ ∂µ A − ω exp i d4 x − 14 Fµν Fµν + jµ Aµ ,
µ
(3.52)
From this formula, a standard Gaussian integration tells us that the corresponding
photon propagator in momentum space should be the inverse of
i gµν p2 − (1 − ξ)pµ pν . (3.54)
ν ρ
Looking for an inverse of the form α gνρ + β ppp2 , we find
0 µν −i gµν i 1 pµ pν
GF (p) = 2 + 1− . (3.55)
p + i0+ p2 + i0+ ξ p2
4 Since the argument of the delta function is linear in the variable Aµ k
that does not appear in the
integrand, we do not need a Jacobian. It is possible to impose non-linear gauge conditions of the form
δ[F(∂µ Aµ ) − ω], but this should be done by writing the path integral as follows
Z Z Z
ξ
Dω(x) exp − i d4 x ω2 (x) DAµ (x) F ′ (∂µ Aµ ) δ F(∂µ Aµ ) − ω · · ·
2 | {z }
Jacobian
In general, the Jacobian cannot be ignored since it depends on the gauge field, but it can be expressed in
terms of ghost fields. Doing this would be an useless complication in QED, but is an essential step in the
quantization of non-Abelian gauge theories.
130 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The gauge fixing parameter ξ appears in the propagator, but only in the term pro-
portional to pµ pν . Thanks to the Ward-Takahashi identities, it does not have any
incidence on physical results, provided that all the external charged particles are on
mass-shell. The Landau gauge of the previous subsection corresponds to ξ → ∞.
Another popular choice is the Feynman gauge, obtained for ξ = 1,
−i gµν
G0F µν (p) = . (3.56)
ξ=1 p2 + i0+
Note that one could also introduce a non Lorentz covariant condition inside the delta
function, such as δ[∂i Ai − ω], in order to derive the photon propagator in Coulomb
gauge via the path integral.c sileG siocnarF
In the right hand side, φ(x) should be viewed as a dummy integration variable, and
the result of the integral should be unmodified if we change φ(x) → φ(x) + δφ(x).
This translates into
Z Z
iS[φ]+i R jφ δS
0 = δZ[j] = i Dφ(x) e d4 x δφ(x) j(x) + . (3.58)
δφ(x)
Taking n functional derivatives of this identity with respect to ij(x1 ),...,ij(xn ) and
setting then j to zero gives:
Z Z
δS
0 = Dφ(x) eiS[φ] d4 x δφ(x) i φ(x1 ) · · · φ(xn )
δφ(x)
Xn Y
+ δ(x − xi ) φ(xj ) . (3.59)
i=1 j6=i
Since in this discussion the variation δφ(x) is arbitrary, this implies the following
identities
Z
δS
0 = Dφ(x) eiS[φ] i φ(x1 ) · · · φ(xn )
δφ(x)
Xn Y
+ δ(x − xi ) φ(xj ) , (3.60)
i=1 j6=i
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 131
(We have used the remark following eq. (2.30) in order to let the operator + m2
act also on the step functions that order the operators in the time-ordered product.)
If we convolute this equation with the free Feynman propagator (i.e. the inverse of
the operator x + m2 ), the above Schwinger-Dyson equation can be represented
diagrammatically as follows:
1 2 1 2 1
i
i−1
n
X i+1
x + x = x . (3.62)
i=1
n n n
| {z }
contact terms
The Schwinger-Dyson equations have several simple consequences. When applied
to a free theory (λ = 0) in the case n = 1, we get
x + m2 0out T φ(x1 )φ(x) 0in = −iδ(x − x1 ) , (3.63)
which is nothing but the equation of motion satisfied by the Feynman propagator. In
the general case, if x differs from all the xi ’s, we obtain
x + m2 0out T φ(x1 ) · · · φ(xn )φ(x) 0in
λ
+ 3! 0out T φ(x1 ) · · · φ(xn )φ3 (x) 0in = 0 . (3.64)
Thus, in a certain sense5 , we can say that time-ordered products of fields satisfy the
Euler-Lagrange equation of motion. c sileG siocnarF
∂L ∂L
δL = δφ(x) + ∂µ (δφ(x))
∂φ(x) ∂(∂µ φ(x))
∂L δS
= ∂µ δφ(x) + δφ(x) . (3.66)
∂(∂µ φ(x)) δφ(x)
Note that in some cases, a continuous symmetry does not leave the Lagrangian
density invariant, but modifies it by a total derivative,
δL = ∂µ Kµ , (3.69)
so that only the action is invariant. There is still a conserved current, given by
∂L
Jµ (x) ≡ δφ(x) − Kµ (x) . (3.70)
∂(∂µ φ(x))
This however does not modify eqs. (3.68). c sileG siocnarF
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 133
Let us consider a set of fermion fields ψn (x), that we encapsulate into a multiplet
denoted ψ(x), and assume that they interact with a gauge potential Aa µ (x) in a non-
chiral way (this is the case of electromagnetic interactions and of strong interactions).
Consider now the following transformation of the fermion fields:
ψ(x) → U(x)ψ(x) . (3.71)
The Hermitic conjugate of ψ transforms as:
ψ† (x) → ψ† (x)U† (x) , (3.72)
so that we have
ψ(x) ≡ ψ† (x)γ0 → ψ† (x)U† (x)γ0 = ψ(x)γ0 U† (x)γ0 . (3.73)
Since they are Grassmann variables, the measure should be transformed with the
inverse of the determinant of the transformation. Since the transformation under
consideration is local in x, it reads
1
DψDψ → DψDψ , (3.74)
det (U) det (U)
where the matrices U and U carry both indices for the fermion species and space-time
indices:
where α(x) ∈
0 1 2 3
❘
and where t is a Hermitean matrix that does not contain γ5 ≡
i γ γ γ γ . Therefore:
and
Z X
(UU)xm,yn = d4 z Uxm,zp Uzp,yn
p
Z X
= d4 z δ(x − z)δ(z − y) e−iα(z)t eiα(z)t
mp pn
p
= δmn δ(x − y) . (3.78)
Thus UU = 1, which implies det U det U = 1, and the fermion measure is invariant
under this kind of transformations. This means that this symmetry does not exhibit
quantum anomalies. c sileG siocnarF
where t is again a Hermitean matrix. Such transformations are called chiral transfor-
mations. The matrix γ5 ≡ iγ0 γ1 γ2 γ3 satisfies
2
γ5 =1,
5†
γ = γ5 ,
{γ5 , γ0 } = 0 , (3.81)
which implies:
5 5
γ0 U† (x)γ0 = γ0 e−iα(x)γ t γ0 = eiα(x)γ t
= U(x) . (3.82)
Thus U = U, and det U = det U. Unless this determinant is equal to one, the measure
is not invariant and transforms according to:
1
DψDψ → 2
DψDψ . (3.83)
(det U)
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 135
−2
(det U) = exp −2 tr ln 1 + iα(x) γ5 t δ(x − y)
≈ exp −2 i tr α(x) γ5 t δ(x − y)
α≪1
Z
= exp i d4 x α(x)A(x) , (3.86)
In this equation, the trace symbol tr denotes both a trace on the indices carried by
the Dirac matrices and a trace on the fermion species. In terms of this function, the
measure transforms as
R 4
DψDψ → ei d x α(x)A(x) DψDψ . (3.88)
The fact that this measure is not invariant under the transformation (3.80) implies
that there exists fermion loop corrections that break the invariance under chiral
transformations, even if the Dirac Lagrangian itself is invariant (this is the case when
one considers a global transformation, i.e. a constant α(x), and the fermions are
massless). The prefactor that alters the measure can be absorbed into a redefinition of
the Lagrangian,
All happens as if the Lagrangian itself was not invariant under this transformation.
If one integrates out the fermion fields in order to obtain an effective theory for the
other fields, the term in α(x)A(x) must be included in the Lagrangian of this effective
theory in order to correctly account for the quantum anomalies. c sileG siocnarF
At first sight, the expression (3.87) of the anomaly function A(x) is very poorly
defined: the trace is zero, but it is multiplied by an infinite δ(0). In order to manipulate
finite expressions, we must first regularize the delta function. This can be done by
writing:
!
5 / 2x
D
A(x) = −2 lim tr γ t F − 2 δ(x − y) , (3.90)
y→x,M→+∞ M
F(0) = 1 ,
F(+∞) = 0 ,
s F′ (s) = 0 at s = 0 and at s = +∞ . (3.92)
which leads to
Z !
d4 k / 2x
D 5
A(x) = −2 lim tr γ t F − 2 eik(x−y)
(2π)4
y→x,M→+∞ M
Z 4
d k 5 / +D
(ik / x )2
= −2 lim tr γ t F − . (3.94)
(2π)4 M→+∞ M2
7 We are considering here the case where the fermions are coupled to a non-Abelian gauge field. The
index a carried by Aa a
µ is a “colour” index, and the t ’s are the generators of the Lie algebra representation
where the fermions live. g is the coupling of the fermions to the gauge fields. See the next chapter for more
details.
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 137
The last equality is obtained by two successive integrations by parts. We also have:
/ 2x
D = Dµ ν
x Dx γ µ γ ν
1 µ ν
= D D ({γµ , γν } + [γµ , γν ])
2 x x
1
= D2x + [Dµ , Dν ] [γµ , γν ]
4 x x
ig
= D2x − ta Fµν
a [γµ , γν ] . (3.100)
4
Using
tr (γ5 γµ γν γρ γσ ) = −4 i ǫµνρσ , (3.101)
we obtain
g2 ρσ
A(x) = − ǫµνρσ Fµν a b
a (x) Fb (x) tr (t t t) , (3.102)
16π2
8 In this counting, we assume that the matrix t does not contain Dirac matrices.
9 Recall that the rotationally invariant measure in 4-dimensional Euclidean space is 2π2 κ3 dκ.
138 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where the trace is now only on the fermion species. When t is the identity matrix,
the integral of A(x) depends only on topological properties of the gauge field con-
figuration and takes discrete values. In the context of anomalies, it is called the
Chern-Pontryagin index. Moreover, the Atiyah-Singer theorem relates this invariant
to the zero modes of the Euclidean Dirac operator in this gauge field (see the section
3.5.7). c sileG siocnarF
where Jµ5 (x) is the axial current. Integrating by parts, and identifying this variation
with the term obtained in the previous section, we should have
g2
h∂µ Jµ
ǫµνρσ Fµν
5 (x)iA = −
ρσ a b
a (x) Fb (x) tr (t t t) , (3.104)
16π2
where h·iA is an average over the fermion fields, in a fixed gauge field configuration. c sileG siocnarF
Strong interaction : Through the strong interactions, all quark flavours couple
identically with the gluons (i.e. all quarks belong to the same representation of the
SU(3) algebra). In other words, the matrices ta that describe this coupling do not
depend on the quark flavour (equivalently, one may say that they are proportional to
the identity in quark flavour space). The trace that appears in the anomaly function
can be factored into separate flavour and colour factors
tr (ta tb t) = trcolour (ta tb ) × trflavour (t) = 0 . (3.107)
| {z }
1−1=0
This means that the anomalies that may occur in the gluon-gluon term cancel between
the u and d flavours of quarks. c sileG siocnarF
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 139
and the identity 1colour in colour space, since all the quark colours couple identically
to photons. Therefore, the trace in the anomaly function is
Nc
trflavour (Q2 t) × trcolour (1colour ) = , (3.109)
3
where Nc = 3 is the number of colours. This leads to
e2 Nc
A(x) = − ǫµνρσ Fµν (x) Fρσ (x) , (3.110)
48π2
where Fµν is the electromagnetic field strength.
Decay of the neutral pion in two photons : At low energy, the strong interactions
may be described by an effective theory that couples a doublet of fermions ψ (the u
and d quarks), the three pions π and a field σ. The interaction term in this model is
where σi (i = 1, 2, 3) are the Pauli matrices. Note that π3 must be the neutral pion,
since it couples diagonally to the two components of the doublet (σ3 is a diagonal
matrix). This interaction term is invariant under the transformation (3.105) provided
that the fields σ and π transform as
Moreover, the masses of nucleons are due to a spontaneous breaking of this symmetry,
in which the σ field has a non-zero expectation value in the ground state: σ = fπ .
Thus the variation of the field π3 is δπ3 = fπ α.
When photons are added to this model, there is no direct coupling between the
neutral pion and the photon. Let us now consider the theory that would result from
integrating out the quark fields. The anomaly (3.110) would produce a term
e2 Nc
Lanom (x) = − ǫµνρσ Fµν (x) Fρσ (x) α(x) . (3.113)
48π2
in the Lagrangian. This term should be canceled somehow, because we are now
talking about an effective theory of pions and photons, that should be chiral invariant.
140 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The resolution of this issue is that this effective theory contains a coupling between
the neutral pion and two photons, of the form:
e2 Nc
Lπ0 γγ = − ǫµνρσ Fµν (x) Fρσ (x) π3 (x) . (3.114)
48π2 fπ
The decay rate of a neutral pion into two photons can be easily determined from the
effective coupling (3.114):
This result could also be obtained by computing the transition amplitude at one loop
from a neutral pion to two photons in the effective model we started from. The present
considerations show that this decay is in fact controlled to a large extent by a quantum
anomaly.
Covariant derivatives Dµ = ∂µ − i g Aa a
µ t are anti-Hermitean, because the gauge
potential Aµ is real and the colour matrices ta are Hermitean (recall that an ordinary
a
where the index i runs from 1 to 4. Now, the Dirac matrices γi are all anti-Hermitean,
which implies that the Euclidean Dirac operator is Hermitean. It can therefore be
diagonalized in an orthonormal basis of eigenfunctions φk :
/ x φk (x) = λk φk (x) ,
D
Z
d4 xE φ†k (x)φk′ (x) = δkk′ , (3.118)
3. PATH INTEGRALS FOR FERMIONS AND PHOTONS 141
with real eigenvalues λk . Note also that these eigenfunctions must obey the following
completeness relation:
X
φk (x)φ†k (y) = δ(x − y) . (3.119)
k
and use the completeness identity in order to express the delta function in the anomaly
function A(x) in eq. (3.90):
!
/2
D X
A(x) = −2 lim tr γ F − x2
5
φk (x)φ†k (y)
y→x,M→+∞ M
k
!
X /2
D
= −2 lim tr φ†k (y) γ5 F − x2 φk (x)
y→x,M→+∞ M
k
X λ2
= −2 lim F − k2 φ†k (x)γ5 φk (x) . (3.121)
M→+∞ M
k
between an integral that involves the field strength of a gauge field configuration and
a sum over the spectrum of the Euclidean Dirac operator (in the same gauge field). c sileG siocnarF
/ x is Hermitean,
When λk 6= 0, the state φk′ is distinct from the state φk (x). Since D
they are in fact orthogonal:
Z Z
d xE φk (x)γ φk (x) = d4 xE φ†k (x)φk′ (x) = 0 .
4 † 5
(3.125)
142 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
This implies that none of the eigenfunctions φk with a non-zero eigenvalue can
contribute to the right hand side of eq. (3.122). The only contributions to eq. (3.122)
come from the eigenfunctions for which λk = 0, i.e. the zero modes of the Euclidean
Dirac operator. Since we have assumed that f(0) = 1, we have:
Z X Z
g2
d4 xE ǫijkl Fa b a b
ij (x) Fkl (x) tr(t t ) = d4 xE φ†k (x)γ5 φk (x) .
32π2
k|λk =0
(3.126)
Since γ5 ,D/ x = 0, we can choose these zero modes in such a way that they are also
eigenmodes of γ5 , with eigenvalues +1 or −1. We can thus divide the zero modes in
two families, the right-handed and the left-handed zero modes:
/ x φR (x) = 0 ,
D γ5 φR (x) = +φR (x) ,
/ x φL (x) = 0 ,
D γ5 φL (x) = −φL (x) . (3.127)
Using also the fact that the eigenfunctions are normalized as follows,
Z
d4 xE φ†R (x)φR (x) = 1 ,
Z
d4 xE φ†L (x)φL (x) = 1 , (3.128)
Gauge theories are quantum field theories with matter fields (usually spin 1/2 fer-
mions, but also possibly scalars) and gauges potentials in such a way that the La-
grangian is invariant under the action of a local continuous transformation. Quantum
Electrodynamics is the simplest such theory, with a local U(1) invariance. Given
Ω(x) ∈ U(1), the various objects that enter in the theory transform as follows:
ψ → Ω−1 ψ ,
i
Aµ → Aµ + Ω−1 ∂µ Ω ,
e
Fµν → Fµν ,
i
D µ
→ Ω−1 Dµ Ω = ∂µ − ie Aµ + Ω−1 ∂µ Ω . (4.1)
e
with Dµ ≡ ∂µ − ieAµ the covariant derivative. The field strength Fµν would then
be defined as ∂µ Aν − ∂ν Aµ .
Our goal is now to generalize the concept of gauge theory to more general groups
of transformations, in view of applications to the electroweak and to the strong
interactions. In these two cases, the internal group of transformations is SU(2) and
SU(3), respectively, but we will consider in most of this chapter a general Lie group.
Our goal is to construct a consistent field theory that generalizes eqs. (4.1) to the case
where Ω(x) belongs to some general Lie group G. c sileG siocnarF
143
144 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The matrix exponential eX is defined from the Taylor series of the exponential by
∞
X Xn
eX ≡ , (4.5)
n!
n=0
1 The prefactor i inside the exponential is common in the quantum physics literature, but seldom used in
mathematics. Its main benefit is to make X a Hermitean matrix when the group elements are unitary.
4. N ON -A BELIAN GAUGE SYMMETRY 145
that converges for all finite size matrices X since this series has an infinite radius of
convergence. A crucial property of the matrix exponential is that
Instead, one may use Trotter’s formula (also known as the Lie product formula)2 :
n
eX+Y = lim eX/n eY/n . (4.7)
n→∞
X = Xa ta (Xa ∈ ❘) , (4.8)
with an implicit sum on the index a. The ta ’s are called the generators of the algebra.c sileG siocnarF
Thanks to the exponential mapping (4.4), the properties of the Lie groups listed
in eqs. (4.3) translate into specific properties of the matrices X in the corresponding
algebras:
Note that the conditions imposed on Ω in eqs. (4.3) are non-linear, in contrast with
the linear conditions obeyed by the matrices X in eqs. (4.9). This is why a Lie group
is a curved manifold, while a Lie algebra is a linear space.
2A sketch of the proof is the following:
eX/n eY/n = 1 + Xn
+nY
+ O(n−2 ) = exp X+Y
n
+ O(n−2 ) ,
n
eX/n eY/n = exp X + Y + O(n−1 ) .
146 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
e it X
iX
Lie Algebra g
The group elements exp(itX) form a smooth curve on the group manifold (t = 0
corresponds to the identity), and iX may be viewed as the vector tangent to this curve
at the identity, as illustrated in the figure 4.1. The non-commutativity of the group is
related to the curvature of the corresponding manifold3 . Because of this curvature, a
displacement eiX followed by a displacement eiY does not lead to the same point as
the two displacements performed in reverse order. This geometrical representation
also provides an interpretation of Trotter’s formula for the exponential of a sum, as
shown in the figure 4.2.
The dimension of the Lie algebra equals the number of independent directions on
the group manifold. From the conditions listed in (4.9) on the matrices X ∈ g, it is
easy to determine the dimension of these algebras (viewed as algebras over the field
❘ ). The dimensions are listed in the table 4.1 for some common cases.
Moreover, as we shall see in the section 4.1.5, the group multiplication can be
inferred from that on the Lie algebra, via the Baker-Campbell-Hausdorff formula.
Despite these correspondences, the Lie algebra may not reflect the global properties of
the group (e.g. whether it is connected), and distinct Lie groups may have isomorphic
Lie algebras. This is for instance the case of U(1) and SO(2), SO(3) and SU(2), or
SU(2) × SU(2) and SO(4). c sileG siocnarF
3 This assertion could be made more precise as follows: it is possible to define a metric tensor on the
group manifold, and the corresponding Ricci curvature tensor. This curvature may then be expressed in
terms of the constants that define the commutators between the generators of the algebra (see eq. (4.14)).
4. N ON -A BELIAN GAUGE SYMMETRY 147
eitX
Figure 4.2: Geometrical e i t (X+Y)
interpretation of Trot- iX
ter’s formula: the broken
path, made of a suc-
cession of elementary i(X+Y)
steps eitX/n and eitY/n ,
approximates better and
iY eitY
better the curve eit(X+Y)
on the group manifold as
n → ∞.
where the equality follows from the Taylor series of the exponential. From the
definition of the Lie algebra, this implies that Ω−1 X Ω ∈ g. Therefore, if X, Y ∈ g
we also have
e−itX Y eitX ∈ g , (4.12)
and the derivative with respect to t at t = 0 is also an element of the algebra,
−i X, Y ∈ g . (4.13)
In other words, −i times the commutator of two elements of a Lie algebra is another
element of the algebra. Thus −i[·, ·] is the multiplication law4 in g (it is also called
4 In contrast, the ordinary product of two elements of the algebra is in general not in the algebra.
148 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
the Lie bracket). Therefore, the commutators between its generators can be written as
a b
t , t = i fabc tc , (4.14)
where the fabc are real numbers called the structure constants. The antisymmetry of
the commutator implies that fabc = −fbac . Given three elements X, Y, Z ∈ g of the
algebra, their commutator satisfies the Jacobi identity
X, Y, Z + Y, Z, X + Z, X, Y = 0 , (4.15)
The function adX is called the adjoint mapping at the point X. The exponential of the
adjoint mapping plays an important role, thanks to the following formula5
(This is known as Duhamel’s formula6 .) The non-trivial aspect of this formula is that
it is true even when X(t) does not commute with its derivative. Then, given X, Y ∈ g,
5 This may be proven by considering a one-parameter family of such equalities:
t adX
e Y = e−itX Y eitX ,
and by noting that the left and right hand sides coincide at t = 0, and obey identical differential equations
with respect to the parameter t.
6 Note that this formula is equivalent to:
Z1
d iX(t) dX(t) isX(t)
e =i ds ei(1−s)X(t) e .
dt 0 dt
This latter form can be proven by writing
X(t) ε
X(t)+εX ′ (t)+O(ε2 ) ′
(t)+O(ε2 ) n
eiX(t+ε) = ei = lim ei n ei n X ,
n→∞
(we use Trotter’s formula to obtain the second equality) and by expanding the right hand side to first order
in ε.
4. N ON -A BELIAN GAUGE SYMMETRY 149
Differentiating both sides with respect to t (using eq. (4.19) for the left hand side),
we obtain
" ad #−1
dZ(t) e Z(t) − 1
= Y. (4.21)
dt adZ(t)
ln(z)
F(z) ≡ . (4.24)
z−1
This leads to
1 i
ln eiX eiY = i(X + Y) − X, Y − X, X, Y − Y, X, Y + · · · (4.26)
2 12
(Explicit expressions for all the coefficients of this series are given by Dynkin’s
formula.) In applications to quantum field theory, we usually need only the first
two terms of this expansion because the commutators we encounter are commuting
numbers and all the subsequent terms are zero. Besides being an intermediate step
in the derivation of eq. (4.26), the integral form (4.23) shows that the group product
can be reconstructed from Lie algebra manipulations (since the right hand side of this
equation contains only objects that belong to the algebra). c sileG siocnarF
150 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
4.1.6 Representations
A real representation of a Lie group G is a group homomorphism from elements of
❘ ❘
G to elements of GL(n, ), i.e. a mapping π from G to GL(n, ) that preserves the
group structure:
π(1) = 1 , π(Ω2 Ω1 ) = π(Ω2 )π(Ω1 ) . (4.27)
A representation is said to be faithful if it is a one-to-one mapping.
Likewise, one may define representations of a Lie algebra as homomorphisms
❘
from g to gl(n, ), i.e. mappings π that preserve the Lie algebra structure:
π(αX + βY) = α π(X) + β π(Y) , π([X, Y]) = π(X), π(Y) . (4.28)
Note that if we define ta a
π ≡ π(t ) the images of the generators, then they obey
a b abc c
[tπ , tπ ] = i f tπ with the same structure constants as in the original Lie algebra.
Since we are focusing on matrix Lie groups, their elements are already matrices,
and one may wonder what representations are good for. In fact, it is often important
to know how a given group (e.g. the rotation group SO(3)) acts on a more general
linear space. In the example of SO(3), even though the “defining” action is on 3 in ❘
terms of 3 × 3 matrices, the group has many other matrix representations made of
❘
objects that act on spaces other than 3 .
If the dimension of the Lie algebra is d, then AdΩ may be viewed as a d × d matrix.
We may also define the adjoint representation of the algebra g, as follows:
It is sufficient to know the adjoint representation of the generators ta , for which one
often uses the following notation
a
i adta ≡ Tadj . (4.32)
a
Note that Tadj can be represented by a d × d matrix. Using Jacobi’s identity, one may
check that adta , adtb = −adi[ta ,tb ] = fabc adtc . Therefore, the Tadja
’s fulfill the
a
same commutation relations as the t ’s themselves:
a b
Tadj , Tadj = i fabc Tadj
c
. (4.33)
Using eq. (4.14), we find that the components of these matrices are given by
a
Tadj bc
= −i fcab . (4.34)
Thus, we have
h i h i
e−iX Y eiX = eiXadj Yb , (4.37)
c cb
where the left hand side may be in any representation r. In other words, the right
and left multiplication by a group element and its inverse can be rewritten as a left
multiplication by the adjoint of this group element.
152 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where Ω(x) is a spacetime dependent element of a Lie group G. Let us look for a
covariant derivative of the form
Dµ ≡ ∂µ − ig Aµ (x) , (4.39)
where g is a coupling constant similar to the constant e in QED and Aµ (x) a 4-vector
(in quantum field theory, this field is called a gauge field). The transformation law
(4.38) is satisfied provided that Aµ (x) transforms in a very specific way. Note first
that the ordinary derivative ∂µ is invariant (i.e. it belongs to the singlet representation).
If we denote AΩ µ (x) the transformed Aµ (x), then we must have:
∂µ − ig AΩ
µ (x) = Ω−1 (x) ∂µ − ig Aµ (x) Ω(x)
= ∂µ + Ω−1 (x) ∂µ Ω(x) − ig Ω−1 (x) Aµ (x) Ω(x) ,
(4.40)
−1 i −1
Aµ (x) → µ (x) ≡ Ω
AΩ (x) Aµ (x) Ω(x) + Ω (x) ∂µ Ω(x) . (4.41)
g
From eqs. (4.17), (4.18) and (4.19), we see that if Ω is an element of a Lie group
G, then Ω−1 ∂µ Ω belongs to the Lie algebra g. Thus, if the second term in the right
hand side of eq. (4.41) belongs to the representation r of the Lie algebra, the first term
should also be in this representation for consistency. The same applies to Aµ , that we
can decompose as follows:
Aµ (x) ≡ Aa a
µ (x) tr , (4.42)
where the ta
r are the generators of the algebra in the representation r.
7 From this transformation law, we see that field configurations of the form ig−1 Ω−1 ∂ Ω may be
µ
transformed into the null field Aµ ≡ 0. Such configurations are called pure gauge fields.
4. N ON -A BELIAN GAUGE SYMMETRY 153
Infinitesimal transformations : Eq. (4.41) specifies how the field Aµ changes un-
der any transformation of G. However, it is sometimes useful to consider infinitesimal
transformations, i.e. Ω close to 1. This is done by writing Ω = exp(ig θa ta r ), with
|θa | ≪ 1, and by expanding eq. (4.41) to order one in θa . The variation of Aµ is
given by:
δAµ = −∂µ θr (x) + i g Aµ (x), θr (x) = − Dµ , θr (x) , (4.43)
where Dadj
µ is the covariant derivative in the adjoint representation.
This generalizes the field strength Fµν to an arbitrary gauge group G. Note the
commutator between gauge fields, that did not exist in QED. By construction, the
field strength is an element of algebra, in the same representation as Aµ ,
Fµν (x) ≡ Fa a
µν (x) tr , (4.47)
154 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
4.2.3 Lagrangian
In order to build a kinetic term for Aµ from Fµν , we must contract all the Lorentz
indices to have a Lorentz invariant Lagrangian. This forces us to have at least two
F’s, since gµν Fµν = 0. Therefore, if we restrict to objects of mass dimension 4,
this kinetic term should be quadratic in F, with a dimensionless prefactor. The most
general9 term of this kind is
LA ≡ −hab Fa µν Fb
µν , (4.51)
where hab is a constant real symmetric matrix in the group indices. In addition, for
this Lagrangian to define a consistent field theory, the matrix hab should be positive
definite (otherwise some parts of the kinetic term would have the wrong sign and
the energy of the system would not be bounded from below). Under an infinitesimal
gauge transformation, the variation of this Lagrangian is
δ LA = −2 hab Fa µν θc fdcb Fd
µν , (4.52)
and for the kinetic term to be gauge invariant we must have
hab fdcb Fa µν Fd
µν = 0 . (4.53)
| {z }
sym. in a,d
This condition is satisfied for any gauge field configuration provided that
fdcb hba + facb hbd = 0 . (4.54)
Note that ❤ab ≡ tr (ta tb ) is a solution of this constraint, since tr Fµν Fµν is
obviously gauge invariant given the transformation law (4.48) for the field strength,
but the positivity condition imposes some restrictions on the kind of Lie algebra we
may use. c sileG siocnarF
8 The field strength associated to a pure gauge field is zero, since there exists a transformation Ω for
where Oac is a real orthogonal matrix. Since the matrix hab is positive definite, all
the eigenvalues λc are positive, and we can define a square root of the matrix by
Note that Ωab is a real symmetric matrix. Now, let us introduce a new basis for the
algebra, defined by
a b
t ≡ Ω−1 b
ab t , ta = Ωab t . (4.57)
This is a legitimate change of basis for a real algebra since the matrix Ω is real and
has no vanishing eigenvalue (all the eigenvalues λc are strictly positive since hab is
positive definite). The commutator of two of these new generators is
a b −1 a ′ b ′ c ′ c
t , t = i Ω−1 ′Ω ′ f Ωc ′ c t . (4.58)
| aa bb {z }
fabc
By rewriting eq. (4.54) in terms of the new structure constants fabc and by using the
fact that Ω is invertible, we get
In other words, eq. (4.54) implies that there exists a basis in which the structure
constants are also antisymmetric under the exchange of the first and third indices.
From this, we conclude that they are in fact completely antisymmetric10 (and not just
in the first two indices, as implied by their definition in terms of the Lie bracket).
to construct an antisymmetric rank-3 tensor with indices that take less than 3 values.
156 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Since the structure constants are real and antisymmetric, these generators are Her-
mitean matrices. Thanks to this property, there exists a basis in which all the adjoint
generators have a common block diagonal structure:
a
D(1) 0 0 ...
0 Da 0 . . .
(2)
a
Tadj =
0
, (4.61)
0 Da
(3) . . .
.. .. .. ..
. . . .
a
where the sizes of the blocks are the same for all the Tadj ’s. This block decomposition
can be obtained recursively, until one gets blocks that are not further reducible. If d is
the dimension of the adjoint representation (i.e. also the dimension of the Lie group),
❘
it corresponds to a decomposition of d into orthogonal subspaces that are invariant
under the action of all the generators. Regarding the Lie algebra, this indicates that
it is a direct sum of simple sub-algebras11 , and u(1) sub-algebras (if some diagonal
blocks Da (n) are zero for all a’s). In addition, these simple sub-algebras are compact
a b
because Kab ≡ tr (Tadj Tadj ), restricted to the corresponding subspace, is positive
12
definite . Indeed,
P when the structure constants are totally antisymmetric, we have
Kab Xa Xb = c,d Xa facd )2 ≥ 0 for any vector X. Moreover, there is no non-zero
vector X for which this quadratic form is zero, because otherwise we would have
c
Tadj X = 0 for all c, which means that this vector X would define a u(1) sub-algebra
and cannot be part of the subspace associated with a simple sub-algebra.
Standard form of the Lagrangian : Note now that the constraint (4.54) can also
be written as
c
Tadj ,h = 0 , for all c . (4.62)
This implies that hab has the same block decomposition as the adjoint generators (see
eq. (4.61)), with diagonal blocks that are proportional to the identity (with positive
11 A Lie algebra is not simple if there exists a set of generators T α (the number of which is strictly
smaller than the dimension of the algebra) which is closed under commutation with the algebra, i.e.
a , T α ] = gaαβ T β for all a, α. If we write these new generators as linear combinations T α ≡ V α T a ,
[Tadj a adj
the closure of the sub-algebra under commutation implies that the set of vectors {V α } is the basis of a
subspace invariant under the action of all the Tadj a ’s. Conversely, if we have an invariant subspace that
prefactors)
2
α(1) 1 0 0 ...
0 α2(2) 1 0 . . .
h=
0
(4.63)
0 α2(3) 1 . . .
.. .. .. ..
. . . .
The prefactors α2(i) can be absorbed into the normalization of the gauge field and the
coupling constant of the corresponding sub-algebra, by writing
α(i) Fµν = ∂µ Aν′ − ∂ν Aµ′ − ig ′ [Aµ′ , Aν′ ] (4.64)
4.3.1 Fermions
However, useful gauge theories in particle physics must also have matter fields, i.e.
fermions. Under the action of a Lie group G, a fermion field transforms as
ψ(x) → Ω−1 (x) ψ(x) , ψ(x) → ψ(x) Ω−1† (x) , (4.67)
158 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
For this Lagrangian to be gauge invariant, Ω must be a unitary matrix, which restricts
to unitary representations of the gauge group (all finite dimensional representations
of compact Lie groups are equivalent to a unitary representation).
Like in electrodynamics, the necessity of using a covariant derivative in order
to have a Dirac Lagrangian invariant under local gauge transformations completely
specifies the coupling between the fermions and the gauge field Aµ :
LI = −ig ψi γµ Aa a
µ tr ij ψj , (4.70)
where we have written explicitly the Lie algebra indices i, j of all the objects. These
indices, that run from 1 to the size of the representation r, label the “charge” carried
by the fermions, while the index a may be viewed as the charge carried by the spin-1
particle associated to the vector field Aaµ (this index runs from 1 to the dimension of
the group).
and therefore the following Lagrangian density is invariant under local gauge trans-
formations:
†
Lscalar = Dµ φ(x) Dµ φ(x) − m2 φ† (x)φ(x) − V φ† (x)φ(x) . (4.72)
(The potential should depend on the scalar field via the combination φ† φ in order to
be gauge invariant). The most important example of such a scalar in particle physics is
the Higgs boson. In the Standard Model, the potential of the Higgs field is symmetric
under the gauge transformations, but has minima at non-zero value of the field φ,
leading to spontaneous symmetry breaking. Because of its coupling to the gauge
potentials and to the fermions, the Higgs field vacuum expectation value turns them
into massive particles (see the next section for a discussion of this phenomenon).
From the Lagrangians (4.66), (4.68) and (4.72), it is straightforward to obtain the
classical Euler-Lagrange equations of motion. For the fermions, we simply obtain the
Dirac equation
/ − m ψ(x) = 0 .
iD (4.73)
For scalar fields, the classical equation of motion is a deformation of the Klein-Gordon
equation, in which the ordinary derivatives are replaced by covariant derivatives:
h i
Dµ Dµ + m2 + V ′ φ† (x)φ(x) φ(x) = 0 . (4.74)
For the gauge field Aµ , the derivatives of the various pieces of the Lagrangian read:
∂LA
∂µ = −Fa µν ,
∂(∂µ Aa ν)
∂LA
= g fabc Ab
µF
c µν
,
∂Aa
ν
∂LD
= g ψ γν ta ψ ,
∂Aa
ν
∂Lscalar †
a
= ig φ† ta Dν φ − Dν φ ta φ . (4.75)
∂Aν
160 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
known as Yang-Mills equation. From the Dirac and Klein-Gordon equations, one
may check that the colour current Jν
a is covariantly conserved:
Dν , Jν = 0 . (4.77)
The field strength also obeys another equation, known as the Bianchi identity,
Dµ , Fνρ + Dν , Fρµ + Dρ , Fµν = 0 , (4.78)
that follows from the Jacobi identity between covariant derivatives. c sileG siocnarF
Feynman graphs relevant for the Standard Model involve manipulations of the su(N)
generators (for N = 2, 3), mostly in the fundamental representation (since all matter
fields are in this representation). In this section, we derive some useful formulas that
help in these calculations.
Fierz identity : In the case of su(N), there are N2 − 1 generators ta f , while the
linear space of all N × N Hermitean matrices has a dimension N2 . A basis of the
latter can be obtained by adding the identity matrix to the ta
f ’s. Thus, any N × N
Hermitean matrix M can be written as
M = m0 1 + ma ta
f . (4.79)
Since the ta
f ’s are traceless, we have
1
m0 = tr (M) , ma = 2 tr (M ta ) . (4.80)
N
1
Mij = Mkk δij + 2 Mlk ta a
f kl tf ij
N
h1 i
= Mlk δkl δij + 2 ta ta
f kl f ij . (4.81)
N
4. N ON -A BELIAN GAUGE SYMMETRY 161
j i
1 1
(ta a
f )ij (tf )kl = = − , (4.84)
2 2N
k l
in which the solid blobs represent the taf matrices, and the wavy line indicates that
the indices a are contracted. In the right hand side, the solid lines indicate how the
indices ijkl are connected by the delta symbols. By contracting the indices jk in the
Fierz identity (4.83), we obtain:
N2 − 1
ta a
f tf il
= δil . (4.85)
2N
The quadratic combination ta a
f tf , called the fundamental Casimir operator, is pro-
portional to the identity (and therefore commutes with everything). The prefactor is
sometimes denoted Cf ≡ (N2 − 1)/(2N).
The diagrammatic representation (4.84) provides a very convenient way of obtain-
ing certain identities involving the generators of the fundamental representation. As
an illustration, let us consider the following example:
b 1 1
ta b a
f tf tf = a a = −
2 2N
1 1 b 1 b
= tr (tb
f )1 − t =− t . (4.86)
2 2N f 2N f
For the first term, we have used the fact that a closed loop in this diagrammatic
representation corresponds to a trace over the colour indices, and the tracelessness the
generators. Likewise, one would obtain
c 1 1
ta b c a b
f tf tf tf tf = a a
= 1 + 2 tcf . (4.87)
b b 4 N
162 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
1 + γ5 1 − γ5
ψR ≡ ψ , ψL ≡ ψ,
2 2
1 + γ5 0 1 − γ5 0
ψ R = ψ† γ , ψL = ψ† γ . (4.92)
2 2
4. N ON -A BELIAN GAUGE SYMMETRY 163
Consider first the term of the Dirac Lagrangian that does not depend on the mass. It
can be decomposed as follows in terms of the left and right handed spinors:
X 1 + ǫγ5 0 1 + ǫ ′ γ5
/ψ
ψD = ψ† /
γ D ψ
2 2
ǫ,ǫ ′ =±
X 1 + ǫγ5 0
= ψ† γ D / ψR + ψ L D
/ ψ = ψR D / ψL . (4.93)
2
ǫ=ǫ ′ =±
Therefore, this terms does not mix the left and right spinors, and is invariant under
independent gauge transformations of the two spinor helicities. In particular, it is
perfectly possible that they belong to different representations of the Lie algebra.
c sileG siocnarF
X 1 + ǫγ5 0 1 + ǫ ′ γ5
m ψψ = m ψ† γ ψ
2 2
ǫ,ǫ ′ =±
X 1 + ǫγ5 0
= ψ† γ ψ = m ψR ψL + m ψL ψR . (4.94)
2
ǫ=−ǫ ′ =±
Let us focus on the case where the right handed fermions are singlet under the gauge
group, while the left handed ones belong to a non trivial representation. This means
that they transform as follows
ψR → ψR , ψR → ψR ,
ψL → Ω−1 ψL , ψL → ψL Ω . (4.95)
164 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Thus, a way out to construct an operator which is bilinear in the fermions, mixes the
left and right components, and does not contain derivatives is to introduce a scalar
field Φ that transforms in the same way as ψL ,
Φ → Ω−1 Φ . (4.96)
where λ is a coupling constant, r is a Dirac index, and i is the index that labels the
components of the Lie algebra representation to which ψL and Φ both belong. From
the way the indices are contracted, this term is both gauge and Lorentz invariant.
Note that since the contraction of the Dirac indices between the two spinors already
produces a Lorentz invariant object, the field Φ must be Lorentz invariant on its own,
and thus must be a scalar.
At this point, the term of eq. (4.97) is not yet a mass term, but simply a tri-linear
interaction term between fermions and the newly introduced scalar field. However, a
mass term is generated if the vacuum expectation value Φi is non-zero (as we shall
see, this is related to the spontaneous breaking of the gauge symmetry). Therefore,
we may redefine the scalar field by writing (for the sake of this example, we choose
this expectation value to point in the direction i = 1)
v
0
Φ ≡ Φv + ϕ , Φv ≡
.. , (4.98)
.
0
If the fermion in the right handed singlet matches the first component of the left
handed multiplet, then the first term in the right hand side is a Dirac mass term for
this fermion (the fermions corresponding to the other components of the multiplet
remain massless13 ). The second term in the right hand side is a genuine interaction
term between the fermions and the fluctuating part of the scalar. Interestingly, with
13 In the Standard Model, where the left handed fermions belong to the fundamental representation
of SU(2), it is possible to give a mass to the second component of the doublet. Indeed, by noting
that Ωt2 ΩT = t2 for any matrix Ω in the fundamental representation of SU(2), we see that the term
iλ ψL t2 Φ∗ ψR is gauge invariant and gives a mass λv to the second component of the left handed
doublet when the vacuum expectation value of the scalar field has a non-zero first component.
4. N ON -A BELIAN GAUGE SYMMETRY 165
this mechanism, the strength of this interaction is proportional to the mass of the
fermion. In a theory where several fermions acquire their masses by coupling to the
expectation value of the same scalar field, this leads to a definite prediction: the ratios
of the couplings must equal the ratios of the masses (but the masses themselves are
not predicted, since the Yukawa couplings λ are free parameters).
Family mixing : When there are several families of fermions (that we label by an
extra index f in this paragraph), the Yukawa term of eq. (4.97) can be generalized into
λff ′ ψL fri Φi ψR f ′ r , (4.100)
without spoiling Lorentz or gauge invariance. Thus, with a non-zero vacuum expecta-
tion value of the scalar field, we get a fermion mass matrix which is in general not
diagonal in the fermion families. Note that here, we are implicitly choosing a basis
of fermion fields in which the couplings to the gauge bosons are diagonal, i.e. for
which the vertex with one gauge boson and two fermions does not mix the fermion
families. Conversely, we could choose a basis of fermion fields in which the mass
matrix is diagonal. In this alternate basis, the interactions with the gauge bosons
are no longer diagonal, i.e. the coupling to a gauge boson may change the type of
fermion. These non-diagonal interactions are described by a matrix known as the
Cabbibo-Kobayashi-Maskawa in the sector of quarks.
Let us denote Φv the ground state on which the system settles. The gauge group
G contains a subgroup H that leaves Φv invariant, called the stabilizer of Φv , and the
set of the minima of the potential can be identified with the coset space14 G/H. Then,
14 Given a group G and H one of its subgroups, two elements Ω and Ω ′ are said to be H-equivalent
if Ω−1 Ω ′ ∈ H. The quotient of the group G by this equivalence relationship, also called coset space
and denoted G/H, is the set of the resulting equivalence classes. When H is a normal subgroup (i.e.
ΩHΩ−1 = H for all Ω ∈ G), the coset space is itself a group.
166 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
the generators ta of the Lie algebra g can be divided in two sets: a basis of h (for
a > n), and a complementary set (for 1 ≤ a ≤ n):
1≤a≤n : ta 6 0,
ij Φvj =
a
a>n : tij Φvj = 0 . (4.102)
In the cases of interest in quantum field theory, the Lie algebra g is a direct sum
g = h ⊕ m , with h, m ⊂ m , (4.103)
In this representation, r(x) denotes the “radial” field variables, while the ϑa (x) are
the “angular” ones. From eqs. (4.102), the latter correspond to the generators broken
by the spontaneous symmetry breaking. Therefore, if it were not for the coupling
of Φ to the gauge fields through the covariant derivatives, we would conclude from
4. N ON -A BELIAN GAUGE SYMMETRY 167
Goldstone’s theorem that the modes r(x) are massive while the modes ϑa (x) are
the massless Nambu-Goldstone bosons. However, this conclusion is altered by the
minimal coupling to a gauge field because it is possible to absorb the matrix Ω−1 (x),
that contains the would-be Nambu-Goldstone modes, into a gauge transformation of
that field. Indeed, we may write
Dµ Φ = Dµ Ω−1 Φv + r)
= Ω−1 ΩDµ Ω−1 (Φv + r) , (4.106)
| {z }
′
Dµ
with
i
Dµ′ ≡ ∂µ − igAµ′ , Aµ′ ≡ ΩAµ Ω−1 + Ω∂µ Ω−1 . (4.107)
g
We see that after this gauge transformation of Aµ (this choice of gauge is known as
the unitary gauge), only the modes r(x) can still be considered as physical dynamical
modes of the scalar field, and the kinetic term of the scalar Lagrangian can thus be
rewritten as
† ′ †
Dµ Φ Dµ Φ = D µ (Φv + r) Dµ′ (Φv + r)
′ †
= D µ r Dµ′ r
′ † ′ †
+ − igA µ Φv Dµ′ r + D µ r − igA ′ µ Φv
′ †
+ − igA µ Φv − igA ′ µ Φv . (4.108)
| {z }
1 ′a ′ bµ
2 Mab Aµ A
In this expression, the last term is particularly interesting, since it provides a mass for
some of the gauge bosons. More explicitly, the mass matrix is given by
Mab ≡ 2 g2 Φ†vi ta b
ik tkj Φvj . (4.109)
this mass matrix is positive and has a number of flat directions equal to the number
of generators ta that annihilate Φv . From this, we conclude that the gauge bosons
that become massive via this mechanism are those that couple to the generators of the
broken symmetries.
168 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the construction of the Lagrangian of Yang-Mills theory, we have argued that the
only dimension four gauge invariant local operator is an operator quadratic in the
field strength Fµν
a . All the Lorentz indices should be contracted in order to obtain
a Lorentz invariant Lagrangian density. An obvious possibility is Fµν a
a Fµν , which is
the combination that appears in the Yang-Mills action. However, there exists another
Lorentz invariant contraction, obtained by introducing the Levi-Civita tensor,
g2 θ
Lθ ≡ ǫµνρσ tr (Fµν Fρσ ) . (4.111)
32π2
The prefactor 1/32π2 will appear convenient later, and the coupling constant in front
of this term is usually denoted θ. Consequently, this term is referred to as the θ-term.
Firstly, we should clarify why we have not considered this term right away when we
listed the possible gauge invariant operators that may enter in a non-Abelian gauge
theory. As we shall prove now, the θ-term is a total derivative. Therefore, it does not
enter in the field equations of motion, and has also no influence on perturbation theory.
Since our discussion has been so far centered on the perturbative expansion, this
term was irrelevant. However, the θ-term –that we cannot exclude on the grounds of
symmetries– may lead to non-perturbative effects that we shall discuss in this section. c sileG siocnarF
h g abc a b c i
Kµ ≡ ǫµνρσ Aa F
ν ρσ
a
− f Aν Aρ Aσ . (4.112)
3
The two terms of the third line are antisymmetric under the exchange (µν) ↔ (ρσ),
while the prefactor ǫµνρσ is symmetric under this exchange. These terms are therefore
zero after summing over the indices νρσµ. Then, the term on the second line can be
written as follows:
Each term is a trace of four factors, and is invariant under cyclic permutations of
the indices. Since cyclic permutations are odd in four dimensions, the ǫµνρσ tensor
changes sign under such a permutation, and the contraction with the trace is zero.
Therefore, we obtain:
1 µνρσ a a
∂µ Kµ = ǫ Fµν Fρσ , (4.115)
2
which is proportional to the θ-term. More precisely, we have
g2 θ
Lθ = ∂µ Kµ . (4.116)
32π2
The elementary proof of this result that we have presented in the previous subsection
is arguably rather cumbersome. This could have been made much more compact by
170 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
using the language of differential forms. The simplest differential forms are 1-forms,
that one may think of as the contraction of a spacetime dependent vector aµ (x) and
of the differential element dxµ , as in
dxµ
dxν = − dxν
dxµ
This also naturally implies that dxµ ∧ dxµ = 0, in accordance with the fact that a
quadrangle of edges (dxµ , dxµ , −dxµ , −dxµ ) is reduced to a line segment and thus
has zero area. The most general 2-form can be written as
where fµν is antisymmetric under the exchange of the µ, ν indices. 2-forms can be
integrated over a two-dimensional manifold, to give a number. In d-dimensional
space, one may iteratively construct p-forms for any p ≤ d (higher degree forms are
zero by antisymmetry of the exterior product). In particular, in four dimensions, the
volume element weighted by the fully antisymmetric tensor ǫµνρσ can be written as
d ω ≡ dxµ ∧ ∂µ ω . (4.122)
d2 ω = 0 . (4.124)
When applying the exterior derivative to the exterior product of two forms, one should
distribute the partial derivative on the two factors, and account for the fact that the
exterior derivative contains an anticommuting dxµ . Thus, if A is a p-form, we have
d A ∧ B = dA ∧ B + (−1)p A ∧ dB . (4.125)
Differential forms also provide a unified version of various formulas of vector calculus
(e.g., Kelvin–Stokes and Ostrogradsky–Gauss theorems), known as Stokes theorem.
Given a form ω and a manifold M, Stokes theorem states that
Z Z
ω= dω , (4.126)
∂M M
In order to cast the θ-term in the language of differential forms, let us firstly
introduce the gauge potential 1-form:
A ≡ ig Aµ dxµ . (4.127)
Then, we have
ig
dA = ∂µ Aν − ∂ν Aµ dxµ ∧ dxν
2
g2
A∧A=− Aµ , Aν dxµ ∧ dxν , (4.128)
2
and we see that the field strength Fµν appears in the coefficients of the following
2-form,
ig
F ≡ dA − A ∧ A = ∂µ Aν − ∂ν Aµ − ig [Aµ , Aν ] dxµ ∧ dxν . (4.129)
2 | {z }
Fµν
16 A differential form ω whose exterior derivative is zero (dω = 0) is said to be closed. A differential
form χ which is the exterior derivative of another form (χ = dω) is said to be exact.
172 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where we have used the cyclicity of the trace and the fact that commuting dA with
other forms does not bring any sign since it is a 2-form. In order to obtain the last
line, we have used
tr A ∧ A ∧ A ∧ A = 0 , (4.133)
which is a consequence of the fact that a cyclic permutation of four objects is odd.
Eq. (4.132) is the translation in terms of forms of the fact that the θ-term is the
derivative of the vector Kµ . Thanks to Stokes theorem, the integral of the θ-term
over a four-dimensional manifold M can be rewritten as an integral over its boundary
❘
(located at infinity if M is the entire 4 ),
Z Z
1
tr F ∧ F = tr A ∧ F + A ∧ A ∧ A . (4.134)
M ∂M 3
where the integer n is related to the chirality of the zero modes of the Dirac operator
in the gauge field configuration. When added to the Yang-Mills action, the integral of
the θ-term modifies the Euclidean path integral as follows
Z Z
R 4
DAµ · · · e−S[A,··· ] → DAµ · · · e−S[A,··· ]− d xE Lθ
Z
X
= e−nθ DAµ · · · n e−S[A,··· ] , (4.136)
n∈❩
4. N ON -A BELIAN GAUGE SYMMETRY 173
where the measure DAµ n is restricted to the gauge fields of index n. Thus, the
effect of the θ-term is to reweight the gauge field configurations by a factor (e−θ )n
that depends only on θ and on the index n. Note that since n is an integer, the path
integral is periodic in θ, with a period 2iπ.
However, this does not include any CP-violating interactions, such as those that may
result by the θ-term. Its effects may be included in the effective theory by generalizing
the interaction term into
By a matching with the underlying theory, the new coupling λ can be related to the
parameter θ by the following estimate
Then, the effective theory (4.138) can be used to estimate the neutron electric dipole
moment DN (in the chiral limit where the pion mass mπ is much smaller than the
nucleon mass mN ). This leads to
m
ln mN π
DN ≈ λ λ e ≈ 5 × 10−16 θ e · cm . (4.140)
4π2 mN
Current experimental limits on the neutron electric dipole moment indicate that
DN ≤ 3 × 10−26 e · cm , (4.141)
implying that
ψf −→ eiγ5 αf ψf , (4.143)
where f is an index labeling the quark flavours and the αf are real phases. Under this
transformation, the functional measure for the quarks is not invariant, but transforms
as follows
Z X h
i
DψDψ −→ exp − 2
d4 x ǫµνρσ Fa a
µν Fρσ αf DψDψ . (4.144)
32π
f
The same effect would have been obtained by a change of the angle θ:
X
θ→θ−2 αf . (4.145)
f
For the quarks, we can write generically the following mass term17
X 1 + γ5 X 1 − γ5
Mf ψ f ψf + M∗f ψf ψf , (4.146)
2 2
f f
that transforms into the following under the above chiral transformation
X 1 + γ5 X 1 − γ5
e2iαf Mf ψf ψf + e−2iαf M∗f ψf ψf . (4.147)
2 2
f f
Mf → e2iαf Mf . (4.148)
which is invariant. This discussion indicates that the θ-term has no effect if at least
one of the quarks is massless. Unfortunately, a massless up quark (the lightest quark)
does not seem consistent with existing experimental and lattice evidence. c sileG siocnarF
17 If the masses are complex, then the symmetries P and CP are explicitly broken.
4. N ON -A BELIAN GAUGE SYMMETRY 175
Using Stokes’ theorem, the integral of the θ-term over Euclidean spacetime may be
rewritten as an integral over a surface localized at infinity:
Z Z Z
g2 θ g2 θ
d4 xE Lθ = d4
x ∂ µ Kµ
= lim dSµ Kµ , (4.150)
32π2 E
32π2 R→∞
S3,R
where S3,R is a 3-dimensional sphere of radius R and dSµ the measure on this surface.
Let us now assume that the coloured objects of the problem are comprised in a
finite region of space-time, so that the gauge field configuration goes to a pure gauge
at infinity. Such a field can be written as
i †
Aµ (x) = aµ (x) + Ω (b
x) ∂µ Ω(b
x) , (4.151)
g
where Ω(b x) is an element of the gauge group that depends only on the direction of
the vector xµ , and aµ (x) is the deviation from the asymptotic pure gauge. For the
total field to be a pure gauge at infinity, this deviation must decrease faster than |x|−1 .
When |x| → +∞, Aν (x) goes to 0 as |x|−1 , while Fρσ (x) goes to 0 faster than |x|−2
(since Aν (x) goes to a pure gauge), and we have:
4ig µνρσ
Kµ −→ ǫ tr (Aν Aρ Aσ ) ∼ |x|−3 , (4.152)
|x|→+∞ 3
and
Z Z
θ
d4 xE Lθ = lim xµ ǫµνρσ
dS b
24π2 R→∞
S3,R
×tr Ω† (∂ν Ω)Ω† (∂ρ Ω)Ω† (∂σ Ω) , (4.153)
where we have used dSµ = b xµ dS, with dS the element of area on the 3-sphere. Note
that the integrand decreases as R−3 because of the three derivatives, while dS ∼ R3 .
Therefore, the integral is in fact independent of the radius R and we can drop the limit:
Z Z
θ
4
d xE L θ = xµ ǫµνρσ tr Ω† (∂ν Ω)Ω† (∂ρ Ω)Ω† (∂σ Ω) . (4.154)
dS b
24π2
S3
Ω : S3 7−→ G. (4.155)
176 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
It turns out that these mappings can be grouped in equivalence classes of Ω’s that
can be deformed continuously into one another. On the contrary, Ω’s that belong to
distinct classes cannot be related by a continuous deformation. The set of these classes
possesses a group structure, and is called the third homotopy group of G, denoted
π3 (G). For all SU(N) groups with N ≥ 2, the third homotopy group is isomorphic
to (❩, +). The interpretation of eq. (4.154) is that the integral of the θ-term depends
only on the class to which Ω belongs, and is therefore a topological quantity that can
change only in discrete amounts. This discussion provides another point of view on
the Atiyah-Singer index theorem, where the same integral was related to the chirality
imbalance between the zero modes of the Euclidean Dirac operator in a background
gauge field.
dW dγµ
≡ Dµ (γ(s)) W = 0, with initial condition W(0) = 1. (4.158)
ds ds
where the notation Dµ (γ(s)) indicates that the gauge field in the covariant derivative
must be evaluated at the point γµ (s). In other words, the covariant derivative of W,
projected along the tangent vector to the path γµ (s), is zero. From this definition, it
follows that W(s) is an element of the representation r of the gauge group if Aµ is in
the representation r of the algebra.
4. N ON -A BELIAN GAUGE SYMMETRY 177
Note that when the gauge field Aµ is zero everywhere, then the solution is trivially
W(s) = 1. For a generic gauge field, the value of the solution18 at s = 1 is a property
of the path γµ and of the gauge potential Aµ . This object, that we will denote as
is called a Wilson line. Let us now study how it changes under a gauge transformation
Ω. From the transformation law of the covariant derivative, the differential equation
that defines the transformed WΩ (s) is
dγµ †
Ω (γ(s))Dµ (γ(s))Ω(γ(s))WΩ (s) = 0, with initial condition WΩ (0) = 1.
ds
(4.160)
dγµ
Dµ (γ(s)) Z(s) = 0 , with initial condition Z(0) = Ω(x) . (4.161)
ds
Comparing this equation with the original equation (4.158), we obtain
Looking now at the point s = 1, we see that the Wilson line transforms as
Thus, the Wilson line transforms precisely as we wanted in eq. (4.156), and we
conclude that the operator ψ(y)Wyx [A; γ]ψ(x) is gauge invariant. Note that the c sileG siocnarF
Wilson line Wyx [A; γ], solution of eq. (4.158) at s = 1, can also be written as a
path-ordered exponential,
Z
Wyx [A; γ] = P exp ig dxµ Aµ (x) . (4.164)
γ
Although this compact notation is suggestive, it is often useful to return to the defining
differential equation (4.158).
By inserting a Wilson line between the points x and y, we can construct a gauge
invariant non-local operator ψ(y) · · · ψ(x). However, in doing so, we have introduced
18 Note that if the initial condition is W(0) = Ω instead of 1, then the solution would be changed as
0
follows W(s) → W(s)Ω0 .
178 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
a path γ, for which there are infinitely many possible choices since only its endpoints
are fixed. It turns out that in general, the Wilson line depends on the path γ, i.e.
This implies that, although we may define gauge invariant non-local bilinear operators,
their definition is not unique and each choice of the path connecting the two points
leads to a different operator.
When the gauge potential is a pure gauge field, there exists a function Ω(x) such that
i †
µ (x) =
AΩ Ω (x) ∂µ Ω(x) . (4.166)
g
Since this field is a gauge transformation of the null field Aµ ≡ 0, Wilson lines in
this pure gauge field are given by
In other words, in a pure gauge field, the Wilson lines depend only on their endpoints,
but not on the path chosen to connect them. This is the only exception to the remark
of the previous paragraph.
Conversely, a gauge potential Aµ (x) in which the Wilson lines depend only on
the endpoints is a pure gauge. A function Ω(x) that gives this gauge potential through
eq. (4.166) can be constructed as a Wilson line from x to some arbitrary base point
x0 :
A Wilson loop is a special kind of Wilson line, where the initial point and endpoint
are identical, x = y, and therefore the path γ is a closed loop:
I
W[A; γ] = P exp ig dxµ Aµ (x) . (4.169)
γ
Note that they are a property of the closed loop γ, and do not depend on the choice of
the starting point x. Because they have identical endpoints, the trace of a Wilson loop
4. N ON -A BELIAN GAUGE SYMMETRY 179
is gauge invariant. From the result of the previous paragraph, they are equal to the
identity in a pure gauge field, but they depend non-trivially on the path in a generic
gauge field19 .
In Abelian gauge theories, the Wilson loop can be rewritten in terms of the integral
of the field strength Fµν over a surface Σ of boundary γ, by using Stokes theorem:
I gZ
µ
exp ig dx Aµ (x) = exp i dxµ ∧ dxν Fµν (x) . (4.170)
γ Abelian 2 Σ
Generalizations of this formula to the non-Abelian case exist, that involve a path-
ordering in the left hand side (thus giving a Wilson loop) and a surface-ordering in
the right hand side. For infinitesimally small closed loops, a more direct connection
to the field strength may be established. Consider for instance a small square closed
path in the (12) plane,
γ= a
x
a
ǫ2 a b a b
= 1r + i ǫ αa ta 2 a a
r + ǫ β tr − α α tr tr + O(ǫ3 ) ,
2
(4.173)
19 Wilson loops are extensively used in lattice gauge theories. Moreover, Giles’ theorem states that all
the gauge invariant information contained in a gauge potential Aµ can be reconstructed from the trace of
Wilson loops (assuming we know Wilson loops for arbitrary loops).
180 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
g2 a4 a
tr (W[A; γ]) = tr 1r − F12 (x)Fb a b 6
12 (x) tr tr tr + O(a ) , (4.174)
2
where we have used the fact that the generators ta r are traceless for the su(N) algebra.
Eq. (4.174) is the basis of the discretization of the Yang-Mills action, the first step in
the formulation of lattice gauge theories.
Wilson lines also appear in the high energy limit of scattering by an external potential,
known as the eikonal limit. Consider the following S-matrix element,
for the transition between two arbitrary states made of quarks, antiquarks and gluons,
α and β. In the second equality, U(+∞, −∞) is the evolution operator from the
initial to the final state. It can be expressed as the time ordered exponential of the
interaction part of the Lagrangian,
h Z i
U(+∞, −∞) = T exp i d4 x LI (φin (x)) , (4.176)
where φin denotes generically the fields in the interaction picture. In this discussion,
LI contains both the self-interactions of the fields, and their interaction with the
external field. Consider now the high energy limit of this scattering amplitude,
(∞) 3 3
Sβα ≡ lim βin e−iωK U(+∞, −∞) e+iωK αin (4.177)
ω→+∞ | {z }
boosted state
the projectile and the external field was via a scalar exchange, then the conclusion
would be that the scattering amplitude vanishes in the high energy limit (in other
words, S-matrix elements would go to unity). However, interactions with a colour
field involve a vector exchange, i.e. the external field couples to a four-vector Jµ that
represents the colour current carried by the projectile, by a term of the form Aµ Jµ . At
high energy, the longitudinal component of this four-vector increases proportionally
to the energy, and compensates the small time spent in the interaction zone. Thus, for
states that interact via a vector exchange20 , we expect that scattering amplitudes have
a finite high energy limit (nor zero, nor infinite).
This calculation is best done using light-cone coordinates. For any four-vector
aµ , one defines
a0 + a3 a0 − a3
a+ ≡ √ , a− ≡ √ . (4.178)
2 2
x · y = x+ y− + x− y+ − x⊥ · y⊥
d4 x = dx+ dx− d2 x⊥
∂ ∂
= 2∂+ ∂− − ∇2⊥ with ∂+ ≡ , ∂− ≡ + . (4.179)
∂x− ∂x
Note also that the non-zero components of the metric tensor are
For a highly boosted projectile in the +z direction, x+ plays the role of the time,
and the Hamiltonian is the P− component of the momentum. The generator of
longitudinal boosts in light-cone coordinates is
K3 = M+− . (4.181)
Using the commutation relations of the Poincaré algebra, this leads to the following
identities:
3 3
e−iωK P− eiωK = e−ω P−
3
−iωK + iωK3
e P e = e+ω P+
−iωK3 j iωK3
e P e = Pj . (4.182)
20 By the same reasoning, gravitational interactions, that involve a spin two exchange, would lead to
They express the fact that, under longitudinal boosts, the components P± of a four-
vector are simply rescaled, while the transverse components are left unchanged.
Likewise, states, creation operators and field operators are transformed as follows,
3
eiωK p · · · in = (eω p+ , p⊥ ) · · · in
3 3
eiωK a†in (q) e−iωK = a†in (eω q+ , e−ω q− , q⊥ )
3 3
eiωK φin (x) e−iωK = φin (e−ω x+ , eω x− , x⊥ ) . (4.183)
Note that the last equation is valid only for a scalar field, or for the transverse
components of a vector field. In addition, the ± components of a vector field receive
an overall rescaling by a factor e±ω . Moreover, since a longitudinal boost does not
alter the time ordering, we can also write
Z
−iωK3 iωK3 3 3
e U(+∞, −∞) e = T exp i d4 x LI (e−iωK φin (x) eiωK ) . (4.184)
The components of the vector current that couples to the target field transform as
3 3
e−iωK Ji (x) eiωK = Ji (e−ω x+ , eω x− , x⊥ )
3 3
e−iωK J− (x) eiωK = e−ω J− (e−ω x+ , eω x− , x⊥ )
3 3
e−iωK J+ (x) eiωK = eω J+ (e−ω x+ , eω x− , x⊥ ) . (4.185)
Naturally, the target field Aµ does not change when we boost the projectile. For
simplicity, let us assume that Aµ is confined in the region −L ≤ x+ ≤ +L. We can
thus split the evolution operator into three factors,
The factors U(+∞, +L) and U(−L, −∞) do not contain the external potential. For
these two factors, the change of variables e−ω x+ → x+ , eω x− → x− leads to
3 3
lim e−iωK U(+∞, +L) eiωK = U0 (+∞, 0)
ω→+∞
3 3
lim e−iωK U(−L, −∞) eiωK = U0 (0, −∞) , (4.187)
ω→+∞
where U0 is the same as U, but defined with the self-interactions only (since these
two factors correspond to the evolution of the projectile while outside of the target
field). For the factor U(+L, −L), the change eω x− → x− gives
h Z i
−iωK3 iωK3
lim e U(+L, −L) e = exp i d2 x⊥ χ(x⊥ ) ρ(x⊥ ) , (4.188)
ω→+∞
4. N ON -A BELIAN GAUGE SYMMETRY 183
Z
χ(x⊥ ) ≡ dx+ A− (x+ , 0, x⊥ )
with Z (4.189)
ρ(x⊥ ) ≡ dx− J+ (0, x− , x⊥ ) .
This formula is an exact result in the limit ω → +∞. One may also note the following
important properties:
• Only the A− component of the external vector potential, integrated along the
trajectory of the projectile, matters. c sileG siocnarF
• The self-interactions and the interactions with the external potential are fac-
torized into three separate factors – this is a generic property of high energy
scattering. The role of the longitudinal boost in this factorization is illustrated
in the figure 4.4.
Figure 4.4: Illustration of the role of kinematics in the factorization of eq. (4.190).
Left: before the boost is applied, quantum fluctuations of the incoming projectile
may occur in the region of the external field. Right: after the boost, the region of
the external field shrinks due to Lorentz contraction (in the frame of the projectile),
and the effect of quantum fluctuations inside this region go to zero.
Eq. (4.190) is an operator formula that still contains the self-interactions of the fields
to all orders. In order to evaluate it, one must insert the identity operator written as a
sum over a complete set of states on each side of the exponential,
(∞)
X
Sβα = βin U0 (+∞, 0) γin
γ,δ h Z i
× γin exp i d2 x⊥ χ(x⊥ )ρ(x⊥ ) δin
The factor
X
δin δin U0 (0, −∞) αin (4.192)
δ
is the Fock expansion of the initial state: it accounts for the fact that the state α
prepared at x+ = −∞ may have fluctuated into another state δ before it interacts
with the external potential. The matrix elements of U0 that appear in this expansion
can be calculated perturbatively to any desired order. There is a similar factor for the
final state evolution.
The interactions with the external field are in the central factor, γin exp ... δin .
In order to rewrite it into a more intuitive form, let us first rewrite the operator ρ in
terms of creation and annihilation operators. For instance, the fermionic part of the
current gives
Z
dp+ d2 p⊥ d2 q⊥ a †
ρa (x⊥ ) = g t b + bsj p+ q⊥ ei(p⊥ −q⊥ )·x⊥
4πp+ (2π)2 (2π)2 f ij si p p⊥
−d†si p+ p⊥ dsj p+ q⊥ e−i(p⊥ −q⊥ )·x⊥ , (4.193)
where the taf are the generators of the fundamental representation of the su(N)
algebra and b, d, b† , d† are the annihilation and creation operators for quarks and
antiquarks. ρa also receives a contribution from gluons, not written here, obtained
with the generators in the adjoint representation and the annihilation and creation
operators for gluons instead. This formula captures the essence of eikonal scattering:
• The colours and transverse momenta of the constituents of the state may change
during the scattering.
Scattering amplitudes in the eikonal limit take a very simple form if one trades
transverse momentum for a transverse position by a Fourier transform. For each
intermediate state δin ≡ k+ i , ki⊥ , we first define the corresponding light-cone
wave function by :
Y Z d2 ki⊥
Ψδα ({k+
i , xi⊥ }) ≡ e−iki⊥ ·xi⊥ δin U0 (0, −∞) αin , (4.194)
(2π)2
i∈δ
4. N ON -A BELIAN GAUGE SYMMETRY 185
where the index i runs over all the constituents of the state δ. Then, each charged
particle going through the external field acquires an SU(N) phase that depends on
the representation in which it lives
Y
Ψδα ({k+ +
i , xi⊥ }) −→ Ψδα ({ki , xi⊥ }) Ui (x⊥ )
i∈δ
h Z i
Ui (x⊥ ) ≡ T exp ig dx+ A− + a
a (x , 0, xi⊥ ) tri , (4.195)
Quantization of
Yang-Mills theory
5.1 Introduction
†
L ≡ Dµ φ(x) Dµ φ(x) − m2 φ† (x)φ(x) − V φ† (x)φ(x)
/ x − m ψ(x)
+ψ(x) iD
− 41 Fa
µν (x)F
a µν
(x) . (5.1)
The local non-Abelian gauge invariance of this Lagrangian does not change anything
to the quantization of the scalar field φ and of the spinor ψ, for which we may use
the standard canonical or path integral approaches, with the result that the usual
Feynman rules still apply. The main complication resides in the pure Yang-Mills part
(third term) of this Lagrangian, i.e. with the quantization of the gauge potential Aµ .
The identification of the degrees of freedom that are made redundant by the gauge
symmetry is much more complicated than in QED, and a lot more care is necessary
in order to isolate the genuine dynamical variables of the theory.
In order to get a sense of the difficulty, let us try to mimic the QED case in order
to guess the Feynman rules for non-Abelian gauge fields. Using the explicit form of
the field strength,
Fa
µν = ∂µ Aν − ∂ν Aµ + g f
abc b c
Aµ Aν , (5.2)
187
188 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where we have anticipated an integration by parts in the first (kinetic) term. Note
that the kinetic term is formally identical to the kinetic term of a photons, except for
the colour index a carried by the gauge potential. Therefore, one may be tempted
to generalize the QED Feynman rules to a non-Abelian gauge boson. As in the
QED case, the quadratic part of the Lagrangian (5.3) poses a difficulty when trying
to determine the free propagator, because the operator between Aµ · · · Aν is not
invertible. If we take for granted that a similar gauge fixing procedure (more on this
later, as this is in fact the heart of the problem) can be applied here, we may assume
that the free gauge boson propagator1 in Feynman gauge is
p
−i gµν δab
G0F µν
ab (p) = = , (5.4)
p2 + i0+
and one may read off directly from the Lagrangian (5.3) the following 3-gluon and
4-gluon vertices:
aµ
k
g fabc gµν (k − p)ρ
= (5.5)
p + gνρ (p − q)µ + gρµ (q − k)ν
bν q cρ
aµ bν
−i g2 fabe fcde (gµρ gνσ − gµσ gνρ )
= + face fbde (gµν gρσ − gµσ gνρ ) (5.6)
+ fade fbce (gµν gρσ − gµρ gνσ )
cρ dσ
All this seems fine, except for a rather subtle problem that would appear when
using this perturbation theory: these Feynman rules lead to amplitudes that do not
1 In this chapter, we use the diagrammatic convention of QCD, where the gauge bosons (gluons) are
represented as springs in Feynman diagrams. In the electroweak theory, it is more common to represent
them as wavy lines, like the photon in QED.
5. Q UANTIZATION OF YANG -M ILLS THEORY 189
fulfill Ward identities, even when all the external coloured particles are on their mass-
shell. From the discussion of perturbative unitarity for amplitudes with external gauge
bosons in 1.16.4, the lack of Ward identities seems to imply a violation of unitarity in
perturbation theory. Since unitarity is one of the cornerstones of any quantum theory,
this is not a conclusion we are ready to accept, and we must conclude that something
is missing in the above Feynman rules.
† i †
Aµ (x) → µ (x) ≡ Ω (x) Aµ (x) Ω(x) +
AΩ Ω (x) ∂µ Ω(x) , (5.8)
g
leave the action and the observable unchanged. Moreover, the functional measure is
also invariant, since
Ω
δAa µ (x)
DAa µ (x) = DAa µ (x)] det
Ω
, (5.9)
δAb ν (y)
where the determinant is the Jacobian of the change of coordinates. Using eq. (4.37),
this determinant can be rewritten as follows
a µ (x)
δAΩ
det = det δµ ν δ(x − y) Ωadj (x) ab = 1 , (5.10)
δAb ν (y)
since the group element Ωadj is a unitary matrix. Therefore, there is a large amount of
redundancy in the above path integral, and it is in fact infinite. By applying a gauge
transformation, each field configuration Aµ develops into a gauge orbit (see the figure
5.1), along which the physics is invariant. In order to eliminate this redundancy, we
190 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Aµ
G(Aµ) = 0
gauge
fixed Aµ
gauge orbit
Figure 5.1: Illustration of the gauge fixing procedure. The lines represent the
gauge field configurations spanned when varying Ω. The shaded surface is the
manifold where the gauge condition is satisfied, and the black dots are the gauge-
fixed field configurations.
5. Q UANTIZATION OF YANG -M ILLS THEORY 191
would like to impose a condition at every space-time point x on the gauge fields,
Ga Aµ (x) = 0 , (5.11)
in order to select a unique2 field configuration along each orbit. Geometrically, the
gauge condition (5.11) defines a manifold that intersects each orbit, as shown in
the figure 5.1, and we choose this intersection as the representative of this field
configuration.
∆[Aµ ] is the determinant of the derivative of the constraint G(Aµ ) with respect to
the gauge transformation Ω, at the point where G(Aµ ) = 0,
a
δG
∆(Aµ ) = det . (5.13)
δΩ Ga (AΩ µ )=0
In QED, for linear gauge fixing conditions, this derivative (and therefore the determi-
nant) is independent of the gauge field, and can be trivially factored out of the path
integral. This is not the case in non-Abelian gauge theories, and this determinant
is the source of significant complications. One can first prove that the determinant
∆[Aµ ] is gauge invariant. Indeed, changing Aµ → AΘ µ , we have:
Z Ω′
−1
z}|{
∆ [Aµ ] =
Θ
DΩ(x) δ[Ga (AµΘΩ )]
Z
= D(Θ† (x)Ω ′ (x)) δ[Ga (AΩ ′
µ )]
Z
= DΩ ′ (x) δ[Ga (AΩ ′
µ )] = ∆
−1
[Aµ ] . (5.14)
2 It turns out that this is not possible, due to the Gribov ambiguity: all gauge conditions of the form
(5.11) have several solutions, called Gribov copies. However, only one of these solutions is a “small field”,
while the others are proportional to the inverse coupling g−1 . Since perturbation theory is an expansion
around the vacuum (i.e. in the small field regime), these non-perturbatively large copies do not play any
role in perturbation theory.
192 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Here, we have used the fact that there exists a group invariant integration measure on
a Lie group. By inserting
Z
1 = ∆[Aµ ] DΩ(x) δ[Ga (AΩ µ )] (5.15)
At this point, the second integral does not contain the gauge transformation Ω any-
more, and therefore we have managed
to factorize the “integral along the orbits” in the
form of the first integral over DΩ . Dropping this constant factor, we can therefore
write an integral free of any redundancy:
Z
iSYM [Aµ ]
O = DAa a
µ (x) ∆[Aµ ] δ[G (Aµ )] O(Aµ ) e . (5.19)
In the above formula, the determinant ∆[Aµ ] depends on the gauge field and must
therefore have an effect on the Feynman rules. The Fadeev-Popov method consists
in rewriting this determinant as a path integral. Note that since ∆[Aµ ] appears in the
numerator, we need Grassmann variables in order to represent it as a path integral3 ,
according to eq. (3.36):
Z
det i M = Dχa (x)Dχa (x)
Z
× exp i d4 xd4 y χa (x) Mab (x, y) χb (y) . (5.20)
3 The factor i in det i M has been included for aesthetic reasons, but does not change anything. In
fact any rescaling M → κ M would leave the results unchanged. Indeed, such a change would alter
the ghost propagator according to S → κ−1 S, and the ghost-gauge boson vertex by V → κV. Since the
ghosts appear only in closed loops, that contain an equal number of propagators and vertices, these factors
κ would cancel out.
5. Q UANTIZATION OF YANG -M ILLS THEORY 193
An extra generalization, that we have already used in the path integral quantization
of the photon (see eq. (3.52)), is to shift the gauge condition from Ga (A) = 0 to
Ga (A) = ωa and to perform a Gaussian integration over ωa . The final result takes
the following form:
Z
O = DAaµ (x) Dχa (x)Dχa (x) O(Aµ )
Z 1
ξ
× exp i d4 x − Fa µν F
a µν
− (Ga (Aµ ))2 + χa Mab χb ,
| 4 {z } | 2 {z } | {z }
LYM LGF LFPG
(5.21)
where Mab is the derivative of Ga (AΩ ) with respect to the gauge transformation Ω,
at the point Ω = 1 (here, we use the fact that the determinant is gauge invariant to
choose freely the Ω at which we compute the derivative). The unphysical Grassmann
fields χ and χ introduced as a trick to express the determinant are called Fadeev-Popov
ghosts, or simply ghosts. Although physical observables do not depend on these
fictitious fields, there is in general a coupling between the ghosts and the gauge fields,
because the matrix Mab may contain the gauge field. This implies that the ghosts
may appear in the form of loop corrections in the perturbative expansion. As we
shall see shortly, they are in fact crucial for the consistency of perturbation theory in
non-Abelian gauge theories. In particular, the ghosts ensure that the theory is unitary. c sileG siocnarF
as
Ga (A) ≡ ∂µ Aa a
µ − ω (x) . (5.22)
With this gauge fixing, the free gauge boson propagator is
p
−i gµν δab i δab 1 pµ pν
G0F µν
ab (p) = = 2 + 2 1− . (5.23)
p + i0+ p + i0+ ξ p2
(The simplest form is obtained in the limit ξ → 1, giving the Feynman gauge4 .) The
matrix Mab can be calculated by applying an infinitesimal gauge transformation
Ω = exp(iθa ta ) to Aµ . The variation of the gauge field is
δAa µ (x) = g fabc θb (x) Ac µ (x) − ∂µ θa (x) , (5.24)
and the variation of Ga (A) at the point x is
δGa = g fabc ∂µ θb (x) Ac µ (x)+g fabc θb (x) ∂µ Ac µ (x) − θa (x) . (5.25)
Therefore, we have
δGa (A)
Mab = b
= g fabc ∂µ Ac µ (x) + g fabc Ac µ (x) ∂µ − δab , (5.26)
δθ
and the terms that depend on the Fadeev-Popov ghosts can be encapsulated in the
following effective Lagrangian:
LFPG = χa − δab + g fabc ∂µ Ac µ (x) + g fabc Ac µ (x) ∂µ χb (5.27)
The first term leads to the following propagator for the ghosts:
p
0 i δab
GF (p) = = . (5.28)
p2 + i0+
Note that it has the form of a scalar propagator, although the ghosts are anti-com-
muting Grassmann variables. The vertex between ghosts and gauge bosons reads
a
r
q
= g fabc (pµ + qµ ) = g fabc rµ . (5.29)
cµ
p
b
The Feynman rules for non-Abelian gauge theories in covariant gauge are summarized
in the figure 5.2, where we have added for completeness the rules relative to fermions.
4 Another popular choice is the Landau gauge, obtained in the limit ξ → +∞, that corresponds to a
strict enforcement of the condition ∂µ Aµ = 0. Indeed, in this limit the exponential of i ξ2 (∂µ Aµ )2 in the
gauge fixed Lagrangian oscillates wildly –and produces cancellations– unless ∂µ Aµ = 0. Equivalently,
the Gaussian distribution for the function ωa (x) has a vanishing width in this limit, which forces the strict
equality ∂µ Aaµ = 0.
5. Q UANTIZATION OF YANG -M ILLS THEORY 195
p
−i gµν δab i δab 1 pµ pν
= + 2 1−
p2 + i0+ p + i0+ ξ p2
p
i δij
=
p − m + i0+
/
p
i δab
=
p2 + i0+
aµ
k g fabc gµν (k − p)ρ
=
+ gνρ (p − q)µ + gρµ (q − k)ν
p
bν q cρ
aµ bν
−i g2 fabe fcde (gµρ gνσ − gµσ gνρ )
= + face fbde (gµν gρσ − gµσ gνρ )
+ fade fbce (gµν gρσ − gµρ gνσ )
cρ dσ
i
= −i g γµ ta
r ij
aµ
j
a
r
q
= g fabc (pµ + qµ ) = g fabc rµ
cµ
p
b
Ga (A) ≡ nµ Aa a
µ − ω (x) . (5.30)
After gauge fixing, the quadratic part of the effective Lagrangian reads
1 a µν
Aµ g − ∂µ ∂ν − ξ nµ nν Aa
ν , (5.31)
2
and the free gauge boson propagator is obtained in momentum space by inverting
gµν p2 − pµ pν + ξ nµ nν . (5.32)
(This is the most general symmetric tensor that one may construct with gµν , pµ and
nµ .) This leads to the following propagator
−i δab h µν pµ nν + pν nµ pµ pν i
G0F µν
ab (p) = g − + n 2
+ξ −1 2
p . (5.34)
p2 + i0+ p·n (p · n)2
Note that this propagator does not vanish as p−2 at large momentum, because of the
term proportional to ξ−1 , With this gauge fixing, the variation of the gauge fixing
function under an infinitesimal gauge transformation is given by
which leads to the following expressions for the ghost propagator and its coupling to
the gauge boson:
p
0 δab
GF (p) = =−
p · n + i0+
a
r
q
= i g fabc nµ . (5.38)
cµ
p
b
In Yang-Mills theory, it turns out that the identity (5.41) is in general not satisfied.
Instead, it is replaced by a different on-shell identity, discovered by ’t Hooft. In order
to derive it, let us consider a generalized covariant gauge condition of the form
∂µ Aµ
a (x) − ζa (x) = ωa (x) . (5.42)
198 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where ζea is the Fourier transform of ζa . This source is always contracted into an
external gluon propagator, leading to the combination5
ζeb (k) kν
i ξ kµ ζea (k) G0F µν
ab (k) = . (5.46)
k2
(The propagator is given in eq. (5.23).) In this contraction, the external gluon prop-
agator is replaced by a factor kν directly contracted into the amputated correlation
function, independently of the gauge parameter.
Since the function ζa has been introduced as part of our choice of gauge fixing
condition, gauge invariant quantities should not depend on it. Consequently, the
sum of the graphs contributing to gauge invariant quantities with a given non-zero
number of insertions of the source ζa must be zero. Consider S-matrix elements,
i.e. transition amplitudes between physical states. The graphs contributing to such a
matrix element have a number of external gluons corresponding to the in- and out-
states of the amplitude, plus possibly some insertions of the source ζa :
while in QED it is sufficient to contract the self-energy with a single k (even off-shell)
to obtain zero. Note that these identities are insufficient in order to obtain an unitary S-
matrix with only internal gluons, because the tensor structure of the internal cut gluon
propagators also involves polarizations which are neither physical nor proportional
to qν . The Zinn-Justin equation, that we shall derive in the next chapter, may be
viewed as a generalization of these Ward identities to off-shell momenta and arbitrary
polarizations.
i. There are no Ward identities similar to those of QED, that could be used to
prove unitarity.
ii. Higher order graphs in general have ghost loops, whose interpretation is at the
moment unclear when such loops are cut.
As we shall see, these two issues are in fact related: the cut ghost lines precisely
cancel the unphysical polarizations of the cut gluons. Let us first work out an explicit
c sileG siocnarF
example that illustrates this assertion: the tree level annihilation of a quark and an
antiquark into two gluons in QCD. The corresponding diagrams are the following:
200 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
We denote p et q the momenta of the incoming quark and antiquark, respectively, and
k1,2 the momenta of the outgoing gluons (with Lorentz indices µ, ν and colours a, b,
respectively).
The contribution of the first two graphs is very similar to that of the analogous
graphs in QED for the emission of two photons, except for the extra colour matrices
at the quark-gluon vertices:
i
i Mµν 2 µ a
ab |1+2 (p, q|k1 , k2 ) = (i g) v(q) γ t γν tb
/1 − q
k / −m
i
+γν tb γµ ta u(p) . (5.49)
/ −k
p /1 − m
i
k1µ i Mµν 2 / a
ab |1+2 (p, q|k1 , k2 ) = (i g) v(q) k1 t γν tb
/1 − q
k / −m
i
+γν tb / ta u(p) .
k (5.50)
/1 − m 1
/ −k
p
/1 = (k
k /1 − q
/ − m) + (q
/ + m) , (5.51)
/1 = (p
k / − m) − (p
/ −k
/1 − m) ,
/
(p − m)u(p) = 0 , (5.52)
which leads to
k1µ i Mµν 2 ν a b
ab |1+2 (p, q|k1 , k2 ) = i (i g) v(q) γ [t , t ] u(p) . (5.53)
related to the third graph, that contains a 3-gluon vertex. If we use the Feynman gauge
for the internal gluon propagator, its contribution can be written as
−i
i Mµν c
ab |3 (p, q|k1 , k2 ) = i g v(q)γρ t u(p)
k23
×g fabc [gµν (k2 − k1 )ρ + gνρ (k3 − k2 )µ + gρµ (k1 − k3 )ν ] ,
(5.54)
−i
k1µ i Mµν c
ab |3 (p, q|k1 , k2 ) = i g v(q)γρ t u(p)
k23
ρ ν ρ
×g fabc [gνρ k22 − kν νρ 2
2 k2 − g k3 + k3 k3 ] . (5.55)
ρ
In this equation, the term in kν
3 k3 vanishes once contracted with γρ , since we can
write
v(q)γρ tc u(p)kρ3 = −v(q)[(p / + m)]tc u(p) = 0 .
/ − m) + (q (5.56)
However, this is not sufficient for (5.55) to fully cancel (5.53).
ρ
Setting k22 = 0 kills another term in eq. (5.55). The term in kν 2 k2 would be
canceled if in addition we contract the amplitudes with a transverse polarization
vector ǫ1,2ν (k2 ), since kν
2 ǫ1,2ν (k2 ) = 0. We indeed have:
k1µ ǫ1,2ν (k2 ) i Mµν µν
ab |1+2 (p, q|k1 , k2 ) + i Mab |3 (p, q|k1 , k2 ) k2 =0 = 0 .
2
(5.57)
Except for a graph with a quark loop that does not play any role in the present
discussion (since it does not give any 2-gluon final state when cut), the complete list
of graphs contributing to the qq̄ → qq̄ forward amplitude at one loop is shown in the
5.3. The contribution of the first 5 graphs (i.e. those with gluon internal lines) to the
optical theorem can be calculated easily by noting that it can be expressed in terms of
the amplitude we have just calculated:
i Mµν µν µν
ab (p, q|k1 , k2 ) ≡ i Mab |1+2 (p, q|k1 , k2 )+ i Mab |3 (p, q|k1 , k2 ) , (5.59)
as follows6
Z Z
1 d4 k1 d4 k2
(2π)4 δ(4) (p + q − k1 − k2 )
2 (2π)4 (2π)4
×2π(−gµρ )θ(k01 )δ(k21 − m2 ) 2π(−gνσ )θ(k02 )δ(k22 − m2 )
∗
×i Mµν ρσ
ab (p, q|k1 , k2 ) (i Mab (p, q|k1 , k2 )) . (5.60)
where ǫµ
± (k) are unphysical polarizations (with ǫµ µ
+ (k) proportional to k ). After this
substitution, several terms are not problematic:
6 The factor 1/2 is a symmetry factor due to the presence of two identical gluons in the final state.
5. Q UANTIZATION OF YANG -M ILLS THEORY 203
1h ∗
(i Mµν ρσ
ab ǫ−µ ǫ+ν ) (i Mab ǫ+ρ ǫ−σ )
2 i
∗
+ (i Mµν ρσ
ab ǫ+µ ǫ−ν ) (i Mab ǫ−ρ ǫ+σ ) , (5.62)
√
integrated over the on-shell momenta k1 and k2 . Using ǫµ µ
+ (k) = k / 2|k| and
eqs. (5.53) and (5.55), we obtain
g2 1
ǫ+µ (k1 ) i Mµν
ab = − √
/2 kν
v(q) k 2 f
abc c
t u(p) . (5.63)
2|k1 | k23
Likewise with the other gluon, we have
g2 1
ǫ+ν (k2 ) i Mµν
ab = √
/1 kν
v(q) k 1 f
abc c
t u(p) . (5.64)
2|k2 | k23
√
Using then ǫµ
− (k) = (k0 , −k)/ 2|k|, we get
|k2 | 1
ǫ−ν (k2 ) ǫ+µ (k1 ) i Mµν
ab = −g
2 /2 fabc tc u(p) ,
v(q) k
|k1 | k23
|k1 | 1
ǫ+ν (k2 ) ǫ−µ (k1 ) i Mµν
ab = +g2 /1 fabc tc u(p) . (5.65)
v(q) k
|k2 | k23
Squaring this amplitude, and including the − sign7 associated to a ghost loop8 , the
contribution of the last graph of fig. 5.3 to the optical theorem becomes
1
−g4 2 2
/1 fabc tc u(p) v(q) k
v(q) k /1 fabd td u(p) , (5.69)
(k3 )
that exactly cancels the unphysical gluon contribution of eq. (5.67). In other words,
the optical theorem is satisfied with only physical modes in the final state sum, thanks
to a crucial cancellation that involves ghosts.
The cancellation that occurred in the previous example is in fact general: for every
gluon loop, there is a graph of identical topology where this loop is replaced by a
ghost loop, that cancels the contribution from the unphysical gluon polarizations in the
optical theorem. However, it is difficult to turn the calculation of the previous subsec-
tion into a general proof. It turns out that this cancellation originates from a residual
symmetry of the gauge fixed Lagrangian: although the gauge fixing term explicitly
breaks the gauge symmetry, the effective Lagrangian that appears in eq. (5.21) has a
remnant of the original gauge symmetry, known as the Becchi-Rouet-Stora-Tyutin
symmetry (BRST).
Under an infinitesimal gauge transformation parameterized by θa (x), the gauge
field and fermion field vary by
δAa
µ (x) = − Dadj
µ ab
θb (x)
δψ(x) = −i gθa (x) ta
r ψ(x) , (5.70)
Eqs. (5.71) do not tell how ghost and antighost fields transform under BRST. For
reasons that will become clear later, we shall impose that the BRST transformation
is nilpotent, i.e. that Q2BRST = 0 when applied to any of the fields of the theory. This
requirement constrains the BRST transformation of the ghosts. Indeed, a double
BRST transformation applied to fermions reads
Q2BRST ψ(x) = i g QBRST χa (x) ta a
r ψ(x) − χa (x) tr QBRST ψ(x)
a
= i g QBRST χa (x) tr ψ(x) + g2 χa (x)χb (x) ta b
r tr ψ(x) .
(5.73)
(The BRST generator is an anti-commuting object, which leads to a minus sign in the
second term of the first line when we push it through the Grassmann field χa .) Since
1 a b i abc c
χa and χb anti-commute, we can replace ta b
r tr by 2 [tr , tr ] = 2 f tr . We see that
eq. (5.73) will identically vanish provided that
1
QBRST χa (x) = − g fabc χb (x) χc (x) . (5.74)
2
Then, we can calculate the action of a double BRST transformation on the gauge
field,
Q2BRST Aa
µ = Dadj µ ab QBRST χb − g f
abc
QBRST Acµ χb
g bcd
= Dadj µ ab − 2 f χc (x)χd (x)
−g fabc ∂µ χc −gfcde Aeµ χd χb . (5.75)
The terms linear in the gauge field cancel by using the anti-commuting nature of the
χ’s and the Jacobi identity satisfied by the structure constants:
g2
Q2BRST χa = 4 |f
abc bde
f + facb fbde} χc χd χe
{z (5.78)
0
Therefore, the prescription (5.74) for the BRST transformation of a ghost field leads
to
Q2BRST ψ = 0 , Q2BRST Aa
µ =0 , Q2BRST χa = 0 . (5.79)
We need now to specify the BRST transformation of the antighost field. Note that in
the path integral that gives the Fadeev-Popov determinant, the ghost and antighost
fields are treated as independent; therefore the BRST transformation of the antighost
does not have to be related to that of the ghost. Let us denote:
where Ba (x) is a commuting field. For QBRST to be nilpotent, we must have in addition:
where ξ is a parameter and Ga (A) is the gauge fixing function. We can write10
h 1 a i h1 ∂Ga i
QBRST Ξ = QBRST χa B + Ga − χa QBRST Ba + Q Ab
µ
2ξ 2ξ ∂Ab
µ
BRST
1 a a ∂Ga
= B B + Ba Ga + χa b
− Dadj
µ bc χc . (5.84)
2ξ ∂Aµ
| {z }
LFPG
10 Note that a minus sign arises when moving QBRST through the anti-commuting field χb .
5. Q UANTIZATION OF YANG -M ILLS THEORY 207
Note that the last term is nothing but the Fadeev-Popov part of the Lagrangian we
have derived earlier in this chapter. Moreover, the field Ba enters only quadratically
in this Lagrangian. Therefore, the path integral on Ba can be performed trivially11 ,
Z
a i R d4 x 1 Ba Ba +Ba Ga ξR 4 a a
DB (x) e 2ξ = e−i 2 d x G G . (5.85)
Therefore, after integrating out the auxiliary field Ba , the resulting theory has exactly
the same effective Lagrangian as the one resulting from the Fadeev-Popov procedure:
ξ a a ∂Ga
Leff = LYM + LD − G G + χa b
− Dadj
µ bc χc . (5.86)
2 ∂Aµ
The formal construction we have followed in this section proves that Leff is BRST
invariant, but in a somewhat obfuscated manner after the auxiliary field Ba has been
integrated out. The BRST invariance of eq. (5.86) is realized if we define the BRST
variation of the antighost field as follows
QBRST χa = −ξ Ga , (5.87)
which is reminiscent of the relationship between Ba and Ga when we do the Gaussian
integration on Ba . c sileG siocnarF
• Global gauge invariance (because all the colour indices are contracted).
• BRST invariance.
• Ghost number conservation, if we assign a ghost number +1 to χ’s and −1 to
χ’s.
From the 0-th component of this current, we may obtain the BRST charge
Z
QBRST ≡ d3 x J0BRST (x0 , x) . (5.89)
11 Note that this is equivalent to evaluating the argument of the exponential at the stationary point
In fact, this charge generates the BRST transformation in the following sense:
i QBRST , Φ ± = QBRST Φ (Φ ∈ {Aµ , ψ, χ, χ, B}) , (5.90)
we obtain
QBRST , a†aλp ∝ δλ+ α†ap ,
QBRST , αap = 0 ,
QBRST , βap ∝ a†a−p ,
QBRST , b†sp = QBRST , d†sp = 0 . (5.92)
The fact that the BRST charge is nilpotent, Q2BRST = 0, has profound implications on
the states of the system. The kernel of QBRST is the set of states annihilated by QBRST ,
Ker QBRST ≡ ψ QBRST ψ = 0 . (5.93)
The set of states that can be obtained by the action of QBRST on another state is called
the image of QBRST ,
Im QBRST ≡ QBRST ψ . (5.94)
Note that states in the image cannot be physical states, because they have a null norm:
Consider now the following equivalence relationship between states in the kernel:
two states are considered equivalent if their difference is in the image,
ψ ∼ ψ′ if ψ − ψ ′ ∈ Im QBRST . (5.97)
The cohomology of QBRST is the set of classes of equivalent states,
H QBRST ≡ Ker QBRST / Im QBRST . (5.98)
It turns out that the physical states are the elements of the cohomology with non-zero
norm12 . Indeed, using eqs. (5.92), it is easy to prove that if ψ is a state in the
cohomology, then
a†a{1,2}p ψ ∈ H QBRST
b†sp ψ ∈ H QBRST
d†sp ψ ∈ H QBRST , (5.99)
while
a†a±p ψ 6∈ H QBRST
α†p ψ 6∈ H QBRST
β†p ψ 6∈ H QBRST . (5.100)
In other words, adding to the state a physical particle (gluon with a physical polar-
ization, or quark or antiquark) gives another state in the cohomology, while adding
to the state a nonphysical quantum (gluon with a non-physical polarization, ghost or
antighost) takes the state out of the cohomology. c sileG siocnarF
12 This
restriction is necessary, because one of the classes in H QBRST is Im QBRST itself, that we
know has only zero-norm states.
210 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 6
Renormalization of
gauge theories
211
212 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
• n4 four-gluon vertices,
• nL loops.
The first equation states that each vertex must have all its “handles” attached to the
endpoint of a propagator, and the second equation counts the number of internal
momenta that are not determined by energy momentum conservation. In terms of
these parameters, the ultraviolet degree of divergence of this graph (in four space-time
dimensions) is
Note that each trivalent vertex contains one power of momentum and therefore
contribute +1 to this counting. Adding eq. (6.1) and four times eq. (6.2), we obtain
ω(G) = 4 − nE , (6.4)
that does not depend on any of the internal details of the graph. Moreover, the only
functions that have intrinsic ultraviolet divergences are the 2-point, 3-point and 4-point
functions, which suggests that Yang-Mills theories may indeed be renormalizable.
However, a Yang-Mills theory is not simply the addition of gluon and ghost kinetic
terms, 3- and 4-gluon vertices, and a ghost-antighost-gluon vertex: all these terms
of the Lagrangian are tightly constrained by gauge symmetry. For instance (but this
is not the only constraint), all the vertices depend on a unique coupling constant g.
Therefore, in order to establish the renormalizability of Yang-Mills theories, one
needs to prove that the structure of the divergences in the above listed functions is
such that they can be absorbed into a redefinition of the classical Lagrangian that does
not upset these tight constraints (up to a renormalization of the fields).
Although the local gauge invariance of the Yang-Mills Lagrangian is now broken (this
was precisely the goal of the gauge fixing procedure), this effective Lagrangian has a
number of symmetries. One of them is the BRST symmetry, that we have exhibited
in the previous chapter. In addition, Leff has the following symmetries:
For these three symmetries, the infinitesimal variation of the fields is linear in the fields
(which is not the case of the BRST symmetry). These linearly realized symmetries of
the classical action are inherited directly by the quantum effective action.
In order to prove this assertion, let us consider a generic infinitesimal linear
transformation of the fields
where φ1 , φ2 , · · · denote the various fields of the theory (gauge fields, ghosts, ...) and
Fn [x; φ] is a local function of the fields (for now, we do not assume that it is linear in
the fields). We assume that both the classical action and the functional measure are
invariant under this symmetry. Consider now the generating functional Z[j],
Z
R 4
Z[j] ≡ Dφn (x) ei S[φn ]+ d x jn (x)φn (x) , (6.8)
where there is one external source jn for each field φn . Since φn (x) is a dummy
integration variable in this path integral, we should obtain the same result after
performing the change of variable (6.7). Using the fact that this transformation
preserves the measure and the classical action, this implies that
Z
R 4 R 4
Z[j] = Dφn (x) ei S[φn ]+ d x jn (x)φn (x)+ε d x jn (x)Fn [x;φ]
i S[φn ]+R d4 x jn (x)φn (x)
Z Z
≈ Z[j] + iε Dφn (x) e d4 x jn (x)Fn [x; φ] .
(6.9)
214 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
now satisfied for any fields φn . This is known as the Slavnov-Taylor identity. In
other words, the functional Γ [φ] is invariant under the transformation
φn (x) → φn (x) + ε Fn [x; φ] jφ
. (6.14)
It is crucial to note that, because the quantum average in the right hand side is
performed with the external source jn;φ that depends implicitly on the fields φn , this
is a priori not the same transformation as in eq. (6.7). c sileG siocnarF
Let us now consider the special case of a transformation of type (6.7) which is
linear in the fields. In this case, we may write
Z
Fn [x; φ] = d4 y fnm (x, y) φm (y) . (6.15)
(In most practical cases, the transformation will be local and the coefficients propor-
tional to δ(x − y), but this restriction is not necessary for the following argument.)
For such a linear transformation, we have
Z
Fn [x; φ] j = d4 y fnm (x, y) φm (y) j . (6.16)
φ φ
Recalling that jφ is the configuration of the source j such that the quantum average
φ(x) j precisely equals φ(x), this in fact reads
Fn [x; φ] jφ
= Fn [x; φ] . (6.17)
It is this last step that fails when Fn is nonlinear in the fields. From eq. (6.17), we
see that the transformations (6.14) and (6.7) are identical. We have thus proven that
all linearly realized symmetries of the classical action are also symmetries of the
quantum effective action.
6. R ENORMALIZATION OF GAUGE THEORIES 215
Since an infinitesimal BRST variation is not linear in the fields, the BRST symmetry
of the classical action is not inherited so simply by the quantum effective action.
Instead, it leads to a set of identities that may be viewed as the analogue of Ward
identities for the BRST invariance. Their derivation follows the method of the section
3.4.2. Since we need to apply a BRST transformation to the Yang-Mills path integral,
we should first study how this transformation affects the measure DAµ DχDχ .
Under such a transformation, the fields transform into
Aa
µ → Aa′ a adj a
µ ≡ Aµ + ϑ Dµ ab χb = Aµ + ϑ ∂µ δab + gf
abc c
Aµ χb
ϑ
χa → χ′a ≡ χa − g fabc χb χc
2
χa → χ′a ≡ χa + ϑ Ba = χa − ξ ϑ Ga , (6.18)
where ϑ is a Grassmann constant. The Jacobian matrix has the following block
structure:
ν
′ ′
δµ (δab − gϑfabc χc ) ∗ 0
∂ Aa′
µ , χa , χa
= δ(x−y) 0 δab + gϑfabc χc 0 ,
∂ Ab
ν , χb , χb a
−ξϑ ∂G
∂Ab
0 δab
ν
(6.19)
where the ∗ denotes a non-zero element that we do not need to calculate because it
does not contribute to the determinant. From this structure, we see that the determinant
is given by the product of the diagonal elements, and is therefore equal to 1 (recall
that ϑ2 = 0).
In the derivation, it is convenient to introduce sources ja µ , ηa , ηa that couple
respectively to Aa a
µ , χa , χa , but also two extra sources that couple directly to QBRST Aµ
and QBRST χa :
Z Z
Z[j, η, η; ζ, κ] ≡ DAµ DχDχ exp i d4 x Leff + ja µ
µ Aa + ηa χa + χa ηa
+ζµ a
a QBRST Aµ − κa QBRST χa
Z Z
= DAµ DχDχ exp i d4 x Ltot , (6.20)
where we use the shorthand Ltot for the sum of terms inside the exponential. Note
that the coefficients of the new sources ζµa and κa are BRST invariant since the
BRST transformation is nilpotent. Let us now perform a BRST transformation of the
216 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
integration variables inside the path integral. This is just a change of variables, that
does not change the value of the path integral. Using the fact that the measure and the
Lagrangian Leff are BRST invariant, we obtain
Z Z
Z[j, η, η; ζ, κ] = DAµ DχDχ exp i d4 x Ltot
h Z
× 1 + i d4 x ja µ
µ ϑ QBRST Aa
i
+ηa ϑ QBRST χa + ϑ QBRST χa ηa
Z δZ δZ
= Z[j, η, η; ζ, κ] + i ϑ d4 x jaµ (x) a
+ ηa (x)
iδζµ (x) iδκa (x)
δZ
−ξ Ga ηa (x) . (6.21)
iδj(x)
This is one of the forms of the conservation identities. In this derivation, we see that
having introduced sources specifically coupled to the BRST variation of the gauge
field Aa
µ and of the ghost χa avoided the need for terms with higher order derivatives
(indeed, these variations are non-linear in the fields, and would have required more
derivatives to be expressed as functional derivatives with respect to sources coupled
to elementary fields). By writing Z = exp(W), we see that the same identity applies
to W,
Z
δW δW δW
d4 x ja
µ (x) + η a (x) − ξ G a
η a (x) = 0 . (6.23)
iδζa
µ (x) iδκa (x) iδj(x)
(Here, we have assumed that the gauge fixing function is linear in the gauge field.)
The next step is to convert this into an identity for the quantum effective action
Γ that generates the 1PI graphs. In this transformation, we will keep the auxiliary
sources ζa
µ and κa unmodified, as parameters. Thus, Γ and W are related by
−i W[j, η, η; ζ, κ] = Γ [A, χ, χ; ζ, κ]
Z
+ d4 x jµ a a a
a (x)Aµ (x) + χa (x)η (x) + η (x)χa (x) . (6.24)
6. R ENORMALIZATION OF GAUGE THEORIES 217
Fields and sources are related by the following quantum equations of motion:
δΓ
+ jµ
a (x) = 0 ,
δAaµ (x)
δΓ
+ ηa (x) = 0 ,
δχa (x)
δΓ
+ ηa (x) = 0 , (6.25)
δχa (x)
δW
= i Aa
µ (x) ,
δjµ
a (x)
δW δΓ
=i a ,
δζaµ (x) δζ µ (x)
δW δΓ
=i a . (6.26)
δκa (x) δκ (x)
This equation can be simplified a bit as follows. By inserting a derivative δ/δχa (x)
under the integral in the definition (6.21) of Z, we obtain zero since we now have the
integral of a total derivative. Recalling that the Fadeev-Popov term in the effective
Lagrangian is
∂Ga
LFPG = χa b
− Dadj
µ bc χc , (6.28)
∂Aµ
∂Ga δW δΓ ∂Ga δΓ
ηa (x) + i µ =0 , + =0. (6.31)
∂Ab
µ δζb (x) δχa (x) ∂Aµ δζµ
b
b (x)
from which any explicit reference to the gauge fixing function Ga (A) has disappeared,
as well as the coupling constant g.
Eq. (6.33) applies to the full quantum effective action, that encapsulates the results
from all-order perturbation theory. In the next section, we will show that this identity
(combined with the other symmetries of the effective action) completely constrains
the structure of its local terms of dimension less than or equal to four, forcing them to
be identical to those in the classical action (up to a rescaling of the fields and of the
coupling constant). c sileG siocnarF
6.3 Renormalizability
By taking the h̄ → 0 limit in eq. (6.33), one immediately concludes that it is also
satisfied by the classical action, S, supplemented with ghosts as well as the sources
ζµa and κa :
Z h µν a adj
S[A, χ, χ; ζ, κ] = d4 x − 1
4 Fa Fµν + ζµ µ
a + ∂ χa Dµ ab χb
i
+ g2 fabc κa χb χc . (6.34)
we therefore have
S, S = 0 ,
Γ, Γ = 0 . (6.36)
The first equation may be viewed as a constraint on the terms that can appear in the
classical action, while the second equation constrains which divergences may appear
in higher orders.
Let us now write the effective action as a loop expansion,
∞
X
Γ ≡S+ Γl , (6.37)
l=1
where S is given in eq. (6.34), and the subsequent terms Γ l are of order l in h̄. The
Zinn-Justin equation at order L thus reads
X
Γ p, Γ q = 0 . (6.38)
p+q=L
S → S(L) , (6.39)
such that S(L) contains counterterms up to order L, and gives finite Γ l ’s for l ≤ L (but
in general not beyond the order L).
The first step
is to prove that it is possible to find counterterms such that the
equation S, S = 0 is preserved at every order. Let us assume that we have achieved
this up to the order L − 1. All Γ l for l ≤ L − 1 are now finite, while Γ L still contains
a divergent part, that we denote Γ L,div . We can rewrite the Zinn-Justin equation at
order L as follows,
L−1
X
S, Γ L + Γ L , S = − Γ l , Γ L−l . (6.40)
l=1
Only the left hand side contains divergences, and we therefore have
S, Γ L,div + Γ L,div , S = 0 , (6.41)
which constrains the structure of the divergences at order L. A natural candidate for
the counterterm at order L is to simply add −Γ L,div to the classical action,
S → S − Γ L,div , (6.42)
220 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
field Aµa χa χa ζµ
a κa
mass dimension 1 1 1 2 2
ghost number 0 +1 -1 -1 -2
In addition, eq. (6.31) implies that the χ and ζ dependences come in the form of a
dependence on the combination
∂G
ζµ − χ = ζµ + ∂ µ χ , (6.44)
∂Aµ
6. R ENORMALIZATION OF GAUGE THEORIES 221
where in the right hand side we have assumed the covariant gauge condition G(A) =
∂µ Aµ and anticipated an integration by parts. Finally, the Zinn-Justin equation
S, S = 0 must be satisfied.
Since the sources ζµa and κa have mass dimension 2, at most two of them may
appear. However, terms with two such sources cannot contain any other field since
the mass dimension 4 is already reached, and they cannot have ghost number zero.
Therefore, S can only contain terms that have degree 0 or 1 in ζµ
a and κa .
The source ζµa must be combined with another combination of fields that have
one Lorentz index, one colour index, mass dimension at most 2, and ghost number
+1. The only operators that fulfill these conditions are
fabc ζµ b
a Aµ χc and ζµ
a ∂µ χa . (6.45)
Once the dependence on ζµ
is fixed, the dependence on the antighosts will be com-
a
pletely known from eq. (6.44). Likewise, κa must be combined with an object that
has one colour index, mass dimension at most 2 and ghost number +2. The only
possibility is
fabc κa χb χc . (6.46)
From the information gathered so far, the classical action must have the following
general form:
Z h b
S[A, χ, χ; ζ, κ] = Σ[A] + d4 x gα fabc ζµ µ
a + ∂ χa Aµ χc
i
γ abc
+β ζµ a + ∂ µ
χa ∂ µ χa + 2 f κ a χb χc , (6.47)
where α, β, γ are three arbitrary constants. The term Σ cannot depend on the sources
ζµ
a and κa because we have already constructed explicitly all the allowed terms that
contain these sources, and cannot depend on χ because the antighost dependence is
already encapsulated in the combination ζµ µ
a + ∂ χa . A dependence on χ in Σ is also
forbidden because χ would be the only field in Σ with a non-zero ghost number. Our
next step is to constrain the coefficients gα, β, γ and the functional Σ[A] in order to
satisfy the Zinn-Justin equation (6.33). The functional derivatives that enter in (6.33)
are given by:
δS δΣ
= − gα fabc (ζµb + ∂µ χb ) χc ,
δAµa δAµa
δS
= gα fade Aµ µ
d χe + β ∂ χa ,
δζµa
δS
= gα fabc (ζµb + ∂µ χb ) Aµ µ
c + β (ζµa + ∂µ χa ) ∂ + γ f
abc
κb χc ,
δχa
δS γ ade
= f χd χe . (6.48)
δκa 2
222 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Using the Jacobi identity satisfied by the structure constants, one may first check
that the last term, in κχχχ, is identically zero, and therefore does not provide any
constraint. Consider now the terms in ζAχχ:
gα ζµb − gα fabc fade Aµ
d χc χe +
γ
2
abc ade µ
|f {zf } Ac χd χe
−fabd faec −fabe facd
Since this is the only term containing this combination of fields, it cannot be canceled
by other terms, and therefore we must have
gα = γ . (6.51)
Thus, the cancellation of this term does not bring any additional constraint beyond
eq. (6.51). At this point, all the terms containing ζµa have been canceled (and by
extension also the terms with ∂µ χa ), and the Zinn-Justin equation reduces to
Z
δΣ
0 = d4 x gα facb Aµ µ
c χb + β ∂ χa . (6.53)
δAµ a
and note that it has the structure of an adjoint covariant derivative acting on χb ,
µ
Dadj ab ≡ ∂µ δab − igαβ−1 (Aµ adj )ab . (6.55)
6. R ENORMALIZATION OF GAUGE THEORIES 223
where we have introduced a constant Grassmann variable ϑ to make the second term
a commuting object. Therefore, for the integral to be zero for an arbitrary χb (x), the
functional Σ[A] must be invariant under this transformation. Recalling our discussion
of the local gauge invariant operators of mass dimension four or less, we conclude
that the only possible form for Σ is
Z
δ µν a
Σ[A] = − d4 x Fa Fµν , (6.58)
4
µν µ
where F is the field strength constructed with the covariant derivative D and δ
another constant. Given all the above constraints, we must have
Z h µν a adj
S[A, χ, χ; ζ, κ] = d4 x − δ
4 Fa Fµν + β ζµ µ
a + ∂ χa Dµ ab χb
i
+ gα
2 fabc
κ a χb χc . (6.59)
Up to rescalings of the various fields and of the coupling constant g, this is structurally
identical to the bare classical action of eq. (6.34). Note that this equation implies
that the field renormalization factors for the gauge field Aµa and for the source κa are
equal, ZA = Zκ . c sileG siocnarF
In this section, we describe the calculation of the one-loop quantum corrections to the
coupling constant by a method based on the quantum effective action combined with
the so-called background field method.
The first step of this method is to rescale the gauge field by the inverse of the
coupling constant:
g Aµ → Aµ . (6.60)
224 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
By doing this, the various objects that appear in the Yang-Mills action are transformed
as follows:
1 µ ν
Fµν → ∂ A − ∂ν Aµ − i [Aµ , Aν ]
g
Dµ → ∂µ − i Aµ . (6.61)
In other words, up to a rescaling in the case of the field strength Fµν , these objects
are transformed into their counterparts for a coupling equal to unity. In the rest of this
section, the notation Aµ , Dµ , Fµν will refer to the rescaled quantities. In terms of the
rescaled fields, the Yang-Mills action simply reads
Z
1
SYM = − 2 d4 x Fµν a
a Fµν , (6.62)
4g | {z }
no g
where all the dependence on the coupling constant appears now in the prefactor g−2 .
This action has a local non-Abelian gauge invariance analogous to the original one,
but with g = 1:
† †
Aµ → AΩ
µ ≡ Ω Aµ Ω + i Ω ∂µ Ω . (6.63)
µ
aµa → aµ a − Dadj ab θb + f
abc
θ b aµ
c . (6.68)
This invariance leads to the same pathologies as in the original theory, and we
must fix the gauge in order to have a well defined path integral. The background field
gauge corresponds to the following condition on aa µ,
Ga (A) ≡ Dµ b
adj ab aµ = ωa . (6.69)
Let us recall that a gauge fixing function Ga (A) leads to the following terms in the
effective Lagrangian:
ξ
LGF = − Ga (A)Ga (A) (gauge fixing term)
2 g2
∂Ga
LFPG = − χa b
Dadj
µ bc χc (Fadeev-Popov ghosts) . (6.70)
∂Aµ
where in the second equality we have anticipated an integration by parts and used the
µ
notation (Dadj adj
µ χ)a ≡ (Dµ )ab χb (and a similar notation for (Dadj χ)a ).
Aµ → Ω† Aµ Ω + i Ω† ∂µ Ω ,
Aµ → Ω† Aµ Ω + i Ω† ∂µ Ω . (6.72)
aµ → Ω† aµ Ω ,
Dµ → Ω† Dµ Ω ,
Dµ → Ω† Dµ Ω ,
χ → Ω† χ ,
χ → χΩ ,
G(A) → Ω† G(A) Ω . (6.73)
226 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
From this, we conclude that the gauge fixing Lagrangian LGF and the Fadeev-Popov
Lagrangian LFPG are both invariant in this transformation, as well as the Yang-Mills
Lagrangian. Since the path integration measure over aµ , χ, χ is also invariant under
this transformation, the result of the path integral must be invariant under local gauge
transformations of the background field Aµ .
Let us now turn to the calculation of the quantum effective action at one-loop. For
this, we use the results of the section 2.6.5, where we have shown that these one-
loop corrections are obtained by expanding the classical action to quadratic order
in deviations with respect to a background field, and by performing the resulting
Gaussian path integration with respect to the deviations (which gives a functional
determinant).
The first step is to expand the three terms of the gauge fixed Lagrangian to second
order in the deviation aµ . In this calculation, we choose the gauge fixing parameter
ξ = 1. The quadratic terms in the combined Yang-Mills and gauge fixing terms read
1 2
LYM + LGF = − 1
2 (Dµ ν ν µ
adj a )a −(Dadj a )a
2g2
µ
a 2
+fabc Faµν ab c
µ aν + (Dadj aµ )
1 h i
= − aa 2
µ − Dadj )ac g
µν
− 2fabc Fbµν acν
2g2
1 h (1)
i
ρσ
= − 2 aa
µ − D adj )2
ac gµν
+ (F adj )ac (M ρσ )µν
acν ,
2g
(6.74)
(1)
where we have introduced (Mρσ )µν ≡ i(δρ µ δσ ν − δρ ν δσ µ ) the generators of the
Lorentz transformations for 4-vectors (the Lorentz transformation corresponding to
(1)
the transformation parameters ωρσ reads Λµν = exp( 2i ωρσ (Mρσ )µν )). For the
ghost term, the quadratic part is
h 2 i
LFPG = χa − Dadj ab χb . (6.75)
Note that the operator that appears between the two ghost fields is the spin-0 analogue
of the one that appears in eq. (6.74), since the generators of Lorentz transformations
(0)
for spin-0 objects are identically zero (Mρσ ≡ 0). Although we have not considered
fermions so far in this chapter, the Dirac Lagrangian would give a contribution equal
/ or equivalently the square root of the determinant of (iD)
to the determinant of iD, / 2.
6. R ENORMALIZATION OF GAUGE THEORIES 227
Noting that
2
/
iD = −D2 + i i
2 [γµ , γν ] Dµ Dν
(1/2)
= −D2 + (Fρσ ) Mρσ , (6.76)
(1/2)
where the Mρσ ≡ 4i [γρ , γσ ] are the generators of Lorentz transformations for
spin-1/2 fields. Note that the covariant derivatives and the field strength are in
the fundamental representation (assuming fermions that transform according to the
fundamental representation, like quarks). Therefore, for each of the fields that appear
in the quantum effective action (gauge fields, ghosts, fermions), we get a determinant
∆r,s of an operator containing −D2 (in the representation r corresponding to the field
under consideration) plus a “spin connection”2 made of the contraction of the field
strength with the Lorentz generators corresponding to the spin s of the field:
(1)
ρσ
gauge fields : ∆adj,s=1 ≡ det − D2adj + Fadj Mρσ
(0)
ρσ
ghosts : ∆adj,s=0 ≡ det − D2adj + Fadj Mρσ
| {z }
=0
(1/2)
2 ρσ
fermions : ∆f,s=1/2 ≡ det − Df + Ff Mρσ . (6.77)
Λ2
Cr,s = cr,s ln , (6.80)
κ2
where Λ is an ultraviolet scale and κ the typical scale of inhomogeneities of the back-
ground field. After combining them with the counterterms from ∆S, the ultraviolet
2 This terms describes the coupling between the magnetic moment of the particle and the background
µ2
Cr,s → Cr,s = cr,s ln . (6.81)
κ2
From eq. (6.78), we see that the 1-loop renormalized coupling at the scale µ and the
bare coupling must be related by
1 1 1 nf
= + Cadj,1 − Cf,1/2 − Cadj,0
g2b g2r (µ) 2 2
1 1 nf µ2
= + c adj,1 − cf,1/2 − cadj,0 ln 2 . (6.82)
g2r (µ) 2 2 κ
The explicit calculation of the constants cr,s requires to expand the logarithm of the
functional determinants to second order in the background field strength Fµν . Thanks
to the organization of eqs. (6.77), this calculation needs to be performed only once,
for generic gauge group and Lorentz representations. This leads to
1 h1 i
cr,s = 2 3 d(s) − 4C(s) N(r) , (6.83)
(4π)
where d(s) is the number of spin components (respectively 1, 4, 4 for scalars, fer-
mions, and vector particles), C(s) is the normalization of the trace of two Lorentz
generators3 ,
(s) (s)
tr Mρσ Mαβ = C(s) (gρα gσβ − gρβ gσα , (6.84)
and N(r) is the normalization of the trace of two generators of the Lie algebra in
representation r,
tr ta b
r tr = N(r) δab . (6.85)
For the fundamental and adjoint representations of su(N), we have N(f) = 12 and
N(adj) = N. Therefore, the constants involved in the 1-loop running coupling are
N 20 N 4
cadj,0 = , cadj,1 = − cf,1/2 = − , (6.86)
3(4π)2 3(4π)2 3(4π)2
1 1 1 11 2
µ2
= + N− nf ln . (6.87)
g2r (µ) g2b (4π)2 |3 {z 3
} κ2
11 N
>0 for nf ≤ 2
3 For 1
spin-0, 2
and 1, this constant is respectively 0, 1 and 2.
6. R ENORMALIZATION OF GAUGE THEORIES 229
Given two scales µ and µ0 , the renormalized couplings at these scales are related by
1 1 1 11 2
µ2
− = N− nf ln 2 , (6.88)
g2r (µ) g2r (µ0 ) (4π)2 3 3
µ0
g2 (µ0 )
g2 (µ) = . (6.89)
g2r (µ0 )
2
1+ (4π)2
11
3 N− 2
3 nf ln µ
µ2
0
In quantum chromodynamics, where the gauge group is SU(3) (i.e. N = 3) and where
there are 6 flavours of quarks in the fundamental representation, the coefficient in
front of the logarithm is positive, which indicates that the coupling constant decreases
as the scale µ increases. The coupling constant in fact goes to zero when µ → ∞, a
property known as asymptotic freedom. Thanks to the formula (6.83), it would have
been easy to determine the one-loop running of the coupling in the presence of matter
fields in arbitrary representations. c sileG siocnarF
230 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 7
Renormalization group
In quantum field theory, the renormalization group refers to a set of tools for in-
vestigating the changes of a system when observed at varying distance scales, akin
to varying the magnifying power of a microscope in order to uncover new features
that were not visible at lesser resolution scales. For renormalizable theories, such a
change of scale merely amounts to a change in a few parameters of the theory (masses,
coupling, field normalization), but the use of the renormalization group is not limited
to this class of theories, as we shall discuss in the last section.
where Π(p) is the self-energy and Γ (4) the 1-particle irreducible 4-point function.
There is a large amount of freedom in the choice of the renormalization conditions.
Two sets of renormalization conditions may correspond to the same physical theory
231
232 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
provided that the bare Green’s functions, expressed in terms of the bare parameters
of the Lagrangian, are identical. Indeed, the renormalization scale M appears
√ only
when we replace the bare field φb by the renormalized field φr ≡ φb / Z and the
bare coupling constant λb by the renormalized coupling constant λr . The bare and
renormalized Green’s functions are related by
(n)
G(n)
r (x1 , · · · , xn ) = Z−n/2 Gb (x1 , · · · , xn ) . (7.2)
In order to have the same physical theory, we must change Z and λ when varying the
renormalization scale M. With such a variation of the scale M, we can write:
(n) (n) (n)
dGr ∂Gr ∂Gr ∂λ
= + . (7.3)
dM ∂M ∂λ ∂M
On the other hand, we may obtain this derivative from the right hand side of eq. (7.2)
and the fact that bare Green’s functions must remain unchanged:
(n)
dGr n ∂Z (n)
=− G . (7.4)
dM 2 Z ∂M r
Combining the previous two results, we obtain
∂ ∂
M +β + nγ G(n) r =0, (7.5)
∂M ∂λ
where we have defined
∂λ M ∂Z
β≡M , γ≡ . (7.6)
∂M 2 Z ∂M
G(n)
r (· · · ; M) = U(M, M0 ) G(n)
r (· · · ; M0 ) , (7.7)
where the evolution operator U(M, M0 ) is a Green’s function of the operator between
the square brackets in the right hand side of eq. (7.5). A 1-dimensional group structure
can be attached to this evolution by noting that
In other words, a finite rescaling can be broken down into several smaller rescalings
without affecting the final result.
7. R ENORMALIZATION GROUP 233
where the first term in the right hand side is the tree-level vertex, the second and
third terms are respectively the 1PI vertex correction and the associated counterterm.
The fifth and sixth terms are the self-energy corrections on the external lines and the
corresponding counterterms. Up to one-loop, this equation can be written as follows:
!
Y i h
(4)
Gr (p1 , · · · , pn ) = − iλb
i
p2i
(4)
+Γb − iδλ
X 1 (2) i
−iλb 2
(Γb (pi ) − p2i δiZ ) . (7.11)
i
pi
In this equation, the first line is the tree-level 4-point function, the second line contains
the one-loop 1PI vertex correction and the vertex counterterm (necessary in order to
fulfill the renormalization condition for the vertex at the scale M), and the last line
is the sum of the 1-loop corrections on the external lines (the counterterms δiZ are
determined by the normalization condition of the propagator at the scale M). The
dependence of this renormalized Green’s function on the renormalization scale M
arises from the counterterms δλ and δiZ . By applying the Callan-Symanzik equation
to this Green’s function, we obtain at leading order
!
∂ X
i λ X ∂δiZ
M δλ − λ δZ + β + M =0, (7.12)
∂M 2 ∂M
i i
where we have replaced the anomalous dimensions γi attached to the external lines
by their expression given by eq. (7.9) in terms of the corresponding counterterms δiZ .
Therefore, we obtain the following formula for the β function:
!
∂ λX i
β=M −δλ + δZ . (7.13)
∂M 2
i
234 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(2)
7.1.2 Solution for the 2-point function Gr
In a massless theory, we may always parameterize the 2-point function as follows:
i
G(2)
r (p) = g(−p2 /M2 ) , (7.14)
p2
where g(−p2 /M2 ) is a function so far arbitrary. Since the M dependence arises
solely from the ratio −p2 /M2 , we can rewrite the derivative with respect to M in the
Callan-Symanzik equation in the form of a derivative with respect to p:
∂ ∂
p −β + 2 − 2γ G(2)
r (p) = 0 . (7.15)
∂p ∂λ
In order to solve this equation, let us introduce a function λ(p, λ) defined by:
dλ(p, λ)
= β(λ) , λ(M, λ) = λ . (7.16)
d ln(p/M)
In other words, λ is the running coupling constant that takes the value λ at the
momentum scale M. We can then write the solution of the Callan-Symanzik equation
in the following form:
p
Z ′
i dp
G(2)
r (p) = 2 G(λ(p, λ)) exp 2 γ(λ(p′ , λ)) , (7.17)
p p′
M
where G(λ(p, λ)) is an arbitrary function that cannot be determined from the renormal-
ization group equations1 . This function must be determined order by order from pertur-
bative calculations. In the case of the 2-point function, we have G(λ(p, λ)) = 1+O(λ).
The exponential in eq. (7.17) is the cumulative field renormalization between the
scales M and p. In particular, for a constant anomalous dimension, this factor is
(p/M)2γ , and we see that it alters the power law dependence of the propagator with
respect to momentum, changing a power −2 into −2 + 2γ. c sileG siocnarF
the same space-time point. Similarly to the case of elementary operators, we must in-
troduce a renormalization factor ZO , determined order by order in perturbation theory
in order to fulfill a certain renormalization condition at the scale M. The renormalized
operator Or is related to the bare operator Ob by the relationship Or = Ob /ZO . Let us
consider now a renormalized correlation function involving a composite operator O
and n elementary fields:
G(n;1)
r (x1 , · · · , xn ; y) ≡ hφ(x1 ) · · · φ(xn )O(y)i . (7.18)
By requesting that the bare correlation function remains unchanged upon changes
of the renormalization scale M, we obtain the following equation satisfied by the
renormalized correlation function
∂ ∂
M +β + nγ + γO G(n;1)
r =0, (7.20)
∂M ∂λ
where we have defined the anomalous dimension of the composite operator O as
follows
M ∂ZO
γO ≡ . (7.21)
ZO ∂M
where δO is the counterterm that one must adjust in order to satisfy the renormalization
condition of the operator O at the scale M.
Jµ ≡ ψγµ ψ . (7.23)
236 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where the Oi ’s are arbitrary local operators, not necessarily renormalizable in four
dimensions. The Callan-Symanzik equation for a correlator containing n elementary
fields φ and an arbitrary number of these new interaction terms reads:
" #
∂ ∂ X ∂
M +β + nγ + γi ci G(n)
r =0. (7.26)
∂M ∂λ ∂ci
i
In this equation, γi is the anomalous dimension of the operator Oi and the operator
(n)
ci ∂/∂ci counts the number of occurrences of Oi inside the function Gr . If di is the
dimension of the operator Oi (in mass units), it is convenient to define a dimensionless
coupling constant ρi by the following relation,
ci ≡ ρi M4−di . (7.27)
Thanks to this definition, the previous Callan-Symanzik equation becomes:
" #
∂ ∂ X ∂
M +β + nγ + βi G(n)
r =0, (7.28)
∂M ∂λ ∂ρi
i
where we denote βi ≡ ρi (γi + di − 4). With these notations, we see that the
additional couplings ρi play exactly the same role as the original coupling λ. We can
therefore mimic the explicit solution found in the case of the two-point function in
the section 7.1.2. Let us first introduce running couplings λ, ρi , as solutions of the
following differential equations
dλ(p, λ)
= β(λ, ρi ) , λ(M, λ) = λ ,
d ln(p/M)
dρi (p, ρi )
= βi (λ, ρi ) , ρi (M, ρi ) = ρi . (7.29)
d ln(p/M)
7. R ENORMALIZATION GROUP 237
In the weak coupling limit, the functions βi are given at lowest order by
7.3.1 Introduction
The operator product expansion (OPE) is a tool that allows to study the renormal-
ization flow at the level of the operator themselves, instead of encapsulating them
inside a correlator (although the derivation still requires that we consider a correlator).
The intuitive idea is that a non-local product of operators may be approximated by a
local composite operator when the separations between the original operators go to
zero, possibly with a numerical prefactor that depends on the separation between the
operators in the original product. However, since limits of operators are difficult to
handle, it is convenient to consider a weaker form of limit, in which the product of
operators under consideration is encapsulated into a correlation function of the form
(n)
G12 (x; y1 , · · · , yn ) ≡ hA1 (x)A2 (0)φ(y1 ) · · · φ(yn )i , (7.32)
where A1 and A2 are local operators, and φ an elementary field. Let us consider a
limit where the coordinates yi are fixed, while x → 0. We can already note that, since
the product of operators at the same point is ill-defined in general, we may expect
divergences in this limit. c sileG siocnarF
238 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(n)
It turns out that the behaviour of G12 when x → 0 is entirely determined by the
operators A1 and A2 themselves, in a way that does not depend on the other fields
φ(yi ) (provided they are kept at a finite distance from the the points 0 and x). In order
to determine this behaviour, Wilson proposed to expand the product A1 (x)A2 (0) as
a sum of composite local operators, with x dependent coefficients:
X
A1 (x)A2 (0) = Ci12 (x) Oi (0) , (7.33)
i
where the Oi are a basis of composite local operators that have the same quantum
numbers as the product A1 A2 . All the x dependence is carried by the Wilson
coefficients Ci12 (x). This decomposition can then be used in any correlation function
(n)
where the product A1 (x)A2 (0) appears. For instance, the correlation G12 introduced
at the beginning of this section would read
(n)
X (n)
G12 (x; y1 , · · · , yn ) = Ci12 (x) Gi (y1 , · · · , yn ) , (7.34)
i
where we denote
(n)
Gi (y1 , · · · , yn ) ≡ hOi (0)φ(y1 ) · · · φ(yn )i . (7.35)
where γ, γA1 and γA2 are the anomalous dimensions of the operators φ, A1 and A2 ,
(n)
respectively. Concerning the correlation functions Gi that enter in the right hand
side of eq. (7.34), we have the following equations:
∂ ∂ (n)
M +β + nγ + γi Gi = 0 , (7.37)
∂M ∂λ
where γi is the anomalous dimension of Oi . The left hand side and right hand sides
of eq. (7.34) are consistent provided that the coefficients Ci12 obey the following
2 In the rest of this chapter, we do not write explicitly the subscript r to indicate the renormalized
quantities, in order to simplify the notations. From the context, it is always clear when a quantity is
renormalized.
7. R ENORMALIZATION GROUP 239
equation:
∂ ∂
M +β + γA1 + γA2 − γi Ci12 = 0 . (7.38)
∂M ∂λ
This equation confirms a posteriori the fact that the coefficients Ci12 must depend on
the renormalization scale M. Moreover, we see that this dependence only depends on
the anomalous dimensions of the operators A1 , A2 and Oi , but not on the specific
(n)
correlation function G12 that was used in the derivation (in particular, eq. (7.38) does
not depend on the number n of fields φ, nor on their anomalous dimension). It is this
property that renders the operator product expansion universal.
1 ei (M|x|) ,
Ci12 (x; M) ≡ C (7.39)
|x|D1 +D2 −di 12
where C ei (Mx) is a dimensionless function of the sole variable M|x|. One can
12
determine this function similarly to the case of the 2-point function considered in the
section 7.1.2, by introducing the running coupling λ(1/|x|). We obtain the following
structure for the coefficient Ci12 :
Ci (λ(1/|x|))
Ci12 (x; M) = 12D +D −d
|x| 1 2 i
M
Z ′
dp
× exp (γi (λ(p′ )) − γA1 (λ(p′ )) − γA2 (λ(p′ ))) ,
p′
1/|x|
(7.40)
where Ci12 is a function of the running coupling that can be obtained by a matching
to perturbative calculations. We see that the leading short distance behaviour is
controlled by the prefactor |x|di −D1 −D2 , that becomes singular if di < D1 + D2 .
Moreover, the contribution of the operators Oi whose dimension obeys di > D1 +D2
goes to zero when x → 0. One does not need to consider such operators in the OPE
when studying the short distance limit.
In asymptotically free theories where the coupling goes to zero at short distance,
such as QCD, we may carry a bit further the determination of the Wilson coefficients.
240 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Indeed, at the first order of perturbation theory, the anomalous dimensions are pro-
portional to g2 , and we may write the anomalous dimension of any operator O as
follows:
g2
γO ≡ −aO , (7.41)
(4π)2
where aO is a numerical constant (the minus sign is conventional). Therefore, we
have
αs
γi − γA1 − γA2 = (aA1 + aA2 − ai ) , (7.42)
4π
with αs ≡ g2 /4π. At one loop, the running coupling αs is given by:
αs (Q2 ) 1
= , (7.43)
4π Q2
β0 ln Λ2
QCD
where β0 is the first Taylor coefficient of the QCD β function. From this, we get
" # ai −a2β
A1 −aA2
2 2
Ci12 (g(1/|x|)) ln(1/|x| ΛQCD ) 0
We see that, besides the trivial power law prefactor in |x|di −D1 −D2 , there are cor-
rections in the form of powers of logarithms that may be large when x → 0. When
di = D1 + D2 , these logarithms are in fact the main source of |x| dependence.
It may happen that several of the operators Oi that enter in the OPE basis for the
product A1 (x)A2 (0) mix under the evolution of the scale M. This means that the
anomalous dimensions γi are in fact a matrix γij (when there is no mixing, this
matrix is diagonal and the γi ’s that we have used so far are its diagonal elements) and
(n)
the Callan-Symanzik equations for the correlators Gi are coupled:
X ∂ ∂
(n)
δij M +β + n γ + γij Gj = 0 . (7.45)
∂M ∂λ
i
(n)
The equation for G12 is unchanged, and we obtain the following equation for the
Wilson coefficients
X
∂ ∂
M +β + γA1 + γA2 Cj12 − γij Ci12 = 0 . (7.46)
∂M ∂λ
i
7. R ENORMALIZATION GROUP 241
Note that when the operators A1 and A2 are conserved currents, their anomalous
dimensions are zero, and this equation simplifies into
X
∂ ∂
M +β Cj12 − γij Ci12 = 0 . (7.47)
∂M ∂λ
i
This situation turns out to be quite frequent in applications of the OPE.c sileG siocnarF
In order to illustrate the use of the operator product expansion on a concrete case, let
us consider the weak interactions between quarks and leptons. In the standard model,
the interactions between charged currents take the following form:
g2 µ
LI = J (0) Dµν (0, x) Jν† (x) + h.c. , (7.48)
2 L L
where JµL
is the left handed charged current (containing a leptonic term and a term
due to quarks) and Dµν (0, x) is the propagator of the W ± boson between the points
0 and x.
At low energy, we may neglect the momentum carried by the W ± boson prop-
agator in front of the W ± mass. In this approximation, the propagator becomes
momentum independent, and its Fourier transform is proportional to δ(x). We may
then replace the non-local interaction term of eq. (7.48) by a 4-fermion (local) contact
interaction, which is nothing but the interaction term of Fermi’s
√ theory. The prefactor
of this interaction term, g2 /2M2W , is usually denoted 4GF / 2 where GF is Fermi’s
constant:
4G
Lint ≈ √ F Jµ (0) Jν† (0) + h.c. . (7.49)
2 L L
Thanks to the operator product expansion, one may study in greater detail the limit
from the electroweak theory to Fermi’s theory, i.e. the process by which one replaces
the non-local product of two currents by one or more local interaction terms. This
example will also illustrate how this decomposition in local operators depends on the
energy scale of the processes under consideration, by including the strong interaction
corrections at one loop.
Let us discuss first two trivial cases regarding the effect of QCD corrections
at one loop. Firstly, purely leptonic weak interactions are not affected by strong
242 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
interactions at this order since leptons do not couple directly to gluons (but QCD
corrections do exist at two loops and beyond). The other simple case is that of
semi-leptonic weak interactions, involving a leptonic current and a current made of
quarks. Indeed, the leptonic current is not renormalized by strong interactions. The
quark current, conserved at leading order, is also not affected by strong interactions
since its anomalous dimension is zero. Finally, a gluon cannot connect the lepton
and the quark currents. Thus, semi-leptonic weak interactions are not affected by
QCD corrections at one loop. The only non-trivial case, to which we will devote
the rest of this section, is that of weak interactions between quark currents, i.e. the
non-leptonic weak interactions. As an example, let us consider the QCD corrections
to the weak decay of the strange quark, which in Fermi’s theory comes from the
following coupling: (dL γµ uL ) (uL γµ sL ).
Aµ µ
1 ≡ dL γ uL , Aµ µ
2 ≡ uL γ sL . (7.50)
When going from the standard model to Fermi’s theory, the non-local dependence
of the W ± propagator is captured by the Wilson coefficients Ci12 (x). Therefore,
the typical separation x is x ∼ M−1 W
(since the mass MW is the only dimensionful
parameter in the propagator). On the other hand, the scale M characteristic of Kaon
decays is of the order of the mass of a Kaon, around 500 MeV. The simplest operators
on which we may expand the product A1 (x)A2 (0) are the following:
O1 ≡ (dL γµ uL )(uL γµ sL ) ,
O2 ≡ (dL γµ sL )(uL γµ uL ) , (7.51)
where in the second one two quark operators of different flavours have been inter-
changed. Note that the mass dimension of the operators A1 and A2 is 3, while that
of O1 and O2 is 6. Therefore, we have dA1 + dA2 − di = 0, which means that the
x dependence of the Wilson coefficients comes entirely from the logarithms in the
expression (7.44). The more complicated operators that may enter in this expansion
all have a larger mass dimension, so that dA1 + dA2 − di < 0. Thanks to the
prefactor in eq. (7.44), the corresponding Wilson coefficients are very small since
M|x| ∼ M/MW ≪ 1. Thus, one can restrict the OPE of A1 (x)A2 (0) to the sole
operators O1 and O2 when applied to the physics of Kaon decays.
γ2 , for the operators A1 , A2 , O1 and O2 . Since A1 and A2 are conserved at the first
order, their anomalous dimension is zero:
γA1 = γA2 = 0 . (7.52)
In order to obtain the anomalous dimensions of the operators O1 and O2 , let us
introduce the following graphical representation for these operators:
u d u d
O1 = , O2 = . (7.53)
u s u s
This representation renders explicit the fact that these operators are products of two
currents. Thanks to eq. (7.22), the anomalous dimension of these operators is obtained
by calculating the vertex counterterm and the counterterms associated to the external
lines. All the order-g2 strong interaction corrections are listed in the figure 7.1 in
the case of O1 . The contributions to γ1 of the first three diagrams on the first line
u d
u s
cancel, because their sum gives the anomalous dimension of a conserved (at first
order) current. The same conclusion holds for the remaining three graphs of the first
line. Thus, we need only to consider the diagrams of the second line. In Feynman
gauge, the expression of the first diagram of the second line is given by:
u d
Z
dD k −i /
ik
= (−ig)2 d γ µ
ta λ
γ u
(2π)D k2 L
(k + p)2 f L
u s
/
ik
× uL γλ ta
f γµ sL , (7.54)
(k − q)2
244 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where p and q are the (incoming) momenta carried by the quark lines to which the
gluon is attached. The ta
f are the generators of the fundamental representation of the
su(3) algebra, that holds the quarks. In the numerator, some terms in p / and q / have
been dropped because they do not contribute to the ultraviolet divergence of the graph.
The integral over k can be rewritten as follows3 :
Z ′ ′ Z
dD k kν kν gνν dD k 1
=
(2π)D k2 (k + p)2 (k − q)2 d (2π) (k + p) (k − q)2
D 2
1
′ Z Z D
gνν d k 1
= dx
d (2π)D (k2 + ∆)2
0
′ Z1
gνν Γ (2 − D
2) 1
= i dx D/2 2−D/2
, (7.55)
d (4π) ∆
0
g2 Γ (2 − D
2) 1
= 2 4−D
dL γµ γν ta λ a
f γ uL [uL γλ tf γν γµ sL ] . (7.56)
4 (4π) M
u s
The contribution of this graph to the counterterm for the normalization of O1 is given
by the opposite of this result. c sileG siocnarF
In order to simplify the combination of spinors, Dirac and colour matrices that
appear in the result of eq. (7.56), it is useful to use the chiral representation (also
known as Weyl’s representation) since only the left handed component of the spinors
enter in this expression. In this representation, the Dirac matrices are given by
! !
µ
0 σ −1 0
γµ = , γ5 = , (7.57)
σµ 0 0 1
with σµ ≡ (1, σ) and σµ ≡ (1, −σ) where σ is a vector made of the three Pauli
matrices. In this representation, the left handed projector PL ≡ (1 − γ5 )/2 and the
3 The first equality disregards some terms that are ultraviolet finite.
7. R ENORMALIZATION GROUP 245
so that any 4-component spinor can be viewed as two 2-components spinors, one of
which is right handed and the other one left handed:
!
ψL
ψ= . (7.59)
ψR
dL γµ γν γλ ta µ ν λ a
f uL = dL σ σ σ tf uL . (7.60)
This equation contains a small abuse of notations, since it contains the 4-component
spinors (ψL , 0) in the left hand side, while the right hand side contains only the
2-component left handed spinors ψL .
In order to reduce the combination of spinors that appear in eq. (7.56), we need
to simplify the products (σµ )αβ (σµ )γδ and (σµ )αβ (σµ )γδ as well as (ta a
f )ij (tf )kl .
In both cases, this can be done by using the Fierz identity for the generators of the
fundamental representation of the su(n) algebra, introduced in the section 4.1.6. Let
us recall this identity here:
a a 1 1
(tf )ij (tf )kl = δil δjk − δij δkl . (7.61)
2 n
For the contraction of colour matrices ta f , we can apply it directly with n = 3:
1 1
(ta ) (ta
)
f ij f kl = δ δ
il jk − δ δ
ij kl . (7.62)
2 3
For the contraction of the σµ or the σµ , let us recall that the Pauli matrices σi are
related to the su(2) fundamental generators τi by
σi = 2 τi . (7.63)
Using this relation and the Fierz identity for the fundamental representation of su(2),
we obtain:
(σµ )αβ (σµ )γδ = (σµ )αβ (σµ )γδ = δαβ δγδ − 4(τi )αβ (τi )γδ
1
= δαβ δγδ − 2 δαδ δβγ − δαβ δγδ
2
= 2 [δαβ δγδ − δαδ δβγ ] . (7.64)
246 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
u d
Z
dD k −i /
ik
= (−ig)2 d γ µ
ta λ
γ u
(2π)D k2 L
(k + p)2 f L
u s
/ a
−ik
× uL γµ t γλ sL , (7.66)
(k − r)2 f
where r is the momentum that flows into the diagram by the line carrying the s quark.
The integration over k is similar to the previous case, and leads to
u d
g2 Γ (2 − D
2) 1
=− 2 4−D
dL γµ γν ta λ a
f γ uL [uL γµ γν tf γλ sL ] .
4 (4π) M
u s
(7.67)
Likewise, we can simplify the Dirac and colour matrices by using Fierz identities:
dL γµ γν ta λ a
f γ uL [uL γµ γν tf γλ sL ]
8
= 8(uL γµ uL )(dL γµ sL ) − (dL γµ uL )(uL γµ sL ) , (7.68)
3
which is again a linear combination of O1 and O2 . The last diagram gives the same
result. c sileG siocnarF
By combining the four contributions, we obtain the following form for the operator
O1 , renormalized at the scale M, in terms of the bare operators:
g2 Γ (2 − D
2) g2 Γ (2 − D
2)
δ11 ≡ 2 4−D
, δ12 ≡ −3 2 4−D
. (7.70)
(4π) M (4π) M
By calculating in the same way the one-loop corrections to the operator O2 , we obtain
the counterterms δ22 and δ21 , that are equal to
Because of the mixing, the anomalous dimensions for the operators O1,2 form a
non-diagonal matrix
!
∂δij g2 −2 6
γij = M = . (7.72)
∂M (4π)2 6 −2
In order to solve the coupled Callan-Symanzik equations (7.47), we must find a basis
of operators in which the matrix of anomalous dimensions becomes diagonal. This is
achieved by choosing5 :
1
O1/2 ≡ [O1 − O2 ] ,
2
1
O3/2 ≡ [O1 + O2 ] . (7.73)
2
g2 g2
γ1/2 = −8 , γ3/2 = 4 . (7.74)
(4π)2 (4π)2
Using the equation (7.44) (the functions Ci12 are equal to 1 at the first order of
perturbation theory) at a distance scale x ≈ M−1
W
, we obtain the following values for
the Wilson coefficients:
" # β4
1/2 ln(M2W /Λ2QCD ) 0
C12 (M−1 ; M) = ,
W
ln(M2 /Λ2QCD
)
" #− β2
3/2 ln(M2W /Λ2QCD ) 0
5 The subscripts 1/2 and 3/2 are related to the isospin variation in the s quark decay mediated by these
operators.
248 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The general concepts of renormalization that we aim at introducing in this section can
be first exposed by considering the simple example of a system of spins on a lattice,
the simplest of which is the Ising model in two dimensions, which is exactly solvable
for interactions among nearest neighbors. This model is known to have a disordered
phase at high temperature, a ferromagnetic order at low temperature (where spins
align with an external magnetic field, no matter how small), and a second order phase
transition at a critical temperature T∗ . At the second order transition, the correlation
length of the system becomes infinite, despite the fact that the interactions are short
ranged. Roughly speaking, a measure of the complexity of the study of a discrete
physical system (at least if one attempts to do it from the theory that describes the
interactions among the microscopic degrees of freedom) is the number of elementary
degrees of freedom per correlation length. By this account, second order phase
transitions are among the hardest problems to analyze.
6 In this problem, N = 5 flavours of quarks should be taken into account in the running of the strong
f
coupling constant, in order to include all the quarks up to mass of the W ± bosons.
7 The measured imbalance between the isospin variations 1/2 and 3/2 is even larger, but a quantitative
Figure 7.2:
Kadanoff’s block-
spin renormalization.
Top: the spins are
grouped into 3 × 3
blocks.
Middle: each block
of 9 spins is replaced
by a single spin
determined by the
rule of majority.
Bottom: the lattice
is scaled down (new
spins come into the
picture, that where
previously outside of
the represented area).
250 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Sr ≡ R S0 . (7.76)
However, the real power of this idea comes by iterating the renormalization group
steps R until there are only a few of the coarse-grained spins in a macroscopic area
of the system. Under such a sequence of renormalization steps, the actions are
sequentially transformed as follows:
The behaviour of the mapping Rn for large n contains all the information we may
need about the macroscopic properties of the system. In particular, a critical point,
where the system has an infinite correlation length and is self-similar, corresponds to
a fixed point of this transformation, i.e. to an action S∗ that satisfies
S∗ = R S∗ . (7.78)
short distance scale. Thus, τ = 0 corresponds to the bare action at short distance, and
τ = +∞ corresponds to macroscopic distances, and the discrete steps of eq. (7.77)
are replaced by an equation of the form
∂ τ Sτ = H S τ , (7.79)
One may view a given action S as a point in an abstract space, where each axis
corresponds to the coupling constant in front of a given operator. For instance, in the
case of a lattice spin system, there would an axis for the strength of the interactions
among nearest neighbors,
√ an axis for the strength of the interactions among sites
whose distance is 2 lattice units, and so on... In a scalar quantum field theory, these
could be the couplings for the operators φφ, φ2 , φ4 , φ6 , ... A renormalization
group transformation such as (7.76) defines a mapping of the points in this theory
space, either discrete or continuous depending of the system. We have illustrated this
in the continuous case in the figure 7.3, where the thick gray line shows how a bare
action S0 at short distance flows as the distance scale ℓ increases, leading to a theory
that may have very different couplings at macroscopic scales. Note that only three
out of many (possibly infinitely many for a continuous system) dimensions are shown
in the figure.c sileG siocnarF
S ≡ S∗ + ∆S , H S∗ = 0 ,
H S = L ∆S + · · · , (7.80)
L On = λn On , (7.81)
where λn is the corresponding eigenvalue. In the vicinity of the fixed point, we thus
have
X
S ≈ S∗ + cn eλn τ On , (7.82)
n
252 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Figure 7.3:
Renormalization group
S0 flow in theory space (the
arrows go from UV to
IR scales). The black dot
is a critical fixed point
S∗ . The gray surface is
the critical surface, i.e.
the universality class
made of all the theo-
S* ries that flow into the
critical point. The light
colored line, flowing
away from the critical
point, corresponds to the
direction of a relevant
operator. The thick gray
line illustrates the flow
from a generic initial
action S0 .
7. R ENORMALIZATION GROUP 253
where the cn are coefficients determined by initial conditions. This expression leads
to the following classification of operators8 :
The previous discussion, based on a linear analysis near the critical point, may be
extended globally as follows. One defines the critical surface as the domain of theory
space which is attracted into the critical point as the length scale goes to infinity.
All the bare actions that lie in this domain (the shaded surface in the figure 7.3)
describe systems that have the same long distance behaviour. Despite the fact that
these systems may correspond to completely different microscopic degrees of freedom
and interactions, they are described by the same action S∗ at large distances. For
this reason, this domain is also called the universality class of the critical point. The
relevant operators correspond to the directions of theory space that are “orthogonal”
to the critical surface. The term relevant follows from the fact that the coupling of
these operators must be fine-tuned in order to be on the critical surface: in other
words, the relevant couplings matter for making the system critical. A remarkable
aspect of phase transitions is that the number of these relevant operators is small9 ,
despite the fact that the microscopic interactions may require a very large number of
distinct couplings. Heuristically, this follows from a dimensional argument: since
the action is dimensionless, the coupling constants of higher dimensional operators
must have a negative mass dimension, and therefore they scale as inverse powers of
8 This discussion does not exhaust all the possibilities. Firstly, in a theory space with two or more
dimensions, eigenvalues can be complex valued, corresponding to RG trajectories that spiral around the
fixed point (spiraling inwards if the real part is negative and outwards if it is positive). Another possibility
is limit cycles (i.e. closed RG trajectories), that play a role for instance in the Efimov effect (a scaling
law in the binding energies of 3-boson bound states when the 2-body interaction is too weak to have a
two-body bound state).
9 In the case of 2-dimensional Ising model, the only parameters that need to be adjusted in order to
reach the critical point are the temperature (T∗−1 ≈ 0.44) and the external field (equal to zero).
254 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
the ultraviolet cutoff. Thus, these operators are irrelevant. Only operators of low
dimensionality can be relevant, and there is usually a (small) finite number of them10 .
Let us now consider the domain that originates from the fixed point (the light
colored line in the figure 7.3), sometimes called the ultraviolet critical surface. This
is the domain spanned by the renormalization group flow if one starts from an
infinitesimal region around the fixed point. Any theory that lies on the UV critical
surface is renormalizable, since it evolves into the fixed point at short distance: this
indeed means that one may safely send the ultraviolet cutoff to infinity in such a
theory (this corresponds to moving in the direction opposite to the arrows in the figure
7.3). Note also that theories on the UV critical surface transform into one another
under the renormalization flow, but the couplings of the various relevant operators
depend on the scale. The following situations may occur:
• It may also happen that around a Gaussian fixed point, the only relevant op-
erators are quadratic in the fields, like mass and kinetic terms. In this case,
there is no interacting renormalizable action, and the theory is said to suffer
from triviality. There is nowadays strong evidence that, in a pure real scalar
field theory, the operator φ4 is not relevant in four space-time dimensions (it is
relevant in three dimensions or less) and therefore such a field theory is trivial
because only the non-interacting theory makes sense.
• When the fixed point is a non-trivial interacting fixed point instead of a Gaussian
one, the theories on the UV critical surface are also renormalizable, but their
high energy behaviour cannot be studied by perturbative means. This situation
is called asymptotic safety11 .
inelastic scattering. There, peculiarities of the kinematics lead to an infinite number of relevant operators.
11 The concept of asymptotic safety was introduced by Weinberg, as a logical possibility for a renormal-
close to but not exactly on the critical surface, the theory firstly approaches the critical
point upon increasing the length scale, but instead of reaching it, it departs from it
on even larger scales to follow one of the repulsive directions. In such a system, the
correlation length may be large but not infinite as it would be at the critical point (the
turning point between the approach of the critical point and the subsequent departure
from it happens roughly when the RG scale equals the correlation length).
The block-spin renormalization procedure that we have discussed in the section 7.5.1
can be extended to the case of a continuous system such as a quantum field theory.
Moreover, while our discussion has been so far qualitative, we shall now derive an
explicit RG flow equation for the quantum effective action, the solution of which
would provide the full quantum content from tree level contributions only.
Reminders about the quantum effective action : Let us first recall some basic
results about the quantum effective action Γ [φ], taken from the section 2.6. It is
related to the generating functional W[j] of connected Feynman graphs by
Z
i Γ [φ] = W[jφ ] − i d4 x jφ (x)φ(x) . (7.83)
δΓ [φ]
+ jφ (x) = 0 . (7.84)
δφ(x)
or equivalently in terms of W by
δW[j]
φ(x) = . (7.85)
i δj(x) j=jφ
In other words, jφ (x) is the external source such that the expectation value of the
field is φ(x). By combining the path integral representation of W,
Z h Z i
eW[j] = Dφ(x) exp iS[φ(x)] + i d4 x j(x)φ(x) , (7.86)
with eqs. (7.83) and (7.84) we obtain the following functional equation satisfied by
the effective action Γ :
Z h Z i
δΓ [ϕ]
ei Γ [ϕ] = Dφ(x) exp iS[φ + ϕ] − i d4 x φ(x) . (7.87)
δϕ(x)
256 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
h δ i
eWκ [j] ≡ exp i ∆Sκ Z[j]
iδj
Z Z
= Dφ(x) exp i S[φ] + ∆Sκ [φ] + i jφ , (7.88)
where Z[j] is the usual generating functional for time ordered correlation functions
and ∆Sκ is defined in terms of the Fourier transform of the fields as follows:
Z
d4 p e e
∆Sκ [φ] ≡ φ(−p) Rκ (p) φ(p) . (7.89)
(2π)4
which means that the cutoff plays no role in this limit and we recover the full quantum
theory. This is the limit we aim at reaching at the end of the RG flow. In contrast, it
should become large when κ → ∞:
This property ensures that when κ is large, the right hand side of eq. (7.88) is
dominated by the saddle point, so that the corresponding effective action equals the
classical action.
7. R ENORMALIZATION GROUP 257
δ2 Wκ [j]
Gκ (x, y) ≡ , (7.94)
iδj(x)iδj(y)
and φ κ
is the corresponding 1-point function:
δWκ [j]
φ(x) κ
≡ . (7.95)
iδj(x)
Scale dependent effective action : Let us now alter the definition (7.83) in order
to make it depend on the scale κ, by writing
Z
Γκ [φ] + ∆Sκ [φ] = −i Wκ [jφ ] − d4 x jφ (x) φ(x) . (7.96)
The left hand side is written as Γκ + ∆Sκ in order not to include in the definition of
the effective action the unphysical regulator ∆Sκ . Like in the original definition, the
field φ and the current jφ are related by
δWκ [j]
φ(x) = . (7.97)
iδj(x)
j=jφ
δΓκ [φ] h e i
jφ (x) + + Rκ φ (x) = 0 . (7.98)
δφ(x)
Differentiating eq. (7.97) with respect to j(y) and eq. (7.98) with respect to φ(y), and
multiplying the results, we obtain the following identity:
Z " #
4 δ2 Wκ [j] δ2 Γκ [φj ]
i δ(x − y) = d z +Rκ (x, y) , (7.99)
iδj(y)iδj(z) δφj (z)δφj (x)
| {z } | {z }
Gκ (y,z) Γκ,2 (z,x)
Flow equation for Γκ : Now, we can differentiate eq. (7.96) with respect to the
scale:
Z
∂τ Γκ [φ] = −∂τ ∆Sκ [φ] − i ∂τ Wκ [jφ ] − d4 x φ(x) ∂τ jφ (x)
h i
= −∂τ ∆Sκ [φ] − i ∂τ Wκ [j]
j=jφ
Z Z
4 δW [j
κ φ ]
−i d x ∂τ jφ (x)− d4 x φ(x)∂τ jφ (x)
δjφ (x)
Z
1 d4 p e κ (p) .
= ∂τ Rκ (p) G (7.100)
2 (2π)4
In the second line, we have made explicit the fact that Wκ [jφ ] contains both an
intrinsic scale dependence and an implicit one from the κ dependence of its argument
jφ . Using eq. (7.99), this can be put into the following form:
" #−1
i δ2 Γκ [φ]
∂τ Γκ = Tr (∂τ Rκ ) + Rκ , (7.101)
2 δφδφ
that depends only on Γk (the integral over the momentum p has been written compactly
in the form of a trace). Let us make a few remarks concerning this equation:
• This equation is a functional differential equation, that does not involve any
functional integral, unlike eq. (7.87). Nevertheless, it cannot be solved exactly
in general, and various truncation schemes have been devised in order to obtain
physical results.
• The term Rκ in the denominator provides an infrared regularization (by adding
a kind of mass term to the inverse propagator).
• The factor ∂τ Rκ is peaked around momentum modes of order κ. Thus, the
right hand side is rather localized in momentum space, in contrast with the
equation (7.87) that includes all the momentum scales at once.
• The choice of the regularizing function Rκ is not unique, provided that it fulfills
the conditions (7.90-7.92). Consequently, the renormalization group trajectories
depend somehow on this choice (this may be viewed as a dependence on the
renormalization scheme). However, the fixed points of the renormalization
group flow do not depend on this choice.
Chapter 8
Until now, we have discussed various quantum field theories (the electroweak theory
and quantum chromodynamics) that are believed to provide a unified description of
all particle physics up to the scale of electroweak symmetry breaking, i.e. roughly
ΛEW ∼ 200 GeV. However, it is hard to imagine that there isn’t some kind of new
physical phenomena (new particles, new interactions) at higher energy scales (so
far out of reach of experimental searches). An interesting question is therefore to
understand why the Standard Model is such a good description of physics below the
electroweak scale, despite the fact that it does not contain any of the physics at higher
scale. In other words, despite the fact that there is distinct physics on scales that
span many orders of magnitude, why can “low energy” phenomena be described by
ignoring most of the higher scales? The same question could be asked in other areas:
for instance, why can chemistry (i.e. phenomena of atomic bonding in molecules)
get away without any of the complications of quantum electrodynamics? The general
question is that of the separation between various physical scales. c sileG siocnarF
In the context of quantum field theory, such a low energy description is called
an effective theory. The basic idea is that most of the details of an underlying more
fundamental (i.e. valid at higher energy) description are not important at lower
energies, except for a small number of parameters. As we shall see in this chapter,
effective field theories may occur in several situations:
• Top-down : the quantum field theory which is valid at higher energy is known,
but it is unnecessarily complicated to describe phenomena at lower energy
scales. A typical example is that of a theory that contains particles that are
much heavier than the energy scale of interest (e.g., the top quark in quantum
chromodynamics, while one is interested in interactions at the GeV scale). In
this case, the effective theory “integrates out” the higher mass particles in order
to obtain a simpler theory.
259
260 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the top-down approach, where the fundamental underlying theory is known, the
goal of obtaining an effective description for low energy phenomena could in principle
be achieved by the renormalization group. In particular, the functional renormalization
group introduced in the section 7.5.3 allows to evolve from an ultraviolet classical
action towards a low energy quantum effective action, by progressively integrating
out layers of lower and lower momentum. There is nothing wrong with this approach,
but one has to keep in mind that the effective action obtained in this way is usually
extremely complicated and cumbersome to use in practical applications (in particular,
it could have infinitely many effective interactions, all of which are in general non-
local). In a sense, the quantum effective action that results from the RG evolution
is much more complex that the original ultraviolet action, and the gain in terms of
simplicity is rather dubious. In contrast, the concept of effective theory that we
are aiming at in this chapter is a field theory in which the ultraviolet physics is
encapsulated into a finite number of local operators, with coupling constants that may
depend on the energy scale and on the properties of the degrees of freedom that have
been integrated out.
SΛ [φS ] is the action of the low energy effective theory. Using the operator product
expansion, it may be written as a sum of local operators, possibly infinitely many of
them:
Z X
SΛ [φS ] ≡ dd x λn On . (8.4)
n
Combined with the corresponding coupling constant, the contribution of this operator
would be of order
Z d−dn
d Λ
λn d x On ∼ gn . (8.7)
E
This estimate is the basis of the following classification of the operators that may
enter in the action of the effective theory:
• Supersymmetry may also forbid certain types of mass terms (if unbroken, the
mass must be strictly zero, and if broken, the mass will settle to a value close
to the scale of supersymmetry breaking).
By that account, the Standard Model (without any supersymmetric extension) is not
natural, since it does not contain any mechanism to prevent the mass of the Higgs
264 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
scalar boson to be at a cutoff scale (possibly much higher than the electroweak scale)
where the Standard Model is superseded by a more fundamental theory. c sileG siocnarF
differences matter for the dynamics of the field theory, but the absolute value of the energy enters in the
energy-momentum tensor that acts as a source in Einstein’s equations.
8. E FFECTIVE FIELD THEORIES 265
bosons (photon and gluons). Thus, this low energy truncation has no mechanism
for weak decays. Nevertheless, one may write an effective coupling involving a
proton, a neutron (here, we prefer to use hadrons, that are the states encountered in
actual experimental situations), an electron and the corresponding neutrino. The most
general local operator combining these four fields may be written as
g12
2
ψp Γ1 ψn ψe Γ2 ψν , (8.9)
Λ
where g12 is a dimensionless constant, Λ is a dimensionful scale, and Γ1,2 are matrices
chosen in the following set
Γ1,2 ∈ 1, γ5 , γµ , γµ γ5 , 4i [γµ,γν ] . (8.10)
| {z }
σµν
Note that σµν γ5 is not linearly independent from these matrices, since σµν γ5 ∝
ǫµνρσ σρσ , and therefore need not be included in this list. Thus, the most general
Lorentz invariant Lagrangian involving these four fields reads
Leff = ψp γµ ψn ψe γµ (CV + CV′ γ5 )ψν
+ ψp γµ γ5 ψn ψe γµ γ5 (CA + CA′ γ5 )ψν
| {z }
vector, axial
+ ψp ψn ψe (CS + CS′ γ5 )ψν
+ ψp γ5 ψn ψe γ5 (CP + CP′ γ5 )ψν
| {z }
scalar, pseudo-scalar
+ ψp σµν ψn ψe σµν (CT + CT′ γ5 )ψν . (8.11)
| {z }
tensor
Note that the presence of certain terms violate some discrete symmetries. For instance,
′
the primed terms CV,S,P,T all violate parity, and T -invariance requires that the ratio
′
Ci /Ci be real for all i ∈ {V, A, S, P, T }. On the other hand, by confronting this
effective Lagrangian with the existing data on weak decays, we learn that
The first of these results is an indication of the energy scale at which the Fermi theory
breaks down and should be replaced by a more accurate microscopic description of
266 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
weak decays, and the second one implies that this underlying theory is chiral . The
′
fact that CV,A ∼ CV,A is a sign of parity violation in weak interactions. Finally, the
last property tells us that this microscopic interaction is not mediated by a scalar
or a tensor with a mass less than ∼ 2 TeV. All these informations may be used in
constraining the possible form of the theory that describes weak interactions at higher
energies.
Let us now consider the opposite exercise: namely, start from the Lagrangian of the
Standard Model and obtain the low energy effective theory of weak interactions by
a matching procedure. We know that the W ± bosons responsible for weak decays
couple to left-handed fermions arranged in SU(2) doublets:
ν
e d
, , (8.13)
e L u L
where we have written only the relevant doublets for the decay n → peνe . In
addition, we have to keep in mind that the mass eigenstates are misaligned with
the weak interaction eigenstates in the quark sector. Thus, the vertex Wud contains
a factor Vud from the CKM matrix. With these ingredients, the tree level decay
amplitude d → ueνe reads:
g2 i
µ µ
A= Vud 2 uγ (1 − γ 5 )d eγ (1 − γ )ν
5 e , (8.14)
8 k − M2W
G G g2
Leff = √F Vud ψu γµ (1−γ5 )ψd ψe γµ (1−γ5 )ψν with √F ≡ .
2 2 8 M2W
(8.15)
In order to obtain from this the physical decay amplitude n → peνe , we need the
matrix element
p ψu γµ (1 − γ5 )ψd n (8.16)
with initial and final nucleons instead of quarks. In the low momentum limit, it may
be related to a similar matrix element with the spinors of the proton and neutron by
where gV,A are two constants that may be viewed as the zero momentum limit of
some form factors. Then, by comparing the decay amplitudes obtained from the low
energy effective theory guessed on the basis of phenomenological considerations, and
the one obtained by starting from the electroweak theory, we obtain
g2 1
CV = −CV′ = gV 2
Vud = 2 ,
8 MW Λ
g2
CA = −CA′ = −gA Vud ,
8 M2W
′
CS,P,T = CS,P,T =0. (8.18)
In this top-down approach, we see that the parity violation inferred from experimental
evidence is in fact maximal in the electroweak theory, and that the scalar and tensor
contributions are exactly zero. Note also that the scale Λ that we introduced by hand
in the low energy effective theory does not coincide exactly with the mass of the
heavy particle which is integrated out (in the present case, the W boson), but has
the same order of magnitude. Finally, even though we performed here the matching
at tree level, it is in principle possible to correct the coefficients of the low energy
effective theories by electroweak and QCD loop corrections.
and a natural endeavor is to construct the terms L(1,2,··· ) , made of operators with
mass dimension greater than four. By power counting, these operators must be
suppressed by coupling constants that are inversely proportional to powers of some
high energy scale Λ at which corrections to the Standard Model become important. In
the construction of these corrections, one usually abides by the following constraints:
2 One exception is the fact that neutrinos have masses, that does not have a very compelling explanation
in the Standard Model – we shall return to this issue in the next subsection.
268 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
• The SU(3) × SU(2) × U(1) gauge symmetry of the Standard Model remains
a symmetry of the higher order corrections (the idea being that whatever is
the more fundamental theory that underlies the Standard Model, it is more
symmetric, not less),
• The corrections are built with the degrees of freedom of the Standard Model,
• The vacuum expectation value of the Higgs is not modified by the corrections.
• Operators that allow processes that were forbidden in the Standard Model.
In this case, what is needed are more sensitive experiments, able to detect
extremely rare events.
8. E FFECTIVE FIELD THEORIES 269
The right handed neutrinos are singlet under SU(3) and SU(2) and have a null
electrical charge, which means that they do not feel any of the interactions of the
Standard Model. As a consequence, all the neutrinos detected in experiments (via their
weak interactions with the matter of the detector) are left handed neutrinos, implying
that there is no direct evidence for the existence of right handed neutrinos. For this
reasons, right handed neutrinos are usually not considered as a part of Standard
Model.
The observation of neutrino oscillations, i.e. the fact that the flavour of a neutrino
can change as it propagates, implies that there are non-zero mass differences between
neutrinos3 . Therefore, at most one of the neutrinos can be massless, and at least two
of them must be massive.
Neutrino masses from the Higgs mechanism : Since the electroweak theory is
chiral (right handed leptons are SU(2) singlet, while the left handed ones belong to
SU(2) doublets), a naive Dirac mass term of the form mD ψL ψR is not invariant
under SU(2). However, we may construct such a Dirac mass in the same way as for
the other leptons, by starting from a Yukawa coupling involving the Higgs boson:
λ ψL ,iα ǫij Φ∗j ψR ,α , (8.20)
eigenstate) of definite momentum. If mass eigenstates are misaligned with the weak interaction eigenstates,
then this neutrino may project on several mass eigenstates. Since the time evolution of the phase of a
wavefunction depends on the mass of the particle, these mass eigenstates evolve slightly differently in time
(unless all the neutrino masses are identical). At the detection time, this leads to a flavour decomposition
which is different from the one at the time of production. Thus, the original electron anti-neutrino will be a
mixture of electron, muon and tau anti-neutrinos. Conversely, the observation of this change of flavour
implies mass differences in the neutrino sector.
4 Whether this type of term is “beyond the Standard Model” is to a large extent a matter of definition.
Before the observation of neutrino oscillations, the Standard Model was most often defined without right
handed neutrinos, and therefore massless neutrinos. But it would have been equally acceptable to include
right handed neutrinos from the start, with Yukawa couplings so small that their masses were too small to
detect.
270 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
neutrinos would be that they do not feel any of the gauge interactions of the Standard
Model. For this reason, they are sometimes called sterile neutrinos. The main
drawback of this solution is that it requires an even larger range of values of the
Yukawa couplings, with no natural explanation.
c v2 t
ν C νL , (8.22)
Λ L
which corresponds to a Majorana mass mM = cv2 Λ−1 . The appeal of this mecha-
nism, is that a small mass of the neutrinos is naturally explained by a high scale Λ for
the new physics. For instance, a neutrino mass of the order of 1 eV or below corre-
sponds to Λ & 1013 GeV. As we have already mentioned, the operator in eq. (8.21)
does not conserve lepton number, since it is not invariant under the following global
transformation
For this reason, this alternative mechanism is clearly beyond the Standard Model.
However, as long as gauge symmetries are preserved, the violation of lepton number is
not considered particularly dramatic. In a sense, one may view the lepton conservation
that exists in the Standard Model as accidental, being a consequence of the fact that
only dimension-four operators are included.
Weinberg operator from the low energy limit of another QFT : In the spirit
of the bottom-up construction of an effective theory, the operator of eq. (8.21) can
5 ψt ǫij Φj and Φtk ǫkl ψL ,lβ are both SU(2) invariant (but not Lorentz invariant), and the combi-
L ,iα
nation ψt Cαβ ψL ,lβ is Lorentz invariant. This combination is SU(3) invariant only for the leptons
L ,iα
(not for the quarks).
8. E FFECTIVE FIELD THEORIES 271
be obtained by exploring all the possibilities for dimension 5 operators built with
the degrees of freedom of the Standard Model and some symmetry requirements.
However, this operator can also be obtained in the low energy limit of a renormalizable
quantum field theory. Consider an extension of the field content of the Standard Model,
where we add a right handed neutrino νR with a very large Majorana mass MR (much
heavier than the electroweak scale), that also couples to the SU(2) doublet containing
the left handed neutrino and to the Higgs field via a Yukawa coupling,
L = LSM + LνR ,
/ νR − y ψL ǫΦ∗ νR − y∗ νR Φt ǫ† ψL
L νR ≡ i ν R ∂
1
+ MR νtR C νR + M∗R νt∗ C ν∗R . (8.24)
2 R
With two instances of the Yukawa coupling and a propagator of the heavy Majorana
Φ Φ Φ Φ
ψL ψL ψL ψL νL νL
p << MR Φ=v
p
matrix C.
7 More precisely, it corresponds to the Type-I see-saw mechanism. Type-II and Type-III see-saw
mechanisms exist, that differ in the nature of the heavy particle that connects the ΦΦψtL ψL fields in the
original four point function.
272 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The equation of motion that follows from the zeroth order Lagrangian is
λ
+ m2 φ + φ3 = 0 . (8.27)
6
Naively, it is tempting to replace the last term of the effective Lagrangian, φ3 φ, by
a sum of terms in φ4 and φ6 . However, it is not totally clear that this is legitimate
when this interaction term is inserted in a more complicated graph. A more robust
justification goes as follows. Consider a new scalar field ψ related to φ by
φ = ψ + λ2 Λ−2 ψ3 . (8.28)
Note that both terms in the right hand side have mass dimension 1 and transform
as Lorentz scalars. Rewriting the terms of the above Lagrangian in terms of ψ, we
obtain
1 1
∂µ φ ∂µ φ = ∂µ ψ ∂µ ψ − λ2 Λ−2 ψ3 ψ + O(Λ−4 ) ,
2 2
m2 φ2 = m2 ψ2 + 2λ2 Λ−2 ψ4 + O(Λ−4 ) ,
λ 4 λ 4
φ = ψ + 4λ2 Λ−2 ψ6 + O(Λ−4 ) , (8.29)
4! 4!
and finally
1 1 λ′ 1
L= ∂µ ψ ∂µ ψ − m2 ψ2 − ψ4 + 2 λ1′ ψ6 + O(Λ−4 ) , (8.30)
2 2 4! Λ
where λ ′ , λ1′ are new coupling constants for the quartic and sextic terms. In the spirit
of an effective field theory, we do not care about the terms of order Λ−4 since they
8. E FFECTIVE FIELD THEORIES 273
come with operators of dimension 8, that we are not considering here. Thus, by the
change of variable of eq. (8.28), we can eliminate the term that seemed redundant in
the Lagrangian. More generally, any term of the form
Λ−2 f(φ) φ + m2 φ + λ6 φ3 , (8.31)
| {z }
l.h.s. of the EOM
where f(φ) is any local function of the fields of mass dimension 3 (e.g., φ3 , m2 φ,
φ), can be removed from the effective Lagrangian by the following field redefinition
For a transformation of the type (8.28), the determinant depends on the field since we
have
δφ(x)
= Λ−2 δ(x − y) Λ2 + 3 λ2 ψ2 (x) . (8.34)
δψ(y)
and therefore this determinant should not be disregarded. Like in the Fadeev-Popov
quantization procedure, we may express it as an path integral over fictitious Grassmann
fields χ, χ, by writing:
Z Z
δφ(x)
4 2 2
det = Dχ(x)Dχ(x) exp i d x χ(x) Λ +3 λ2 ψ (x) χ(x) .
δψ(y)
(8.35)
In the case of our simple example, the kinetic term of this ghost field is a bit peculiar
since it does not contain any derivatives. However, it exhibits a feature which is
completely generic, namely the fact that its mass is of order Λ. Since the ghosts can
only appear in closed loops, their contribution is suppressed by inverse powers of
Λ. In other words, the determinant depends on the field ψ, but this dependence is of
higher order in Λ−2 and will not affect our effective theory. c sileG siocnarF
274 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
At this point, we have shown that the only possible effect of the term we have removed
from the effective Lagrangian is to modify the operators inside a time-ordered product
of fields (in the form of extra terms that will appear on the external legs of the
corresponding Feynman graphs). However, the physical quantities are not the above
correlation functions themselves, but the on-shell transition amplitudes obtained with
the LSZ reduction formulas, i.e. the residue of the 1-particle poles in the Fourier
transform of eq. (8.37). For instance, in a 4-point function contributing to a 2 → 2
scattering amplitude, one would have a graph such as the following
ψ ψ
ψ ψ3
where one of the operators in the T-product is a ψ3 (in this diagrammatic representa-
tion, we have not yet amputated the external propagators.) We can readily see that one
point of this function is not terminated by a propagator, and therefore does not exhibit
a 1-particle pole. Thus, such a graph does not contribute to the on-shell transition
amplitude when inserted in the LSZ reduction formula. Although we have used a very
simple example to illustrate the chain of arguments leading to this result, it is in fact
completely general: if a term of the effective Lagrangian can be rewritten as a linear
combination of other operators thanks to the leading order equation of motion, then
this term can be ignored in the effective theory without changing anything to the S
matrix.
span a wide range of momentum scales), we may expect that some simplifications are
possible if one is interested in processes in which some of these scales are irrelevant.
Several effective theories have been developed in order to simplify the treatment of
strong interactions in some special kinematical situations, and we shall discuss two of
them in this section.
Main ideas : There are six families of quarks in Nature, u, d, s, c, b, and t. The
u, d and s quarks are light in comparison to other QCD scales (in particular the
confinement scale ΛQCD ), while the c, b and t are considered heavy. Besides the well
known nucleons (proton, neutron) and light mesons (pions, rho), that are made of u
and d valence quarks, some hadrons contain heavy quarks (c and b only, since the t
quark decays before a bound state can form). An obvious source of simplification
in the presence of heavy quarks is asymptotic freedom, thanks to which the strong
coupling constant at the scale mQ is not very large and thus the strong interactions
are more like electromagnetic interactions. In particular, hadrons made of a pair of
heavy quark and antiquark QQ have a size of order (αs mQ )−1 . When this size is
much smaller than Λ−1QCD
, these bound states are quite similar to a hydrogen atom.
However, hadrons mixing heavy and light quarks are not as simple, because their
size is of order Λ−1
QCD
and the typical momentum transfer between the light and heavy
quarks is of order ΛQCD . Thus, in these heavy-light hadrons, on may view the heavy
quark as surrounded by a non-perturbative cloud of light quarks and gluons. Such
systems are characterized by two different scales:
For a heavy quark, one has λQ ≪ Rh . Thus, in a certain sense, the heavy quark
may be viewed as a point-like object inside a much larger hadron. Loosely speaking,
the quantum numbers of the heavy quark (flavour, spin) are confined in a volume of
order of its Compton wavelength λQ , but the accompanying cloud of light quarks and
gluons can only resolve distances as small as Λ−1 QCD
. Therefore, the light degrees of
freedom are totally insensitive to the heavy quark quantum numbers, and they only
feel its colour field. Moreover, for a heavy-light hadron, the rest frame of the hadron
is almost equivalent to the rest frame of the heavy quark. In this frame, the colour field
of the heavy quark is the Coulomb electrical field produced by a static colour charge,
that does not depend on the heavy quark mass. Thus, we expect that the configuration
of the light constituents is independent of mQ when mQ → ∞. These observations
276 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
constitute what is called heavy quark symmetry, that we shall derive more formally
later in this section. Note that, unlike chiral symmetry for massless quarks, it is
not a symmetry of the QCD Lagrangian, but rather an approximate symmetry that
arises in special kinematical conditions (namely, when a heavy quark interacts only
with light degrees of freedom via soft exchanges). Heavy quark symmetry provides
relationships between bound states that differ only in the flavour and/or the spin8 of
the heavy quark, for instance the B, D, B∗ , D∗ mesons, or the Λb , Λc baryons. Heavy
quark effective theory exploits this separation of scales in a systematic way in order
to calculate the dependence of various physical quantities on the mass of the heavy
quark, by an expansion in powers of m−1 Q
.
Spinor decomposition : Let us assume that there is a large gap between the con-
finement scale ΛQCD and the heavy quark mass mQ , and introduce an intermediate
scale Λ such that ΛQCD ≪ Λ ≪ mQ . Our goal is to construct an effective theory
which is equivalent to QCD at long distance, i.e. for momenta below Λ (but may
differ from QCD above Λ). Heavy quark effective theory is somewhat special in that
we do not completely integrate out the heavy quarks (since one of its applications is
to describe bound states that contain heavy quarks), but we rather integrate out only a
part of the heavy quark degrees of freedom. This is done by writing the momentum
of a heavy quark as follows:
pµ ≡ mQ vµ + qµ , (8.38)
1 ± v/
P± ≡ , (8.39)
2
8 This is analogous to the fact that isotopes have almost identical chemistry, since the cloud of electrons
surrounding the nucleus is almost independent of its mass (in a first approximation, it depends only on its
electrical charge). Likewise, the independence with respect to the spin of the heavy quark is analogous to
the near degeneracy of the hyperfine levels in atomic physics.
8. E FFECTIVE FIELD THEORIES 277
or conversely
h i
ψ(x) = e−i mQ v·x qv (x) + Qv (x) . (8.41)
The path integration over the heavy field Qv is Gaussian and can be performed
analytically, giving
Z
R 4
Z[η, η] = Dqv Dqv ∆v [A] ei d x (Leff +ηqv +qv η) , (8.44)
and where ∆v [A] is the functional determinant produced by the Gaussian integral:
1/2
∆v [A] ≡ det 2mQ + iv · D . (8.46)
Note that if one chooses the strict axial gauge v · A = 0, then this determinant is
constant and may be disregarded.
278 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
v · D ∼ ΛQCD ≪ mQ , (8.47)
L∞
where Fij is the QCD field strength tensor, and Mij ≡ 4i [γi , γj ] are the generators of
the Poincaré algebra in the spin 1/2 representation (latin indices i, j run only over the
spatial components transverse to the velocity). The first term L∞ is the only one that
survives in the limit of infinite quark mass. The terms of order m−1
Q
can be interpreted
respectively as the contribution of the transverse motion to the kinetic energy and the
interaction between the spin of the quark and the chromo-magnetic field.
Heavy quark symmetry : The leading term in the effective Lagrangian, L∞ , cor-
responds to the following Feynman rules
i
p
i δij P+
= , = −i g vµ ta
r .
v · p + i0+ aµ ij
Since there are no Dirac matrices in the expression of the vertex, the interactions with
gluons do not alter the spin of the heavy quark at order m0Q . More formally, since
L∞ does not contain Dirac matrices, it is invariant under
!
i
i i 1 σ 0
qv → ei θ S qv , with Si ≡ . (8.50)
2 0 σi
8. E FFECTIVE FIELD THEORIES 279
(The σi are the Pauli matrices.) Since we have [Si , Sj ] = i ǫijk Sk , this corresponds
to an SU(2) invariance of L∞ . Moreover, since L∞ is independent of mQ , all heavy
quarks play the same role. With Nf flavours of heavy quarks, the leading effective
Lagrangian
Nf
X
L∞ = qvf i v · D qvf (8.51)
f=0
has an SU(2Nf ) symmetry, that constitutes the spin-flavour heavy quark symmetry.
These symmetries are broken by the corrections in m−1
Q
, since they depend explicitly
on the mass and contain Dirac matrices.
these constituents are in fact quantum fluctuations, but their long lifetime (compared
to the interaction time) allows to treat them as on-shell. Moreover, these partons
distributions must vary with the resolution scale (in space and time) with which the
proton is probed, since a smaller resolution scale will resolve more partons in the
measurement. c sileG siocnarF
Figure 8.3: Cartoon of the fluctuations inside a nucleon. The shaded strip indicates
the time resolution of some external probe. Top: slow nucleon. Bottom: boosted
nucleon. All the internal time scales are dilated by a Lorentz factor, and new virtual
fluctuations become accessible to the probe.
x0 + x 3 x 0 − x3
x+ ≡ √ , x− ≡ √ . (8.52)
2 2
(The remaining two coordinates are the transverse coordinates x⊥ .) Similar definitions
can be introduced for 4-momenta. These coordinates have the virtue of transforming
very simply under boosts in the z direction, since x± just undergo a rescaling:
x+ → eω x+ , x− → e−ω x− , x⊥ → x⊥ . (8.53)
In order to order the constituents by their longitudinal momentum, the most convenient
variable is rapidity, defined as y ≡ 21 ln(p+ /p− ), since it is shifted by an additive
constant under a boost in the z direction. By definition, y = 0 (i.e. pz = 0)
corresponds to objects with no longitudinal momentum in the observer’s frame.
Quantum fluctuations with a large positive rapidity appear to the observer as nearly
on-shell constituents. At the largest rapidities (corresponding to the total pz of the
hadron), there are few constituents, mostly the valence quarks. Because of their large
longitudinal momentum, the dynamics of these constituents is considerably slowed
down by time dilation, and therefore they appear static to the observer. The only
relevant information about these fast partons is the colour current they carry. This
current is longitudinal, and because these constituents are static, it does not depend
on the light-cone variable9 x+ and takes the following form :
Jµ
a (x) ≡ δ
µ+
ρa (x− , x⊥ ) , (8.54)
where the function ρa is the spatial distribution of colour charge. For a high energy
hadron, Lorentz contraction implies that the x− dependence of this function is very
peaked around x− ≈ 0. On the other hand, the x⊥ dependence reflects the distribution
of the constituents of the hadron in the plane transverse to the collision axis. Since
this depends on the peculiar spatial arrangement of the constituents at the time of
the collision, the function ρa (x− , x⊥ ) is not known and may be considered as a
random variable with a probability distribution W[ρ]. When one repeats may similar
collisions, the expectation value of an observable is obtained by a functional average
Z
O = Dρ W[ρ] O[ρ] , (8.55)
where O[ρ] is the value of this observable calculated with an arbitrary instance of the
distribution ρa .
In contrast, the constituents that lie at small rapidity in the observer’s frame have
a time evolution that cannot be neglected. These modes are thus described according
9 The evolution in x+ is generated by the component P − of the momentum. However, for massless
on-shell modes, we have P− = P2⊥ /(2P+ ) → 0 for the fast moving modes in the z direction.
282 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
fields sources
Figure 8.4: Degrees of freedom in the Colour Glass Condensate effective descrip-
tion of a high energy hadron.
to the original Yang-Mills action, as illustrated in the figure 8.4. Moreover, due to
the hierarchy between the longitudinal momenta of the modes described as a colour
current and those described as regular gauge fields, the coupling between them may
be approximated as eikonal, i.e. by a term of the form Jµ Aµ , and therefore the action
of the effective theory reads
Z 1
S = d4 x − Fa F µν a
+ J µ a Aµ
a . (8.56)
4 µν
This effective theory is called the Colour Glass Condensate.
Power counting in the saturation regime : The power counting for the graphs that
appear in this effective theory is a bit peculiar in the saturation regime. Indeed, this
situation corresponds to a gluon occupation number of order g−2 , which is achieved
with a colour current of order g−1 . The order of a connected graph G with nE external
gluons, nL loops and nJ insertions of the colour current is given by
n
G ∼ g−2 gnE g2 nL gJ J , (8.57)
where J denotes the typical magnitude of the current. Thus, in the saturated regime
where J ∼ g−1 , the magnitude of connected graphs does not depend on nJ , which
means that all observables depend non-perturbatively on the colour current. In contrast,
the loop expansion still corresponds to an expansion in powers of g2 . Observables at
tree-level are given by an infinite sum of tree diagrams (corresponding to an arbitrary
8. E FFECTIVE FIELD THEORIES 283
Aµ ∼ g−1 , (8.59)
which leads to several technical complications. Some of these issues are discussed
in the chapter 15. Higher order contributions correspond to loops evaluated in the
presence of this classical field as a background.
δONLO [ρ] = ycut H OLO [ρ] + terms that do not depend on ycut , (8.60)
10 Inclusive observables are measurements for which one sums over all the possible final states without
excluding any of them. For instance, the average particle multiplicity in the final state is an inclusive
observable, while the probability of producing exactly 3 particles is not.
284 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where H is a universal (i.e. the same for all inclusive observables) operator containing
second order derivatives with respect to ρa . An important property of this operator is
that it is self-adjoint:
Z Z
[Dρ] A[ρ] H B[ρ] = [Dρ] H A[ρ] B[ρ] . (8.61)
However, the cutoff is not a physical parameter, since it was just introduced by
hand in order to separate the two types of degrees of freedom, and therefore it should
not appear in physical quantities. The way out of this situation is to realize that by
changing the value of the cutoff, one is also modifying which modes are described
by the colour current Jµ . Consequently, the distribution W[ρ] should in fact depend
on ycut . Using eqs. (8.60) and (8.61), we see immediately that the cutoff dependence
coming from the loop correction to observables can be canceled if we also change
More precisely, this substitution cancels the linear dependence on ycut . A more
rigorous procedure is to apply it to an infinitesimal variation δycut of the cutoff, for
which the quadratic terms are truly negligible. By doing so, the change of eq. (8.62)
becomes a differential equation
∂W[ρ]
= − H W[ρ] , (8.63)
∂ycut
that controls how the probability distribution W[ρ] changes as one varies the cutoff
(this equation is called the JIMWLK equation). c sileG siocnarF
bosons of the (approximate for small but non-zero quark masses) chiral symmetry SU(2) × SU(2) that
exists in the light quark (u and d) sector of quantum chromodynamics. This symmetry is spontaneously
broken to a residual SU(2) symmetry in the vacuum of QCD, leading to the appearance of three nearly
massless scalar particles.
8. E FFECTIVE FIELD THEORIES 285
assumed to be invariant under the global action of a Lie group G. The metric of
d-dimensional spacetime is chosen to be Minkowskian (but this discussion is equally
applicable to Euclidean space). In addition, the potential V(φ) has non trivial minima
at some φ 6= 0. Due to the G-invariance of the action, the non trivial minima cannot
be unique. Given a certain minimum φc , all the field configurations that may be
reached from φc by the action of G are also minima. If we assume that there are no
accidental (i.e. not caused by the symmetry of the action) degeneracies of the minima,
the set of all minima can therefore be written as
M0 ≡ gφc g ∈ G . (8.65)
If H is the subgroup of G that leaves φc invariant (sometimes called the stabilizer of
φc ), then M0 is also the coset G/H.
Thus, the field φ given by eq. (8.66) is not really a function of the full group G, but
depends only on elements of the coset G/H. Let us now split the generators ta of the
Lie algebra g into those (for n < a) that correspond to h, and the complement (for
1 ≤ a ≤ n). From the definition of H as the stabilizer of φc , we have
1≤a≤n : ta
ij φcj 6= 0 ,
a>n : ta
ij φcj = 0 . (8.68)
Thus, the matrix R(g) can be written as
n
!
X
a a
R(θ) = exp i θ t . (8.69)
a=1
The value of the potential does not change under the action of G on φc , and we are
free to choose the value of its minimum to be V(φc ) = 0. Thus, the action becomes
Z
1 µ
S= dd x φci ∂µ R−1ik (θ) ∂ Rkj (θ) φcj
2
Z
1
=− dd x φci Aµ (θ)Aµ (θ) ij φcj , (8.70)
2
where in the second expression we have introduced Aµ ≡ R−1 ∂µ R (an element of
the algebra).
Eq. (8.70) gives the action in terms of the “coordinates” θa on the coset G/H,
corresponding to a certain choice of the generators ta . However, it is interesting to
express the action in terms of a completely arbitrary system of coordinates on G/H,
that we may denote ϑm . Since eq. (8.70) has only two derivatives ∂µ · · · ∂µ , the same
must be true of its expression in any system of coordinates. On the other hand, it may
contain terms of arbitrarily high degree in ϑ. Thus, the most general action is of the
form
Z
1
S= dd x gmn (ϑ) ∂µ ϑm ∂µ ϑn , (8.71)
2
where the coefficients gmn (ϑ) can be related to R(θ) as follows:
a b
a −1 ∂R b −1 ∂R
gmn (ϑ) ≡ −4 φci tik tkj φcj tr t R tr t R . (8.72)
∂ϑm ∂ϑn
They form a metric tensor on G/H, if the coset is viewed as a Riemannian manifold.
Indeed, if we use a different system of coordinates ̟p on G/H, gmn (ϑ) would be
replaced by
a −1 ∂R b −1 ∂R
gpq (̟) ≡ −4 φci ta tb
φ
ik kj cj tr t R tr t R
∂̟p ∂̟q
m n
∂ϑ ∂ϑ
= gmn (ϑ) , (8.73)
∂̟p ∂̟q
8. E FFECTIVE FIELD THEORIES 287
which is indeed the expected transformation law of a metric tensor under a change
of coordinates. The field theory described by the action (8.71) is called a non-linear
sigma model. Note that the derivative ∂µ ϑm of the coordinate ϑm is a vector that
c sileG siocnarF
lives on the tangent space to the manifold G/H at the point ϑ. Therefore, the action
(8.71), in which the tensor gmn is contracted with two vectors, is a scalar – invariant
under changes of coordinates on the manifold.
The Taylor expansion of the metric in powers of the field ϑ determines which
couplings exist in the classical action. Interestingly, even though the kinetic term of
the original action was quadratic in the fields, we now have a term with two derivatives
and possibly arbitrarily high orders in the field. Loosely speaking, this is due to the
fact that spontaneous symmetry breaking has restricted the fields from a space n in ❘
which the symmetry G was linearly realized, down to a curved manifold in which it is
realized non-linearly. In addition, it is worth stressing that the final action is uniquely
determined from eq. (8.69), but may take various explicit forms depending on the
choice of coordinates ϑm on G/H. In other words, the non-linear sigma model has
an intrinsic geometrical meaning, that does not depend on the system of coordinates
one uses.
Path integral quantization : The quantization of the non-linear sigma model can
be achieved via path integration. The action is quadratic in derivatives of the field, but
with the unusual feature that these derivatives are multiplied by a function of the field.
In order to ascertain the consequence of this property, it is necessary to start from the
Hamilton formulation of the path integral, and to perform explicitly the integral over
the conjugate momenta. For a Lagrangian density
1
L= gmn (ϑ) ∂µ ϑm ∂µ ϑn , (8.74)
2
the conjugate momenta read
∂L
πm ≡ = gmn (ϑ) ∂0 ϑn , (8.75)
∂∂0 ϑm
and the Hamiltonian is given by
1 1
H = πm ∂0 ϑm −L = gmn (ϑ) πm πn + gmn (ϑ) ∇ϑm · ∇ϑn , (8.76)
2 2
where gmn is the inverse of the metric tensor, gmn gnp = δm p . The Hamiltonian is
quadratic in the momenta, but since the coefficient in front of πm πn depends on the
field, the determinant produced in the Gaussian integration over the momenta cannot
be disregarded. After this integral has been performed, the generating functional is
given by the following formula
Z p Z
Y i
Z[jm ] = g(x) Dϑm (x) exp dd x L(ϑ) + jm ϑm , (8.77)
m
h̄
288 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
φc
Figure 8.7: Perturbative ex-
pansion in the non-linear sigma
G/H model: only field configurations
near φc are explored.
where we denote g(x) ≡ det (gmn (ϑ(x))). Interestingly, the field dependence
Q of
m
gmn (ϑ) alters the
√ Q path integral
in a rather natural way: the measure m Dϑ is
replaced by g m Dϑm , which is invariant under changes of coordinates on the
manifold G/H.
Note that in eq. (8.77), we have introduced an explicit h̄, that will be useful later
to keep track of the number of loops. The perturbative expansion in the non-linear
sigma model corresponds to an expansion in powers of h̄. From the path integral, we
can infer that the typical field amplitudes scale as
√
ϑ ∼ h̄ , (8.78)
which means that the perturbative expansion is also an expansion around ϑ = 0 (i.e.
around φ = φc ). For such small fields, the effects of the curvature of the manifold are
perturbative, and we can expand the metric tensor in powers of the field (an explicit
choice of coordinates must be made for this). The bare propagator of the ϑ fields is
given by
i δmn
Gmn (p) = . (8.79)
p2 + i0+
Renormalization : Dimensional analysis tells that the field ϑ has the dimension
ϑ ∼ (mass)(d−2)/2 (8.80)
(in a system of units where h̄ = 1). From this, we see that there are three cases
regarding the ultraviolet power counting in the non-linear sigma model:
8. E FFECTIVE FIELD THEORIES 289
• d < 2 : the Taylor coefficients of the metric tensor all have a positive mass
dimension, and are therefore super renormalizable.
• d = 2 : the Taylor coefficients are dimensionless, and the theory is renormaliz-
able.
• d > 2 : the Taylor coefficients have a negative mass dimension and are all
non-renormalizable by power counting.
The most interesting situation is therefore the two-dimensional case. It differs some-
what from the renormalization of the quantum field theories we have encountered
until now, since the action contains an infinite series of terms (of increasing degree
in ϑ), and an important question is whether the action (8.71) conserves its structure
under renormalization.
Recall that the fields ϑm transform under a non-linear representation of the group
G. Thus, their variation under an infinitesimal transformation of parameters ǫa may
be written as
δϑm ≡ ǫa Tam (ϑ) , (8.81)
where the Tam (ϑ)
are smooth functions of the fields. Under the same transformation,
the variation of the action reads
Z
δS
δS = ǫa d2 x Tam (ϑ) m , (8.82)
δϑ (x)
and the invariance of the action under G thus requires that
∂gmn ∂Tap ∂Tap
Tap + g pn + gpm =0. (8.83)
∂ϑp ∂ϑm ∂ϑn
In other words, the possible forms of the metric tensor are constrained by the symmetry
G. Indeed, the coset G/H is an homogeneous space12 , i.e. a manifold that possesses
additional symmetries that reduce the dimension of the space of allowed metrics.
More precisely, an homogeneous space is such that given any pair of points ϑ and
ϑ ′ on the manifold, there is an isometry (i.e. a distance preserving transformation)
that maps ϑ to ϑ ′ . If in addition the space is isotropic, then it is said to be maximally
symmetric13 . In an N-dimensional maximally symmetric space, there is a particularly
simple relationship between the metric and curvature tensors:
R
Rmn = gmn (R ≡ Rm m ) ,
N
R
Rmnpq = gmp gnq − gmq gnp . (8.84)
N(N − 1)
12 Thanks to their connections to Lie algebras, a systematic classification of homogeneous spaces is
possible.
13 A maximally symmetric manifold of dimension N has N(N + 1)/2 distinct isometries. In Euclidean
space, this corresponds to N translations and N(N − 1)/2 rotations, but this maximal number of isometries
is the same in N-dimensional manifolds with curvature.
290 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(These two identities imply that the scalar curvature R is constant over the entire
manifold for a dimension N > 2.)
A possible strategy for studying the renormalization of the sigma model is to
introduce an analogue of the BRST transformation of non-Abelian gauge theories,
and the associated Slavnov-Taylor identities obeyed by the quantum effective action.
These identities, combined with dimensional and symmetry arguments that restrict
the terms that may arise in the renormalized action, are sufficient to show that the
renormalized action is structurally identical to eq. (8.71), with a group-invariant
metric tensor that obeys a renormalized version of eqs. (8.83). c sileG siocnarF
where σ has one component and ξ has n − 1 components. Assuming the parameters
of the potential are adjusted so that the sphere Sn−1 of minima has radius φ = 1,
we must impose the constraint σ2 + ξ2 = 1, which means that σ may be viewed as a
dependent field that depends non-linearly on ξ. Usually, these coordinates are chosen
in such a way that the symmetry-breaking vacuum is φc = σ = 1, ξ = 0 . In the
vicinity of φc , σ is the “radial” massive field, while the ξi are the “angular” variables
corresponding to the massless Nambu-Goldstone bosons.
Then, we may split the generators of the o(n) algebra into those of the stabilizer
o(n − 1) and the complementary set of generators:
• The generators of o(n−1) act linearly on ξ. More precisely, they leave σ2 +ξ2
invariant by leaving both σ and ξ2 unchanged (thus simply rotating the n − 1
components of ξ).
• In contrast, the generators of the complementary set preserve σ2 + ξ2 , but mix
σ and ξ as follows:
σ → σ − ǫ i ξi ,
q
ξi i
→ ξ +ǫ i
1 − ξ2 , (8.86)
σ
Figure 8.8: Illustration of the
(σ, ξ) coordinates for an O(3) ξ1
model. The dark circle corre-
sponds to the transformations that
preserve σ and act linearly on ξ
(as an O(2) rotation). The light
colored circles are the transfor-
mations that mix σ and ξ (and
transform the latter non-linearly).
ξ2
We have derived the non-linear sigma model as the effective action that describes the
dynamics of the massless Nambu-Goldstone bosons after a spontaneous breaking of
symmetry. In this case, the fields of the non-linear sigma model live on a manifold
which is also a homogeneous space thanks to the symmetries of the original problem.
292 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
These symmetries severely constrain the possible forms of the metric, and play an
important role in constraining the form of the loop corrections.
However, it is possible to consider an action of the form (8.71) for fields ϑm living
on a generic smooth Riemannian manifold that does not possess any special symmetry.
The power counting argument made earlier is unchanged, and we expect that this
more general kind of sigma model is also renormalizable in 2 dimensions. For these
generalized models, it has been shown that the dependence of the metric tensor (i.e.
the function that defines all the couplings of the model) on the renormalization scale
µ is governed by the following Callan-Symanzik equation:
∂ mn 1 mn 1
µ g =− R − 2 Rmpqr Rn pqr + higher orders . (8.89)
∂µ 2π 8π
Note that if we apply this equation in the case of a maximally symmetric space, for
which the curvature tensors have simple expressions in terms of the metric tensor, it
reduces to
∂ mn R h R i
µ g =− gmn 1 + + ··· . (8.90)
∂µ 2π N 2π N(N − 1)
Thus, in this special case, the metric is rescaled but retains its form under changes
of scale (because it is constrained by the isometries of the manifold). On a generic
manifold, the scale evolution governed by eq. (8.89) explores a much broader space
of metrics. Generally speaking, the renormalization flow tends to expand the regions
of negative curvature and to shrink those of positive curvature.
Figure 8.9: Left to right : successive stages of the Ricci flow on a 2-dimensional
manifold.
There is an interesting analogy between the renormalization group eq. (8.89) and
the Ricci flow,
14 In 2 dimensions, connected manifolds are known to fall into three geometrical classes: flat, spherical
or hyperbolic, depending on their curvature. More precisely, any such 2-dimensional manifold can be
endowed with a metric that has a constant scalar curvature, either null, positive or negative. Thurston
geometrization conjecture proposed a similar –but much more complicated– classification of 3-dimensional
manifolds. In particular, this conjecture contains as a special case Poincaré’s conjecture, stating that every
closed simply connected 3-dimensional manifold is homeomorphic to a 3-sphere. The geometrization
conjecture was proved in 2003 by Perelman, with techniques in which the Ricci flow plays a central role.
294 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 9
Quantum anomalies
Noether’s theorem states that for each continuous symmetry of a classical Lagrangian,
there exists a corresponding conserved current. By construction, this conservation law
holds at tree level, and a very important question is whether it is preserved by quantum
corrections in higher orders of the theory. Quantum anomalies are situations where
a classical symmetry is violated by quantum effects. We have already encountered
anomalies in the section 3.5, where we saw that the fermionic functional measure is
not invariant under chiral transformations of massless fermions, which had interesting
connections with the index of the Dirac operator (its zero modes in the presence of an
external field).
c sileG siocnarF
When such an anomaly arises in a global symmetry like chiral symmetry, its
effect is just to introduce a corrective term into the conservation equation of the
corresponding current (which may have some physical consequences, however).
But when it affects a local gauge symmetry, its effects are devastating, since the
renormalizability and unitarity of gauge theories relies on the validity to all orders of
the gauge symmetry. In general, gauge theories with an anomalous gauge symmetry
do not make sense, and it is therefore of utmost importance to check that no such
gauge anomaly is present in theories of phenomenological relevance.
295
296 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Jµ ≡ −ie Ψ γµ Ψ , ∂µ J µ = 0 . (9.3)
(In the following, this current will be called a vector current.) Being a gauge symmetry,
this invariance is crucial for the unitarity of the theory, since it ensures that longitudinal
photons do not contribute as initial or final states of physical amplitudes.
Because the fermions are massless, this theory has another symmetry. In order to
see it, let us introduce1 a matrix γ5 ,
1
γ5 = ǫµν γµ γν = γ0 γ1 , (9.4)
2
where ǫµν is the 2-dimensional completely antisymmetric tensor, normalized by
ǫ01 = +1. Using γ5 , one may decompose Ψ in its left and right handed components:
1 + γ5 1 − γ5
Ψ = Ψ R + ΨL , ΨR ≡ Ψ, ΨL ≡ Ψ, (9.5)
2 2
and the fermionic part of the Lagrangian can be rewritten as
/ Ψ = i Ψ†R γ0 D
i ΨD / ΨR + i Ψ†L γ0 D
/ ΨL . (9.6)
In other words, the kinetic term does not mix the left and right components (this
would not be true with a mass term). As a consequence, the Lagrangian is invariant if
we multiply the left and right components by independent phases,
Note that this is a global invariance, unlike the gauge symmetry discussed previously.
Equivalently, the massless Dirac Lagrangian is invariant under the following global
transformation,
5
Ψ → eiθγ Ψ , (9.8)
1 It is possible to define γ5 in any even space-time dimension D = 2r, as follows
ir−1
γ5 ≡ ǫµ1 µ2 ···µ2r γµ1 γµ2 · · · γµ2r .
(2r)!
9. Q UANTUM ANOMALIES 297
that amounts to multiplying by conjugate phases the left and right components
(because of the γ5 in the exponential). Since this is a continuous symmetry, Noether’s
theorem also applies here and tells us that the axial current is conserved:
Jµ 5 µ
5 ≡ −ie Ψ γ γ Ψ , ∂µ J µ
5 =0. (9.9)
Figure 9.1: Left: 1-loop contribution to the vector current in a background gauge
potential (the wavy line terminated by a cross represents the background field).
Right: 1-loop contribution to the axial current.
The conservation laws (9.3) and (9.9) have been obtained with Noether’s theorem,
from the fact that the classical Lagrangian possesses certain continuous symmetries.
Let us now study how the vector and axial currents are modified at 1-loop. Here, we
consider a fixed configuration of the gauge potential Aµ (x), that acts as a background
external field (this also means that the photon kinetic term plays no role in this
discussion). The lowest order 1-loop graphs that contribute to these currents are
shown in the figure 9.1. The expectation values of the currents resulting from these
graphs can be written as
eJµ (q) = Πµν (q) A
e ν (q) , eJµ (q) = Πµν (q) A
e ν (q) , (9.10)
5 5
(the tilde denotes the Fourier transform of the external field) where the self-energies
Πµν and Πµν 5 are given by
Z
µν 2 dD k /γν (k
tr γµk /+q /)
iΠ (q) ≡ e ,
(2π)D (k2 + i0+ )((k + q)2 + i0+ )
Z D
d k tr γ5 γµk/γν (k
/+q /)
i Πµν
5 (q) ≡ e2 . (9.11)
(2π) (k + i0 )((k + q) + i0+ )
D 2 + 2
(The only difference between them is the γ5 inside the trace, that comes from the
definition of the axial current). In order to secure the subsequent manipulations, let
us assume that some regularization has been performed on the momentum integrals,
without specifying it for now. The denominators can be arranged into a single factor
by using Feynman’s parameterization,
Z1
1 1
= dx 2 , (9.12)
(k2 + i0+ )((k + q)2 + i0+ ) 0 (l + ∆(x))2
298 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
qµ qν
Πµν (q) = A(q2 ) gµν − B(q2 ) , (9.13)
q2
where the coefficients A(q2 ) and B(q2 ) are given by the following integrals:
Z Z1 2
2 2 dD l ∆(x) + D − 1 l2
A(q ) ≡ −iDe dx ,
(2π)D 0 (l2 + ∆(x))2
Z Z1
dD l 2 ∆(x)
B(q2 ) ≡ −iDe2 dx . (9.14)
(2π)D 0 (l2 + ∆(x))2
e2
B(q2 ) = , (9.15)
D=2 π
while the first integral is ambiguous. Indeed, the term in l2 in the numerator leads
2
to an ultraviolet divergence, but it is multiplied by the factor D − 1 that vanishes
precisely when D = 2. If we use a cutoff as ultraviolet regulator, this term would
vanish and we would have A = B/2, which would violate the conservation of the
vector current at one-loop. In dimensional regularization, in contrast, the factor
2
D − 1 compensates a pole in 1/(D − 2) that comes from evaluating the integral in D
dimensions, leaving a finite but non-zero result. In fact, in dimensional regularization
we obtain A = B, and the conservation of the vector current holds at one-loop. No
matter which regularization procedure we adopt, it must give A = B for vector current
conservation, i.e. for preserving gauge symmetry at 1-loop. c sileG siocnarF
and
tr γ5 γµAγ
/ νB
/ = Aν tr γ5 γµ B
/ + Bν tr γ5 γµA
/ − A · B tr γ5 γµ γν
h i
= −D ǫµσ Bσ Aν + Aσ Bν − (A · B) gσ ν . (9.17)
where A and B are the same coefficients as in eq. (9.13). Therefore, the divergence of
the axial current is given by
qµ eJµ 2
5 (q) = −A(q ) ǫ
µν e ν (q) .
qµ A (9.19)
If we have adopted a regularization that preserves gauge symmetry, i.e. such that
A = B, this divergence is non-zero and reads
2
e µν
qµ eJµ
5 (q) = −
e ν (q) ,
ǫ qµ A (9.20)
π
or, going back to coordinate space:
e2 µν e2 µν
∂ µ Jµ
5 (x) = − ǫ ∂µ Aν (x) = − ǫ Fµν (x) . (9.21)
π 2π
The non-conservation of the axial current at one loop is the unavoidable conclusion
in any regularization scheme that preserves the conservation of the vector current.
Moreover, since when this is the case A becomes equal to the ultraviolet finite
coefficient B, it does not suffer from any scheme dependence, and the above result
may thus be viewed as a scheme-free result. The result (9.21) is known as an axial
anomaly. A somewhat milder conclusion of this 2-dimensional exercise is that it not
possible to preserve both vector and axial current conservation at one-loop. We could
in principle adopt a regularization scheme that conserves the axial current, which
requires A = 0. But the price to pay would be the loss of gauge invariance at 1-loop.
Since gauge invariance is deemed more fundamental (in particular, it ensures the
unitarity of the theory), this route is generally not considered further.
Note that ultraviolet divergences are necessary2 for the existence of this anomaly.
Indeed, at the classical level, the Lagrangian density is invariant under the global
transformation:
5 5
Ψ → eiθγ Ψ , Ψ† → Ψ† e−iθγ . (9.22)
The Feynman graphs that contribute to the expectation value of the axial current in a
background electromagnetic field have an equal number of Ψ’s and Ψ† ’s (this state-
ment is true to all orders of perturbation theory). Since the axial symmetry is global,
when we apply the above axial transformation to a graph, all the factors exp(±iθγ5 )
should naively cancel, leaving a result that does not depend on θ. This conclusion
would indeed be correct if all the integrals were finite, but may be invalidated by the
subtraction procedure necessary to obtain finite results in the presence of divergences.
In the explicit example that we have studied, the ultraviolet regularizations that are
consistent with gauge symmetry all spoil axial symmetry.
2 In a certain sense, the axial anomaly is also an infrared effect since it exists only for massless fermions
(for massive fermions, there is no axial symmetry to begin with). Moreover, as we have already seen when
discussing the Atiyah-Singer index theorem, the axial anomaly is related to the zero modes of the Dirac
operator in a background field.
300 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Beyond one loop, a graph contributing to the expectation value of the axial
current may contain subgraphs that are ultraviolet divergent. However, since QED
is renormalizable, these sub-divergences will all have been made finite thanks to
counterterms calculated in the previous orders of the perturbative expansion. Thus, we
need only to study the intrinsic ultraviolet divergence of the graph under consideration,
an indicator of which is given by its superficial degree of divergence. For the sake of
definiteness, let us assume that the graph G has nψ fermion propagators, nγ photon
propagators, nV photon-fermion-fermion vertices, nA insertions of the external
electromagnetic field and nL loops (plus one extra vertex where the axial current is
attached). These quantities are not independent, but obey the following identities:
2nγ = nV ,
2nψ = 2 + 2(nV + nA ) ,
n L = nψ + nγ − nA − nV . (9.23)
Using these relations, the superficial degree of divergence of the graph reads:
The simplest graph that contributes to the axial current, shown in the figure 9.1, has
nψ = 2 and therefore has a logarithmic ultraviolet divergence. More complicated
graphs, either with more insertions of the external field or with more than one loop,
all have nψ > 2 and are therefore convergent after all their sub-divergences have
been subtracted. This argument, although it lacks some rigor, indicates that the
axial anomaly does not receive any correction beyond the one-loop result, and that
eq. (9.21) is therefore an exact result. An alternate justification of this property is
based on the derivation of the axial anomaly from the fermionic path integral, which
gives the determinant of the Dirac operator in the background field. Indeed, as we
have seen in the section 2.5, functional determinants correspond to 1-loop diagrams.
In order to evaluate the traces of γ5 with an even number of Dirac matrices, let us
firstly recall the general formula for a trace of an even number of Dirac matrices:
X Y
tr γµ1 · · · γµ2n = D sign (P) gµs1 µs2 , (9.27)
pairings P s∈P
where a pairing P is a set of pairs P = (s1 s2 ), (s1′ s2′ ), · · · made of the integers
in [1, 2n]. The signature of P, denoted sign (P), is the signature of the permutation
that reorders the sequence s1 s2 s1′ s2′ · · · into 1234 · · · . Since the Minkowski metric
tensor gµν is diagonal, each Lorentz index carried by one of the Dirac matrices must
coincide with the Lorentz index of another matrix in order to obtain a non vanishing
result. Hence, we have
tr γ5 = i tr (γ0 γ1 γ2 γ3 = 0 . (9.28)
The same is true if the γ5 is accompanied by only two ordinary Dirac matrices,
tr γ5 γµ γν = i tr (γ0 γ1 γ2 γ3 γµ γν = 0 , (9.29)
and the simplest non-zero trace is tr (γ5 γµ γν γρ γσ ). By the previous argument, each
c sileG siocnarF
of the indices µνρσ must match one of the indices 0123 hidden in γ5 = i γ0 γ1 γ2 γ3 .
Therefore, µνρσ must be a permutation of 0123. Since the four Dirac matrices are all
distinct, they all anticommute, and the result is completely antisymmetric in µνρσ,
so that we have
tr γ5 γµ γν γρ γσ = A ǫµνρσ . (9.30)
In order to calculate the prefactor, we just need to evaluate the trace for a particular
assignment of the indices, for instance µνρσ = 3210,
3210 5 3 2 1 0
0 1 2 3 3 2 1 0
Aǫ| {z } = tr γ γ γ γ γ = i tr γ γ γ |γ {zγ } γ γ γ = −4 i . (9.31)
+1 −1
| {z }
+1
| {z }
−1
| {z }
−1
Order 1 in the external field : Let us now turn to the calculation of the expectation
value of the axial current in four dimensions. The simplest graph to consider is again
the graph on the right of the figure 9.1. Its contribution to axial current is
eJµ (q) = Πµν (q) A
e ν (q) , (9.33)
5 5
302 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
with
Z
dD k tr γ5 γµk/γν (k /+q /)
i Πµν
5 (q) ≡ e2
(2π)D (k2 + i0+ )((k + q)2 + i0+ )
Z D Z1
2 d l tr γ5 γµ (l/ − xq/ )γν (l/ + (1 − x)q
/)
= e dx ,
(2π)D 0 (l2 + ∆(x))2
(9.34)
where we have introduced the Feynman parameterization in the second line, and the
new integration variable l ≡ k + xq. The trace that appears in the numerator is
proportional to
ǫµανβ (l − xq)α (l + (1 − x)q)β ∝ ǫµανβ lα qβ , (9.35)
and is therefore odd in the momentum l. Therefore, the momentum integral vanishes,
and this graph does not contribute to the axial current.
Order 2 in the external field : At second order in the external field, we encounter
the graph of the figure 9.2. Its contribution to the expectation value of the axial current
reads
Z 4 4
eJµ (q) = 1 d k1 d k2 (2π)4 δ(q + k1 + k2 )
5
2! (2π)8
e ν (k1 )A
×Γ µνρ (q, k1 , k2 ) A e ρ (k2 ) , (9.36)
5
The two terms correspond to the two ways of attaching the fields with momenta k1
and k2 to the external photon lines. For a reason that will become clear later, we have
taken the freedom to introduce independent shifts a and b of the integration variables
in the two terms. Such shifts would of course have no effect on convergent integrals,
since they just correspond to a linear change of variable. However, we are here in the
presence of linearly divergent integrals, and these shifts have a nontrivial interplay
with the ultraviolet regularization. Note that since {γ5 , γα } = 0, we may move the
γ5 just before the matrices γν or γρ without changing the integrand, as if the axial
current was attached at the other summits of the triangle (where the momenta k1 or
k2 enter, respectively).
Next, in order to test the conservation of the axial current, we contract this
amplitude with qµ , that we may rewrite as follows:
qµ = −(k1 + k2 )µ
= (k + a − k2 )µ − (k + a + k1 )µ
= (k + b − k1 )µ − (k + b + k2 )µ . (9.38)
This leads to
Z D
d k ανβρ
qµ Γ5µνρ (q, k1 , k2 ) = 4e3 ǫ
(2π)D
(k1 )α (k + a)β
×
((k+a) + i0+ )((k+a+k1 )2 +i0+ )
2
(k2 )α (k + a)β
+
((k+a)2 + i0+ )((k+a−k2 )2 +i0+ )
(k1 )α (k + b)β
−
((k+b)2 + i0+ )((k+b−k1 )2 +i0+ )
(k2 )α (k + b)β
− . (9.39)
((k+b)2 + i0+ )((k+b+k2 )2 +i0+ )
and
Z D
d k αµβν
(k2 )ρ Γ5µνρ (q, k1 , k2 ) = −4e3 ǫ
(2π)D
(k + a + k1 )α (k + a − k2 )β
×
((k+a+k1 )2 + i0+ )((k+a−k2 )2 +i0+ )
(k + a + k1 )α (k + a)β
−
((k+a+k1 )2 + i0+ )((k+a)2 +i0+ )
(k + b)α (k + b − k1 )β
+
((k+b)2 + i0+ )((k+b−k1 )2 +i0+ )
(k + b + k2 )α (k + b − k1 )β
− . (9.41)
((k+b+k2 )2 + i0+ )((k+b−k1 )2 +i0+ )
It turns out that the choice a = b = 0 leads to non vanishing results for the conser-
vation of the vector currents. Consider for instance (k1 )ν Γ5µνρ . With a = b = 0
and a regularization that preserves Lorentz invariance as well as reflection symmetry
k → −k, we have:
Z D
µνρ 3 d k αµβρ
(k1 )ν Γ5 (q, k1 , k2 ) = −8e ǫ
(2π)D
(k + k2 )α (k − k1 )β
×
((k+k2 )2 + i0+ )((k−k1 )2 +i0+ )
∝ ǫαµβρ (k2 )α (k1 )β 6= 0 . (9.42)
A systematic search indicates that the only choice of a and b that gives a null result
for both eqs. (9.40) and (9.41) is
a = −b = k2 − k1 . (9.43)
9. Q UANTUM ANOMALIES 305
Since the conservation of the vector current is necessary in order to preserve gauge
symmetry, and that the latter is a requirement for unitarity, we must adopt this choice.
Returning to eq. (9.39) for the axial current with these values of a and b, we obtain:
Z
dD k ανβρ (k1 )α (k+k2 −k1 )β
qµ Γ5µνρ (q, k1 , k2 ) = 16e3 ǫ .
(2π)D (k+k2 )2 +i0+ (k+k2 −k1 )2 +i0+
(9.44)
Let us define
(k1 )α (k−k1 )β
Fνρ (k) ≡ ǫανβρ , (9.45)
k2 +i0+ (k−k1 )2 +i0+
and note that
Z D
d k νρ
F (k) = 0 . (9.46)
(2π)D
(because with a Lorentz invariant regularization the result can only depend on the
vector k1 , which would unavoidably give zero when contracted with the two free
slots of the ǫανβρ .) Therefore, we can write
Z h i
dD k
qµ Γ5µνρ (q, k1 , k2 ) = 16e3 Fνρ (k + k2 ) − Fνρ (k)
(2π)D
Z D h i
d k ∂Fνρ (k) kσ kτ ∂2 Fνρ (k)
= 16e3 kσ
2 + 2 2 + ··· .
(2π)D ∂kσ 2 ∂k ∂k σ τ
(9.47)
Since the integrand now contains only derivatives, we can use Stokes’s theorem
in order to rewrite the divergence of the axial current as a surface integral on the
boundary at infinity of momentum space. If we view this boundary as the limit
k∗ → ∞ of a sphere of radius k∗ , the “area” of this boundary grows like k3∗ in
D = 4. On the other hand, the function Fνρ (k) behaves as k−3 , and each subsequent
derivative decreases faster by one additional power of k−1 . Therefore, the result is
given in full by the first term of the expansion:
Z
dD k σ ∂Fνρ (k)
qµ Γ5µνρ (q, k1 , k2 ) = 16e3 k
(2π)D 2 ∂kσ
Z
16ie3 ανβρ kσ kβ
= ǫ (k1 )α (k2 )σ lim d3 S
(2π)4 k∗ →∞ k k4
S3 (k∗ )
| {z }
π2 gσβ
2
e3 νραβ
= −i ǫ (k1 )α (k2 )β , (9.48)
2π2
306 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the second line, S3 (k∗ ) is the 3-sphere of radius k∗ (i.e. the boundary of a 4-ball of
radius k∗ ), kσ /k is the unit vector normal to the sphere, and the factor i arises when
going to Euclidean momentum space. Note that we have anticipated the limit k → ∞
in order to simplify the function Fνρ (k). Therefore, the contribution of the triangle
graph to the divergence of the axial current reads
Z
e3 νραβ d4 k1 d4 k2
qµ eJµ
5 (q) = −i ǫ δ(q + k1 + k2 )
4π2 (2π)4
e ν (k1 )A
×(k1 )α (k2 )β A e ρ (k2 ) , (9.49)
or in coordinate space
e3 ανβρ
∂ µ Jµ
5 (x) = − ǫ Fαν (x) Fβρ (x) . (9.50)
16π2
This is the main result of this section, namely the existence of an anomalous divergence
of the axial current in the presence of a background electromagnetic field. In the course
of the calculation, we have seen that depending on the labeling of the integration
momentum, we can make the anomaly appear in any of the three external currents.
In the situation considered here, with one axial current corresponding to a global
symmetry, and two vector currents stemming from a local gauge symmetry, we must
enforce the conservation of the vector currents and therefore assign in full the anomaly
to the axial one. But the same calculation would arise in the context of a chiral gauge
theory (where the left and right handed fermions belong to different representations of
the gauge group). In this case, the natural choice would be to regularize the triangle
so that the symmetry among the three currents is preserved, and the anomaly would
then be equally shared by the three currents. c sileG siocnarF
Corrections : Let us now discuss potential corrections to the result (9.50). Firstly,
we should examine one-loop graphs with more than two photons in addition to the
insertion of the axial current. A simple dimensional argument can exclude that such
graphs contribute to the divergence of the axial current. Indeed, ∂µ Jµ 5 has mass
dimension 4. In an abelian gauge theory, each external photon must appear in the
right hand side in the form of the field strength Fµν , that has mass dimension 2. A
term with n photons would thus have mass dimension 2n, and require a prefactor
of mass dimension 4 − 2n to be a valid contribution to the divergence of the axial
current. But since the fermions we are considering are massless and the coupling
constant is dimensionless in four dimensions, there is no dimensionful parameter in
the theory for making up such a prefactor.
Let us now consider higher loop corrections. From the calculation that led to
eq. (9.50), the anomaly results from the integration over the momentum that runs in
the fermion loop, provided that the integrand has mass dimension 4 or higher. Note
9. Q UANTUM ANOMALIES 307
that some of the higher order corrections just renormalize the objects that appear in
the right hand side of eq. (9.50), such as the photon field strength and the coupling
constant, without changing the structure of the anomaly (including the numerical
prefactor). Quite generally however, adding an internal photon line requires to add
more fermion propagators in the main loop, which reduces its degree of ultraviolet
divergence. Of course, the integration over the momentum of this internal photon
may itself be ultraviolet divergent, but it can be regularized in a way that does not
interfere with axial symmetry and thus does not contribute to the anomaly.
9.2 Generalizations
9.2.1 Axial anomaly in a non-abelian background
In the previous section, we have discussed axial anomalies in an abelian gauge theory.
However, a similar anomaly arises in the presence of a non-abelian background gauge
field. Let us assume that the fermions are in a representation of the gauge algebra
where the generators are ta . The calculation of the triangle graph proceeds almost
in the same way as in the abelian case, except for the Lie algebra generators, and
eq. (9.50) becomes
e3
∂ µ Jµ
5 (x) = − 2
tr ta tb ǫανβρ ∂α Aa
ν (x) ∂β Ab
ρ (x) . (9.51)
4π
This is not gauge invariant, but it is easy to guess what should be the right hand side
to restore gauge invariance:
e3
∂ µ Jµ
5 (x) = − 2
tr ta tb ǫανβρ Fa b
αν (x) Fβρ (x) . (9.52)
16π
The same dimensional argument that we have used in the abelian case also applies
here: there cannot be contributions to the anomaly of degree higher than two in
the field strength. Note that when expanded in terms of the gauge potential Aa µ,
eq. (9.52) contains terms of degree 3 and 4, that exist only in a non-abelian background.
Diagrammatically, they correspond to contributions coming from the following two
diagrams:
308 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(But the direct extraction of the anomaly contained in these graphs would be very
cumbersome, due to the numerous terms arising from permutations of the external
gauge fields.)
spinor is (∂µ − Γµ ) Ψ and the generally covariant Dirac equation for a massless
fermion reads
i γµ (∂µ − Γµ ) Ψ = 0 . (9.55)
In order to construct a Lagrangian that transforms as a scalar, we need a matrix Γ such
that ψ† Γψ is a real scalar. This is the case if the following conditions are satisfied
Γ = Γ† ,
à 㵠= 㵆 à ,
∇µ Γ = ∂µ Γ + Γµ† Γ + Γ Γµ . (9.56)
9. Q UANTUM ANOMALIES 309
(g is the determinant of the metric tensor.) The vector current and its conservation
law generalize into
J µ ≡ Ψ γµ Ψ , ∇ µ Jµ = 0 . (9.58)
Jµ 5 µ
5 ≡ Ψγ γ Ψ , ∇ µ Jµ
5 =0. (9.59)
where in this section we use the notation ηαβ for the Minkowski metric tensor. This
is equivalent to introducing at each point x a local Minkowski frame with coordinates
yα . Note that eα µ transforms as a vector under diffeomorphisms (a coordinate vector)
with respect to the index µ, and as an ordinary 4-vector under Lorentz transformations
(called a tetrad vector in this context) with respect to the index α. The indices
α, β, · · · are raised and lowered with the Minkowski metric tensor, while the indices
µ, ν, · · · are raised and lowered with the curved space metric gµν (x). Since in the
right hand side of eq. (9.60) the indices α and β are contracted with the Lorentz
tensor ηαβ , the result is a scalar under Lorentz transformations, but a rank-2 tensor
under diffeomorphisms. The Dirac matrices in curved spacetime (γµ (x)) can then be
related to those in flat spacetime (γα ) by
and a spin connection Γµ that satisfies eq. (9.54) (and reduces to zero in flat spacetime)
is given by
1
Γµ (x) = − γα γβ eαρ (x) ∇µ eβ ρ (x) , (9.62)
4
with ∇µ eβ ρ = ∂µ eβ ρ − Γµρν β
e ν (since eβ ρ is a coordinate vector with respect to the
index ρ). A matrix Γ that fulfills eqs. (9.56) is the flat spacetime γ0 , and the matrix
γ5 is still given in terms of the flat spacetime Dirac matrices by γ5 = i γ0 γ1 γ2 γ3 .
3 In this section, we denote η
αβ ≡ diag (1, −1, −1, −1) the flat spacetime Minkowski metric, in order
to distinguish it from gµν .
310 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
fermions and gauge fields involve the left or right projectors PR,L ≡ (1 ± γ5 )/2, and
the generators of the Lie algebra that appear in these vertices are taR,L
, respectively
(the left and right generators would be equal in a theory where the two fermion
chiralities belong to the same representation).
The triangle diagram that gave the axial anomaly in four dimensions is replaced
by a graph with three external gauge bosons, with chiral couplings to the fermion
loop. When the fermion in the loop is massless, the left and right chiralities do not
mix, and the multiple occurrences of the projectors simplify into a single one, thanks
to
PR PL = 0 , PL2 = PL , PR2 = PR ,
PR,L , γµ γν = 0 ,
tr PR γµp
/ 1 PR γ ν p
/ 2 PR γ ρ p
/ 3 = tr PR γµp
/ 1 γν p
/ 2 γρ p
/3 ,
tr PL γµp
/ 1 PL γ ν p / 3 = tr PL γµp
/ 2 PL γ ρ p / 1 γν p
/ 2 γρ p
/3 . (9.64)
The γ5 contained in the projectors PR,L may lead to an anomaly, with a relative sign
between the right and left chiralities. The calculation is almost identical to the case
of a global axial symmetry, except that now we should choose the shifts a and b so
that the resulting 3-point function is symmetric in the external fields, since they play
identical roles. But this choice does not eliminate the anomaly; it just distributes
it evenly among the three external currents, leading to an anomaly proportional to
tr ta {tb , tc } . When there are both right and left fermions in the loop, the anomaly
is proportional to
dabc ≡ tr ta R
{tb
R
, tcR } − tr ta
L
{tb
L
, tcL } . (9.65)
Obviously, this is zero in a vector theory, where the right and left fermions couple in
the same way to the gauge bosons. c sileG siocnarF
Table 9.1: Weak isospin, hypercharge and electrical charge of the fermions of the
Standard Model.
Y of the left and right handed fermions are different. After spontaneous symmetry
breaking via the Higgs mechanism, the fields Bµ 3 (third component of SU(2)) and
Aµ (U(1)) mix to give the Z boson and the photon fields. The electrical charges of
the fermions are then given by Q = T3 + Y2 (since the electrical charges are the same
for left and right fermions, the resulting U(1)em of electromagnetism is a non-chiral
gauge interaction).
The simplest case of anomaly cancellation is the 3-gluon triangle, which is not
anomalous because the strong interaction vertex is a vector coupling:
su(3)
c
su(3)
a
b
R − L cancellation (see eq. (9.65)).
su(3)
For the triangle involving three SU(2) bosons, the anomaly cancels thanks to a
peculiar identity obeyed by the su(2) generators:
su(2)
k
su(2)
i
j
trsu(2) (ti {tj , tk }) = 0 .
su(2)
In triangles that have a single SU(3) or a single SU(2) boson, the anomaly cancels
because the corresponding generators are traceless:
9. Q UANTUM ANOMALIES 313
su(2) u(1)
j
su(3) su(3)
a
i
a trsu(3) (ta ) = 0 ,
su(2) u(1)
su(3) u(1)
b
su(2) su(2)
i
a
i trsu(2) (ti ) = 0 .
su(3) u(1)
In triangles with a single U(1) boson and a pair of SU(2) or SU(3) bosons, the
anomaly cancels thanks to the specific linear combination of weak hypercharges one
gets by summing over all the allowed fermions in the loop:
su(3)
b
u(1) P
a
y = 2 − 31 + 34 − 23 = 0 ,
quarks
su(3)
su(2)
j
u(1) P
i
y = 3 − 13 +1 = 0 .
left handed
su(2)
fermions
(In the first of these cancellations, there is a factor of 2 in the first term to account for
the fact that the left handed quarks form SU(2) doublets, and in the second equality
the first term has a factor 3 because the quarks can have three colours.) Note also that
loops with left handed fermions should be counted with a minus sign, according to
eq. (9.65). Finally, the triangle with three U(1) bosons has no anomaly, thanks to the
fact that the sum of the cubes of the weak hypercharges over all fermions is zero:
u(1)
u(1) P 3 3 3 3
y3 = 6 − 13 +3 43 +3 − 32 +2+ −2 = 0 .
u(1)
(Again, the numerical prefactors count the number of SU(3) and SU(2) states for
each fermion.) Interestingly, gravitational anomalies also cancel in the standard
model. Indeed, an anomaly may potentially exist in the triangle with a U(1) boson
and two gravitons. But this anomaly would be proportional to the sum of the weak
hypercharges of all fermions, which turns out to be zero:
G
u(1) P
y = 6 − 13 +3 43 +3 − 32 +2+ −2 = 0 .
G
One can see the crucial role played by the weak hypercharges assigned to the various
fermions of the Standard Model in these cancellations. Conversely, one may try to
314 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the subsection 9.2.1, where we have derived the axial anomaly in a non-abelian
background field, we first obtained a partial answer with only the terms quadratic
in the external field, and then we used gauge symmetry in order to reconstruct the
missing terms (of order 3 and 4 in the external field). However, how to promote
such a partial result into the full expression of the anomaly is not always so obvious,
for instance in the case of chiral gauge theories where the gauge symmetry itself
is anomalous (in this case, we cannot invoke gauge invariance to restore the full
answer). The Wess-Zumino consistency conditions are a set of equations satisfied by
the anomaly function, that are powerful enough to allow reconstructing the anomaly
from the knowledge of its lowest order in the gauge fields.
Even in the case where the anomalous symmetry is global, it is convenient to
couple a (fictitious in that case) gauge field Aµ to the corresponding current Jµ whose
conservation is violated by the anomaly. By doing this, we promote the symmetry
to a local gauge invariance (violated by the anomaly), and we may return to a global
symmetry by letting the gauge coupling go to zero. Let us denote Γ [A] the effective
action for the gauge field (i.e. the effective action in which the fermions are included
only in the form of loop corrections). In the absence of anomaly, Γ [A] would be
invariant under gauge transformations of the field Aµ ,
Z δΓ [A]
0 = δθ Γ [A] = d4 x Dadj
µ ab θ b (x)
no anomaly δAa
µ (x)
Z
δ
= − d4 x θb (x) Dadj
µ ba Γ [A] . (9.66)
δAaµ (x)
| {z }
i Tb (x)
where the function Ga [x; A] encodes the anomaly. This function is closely related
to the non-zero right hand side of the anomalous conservation law for the current
associated to the symmetry, since the effective action and the current are related by
δΓ [A]
Jµa (x) + =0, (9.68)
δAaµ (x)
which implies
Dadj
µ ba
Jµa (x) = −Gb [x; A] . (9.69)
Since the anomaly is local, Gb [x; A] should be a local (at the point x) polynomial in
the gauge field and its derivatives. One may then check that the operators Ta (x) obey
the following commutation relation,
Ta (x), Tb (y) = i g fabc δ(x − y) Tc (x) , (9.70)
where the fabc are the structure constants of the gauge group. From this, we deduce
the following identity
called the Wess-Zumino consistency conditions. Since this identity is linear in the
anomaly function Ga , it cannot constrain its overall normalization (for this, it is
usually necessary to compute the triangle diagram). However, this equation is strong
enough to fully constrain its dependence on the gauge field from the term of lowest
order in A. c sileG siocnarF
The consistency condition can be recasted into a more convenient form that involves
BRST symmetry. Let us introduce a ghost field χa , and recall that the BRST transfor-
mation reads:
g
QBRST Aa adj
µ (x) = Dµ ab
χb (x) , QBRST χa (x) = − fabc χb (x) χc (x) .
2
(9.72)
Then, let us encapsulate the anomaly function into the following local functional of
ghost number +1:
Z
G[A, χ] ≡ d4 x χa (x) Ga [x; A] . (9.73)
316 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
We obtain:
Z
QBRST G[A, χ] = i d4 xd4 y χa (x)χb (y) Tb (y)Ga [x; A]
Z
g
− d4 x fabc χa (x)χb (x) Gc [x; A]
2
Z
i
= d4 xd4 y χa (x)χb (y) Tb (y)Ga [x; A] − Ta (x)Gb [y; A]
2
+i g δ(x − y) fabc Gc [x; A] .
| {z }
=0
(9.74)
where h[A] does not depend on the ghost field (indeed, QBRST increases the ghost
number by one unit, and G[A, χ] must have ghost number unity). But since h[A] is
a local functional of the gauge field, it may be subtracted from the action to cancel
the anomaly. Thus, genuine anomalies are given by local functionals G[A, χ] of ghost
number +1 that satisfy the consistency condition (9.75), modulo a term obtained
by acting with QBRST on a functional of A only. Note that if we write G[A, χ] as the
integral of a local density,
Z
G[A, χ] ≡ d4 x G(x) , (9.77)
In order to determine how the Wess-Zumino equation constrains G(x), the language of
differential forms introduced in the section 4.5.3 is very handy, as a way to encapsulate
both Lorentz and group indices in compact objects. The 1-forms dxµ anticommute
among themselves under the exterior product ∧. In addition, they also anticommute
9. Q UANTUM ANOMALIES 317
with the ghost field and the BRST generator QBRST . The volume element weighted by
the fully antisymmetric tensor ǫµνρσ can therefore be written as
d4 x ǫµνρσ = dxµ ∧ dxν ∧ dxρ ∧ dxσ . (9.79)
Then, given a vector Vµ and the corresponding 1-form
V ≡ Vµ dxµ , (9.80)
we may write in a compact manner
Z Z
4 µνρσ
d xǫ Vµ Vν Vρ Vσ = V ∧ V ∧ V ∧ V . (9.81)
QBRST A = −dχ + A ∧ χ + χ ∧ A ,
QBRST χ = χ ∧ χ . (9.84)
On dimensional grounds, the anomaly function G[A, χ] may contain the following
terms:
Z
G[A, χ] = −iC d4 x ǫµνρσ χa tr ta ∂µ Aν (∂ρ Aσ )
+ia1 ∂µ Aν Aρ Aσ + ia2 Aµ ∂ν Aρ Aσ + ia3 Aµ Aν (∂ρ Aσ )
−b Aµ Aν Aρ Aσ . (9.85)
The term on the first line comes from the triangle diagram, whose explicit calculation
gives the overall coefficient C. The terms of the second and third lines come from the
square and pentagon diagrams, respectively. Alternatively, they can be obtained from
the consistency conditions. Firstly, the previous equation may be rewritten as a sum
of forms:
Z
G[A, χ] = γ tr χ ∧ (dA) ∧ (dA))
where γ, α1,2,3 , β are constants related to C, a1,2,3 , b. Consider first the BRST
transform of the last term,
QBRST tr χ ∧ A ∧ A ∧ A ∧ A = tr χ ∧ χ ∧ A ∧ A ∧ A ∧ A
+ terms in χ ∧ (dχ) ∧ A ∧ A ∧ A .
(9.87)
By evaluating similarly the BRST transforms of the other terms, one can check that
when α1 = −α2 = α3 = −1/2 the BRST transform of the anomaly functional is the
integral of an exact form and therefore vanishes:
Z Z
QBRST G[A, χ] = γ dF = γ F=0. (9.89)
❘ 4 ∂ ❘
4
This is in fact the only possibility. Introducing the field strength 2-form,
ig a a
F ≡ dA − A ∧ A = t Fµν dxµ dxν , (9.90)
2
the anomaly functional for these values of the coefficients can then be rewritten as
Z h i
1
G[A, χ] = γ tr χ ∧ d A ∧ F + A ∧ A ∧ A . (9.91)
2
Therefore, except for the prefactor γ whose determination requires to calculate the
triangle diagram, the consistency relations completely determine the dependence of
the anomaly function on the gauge field. c sileG siocnarF
As shown by ’t Hooft, one way this may happen is to have in the underlying
fundamental theory a global chiral symmetry with generators T a , such that the
anomaly function tr (T a {T b , T c }) is non-zero. In the low energy sector of the spectrum
of this theory, there must be spin 1/2 massless bound states, on which this chiral
❚
symmetry acts with generators a , and whose anomaly coefficients are identical to
the high energy ones:
tr ❚a ❚b , ❚c
= tr T a T b , T c . (9.92)
The proof of this assertion goes as follows. Let us first couple a fictitious weakly
coupled gauge boson to the generators T a . We also introduce additional fictitious
massless fermions coupled only to the fictitious gauge boson, but not to the strongly
interacting gauge bosons responsible for the confinement, tuned so that their contribu-
tion exactly cancels the anomaly:
h i h i
tr T a T b , T c physical
+ tr T a T b , T c fictitious
=0. (9.93)
high energy fermions
Let us now examine the low energy part of the spectrum of this theory, i.e. at energies
much lower than the strong scale Λ. Since they are not coupled in any way to the
strong sector, this low energy spectrum contains the fictitious gauge bosons and
massless fermions, unmodified compared to what we have introduced at high energy.
In addition, this spectrum contains the bound states made of the trapped fermions and
strongly interacting gauge bosons. For consistency, this low energy description must
also be anomaly-free, which means that the bound states must transform under the
chiral symmetry with generators a , such that ❚
h i h i
tr ❚ a ❚b , ❚ c physical
+ tr T a T b , T c fictitious
=0. (9.94)
bound states fermions
The crucial point in this argument is that the contribution of the fictitious fermions is
the same in the equations (9.93) and (9.94), because these fermions are not coupled
to the strongly interacting sector. Eqs. (9.93) and (9.94) immediately give (9.92). In
other words, the anomalies of the trapped elementary fermions must be mimicked by
those of the massless spin 1/2 bound states they are confined into.
320 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Z
gµν µ λ 4
S[φ] ≡ d4 x (∂x φ(x))(∂ν
x φ(x)) − φ (x) . (9.95)
2 4!
A scaling transformation amounts to multiplying all length scales by some factor
xµ → yµ ≡ eϑ xµ . (9.96)
d
In this transformation, a field φ(x) of dimension mass φ and its derivative trans-
form as:
d4 x → d4 y = e4 ϑ d4 x . (9.98)
where we have used the fact that the mass dimension of φ is dφ = 1 in four spacetime
dimensions. The action defined in eq. (9.95) is thus invariant under scale transfor-
mations. The same conclusion holds for any classical action that does not contain
any dimensionful parameter, provided the appropriate dimension dφ is used for each
field. This is for instance the case of pure Yang-Mills theory in four dimensions, or
quantum chromodynamics in which we neglect the quark masses.
9. Q UANTUM ANOMALIES 321
Since the transformation (9.97) is continuous, Noether’s theorem implies that there is
a corresponding conserved current. On the one hand, the infinitesimal variation of the
field is
On the other hand, the scale transformation (9.96) directly applied to the integrand of
the action gives a variation
h i
δ d4 x L(x) = −ϑ d4 x 4 + xµ ∂µ L(x) + O(ϑ2 )
= −ϑ d4 x ∂µ xµ L(x) + O(ϑ2 ) . (9.101)
It is important to include the measure in this calculation, since it is not invariant under
scale transformations. The variation of the measure gives the 4 in the first line, which
is crucial for obtaining a total derivative in the second line. Then, from the derivation
of Noether’s theorem, we conclude that
∂L
∂µ dφ + xν ∂ν )φ − xµ L = 0 . (9.102)
∂(∂µ φ)
| {z }
Dµ
The vector Dµ is called the dilatation current. In the case of the scalar field theory
used earlier as an example, the explicit form of Dµ is
Dµ = xν (∂µ φ)(∂ν φ) − gµν L + φ(∂µ φ) . (9.103)
| {z } | {z }
Θµν 1 µ 2
2∂ φ
The final zero follows from the classical equation of motion of the field.
322 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The first two terms are identical to the original D0 of eq. (9.103), and the third term
is a total spatial derivative. Therefore, when we integrate this charge density over all
space, the new definition of the dilatation current gives the same conserved charge as
eq. (9.103).
xµ → xµ + ξµ (x) . (9.110)
where ∇µ is the covariant derivative. Let us recall for later use an important identity
Z Z
√ √
d4 x −g A ∇µ B = − d4 x −g ∇µ A B , (9.112)
In the first line, the second term vanishes when the field φ is a solution of the classical
equation of motion. For this to be true for an arbitrary variation ξν (x), we must have
2 δS
∇µ T µν = 0 , with T µν ≡ √ . (9.114)
−g δgµν
4 Note that, although xµ is not a vector, the infinitesimal variation ξµ (x) is a vector, tangent to the
coordinate manifold at the point x. Therefore, it makes sense to act on it with a covariant derivative.
324 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
By construction, this tensor is symmetric and (covariantly) conserved, and the nature
of the coordinate transformation (9.111) makes it clear that it is related to translation
invariance5 . In order to obtain the flat space energy-momentum tensor, one should set
gµν to the Minkowski metric tensor after evaluating the derivative.
Moreover, if we apply a scale transformation to the coordinates,
xµ → eϑ xµ , (9.115)
the metric tensor is simply rescaled:
gµν → e−2ϑ gµν . (9.116)
Moreover, if the classical action does not contain any dimensionful parameter, it is
invariant under this rescaling, and we can write
Z !
δS δS
0 = δS = d4 x e−2ϑ gµν (x) + δφ(x) . (9.117)
δgµν (x) δφ(x)
| {z }
=0
This equation implies that the derivative of the action with respect to the metric, and
therefore the energy-momentum tensor T µν , is traceless.
In order to illustrate this method, let us consider Yang-Mills theory, whose action
coupled to gravity reads
Z
1 √
S=− d4 x −g gµρ gνσ Fµν ρσ
a Fa . (9.118)
4
In order to calculate
√ the derivative of this action with respect to the metric, we need
the variation of −g, that can be obtained as follows:
Hence,
√
∂ −g √ gµν
= −g , (9.120)
∂gµν 2
and we obtain the following expression for the energy-momentum tensor:
gµν a αβ a
T µν = Fµα a Fα ν a − F F , (9.121)
4 αβ
whose trace is obviously zero.
5 It is important to note that the derivation implicitly assumes that the parameters in the action, such as
∂g
µ = β(g) . (9.123)
∂µ
Moreover, even if the classical scale invariance is broken by the renormalization
group flow, it should be recovered at the fixed points of the RG flow. For instance, a
quantum field theory is scale invariant at critical points.
In Yang-Mills theory, we can derive the form of this trace in the following (non-
rigorous) manner. Let us start from the Yang-Mills action, written in terms of rescaled
fields, so that the coupling appears in the form of a prefactor g−2 :
Z h
√ 1 a µν a i
S = d4 x −g − F F , (9.124)
4 g2b µν
where gb is the bare coupling constant. When this theory is regularized by an gauge
invariant cutoff µ (e.g., a lattice regularization), the bare coupling becomes cutoff
dependent in order for the renormalized quantities to have a proper ultraviolet limit.
Then, consider again the scaling transformation defined in eqs. (9.115) and (9.116).
With a scale dependent coupling, the physics is invariant provided we also change the
scale at which the coupling is evaluated
gb (µ) → gb (e−ϑ µ) . (9.125)
326 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Then, by writing explicitly the two sources of ϑ dependence in the variation of the
action, we get
Z h 1 β(gb ) a µν a i
0 = δS = −ϑ d4 x 2 T µ µ − F F . (9.127)
gµν =ηµν gb 2 g3b µν
Therefore, we obtain the following form of the anomalous divergence of the dilatation
current:
β(g) a µν a
∂µ Dµ = T µ µ = F F . (9.128)
2 g µν
This derivation is only heuristic, but a more rigorous treatment using properly renor-
malized operators would lead to the same result. This anomaly can also be derived in
perturbation theory, from the loop corrections to the dilatation current,
(The dotted line terminated by the dark blob denotes the vertex between two gluons
and the dilatation current.) Note that, thanks to asymptotic freedom, Yang-Mills
theory becomes better and better scale invariant as the energy scale increases. Finally,
when one adds quarks in order to obtain QCD, the right hand side of the previous
equation contains also terms in m ψψ, due to the explicit breaking of scale invariance
(already in the classical theory, therefore this is not a quantum anomaly) by the masses
of the quarks.
Chapter 10
All the applications of quantum field theory we have encountered so far amount to
study situations that may be viewed as small perturbations above the vacuum state;
i.e. interactions involving states that contain only a few particles. Besides the fact that
these situations are actually encountered in scattering experiments, their importance
stems from the stability of the vacuum, that makes it a natural state to expand around.
In this chapter, we will study other field configurations, classically stable, that may
also be sensible substrates for expansions that differ from the standard perturbative
expansion that we have studied until now. However, under normal circumstances, a
localized “blob” of fields is not stable: it will usually decay into a field which is zero
everywhere. As we shall see, the stability of the field configurations considered in
this chapter is due to topological obstructions that prevent a smooth transformation
between the field configuration of interest and the null field that corresponds to the
vacuum. These field configurations can be classified according to their space-time
structure:
• Event-like : localized both in time and space (e.g., instantons). These may be
viewed as local extrema of the 4-dimensional action, and therefore may give a
(non-perturbative) contribution to path integrals.
327
328 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
50
45
40
35
0
-10 -5 0 5 10
φ
In order to simplify the discussion, let us consider field configurations that depend
only on x, and are independent of time, as well as of the transverse coordinates y, z.
We seek field configurations that obey the classical field equation of motion,
−∂2x φ + V ′ (φ) = 0 , (10.3)
and have a finite energy (per unit of transverse area),
Z +∞
dE 2
= dx 21 ∂x φ(x) + V(φ(x)) < ∞ . (10.4)
dydz −∞
This energy density is the sum of two positive definite terms (since we have adjusted
the potential so that its minima are V(±φ∗ ) = 0. For the integral over x to converge
1 This is for 4-dimensional spacetime. In D-dimensional spacetime, domain walls have dimension
D − 2.
10. L OCALIZED FIELD CONFIGURATIONS 329
when x → ±∞, it is necessary that φ(x) becomes constant when |x| → ∞, and that
this constant be +φ∗ or −φ∗ . There are therefore four possibilities for the values of
the field at x = ±∞:
The first two of these possibilities do not lead to stable field configurations of positive
energy, because they can be continuously deformed (while holding the asymptotic
values unchanged) into the constant fields φ(x) = +φ∗ , or φ(x) = −φ∗ , respectively,
that have zero energy. Physically, this means that if one creates a field configuration
with these boundary values, it will decay into a constant field (i.e., the regions where
the field was excited to values different from ±φ∗ will dilute away to |x| = ∞).
The interesting cases are encountered when the field takes values corresponding to
opposite minima at x = −∞ and x = +∞. If one holds the asymptotic values of the
field fixed, then it is not possible to deform continuously such a field configuration into
one that would have zero energy. Thus, there must be stable field configurations of
positive energy with these boundary values. A very handy trick, due to Bogomol’nyi,
is to rewrite the energy density as follows:
Z p 2 Z φ(+∞) p
dE 1 +∞
= dx ∂x φ(x)± 2 V(φ(x)) ∓ dφ 2 V(φ) . (10.6)
dydz 2 −∞ φ(−∞)
In the cases i, ii, the second term vanishes, and the energy density is allowed to be
zero, by having a constant field equal to ±φ∗ . Let us consider now the case iii. In
this case, it is convenient to choose the minus sign in the first term, so that
Z p 2 Z +φ∗ p
dE 1 +∞
= dx ∂x φ(x) − 2 V(φ(x)) + dφ 2 V(φ) . (10.7)
dydz 2 −∞ −φ
| ∗ {z }
>0
The second term is now strictly positive, and does not depend on the details of φ(x)
(except its boundary values). Since the first term is the integral of a square, this
implies that there is no field configuration of zero energy with this boundary condition.
The minimal energy density possible with this boundary condition is
Z +φ∗ p
dE
= dφ 2 V(φ) , (10.8)
dydz min −φ∗
(∂x φ) V ′ (φ)
∂2x φ = p = V ′ (φ) , (10.10)
2 V(φ)
which is nothing but the classical equation of motion (10.3). Solutions of this equation
with prescribed boundary values ±φ∗ at x = ±∞ interpolate between the two ground
states of the potential of the figure 10.1. The ground state φ = +φ∗ is realized at
x → +∞, while the other ground state is realized at x → −∞. Since these two vacua
correspond to two different ways to spontaneously break the φ → −φ symmetry,
there must exist an interface between the two phases, called a domain wall. From
eq. (10.9), we may write
Zφ
dξ
x(φ) = x0 + p , (10.11)
0 2 V(ξ)
In the middle of this process, the field in this region will be φ = 0, at which
V(φ) = V0 > 0, a configuration that has an infinite energy density. Thus, the domain
wall solution is stable, except for shifts of x0 (since the energy density is independent
of x0 ): the domain wall may move along the x axis, but cannot disappear.
Let us finish by a note on the y, z dependence that has been neglected sofar.
Reintroducing the transverse dependence adds the term 12 (∂y φ)2 + (∂z φ)2 to
the integrand of the energy density in eq. (10.4). This term is positive, or zero for
fields that do not depend on y and z. Therefore, the minimum of energy density is
reached for domain walls that are invariant by translation in the transverse directions.
Domain walls that are not translation invariant are not stable, but will relax to this
y, z-invariant configuration. Physically, one may view the term 12 (∂y φ)2 + (∂z φ)2
as a surface tension energy, and the energetically favored configurations are those for
which the interface has the lowest curvature. c sileG siocnarF
10.2 Skyrmions
Skyrmions are field configurations that arise in models resulting from a spontaneous
symmetry breaking, such as a non-linear sigma model. Consider for instance the
following action,
Z
1X
S[ξ] = dD x gab (ξ) ∂i ξa ∂i ξb + · · · , (10.12)
2
a,b
where the fields ξa are the Nambu-Goldstone bosons of a broken symmetry from the
symmetry group G down to H. The matrix gab (ξ) is positive definite, and in general
field dependent. The dots represent terms with higher derivatives, that we have not
written explicitly. In such a model, the Nambu-Goldstone fields ξa may be viewed as
elements of the coset G/H.
In order to have a finite action, the derivatives of the fields should decrease faster
than |x|−D/2 at large distance,
which means that the field ξa (x) should go to a constant, with a remainder that
decreases faster than |x|1−D/2 .
The constant value of ξa at infinity can be chosen to be some fixed predefined
element of G/H. Thus, we may view the field ξa (x) as a mapping
ξa : SD 7→ G/H , (10.14)
Figure 10.3:
Stereographic pro-
jection that maps the
❘
plane 2 to the sphere
S2 . All the points at
infinity in the plane
are identified, and
mapped to the north
pole of the sphere.
ξa
R
(x) ≡ ξa (x/R) . (10.15)
The action becomes S[ξR ] = RD−2 S[ξ]. In D > 2 dimensions, we may make it
decrease continuously to zero, despite the fact that ξa and ξa
R
have the same topology.
Such a solution may be stabilized by adding a term with higher derivatives, such as
Z
V[ξ] ≡ dD x habcd (ξ) ∂i ξa ∂i ξb ∂j ξc ∂j ξd . (10.16)
Under the same rescaling, we now have V[ξR ] = RD−4 V[ξ]. In D = 3 spatial
dimensions, the term with second derivatives decreases to zero when R → 0, while
the above quartic term increases to +∞. Their sum therefore exhibits an extremum at
10. L OCALIZED FIELD CONFIGURATIONS 333
some finite scale R∗ . Although we obtain in this way non-trivial stable solutions, there
is a priori no reason to limit ourselves to terms with four derivatives, and therefore the
predictive power of such a model is limited by the many possible choices for these
higher order terms.
10.3 Monopoles
Magnetic monopoles are not forbidden in quantum electrodynamics, but their exis-
tence would automatically lead to the quantization of electrical charge, as first noted
by Dirac. Let us reproduce here this argument. Consider the radial magnetic field of a
would-be monopole:
b
x
B=g . (10.17)
|x|2
1 − cos θ
A(x) = g eφ , (10.18)
|x| sin θ
where θ is the polar angle, φ the azimuthal angle, and eφ is the unit vector tangent to
the circle of constant |x| and θ. This vector potential is not defined on the semi-axis
θ = π (i.e. the semi-axis of negative z). One may argue that on this semi-axis, we
have in addition to the monopole field a singular Bz whose magnetic flux precisely
cancels the magnetic flux of the monopole, so that the total flux on any closed surface
containing the origin is zero, as illustrated in the figure 10.4. Thus, in this solution, the
magnetic flux Φm ≡ 4π g of the monopole is “brought from infinity” by an infinitely
thin “solenoid”. Even if it is infinitely thin, such a solenoid may in principle be
detected by looking for interferences between the wavefunctions of charged particles
that have propagated left and right of the solenoid (this corresponds to the Aharonov-
Bohm effect). For a particle of electrical charge e, the corresponding phase shift is
eΦm = 4πeg. Dirac pointed out that this interference is absent when the phase shift
is a multiple of 2π, i.e. when the electric and magnetic charges are related by
n
ge = ,n ∈ ❩ . (10.19)
2
Thus, electrodynamics can perfectly accommodate genuine magnetic monopoles,
provided this condition is satisfied, since the annoying solenoid that comes with
334 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
θ
er
eφ
eθ
φ
Figure 10.4: Left: notations for the polar coordinates local frame used in
eq. (10.18). Right: magnetic field lines of the Dirac monopole, corresponding
to the vector potential of eq. (10.18).
the above vector potential is totally undetectable. In particular, this implies that
all electrical charges should be multiples of some elementary quantum of electrical
charge if monopoles exist. Note that in quantum electrodynamics, while the electric
and magnetic charges must be related by eq. (10.19), there is no constraint a priori on
the mass of monopoles and it should be regarded as a free parameter. c sileG siocnarF
Let us mention briefly an alternative argument, that does not involve discussing
the detectability of Dirac’s solenoid. Instead of the vector potential of eq. (10.18),
one could instead have chosen
1 + cos θ
A ′ (x) = −g eφ , (10.20)
|x| sin θ
(A − A′ ) · dx = 2g dφ , (10.21)
where the integration path γ[0, φ] is the portion of the equator that extends between
the azimuthal angles 0 and φ. After a complete revolution, we have
I
′
Ω(2π) = Ω(0) exp − ie A−A · dx
Equator
= Ω(0) exp − ie ΦU + ΦL = Ω(0) e−4πi eg . (10.26)
| {z }
flux =4πg
To obtain the first equality on the second line, we use Stokes’s theorem to rewrite the
contour integrals of A and A ′ as surface integrals of the corresponding magnetic field.
Therefore, we obtain the magnetic fluxes through the upper and lower hemispheres,
respectively, whose sum is the total flux 4πg of the monopole. Requesting the
single-valuedness of Ω leads to Dirac’s condition on eg.
candidate for a field theory of electroweak interactions, until the neutral vector boson Z0 was discovered.
Here, we use it as a didactical example of a theory with classical solutions that are magnetic monopoles.
336 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
λ 2
V(Φ) ≡ Φa Φa − v2 ,
8
Dµ Φa = ∂µ Φa − e ǫabc Ab c
µΦ ,
Fa a a b c
µν = ∂µ Aν − ∂ν Aµ − e ǫabc Aµ Aν , (10.28)
where we have written explicitly the structure constants of the su(2) algebra. In order
to study static classical solutions, it is simpler to consider the minima of the energy:
Z
1 a a
E ≡ d3 x Ei Ei + Ba a
i B i + Di Φ
a
Di Φa + V(Φ) , (10.29)
2
1
where Ea a a
i ≡ F0i is the (non-abelian) electrical field and Bi ≡ 2 ǫijk Fa
jk is the
magnetic field.
It is possible to choose a gauge (called the unitary gauge) in which the scalar field
triplet takes the form
Φa = 0, 0, v + ϕ . (10.30)
In this equation, we have anticipated spontaneous symmetry breaking, that will give
to the scalar field a vacuum expectation value v, and we have made a specific choice
about the orientation of the vacuum in SU(2). The field ϕ is thus the quantum
fluctuation of the scalar about its expectation value. In this process, the fields A1,2
µ
√ massive (with a mass3MW = e v), as well as the scalar field (with mass
will become
MH = λ v), while the field Aµ remains massless (it corresponds to a residual
unbroken U(1) symmetry). The classical vacuum of this theory corresponds to
ϕ=0, Aa
µ =0. (10.31)
Now, we seek stable classical field configurations that are local minima of the
energy, but are not equivalent to the vacuum in the entire space. To prove the
existence of such fields, it is sufficient to exhibit a field configuration of non-zero
energy that cannot be continuously deformed into the null fields of eq. (10.31) (up to
a gauge transformation). In order to have a finite energy, the scalar field Φa should
reach a minimum of the potential V(Φ) at large distance |x| → ∞ (we have shifted
the potential so that its minimum is zero), but it may approach different minima
depending on the direction b x in space. The allowed asymptotic behaviours of Φa
c sileG siocnarF
define a mapping from the sphere S2 (the orientations b x, for three spatial dimensions)
10. L OCALIZED FIELD CONFIGURATIONS 337
hedgehog configuration of
eq. (10.32). Each needle indicates
the internal orientation of Φa at
the corresponding point on the
sphere.
Φa (b xa ,
x) ≡ v b (10.32)
sometimes called a “hedgehog field” because the direction of internal space pointed
to by the scalar field is locked to the spatial direction, as shown in the figure 10.5.
Any smooth classical field Φa that obeys this boundary condition at infinite spatial
distance must vanish at some point in the interior of the sphere. Therefore, it cannot
simply be a gauge transform of the constant field Φa = v δ3a (the expectation value
of the scalar field in the vacuum). Once again, the classes of fields that can be
continuously deformed into one another are given by a homotopy group, in this case
the group π2 (M0 ) where M0 is the manifold of the minima of the scalar potential.
For the SU(2) group, M0 is topologically equivalent to the 2-sphere S2 , and the
equivalence classes of the mappings S2 7→ S2 are indexed by the integers, since
π2 (S2 ) = ❩. The hedgehog field of eq. (10.32) has topological number +1, while
the vacuum has topological number 0.
4 Here, we see that it is crucial that the scalar potential has non-trivial minima. If Φa ≡ 0 was the
only minimum, it would not be possible to construct solutions of finite energy that are not topologically
equivalent to the vacuum.
338 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Thus, Ω transforms the usual scalar vacuum into the hedgehog configuration at
infinity. Note that (10.33) is not a valid gauge transformation over the entire space
because it is not well defined at the origin.
The choice of eq. (10.32) for the asymptotic behaviour of the scalar field was
motivated by the requirement that the potential V(Φ) gives a finite contribution to the
2
energy. The term in Di Φa should also give a finite contribution. However, note
that
v ia
∂i Φa (b
x) = δ −b xi b
xa (10.35)
|x|
is not square integrable. We must therefore adjust the asymptotic behaviour of the
gauge potential in order to cancel this term in the covariant derivative, by requesting
that
δia − bxi b
xa
ǫabc Ab xc
i b = , (10.36)
|x|→∞ e |x|
which is satisfied if
ǫibd bxd
Ab
i = xb .
+ term in b (10.37)
|x|→∞ e |x|
The corresponding field strength and magnetic field are given by
1 d
Fa
ij = 2
2 ǫija + 2 ǫiad b xi b
xj − ǫjad b xa b
x − ǫijd b xd ,
|x|→∞ e |x|
xi b
b xa
Ba
i = . (10.38)
|x|→∞ e |x|2
Therefore, at large distance (these considerations do not give the precise form of the
fields at finite distance) there is a purely radial magnetic field that vanishes like |x|−2 ,
5 When an SU(2) transformation in the fundamental representation is written as
Ω ≡ u0 + 2i ua ta
f ,
i.e. according to Coulomb’s law, thus suggesting that a magnetic monopole is present
at the origin. For a more robust interpretation, we should apply a gauge transformation
that maps the asymptotic Hedgehog scalar field into the usual scalar vacuum, aligned
with the third colour direction. Thanks to eq. (10.34), we see that in this process
the magnetic field of eq. (10.38), proportional to b xa , will become proportional to
δ3a . But the third colour direction precisely corresponds to the gauge potential that
remains massless in the spontaneous symmetry breaking SU(2) → U(1). Therefore,
eq. (10.38) is indeed the magnetic field of a U(1) magnetic monopole. Its flux through
a sphere surrounding the origin is
4π
Φm = , (10.39)
e
equivalent to that of a magnetic charge g ≡ e−1 at the origin.
Until now, we have only discussed the implications of requiring a finite energy on
the asymptotic form of the scalar field and of the gauge potentials. In order to obtain
their values at finite distance, one may make the following ansatz:
xb
b
Φa (x) = v b
xa f(|x|) , Aa
i (x) = ǫiab g(|x|) , (10.40)
e |x|
where f, g are two functions that can be determined from the classical equations of
motion. From this solution over the entire space, one sees that the monopole is an
extended object made of two parts:
Given these fields, the total energy of the field configuration can be identified with
the mass (in contrast with Dirac’s point-like monopole in quantum electrodynamics,
whose mass is not constrained) of the monopole (since it is static). It takes the form
4π
Mm = M C(λ/e2 ) , (10.41)
e2 W
where C(λ/e2 ) is a slowly varying function of the ratio of coupling constants, of
order unity. Note that the core and the halo contribute comparable amounts to this
mass. Interestingly, the size M−1W
of this monopole is much larger (by a factor
340 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
α−1 = 4π/e2 ) than its Compton wavelength M−1 m . Therefore, when α ≪ 1, the
monopole receives very small quantum corrections and is essentially a classical object.
We have argued earlier that the topologically non-trivial configurations of the
scalar field that lead to a finite energy can be classified according to the homotopy
group π2 (S2 ). Since this group is the group ❩ of the integers, there are monopole
solutions with any magnetic charge multiple of e−1 (the solution we have constructed
explicitly above has topological number 1), i.e.
ge = n , n∈❩. (10.42)
Therefore, in this field theoretical monopole solution, the electrical charge would also
be naturally quantized. At first sight, eqs. (10.42) and (10.19) appear to differ by a
factor 1/2. Note however, that in the SU(2) model we are considering in this section,
it is possible to introduce matter fields in the fundamental representation6 that carry
a U(1) electrical charge ±e/2 (this is the smallest possible electrical charge in this
model). Thus, if rewritten in terms of this minimal electrical charge, the monopole
quantization condition (10.42) is in fact identical to Dirac’s condition. Although the
Georgi-Glashow model studied in this section is no longer considered as phenomeno-
logically relevant, theories that unify the strong and electroweak interactions into a
unique compact Lie group (such as SU(5) for instance) do have magnetic monopoles. c sileG siocnarF
M0 = Φ Φ = ΩΦ0 ; Ω ∈ G . (10.43)
6 If Ψ is a doublet that lives in this representation, the covariant derivative acting on it reads:
e 3 1 0
Dµ Ψ = ∂µ Ψ − i e Aa t
µ f
a
Ψ = ∂ µ Ψ − i A Ψ.
2 µ 0 −1
10. L OCALIZED FIELD CONFIGURATIONS 341
(Here, we are assuming that there are no accidental degeneracies among the minima,
i.e. no minima Φ0 and Φ0′ that are not related by a gauge transformation.) The
manifold defined in eq. (10.43) is in fact the coset G/H,
M0 = G/H . (10.44)
• The relevant gauge fields belong to h all the way down to zero radius. In this
case, the magnetic charge is independent of the radius of the sphere at all R,
7 Therefore, it must be conserved by time evolution. Indeed, time evolution is continuous, and the only
A∈h
Figure 10.7: Decomposition of
the sphere into two hemispheres
Ω(φ) ∈ H
with gauge potentials A and A ′ .
A' ∈ h
which means that the monopole is a point-like singularity at the origin, like the
original Dirac monopole.
• There exists a short-distance core in which the gauge fields live in an algebra
which is larger than h (possibly the algebra g before symmetry breaking). Inside
this core, the above argument is no longer valid, and the magnetic charge inside
the sphere may vary continuously with the radius. In this case, the monopole is
an extended object whose size is the radius of the core (its magnetic charge is
spread out in the core).
For a simply connected Lie group G (e.g., all the SU(N)), the first homotopy group is
trivial, π1 (G) = {0}, and we have
10.4 Instantons
Until now, all the extended field configurations we have encountered were time
independent. After integration over time, their action is infinite, and therefore they do
not contribute to path integrals. In this section, we will discuss field configurations of
finite action, called instantons, that are localized both in space and in time. Consider
a Yang-Mills theory in D-dimensional Euclidean space, whose action reads
Z
1
S[A] ≡ dD x Fij ij
a (x)Fa (x) . (10.47)
4
(We use latin indices i, j, k, · · · for Lorentz indices in Euclidean space.) Instantons
are non-trivial (i.e. not pure gauges in the entire spacetime) gauge field configurations
that realize local minima of this action.
i †
Aia (x)ta = x) ∂i Ω(b
Ω (b x) + ai (x) . (10.49)
g
| {z } | {z }
|x|−1 ≪|x|−1
The field strength associated to such a field decreases faster than |x|−2 , and therefore
the corresponding action is finite in D = 4 dimensions. There is in fact a scaling
argument showing that instanton solutions can only exist in four dimensions. Given
an instanton field configuration Ai (x) and a scaling factor R, let us define
1 i
AiR (x) ≡ A (x/R) . (10.50)
R
Since classical Yang-Mills theory is scale invariant, the field AiR is also an extremum
of the action (i.e. a solution of the classical Yang-Mills equations) if Ai is. The action
of this rescaled field is given by
S[AR ] = RD−4 S[A] . (10.51)
344 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Instanton
Pure gauge
F ij = 0 S3
8 The only exception to this reasoning occurs if S[A] = 0. But this happens only in the trivial situation
By choosing appropriately the sign, this can be rearranged into a lower bound for the
action:
Z
1
S[A] ≥ ǫijkl d4 x Fa a
ij Fkl , (10.54)
8
known as Bogomol’nyi’s inequality. Interestingly, we recognize in the right hand side
an integral identical to the one that enters in the θ-term of Yang-Mills theories (see
the section 4.5) or in the anomaly function (see the section 3.5 and the chapter 9). c sileG siocnarF
Ω : S3 7→ G , (10.56)
with a fixed value Ω(bx0 ) = 1. These functions can be grouped into topological classes,
such that mappings belonging to the same class can be continuously deformed into
one another. The set of these classes can be endowed with a group structure, called
the third homotopy group of G and denoted π3 (G) (for any SU(N) group with N ≥ 2,
we have π3 (G) = ❩). Note that the asymptotic forms of the fields Ai and AiR are
identical, implying that these two instantons belong to the same topological class.
Since their actions are identical in four dimensions, this scaling provides a continuous
family of instantons that belong to the same topological class and have the same
action. This is in fact more general: we will show later that the action of an instanton
depends only on the topological class of the instanton, and therefore can only vary by
discrete amounts.
ǫijkl Fa a
ij Fkl is a total derivative,
1 ijkl a a h g abc a b c i
ǫ Fij Fkl = ∂i ǫijkl Aa F
j kl
a
− f Aj Ak Al . (10.57)
2 | 3
{z }
Ki
(This property was derived in the section 4.5.) The vector Ki can also written as a
trace of objects belonging to the fundamental representation:
2ig
Ki = 2 ǫijkl tr Aj Fkl + Aj Ak Al . (10.58)
3
Since the integrand in the right hand side of eq. (10.54) is a total derivative, one
may use Stokes’s theorem in order to rewrite the integral as a 3-dimensional integral
extended to a spherical hypersurface SR of radius R → ∞:
Z Z
1 4 a a 1
Smin [A] ≡ ǫijkl d x Fij Fkl = lim d3 Si Ki . (10.59)
8 R→∞ 4 S
R
Thus, the minimum of the action depends only on the behaviour of the gauge field
at large distance (this does not mean that the action does not depend on details of
the gauge field in the interior, but more simply that the gauge fields that realize the
minima are fully determined in the bulk by their asymptotic behaviour). From the
earlier discussion of the asymptotic behaviour of instanton solutions, we know that
Therefore, in the current Ki , the term Aj Fkl is negligible in front of the term Aj Ak Al
at large distance, and we can also write
Z
ig
Smin [A] = lim d3 Si ǫijkl tr Aj Ak Al
R→∞ 3 SR
Z
1
= lim d3 Si ǫijkl tr Ω† (∂j Ω)Ω† (∂k Ω)Ω† (∂l Ω) ,
R→∞ 3 g2 SR
(10.61)
where Ω(b x) is the group element that defines the asymptotic pure gauge behaviour
of the gauge potential in the direction b
x. In this expression, each derivative brings
a factor R−1 , while the domain of integration scales as R3 . The result is therefore
independent of the radius of the sphere and we can ignore the limit R → ∞.
On this sphere, let us choose a system of coordinates made of three variables
(θ1 , θ2 , θ3 ), such that the volume element in SR is dθ1 dθ2 dθ3 . To rewrite the
previous integral more explicitly in terms of these variables, it is convenient to
10. L OCALIZED FIELD CONFIGURATIONS 347
∂θ0 ∂θ0
d3 Si = b
xi dθ1 dθ2 dθ3 = dθ1 dθ2 dθ3 = d4 x δ(θ0 − R) (10.62)
∂xi ∂xi
where we have rewritten the derivatives with respect to xi in terms of derivatives with
respect to θa (the implicit sums on a, b, c run over the indices 1, 2, 3 only, because
the group element Ω depends only on the orientation b x). Finally, we may use:
∂θ0 ∂θa ∂θb ∂θc ∂(θ0 θ1 θ2 θ3 )
ǫlijk = det ǫ0abc . (10.64)
∂xi ∂xj ∂xk ∂xl ∂(x1 x2 x3 x4 ) | {z }
=ǫabc
The determinant is nothing but the Jacobian of the coordinate transformation {xi } →
{θa }. Therefore, we obtain
Z
1 ∂Ω(θ) † ∂Ω(θ) † ∂Ω(θ)
Smin [A] = 2
dθ1 dθ2 dθ3 ǫabc tr Ω† (θ) Ω (θ) Ω (θ) .
3g ∂θa ∂θb ∂θc
(10.65)
The Cartan-Maurer form F[Ω] is an integral that generalizes the one encountered
earlier:
Z
∂Ω(θ) ∂Ω(θ)
F[Ω] ≡ dθ1 · · · dθd ǫi1 ···id tr Ω† (θ) · · · Ω† (θ) , (10.67)
∂θi1 ∂θid
348 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
which is identical to eq. (10.67), except for the fact that it is expressed in terms of
the new coordinates θ′i . This proves that F[Ω] is independent of the choice of the
coordinate system on S, and is a property of the manifold S itself.
Change under a small variation of Ω : Let us now study the change of F[Ω]
when we vary the mapping Ω by δΩ. Thanks to the cyclicity of the trace, the
variation of each factor Ω† ∂Ω/∂θi gives the same contribution to the variation of
F[Ω]. Therefore, it is sufficient to consider one of these variations, and to multiply its
contribution by the number of factors, d:
Z
∂Ω(θ) ∂Ω(θ)
δF[Ω] = d dθ1 · · · dθd ǫi1 ···id tr Ω† (θ) · · · δ Ω† (θ) .
∂θi1 ∂θid
(10.71)
The variation of the last factor inside the trace can be written as
∂Ω(θ) ∂Ω(θ) ∂δΩ(θ)
δ Ω† (θ) = −Ω† (θ)δΩ(θ) Ω† (θ) +Ω† (θ)
∂θid ∂θid ∂θid
| {z }
∂Ω† (θ)
− ∂θi Ω(θ)
d
∂δΩ(θ)Ω† (θ)
= Ω† (θ) Ω(θ) . (10.72)
∂θid
9 This is the case in the study of instantons, since in this case the manifold S is the 3-sphere S3 .
10. L OCALIZED FIELD CONFIGURATIONS 349
All the terms containing a factor ∂2 Ω/∂θid ∂θia vanish because the second derivative
is symmetric under the exchange of of the indices id and ia , while the prefactor
ǫi1 ···id is antisymmetric. The remaining terms are those where the derivative with
respect to θd act on one of the factors Ω† . There are d − 1 such terms, which after
some reorganization can be written as
Z X ∂Ω† ∂Ω ∂Ω†
δF[Ω] = −d dθ1 · · · dθd ǫi1 iσ(2) ···iσ(d) tr ··· δΩ .
∂θi1 ∂θi2 ∂θid
σ cyclic perm.
of 2···d
| {z }
0
(10.74)
ǫi1 ···id changes sign under a one-step cyclic permutation of its last d − 1 indices.
Therefore, the d − 1 terms in the sum exactly cancel since d − 1 is even, and we have
δF[Ω] = 0 . (10.75)
Therefore, F[Ω] is invariant under small changes of Ω, which implies that F[Ω] can
only vary by discrete jumps. In particular, when S is the d-sphere Sd , F[Ω] depends
only on the homotopy class of Ω. These classes form a group πd (M). Moreover,
F[Ω] provides a representation of πd (M): if Ω denotes the homotopy class to which
Ω belongs, we have
Case of a Lie group target manifold : Let us now specialize to the case where the
target manifold M is a d-dimensional Lie group H, and exploit its group structure
in order to obtain simpler expressions. In this case, the θa ’s can also be used as
coordinates on H. Consider two elements Ω1 and Ω2 of H, represented respectively
by the coordinates θa and φa . Their product Ω2 Ω1 is an element of H of coordinates
ψ(θ, φ) (the group multiplication determines how ψ depends on θ and φ). Since we
350 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
have shown that the choice of cordinates on S is irrelevant, we may choose them in
such a way that the function Ω(θ) is a representation of the group H, i.e.
∂Ω(θ) ∂Ω(θ)
ǫi1 ···id tr Ω† (θ) · · · Ω† (θ)
∂θi1 ∂θid
∂(ψ) ∂Ω(ψ) ∂Ω(ψ)
= det ǫj1 ···jd tr Ω† (ψ) · · · Ω† (ψ) ,
∂(θ) ∂ψj1 ∂ψjd
(10.80)
where ψ can be any fixed reference point in the group. In the right side, the integration
variable θ now appears only inside the determinant.
The Lie group H being a smooth manifold, it can be endowed with a metric tensor
γij (θ), that transforms as follows in a change of coordinates
∂θk ∂θl
γij (ψ) = γkl (θ) . (10.81)
∂ψi ∂ψj
Given a mapping Ω(θ) between coordinates and group elements, a possible choice
for the metric is given by10
1 ∂Ω(θ) † ∂Ω(θ)
γij (θ) = − tr Ω† (θ) Ω (θ) . (10.82)
2 ∂θi ∂θj
Moreover, for any such metric γij (θ), we have:
s
∂(ψ) det γ(θ)
det = . (10.83)
∂(θ) det γ(ψ)
10 In
the algebra of a compact Lie group, the Killing form K(X, Y) ≡ tr adX adY is a negative definite
inner product, from which one can define a distance on the group manifold in the vicinity of the origin
(see the section 4.2.4). Eq. (10.82) extends this definition globally to the entire group, in a way which is
invariant under left and right group action.
10. L OCALIZED FIELD CONFIGURATIONS 351
∂Ω(ψ) ∂Ω(ψ)
F[Ω] = ǫj1 ···jd tr Ω† (ψ) · · · Ω† (ψ)
∂ψj1 ∂ψjd
Z p
1
×p dd θ det γ(θ) , (10.84)
det γ(ψ)
in which all the terms pthat do not depend on θ have been factored out in front of the
integral. In fact, dd θ det γ(θ) is an invariant measure on the Lie group, and the
integral is therefore the volume of the group. In other words, the previous formula
exploits the group invariance in order to rewrite the Cartan-Maurer invariant as the
product of the integrand evaluated at a fixed point by the volume of the group. Since
ψ is arbitray in this expression, we may choose the value ψ0 that corresponds to the
group identity. Furthermore, groups elements in the vicinity of the identity may be
written as
Ω(ψ) ≈ 1 + 2i (ψ − ψ0 )a ta , (10.85)
ψ→ψ0
where the ta ’s are the generators of the Lie algebra h. Then, the derivatives read
simply
∂Ω(ψ)
= 2i ta . (10.86)
∂ψa ψ0
with t1,2,3 the generators of the su(2) algebra (for the fundamental representation,
the Pauli matrices divided by 2) and θ21 + θ22 + θ23 + θ24 = 1. The following identities
hold:
(We denote θ2 ≡ θ21 + θ22 + θ23 .) In the evaluation of eq. (10.82), we need traces of
products of up to four ta matrices. In the fundamental representation, they can all be
obtained from
tr (ti ) = 0 ,
i 1
ti tj = ǫijk tk + δij , (10.90)
2 4
which leads to
1 ij
tr (ti tj ) = δ ,
2
i
tr (ti tj tk ) = ǫijk ,
4
1
tr (ti tj tk tl ) = (δij δkl + δil δjk − δik δjl ) . (10.91)
8
Combining the above results, we obtain the following expression for the Cartan-
Maurer invariant of the homotopy class of Ω in π3 (SU(2))
Z
2 d3 θ
F[Ω] = (2i)3 ǫabc tr (ta tb tc ) p . (10.94)
1 − θ2
The factor 2 comes from the fact that there are two allowed values of θ4 for each
θ1,2,3 . Finally, we have
Z1
dθ θ2
F[Ω] = 96π √ = 24π2 . (10.95)
1 − θ2
0
In fact, the mapping of eq. (10.88) wraps only once in SU(2), and the above result
therefore corresponds to the topological index +1. Since 24π2 is non-zero, there are
other classes of Ω’s whose Cartan-Maurer invariants are the integer multiples of this
result, and the second homotopy group is π3 (SU(2)) = ❩. Note also that this result
extends to any Lie group that contains an SU(2) subgroup. c sileG siocnarF
10. L OCALIZED FIELD CONFIGURATIONS 353
In a gauge theorie whose gauge group contains an SU(2) subgroup, the mapping of
eq. (10.88)) can be used to construct the asymptotic form of an instanton of topological
index +1,
i †
Ai (x) = Ω (b
x)∂i Ω(b
x) , (10.96)
|x|→∞ g
with Ω(b x) ≡ b
x4 + 2i bxi ti . One may then prove that the self-dual field configuration
in the bulk that has this large distance behaviour is given by
i r2
Ai (x) = Ω† (b
x)∂i Ω(b
x) , (10.97)
g r 2 + R2
with an arbitrary radius R. From the result (10.95) of the previous subsection, we find
that the minimum of the action that corresponds to this solution is:
8π2
Smin [A] = . (10.98)
g2
Up to translations, dilatations or gauge transformations, this is the only field con-
figuration that gives this action. The field strength corresponding to eq. (10.97) is
localized in Euclidean spacetime, with a size of order R. One may also superimpose
several such solutions. Provided that their centers are separated by distances much
larger than R, this sum is also a solution of the classical equations of motion, and its
action is a multiple of 8π2 /g2 .
spacetime. However, it is approximately an integer when the size of the domain is much larger than the
instanton size.
10. L OCALIZED FIELD CONFIGURATIONS 355
Consider an instanton solution Aµ n,α (x), that provides a local minimum of the Eu-
clidean action, where the subscript n is the topological index of the instanton, and α
collectivey denotes all the other parameters that characterize the instanton (its center,
its size, its orientation in colour space). The expectation value of an observable reads
Z Z
O = Z−1 [DA] e−S[A] O(A) = Z−1 [Da] e−S[An,α +a] O(An,α + a) ,
(10.105)
It is important to note that the action has flat directions in the space of field con-
figurations, that correspond to changing the parameters of the instanton inside its
topological class. For instance, changing the center coordinates of the instanton does
not modify the value of its action. Along these directions, the second derivative of the
action vanishes. This means that the matrix of second-order coefficient G−1
nα,mβ (x, y)
has a number of vanishing eigenvalues, corresponding to these flat directions.
If we expand the action only to quadratic order in aµ , which amounts to a one-
loop approximation in the background of the instanton, a typical contribution to the
expectation value of eq. (10.105) is a product of dressed propagators Gnα,mβ (x, y)
connecting pairwise the gauge fields contained in the observable O and a determinant:
2
1/2 Y
|n|/g2
O = Z−1 e−8π det G G + ··· . (10.107)
Our goal here is simply to extract the dependence of such an expectation value on the
topological index n. Besides the obvious exponential prefactor, a dependence on n
hides in the determinant. Le us rewrite it as a product on the spectrum of G−1
1/2 Y
det G = λ−1/2
s , (10.108)
s
where the λs are the eigenvalues of G−1 . If we rescale the gauge fields by a power
of the coupling g, g A → A, the only dependence on g in the Yang-Mills action is a
prefactor g−2 , and all the eigenvalues λs are also proportional to g−2 . Moreover, as
explained above, we should remove the zero modes from this product, since they do
356 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
not give a quadratic term in eq. (10.106). If we are interested only in the powers of g,
we may write
1/2 Y Y
det G ∼ g g−1 . (10.109)
all modes zero modes
The first factor, that involves a (continuous) infinity of modes, is not well defined but
it does not depend on the details of the instanton background. In contrast, the second
factor brings one factor of g−1 for each collective coordinate of the instanton. For an
instanton of topological number n = 1, these collective coordinates are:
Because of the exponential factor that contains the inverse coupling, all the Taylor
coefficients of this function are vanishing at g = 0. Thus, such a contribution never
shows up in perturbation theory.
12 This counting is more involved for an SU(3) instanton. In this case, there are 7 collective coordinates
• Even at tree level, the number of distinct graphs contributing to a given ampli-
tude increases very rapidly with the number of external lines, as shown in the
table 11.1 for amplitudes with external gluons only.
• The Feynman rules are sufficiently general to compute amplitudes with arbitrary
external momenta (not necessarily on-shell) and polarizations (not necessarily
physical), although this is not useful for amplitudes that will be used in cross-
sections. One would hope for a leaner formalism, that only calculates what is
strictly necessary for physical quantities.
357
358 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The situation becomes even worse with loop diagrams. Another situation with an
even higher degree of complexity, even at tree-level, is that of gravity. It would be
desirable to be able to calculate tree-level amplitudes with gravitons, since they enter
for instance in the study of the scattering of gravitational waves by a distribution of
masses. But because the graviton has spin 2, the corresponding Feynman rules are
considerably more complicated (especially the self-couplings of the graviton) than
those of Yang-Mills theory.
It turns out that physical on-shell amplitudes in gauge theories are considerably
simpler than one may expect from the Feynman rules and the intermediate steps of
their calculation by the usual perturbation theory, and a legitimate query is whether
there is a more direct route to reach these compact answers. The goal of this chapter
is to give a glimpse (in particular, our discussion will be restricted to tree-level
amplitudes, but a significant part of the many recent developments deal with loop
corrections) of some of the recent developments that led to powerful new methods for
calculating amplitudes. A recurring theme of these methods is to avoid as much as
possible references to the Lagrangian, which may be viewed as the main source of
the complications in standard perturbation theory (for instance, the gauge invariance
of the Lagrangian is the reason why non-physical gluon polarizations appear in the
Feynman rules). Instead, these methods try to gather as much information as possible
on amplitudes based on symmetries and kinematics.
i fabc = 2 tr (ta b c b a c
f tf tf ) − 2 tr (tf tf tf ) , (11.2)
The black dots indicate the fundamental representation generators ta f . Note that the
“loops” in this representation are not actual fermion loops, they are just a graphical
cue indicating how the indices carried by the ta f ’s are contracted in the traces. We
may also apply this trick to the 4-gluon vertex, which from the point of view of its
colour structure (but not for what concerns its momentum dependence) is equivalent
to a sum of three terms with two 3-gluon vertices,
a b a b
a b a b
= + + . (11.4)
d c d c
d c d c
Since the gluon propagators are diagonal in colour (i.e proportional to a δab ),
the ta
f that are attached to the endpoints of the internal gluon propagators have their
colour indices contracted and summed over. The result of this contraction is given by
the following su(N) Fierz identity:
j i
1 1
(ta a
f )ij (tf )kl = = − . (11.5)
2 2N
k l
Thus, it seems that these contractions produce 2n terms for n internal gluon propaga-
tors, but this can in fact be simplified tremendously by noticing that the second term
of the Fierz identity corresponds to the exchange of a colourless object1 , that does
not couple to gluons. All these terms in 1/N must therefore cancel in purely gluonic
amplitudes (this is not true anymore if quarks are involved, either as external lines or
via loop corrections). c sileG siocnarF
1A more rigorous justification is to note that SU(N) × U(1) = U(N), where U(N) is the group of the
360 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
= + + . . . (11.6)
where the prefactor 2 combines the factors 2 from eq. (11.2) and the factors 12 from
the first term of the Fierz identity (11.5). The object An (σ(1) · · · σ(n)) is called a
colour-ordered partial amplitude. By construction, it depends only on the momenta
and polarizations of the external gluons, but not on their colours since they have
already been factored out in the trace. Therefore, the partial amplitudes are gauge
N × N unitary matrices. For the fundamental generators of the u(N) algebra, the Fierz identity is
j i
1
= .
u(N) 2
k l
The U(N) gauge theory differs from the SU(N) one by the extra U(1), and the comparison of their Fierz
identities indicates that the term in 1/2N in eq. (11.5) is due to this U(1) factor. Being Abelian, this extra
factor corresponds to a photon-like mode that does not couple to gluons.
2 This is equivalent to considering permutations that have the fixed point σ(1) = 1, i.e. permutations that
only reshuffle the set {2 · · · n}. For n external gluons, there are (n − 1)! independent colour structures. The
basis provided by these traces is over-complete, and there exist linear relationships among the tree-level
partial amplitudes, known as the Kleiss-Kuijf relations. These relations reduce the number of partial
amplitudes from (n − 1)! to (n − 2)!. Additional relationships known as the Bern-Carrasco-Johansson
relations further reduce this number to (n − 3)!,
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 361
invariant. From eq. (11.7), the squared amplitude summed over all colours can be
written as
X 2 X X a a a a
Mn (1 · · · n) = 4 tr (tf σ(1) · · · tf σ(n) ) tr∗ (tf ρ(1) · · · tf ρ(n) )
colours σ,ρ∈Sn /❩n colours
The sum over colours of the product of two traces that appears in the first line can be
performed using the su(N) Fierz identity (11.5). For instance
which can be then expressed as a function of N by repeated use of the Fierz identity.
At this point, we have isolated the colour dependence of the amplitude, from its
momentum and polarization dependences that are factorized into the partial ampli-
tudes. Of course, calculating the latter is still not easy, but the task is significantly
reduced for two reasons:
3 2 4 3 2
4 1 5 1
5 4 3 2
6 1
Figure 11.1: Diagrams contributing to the 4-point, 5-point and 6-point colour
ordered amplitudes in Yang-Mills theory. The external points are labeled 1 to
n = 4, 5, 6 in the counterclockwise direction. The solid lines represent gluons.
p
−i gµν i 1 pµ pν
= + 1−
p2 + i0+ p2 + i0+ ξ p2
k
g gµν (k − p)ρ
=
q
+ gνρ (p − q)µ + gρµ (q − k)ν
ν
p ρ
µ ν
− i g2 (2 gµρ gνσ
=
− gµσ gνρ − gµν gρσ )
ρ σ
In the case of the 4-gluon vertex, we have included only the terms that cor-
respond to the cyclic ordering µνρσ (note that it is invariant under cyclic
permutations, i.e. the Feynman rule is the same for the vertices νρσµ, ρσµν
and σµνρ). We can already see a considerable simplification of the Feynman
rules, since all the colour factors have disappeared, and the Lorentz structure of
the 4-gluon vertex is also much simpler than in the original Feynman rules.
But even after having isolated the colour structure, the remaining colour-ordered
amplitudes are still complicated. As an illustration of the colour-ordered Feynman
rules, let us consider the partial amplitude A4 (1, 2, 3, 4) that contributes to one of the
colour structures in the gg → gg amplitude. Because of colour ordering, only three
graphs contribute to this partial amplitude:
2 3
2 3 2 3
A4 (1, 2, 3, 4) = + + . (11.10)
1 4 1 4
1 4
For definiteness, let us assume that the external momenta p1 · · · p4 are defined as
incoming, and denote ǫ1 · · · ǫ4 the four polarization vectors. Using the rules listed in
the figure (11.2), we obtain:
A4 (1, 2, 3, 4) =
−i g2 h
= (2p2 + p1 ) · ǫ1 ǫλ2 − (2p1 + p2 ) · ǫ2 ǫλ1
(p1 + p2 )2
ih
+ǫ1 · ǫ2 (p1 − p2 )λ (p3 + 2p4 ) · ǫ3 ǫ4λ
i
−(2p3 + p4 ) · ǫ4 ǫ3λ + ǫ3 · ǫ4 (p3 − p4 )λ
−i g2 h
+ (p2 + 2p3 ) · ǫ2 ǫλ3 − (2p2 + p3 ) · ǫ3 ǫλ2
(p2 + p2 )2
ih
+ǫ2 · ǫ3 (p2 − p3 )λ (2p1 + p4 ) · ǫ4 ǫ1λ
i
−(p1 + 2p4 ) · ǫ1 ǫ4λ + ǫ1 · ǫ4 (p4 − p1 )λ
h i
−i g2 2(ǫ1 · ǫ3 )(ǫ2 · ǫ4 ) − (ǫ1 · ǫ4 )(ǫ2 · ǫ3 ) − (ǫ1 · ǫ2 )(ǫ3 · ǫ4 ) .
(11.11)
Although this is considerably simpler than the full 4-gluon amplitude, it remains quite
difficult to extract physical results from such an expression.
364 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
11.3.1 Motivation
Part of the complexity of eq. (11.11) lies in the fact that this formula still contains a
large amount of redundant and unnecessary information, since each polarization may
be shifted by a 4-vector proportional to the momentum of the corresponding external
gluon, thanks to gauge invariance. For instance, the transformation
ǫµ µ µ
1 → ǫ1 + κ p1 , (11.12)
leaves the amplitude unchanged. However, it is not clear how to optimally choose the
polarization vectors in order to simplify an expression such as eq. (11.11). In other
words, the question is how to represent the spin degrees of freedom of the external
particles in order to make the amplitude as simple as possible. In the traditional
approach to the calculation of amplitudes, one usually refrains from introducing any
explicit form for the polarization vectors. Instead, one first squares the amplitude
written in terms of generic polarization vectors, such as eq. (11.11), and then the sum
over the polarizations of the external gluons is performed by using
X pµ nν nµ pν
ǫµ∗ (p)ǫν (p) = −gµν + + , (11.13)
p·n p·n
physical pol.
where nµ is some arbitrary light-like vector. Note that this is the formula for summing
over all physical polarizations, which is necessary when calculating unpolarized cross-
sections. For cross-sections involving polarized particles, one would perform only a
partial sum, which leads to a different projector in the right hand side. If the amplitude
is a sum of Nt terms, then this process generates 3N2t terms in the squared amplitude
summed over polarizations. In contrast, the spinor-helicity method that we shall
expose below aims at obtaining the amplitude with explicit polarization vectors, for a
given assignment of the helicities {h1 = ±, · · · , hn = ±} of the external gluons, in
the form of an expression made of Nt terms that can be easily evaluated (numerically
at least). The sum of these Nt terms is done first, and then squared, which is an O(1)
computational task (simply squaring a complex number). Thus, the total cost scales
as 2n Nt in this approach. Since Nt grows very quickly with n, this is usually better.
In the previous section, we have seen how the adjoint colour degrees of freedom may
be represented in terms of the smaller fundamental representation. Likewise, we will
now represent the Lorentz structure associated to spin-1 particles in terms of spin-1/2
variables. From a mathematical standpoint, this representation exploits the fact that
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 365
σµ ≡ (1, σi ) , (11.14)
where σ1,2,3 are the usual Pauli matrices. In terms of these matrices, a 4-vector pµ
can be mapped into
µ µ p0 + p3 p1 − ip2
p → P ≡ pµ σ = . (11.15)
p1 + ip2 p0 − p3
(In the second equality, we have used the explicit representation of the Pauli matrices.)
For amplitudes involving only external gluons, the momentum pµ has a vanishing
invariant norm, pµ pµ = 0, which translates into
Thus, the massless on-shell condition is equivalent to the determinant of the matrix
P being zero. For a 2 × 2 matrix, a null determinant means that the matrix can be
factorized as the direct product of two vectors:
Pab = λa ξb , (11.17)
For a real valued 4-vector, λa and ξa are mutual complex conjugates. However, when
we later analytically continue the external momenta in the complex plane, this will no
longer be the case. To make the notations more compact, it is customary to introduce
the following notations:
p = λa , p = ξa , (11.19)
It is also convenient to define spinors with raised indices, related to the previous ones
as follows,
λa ≡ ǫab λb = p , ξa ≡ ǫab ξb = p , (11.21)
366 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
P ≡ pµ σµ , (11.23)
with σµ ≡ (1, −σi ). In the Weyl representation, where the Dirac matrices read
µ 0 σµ
γ = , (11.24)
σµ 0
we thus have
0 P
/ ≡ pµ γµ =
p . (11.25)
P 0
The fact that we are dealing with on-shell momenta is already built in the factorized
representation of eq. (11.17). Amplitudes depend on kinematical invariants such as
(p + q)2 , for which it is straightforward to check that4
(p + q)2 = 2 p · q = pq pq , (11.26)
where the brackets are defined by contracting upper and lower spinor indices, as in
a
pq ≡ p a
q . (11.27)
It is useful to work out the form of momentum conservation in the spinor formal-
ism. For an amplitude
with external momenta {pi }, chosen to be all incoming, let us
denote i , i , · · · the corresponding spinors. For any arbitrary on-shell momenta p
and q, we may then write
X X
0= p Pi q = pi iq . (11.29)
i i
3 We may use ǫac ǫbd δdc = δab and ǫac ǫbd σidc = −σiab .
4 For real momenta, angle and square brackets are complex conjugates, and (p + q)2 is a real quantity.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 367
Another interesting identity follows from the fact that three 2-component spinors
cannot be linearly independent. Thus, given p , q and r , we must have a
relationship of the form:
r =α p +β q . (11.30)
Contracting this equation with p and q gives the explicit expression of the
coefficients α and β:
qr pr
α= , β= . (11.31)
qp pq
This leads to
p qr + q rp + r pq = 0 , (11.32)
known as the Schouten identity. A similar identity holds with square brackets:
p qr + q rp + r pq = 0 . (11.33)
At this point, we have a representation in terms of spinors for the on-shell momenta
that appear on the external legs of amplitudes. We also need a similar representation
for the polarization vectors. The polarization vectors for a gluon of momentum p
with positive and negative helicities may be represented as follows:
q σµ p p σµ q
ǫµ
+ (p; q) ≡ − √ , ǫµ
− (p; q) ≡ − √ , (11.34)
2 qp 2 pq
p · ǫ± (p; q) = q · ǫ± (p; q) = 0 ,
qk kp
k · ǫ+ (p; q) = − √ ,
2 qp
pk kq
k · ǫ− (p; q) = − √ . (11.37)
2 pq
Let us now discuss the very important case of 3-particle amplitudes in the massless
case, since they will appear later as the building blocks of more complicated am-
plitudes. Such an amplitude depends on three on-shell momenta p1,2,3 such that
p1 + p2 + p3 = 0. This implies that
12 12 = 2 p1 · p2 = (p1 + p2 )2 = p23 = 0 . (11.38)
Therefore, either 12 = 0 or 12 = 0. Let us assume that 12 6= 0. We also have:
12 23 = 1 P2 3 = − 1 P1 +P3 3 = − 11 13 − 13 33 = 0 , (11.39)
|{z} |{z}
0 0
which implies that 23 = 0. Likewise, 13 = 0. Therefore, all the square brackets
are zero if 12 6= 0. Conversely, all the angle brackets would be zero if instead we
had assumed that 12 6= 0. From this discussion, we conclude that massless on-shell
3-point amplitudes may depend either on square brackets or on angle brackets, but not
on a mixture of both. Recall now that, for real momenta, angle and square brackets
are related by complex conjugation. Thus, 3-point amplitudes can only exist for
complex momenta. This is of course a trivial consequence of kinematics: momentum
conservation p1 + p2 + p3 = 0 is impossible for three real-valued light-like momenta,
except on a measure-zero subset of exceptional configurations.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 369
Let us now be more explicit and calculate the 3-point amplitudes in Yang-Mills
theory. For generic polarization vectors ǫ1,2,3 , the second Feynman rule of the figure
(11.2) leads to
h i
A3 (123) = 2g (ǫ1 ·ǫ2 )(p1 ·ǫ3 )+(ǫ2 ·ǫ3 )(p2 ·ǫ1 )+(ǫ3 ·ǫ1 )(p3 ·ǫ2 ) , (11.40)
where we have used pi · ǫi = 0 to cancel several terms. Consider first the helicities
− − +. Using eqs. (11.35) and (11.37), we obtain
√
− − + 2g
A3 (1 2 3 ) = −
q1 1 q2 2 q3 3
× 12 q1 q2 q3 1 13 + 2q3 q2 3 12 2q1
+ q3 1 3q1 23 3q2 . (11.41)
Each of the three terms contains in the numerator an angle bracket between the
external momenta (respectively 12 , 12 and 23 ). Therefore, for this amplitude
to be non-zero, we must adopt the choice of spinor representation where it is the
square
brackets that are zero. With this choice, the first term vanishes since it contains
13 :
− − +
√ 2q3 q2 3 12 2q1 + q3 1 3q1 23 3q2
A3 (1 2 3 ) = − 2 g .
q1 1 q2 2 q3 3
(11.42)
Using momentum conservation (11.29) in the form of
11 1q1 + 12 2q1 + 13 3q1 = 0 , (11.43)
|{z}
0
(The + + + and − − − amplitudes are zero in Yang-Mills theory, as argued in the next
subsection.) Eqs. (11.46) and (11.47) are both much simpler than the Feynman rule
for the 3-gluon vertex. This is the simplest illustration of an assertion we made at the
beginning of this section, namely that on-shell amplitudes with physical polarizations
are much simpler than one may expect from the traditional perturbative expansion. In
the case of the 3-gluon amplitude, we may think that the simplicity comes from the
fact that it receives contributions from a single diagram. However, this is not true. As
a teaser for the next section, let us give the answers for some 4-gluon and 5-gluon
amplitudes in the spinor-helicity formalism:
3
√ 12
A4 (1− 2− 3+ 4+ ) = i( 2 g)2 ,
23 34 41
3
√ 12
A4 (1− 2− 3+ 4+ 5+ ) = i2 ( 2 g)3 , (11.48)
23 34 45 51
that appear to generalize trivially eq. (11.46) although they result from the sum of 3
and 10 Feynman graphs (see the figure 11.1), respectively. In this section, we have
followed a pedestrian approach that consists in starting from the usual Feynman rules,
and translating all their building blocks in the spinor-helicity language. However,
the simplicity of the results provides an important hint: there must be a better way
to obtain them, that bypasses the traditional Feynman rules and provides the answer
much more directly. c sileG siocnarF
It turns out that massless on-shell 3-point amplitudes are almost completely con-
strained by a scaling argument, except for an overall prefactor. Thus, the Lagrangian
is in a sense not necessary for specifying their form (it only plays a marginal role
in setting their normalization). From eqs. (11.20) and (11.22), it is clear that the
representation of massless on-shell 4-momenta as bi-spinors is invariant under the
following rescaling:
p →λ p , p → λ−1 p , (11.49)
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 371
known as little group scaling. The terminology follows from the fact that there is
a one-parameter SO(2) subgroup (the rotations in the plane transverse to p) of the
Lorentz group that leaves invariant the vector pµ . Such a residual symmetry that
leaves a vector invariant is called little group. In the spinor formulation, this residual
symmetry precisely corresponds to the transformation of eq. (11.49).
Under little group scaling of p and p , the polarization vectors of eq. (11.34)
scale as follows:
ǫµ
+ (p; q) → λ
−2 µ
ǫ+ (p; q) , ǫµ 2 µ
− (p; q) → λ ǫ− (p; q) , (11.50)
i.e. a scaling by a factor λ−2h for a helicity h. Note that the polarization vectors are
invariant under little group scaling of the auxiliary vector q. In an amplitude, the
internal ingredients (propagators and vertices) are not affected by little group scaling.
Therefore, if we apply the little group scaling λi to an external momentum i of an
amplitude, its expression in terms of square and angle spinors must transform as
An (1 · · · ihi · · · n) → λ−2h
i
i
An (1 · · · ihi · · · n) , (11.51)
where hi is the helicity of the external line i (we do not need to specify the helicities
of the other external lines).
It turns out that the structure of all 3-point amplitudes6 is completely fixed by this
property. Let us start from the following generic expression
α β γ
A3 (1h1 2h2 3h3 ) = C 12 23 31 , (11.52)
−2 h1 = α + γ , −2 h2 = α + β , −2 h3 = β + γ , (11.53)
whose solution is
α = h3 − h1 − h2 , β = h1 − h2 − h3 , γ = h2 − h3 − h1 . (11.54)
square and angle brackets, and because the number of constraints provided by the helicities of the external
lines is not sufficient to fix the unknown exponents.
372 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that instead of eq. (11.52), we could have chosen an ansatz that involves the
square brackets,
α ′ β ′ γ ′
A3 (1h1 2h2 3h3 ) = C 12 23 31 . (11.56)
(This is the only alternative, since we are not allowed to mix square and angle brackets
in a 3-point amplitude for massless particles.) Little group scaling would now lead to
and consequently
−h3 +h1 +h2 −h1 +h2 +h3 −h2 +h3 +h1
A3 (1h1 2h2 3h3 ) = C 12 23 31 . (11.58)
The expected dimension of the amplitude is sufficient to choose between eqs. (11.55)
and (11.58). Indeed, both angle and square brackets have mass dimension 1, while the
3-gluon amplitude should have dimension 1 in 4-dimensional Yang-Mills theory (for
which the coupling constant is dimensionless). Since all the kinematical dependence
is carried by the brackets, the prefactor C can only be made of coupling constants
and numerical factors, and must therefore be dimensionless in Yang-Mills theory.
Consider first the − − + amplitude: eq. (11.55) gives a mass dimension +1, while
eq. (11.58) gives a mass dimension −1. Therefore, the − − + amplitude must be
expressed by eq. (11.55) in terms of angle brackets. The same argument tells us that
the + + − amplitude must be given by eq. (11.58), in terms of square brackets.
Let us consider now the − − − amplitude, for which the little group scaling tells
us that
A3 (1− 2− 3− ) = C 12 23 31 . (11.59)
Therefore, the prefactor C should have mass dimension −2, which cannot be con-
structed from the dimensionless coupling constant of Yang-Mills theory, unless C = 0
(the same conclusion holds if we try to construct this amplitude with square brackets).
Likewise, we conclude that the + + + amplitude is zero as well.
n + 2 nI = 3 n3 + 4 n4 ,
nI = n3 + n4 − 1 . (11.60)
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 373
The second equation is the statement that this graph has no loops. From these equation
we get the following identities:
n = n3 + 2 n4 + 2 , n3 − 2 n I = 4 − n . (11.61)
Firstly, we see that the mass dimension of the n-point amplitude is 4 − n. Moreover,
the amplitude An does not carry any Lorentz index. Therefore, in the numerator all
the Lorentz indices µi and νj must be contracted pairwise. These contractions lead to
three type of factors:
ǫi · ǫi ′ , ǫi · Lj , Lj · Lj ′ . (11.63)
ǫ+ (i; qi ) · ǫ+ (i ′ ; qi ′ ) ∝ qi qi ′ . (11.64)
An (1+ 2+ · · · n+ ) = 0 . (11.65)
By the same reasoning, we conclude that the all-minus amplitude is also zero. We can c sileG siocnarF
see here the power that stems from the freedom of choosing the auxiliary vectors qi ;
for generic qi ’s, this amplitude would still be zero (since it does not depend on the
qi ’s), but this zero would result from intricate cancellations among the many graphs
that contribute to An . Instead, with a smart choice of the auxiliary vectors, we can
make this cancellation happen graph by graph.
7 We assume for simplicity Feynman gauge, in which the numerator of the gluon propagator does not
depend on momentum.
374 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Maximally Helicity Violating amplitudes : Let us flip one more helicity, e.g. with
the assignment 1− 2− 3+ · · · n+ . This time, a useful choice of auxiliary vectors is
q1 = q2 = pn , q3 = q4 = · · · = qn = p1 . (11.68)
With this choice, all the contractions of polarization vectors are zero, except:
ǫ− (2; q2 ) · ǫ+ (i; qi ) 6= 0 for i = 3, · · · , n − 1 . (11.69)
Thus, this time, we need to contract the remaining n − 2 polarization vectors with
the n3 momenta from the 3-gluon vertices, which is possible (provided that n4 = 0,
which means that diagrams containing 4-gluon vertices do not contribute to the
− − + · · · + amplitude for our choice of auxiliary vectors). Therefore, this assignment
of helicities gives a non-zero amplitude:
An (1− 2− 3+ · · · n+ ) 6= 0 . (11.70)
These amplitudes, called the Maximally Helicity Violating (MHV) amplitudes, are
the simplest non-zero amplitudes. As we shall see later, they are given at tree level by
very compact formulas in terms of square and angular brackets (note that up to n = 5
external lines, all the non-zero amplitudes are MHV amplitudes). Generically, the
complexity of amplitudes increases with the number of − helicities, culminating with
amplitudes that have comparable numbers of − and + helicities (increasing further
the number of − helicities then reduces the complexity).
legs. This problem remains true even after one has factorized the colour factors, even
if it is somewhat mitigated by the fact that the number of cyclic-ordered graphs grows
at a slower pace.
This issue could be avoided if there was a way to break down a tree amplitude into
smaller pieces (themselves tree amplitudes) that have a smaller number of external
legs. It turns out that an amplitude naturally factorizes into two sub-amplitudes
when one of its internal propagators goes on-shell. The physical reason of such a
factorization is that on-shell momenta correspond to infinitely long-lived particles.
Thus, the two sub-amplitudes on each side of this on-shell propagator do not talk
to one another. The other advantage of this situation is that the two sub-amplitudes
would themselves be on-shell, and therefore we may use for them spinor-helicity
formulas that could have been previously obtained for amplitudes with fewer external
legs. If this were possible, we would thus obtain a recursive relationship (in the
number of external legs) for on-shell amplitudes.
where z is a complex variable that controls the deformation. The singularities of tree
Feynman graphs come from the zeroes of the denominators of its internal propagators,
which give poles in z. Our goal will be to choose this deformation in such a way that
the total momentum remains conserved, and the deformed external momenta are still
on-shell. With such a choice, we will be able to reuse the on-shell formulas obtained
for smaller amplitudes.
Let us consider the ratio An (· · · ; z)/z. Besides the poles coming from the internal
propagators, the ratio also has a simple pole at z = 0. Let us assume that An (· · · ; z)
vanishes when |z| → ∞, so that the integral of An (· · · ; z)/z on a contour at infinity
in the complex plane vanishes. Then, we may write
I
dz An (· · · ; z)
0 = = An (· · · ; 0)
γ 2πi z
X An (· · · ; z)
+ Res . (11.72)
z zi
zi ∈{poles of An }
The first term, An (· · · ; z = 0), is nothing but the amplitude we aim at calculating.
This formula therefore expresses it in terms of the residues of An (· · · ; z)/z at the
376 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
• The sum of the shifted incoming momenta should remain zero. Therefore, we
must shift at least two momenta (and the simplest is to shift only two).
• The shifted momenta should stay on-shell at all z.
• The amplitude evaluated at the shifted momenta should go to zero as |z| → ∞.
pi → bi = pi (z) ≡ pi + z k ,
p
pj → bj = pj (z) ≡ pj − z k ,
p (11.73)
where we denote with a hat the shifted momenta. All the momenta pk for k 6= i, j are
bi,j are satisfied provided that
left unmodified. The on-shell conditions for p
k2 = 0 , pi · k = 0 , pj · k = 0 . (11.74)
It turns out that these equations have two solutions (up to an arbitrary prefactor),
provided we allow complex momenta. In the spinor notation, the first condition is
automatically satisfied if K can be factorized as in eq. (11.17), while the second and
third conditions become
ik ik = 0 , jk jk = 0 . (11.75)
This explains why we need a complex momentum kµ . Indeed, for a real kµ , k and
k are related by complex conjugation, and the above conditions reduce to ik =
jk = 0. With two-component spinors, this implies k ∝ i and k ∝ j , which
is in general impossible. By allowing a complex momentum kµ , we let k and k
be independent, which allows to solve the above conditions by having for instance:
k = i , k = j . (11.76)
(The other independent solution consists in exchanging the roles of i and j.) The
bi-spinors corresponding to the shifted momenta are
b i = i i +z j i =
P i +z j i , b j = j j −z j i = j
P j −z i ,
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 377
Figure 11.3: i j
Propagators affected
by the momentum shift
(shown in dark) in a tree
amplitude. The lighter
colored lines do not
depend on z. The prop-
agators on the external
lines are not actually
part of the expression of
the amplitude.
(11.77)
^ı = i , ^ = j − z i ,
^ı = i + z j , ^ = j . (11.78)
global behaviour at most ∼ z at large z (obtained when all these vertices are 3-gluon
vertices, that scale as z). For the assignment {hi = −, hj = +} of polarizations, we
thus find an overall behaviour8 in z−1 , valid graph by graph.
For other combinations of polarizations on the lines i, j, this diagrammatic ar-
gument suggests that they do not go to zero. However, the actual behaviour for
{hi = −, hj = −} and {hi = +, hj = +} is better than the one suggested by this
graph by graph estimate. Firstly, note that this problem is reminiscent of the eikonal
approximation, in which a hard on-shell particle punches through a background of
much softer particles that very mildly disturb its motion. This can be studied by
splitting the gauge field Aµ into a hard component aµ that describes the gluons along
the string with shifted momenta and a soft background Aµ (describing the unshifted
gluons attached to the hard ones),
Aµ ≡ Aµ + aµ . (11.80)
1 ig
LYM = · · ·− tr Dµ aν −Dν aµ Dµ aν −Dν aµ + tr aµ , aν Fµν +· · · ,
4 2
(11.81)
where the covariant derivative Dµ is constructed with the background field. When c sileG siocnarF
splitting the gauge potential as in eq. (11.80), one may fix independently the gauge
for the background and for the fluctuation aµ . For the latter, a convenient choice is
the background field gauge,
After adding the gauge fixing term, the quadratic part of the Lagrangian becomes
1 ig
LYM+GF = · · · − tr Dµ aα Dµ aα + tr aµ , aν Fµν + · · · (11.83)
4 2
In this equation, the first term possesses an extended Lorentz symmetry, since it is
invariant under independent Lorentz transformations of the fluctuations and of the
background, while the second term is only invariant under simultaneous transforma-
tions of Aµ and aµ .
Let us denote Mαβ [A] the propagator of the fluctuation aµ , amputated of its
final lines. This propagator contains 3-gluon couplings to the background field, that
8 When the shifted amplitude decreases faster than z−1 , one may obtain a more compact expression
by integrating An (· · · ; z)(1 − z/z∗ )/z, where z∗ is one of the poles of An . There is no boundary term
thanks to the faster decrease of An , and the additional subtraction removes the contribution from the pole
z∗ , leading to an expression with one less term.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 379
come from the first term of eq. (11.83), and 4-gluon couplings to the background
field coming from the second term. With only 3-gluon couplings, we have Mαβ ∼ z
(because there is one more vertex than propagators), and each 4-gluon vertex removes
one power of z. Given the Lorentz structure of eq. (11.83), we may write
Mαβ = c1 z + c0 + c−1 z−1 + · · · gαβ + Aαβ + z−1 Bαβ + · · · (11.84)
In this formula, the first term comes entirely from the first term of eq. (11.83), whose
extended Lorentz symmetry leads to the factor gαβ . All the coefficients in this
expansion are functionals of the soft field. The term Aαβ , that comes from a single
insertion of the 4-gluon vertex, is antisymmetric. The subsequent terms correspond to
2 or more insertions of the 4-gluon vertex. These terms have no definite symmetry,
but they are not needed in the discussion. The amputated 2-point function Mαβ also
obeys the following on-shell Ward identities:
β
bα
pi Mαβ ǫhj (b
) = 0 , ǫα
hi (b bβ
ı) Mαβ pj =0, (11.85)
with shifted on-shell momenta and polarization vectors. Note that, unlike in an
Abelian gauge theory, it is necessary to contract one side of the function with a
physical polarization vector for the identity to hold.
The shifted amplitude An is obtained by keeping n − 2 powers of the background
field in Mαβ , and by contracting with the appropriate polarization vectors:
An ∼ ǫα ı; q) Mαβ ǫβ
hi (b ; q ′ ) .
hj (b (11.86)
The first term vanishes because k is on-shell, and the second one thanks to the
antisymmetry of Aαβ . Next, consider the case {hi = −, hj = −}, for which we
obtain
An;−− ∼ kα Mαβ ǫβ ; q ′ )
− (b
β
= −z−1 pα ; q ′ )
i Mαβ ǫ− (b
∼ z−1 pi · (k∗ −z pi )(c1 z + · · · ) + z−1 pα
i (k
∗β
−z pβ
i )Aαβ + O(z
−1
)
∼ O(z−1 ) . (11.89)
The second line is obtained by using the Ward identity, and in the third line all terms
that could be larger than z−1 vanish due to p2i = pi · k∗ = 0 and thanks to the
antisymmetry of Aαβ . The case {hi = +, hj = +} is very similar and leads to
An;++ ∼ ǫα ı; q) Mαβ kβ
+ (b
= z−1 ǫα ı; q) Mαβ pβ
+ (b j
β
∼ z−1 (k∗ +z pj ) · pj (c1 z + · · · ) + z−1 (k∗α +z pα
j )pj Aαβ + O(z
−1
)
∼ O(z−1 ) . (11.90)
Finally, in the last case, {hi = +, hj = −}, we obtain An;+− ∼ O(z3 ), and therefore
we cannot use such a shift in eq. (11.72).
As explained earlier, the poles zi come from the vanishing denominators of the
internal propagators, i.e. one of the dark colored propagators in the figure 11.4. Let
us denote KI the momentum (before the shift) carried by the propagator producing
the pole, with the convention that it is oriented in the same direction as p1 . The shift
changes this momentum into
KI → b ≡ K + zk ,
K (11.92)
I I
and the condition that the denominator of the propagator vanishes after the shift is
b 2 = K2 + 2 z K · k , K2I
0=K I I
i.e. zI = − . (11.93)
I I
2 KI · k
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 381
The singular propagator divides the amplitude into left and right sub-amplitudes, so
that we may write:
X i
An (^
1 2 · · · (n−1)^
n; z) ≡ AL (^ b +h ; z)
1 2 · · ·− K b −h · · · (n−1)^
AR (K n; z) ,
I
b
KI2 I
h=±
(11.94)
with a sum over the helicity h of the intermediate gluon10 . From this expression, the
residue at the pole zI of An (· · · ; z)/z takes the form
An (· · · ; z) X i
Res =− AL (^ b +h ; z )
1 2 · · ·− K b −h · · · (n−1)^
A (K n; zI ) .
z zI I I
K2I R I
h=±
(11.95)
Both AL and AR have strictly less than n external lines, which means that the formula
is recursive: it expresses an amplitude in terms of smaller amplitudes, eventually
breaking it down to 3-point amplitudes. Moreover, the crucial point here is that,
when evaluated at the value zI that gives K b 2 = 0, the left and right sub-amplitudes
I
have only on-shell (but complex) external momenta. Therefore, this recursion never
requires off-shell amplitudes, which is of utmost importance for keeping out of the
calculation unnecessarily complicated kinematics and unphysical degrees of freedom.
Since each internal z-dependent propagator can be singular for some z, eq. (11.91)
contains one term for each such propagator. There are at most n − 3 terms in this sum,
corresponding to the partitions of [2, n−1] = [2, l]∪[l+1, n−1] with 2 ≤ l ≤ n−2. c sileG siocnarF
10 Both AL and AR are defined with all gluons incoming. This is why one has argument −K b and the
I
b
other one +KI . For the same reason, the helicity is +h on one side and −h on the other side.
382 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
An (1− 2− 3+ · · · n+ ) = An−1 (^
1− 2− 3+ · · · (n − 2)+ − Kb+; z ) i
I I
K2I
×A3 (Kb − (n − 1)+ n ^ +; z ) , (11.96)
I I
where the momentum carried by the singular propagator is (before the shift)
KI = −(pn−1 + pn ) . (11.97)
In the right hand side of eq. (11.96), the factor on the right is an already known 3-point
amplitude, and the factor on the left is an MHV amplitude with n − 1 external legs.
Four-point MHV amplitude : Let us now calculate the first few iterations of
this recursion, in order to guess a formula for the MHV amplitude that will be
11 This assignment of helicities, with the negative helicities carried by adjacent lines, is the simplest.
MHV amplitudes with non-adjacent negative helicities are also given by the Parke-Taylor formula, but the
proof is a bit more complicated in this case.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 383
our hypothesis for an inductive proof. Firstly, consider the − − ++ 4-point MHV
amplitude, In this case, the BCFW recursion formula gives
b+; z ) i b − 3+ ^4+ ; z ) ,
A4 (1− 2− 3+ 4+ ) = A3 (^
1− 2− − K A3 (K (11.98)
I I
K2I I I
and both amplitudes in the right hand side are known. This gives:
3 3
^
12 1 3^4
− − + + 2
A4 (1 2 3 4 ) = 2 i g . (11.99)
b K
2K b ^1 12 12 ^4K b K b 3
I I I I
we obtain
b K
K b = 1 1 + 2 2 + zI 1 4 ,
I I
2Kb K b 4 = 21 14 ,
I I
1Kb K b 3 = 12 23 , (11.101)
I I
which leads to
3
− − + + 342
A4 (1 2 3 4 ) = 2 i g . (11.102)
41 12 23
This formula, that depends only on square brackets, can also be expressed in terms
3
of angle brackets. Let us multiply the numerator and denominator by 12 . Then,
momentum conservation leads to
41 12 = − 43 32 ,
12 23 = − 14 43 ,
12 12 = (p1 + p2 )2 = (p3 + p4 )2 = 34 34 , (11.103)
A5 (1− 2− 3+ 4+ 5+ ) = A4 (^ b + ; z ) i A3 (K
1− 2− 3+ − K b − 4+ ^5+ ; z )
I I
K2I I I
√ ^12 3 4^5 3
= ( 2 g)3 i2 ,
23 3K b K b 1 45 45 ^5K b K b 4
I I I I
(11.105)
where we have chosen to express K2I as (p4 + p5 )2 = 45 45 . This time, we use
^
1 = 1 , 1 ^ = 1 + z 5 , ^5 = 5 − z 1 , ^5] = 5 ,
b K
K b =− 4 4 − 5 5 +z 1 5 ,
I I I
3Kb K b ^ 5 = − 34 45 ,
I I
^
1Kb K b 4 = − 51 45 , (11.106)
I I
which gives
3
− − + + +
√ 12
A5 (1 2 3 4 5 ) = ( 2 g)3 i2 . (11.107)
23 34 45 51
Parke-Taylor formula : The previous results for 3, 4 and 5-point MHV amplitudes
lead us to conjecture the following general formula:
3
− − + +
√ 12
An (1 2 3 · · · n ) = ( 2 g)n−2 in−3 ,
23 34 · · · (n − 1)n n1
(11.108)
known as the Parke-Taylor formula. Let us assume the formula to be true for all
p < n, and consider now the case of the n-point MHV amplitude. The BCFW
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 385
1 b (n−1) n
− + +
× 2 A3 (K ^ ; zI )
KI I
3
√ ^12
= ( 2 g)n−2 in−3
b K
23 · · · (n−2)K b 1
I I
3
1 (n−1)^ n
× ,
(n−1)n (n−1)n n b K
^K b (n−1)
I I
(11.109)
where we have used our induction hypothesis for the (n − 1)-point MHV amplitude
that appears in the left sub-amplitude. The spinor manipulations that are necessary to
simplify this expression are the same as in the case of the 5-point amplitude, and lead
to:
(n−2)K b K b n^ = − (n−2)(n−1) (n−1)n ,
I I
^
1Kb K b (n−1) = − n1 (n−1)n , (11.110)
I I
thanks to which we obtain eq. (11.108) for n points. Up to 5-points, all amplitudes
are MHV (or anti-MHV, i.e. + + − − −). Beyond 5-points, there exist non-MHV
amplitudes, that are not given by the Parke-Taylor formula. However, multiple MHV
amplitudes can be sewed together in order to construct the non-MHV ones, with
a set of rules known as the Cachazo-Svrcek-Witten (CSW) rules, derived in the
section 11.6. Such an expansion is much more efficient that the textbook perturbation
theory, because it is in terms of on-shell gauge-invariant building blocks (the MHV
amplitudes) that already encapsulate a lot of the underlying complexity.
where gµν is the metric tensor, R is the Ricci curvature and κ is a coupling constant
related to Newton’s constant by κ2 = 32π GN . In this action, we have also added
the minimal coupling to a gauge field and to a scalar field, in order to investigate
gravitational interactions with light and matter. The rules for the propagators and
vertices involving gravitons are obtained by expanding the metric around flat space:
(ηµν is the flat space Minkowski metric.) Let us make a remark on dimensions:
Newton’s constant has mass dimension −2, κ has mass dimension −1, the Ricci
curvature has mass dimension 2, and hµν has mass dimension +1 (like the scalar φ
and the photon Aµ ). The expansion in powers of hµν leads to an infinite series of
terms (because the Ricci tensor contains the inverse
√ gµν of the metric tensor, and also
because of the expansion of the square root −g). Schematically, the expansion of
the Hilbert-Einstein action starts with the following terms:
Z
SHE ∼ d4 x h∂2 h + κ h2 ∂2 h + κ2 h3 ∂2 h + · · ·
+κ hφ∂2 φ + κ h F2 + · · · . (11.113)
This sketch only indicates the number of powers of h and the number of derivatives
contained in each term, but of course the actual structure of these terms is much more
complicated. For instance, the vertex describing the coupling φφh between two
scalars and a graviton reads:
iκ h µ ν µ
i
Γ µν (p1 , p2 ) = − p1 p2 + pν p
1 2 − η µν
(p1 · p2 − m2
) , (11.114)
2
where p1,2 are the momenta carried by the two scalar lines (since the graviton has
spin 2, the graviton attached to this vertex carries two Lorentz indices). But the
12 At tree-level, these amplitudes are completely prescribed by the equivalence principle and general
relativity, and their calculation does not require to have a consistent theory of gravitational quantum
fluctuations.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 387
γγh coupling is far more complicated, and the hhh tri-graviton vertex is even more
complex, leading to extremely cumbersome perturbative calculations if performed
within the traditional approach.
It turns out that tree amplitudes in Einstein gravity have a simple form in the
spinor-helicity formalism, very much like their Yang-Mills analogue. The goal of
this section is to illustrate on two examples the use of the spinor-helicity formalism,
combined to the BCFW recursion, in order to calculate some amplitudes that have
a relevance in gravitational physics: (1) gravitational bending of light by a mass,
and (2) scattering of a gravitational wave by a mass. In both examples, the mass
acting as a source of gravitational field is taken to be a scalar particle. In the approach
based on conventional Feynman perturbation theory, these processes are given by the
diagrams shown in the figure 11.6. In particular, the second example (bending of a
Aφγ→φγ ∼
Aφh→φh ∼
ǫµν µ ν
2h (p; q) = ǫh (p; q) ǫh (p; q) . (11.115)
388 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
For 3-point amplitudes that involve only massless particles (photons and gravitons),
little group scaling is sufficient to constrain completely their form. We obtain:
In order to obtain the zeroes of the first line, and to choose between square and angle
brackets for the non-zero results, we use the fact that the 3-point amplitude must have
mass dimension +1, with a prefactor made up only of numerical constants and one
power of κ (that has mass dimension −1). The value √ of the prefactor is obtained
by inspecting the term of order κ in the expansion of −g F2 . For the 3-graviton
amplitudes, little group scaling leads to
Interestingly, the kinematical part of the non-zero 3-graviton amplitudes is simply the
square13 of that of the 3-gluon amplitudes with like-sign helicities (see eqs. (11.46)
and (11.47)), despite a considerably more complicated Feynman rule for the 3-graviton
vertex. This is yet another illustration of the fact that traditional Feynman rules carry
a lot of unnecessary information that disappears in on-shell amplitudes with physical
polarizations.
For the φφh amplitude, we cannot rely on little group scaling because the scalar
field is massive. Instead, we simply contract eq. (11.114) with the polarization vector
(11.115) of the graviton, and take the external momenta on mass-shell. For a graviton
of helicity +2, we have
Aφφh (10 20 3+2 ) = −i κ p1 · ǫ+ (p3 ; q) p2 · ǫ+ (p3 ; q)
i κ q P1 p3 q P2 p3
= − 2
, (11.118)
2 qp 3
13 This property of 3-point purely gravitational amplitudes has a generalization for n-point amplitudes,
known as the Kawai-Lewellen-Tye (KLT) relations. These relations have also been interpreted as a form of
colour-kinematics duality by Bern, Carrasco and Johansson.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 389
Note that,
p3 P2 q = − p3 P1 q − p3 P3 q = − p3 P1 q ,
| {z }
0
q P2 p3 = − q P1 p3 − q P3 p3 = − q P1 p3 , (11.120)
| {z }
0
Note that since p21 = m2 6= 0, the bi-spinor P1 does not admit a factorized form, and
this cannot be simplified further. c sileG siocnarF
Shifted momenta : Consider now the amplitude Aγγφφ (1+ 2− 30 40 ), and apply
the shift to the lines 2 and 3, as illustrated in the figure14 11.7:
b2 ≡ p2 + z k ,
p b3 ≡ p3 − z k ,
p
2
k =0, k · p2 = k · p3 = 0 . (11.122)
k = 2 (11.123)
However, since p23 = m2 , the bi-spinor P3 that represents the momentum p3 cannot
be factorized. Instead, we may write
0 = 2 k · p3 = − k P3 k , (11.124)
14 Note that the factorization with one scalar and one photon on each side of the singular propagator is
not allowed: indeed, the intermediate propagator would need to carry a scalar, and we would have two
φφγ sub-amplitudes, that are zero per our assumption that the scalar field is not electrically charged.
390 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
0
1 4
2 30
Note that the second one is not factorizable, because the line 3 carries a massive
particle.
Scattering amplitude : With this choice of shifts, the BCFW recursion formula
can be written as follows
i X b +h ; z ) Ahφφ (K
b −h ^30 40 ; z ) ,
Aγγφφ (1+ 2− 30 40 ) = Aγγh (1+ ^2− − K
K2I I I I I
h=±2
(11.127)
where the shifted momenta in the 3-point amplitudes are evaluated at the zI for which
the shifted momentum K b of the intermediate graviton is on-shell. The condition for
I
the intermediate momentum to be on-shell reads
0=K b 2 = (p1 + p
b2 )2 = 2 p1 · p
b2 = 12 12 + zI 1 P3 2 , (11.128)
I
| {z }
^ =0
12
15 We use k P k = 2 P P 2 = m2 22 = 0. When p is massless, P factorizes as P =
3 3 3 3 3 3
3 3 , and this solution becomes k = 3 32 . Up to a rescaling, this is the solution we have previously
used in the massless case.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 391
whose solution is zI = − 12 / 1 P3 2 . Plugging in the results for the 3-point
amplitudes and summing explicitly over the two helicities of the intermediate graviton,
the γγφφ amplitude can be written as
κ2 1 b 1 4 K
K b P4 q 2
+ − 0 0 I I
Aγγφφ (1 2 3 4 ) = 2
4 12 12 1^2 qKb 2
I
Kb ^2 4 q P4 K b 2
+ I 2 I
. (11.129)
1^2 qK b 2
I
4 4 2
b 1 4
K b 1 4 K
K b 1 4 b2
2 p1 · p 1^2 1^2
I I I
2 = 2 = 2 = =0. (11.130)
1^
2 1^2 K b 1 4 1^
2 K b 1 4 b 1 4
K
I I I
which gives the following extremely compact form for the amplitude:
2
κ2 2 P4 1
Aγγφφ (1+ 2− 30 40 ) = . (11.133)
4 12 12
†
Cross section and deflection angle : Using 2 P4 1 = 1 P4 2 , the modulus
square of the amplitude is
2 2
+ − 0 0
2 κ4 2 P4 1 1 P4 2
Aγγφφ (1 2 3 4 ) = 2 2
. (11.134)
16 12 12
392 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that
ab cd ab cd
2 P4 1 1 P4 2 = 2 a P4 1 b 1 c P4 2 d = P4 1 b 1 c P4 2 d 2 a
= tr P4 P1 P4 P2 = p4µ p1ν p4ρ p2σ tr σµ σν σρ σσ
= 2 p4µ p1ν p4ρ p2σ ηµν ηρσ − ηµρ ηνσ
+ηµσ ηνρ − i ǫµνρσ
= 2 2 (p1 · p4 )(p2 · p4 ) − p24 (p1 · p2 )
= s13 s14 − m4 , (11.135)
where we have introduced the Lorentz invariants sij ≡ (pi +pj )2 and used s24 = s13
and s12 + s13 + s14 = 2 m2 (both follow from momentum conservation). Therefore,
the squared amplitude reads
2 κ4 (s13 s14 − m4 )2
Aγγφφ (1+ 2− 30 40 ) = . (11.136)
16 s212
The differential cross-section with respect to the solid angle of the outgoing photon is
given by
dσ 1 2
= Aγγφφ (1+ 2− 30 40 ) . (11.137)
dΩ 64π2 s14
Let us now consider the limit of long wavelength photons, namely ω = |p1,2 | ≪ m.
In this limit, the Lorentz invariants that appear in the cross-section simplify into16
s12 ≈ 4 ω2 sin2 θ
2 ,
s13 ≈ m2 − 2 m ω − 4 ω2 sin2 θ
2 ,
2
s14 ≈ m + 2mω , (11.138)
where ω is the photon energy and θ its deflection angle in the center of mass frame
(which is also the frame of the massive scalar particle in this limit). For large enough
impact parameters, the deflection angle is small, θ ≪ 1. Thus we obtain in this limit
dσ 16 G2N m2
≈ . (11.139)
dΩ θ4
In order to determine the deflection angle as a function of the impact parameter b,
consider a flux F of photons along the z direction, with the massive scalar at rest at
16 If we are only interested in the limit of small energy ω and small deflection angle θ, then the somewhat
complicated calculation of the numerator done in eq. (11.135) can be avoided. Indeed, in this limit, the
ab
massive scalar is at rest and P4 ≈ m δab . Moreover, the 3-momenta of the incoming and outgoing √
2ω
photons are nearly parallel to the z axis. This implies that 1 a ≈ 1 a ≈ − 2 a ≈ − 2 a ≈ 0
,
and 2 P4 1 ≈ 1 P4 2 ≈ −2mω.
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 393
the origin. Out of this flux, consider specifically the incoming photons in a ring of
radius b and width db. The number of photons flowing per unit time through this
ring is
2π b F db . (11.140)
All these photons are scattered in the range of polar angles [θ(b) + dθ, θ(b)] (note
that dθ is negative for db > 0, because the deflection angle decreases at larger b),
which corresponds to a solid angle:
dΩ = −2π sin θ(b) dθ . (11.141)
By definition, the number of scattering events is the flux times the cross-section, i.e.
c sileG siocnarF
dσ
2π b F db = F dΩ , (11.142)
dΩ
that can be integrated for small angles into
4 GN m
θ(b) = . (11.143)
b
(The integration constant is chosen so that the deflection vanishes when b → ∞.)
This is indeed the standard formula from general relativity, that can be derived by
considering geodesics in the Schwarzchild metric.
Let us now study the scattering amplitude between a scalar and a graviton, whose low
energy limit will provide us information about the scattering of a long wavelength
gravitational wave by a mass. A priori, each of the two gravitons may have a helicity
±2, but the cases {+2, +2} and {−2, −2} correspond to a helicity flip of the graviton,
which is suppressed at low frequency. Therefore, let us consider the amplitude
Ahhφφ (1−2 2+2 30 40 ). When writing the BCFW recursion for this amplitude, the
simplest shift is one that affects the lines 1 and 2, more specifically:
^ = 2 , 2
2 ^ = 2 −z 1 ,
^
1 = 1 +z 2 , ^1 = 1 , (11.144)
Because the polarization vectors of the gravitons are squares of the spin-1 ones, this
shift can be proven to lead to a vanishing amplitude when |z| → ∞ simply by power
counting. With the shift (11.144), the intermediate propagator carries a scalar, and
394 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
40 30 30 40
K23 K24
AL AR AL AR
Figure 11.8: BCFW shift for the calculation of the hhφφ amplitude.
therefore it has only the h = 0 helicity. The BCFW recursion formula contains two
terms,
b0 ) i b0 )
Ahhφφ (1−2 2+2 30 40 ) = Ahφφ (^
1−2 40 K23 Ahφφ (^2+2 30 − K 23
K223
− m2
b0 ) i b0 ) ,
+ Ahφφ (^
1−2 30 K24 2
Ahφφ (^2+2 40 − K 24
K24 − m2
(11.145)
that differ by a permutation of the external scalars. In the above equation, we have
made explicit the intermediate momentum, K b 23 ≡ pb2 + p3 in the first term and
b 24 ≡ p
K b2 + p4 in the second one. The explicit forms of the first and second terms are
2 2
Ahφφ (^ b 0 )Ahφφ (^
1−2 40 K 23
b0 )
2+2 30 − K23 −iκ2 ^1 P4 q q ′ P3 ^2
i = 2 ,
K223 − m2 4(K223 −m2 ) ^1q q ′ ^2
2
2 2
Ahφφ (^ b 0 )Ahφφ (^
1−2 30 K 24
b0 )
2+2 40 − K24 −iκ2 ^1 P3 q q ′ P4 ^2
i = 2 .
K224 − m2 4(K2 −m2 ) ^1q q ′ ^2
2
24
(11.146)
4
−2 +2 0 0 κ2 1 P3 2 1 1
Ahhφφ (1 2 3 4 ) = −i +
4 12 2 12 2 K223 − m2 K224 − m2
4
κ2 1 P3 2 1
= i . (11.147)
16 12 12 (p2 · p3 )(p2 · p4 )
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 395
In the limit of a graviton of small energy (i.e. a gravitational wave of long wavelength)
and small deflection angle (i.e. at large impact parameter), the second factor in the
right hand side becomes equal to 1, and we have
2 2
Ahhφφ (1−2 2+2 30 40 ) ≈
ω≪m
Aγγφφ (1− 2+ 30 40 ) . (11.149)
θ≪1
This implies that in this limit the bending of a gravitational wave by a mass is the
same as the bending of a light ray (but there are some differences beyond this limit).
Our proof of the Parke-Taylor formula, based on BCFW recursion, is not faithful
to the actual chronology, since the formula was conjectured in 1986 and a proof
was found in 1988 using an off-shell recursion derived by Berends and Giele, well
before on-shell recursion. The Cachazo-Svrcek-Witten rules, also anterior to BCFW
recursion, provide a way to construct the tree-level non-MHV amplitudes by an
expansion in which the MHV ones play the role of vertices, as in the following
diagram:
- +
+ - + -
+ - ,
where the two “vertices” are + + −− 4-point MHV amplitudes, sewed together in
order to make a contribution to a non-MHV 6-point amplitude. In this section, we
will use ideas inspired of the derivation of on-shell recursion in order to establish
these rules.
Firstly, for energy-momentum conservation to hold in the vertices of such a
diagram, the intermediate propagator linking the two vertices must generically carry
an off-shell momentum. This means that in such a construction, we need first to
396 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
which gives
ηP
P = . (11.153)
η|P
Note that this identity contains the angle spinor P in the numerator and the square
spinor P in the denominator, consistent with the fact that rescaling one with λ and
the other with λ−1 gives an equally valid representation. For this reason we may
simply ignore the denominator and define
P = ηP. (11.154)
In the following, we adopt this formula as the definition of the angle spinor associated
with the off-shell momentum pµ . Note that when pµ is on-shell, then p = P, and the
angle spinor P defined in eq. (11.154) is indeed proportional to the usual p . c sileG siocnarF
k−
Figure 11.9:
Propagators affected by
the shift of eq. (11.155) i−
in a generic amplitude.
j−
while the corresponding angle spinors are left unchanged17 . The shifted external
momenta are defined as direct products of these shifted square spinors and the original
angle spinors, and are therefore still on-shell. Note also that
b k
bı i + b j + k = i i + j j + k k
| {z }
0
+z η jk i + ki j + ij k .
| {z }
0
(11.156)
The first zero is due to momentum conservation in the unshifted amplitude, and
the second one follows from Schouten identity. Thus, the above shift preserves
momentum conservation.
The propagators affected by the shift of eq. (11.155) form three lines starting at the
three external points of negative helicity, that meet somewhere inside the graph. Since
the shift modifies only the square spinors, the polarization vectors of negative helicity
scale as z−1 . Moreover, from the figure 11.9, we see that there are p + 1 vertices and
p propagators along the affected lines. Even in the worst case where all these vertices
are 3-gluon vertices that scale like z, the overall scaling of the graph is bounded by
z−3+(p+1)−p ∼ z−2 , and therefore it goes to zero as |z| → ∞. Then, we proceed
like in the derivation of BCFW’s recursion formula, by integrating An (· · · ; z)/z over
z on a circle of infinite radius. The behaviour of the deformed amplitude at large z
17 Recall that the usual BCFW shift acts only on a pair i, j of external lines, by shifting the angle spinor
X An (· · · ; z)
An (· · · ) = − Res
z z∗
z∗ ∈{poles of An }
X
= b −h ; z ) i A (· · · K
AL (· · · − K bh; z ) , (11.157)
I I
K 2 R I I
I,h=± I
i∈A
L
where we have only indicated the relevant helicity assignments (all the thin lines carry
positive helicities). Thus, as far as the helicities are concerned, the vertices in these
graphs are MHV amplitudes with exactly two negative indices.
Note that in eq. (11.158) the shift has no influence on the external lines because the
MHV amplitudes with two negative helicities depend only on the angle spinors. The
MHV vertices in this equation also depend on the angle spinor K b that corresponds
I
to the on-shell (because we evaluate the amplitude at the value of z for which the
intermediate propagator is singular) shifted momentum. Consider for instance the
first diagram, obtained when the singular propagator is on the line that stems from the
external line i. In this case, we have
b K
K b =K b = K + z∗ jk η i , (11.159)
I I I I
where z∗ is the value of z at the pole. By contracting this relation with η , we obtain
b K
ηK b = ηK . (11.160)
I I I
Thus, the angle spinor K b in the MHV vertices resulting from the theorem of
I
residues is proportional to the off-shell extension η KI that we have proposed in the
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 399
previous subsection. Finally, note that the factors ηKb all cancel, because the line
I
of momentum KI has opposite helicities on either side of the propagator that links the
two MHV amplitudes18 . Therefore, in the MHV diagrams of eq. (11.158), we do not
need to find the poles z∗ and we may directly evaluate
the MHV amplitudes that play
the role of vertices with the off-shell angle spinor η KI .
Let us summarize here the CSW rules for calculating amplitudes with exactly
three negative helicities:
• Start from the three skeleton diagrams of eq. (11.158), and interpret the vertices
as MHV amplitudes with one off-shell external leg. Note that the actual number
of MHV graphs depends on the number of positive helicity external lines, since
they may be attached to either of the two MHV vertices (provided we do not
change the cyclic ordering of the external lines, and that all the vertices obtained
in this way have at least three lines).
• The intermediate propagator is simply a scalar propagator i/K2I , with the value
of KI determined by momentum conservation at the vertices.
6 5 4
I 3
1 6
5
I
4
2 3
18 Recall that under a rescaling by a factor λ of an angle spinor, the MHV amplitudes with exactly two
negative helicities scale by λ2 if the scaling affects an external line of negative helicity, and by λ−2 if it
affects an external line of positive helicity.
400 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
11.6.3 Examples
Four-point − − −+ amplitude : let us first use the CSW rules to evaluate the
1− 2− 3− 4+ amplitude. Of course, we know beforehand that the result should be zero,
so this is no more than a trivial illustration of the rules. In this case, the CSW rules
give the following graphs:
4 3 1 4 3
+ - + - - + - + 4 - + - -
A4 (1− 2− 3− 4+ ) = - - + - - +
+ -
2,
1 2 2 3 1
(11.161)
but the last one is trivially zero because one of the vertices (−−) does not exist. In
the first graph, the intermediate momentum (oriented from left to right) is KI =
p1 + p4 = −(p2 + p3 ), and its angle spinor KI ≡ η KI obeys
KI 1 = − η4 14 , KI 2 = η3 23 ,
KI 3 = − η2 23 , KI 4 = η1 14 , (11.162)
3 3
1KI 1 23
A4,1 (1− 2− 3− 4+ ) = 2ig2
KI 4 41 K2I 3KI KI 2
3
2 η4 14
= −2ig . (11.163)
η1 η2 η3 23
3 3
− − − + 2 12
1 LI 3
A4,2 (1 2 3 4 ) = 2ig 2
2LI LI 1 LI 34 4LI
3
2 η4 34
= 2ig . (11.165)
η1 η2 η3 12
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 401
where the shaded ellipses indicate that the lines (4, 5) (in the left graph) or (5, 6) (in
the right graph) can either be attached to the left or right MHV vertex, provided the
cyclic order is not modified. Each term in eq. (11.167) therefore corresponds to three
MHV graphs, i.e. a total of six19 . For this amplitude, the CSW rules give:
A6 ([123]− [456]+ )
5 3 3
X 1Ki 23
= −4ig4 1
K2
Ki i+1 i+1i+2 ··· 61 i Ki 2 34 ··· iKi
i=3
6 3 3
X 12 Lj 3
1
+ L2
,
2Lj Lj j+1 ··· 61 j 34 ··· j−1j jLj
j=4
(11.168)
Moreover, each of these colour-ordered graphs is considerably more complicated than the MHV diagrams.
402 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
thanks to which many factors become identical in the two terms of eq. (11.168). With
this choice, the terms i = 4, 5 in the first line and j = 4, 5 in the second line combine
in the following compact expression:
X ii+1
4ig4
A6,1 = −
34 45 56 61 Ki i Ki i+1 Ki 2
i=4,5
" 3 3 3 3
#
23 Ki 1 12 Ki 3
× K2
+ (Ki +p2 )2
. (11.170)
i
The terms i = 3 in the first line and j = 6 in the second line of eq. (11.168) must be
handled
more carefully. Indeed,
they both contain a denominator that vanishes when
η ≡ 2 , due to a factor η2 . In order to calculate these terms, we must leave η
unspecified, but such that
η2 ≪ ηj for j 6= 2 , (11.171)
and expand in powers of η2 . After simplifications involving
the Schouten identity,
the sum of these two terms is found to be finite when η → 2 and equal to
2
4ig4 13 s13 +2(s 12 36 23 14
A6,2 = 12 +s
32 ) + + ,
34 45 56 61 12 32 12 16 23 34
(11.172)
where we denote sij ≡ ij ij = (pi + pj )2 . Combining all the contributions, the
6-point amplitude reads
4ig4
A6 ([123]− [456]+ ) =
34 45 56 61
" 3 3 3 3
#
X ii+1 23 Ki 1 12 Ki 3
× − K2
+ (Ki +p2 )2
Ki i Ki i+1 Ki 2 i
i=4,5
2 s13 +2(s 12 36 23 14
+ 13 12 +s
32 ) + + . (11.173)
12 32 12 16 23 34
This is the simplest of the 6-point amplitudes with three negative helicities, because
the legs with negative helicities are adjacent. The non-adjacent cases lead to more
complex expressions, but nevertheless considerably simpler than what one would get
from traditional perturbation theory.
where the ri are coefficients chosen in such a way that momentum conservation is
preserved at any z. Namely, they must satisfy
X
ri i = 0 . (11.175)
i∈N
(We further assume that partial sums are not zero, so that the internal propagators
connected to the external negative helicities all carry a z-dependent momentum.)
When |z| → ∞, the shifted amplitude goes to zero like |z|1−N (graph by graph),
which ensures that there is no boundary term when we integrate over z on the circle
at infinity. The propagators that contribute poles to this integral are among those
represented in dark in the figure 11.10 (we may assume that they become singular
at distinct values of z, so that all the poles are simple). When we assign helicities to
these singular propagators, two cases may arise:
In these two cases, the theorem of residues divides the amplitude into a left and right
on-shell sub-amplitudes that have at most N − 1 negative helicities, for which we
may use the CSW rules assumed to be valid by the induction hypothesis.
After the left and right sub-amplitudes have been replaced by using the CSW
rules proven for < N negative helicities, all the poles produce terms that correspond
to the same topology of MHV graph, but whose expressions differ because the value
of z∗ is different for each pole. How this sum of contributions produces the product
of denominators one would obtain by applying directly the CSW rules for amplitudes
with N negative helicities requires some clarification. Let us consider a graph with nI
internal shifted propagators. The application of the theorem of residues to the shifted
amplitude divided by z produces nI terms, whose sum corresponds to the following
combination of denominators:
nI
X 1 Y 1
Dpoles = . (11.176)
K2 b2
I=1 I J6=I KJ,I
Each term in this sum corresponds to the vanishing of one of the shifted internal
b 2 = 0,
propagators. The factor K2I comes from the residue of the pole zI for which K I
b 2 denotes the value taken by K
and K b 2 at this z .
I
❈
J,I J
From eq. (11.174), we can write the SL(2, ) matrices that represent the shifted
internal momenta as follows,
Kb =K +z η I , (11.177)
I I
The last equality20 shows that the nI terms produced by the theorem of residues
combine into an expression which is nothing but the product of unshifted denominators
that would appear in the CSW rules. This completes the proof of the general CSW
diagrammatic rules:
20 This may be proven by integrating over a circle at infinity the following function
n
I
Y 1
f(z) ≡ z−1 .
I=1
K2I − z η KI I
11. M ODERN TOOLS FOR TREE LEVEL AMPLITUDES 405
• Draw all the diagrams with the required assignment of external helicities, and
such that all vertices have exactly two negative helicities. With N external
negative helicities, these graphs all have N − 1 vertices. For instance, the
[1234]− [5 · · · n]+ amplitude receives contributions from the following three
classes of MHV diagrams:
n 4 n 5 1 n 5
- - - + - - + - - -
- - - - - - - - - -
1 2 3 1 2 3 4 2 3 4
Worldline formalism
In the previous chapter, we have exposed the spinor-helicity language in which the
building blocks of scattering amplitudes are expressed in terms of 2-component
spinors. As we have seen, when combined with techniques such as on-shell recursion,
this leads to great simplifications in the evaluation of on-shell tree amplitudes with
physical polarizations. To a large extent, this simplification stems from the fact that the
calculation of amplitudes based on these methods bypasses the usual representation
in terms of Feynman diagrams.
In fact, the spinor-helicity method is not the only one that relegates Feynman
diagrams to a minor secondary role. Another approach, that we shall discuss in
this chapter, is the worldline formalism. The name comes from the fact that in this
approach, Feynman graphs are replaced by a representation in terms of a path integral
over a function zµ (τ) (plus additional auxiliary variables in the case of fields with
internal degrees of freedom, such as spin or colour), that defines a line embedded in
spacetime. This function can be viewed as a parameterization of the whole history
of a point-like particle. Historically, this method was first derived by starting from a
string theory and by taking the limit of infinite string tension. Subsequently, it was
rederived in a more mundane manner, in a first quantized framework. This is the point
of view that we shall adopt in this chapter.
407
408 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Let us assume that we wish to obtain the tree-level propagator G(x, y) in a background
field ϕ. Up to an irrelevant factor i, this propagator is the inverse of the operator
+ m2 + V ′′ (ϕ),
x + m2 + V ′′ (ϕ(x)) G(x, y) = −i δ(x − y) . (12.2)
This equation must be supplemented by boundary conditions that depend on the type
of propagator one wishes to obtain (time-ordered, retarded, etc...). Formally, we may
write
Z∞
−1 2 ′′
+ m2 + V ′′ (ϕ) = dT e−T (+m +V (ϕ)) . (12.3)
0
(The integrand in this formula is sometimes called a heat kernel, by analogy with the
propagator of a heat equation.) However, for this integral to make sense in the limit
T → ∞, it is necessary that the eigenvalues of the operator + m2 + V ′′ (ϕ) be
positive. The high lying eigenvalues of this operator do not depend on the background
field ϕ(x) (assuming that it is smooth enough), and are of the form
−gµν kµ kν + m2 . (12.4)
In order to be positive for any momentum kµ , it is therefore necessary that the metric
be Euclidean, with only minus signs. For this reason, we restrict our discussion to an
Euclidean field theory from now on, so that we may write −gµν kµ kν = ki ki .
The propagator G(x, y) is obtained by evaluating eq. (12.3) between states of definite
position:
Z∞
2 ′′
G(x, y) = −i dT y e−T (+m +V (ϕ)) x . (12.5)
0
Such a matrix element is quite common in ordinary quantum mechanics, and its
representation as a path integral is well known. For a non-relativistic Hamiltonian of
the form
P2
H≡ + V(Q) , (12.6)
2M
we have
Z
R t1 M .2
y e−i(t1 −t0 )H x = Dq(t) ei t0 dt ( 2 q (t)−V(q(t))) . (12.7)
q(t0 )=x
q(t1 )=y
12. W ORLDLINE FORMALISM 409
Eq. (12.5) can be similarly expressed as a path integral, if we use the following
dictionary
i(t1 − t0 ) → T
it → τ
1
M →
2
V(Q) → m2 + V ′′ (ϕ(x)) . (12.8)
This leads to
Z∞ Z
RT 1 .2 2 ′′
G(x, y) = −i dT Dz(τ) e− 0 dτ ( 4 z (τ)+m +V (ϕ(z(τ)))) , (12.9)
0
z(0)=x
z(T )=y
where the dot denotes a derivative with respect to τ. For simplicity, we denote the
integration variable z(τ) instead of zµ (τ), although it takes values in a d-dimensional
spacetime. This expression is known as the worldline representation of the tree
propagator in a background field. Very much like in ordinary quantum mechanics,
the function zµ (τ) explores all the paths that start at x and end at y. Note also that
the formula contains an integral over the “duration” (we use quotes here because τ is
not a physical time) of this evolution. c sileG siocnarF
which highlights the connection that exists between the propagator associated to an
elliptic differential operator and diffusion (i.e. Brownian motion). By comparing
eqs. (12.9) (without external field) and (12.11), one can obtain the following formula
for the absolute normalization of the integral over closed loops in d dimensions
Z ZT
z2
.
1 1
Dz(τ) exp − dτ = = . (12.13)
0 4 (4πT )d/2 d=4 (4πT )2
z(0)=z(T )
The next step is to note that such a Gaussian distribution may be written as the
convolution of two similar distributions defined on half the interval:
Z (zi −zi−1 )2
2 2 dd z1 dd z2 · · · dd zn−1 − ǫ Pni=1
y e−T (+m )
x = e−Tm e 4 ǫ2 ,
(4πǫ)nd/2
(12.15)
Taking into account the term V ′′ (ϕ) due to a background field poses no difficulty
if one breaks the interval [0, T ] into many small intervals. Indeed, even though
V ′′ (ϕ(x)) does not commute with x , the Baker-Campbell-Hausdorff formula in-
dicates that the exponential of their sum is equal to the product of their respective
exponentials, up to terms of higher order in ǫ = T/n, that do not matter in the limit
n → ∞.
12. W ORLDLINE FORMALISM 411
Therefore, we obtain a path integral representation similar to eq. (12.9), but with a
path that starts and ends at the same point:
Z Z
1 ∞ dT RT 1 .2 2 ′′
Γ [ϕ] = const + Dz(τ) e− 0 dτ ( 4 z (τ)+m +V (ϕ(z(τ)))) .
2 0 T
z(0)=z(T )
(12.21)
(We have not written explicitly the term coming from the denominator + m2 – it is
contained in the unspecified additive constant.) In this case, the worldlines are closed,
and therefore form loops in spacetime.
On dimensional grounds, one sees that the typical diameter of the loops1 z(τ) that
appear in eq. (12.21) is
√
∆z ∼ T . (12.23)
In contrast, the perimeter of these loops scales as T . These scaling laws are consistent
with a Brownian motion of duration T . c sileG siocnarF
T 1/2
Figure 12.1: Typical worldloop that contributes in eq. (12.21). While its length
scales as T , its extent in spacetime only grows like T 1/2 .
Thus, the gauge transformation modifies this term by the addition of a total derivative
with respect to τ, whose integral is
ZT
dτ ∂τ χ(z(τ)) = χ(z(T )) − χ(z(0)) . (12.30)
0
414 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the calculation of the one-loop effective action, the trajectories z(τ) have equal
initial and final points, and this shift is therefore zero. Thus, the expression (12.26)
of the one-loop effective action is explicitly gauge invariant. If we were considering
instead the scalar propagator G(x, y), this term would be
ZT
dτ ∂τ χ(z(τ)) = χ(y) − χ(x) , (12.31)
0
which leads to
" #
1 D/ 2 + m2
Γ [A] ≡ Tr ln 2 . (12.35)
2 / + m2
∂
/2
D = D µ Dν γ µ γ ν
1 µ ν 1 µ ν
= D µ Dν 2 {γ , γ } + 2 [γ , γ ]
= −D2 + 1 µ ν
4 [Dµ , Dν ][γ , γ ]
= −D2 − e Fµν Mµν , (12.36)
where Mµν ≡ 4i [γµ , γν ]. (We have assumed an Euclidean metric tensor with only
minus signs in the 3rd and 4th lines.) This gives the following representation of the
one-loop effective action:
∞
Z
1 dT −T m2 −D2 −e Fµν Mµν
Γ [A] = const − tr e . (12.37)
2 T
0
12. W ORLDLINE FORMALISM 415
The term m2 − D2 , identical to the operator encountered in the case of scalar QED,
is now supplemented by a potential
U(x) ≡ −e Fµν (x) Mµν . (12.38)
However, because U(x) still contains non-commuting Dirac matrices, the worldline
representation of the exponential is now more complicated, and the overall trace
applies both to the spacetime dependence and to the Dirac indices. A first possibility
is to reproduce the method used in the previous sections, where we introduce a path
integral over classical trajectories zµ (τ). When doing this, the matrix Dirac structure
inside the exponential is not altered, and is handled by a path ordering:
Z
2 2 µν
x e−T m −D −e Fµν M x = Dz(τ)
z(0)=x
z(T )=x
RT 1 . .
dτ ( 4 z2 (τ)+iez(τ)·A(z(τ))+m2 +U(z(τ)))
× P e− 0 . (12.39)
But it is in fact possible to remove the path ordering by introducing some auxiliary
variables. In the procedure that leads to eq. (12.39), one breaks the interval [0, T ]
into infinitesimal sub-intervals and one inserts a complete sum of states between each
factor. When the evolution operator to be evaluated contains extra internal degrees
of freedom (in the present case, the spin degree of freedom encoded in the Dirac
matrices), the intermediate states inserted in the expression must contain information
about this internal structure, for the matrix elements produced in the process to be
c-numbers. c sileG siocnarF
c−
1,2 0 = 0] , (12.43)
416 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
As with bosonic coherent states, they are eigenstates of the annihilation operators,
c−
i ξ = ξi ξ , ξ c+
i = ξi ξ . (12.47)
ξ ζ = eξζ , (12.48)
and one may construct the identity operator as a superposition of projectors on these
coherent states:
Z
1 = dξ1 dξ1 dξ2 dξ2 e−ξξ ξ ξ . (12.49)
| {z }
≡ dξdξ
and the trace over the Dirac indices of an operator A may be written as
Z
tr A = dξdξ e−ξξ − ξ A ξ . (12.51)
(One may easily check that this gives 4 when A is the identity.) Note that in the
calculation of this trace, the coherent states that appear on the left and on the right
are defined with opposite Grassmann variables ξ and −ξ. This is a standard property
of fermionic traces, whose path integral representation must obey an anti-periodic
boundary condition in time.
This formalism can be used to transform the Dirac structure in eq. (12.39) into
a Grassmann path integral. To achieve this, we follow the standard procedure of
12. W ORLDLINE FORMALISM 417
breaking the interval [0, T ] into N small sub-intervals, and we insert a unit operator
given by eq. (12.49) at the boundaries of the sub-intervals. This produces matrix
elements of the form
µν
ξi+1 e−ǫeFµν (z(τi )) M ξi , (12.52)
(ǫ ≡ T/N) that may be evaluated by replacing the Dirac matrices in Mµν by their
expression in terms of the operators c± 1,2 and by using the properties of fermionic
coherent states. This leads to the following worldline representation for the one-loop
effective action in QED, with a spin 1/2 field in the loop:
Z∞ Z
1 dT
Γ [A] = const − Dz(τ)Dψ(τ)
2 0 T
z(0)=z(T )
ψ(0)=−ψ(T )
RT 1 . . 1 .
dτ ( 4 z2 +iez·A(z)+m2 + 2 ψµ ψµ −ie ψµ Fµν (z)ψν )
× e− 0 , (12.53)
where ψµ is a collection of four Grassmann variables that combine the ξ1,2 , ξ1,2 at
each intermediate time. In this formula, the ordering that was necessary to handle
the non-commutative nature of the Dirac matrices has now been replaced by a path
integral over fermionic internal degrees of freedom.
from the Dirac see can move to the band of free particles by a tunneling process, that
does not require any energy. Standard results of quantum mechanics indicate that the
tunneling probability should behave as exp(−const × m2 /(eE)). This expression is
non-analytic in the coupling constant e, making it impossible to obtain in a standard
perturbative expansion.
Although the Schwinger mechanism was computed a long time ago by resummed
perturbation theory, the worldline formalism provides a straightforward way to cal-
culate it and offers very interesting new insights about the space-time development
of the particle production process. Let us consider the case of scalar QED in order
to illustrate this in a simpler setting. The probability of pair production may be
inferred from the vacuum-to-vacuum transition amplitude, that can be written as an
exponential,
i V being the sum of all the connected vacuum diagrams. The possibility of particle
production is intimately related to the imaginary part of V, since the total probability
of producing particles reads
2
Pprod = 1 − 0out 0in = 1 − e−2 Im V . (12.55)
In scalar QED, the graphs made of one scalar loop embedded in a background
electromagnetic field lead to the following contribution to V,
V1 loop = ln det gµν Dµ Dν + m2 , (12.56)
where Dµ is the covariant derivative in the background field. The metric should
be Euclidean in order to apply the worldline formalism, i.e. gµν = −δµν . At
12. W ORLDLINE FORMALISM 419
one-loop, the sum of the connected vacuum graphs in the presence of a background
electromagnetic field is given by
∞
Z Z
dT −m2 T RT 1 .2 .
V1 loop = e Dz(τ) e− 0 dτ ( 4 z (τ)+iez(τ)·A(z(τ)) . (12.57)
T
0 z(0)=z(T )
This formula involves a double integration: a path integral over all the worldlines
z(τ), i.e. closed paths in Euclidean space-time parameterized by the fictitious time
τ ∈ [0, T ], and an ordinary integral over the length T of these paths. The sum over
all the worldlines can be viewed as a materialization of the quantum fluctuations in
space-time, and the prefactor exp(−m2 T ) suppresses the very long worldlines that
explore regions of space-time that are much larger than the Compton wavelength of
the particles.
In eq. (12.57), the path integral can be factored into an integral over the barycenter
Z of the worldline and the position ζ(τ) about this barycenter,
ZT
z(τ) ≡ Z + ζ(τ) , dτ ζ(τ) = 0 . (12.58)
0
After this separation, all the information about the background field contained in
eq. (12.57) comes via a Wilson line,
ZT
.
WZ ζ ≡ exp − ie dτ ζ(τ) · A(Z + ζ(τ)) , (12.59)
0
This path average is dominated by an ensemble of loops localized around the bary-
center Z, and hWZ iT encapsulates the local properties of the quantum field theory in
the vicinity of Z (roughly up to a distance of order T 1/2 ). In terms of this averaged
Wilson loop, the 1-loop Euclidean connected vacuum amplitude reads
Z Z∞
1 dT −m2 T
V1 loop = d4 Z e hWZ iT . (12.61)
(4π)2 0 T3
(In this formula, the prefactor and power of T in the measure assume 4 spacetime
dimensions.) The imaginary part of V1 loop comes from the existence of poles in
420 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
hWZ iT at real values of the fictitious time T . In terms of these poles, the imaginary
part can be written as
Z X e−m2 Tn
π 4
Im (V1 loop ) = d Z Re Res hWZ iTn , Tn . (12.62)
(4π)2 Tn3
poles Tn
Let us now be more specific and consider a static and uniform electrical field E.
Since one can choose a gauge potential which is linear in the coordinates Z, ζ, the
path integral that gives the average Wilson loop is Gaussian and can therefore be
performed in closed form, leading to
eET
hWZ iT = . (12.63)
sin(eET )
(Note that it does not depend on the barycenter Z since the field is constant.) This
quantity has an infinite series of single poles along the positive real axis, located at
Tn = nπ/(eE) (n = 1, 2, 3, · · · ), that give the following expression for the imaginary
part:
∞
V4 2
X (−1)n−1 −nπm2 /(eE)
Im (V1 loop ) = (eE) e . (12.64)
16π3 n2
n=1
In this formula, V4 is the volume in space-time over which the integration over the
barycenter Z is carried out. After exponentiation, this formula gives the vacuum
survival probability P0 = exp(−2 Im V). A more detailed study would reveal that
the term of index n comes from Bose-Einstein correlations among n produced pairs,
while the first pole τ1 only contains information about the uncorrelated part of the
spectrum. Given the origin of these terms in the present derivation, as coming from
poles Tn that are more distant from T = 0, we see that increasingly intricate (the index
n is the number of correlated particles) quantum correlations come from worldlines
that explore larger and larger portions of space-time. This supports the intuitive image
that quantum fluctuations and correlations are encoded in the fact that the worldlines
explore an extended region around the base point Z. c sileG siocnarF
The worldline formalism can also be used in order to derive expressions for one-loop
amplitudes. The main difference, compared to the calculation of the Schwinger
mechanism, is that in the case of amplitudes the momenta carried by the lines attached
to the loop are fixed instead of integrated over. The expected result is therefore a
function of N momenta (or coordinates), rather than just a number.
12. W ORLDLINE FORMALISM 421
Like in the previous section, we first split z(τ) into the barycenter and a deviation
about it:
z(τ) ≡ Z + ζ(τ) . (12.66)
In the case of amplitudes, the integration over Z will simply produce the delta function
of overall energy-momentum conservation.
.
Using the T -periodicity of the paths over
which we integrate, the term in ζ2 inside the exponential can be integrated by parts,
Z Z
1 T . 1 T
− dτ ζ2 = dτ ζζ̈ , (12.67)
4 0 4 0
and the the path integral on ζ(τ) involves the inverse G(τ, τ ′ ) of the operator 12 ∂2τ ,
defined by
∂2τ G(τ, τ ′ ) = 2 δ(τ − τ ′ ) . (12.68)
This inverse exists thanks to the fact that we have removed the barycenter from z(τ),
which amounts to removing the zero mode from ζ(τ). Indeed, a general T -periodic
function can be written as
X τ
ζ(τ) = ζn e2iπn T , (12.69)
n∈❩
Using this formula, we can check that the propagator G(τ, τ ′ ) is given by
′
X 1 (τ−τ )
2iπn
G(τ, τ ′ ) = 2T e T . (12.71)
(2iπn)2
n∈❩∗
Note that this function is even in τ − τ ′ and T -periodic. Integrating2 eq. (12.70) twice
from 0 to τ − τ ′ , we obtain
(τ − τ ′ )2 T
G(τ, τ ′ ) = |τ − τ ′ | − − . (12.72)
T 6
2 adopt a symmetric convention for handling the delta function δ(τ), which amounts to
Rτ We′ ′ 1
0 dτ δ(τ ) = 2 θ(τ).
422 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
From the quantum effective action, one-particle irreducible amplitudes are ob-
tained by differentiating with respect to the field, as many times as there are external
legs, and by setting ϕ ≡ 0 afterwards. Thus, the N-point function is given by
Z∞ Z
(−λ)N dT −m2 T
ΓN (x1 , · · · , xN ) = e Dz(τ)
2 0 T
z(0)=z(T )
N ZT
Y RT 1 .
dτ 4 z2
× dτi δ(z(τi ) − xi ) e− 0 . (12.73)
i=1 0
In this formula, the path integral is over all closed paths that pass at all the coordinates
x1 , · · · , xN , in any order, which provides a rather intuitive picture of the worldline
representation of the amplitude. Let us now Fourier transform this expression in order
to obtain the amplitude in momentum space. The Fourier integrals over the xi are
trivial thanks to the N delta functions, and we obtain
Z∞ Z Z
(−λ)N dT −m2 T
ΓN (p1 , · · · , pN ) = e dd Z Dζ(τ)
2 0 T
ζ(0)=ζ(T )
N ZT
Y RT 1
dτ 4 ζζ̈
× dτi eipi ·(Z+ζ(τi )) e 0 , (12.74)
i=1 0
where we also have separated the barycenter coordinate Z from the deviation ζ and
integrated by parts the term in ζ̇2 . The integral over Z produces a delta function of
the sum of the momenta, and the path integral over ζ is Gaussian, leading to
Z
(−λ)N X ∞ dT 2
ΓN (p1 , · · · , pN ) = d/2
(2π) δ pi e−m T
21+d/2 i 0 T 1+d/2
ZT Y
N X
× dτi exp 12 G(τi , τj ) (pi · pj ) .(12.75)
0 i=1 i,j
This is the worldline expression of a one-loop N-point scalar amplitude. One may
make a number of remarks about this formula:
• The integral on T may be divergent at small T , because of the factor 1/T 1+d/2 .
However, the integrals of the second line roughly behave as T N (since there
are N integrals over an interval of size T , with an integrand of order one).
Thus, the overall behaviour of the T integral is dT T N−1−d/2 . This integral is
convergent if N − 1 − d/2 > −1, i.e. N > d/2. In four spacetime dimensions,
this is N > 2, in agreement with conventional power counting that indicates
that all one-loop functions with n > 3 are finite in the φ3 scalar theory.
• The constant term −T/6 in the propagator of eq. (12.72) does not contribute in
eq. (12.75). Indeed, its contribution inside the exponential is
T X T X X
− pi · pj = − pi · pj = 0 , (12.76)
12 12
i,j i j
As a slightly more complicated example of application, let us now derive the expres-
sion of the one-loop N-photon amplitude in scalar QED. The starting point is the
one-loop quantum effective action in an Abelian background gauge field,
∞
Z Z
dT −m2 T RT 1 .2 .
Γ [A] = const + e Dz(τ) e− 0 dτ ( 4 z +iez·A(z)) . (12.77)
T
0 z(0)=z(T )
3 Note that there are only N − 1 independent Feynman parameters, since their sum is constrained to be
one, but because of the periodicity and translation invariance of the propagators G(τi , τj ), it is possible to
choose one of the τi ’s to be equal to zero, hence only N − 1 of them are truly independent.
424 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Differentiating N times with respect to Aµ (x) and setting the background field to
zero afterwards, we obtain
∞
Z Z
µ1 ···µN dT −m2 T RT 1 .2
ΓN (x1 , · · · , xN ) = (−ie) N
e Dz(τ) e− 0 dτ 4z
T
0 z(0)=z(T )
ZT Y
N
.
× dτi δ(z(τi ) − xi ) zµi (τi ) . (12.78)
0 i=1
Next, we Fourier transform this expression and contract a polarization vector to each
external Lorentz index, and we isolate the integral over the barycenter Z,
∞
Z Z
N d
X dT −m2 T
ΓN (p1 ǫ1 , · · · , pN ǫN ) = (−ie) (2π) δ pi e Dζ(τ)
i T
0 ζ(0)=ζ(T )
ZT Y
N
RT
dτ 4
1
ζζ̈
.
×e 0 dτi eipi ·ζ(τi ) ζ(τi ) · ǫi .
0 i=1
(12.79)
.
The path integral is still Gaussian, but the factors ζ(τi ) · ǫi complicate it significantly
compared to the φ3 theory. In particular, the answer will now contain derivatives of
the propagator
2 (τ − τ ′ )
Ġ(τ, τ ′ ) ≡ ∂τ G(τ, τ ′ ) = sign(τ − τ ′ ) − ,
T
2
G̈(τ, τ ′ ) ≡ ∂τ ∂τ ′ G(τ, τ ′ ) = 2 δ(τ − τ ′ ) − . (12.80)
T
Note that the first derivative of the propagator with respect to the second time is the
opposite of the Ġ defined above since the propagator G is even. Note again that the
term −T/6 in this propagator will not contribute, thanks to momentum conservation.
A convenient trick to perform this integral is to write
Y .
X .
ζ(τi ) · ǫi = exp ζ(τi ) · ǫi , (12.81)
multi-linear
i i
12. W ORLDLINE FORMALISM 425
where the subscript “multi-linear” means that we keep only the term in ǫ1 ǫ2 · · · ǫN
in the Taylor expansion of the exponential. This leads to
Z
(−ie)N X ∞ dT 2
ΓN (p1 ǫ1 , · · · , pN ǫN ) = (2π) d/2
δ pi e−m T
2d/2 i 0 T 1+d/2
ZT YN X h1
× dτi exp G(τi , τj ) (pi · pj )
0 i=1 2
i,j
1 i
+i Ġ(τi , τj ) (pi · ǫj ) + G̈(τi , τj ) (ǫi · ǫj ) .
2 multi-linear
(12.82)
The expansion of the exponential and extraction of the term that contains each
polarization vector exactly once leads to an expression of the form
X
exp · · · = PN (Ġ, G̈) exp 21 G(τi , τj ) (pi · pj ) , (12.83)
multi-linear
i,j
Now, we have a second path integral, that involves the anti-periodic Grassmann
variables ψµ . This additional integral is also Gaussian, and its result can be expressed
426 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
in terms of the inverse of the operator 12 ∂τ over the space of anti-periodic functions4 ,
whose expression reads
X 1 1 τ−τ ′
S(τ, τ ′ ) = 2 1
e2iπ(n+ 2 ) T = sign (τ − τ ′ ) . (12.86)
n∈❩
2iπ(n + 2 )
One can then in principle follow the same sequence of steps as in the scalar QED
case, to obtain an expression of the one-loop N-photon amplitude in spinor QED in
terms of the propagators G, Ġ, G̈ and S. In fact, it was shown by Bern and Kosower
that this expression can be obtained from the corresponding scalar QED amplitude
by a simple substitution. Starting from the final scalar QED expression in terms of
the polynomial QN (Ġ) (see the eq. (12.84)), one should arrange each term of this
polynomial as a product of cycles of the form
[1, 2, 3, · · · c]G ≡ Ġ(τ1 , τ2 )Ġ(τ2 , τ3 ) · · · Ġ(τc−1 , τc ) . (12.87)
Then, the Bern-Kosower rule states that in order to obtain the analogous spinor QED
amplitude, one should perform the following substitution on each such cycle:
[1, 2, 3, · · · c]G → −2 [1, 2, 3, · · · c]G − [1, 2, 3, · · · c]S , (12.88)
where [· · · ]S is the same cyclic product made of the propagator S defined above
instead of Ġ. c sileG siocnarF
Figure 12.3:
Feynman graphs
contributing to the
one-loop photon
polarization tensor in
scalar QED.
G(τ1 , τ2 ) = T ϑ1 (1 − ϑ1 ) ,
Ġ(τ1 , τ2 ) = 1 − 2 ϑ1 . (12.92)
(We have already dropped the constant term in −T/6 from the propagator, since it
does not contribute to amplitudes thanks to momentum conservation.) At this point,
the polarization tensor reads
e2 µν 2 Z1
Πµν
scalar (p) = − g p − pµ ν
p dϑ1 (1 − 2ϑ1 )2
(4π)d/2 0
∞
Z
dT 2−d/2 −T (m2 +p2 ϑ1 (1−ϑ1 ))
× T e
T
0
e2 µν 2 Z1
µ ν
= − g p − p p dϑ1 (1 − 2ϑ1 )2
(4π)d/2 0
d
2 2
d/2−2
× Γ (2 − 2 ) m + p ϑ1 (1 − ϑ1 ) . (12.93)
One may check that this expression is identical to the one we would have obtained
from the two Feynman diagrams of the figure 12.3, after introducing Feynman
parameters and performing the integration over the loop momentum.
428 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Spinor QED : Let us now consider the same quantity in QED with a spin 1/2
fermion. Eq. (12.90) is replaced by
∞
Z Z
e2 dT −m2 T RT 1 1 .
Πµν
spin 1/2 (p) = e Dζ(τ)Dψ(τ) e 0 dτ ( 4 ζζ̈− 2 ψ·ψ)
2 T
0 ζ(0)=ζ(T )
ψ(0)=−ψ(T )
ZT
× dτ1 dτ2 eip·ζ(τ1 ) e−ip·ζ(τ2 )
0
.
× ζµ (τ1 ) + 2i ψµ (τ1 ) (ψ(τ1 ) · p)
.
× ζν (τ2 ) − 2i ψν (τ2 ) (ψ(τ2 ) · p) . (12.94)
The path integral for the term in ζ̇µ ζ̇ν is the same as in eq. (12.91), but we should
now multiply the result by5
Z
RT 1 .
and this result should be multiplied by (4πT )−d/2 to account for the integration over
the variable ζ. Thus, we see here an example of the Bern-Kosower substitution rule:
the spin 1/2 loop can be obtained from the scalar loop, by replacing Ġ2 (τ1 , τ2 ) by
Ġ2 (τ1 , τ2 ) − S2 (τ1 , τ2 ) and by multiplying by an overall factor −2 (this comes from
a −1/2 due to the different prefactors in the scalar and spin 1/2 one-loop effective
actions, times the factor 4 from eq. (12.95)). In terms of the variables ϑ1,2 and after
setting ϑ2 = 0, we have simply
S(τ1 , τ2 ) = 1 , (12.97)
(1 − 2 ϑ1 )2 − 1 = −4 ϑ1 (1 − ϑ1 ) . (12.98)
5 This formula may be obtained by ζ function regularization. If we denote A ≡ −∂ (restricted to the
τ
subspace of anti-periodic functions) and λn = −2iπ(n + 21 ) its eigenvalues, the ζ function of this operator
P 2
is ζA (s) ≡ n∈❩ λ−s n . Since there are four variables ψµ , the value of the path integral is det A =
exp(−2ζA ′ (0)). On the other hand, we have ζ (s) = (iπ)−s (1 + eiπs )(1 − 2−s )ζ(s), where ζ(s) is
A
Riemann’s zeta function. This function can be expanded at small s, giving: ζA (s) = − ln(2) s + O(s2 ).
12. W ORLDLINE FORMALISM 429
8 e2 µν 2 Z1
Πµν
spin 1/2 (p) = g p − pµ ν
p dϑ1 ϑ1 (1 − ϑ1 )
(4π)d/2 0
d/2−2
× Γ (2 − d2 ) m2 + p2 ϑ1 (1 − ϑ1 ) , (12.99)
that agrees with the expression obtained from Feynman graphs (only the first topology
in the figure 12.3, with the scalar loop replaced by a spinor loop, contributes in this
case). Remarkably, all the Dirac algebra usually involved in the calculation of fermion
loops is completely avoided in the worldline formalism, since it is encapsulated into
the Grassmann functional integration over ψµ .
430 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 13
We have seen earlier that the running coupling in an SU(N) non-Abelian gauge
theories decreases at large energy (provided the number of quark flavours is less
than 11N/2). The counterpart of asymptotic freedom is that the coupling increases
towards lower energies, precluding the use of perturbation theory to study phenomena
in this regime. Among such properties is that of colour confinement, i.e. the fact
that coloured states cannot exist as asymptotic states. Instead the quarks and gluons
arrange themselves into colour neutral bound states, that can be mesons (e.g. pions,
kaons) made of a quark and an antiquark or baryons (e.g. protons, neutrons) made of
three quarks1 . A legitimate question would be to determine the mass spectrum of the
asymptotic states of QCD from its Lagrangian.
Since the perturbative expansion is not applicable for this type of problem, one
would like to be able to attack it via some non-perturbative approach. By non-
perturbative, we mean a method by which observables would directly be obtained to all
orders in the coupling constant, without any expansion. One such method, known as
lattice field theory, consists in discretizing space-time in order to evaluate numerically
the path integral. The continuous space-time is replaced by a discrete grid of points,
the simplest arrangement being a hyper-cubic lattice such as the one shown in the
figure 13.1. The distance between nearest neighbor sites is called the lattice spacing,
and usually denoted a. The lattice spacing, being the smallest distance that exists in
this setup, therefore provides a natural ultraviolet regularization. Indeed, on a lattice
of spacing a, the largest conjugate momentum is of order a−1 . Moreover, one usually
uses periodic boundary conditions; if the lattice has N spacings in all directions,
then we have φ(x + N µ b ) = φ(x) for bosonic fields and φ(x + N µ b ) = −φ(x) for
fermionic fields (bµ is the displacement vector by one lattice spacing in the direction µ
of spacetime). c sileG siocnarF
1 More exotic bound states made of four (tetraquarks) or five (pentaquarks) have also been speculated,
but the experimental evidence for these states is so far not fully conclusive. Likewise, there may exist
bound states without valence quarks, the glueballs.
431
432 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
A natural choice is to replace the integral over space-time by a discrete sum over the
sites of the lattice, weighted by the volume a4 of the elementary cells of the lattice,
X Z
4
a → d4 x . (13.2)
a→0
x∈ lattice
Then we replace the continuous function φ(x) by a discrete set of real numbers that
live on the lattice nodes. For simplicity, we keep denoting φ(x) the value of the field
on the lattice site x. The discretization of the mass and interaction terms is trivial, but
the discretization of the derivatives that appear in the D’Alembertian operator is not
unique. Using only two nearest neighbors, one may define forward or backward finite
differences,
b ) − f(x)
f(x + µ
∇µ f(x) ≡
F
a
µ f(x) − f(x − µb)
∇B f(x) ≡ , (13.3)
a
13. L ATTICE FIELD THEORY 433
that both go to the continuum derivative in the limit a → 0. However, unlike the
continuous derivative, ∇µF
and ∇µ
B
are not anti-adjoint. Instead, assuming periodic
boundary conditions, we have
X X
f(x) ∇µ
F
g(x) = − ∇ µ
B
f(x) g(x) . (13.4)
x∈ lattice x∈ lattice
Let us make a few remarks concerning the errors introduced by the discretization.
Firstly, the continuous spacetime symmetries (translation and rotation invariance) of
the underlying theory are now reduced to the subgroup of the discrete symmetries of
a cubic lattice. They are recovered in the limit a → 0. Another source of discrepancy
between the continuum and discrete theories is the dispersion relation that relates the
energy and momentum of an on-shell particle. In the continuum theory, this relation
is of course
E2 = p2 + m2 , (13.7)
where −p2 is an eigenvalue of the Laplacian. In order to find its counterpart with the
above discretization, we must determine the spectrum of the finite difference operator
∇µB
∇µF
. On a lattice with N sites and periodic boundary conditions, its eigenfunctions
are given by
with k ∈ ❩ ,
kx
φk (x) ≡ e2iπ Na −N
2 ≤k≤
N
2 . (13.8)
35
30
25
Figure 13.2:
Discrepancy between the
20
continuous (solid curve)
E
15
and discrete (points)
dispersion relations, on a
10 one-dimensional lattice
5
with N = 40.
0
-20 -15 -10 -5 0 5 10 15 20
k
As long as k ≪ N, this agrees quite well with the continuum dispersion relation, but
the agreement is not good for larger values of k. This discrepancy is illustrated in
the figure 13.2. This mismatch does not improve by increasing the number of lattice
points: only the center of the Brillouin zone has a dispersion relation that agrees
with the continuum one. In order to mitigate this problem, one should choose the
parameters of the lattice in such a way that the physically relevant scales correspond
to values of k for which the distortion of the dispersion curve is small. c sileG siocnarF
Non-Abelian gauge theories pose an additional difficulty: since the local gauge
invariance plays a central role in their properties, any attempt at discretizing gauge
fields should preserve this symmetry. It turns out that there exists a discretization of
the Yang-Mills action that goes to the continuum action in the limit where a → 0, and
has an exact gauge invariance. The main ingredient in this construction is eq. (4.174),
that relates the Wilson loop along a small square,
to the squared field strength. These elementary lattice Wilson loops are called
plaquettes. In the fundamental representation of su(N), we have
g2 a4 µν
tr []x;µν = N − Fa (x)Fa 6
µν (x) + O(a ) . (13.12)
4
Note that, although the first two terms in the right hand side are real valued, the
remainder (terms of order a6 and beyond) may be complex. Therefore, it is convenient
to take the real part of the trace of the Wilson loop in order to construct a real valued
13. L ATTICE FIELD THEORY 435
discrete action. By summing this equation over all the lattice points x and all the pairs
of distinct directions (µ, ν), we obtain
X 1 µν
a4 − Fa (x)Fa
µν (x)
4
x∈ lattice
N X X
= N−1 tr Re []x;µν − 1 +O(a2 ) . (13.13)
g2
x∈ lattice (µ,ν)
| {z }
1
Wilson action, denoted S [U]
g2 W
Note that the error term of order a6 becomes a term of order a2 after summation over
the lattice sites, since the number of sites grows like a−4 if the volume is held fixed.
Thus, the sum of the traces of the Wilson loops over all the elementary plaquettes
of the lattice provides a discretization of the Yang-Mills action. In this discrete
formulation, the natural variables are not the gauge potentials Aµ (x) themselves, but
the Wilson lines Uµ (x) that live on the edges of the lattice, called link variables. In
this notation, x is the starting point and µ the direction of the Wilson line, as illustrated
in the left panel of figure 13.3. The Wilson line oriented in the −^ µ direction, i.e.
x+ν̂
x x+µ̂ x x+µ̂
Uµ (x)
Figure 13.3: Left: link variable. Right: plaquette on an elementary square of the
lattice.
At this stage, the discrete analogue of the path integral that gives the expectation
value of a gauge invariant operator reads,
ZY
N X X −1
hOi = dUµ (x) O U exp i 2 N tr Re []x;µν − 1 .
x,µ
g x
(µ,ν)
(13.15)
Since there exists a left- and right-invariant2 group measure dUµ (x), the left hand
side of this formula is gauge invariant. Moreover, it goes to the expectation value of
the continuum theory in the limit of zero lattice spacing.
The exponential under the integral is now real-valued, and thus positive definite.
Note that numerical quadratures such as Simpson’s rule, are not practical for this
problem, given the huge number of dimensions of the integral to be evaluated. For
instance, for the 8-dimensional Lie group SU(3), in 4 space-time dimensions, on a
lattice with N4 points, this dimension is 8 × 4 × N4 . For N = 32, the path integral
is thus transformed into a 225 -dimensional (225 ∼ 3.107 ) ordinary integral. Instead,
one views the exponential of the Wilson action as a probability distribution (up to a
normalization constant) for the link variables, that may be sampled by a Monte-Carlo
algorithm (e.g. the Metropolis-Hastings algorithm) in order to estimate the integral.
In this approach, as long as one is evaluating the expectation value of gauge
invariant observables, it is not necessary to fix the gauge in lattice QCD calculations.
2 This means that:
Z Z Z
dU f[U] = dU f[ΩU] = dU f[UΩ] .
Such a measure, known as the Haar measure, exists for compact Lie groups, like SU(N).
13. L ATTICE FIELD THEORY 437
13.2 Fermions
In the discretization, we assign a spinor ψ(x) to each site of the lattice. Under a gauge
transformation Ω(x), these spinors transform in the same way as in the continuous
theory,
The main difficulty in defining a discrete covariant derivative that transforms appro-
priately under a gauge transformation is that ψ(x) and ψ(x ± µb ) transform differently
when Ω(x) depends on space-time. This problem can be remedied by using a link
variable between the point x and its neighbors. Like with the ordinary derivatives,
one may define forward and backward discrete derivatives,
U†µ (x)ψ(x + µ
b ) − ψ(x)
Dµ ψ(x) ≡ ,
F
a
ψ(x) − Uµ (x − µ b )ψ(x − µb)
Dµ ψ(x) ≡ , (13.19)
B
a
that both transform like a spinor at the point x, and therefore are valid discretizations
of a covariant derivative. However, none of these two operators is anti-adjoint, and
438 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
35
30
Figure 13.4:
25
Discrepancy between the
20
continuous (solid curve)
and discrete (points)
E
15
dispersion curves for
10
fermions, on a one-
dimensional lattice with
5 N = 40.
0
-20 -15 -10 -5 0 5 10 15 20
k
therefore they would not give a Hermitean Lagrangian density. This may be achieved
by using instead 12 Dµ
F
+ Dµ B
, which corresponds to a symmetric forward-backward
difference
1 µ U†µ (x)ψ(x + µ
b ) − Uµ (x − µ
b )ψ(x − µ
b)
DF + Dµ ψ(x) = . (13.20)
2 B
2a
35
Figure 13.5: 30
E
fermionic case, on a 15
one-dimensional lattice 10
with N = 40, after
inclusion of the Wilson 5
term. 0
-20 -15 -10 -5 0 5 10 15 20
k
the two covariant derivatives. Therefore, the Wilson term –like an ordinary mass
term– breaks explicitly the chiral symmetry of the Dirac Lagrangian in the case
of massless fermions. The fermion doublers are in fact intimately related to chiral
symmetry. Without the Wilson term, lattice QCD with massless quarks has an exact
chiral symmetry unbroken by the lattice regularization, and therefore there cannot be
a chiral anomaly. In fact, this absence of anomaly is precisely due to a cancellation
of anomalies among the multiple copies (the doublers) of the fermion modes. This
argument is completely general and not specific to the Wilson term: any mechanism
that lifts the degeneracy among the doublers will spoil the anomaly cancellation and
thus break chiral symmetry. For this reason, the study of phenomena related to chiral
symmetry is always delicate in lattice QCD.
In eq. (13.26), the Dirac determinant provides closed quark loops, while the
propagator S(x1 , x2 ) connects the external points of the operator under consideration.
This observation, illustrated in the figure 13.6, clarifies the meaning of the quenched
approximation, in which the determinant of the Dirac operator is replaced by 1. This
approximation, motivated primarily by the computational difficulty of evaluating the
Dirac determinant, was widely used in lattice QCD computations until advances in
algorithms and computer hardware made it unnecessary. Note that, although quark
loops are not included in the quenched approximation, gluon loops are present to their
full extent. In contrast, lattice QCD calculations that include the Dirac determinant,
and thus the effect of quark loops, are said to use dynamical fermions.
13. L ATTICE FIELD THEORY 441
Figure 13.6:
Illustration of the
two types of quark
contributions. In dark:
quark propagators
(i.e. inverse of the
Dirac operator) that
connect the ψ’s and
ψ’s in the operator
being evaluated. In
lighter color: quark
loops coming from
the determinant of the
Dirac operator.
In the first equality, we have inserted a complete basis of eigenstates of the QCD
Hamiltonian, and the second equality follows from the fact that Ψn is an eigenstate
of rest energy Mn (there is no factor i inside the exponential because of the Euclidean
time used in lattice QCD). The sum in the last equality receives non-zero contributions
from all the states Ψn that possess the quantum numbers carried by the operator O.
However, taking the limit T → ∞ selects the one among these eigenstates that has
the smallest mass. This observation can be turned into a method to determine hadron
masses in lattice QCD:
1. Choose an operator O that has the quantum numbers of the hadron h of interest.
The choice of the operator is not crucial, as long as the overlap h O 0 is not
zero. However, eq. (13.27) suggests that a better result, i.e. less noisy with
limited statistics, may be obtained by trying to maximize this overlap.
442 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
3. Fit the large T tail of this expectation value. The slope of the exponential gives
the mass of the lightest hadron that possesses these quantum numbers.
is simply a choice of normalization of the invariant group measure. From the unitarity
of the group elements, one then obtains3
Z
1
dU Uij U†kl = δjk δil . (13.30)
N
In these integrals, the link variables on different edges of the lattice are independent
variables, and there is a separate integral for each of them. This is completely general:
integrals of the form (13.28) are non-zero only if the integrands contains an equal
number of U’s and U† ’s, i.e. for n = m. Therefore, each link variable U that appears
in such a group integral must be matched by a corresponding U† . For instance, the
group integral of the Wilson loop defined on an elementary plaquette is zero,
ZY
dUµ (x) tr U†ν (x) U†µ (x + ν ^ ) Uν (x + µ^ ) Uµ (x) = 0 , (13.31)
x,µ
| {z }
[]x;µν
because the four link variables live on four distinct edges of the lattice. In contrast,
the integral of the trace of a plaquette times the trace of the conjugate plaquette is
non-zero:
ZY
dUµ (x) tr []x;µν tr []†x;µν = 1 . (13.32)
x,µ
Using these results, we can calculate to order β the expectation value of the trace of a
plaquette:
tr []x;µν
RQ P −1
dUµ (x) tr []x;µν exp βN N tr Re []y;ρσ − 1
x,µ y;ρσ
≡ RQ P −1
dUµ (x) exp βN N tr Re []y;ρσ − 1
x,µ y;ρσ
β
= + O(β2 ) . (13.33)
2
Consider now the trace of a more general Wilson loop along a path γ (planar, to
simplify the discussion). Each U and U† in the Wilson loop must be compensated
by a link variable coming from the β expansion of the exponential of the Wilson
action. The lowest order term in β corresponds to a minimal tiling of the Wilson
444 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Let us consider now a rectangular loop, with an extent R in the spatial direction 1 and
an extent T in the Euclidean time direction 4. The previous result indicates that the
expectation value of the trace of the corresponding Wilson loop has the following
form,
θ0 + 2i θa ta 2 2 2
f with θ0 + θ1 + θ2 + θp
2 = 1, and the invariant group measure normalized according to
3 p
eq. (13.29) is dU = dθ1 dθ2 dθ3 /(π2 1 − θ2 ) (with θ0 = 1 − θ2 ). By using this measure and the
Fierz identity satisfied by the generators ta
f , an explicit calculation leads easily to eq. (13.30).
13. L ATTICE FIELD THEORY 445
Figure 13.9:
Rectangular Wilson
loop in the A4 ≡ 0 t
= T
gauge.
where W[0,R] is a (spatial) Wilson line going from (t, R) to (t, 0). Consider now
the vacuum expectation value 0 O†qq (0)Oqq (T ) 0 . In this expectation value, the
fermionic path integral produces two quark propagators that connect the ψ’s to the
ψ’s. However, in the limit of infinite quark mass, the quarks are static and their
propagator is just a Wilson line in the temporal direction, that reduces to the identity
in the A4 = 0 gauge (represented by the dotted lines in the figure 13.9). Thus, we
have
By inserting a complete basis of eigenstates of the Hamiltonian in the right hand side
of eq. (13.37) and by taking the limit T → ∞, we find a result dominated by the
quark-antiquark state of lowest energy E0 ,
2
lim 0 O†qq (0)Oqq (T ) 0 = 0 O†qq (0) Ψ0 e−E0 T . (13.38)
T →∞
Moreover, in the limit of large mass, the energy E0 of this state is dominated by the
potential energy V(R) between the quark and the antiquark (the quark and antiquark
are non-relativistic, and their kinetic energy behaves as P2 /2M → 0),
2
lim 0 O†qq (0)Oqq (T ) 0 = 0 O†qq (0) Ψ0 e−V(R) T . (13.39)
M,T →∞
By comparing this result with that of the strong coupling expansion, eq. (13.35), we
conclude that
V(R) = σ R . (13.40)
This linear potential indicates that the force between the quark and antiquark is con-
stant at large distance, in sharp contrast with a Coulomb potential in electrodynamics.
This is a consequence of the colour confinement property of QCD. c sileG siocnarF
446 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where Aµ µ
Ω is the gauge transform of the field configuration A we aim at bringing to
Landau gauge. Indeed, if we apply an infinitesimal gauge transformation to the field
Aµ
Ω , the corresponding variation of FLandau [A, Ω] is
Z
µa
δFLandau [A, Ω] = − d4 x DΩ µab θb AΩ
Z
= − d4 x ∂µ θa − gfcab AcΩµ θb Aµa
Ω
Z
= d4 x θa ∂µ Aµa
Ω . (13.42)
Therefore, if AµΩ realizes an extremum of the functional, then this variation must be
zero for all possible θa (x), which means that Aµ Ω obeys Landau gauge condition.
The discrete analogue of the functional defined in eq. (13.41) reads
XX
FLandau [U, Ω] ≡ −2 a2 Re tr Ω(x)Uµ (x)Ω† (x + µ b) . (13.43)
x µ
Finding extrema of such a functional is a rather straightforward task, for instance with
the steepest descent algorithm.
Due to the existence of Gribov copies, the gauge fixed field configuration is not
defined uniquely by the gauge condition, which implies that this functional has more
than one extremum corresponding to the various solutions of ∂µ Aµ Ω = 0 along the
same gauge orbit (see the figure 13.10). A natural criterion to decide which extrema
13. L ATTICE FIELD THEORY 447
Aµ
G(Aµ) = 0
gauge
fixed Aµ gauge orbit
Figure 13.10: Gauge orbit that intersects multiple times the gauge fixing manifold.
448 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
to take into account is to try and reproduce the perturbative Fadeev-Popov procedure.
Let us recall here the starting point, which amounts to inserting under the path integral
the left hand side of the following equation
Z a
a µ δG
DΩ(x) δ[G (AΩ )] det =1, (13.44)
δΩ Ga (Aµ )=0
Ω
a
where G (A) = 0 is the gauge fixing condition. However, two conditions must be
met for this integral to be really equal to one:
However, it was shown by Gribov that the unicity condition is generically not satis-
fied: the gauge condition has multiple solutions, called Gribov copies. When these
conditions are not satisfied, the inserted factor is not one, but instead
a −1
X δGa δG
ZFP = det det
δΩ i δΩ i
zeroes i
X δGa
= sign det . (13.45)
δΩ i
zeroes i | {z }
≡ sign(i)
Thus, one may try to mimic the perturbative Fadeev-Popov procedure by the following
definition of a gauge fixed operator on the lattice
P
sign(i) O[AΩi ]
O[A] ≡ P i
extrema
, (13.46)
Landau extrema i sign(i)
where the denominator follows from the requirement that gauge invariant operators
should remain unaffected by the gauge fixing. However, it was shown by Neuberger
that this definition is flawed, because the distribution of Gribov copies is such that
both the numerator and the denominator are exactly zero. In order to see this, consider
a gauge invariant observable O[U], and let us try to mimic closely the continuum
BRST quantization, by introducing ghosts and antighosts, the BRST variation B of
the antighost (B ≡ QBRST χ, QBRST B = 0) and a gauge fixing parameter ξ. By doing
so, the expectation value of the observable would read
Z
− 1 S [U]− 2ξ
1
P
x BB+QBRST
P
x χG
O = Z−1 D(U, χ, χ, B) O[U] e g2 W
Z
− 1 S [U]− 2ξ
1
P
x BB+QBRST
P
x χG
Z ≡ D(U, χ, χ, B) e g2 W . (13.47)
13. L ATTICE FIELD THEORY 449
(Here, we are assuming a completely generic gauge fixing function G(U).) Note
that with a compact gauge group and a finite lattice, all the integrals involved in this
formula are finite. Consider now the following quantity:
Z
− 1 S [U]− 2ξ 1
P
x BB+tQBRST
P
x χG
FO (t) ≡ D(U, χ, χ, B) O[U] e g2 W . (13.48)
The numerator in the gauge fixed definition of O is nothing but FO (1). The
derivative of this function is given by
Z
dFO h X i
= D(U, χ, χ, B) QBRST χG
dt x
1 1
P P
− SW [U]− 2ξ BB+tQ χG
× O[U] e g2 x BRST x
| {z }
BRST invariant
Z hX i
= D(U, χ, χ, B) QBRST χG O[U]
x
1 1
P P
− SW [U]− 2ξ x BB+tQ χG
×e g2 BRST x
=0. (13.49)
In the last equality, we have used the fact that the integral of a total BRST variation is
zero. Thus, we have
Z
− 1 S [U]− 2ξ 1
P
x BB
FO (1) = FO (0) = D(U, χ, χ, B) O[U] e g2 W = 0 . (13.50)
This time, the zero follows from the fact that the integrand does not depend on χ or χ,
hence the integrals over the ghost and antighost are equal to zero. The same reasoning
applies to the denominator Z in the gauge-fixed definition of O , hence we have an
undefined ratio4 :
0
O = . (13.51)
gauge fixed 0
If we interpret this result in the light of eq. (13.46), we see that these zeroes result
from an even number of Gribov copies with alternating signs for the determinant of
the Fadeev-Popov operator. One may view this issue as a fundamental obstruction for
a non-perturbative definition of gauge fixing by the Fadeev-Popov procedure. Because
of this problem, the practical lattice definition of the Landau gauge fixing is simply to
pick one of the extrema of the functional (13.43), without any special selection rule.
One should be aware of this procedure when comparing with perturbative results,
since it is a priori not guaranteed that the solutions of the gauge condition used in the
perturbative and in the non-perturbative calculations are the same. c sileG siocnarF
4 The same conclusion holds if the operator O is not gauge invariant, but simply BRST invariant.
450 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
In the chapter 12, we have exposed the worldline representation for a quantum field
theory in a continuous spacetime. However, a similar representation is also possible
for propagators in a field theory defined on a discrete spacetime. For the sake of
simplicity, let us first consider first a free scalar field theory, defined on a cubic lattice
instead of a continuous spacetime. As we have seen earlier in this chapter, the second
derivatives ∂µ ∂µ that appear in the inverse propagator are replaced by centered finite
differences, and with an Euclidean metric we have
X 2 φ(x) − φ(x + µ
b ) − φ(x − µ
b)
( + m2 ) φ(x) = m2 φ(x) + . (13.52)
µ
a2
+ m2 )−1 =
1
∞
X ❉ n
, (13.54)
lattice m2 + 2da−2 2d + m2 a2
n=0
(In d dimensions, there are 2d terms in the sum of the right hand side.) The operator
❉ /(2d + m2 a2 ) realizes one hop from a lattice site to one of its nearest neighbors,
with a probability (2d + m2 a2 )−1 for a jump in any given direction. Raised to the
13. L ATTICE FIELD THEORY 451
Figure 13.11:
Worldlines on a
cubic lattice. The
points x and x ′ are
materialized by the
two little balls, and we
have represented three
different paths on the
lattice connecting
these two points.
The propagator evaluated between the sites x and x ′ is proportional to the total
probability to connect these two sites by a sequence of jumps, regardless of its length
(because of the sum on n):
∞
1 X 1 X
+ m2 )−1 =
xx ′ lattice
1 , (13.56)
m2 + 2da−2 (2d + m2 a2 )n
n=0 γ∈Pn (x,x ′ )
where Pn (x, x ′ ) is the set of all paths of length n drawn on the edges of the lattice
that connect x to x ′ (see the figure 13.11). Therefore, the second sum merely counts
the number of such paths. This number has an upper bound of (2d)n , which implies
trivially the convergence of the sum on n in the massive case.
Let us now consider a complex scalar field in a background Abelian gauge field.
The D’Alembertian is replaced by the square of the covariant derivative. In order to
maintain an exact gauge symmetry in the discrete lattice formulation, the gauge field
is represented by link variables Uµ (x) defined on each edge of the lattice, and the
452 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
U†µ (x)φ(x + µ
b ) − φ(x)
Dµ φ(x) ≡ ,
F
a
φ(x) − Uµ (x − µ b )φ(x − µb)
Dµ φ(x) ≡ . (13.57)
B
a
In order to evaluate the scalar propagator in this gauge background, we can reproduce
the derivation of the previous subsection, and the only change will arise in the
❉
definition of the operator , whose action now reads
X
❉ φ(x) ≡ U†µ (x)φ(x + µ
b ) + Uµ (x − µ
b )φ(x − µ
b) . (13.58)
µ
Note that the right hand side transforms like a scalar field at the point x under a gauge
transformation. The consequence of this modification is that the jumps performed by
❉
the operator are now weighted by U(1) phases corresponding to the link variables
that appear in the right hand side of eq. (13.58). The lattice worldline representation
of the dressed scalar propagator is
∞
1 X 1 X
D2 + m2 )−1 xx ′
= Wγ [A] ,
lattice m2 + 2da−2 (2d + m2 a2 )n
n=0 γ∈Pn (x,x ′ )
(13.59)
where Wγ [A] is the product of all the phases collected along the path γ, which
is nothing but the Wilson line defined on this path. This expression transforms as
expected for a dressed scalar propagator, thanks to the properties of Wilson lines.
A representation similar to eq. (13.59) is also possible for the one-loop quantum
effective action in the gauge background, that can be obtained by first noting that
∞
X 1
ln(A) = ln(1 − (1 − A)) = − (1 − A)n . (13.60)
n
n=1
Note that now the paths γ involved in the sum are the closed paths starting and ending
at the point x, which is why only even values of the length are allowed.
13. L ATTICE FIELD THEORY 453
In two dimensions, the representation of eq. (13.59) may be used in order to study the
properties of charged particles on an atomic lattice, under the influence of an external
electromagnetic field. In particular, when this field is purely magnetic and transverse
to the plane of the lattice (i.e. the field strength F12 is non-zero), this model is related
to the quantum Hall effect. c sileG siocnarF
This relationship may also be exploited in order to derive explicit formulas for the
moments of the distribution of areas of random closed loops on a cubic lattice. For
this application, it is interesting to consider a 2-dimensional anisotropic lattice, with
lattice spacings a1,2 in the two directions. On this lattice, consider the propagator of
a massless scalar at equal points in the presence of a transverse magnetic field. It is
straightforward to generalize the previous derivation to the anisotropic case, and one
obtains the following expression for the propagator
∞ n1 n2
a2 X h1 h2 X
G(x, y) = Wγ [A] , (13.62)
4 4 4
n1,2 =0 γ∈Pn1 ,n2 (x,y)
where Pn1 ,n2 (x, y) is the set of paths drawn on the edges of the lattice, that connect x
to y, and contain n1 jumps in the first direction and n2 jumps in the second direction.
In this formula, we have also defined
2 1 1 a2
≡ 2+ 2 , h1,2 ≡ . (13.63)
a2 a1 a2 a21,2
∞ 2n1 2n2 X
∞
a2 X h1 h2 (iΦ)2l X 2l
G(0, 0) = Area (γ) .
4 4 4 (2l)!
n1,2 =0 l=0 γ∈P2n1 ,2n2 (0,0)
(13.65)
(Because the area is algebraic, the odd moments are all zero.) In this formula, we have
made explicit the fact that a closed path must have an even number of hops in each
direction. On the other hand, the propagator can be determined perturbatively order
454 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where P2l is a symmetric polynomial in n1 , n2 of degree 2l. Note that the combi-
natorial factor (2(n1 +n2 ))!/(n1 !2 n2 !2 ) is the number of loops in P2n1 ,2n2 (0, 0).
This expansion also provides a semi-explicit form of the polynomial P2l , and the
evaluation of the first two terms gives,
n1 n2
P2 (n1 , n2 ) = ,
3
n1 n2 7n1 n2 −(n1 +n2 )
P4 (n1 , n2 ) = , (13.68)
15
1
P2l (n1 , 0) = P2l (0, n2 ) = 0 , P2l (1, 1) = . (13.69)
3
The first one is a consequence of the fact that if n1 or n2 is zero, then all the closed
paths one can construct have a vanishing area. The second one follows from the fact
that for n1 = n2 = 1, all the closed paths have area −1, 0 or +1, and therefore
contribute equally to all the even moments.
By summing the above results over all n1 + n2 = n, we obtain the moments of
the algebraic area of closed loops of length n, with no restriction on the respective
number of hops in each direction. Quite generally, these moments can be written
as a prefactor (2n)!2 /n!4 (which is the number of closed loops of length 2n on a
2-dimensional lattice) multiplied by a rational fraction in n of degree 2l. For instance,
5 The product of the link variables along a closed loop does not depend on the choice of the gauge, as
can be seen from the following gauge transformation formulas for the link variables
X 2 (2n)!2 n2 (n − 1)
(Area (γ)) = ,
n!4 6(2n − 1)
γ∈P2n (0,0)
X 4 (2n)!2 n3 (n−1)(7n2 − 18n + 13)
(Area (γ)) = . (13.70)
n!4 60(2n − 1)(2n − 3)
γ∈P2n (0,0)
456 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 14
457
458 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
condensed matter physics). Thus, its energy and other conserved quantities are not
fixed. Instead, they fluctuate due to exchanges with the surroundings, that play the
role of a thermal reservoir. The appropriate statistical ensemble for describing this
situation is the (grand) canonical ensemble, in which the system is described by the
following density operator
ρ ≡ exp − βH , (14.1)
H n = En n . (14.3)
From this representation, it is easy to see that the zero temperature limit selects the
state of lowest energy, i.e. the ground state of the Hamiltonian. Assuming that this
state is non-degenerate, this corresponds to a vacuum expectation value:
lim Tr ρ O = 0 O 0 . (14.5)
T →0
provided by eq. (14.4), and a similar formula for the denominator, which would fall
back to the perturbative rules we already know (since the temperature and chemical
potential appear only in the form of numerical prefactors). Note however a peculiarity
of the matrix elements that appear in eq. (14.4): n and n are identical states
since they come from a trace (they are both in-states, since ρ defines the initial state
of the system). This is a bit different from the transition amplitudes that enter in
scattering cross-sections, where the matrix elements are evaluated between an in-state
and an out-state. The perturbative rules to compute these in-in expectation values are
provided by the Schwinger-Keldysh formalism introduced in the section 1.16.5.
A difficulty with this naive approach is that the number of states that contribute
significantly to the sum in eq. (14.4) is large at high temperature, especially when
the temperature is large compared to the masses of the fields (and even more so with
massless particles like photons). In fact, it is possible to encapsulate the sum over the
eigenstates n and the canonical weight of these states exp(−β En ) directly into the
Schwinger-Keldysh rules, by a modest modification of its propagators.
To mimic closely the derivation of the Feynman rules at zero temperature, let us
consider an observable made of the time-ordered product of elementary fields:
where ti is the time at which the system is prepared in equilibrium, and U is the time
evolution operator defined by:
Z t2
U(t2 , t1 ) ≡ T exp i dx0 d3 x LI (φin (x)) , (14.8)
t1
with LI the interaction term in the Lagrangian. Thanks to eq. (14.7), we remove all
the interactions from the field, and relegate them into the evolution operator where
they can easily be Taylor expanded.
In the canonical ensemble at non-zero temperature, there is another source of
dependence on the interactions, hidden in the Hamiltonian inside the density operator.
Indeed, for the system to be in statistical equilibrium, the canonical density operator
should be defined with the same Hamiltonian as the one that drives the time evolution,
460 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
i.e. a Hamiltonian that also contains the interactions of the system2 . If we decompose
the full Hamiltonian as H ≡ H0 + HI , we have
Z −ti −iβ
e−βH = e−βH0 T exp i dx0 d3 x LI (φin (x)) . (14.9)
−ti
| {z }
U(−∞−iβ,−∞)
(This formula in fact does not depend on ti ). It can be proven by noticing that right
and left hand sides are equal for β = 0, and by checking that their derivatives with
respect to β are also equal (for this, we use the fact that the derivative of the time
evolution with respect to its final time is known).
From the previous formulas, we can write
where the symbol P indicates a path ordering, and where the time integration contour
is C = [ti , +∞] ∪ [+∞, ti ] ∪ [ti , ti − iβ]:
ti
C = . (14.11)
t i − iβ
In this contour, ti is the time at which the system is prepared in thermal equilibrium.
As we shall see shortly, all observables are independent of this time, which physically
means that a system in equilibrium has no memory of when it was put in equilibrium.
Note also that in eq. (14.10), the times x01 , · · · , x0n are on the upper branch of the
contour (but this constraint can be relaxed shortly).
The time contour (14.11) is very similar to the contour of the figure 1.4, with the
addition of a vertical part that captures the interactions hidden in the density operator.
Since we had to extend the real time axis into the contour C, it is natural to extend
2 An alternative point of view is to decide that ρ is the density operator of the system at x0 = −∞.
There, we may turn off adiabatically the interactions, and therefore use only the free Hamiltonian inside ρ.
In this section, we derive the formalism for an initial equilibrium state specified at a finite time x0 = ti .
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 461
also the observable of eq. (14.6) to allow the field operators to be located anywhere
on C, with a path ordering instead of a time ordering,
O ≡ P φ(x1 ) · · · φ(xn ) . (14.12)
The expectation values of these operators can be encapsulated in the following
generating functional,
R
Tr ρ P exp i C d4 x j(x)φ(x)
Z[j] ≡ , (14.13)
Tr ρ
where the fictitious source j(x) also lives on the contour C. In order to bring this
generating functional to a useful form, we can follow very closely the derivation of
the section 1.6.2, by first pulling out a factor that contains the interactions, and by
rearranging the ordering of the free factor with two successive applications of the
Baker-Campbell-Hausdorff formula. This leads to
Z Z
4 δ 1
Z[j] = exp i d x LI exp − d4 xd4 y j(x) j(y) G0 (x, y) ,
C iδj(x) 2 C
(14.14)
where the free propagator G0 (x, y), defined on the contour C, is given by
Tr e−β H0 P φin (x)φin (y)
G0 (x, y) ≡ . (14.15)
Tr e−β H0
factor, that would be canceled since all expectation values are normalized by the factor 1/Tr (ρ).
462 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
1
nB (E) ≡ . (14.19)
eβE −1
This leads to the following formula for the free propagator:
Z
d3 p h
G0 (x, y) = 3
θc (x0 − y0 ) + nB (Ep ) e−ip·(x−y)
(2π) 2Ep i
+ θc (y0 − x0 ) + nB (Ep ) e+ip·(x−y) ,
(14.20)
where θc generalizes the step function to the contour C (i.e. θc (x0 − y0 ) is non-zero
if x0 is posterior to y0 according to the contour ordering). This expression of the
propagator generalizes to a non-zero temperature the formula (1.119) (the Bose-
Einstein distribution goes to zero when T → 0). Let us postpone a bit the calculation
of the propagator in momentum space. For now, we just note the following rules for
the perturbative expansion in coordinate space:
1. Draw all the graphs (with vertices corresponding to the interactions of the
theory under consideration) that connect the n points of the observable. Graphs
containing disconnected subgraphs should be ignored. Each graph should be
weighted by its symmetry factor. c sileG siocnarF
3. Each vertex brings a factor −iλ. The space-time coordinate of this vertex is
integrated out, but the time integration runs over the contour C.
Thus, the only differences with the zero temperature Feynman rules are the explicit
form of the free propagator, and the fact the time integrations are over the contour C
instead of the real axis.
The canonical density operator exp(−βH) can be viewed as an evolution operator for
an imaginary time shift, which implies the following formal identity
that contains a field whose time argument is the initial time ti (the other fields it
contains need not be specified in this discussion). Since ti is the “smallest” time on
the contour C, the field operator that carries it is placed to the rightmost position by
the path ordering. Thus, we have
G(ti , · · · ) = Tr e−βH P · · · φ(ti , x) , (14.23)
where the path ordering now applies only to the remaining (unwritten) fields. Using
the cyclic invariance of the trace and eq. (14.21), we then get
G(ti , · · · ) = Tr e−βH φ(ti − iβ, x) P · · ·
= Tr e−βH P φ(ti − iβ, x) · · ·
= G(ti − iβ, · · · ) , (14.24)
where in the second line we have used the fact that ti − iβ is the “latest” time on
the contour C in order to put back the operator carrying it inside the path ordering.
This equality is one of the forms of the Kubo-Martin-Schwinger (KMS) symmetry:
all bosonic path-ordered correlators take identical values at the two endpoints of the
contour C. Note that, although we have singled out the first field in the correlator, this
identity applies equally to all the fields.
The KMS symmetry is very closely tied to the fact that the system is in thermal
equilibrium, since it is satisfied only when the density operator is the canonical
equilibrium one. One of its consequences is that all the equilibrium correlation
functions are independent of the initial time ti . In order to prove this assertion, let us
first note that the free propagator satisfies the KMS symmetry, and does not contain ti
explicitly. A generic Feynman graph leads to time integrations that have the following
structure:
Z
G(x1 , · · · , xn ) = dy01 · · · dy0p F(y01 , · · · , y0p | x1 , · · · , xn ) . (14.25)
C
(We assume that the integrals over the positions at every vertex have already been
performed.) Since the free propagator does not depend on ti , the derivative of the
integral with respect to ti comes only from the endpoints of the integration contour,
and we can write
p Z Y
X h
∂G(x1 , · · · , xn )
= dy0j F(· · · , y0i = ti , · · · | x1 , · · · , xn )
∂ti
i=1 C j6=i
i
−F(· · · , y0i = ti − iβ, · · · | x1 , · · · , xn )
= 0. (14.26)
The vanishing result follows from the fact that the bracket in the integrand is zero,
since it is built from objects that obey the KMS symmetry. The independence with
464 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that if φ is a real field, then q = −q∗ . Therefore, in order to have a non-zero
real valued charge, the field should be complex.
When there are additional conserved quantities such as Q, their conservation
constrains in a similar fashion how they may be exchanged with the heat bath. The
canonical equilibrium ensemble must be generalized into the grand canonical ensem-
ble, in which the density operator of the subsystem is given by
ρ ≡ exp − β H + µQ , (14.28)
where q is the charge carried by the field on which the identity applies. Thus, the
values of correlation functions at the endpoints are equal up to a twist factor that
depends on the chemical potential. c sileG siocnarF
The simplest field that can carry a non trivial charge is a complex scalar field. In
the interaction picture, it can be decomposed as follows on a basis of creation and
annihilation operators:
Z h i
d3 p
φin (x) = 3
ap,in e−ip·x + b†p,in e+ip·x .
(2π) 2Ep
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 465
(This field requires two sets {ap,in , bp,in } of such operators, because it describes
a particle which is distinct from its anti-particle.) With this field, it is possible to
construct a theory that has a global U(1) symmetry, corresponding to the conservation
of the following charge
Z
d3 p
Q≡ b†p,in bp,in − a†p,in ap,in . (14.30)
(2π)3 2Ep
and finally obtain the free propagator for a complex scalar carrying the charge q:
Z
d3 p h
G0 (x, y) = (θc (x0 − y0 ) + nB (Ep − µq)) e−ip·(x−y)
(2π)3 2Ep
i
+(θc (y0 − x0 ) + nB (Ep + µq)) e+ip·(x−y) .
(14.32)
14.2.7 Fermions
Consider now spin 1/2 fermions, whose interaction picture representation reads
XZ d3 p
ψin (x) = a†sp,in vs (p)e+ip·x +bsp,in us (p)e+ip·x , (14.33)
s=±
(2π)3 2Ep
where the creation and annihilation operators obey canonical anticommutation rela-
tions (see eqs. (1.215)). Because they are anticommuting fields, a minus sign appears
in the derivation of the KMS identity:
1
nF (E) ≡ , (14.36)
eβ E +1
and the free propagator reads
Z
d3 p h
0
S (x, y) = / + +m) θc (x0 −y0 )−nF (Ep −µq) e−ip·(x−y)
(p
(2π)3 2Ep
i
+(p/ − +m) θc (y0 −x0 )−nF (Ep +µq) e+ip·(x−y) ,
(14.37)
/ ± ≡ ±Ep γ0 − p · γ.
with the notation p
In perturbation theory, the logarithm of Z is obtained as the sum of all the connected
vacuum graphs at finite temperature. For instance, for a scalar field, its perturbative
expansion starts with the following diagrams:
∂Z
Energy : E=− ,
∂β
Entropy : S = βE + ln(Z) ,
1
Free energy : F = E − TS = − ln(Z) . (14.39)
β
These quantities encode the bulk properties of the system, such as its equation of state
or the existence of phase transitions.
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 467
2
ω
Z
dNγ n(ω1 ) · · · n(ωn )
ω ∝ ,
dtd3 xd3 p (unobserved
particles )
×(1 ± n(ω′1 )) · · · (1 ± n(ω′p ))
(14.40)
where the integration is over the invariant phase-space of the unobserved incoming
and outgoing particles, weighted by the appropriate occupation factor (nB or nF for a
particle in the initial state, and 1 + nB or 1 − nF for a particle in the final state). In
this formula, the gray blob should be calculated with the finite-T Feynman rules.
The previous approach becomes rapidly cumbersome as the number of initial and
final state particles increase. The bookkeeping may be simplified by using a finite-T
generalization of the formula that relates the decay rate of a particle to the imaginary
part of its self-energy:
dNγ 1
ω ∝ ω/T Im Πµ µ (ω, p) . (14.41)
dtd3 xd3 p e −1 | {z }
photon self-energy
is related to the transport of momentum), etc. Note that in their simplest version,
these quantities do not depend on frequency (in fact, they are the zero frequency
limit of a 2-point function), and therefore they describe the response of the system to
an infinitely slow perturbation. They can be generalized into frequency dependent
quantities that also contain information about the response to a dynamical disturbance.
The standard approach for evaluating transport coefficients is to use the Green-
Kubo formula, that relates the transport coefficient to the 2-point correlation function
of a current J that couples to the quantity of interest (electrical charge, momentum,
etc):
h i Z +∞
transport 1
∼ lim Im dtd3 x e−iωt J(t, x), J(0, 0) . (14.42)
coefficient ω→0 ω 0
The physical meaning of this formula is that the system is perturbed at the origin by a
current J, and one measures the linear response by evaluating the same current at a
generic point (t, x). The transport coefficient is proportional to the Fourier transform
of this correlation function at zero energy and momentum. Note that this formula
contains the commutator of the two currents, since one wants the two points to be
causally connected.
The perturbative rules that we have derived so far are expressed in coordinate space,
which is usually not very appropriate for explicit calculations. The standard way of
turning them into a set of rules in momentum space is to Fourier transform all the
propagators, and to rely on the fact that the Fourier transform of a convolution product
is the ordinary product of the Fourier transforms, i.e. symbolically
FT F ∗ G = FT F × FT G . (14.43)
However, the main difficulty in doing this at finite temperature is that the time
integration in the “convolution product” involves an integration over the complex-
shaped contour C, which makes it unclear whether we may use the above identity.
Two main solutions to this problem have been devised. The first one is the
imaginary time formalism, also known as the Matsubara formalism, that we have
already presented superficially in the section 2.8.2. The main motivation of this
formulation is that the quantities that describe the thermodynamics of a system in
thermal equilibrium are time independent. Therefore, one may exploit the freedom to
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 469
deform the contour C in order to simplify it, as shown in the following figure:
ti 0
t i − iβ − iβ
It is customary to denote x0 = −iτ, so that the variable τ is real and spans the range
[0, β) (the point τ = β should be removed – indeed, because of the KMS symmetry, it
is redundant with the point τ = 0). The imaginary time formalism corresponds to the
Feynman rules derived earlier, specialized to this purely imaginary time contour. Note
that one could in principle use this formalism in order to calculate time-dependent
quantities. One would first obtain them as a function of imaginary times τ1 , τ2 , · · ·
and their dependence upon real times x01 , x02 , · · · may then be obtained by an analytic
continuation.
From the KMS symmetry, we see that the propagator, and more generally the
integrand of any Feynman diagram, is periodic (for bosons) in the variable τ with
period β. Therefore, one can go to Fourier space by decomposing the time dependence
in the form of a Fourier series and by doing an ordinary Fourier transform in space :
+∞ Z
0
X d3 p iωn (τx −τy ) −ip·(x−y) 0
G (τx , x, τy , y) ≡ T e e G (ωn , p) ,
n=−∞
(2π)3
(14.44)
with ωn ≡ 2πnT . These discrete frequencies are called Matsubara frequencies.
Note that for fermions, the propagator is antiperiodic with period β, and the discrete
frequencies that appears in the Fourier series are ωn = 2π(n + 12 )T . Moreover, if the
line carries a conserved charge q, the Matsubara frequencies are shifted by −iµq, i.e.
ωn → ωn − iµq (µ is the chemical potential associated to this conservation law).
In the case of scalar fields, an explicit calculation gives the following free bosonic
propagator in Fourier space,
1 1
G0 (ωn , p) = ≡ 2 . (14.45)
ω2n 2
+p +m2
P + m2
(For the sake of brevity, we denote P2 ≡ ω2n + p2 .) Note that, up to a factor −i,
this propagator is the usual free zero temperature Feynman propagator in which
one has substituted p0 → iωn . Let us list here the Feynman rules for perturbative
calculations in this formalism:
• Propagators :
1
G0 (ωn , p) = ,
P2
470 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
• Vertices : each vertex brings a factor λ. Moreover, the sum of the ωn ’s and of
the p’s that enter into each vertex are zero,
• Loops :
X Z d3 p Z
X
T ≡ .
(2π)3
n∈❩ P
(The right-hand side of this equation is a frequently used compact notation for
the combination of discrete sums and integrals that appear in the Matsubara
formalism. This notation includes a factor T that makes its dimension equal to
four in four space-time dimensions.)
As an illustration of the use of this formalism, let us give two examples of vacuum
graphs:
Z Z
λ XX 1
= ,
8 (P + m )(Q2 + m2 )
2 2
P Q
Z Z
g2 XX 1
= 2 2
. (14.46)
6 (P +m )(Q +m2 )((P+Q)2 +m2 )
2
P Q
The Fourier space version of the Matsubara formalism is structurally very similar
to the zero temperature Feynman rules, which makes it quite appealing. There is
one caveat however: the continuous integrations over energies are now replaced by
discrete sums, which are considerably harder to calculate. Let us expose here two
general methods for evaluating these sums. The first one is based on the following
representation of the propagator of eq. (14.45):
Zβ h i
1
0
G (ωn , p) = dτ e−iωn τ (1+nB (Ep )) e−Ep τ +nB (Ep ) eEp τ , (14.47)
2Ep 0
where the integrand in the right hand side is a mixed representation that depends on
the momentum p and the imaginary time τ. By replacing each propagator of a given
graph by this formula, the discrete sums can be easily performed since they are all of
the form
X X
eiωn τ = β δ(τ − nβ) . (14.48)
n∈❩ n∈❩
(The left hand side is obviously periodic in τ with period β, which is ensured in the
right hand side by the sum over infinitely many shifted copies of the delta function.)
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 471
At this point, one has to integrate over the τ’s that have been introduced when
replacing the propagators by (14.47), but these integrals are straightforward since
the dependence on these times is in the form of delta functions and exponential.
Moreover, only a finite number of the delta functions that appear in the right hand
side of eq. (14.48) actually contribute, due to the constraint that each τ must be in
the range [0, β). As an illustration, consider the evaluation of the 1-loop tadpole in a
scalar theory with quartic coupling:
Z
λX 1
= 2
2 P + m2
P
Z Zβ X h i
λ d3 p −Ep τ Ep τ
= dτ δ(τ−nβ) (1+n (E p ))e +n (Ep )e
2 (2π)3 2Ep 0 n
B B
Z
λ d3 p
= 1 + 2 nB (Ep )
2 (2π)3 2Ep
2
Λ T2
= λ + + ··· , (14.49)
16π2 24
where Λ is an ultraviolet cutoff that restricts the integration range |p| ≤ Λ (the final
formula assumes that Λ ≫ T , and we have not written the terms that depend on the
mass). The first term is the usual zero temperature ultraviolet divergence, while the
term coming from the Bose-Einstein distribution exists only at non-zero temperature.
This second term is ultraviolet finite, thanks to the exponential suppression of the
Bose-Einstein distribution at large energy. We can already note on this example that
the ultraviolet divergences are identical to the zero temperature ones. This is a general
property: if the action has already been renormalized at zero temperature, there are no
additional ultraviolet divergences at finite temperature. This is quite clear on physical
grounds: being at finite temperature means that one has a dense medium in which the
average inter-particle distance is T −1 . However, in the ultraviolet limit, one probes
distance scales that are much smaller than the inverse temperature, for which the
effects of the surrounding medium are irrelevant.
An alternate method for evaluating the sums over the discrete Matsubara frequen-
cies is to note that the function
β
P(z) ≡ (14.50)
eβz − 1
has simple poles of residue 1 at all the z = iωn . Therefore, we can write
X I
dz
f(iωn ) = P(z) f(z) , (14.51)
2iπ
n∈❩ γ
contour γ as shown in the middle of the figure 14.1. For this transformation to hold
as is, with no extra term, the function f(z) should not have any pole on the imaginary
axis, which is usually the case. Finally, a second deformation brings the contour along
the real axis. If the function f(z) has poles, the new contour should wrap around these
poles, which an additional contribution. Thus, after these transformations, the discrete
sum over the Matsubara frequencies has been rewritten as a continuous integral along
the real axis (and the weight P(z) becomes an ordinary Bose-Einstein distribution),
plus some isolated contributions coming from poles of the summand. c sileG siocnarF
statistical distribution nB (Ep ) by nB (|p0 |) in the equations (14.52). See the discussion after eq. (14.60).
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 473
need for the vertical part of the time contour. Let us call + and − respectively the
upper and lower horizontal branches of the contour. We may then break down the free
propagator G0 (x, y) into four propagators G0++ .G0−− , G0+− and G0−+ depending on
where x, y are located, and Fourier transform each of them separately. For a scalar
field, this gives:
i
G0++ (p) = + 2π nB (Ep ) δ(p2 − m2 ) ,
p2 − m2 + iǫ
G0+− (p) = 2π (θ(−p0 ) + nB (Ep )) δ(p2 − m2 ) ,
0 ∗
G0−− (p) = G++ (p) , G0−+ (p) = G0+− (−p) . (14.52)
Note that these propagators are very closely related to those of the Schwinger-Keldysh
formalism at zero temperature (see eqs. (1.367)), since we have
h i
for ǫ, ǫ ′ = ± , G0ǫǫ ′ (p) = G0ǫǫ ′ (p) +2π nB (Ep ) δ(p2 −m2 ) . (14.53)
T =0
The rules for the vertices and loops are identical to those of the Schwinger-Keldysh
formalism at zero temperature, namely:
• One must assign types + and − to the vertices of a diagram in all the possible
ways,
• Each vertex of type + brings a factor −iλ and each type − vertex a factor +iλ,
• A vertex of type ǫ and a vertex of type ǫ ′ are connected by the free propagator
G0ǫǫ ′ ,
• Each loop momentum must be integrated with the measure d4 p/(2π)4 .
Finally, let us note for later use that the four propagators of eqs. (14.52) can be
related to the zero temperature Feynman propagator and its complex conjugate by the
following formula:
0 0
G++ G0+− GF 0
=U U (14.55)
G0−+ G0−− 0 G0F ∗
with
p
θ(−p0 )+nB
1 + nB √
1+nB
U(p) ≡ θ(+p0 )+n p (14.56)
√ B
1 + nB
1+nB
and
i
G0F (p) ≡ . (14.57)
p2 − m2 + iǫ
the 2 × 2 matrix of Schwinger-Keldysh propagators, without and with the mass, and
0 0
❉0 ≡
GF 0
0 G0F ∗ m=0
, m ≡ ❉
GF 0
0 G0F ∗ m
(14.59)
the corresponding diagonal matrices made of the Feynman propagator and its complex
conjugate. The massive propagators obtained by explicitly summing the mass term
are given by
∞ n
●m ●0 ●0
X
= − im2 σ3
n=0
n
❉0 ❉0
X∞
= U − im2 Uσ3 U U
n=0
∞ n
❉0 ❉0 ❉mU .
X
= U − im2 σ3 U=U (14.60)
n=0
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 475
In the first line, the third Pauli matrix σ3 provides the necessary signs for the vertices
of types + and − in the Schwinger-Keldysh formalism. The third line uses the fact
that Uσ3 U = σ3 . In the final result, only the matrix ❉
is affected by the mass,
while the matrix U has remained unchanged. If we use the on-shell energy Ep as the
argument of nB in the matrix U, then this argument is |p| since we started from a
massless propagator. With this choice, the final result
p would be inconsistent, since the
poles of the massive propagator are at p0 = ± p2 + m2 (since the matrix m in ❉
the middle now contains the mass), but the statistical information contained in the
U’s is still massless. In contrast, using |p0 | as the argument of nB ensures that the
energy inside nB follows the poles of the propagator, and correctly picks the change
due to the mass. We also see that the (incorrect) prescription nB (Ep ) is equivalent to
neglecting the vertical path of the contour, since it amounts to keeping the interactions
(here, the mass term, treated as an interaction) in the time evolution of the system but
not in the density operator.
Change of basis : All the objects that appear in the Schwinger-Keldysh formalism
carry indices that take the values + or −. Variants of this formalism may be obtained
by performing linear combinations of these two indices, akin to a change of basis.
For any n-point function G{ǫi } in the ± basis, we may define
X n
Y
G{Xi } (k1 , · · · , kn ) ≡ G{ǫi } (k1 , · · · , kn ) UXi ǫi (ki ) , (14.61)
{ǫi =±} i=1
where U is an invertible “rotation” matrix. The new indices Xi also take two values,
that we may denote 1 and 2. For consistency, the vertex functions obtained by
amputating Feynman graphs of their external lines must be related by
X n
Y
{Xi } {ǫi }
Γ (k1 , · · · , kn ) ≡ Γ (k1 , · · · , kn ) V Xi ǫi (ki ) , (14.62)
{ǫi =±} i=1
In particular, this formula gives the expression of the vertices in the new formalism.
For instance, in a φ4 scalar theory, we have
X
−iλABCD (k1 , · · · , k4 ) = −iλ ǫ V Aǫ (k1 )V Bǫ (k2 )V Cǫ (k3 )V Dǫ (k4 ) .
ǫ=±
(14.64)
476 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that the new vertices may be momentum dependent if the rotation matrix is.
Moreover, there could be up to 24 non-zero vertices, while there are only two in the
original Schwinger-Keldysh formalism (but we will see shortly that these rotations
may reduce the number of non-zero entries for the propagators, which is sometimes
an advantage). The n-point functions in the new basis may be obtained directly in
perturbation theory, in terms of Feynman diagrams made of the bare propagators and
vertices of the new basis. c sileG siocnarF
(For fermionic lines, we must replace nB by −nF , and shift the argument by −qµ
if the line carries a conserved charge.) This formalism, compared to the original
Schwinger-Keldysh one, has a number of advantages:
• Thanks to eq. (14.69), the Bose-Einstein (or Fermi-Dirac in the case of fer-
mions) functions are conveniently factorized in each Feynman graph,
• In this formalism, the two identities (14.54) satisfied by n-point functions take
a particularly simple form,
A···A R···R
Γ =Γ =0, (14.70)
that has two nested tadpoles. Let us assume that the uppermost tadpole has already
been combined with the corresponding 1-loop ultraviolet counterterm, so that only
the finite part remains, and denote µ2 the finite remainder. From eq. (14.49), its
expression is given by
λ T2
µ2 ≡ . (14.72)
24
478 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(This is the exact result for the temperature dependent part in a massless theory.) With
this shorthand, we have
Z
λµ2 X 1
=
2 (P2 )2
P
Z
λ µ2 d3 p nB (p)(1 + nB (p)) 2 eβp − e−βp
= +
2 (2π)3 4p2 T p
| {z }
≈ T4
p≪T p
λ µ2 T
= + infrared finite terms , (14.73)
4π2 ΛIR
where in the last line we have introduced an infrared cutoff ΛIR in order to prevent
a divergence at the lower end of the integration range. A similar calculation would
indicate an even worse infrared singularity in the following 3-loop graph:
µ3
∼ λT µ + infrared finite terms , (14.74)
Λ3IR
and more generally for n insertions of the base tadpole on the main loop,
2n−1
µ
∼ λT µ + infrared finite terms . (14.75)
ΛIR
+ + + ··· =
Z p
λ d3 p
= p 1 + 2 nB ( p2 + µ2 ) . (14.76)
2 3 2
(2π) 2 p + µ2
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 479
(The thicker propagator indicates a massive scalar with mass µ2 .) The procedure
used here, that consists in summing an infinite subset of (individually divergent)
perturbative contributions, is a simple form of resummation. We can readily see that
it leads to an infrared finite sum, since now the quantity µ2 plays the role of a cutoff
at small momentum.
Let us now estimate the √
contribution of the infrared sector to this integral. At
weak coupling, we have µ ∼ λ T ≪ T . Therefore, for momenta p ∼ µ, we have
Z p Z
2
2 1 + 2 nB ( p + µ )
2 dp p2 T
λ dp p p ∼λ ∼ λT µ ∼ λ3/2 T 2 . (14.77)
p2 + µ2 p2 + µ2
This contribution comes in addition to the ultraviolet divergence λΛ2 and the contri-
bution λT 2 that are both contained in the first diagram of the resummed series (these
terms come from momenta of order T or above). We observe here an unexpected
feature; the appearance of half powers of the coupling constant λ. On the surface, this
is quite surprising since the power counting indicates that one power of λ should come
with each loop. This oddity is in fact a consequence of the infrared behaviour of the
Bose-Einstein distribution, in T/E, combined with the fact that the µ introduced in the
resummation is of order λ1/2 . Although the loop expansion generates a series which
is analytic in λ, this property may be broken if some parameters in the integrands
depend on λ1/2 . c sileG siocnarF
as much as possible of the large contributions coming from loop corrections to the
propagator. The 1-loop contribution in λT 2 is an obvious candidate for including in
µ2 , since for momenta p2 . λT 2 this is indeed a large correction to the denominator
of the propagator. At small coupling λ ≪ 1, this is the dominant one. However, when
the coupling increases, the propagator may receive additional large corrections from
higher order loop corrections, and an improved resummation scheme could include
these additional corrections.
A further improvement, sometimes considered in some applications, is to let µ2
free and to use some reasonable condition to choose an “optimal” value. For instance,
this condition may be the minimization of the 1-loop correction, which in a sense
would indicate that the resummation has shifted most of this loop contribution into
the free propagator. For instance, one may try to achieve
ZΛ p
λ d3 p 1 + 2 nB ( p2 + µ2 ) λ Λ2
0= + counterterms = p − − µ2 ,
2 (2π)3 2 p2 + µ2 16π2
(14.79)
where the two subtractions are respectively the ultraviolet counterterm and the finite
counterterm necessary in order not to overcount the mass µ2 . The equation, that
provides an implicit definition of the mass µ2 , is called a gap equation5 . Because this
equation is non-linear in µ2 , its solution contains all orders in λ, but at small λ it is
dominated by the 1-loop result µ2 = λT 2 /24.
We show an application of this method to the calculation of the free energy F in
the figure 14.2. In this figure, the results obtained at 1-loop and 2-loops in screened
perturbation theory are compared to the first two orders (λ and λ3/2 ) of the ordinary
perturbative expansion. Firstly, we can see that the latter is quite unstable except at
low coupling: the two subsequent orders differ substantially, and even the sign of the
correction due to the interactions flips. In contrast, screened perturbation theory leads
to a remarkably stable result, with very small changes when going from 1-loop to
2-loops. To a large extent, this success is due to the non trivial coupling dependence
of the mass µ2 , acquired by solving the gap equation (14.79) (screened perturbation
theory with only the 1-loop mass, would be better than strict perturbation theory, but
would encounter some difficulties at large coupling).
particle, generating a “gap” in the spectrum, and thus requiring a non-zero energy to create such a particle.
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 481
100
increasing temperature.
Thick dark curve: po- 60
40
non-trivial minima at
low temperature, leading
20
to spontaneous symme-
try breaking. Thick light 0
curve: potential at the
critical temperature. -10 -5 0 5 10
φ
482 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
temperature. Let us consider for instance a scalar theory whose potential at zero
temperature is
m20 2 λ 4
Vφ = − φ + φ . (14.80)
2 4!
Because of the sign of the mass term, this potential has two degenerate minima. The
true vacuum of this theory is at a non-zero value of φ, and the discrete symmetry
φ → −φ is thus spontaneously broken. When the temperature increases, the thermal
fluctuations generate a positive correction to the square of the mass, proportional to
λT 2 . Eventually m2 becomes positive, i.e. the potential has a unique minimum at
φ = 0, and the symmetry is restored. The critical temperature, that separates the low
temperature broken phase and the high temperature symmetric phase, is the point at
which m2 = 0.
Photon hard thermal loop : A simple example of hard thermal loop is that of a
photon in QED, for the only graph at is shown on the right of the figure 14.4. In the
c sileG siocnarF
K K
P P
In the vacuum, this relation is sufficient to fully constrain the tensorial structure of
Πµν , up to an overall function of K2 . In the presence of a surrounding thermal bath,
the situation is more complicated: besides the metric tensor gµν and the 4-momentum
Kµ , this tensor may also contain the 4-velocity Uµ of the thermal bath (with respect
to the observer). Let us first introduce
V µ ≡ K2 Uµ − (K · U) Kµ . (14.83)
Then, one may check that Ward-Takahashi identity is satisfied by two symmetric
tensors
Kµ Kν V µ V ν
PTµν ≡ gµν − − ,
K2 V2
V µV ν
PLµν ≡ . (14.84)
V2
which means that they are mutually orthogonal projectors (the values of their traces
indicate that PTµν encodes two degrees of freedom, while PLµν contains only one).
Moreover, in the rest frame of the thermal bath, we have Uµ = δµ0 , and the first of
these tensors reads
ki kj
PT00 = PTi0 = PT0i = 0 , PTij = δij − . (14.86)
k2
Therefore, PTµν is a projector orthogonal to the 3-momentum k.
In terms of these projectors, the most general photon polarization tensor is of the
form:
Note that in the presence of a heat bath, the functions ΠT,L (K) may depend on the
four components of Kµ separately (in the vacuum, the corresponding function would
484 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
depend only on the Lorentz invariant K2 ). This complication is due to the fact
the thermal bath imposes a preferred frame that breaks Lorentz invariance. If the
photon self-energy is resummed on the propagator, one obtains the following dressed
propagator in a generic covariant gauge:
PTµν PLµν Kµ Kν
−Dµν (K) = 2
+ 2
+ξ , (14.88)
K + ΠT (K) K + ΠL (K) K2
thanks to the orthogonality properties of these projectors (the gauge dependent term
in the propagator is not affected by the resummation). The two functions ΠT,L (K)
may be obtained from Πµ µ and Π00 by using
k2
Πµ µ = 2 ΠT + ΠL Π00 = − Π . (14.89)
K2 L
The fully traced polarization tensor, Πµ µ , is the easiest to evaluate:
Z
µ 2
X tr (γµ (P /)
/ − /K)γµP
Π µ (K) = e
P2 (P − K)2
P
Z
2
X K2 2
= 4e − . (14.90)
P2 (P − K)2 P2
P
The hard thermal loop approximation consists in assuming that the external momen-
tum K is much smaller than the temperature, that controls the typical loop momentum.
In this approximation, we have
Z
X 1 e2 T 2
Πµ µ (K) = −8e2 = . (14.91)
HTL P2 3
|P {z }
T2
− 24
The sum-integral in this expression has a very simple tadpole structure, but note that
the Matsubara frequencies are the fermionic ones, hence the result −T 2 /24 for its
thermal contribution (instead of T 2 /12 in the bosonic case). The 00 component is a
bit more complicated,
Z
00 2
X tr (γ0 (P
/ − /K)γ0P /)
Π (K) = e 2
P (P − K) 2
P
Z
2
X 8P0 (P0 − K0 ) 4
= e − 2
HTL P2 (P − K)2 P
P
e2 T 2 h k0 k0 + k i
= 1− ln 0 . (14.92)
HTL 3 2k k −k
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 485
In the second line, we have dropped a non-HTL term in K2 /P2 (P − K)2 , and in the
last line we have analytically continued the discrete Matsubara frequency to a real
energy K0 → ik0 . Therefore, the transverse and longitudinal self-energies of the
photon in the HTL approximation read
e2 T 2 k0 h k0 1 k2 k0 + k i
ΠT (K) = + 1 − 20 ln 0
6 k k 2 k k −k
e2 T 2 k20 h k0 k0 + k i
ΠL (K) = 1− 2 1− ln 0 . (14.93)
3 k 2k k −k
Electron hard thermal loop : A similar approximation can be used for fermions
in QED. Due to the breaking of Lorentz invariance caused by the thermal bath, the
self-energy may be decomposed as
e2 T 2
tr (/K Σ
/ (K)) = 4 (K0 α + kβ) = ,
HTL 2
e2 T 2 k0 + k
tr (γ0Σ
/ (K)) = 4α = ln 0 . (14.95)
HTL 4k k −k
Moreover, the HTL approximation leads to a fermion self-energy that does not depend
on the gauge chosen for the photon propagator. After summation of this self-energy
to all orders, the fermion propagator becomes
γ0 + kb·γ γ0 − kb·γ
S(K) = + , (14.96)
2(k0 − k − Σ+ ) 2(k0 + k + Σ− )
p0 / m γ p0 / m f
(T) (+)
1 (L)
1
(−)
p / mγ p / mf
1 1
an imaginary part only when the argument of the logarithm is negative), which
implies that the shifted poles remain on the real axis. In other words, in the HTL
approximation, the gauge boson excitations remain infinitely long-lived.
In the case of fermions, there are also two distinct modes, denoted (+) and (−),
that merge at zero momentum and k0 = m2f ≡ e2 T 2 /8. The + mode is the analogue
of the zero temperature fermion, modified by the surrounding thermal bath (the residue
of this pole goes to one when k ≫ T ). In contrast, the − mode is a purely collective
mode (the corresponding residue vanishes exponentially at low temperature). Like for
bosons, these fermionic modes have an infinite lifetime in the HTL approximation.
Debye screening : The Hard Thermal Loop correction to the gauge boson propa-
gator also encodes interesting phenomena in the space-like region. In particular, by
taking the zero frequency limit of the photon self-energy, and then its zero momentum
limit (in this order), one can determine how the Coulomb potential of a static electrical
charge is modified at long distance. Simply recall that the Coulomb potential is given
by the Fourier transform of the longitudinal7 term in the propagator,
Z 3
d k eik·r
A0 (r) ∼ . (14.97)
(2π)3 k2 + ΠL (0, k)
At large distance, we need the small k behaviour of ΠL (0, k), which is given by
e2 T 2
lim ΠL (k0 = 0, k) = . (14.98)
k→0 3 }
| {z
m2
D
7 The transverse projector does not couple to the electromagnetic current of a static charge, e.g. an
(The mass mD is called the Debye mass.) The Fourier transform then gives the
following Coulomb potential at long distance,
e−mD r
A0 (r) ∼ , (14.99)
r
which is exponentially attenuated compared to the vacuum Coulomb potential of a
point-like charge. The inverse of the Debye mass characterizes the typical distance
beyond which this screening is sizeable. Physically, this phenomenon is due to the
−m r
A0(r) = e r
D
fact that the test charge polarizes the charged medium surrounding it, by attracting in
its vicinity charges of the opposite sign. Because of this, a distant observer sees an
effective charge which is much small than the bare charge visible at short distance
(see the figure 14.7).
+∞
Z
i dω i
2
= ω ρT,L (ω, k) 2 .
k0 − k − ΠT,L (k0 , k) + ik0 0+
2 2π k0 − ω + ik0 0+
2
−∞
(14.101)
From eq. (14.101), we may derive other useful integrals that contain the spectral
functions ρT,L . The starting point is to take the imaginary part of eq. (14.101), by
denoting ω ≡ kx et k0 ≡ ky, which gives the following identity
+∞
Z " #
dx 1
x ρT,L (kx, k) P 2
2π y − x2
−∞
Various interesting integrals can then be obtained by taking special values of y. With
y = 0, we obtain
+∞
Z +∞
Z
dx ρT (kx, k) 1 dx ρL (kx, k) 1
= 2 , = 2 , (14.104)
2π x k 2π x k + m2D
−∞ −∞
while y = +∞ leads to
+∞
Z
dx 1
x ρT,L (kx, k) = 2 . (14.105)
2π k
−∞
Let us also mention another exact integral involving the HTL photon self-energies,
Z1 h i
dx 2 Im Π(x) 1 1
2 2
=π − ,
0 x (z + Re Π(x)) + (Im Π(x)) z + Re Π(∞) z + Re Π(0)
490 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
(14.106)
where Π(x) is any of ΠT,L (kx, k) (which does not depend on k since the bosonic
HTL self-energies depend only on the ratio k0 /k). The values at x = ∞ and x = 0
of these self-energies that appear in the right-hand side are easily determined from
eqs. (14.93). This integral, where the value of x is bounded by one, appears in
the scattering cross-section of a hard particle on a particle of the thermal bath, by
exchange of a soft photon (the momentum of this photon is space-like, hence |x| ≤ 1).
Relevant physical scales : When discussing the physics of a weakly coupled system
of particles at high temperature T (much larger than the masses), it is useful to have
in mind the following hierarchy of length scales:
• ℓ = (gT )−1 . This is the typical distance over which a particle “feels” modifica-
tions of its dispersion relation. Besides the appearance of a thermal gap in the
spectrum of gauge bosons and matter fields, the HTL self-energies also encode
Debye screening and Landau damping. c sileG siocnarF
• ℓ = (g2 T )−1 . This is the mean distance between scatterings with a soft colour
exchange. These are forward scatterings, since the momentum transfer (of
order gT , the scale of the infrared cutoff provided by the dressing of the gluon
propagator) is much smaller than the momentum of the incoming particles
(typically T ). A gross way to obtain this scale is by estimating the corresponding
scattering rate:
2
Z
d2 p⊥
Γ soft = p⊥ ∼ g4 T 3 ∼ g2 T , (14.107)
collisions p4⊥
gT .p⊥
• ℓ = (g4 T )−1 . This is the mean distance between scatterings with a momentum
transfer of order T , i.e. those that scatter particles at large angles. Estimating
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 491
1 / gT
T
g2
/
1
T
g4
/
1
this scale is done as above, but with a lower limit of order T for the momentum
transfer:
2
Z
d2 p⊥
Γ hard = p⊥ ∼ g4 T 3 ∼ g4 T . (14.108)
collisions p4⊥
T .p⊥
This scale is usually called the mean free path. This is the relevant scale for
all transport phenomena that require significant momentum exchanges, for
instance the viscosity. Beyond this scale is the realm of collective effects such
as sound waves (on these scales, it is more appropriate to describe the system
as a fluid rather than in terms of elementary field excitations).
focusing on gauge bosons. Let us first recall that a mode is perturbative if its kinetic
energy dominates its interaction energy. For a mode of momentum k, the kinetic
energy of a gauge field can be estimated as
K ∼ (∂A)2 ∼ k2 A2 . (14.109)
For the interaction energy, we have
2
I ∼ g2 A4 ∼ g2 A2 . (14.110)
(The second part of the equation is of course not exact, but it gives the correct order
of magnitude.) Thus, a mode of momentum k is perturbative if k2 ≫ g2 A2 . When
discussing the order of magnitude of A2 , it is useful to distinguish the contribution
of the various momentum scales by defining
Z κ∗ 3
2 d p
A κ∗ ∼ n (Ep ) ,
Ep B
the contribution of all the thermal modes up to the scale κ∗ . From these considerations,
we can now distinguish three types of modes:
• Hard modes : k ∼ T . For these modes, we have A2 T
∼ T 2 , and K ≫ I.
They are therefore fully perturbative.
• Soft modes : k ∼ gT . For these modes, k2 ∼ g2 A2 T , which implies that
the soft modes interact strongly with the hard modes. However, we also have
A2 gT ∼ gT 2 , so that k2 ≫ g2 A2 gT . Thus, the soft modes interact
perturbatively among themselves. Consequently, it is possible to describe
perturbatively the soft modes, provided one has performed first a resummation
of the contribution of the hard modes. Screened perturbation theory is a
realization of this idea.
• Ultrasoft modes : k ∼ g2 T . For these modes, we have A2 g2 T ∼ g2 T 2 , so
that k2 ∼ g2 A2 g2 T . Therefore, the ultrasoft modes interact non perturba-
tively among themselves, and there is no way to treat them in a perturbative
approach. A non perturbative approach, such as lattice field theory, is necessary
for this.
X
Σ = G0+ǫ (p) Σǫǫ′ (p) G0ǫ′ + (p) . (14.112)
ǫ,ǫ′ =±
Since the derivative of a distribution is well-defined, this indicates that certain products
(or combinations of products) of delta functions and principal values are well defined,
but not all of them (for instance, the product δ2 (z) makes no sense).
Returning now to eq. (14.112) and expanding the propagators, we see that it
contains terms that are ill-defined:
h well defined i h i
Σ = +π2 δ2 (p2 −m2 ) (1+f(p))Σ+− −f(p)Σ−+ ,
distributions
(14.115)
where we have used the first of eqs. (14.54) in order to simplify the combination of
self-energies that appear in the square bracket. Note that the square bracket vanishes
in equilibrium thanks to the KMS symmetry. We are thus facing a very peculiar
pathology, that exists only out-of-equilibrium.
We may learn a bit more about this issue by formally resumming the self-energy
Σ on the propagator. Let us introduce the following notations:
● 0
≡
G0++
G0−+
G0+−
G0−−
, ❉≡ G0F
0
0
G0∗
, ❙≡ Σ++
Σ−+
Σ+−
Σ−−
,
F
(14.116)
where U is the matrix defined in eq. (14.56), but with f(p) instead of the Bose-Einstein
distribution, and where we have used the following notations
i
GF (p) ≡ ,
− p2 m2 − ΣF + iǫ
ΣF ≡ Σ++ + Σ+− ,
1 h i
e≡
Σ (1 + f(p))Σ+− − f(p)Σ−+ . (14.119)
1 + f(p)
Note that the Feynman propagator and its complex conjugate have mirror poles on
each side of the real energy axis. If the self-energy ΣF has no imaginary part, then
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 495
these poles “pinch” the real axis and lead to a singularity (this is in fact a pathology
of the same nature as the product δ2 in eq. (14.115)). By performing explicitly the
multiplication with the matrix U, we obtain the resummed propagator in the following
form:
h i
Gǫǫ ′ (p) = G0ǫǫ ′ (p) + 2π f(p) δ(p2 − m2 )
T =0
h i
+ (1 + f(p))Σ+− − f(p)Σ−+ GF (p)G∗F (p) .
(14.120)
Since it does not depend on the indices ǫǫ ′ , the pathological term (on the second line)
appears on the same footing as the second term, that contains the distribution f(p).
Thus, the lesson of this calculation is that one may consider hiding this pathology into
a redefinition of the distribution f(p). However, the naive formalism that we have
tried to use so far is not adequate for doing this consistently, and must be amended in
a number of ways:
• The initial time ti should not be taken to −∞, as is done when using the
Schwinger-Keldysh formalism in momentum space. Indeed, this is the time at
which the system was prepared in an out-of-equilibrium state. If it were equal
to −∞, the system would have had an infinite amount of time for relaxing to
equilibrium at the finite time where a measurement is performed. Note that
observables will in general depend on the initial time ti , in contrast with what
happens in equilibrium. c sileG siocnarF
The Kadanoff-Baym equations, that we shall derive now, may be viewed as a kind
of quantum kinetic equations. These equations are exact, but contain a self-energy
that must be truncated to a manageable number of diagrams in order to be usable in
practical applications. In the next subsection, we will show how the traditional kinetic
equations can be derived from the Kadanoff-Baym equations.
The starting point is the Dyson-Schwinger equation, written in coordinate space,
496 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where G0 is the free propagator and G is the resummed one. Note that the time
integrations run over the Schwinger-Keldysh contour C. Here, we have written the
equation in two ways, depending on whether the self-energy is inserted on the right or
on the left of the bare propagator (in the end, the resulting propagator G is the same in
both cases). Next, we apply the operator x + m2 on the first equation and y + m2
on the second equation. This eliminates the bare propagators, and we obtain:
Z
2
(x + m )G(x, y) = −iδc (x − y) − d4 v Σ(x, v) G(v, y) ,
C
Z
(y + m2 )G(x, y) = −iδc (x − y) − d4 v G(x, v) Σ(v, y) , (14.122)
C
transforms do not share with the Fourier transform their properties with respect to
convolution. Given two 2-point functions F and G, let us define:
Z
H(x, y) ≡ d4 z F(x, z) G(z, y) . (14.124)
i ← → → ←
H(X, p) = F(X, p) exp ∂X ∂p − ∂X ∂p G(X, p) , (14.125)
2
where the arrows indicate on which side the corresponding derivative acts. The right
hand side of this formula reduces to the ordinary product of the transforms when there
is no X dependence, i.e. when the functions F and G are translation invariant. The
first correction to the translation invariant case is proportional to the Poisson bracket
of F and G,
i
H(X, p) = F(X, p)G(X, p) + F(X, p), G(X, p) + · · · . (14.126)
2
The derivatives with respect to x and y that appear in the Kadanoff-Baym equations
can be written in terms of derivatives with respect to X and s :
1 1
∂x = ∂ X + ∂ s , ∂y = ∂ X − ∂ s
2 2
1 1
x = X + ∂X · ∂s + s , y = X − ∂X · ∂s + s .(14.127)
4 4
difference, and breaking it down into its ++, −−, +− and −+ components,
one obtains
where ρ(X, p) ≡ G−+ (X, p) − G+− (X, p). This would be exact for non-
interacting, infinitely long-lived, particles. In the presence of interactions, the
approximation is justified when the time between two collisions of a particle is
large compared to its wavelength.
Using eqs. (14.129) and (14.130), we obtain and equation for f(X, p), which is
nothing but a Boltzmann equation:
h i i h i
∂t + vp · ∇x f(X, p) = (1 + f(X, p))Σ+− − f(X, p)Σ−+ , (14.131)
2Ep
| {z }
❈ p [f;X]
where vp ≡ p/Ep is the velocity vector for particles of momentum p. Note that the
Boltzmann equation is spatially local since all the objects it contains are evaluated
at the coordinate X, but its right hand side is non local in momentum. The right
❈
hand side, p [f; X], is called the collision term. The combination ∂t + vp · ∇x that
appears in the left hand side is called the transport derivative. It is zero on any function
whose t and x dependence arise only in the combination x − vp t (this is the case
for a distribution of non-interacting particles, that move at the constant velocity vp
prescribed by their momentum). c sileG siocnarF
Σ= . (14.132)
14. Q UANTUM FIELD THEORY AT FINITE TEMPERATURE 499
Using the Feynman rules of the Schwinger-Keldysh formalism, this diagram leads to
the following collision term
Z
❈p [f; X] =
λ2
4Ep
d3 p1 d3 p2 d3 p3
(2π) 2E1 (2π) 2E2 (2π)3 2E3
3 3
(2π)4 δ(p−p1 −p2 −p3 )
h
× f(X, p1 )f(X, p2 )(1 + f(X, p3 ))(1 + f(X, p))]
i
−f(X, p3 )f(X, p)(1 + f(X, p1 ))(1 + f(X, p2 )) .
(14.133)
The expression describes the rate of change of the particle distribution, under the
effect of 2-body elastic collisions. It is the difference between a production rate
(coming from the term in which the particle of momentum p is produced, and thus
weighted by a factor 1 + f(X, p)) and a destruction rate (from the term in which the
particle of momentum p is destroyed, and has a weight f(x, p)).
To close this section, let us mention an additional term that arises when the self-
energy contains a local part, i.e. a term proportional to a delta function in space-time:
When such a local term is present, the difference of the two Kadanoff-Baym equations
contains Φ(y)G(x, y) − Φ(x)G(x, y), whose Wigner transform at lowest order in
the gradient approximation is
i ∂X Φ(X) · ∂p G(X, p) . (14.135)
In the new term (underlined), one may interpret ∂X Φ as a mean force field acting on
the particles. Under the action of this force, the particles accelerate which implies a
change of their momentum. The left hand side of the above equation thus describes
the change of the distribution of particles under the effect of this mean field, in the
absence of any collisions (that are described by the right hand side).
500 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Chapter 15
15.1 Introduction
Until now, all our discussion of quantum field theory has been centered on an ex-
pansion about the vacuum, i.e. on situations involving a system with few particles.
This is also a regime in which the fields are in a certain sense1 small. The connection
between the field amplitude and the density of particles in a state may be grasped
by writing the LSZ reduction formula that gives the expectation value of the number
operator for a system whose initial state is Φin . By mimicking the derivation of the
section (1.4), one obtains easily
Z
1
Φin a†p,out ap,out Φin = d4 xd4 y eip·(x−y) (x +m2 )(y +m2 )
Z
× Φin φ(x)φ(y) Φin
Z
Φin φ(x)φ(y) Φin = Dφ± (z) φ− (x)φ+ (y) ei (S[φ+ ]−S[φ− ]) ,
(15.1)
where in the second line we have sketched the path integral representation of the
matrix element that appears in the reduction formula. Note that, since there is no
time ordering in this matrix element, the Schwinger-Keldysh formalism must be used
here. This formula is only a sketch, because the boundary conditions of the path
1 When we talk of small or large fields, we are referring to the magnitude of the c-number field in a path
integral (it does not make sense to apply these qualifiers to the field operator itself).
501
502 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
integral at the initial time should be precised in order to properly account for the
initial state Φin . However, what we want to illustrate with these formulas is the
direct relationship between large particle occupation numbers (the left hand side of
the first equation), and large fields in a path integral. Moreover, in the path integral,
the magnitude of the fields is controlled by the boundary conditions (this is the only
thing that depends on the initial state of the system in the right hand side of the second
equation).
There is an implicit assumption of weak fields in the perturbative machinery that
we have studied so far, which is best viewed in the path integral formalism. For
instance, in the second of eqs. (15.1), the perturbative expansion amounts to writing
S = S0 + Sint , and to expand the exponentials in powers of Sint . In a scalar field theory
with a quartic coupling, the interaction part of the action reads
Z
λ
Sint [φ] = − d4 x φ4 (x) , (15.2)
4!
while the free action (that we keep inside the exponential) is given by
Z h i
1
S0 [φ] = d4 x (∂µ φ)(∂µ φ) − m2 φ2 . (15.3)
2
The common justification of the perturbative expansion is that, when the coupling
constant λ is small, we have Sint ≪ S0 . However, since S0 [φ] is quadratic in the field
while Sint [φ] contains higher powers of φ, this inequality may not be true if the field
is large, even at weak coupling. In order to make this statement more precise, we
must account for the fact that the field has mass dimension 1. Let us denote by Q
the typical momentum scale in the problem under consideration (for simplicity we
assume that there is only one), and then we write
φ(x) ∼ ϑ Q , (15.4)
where ϑ is a dimensionless number that encodes the order of magnitude of the field.
Naive dimensional analysis tells us that
(∂µ φ)(∂µ φ) ∼ ϑ2 Q4 ,
λφ4 ∼ λ ϑ4 Q4 . (15.5)
For the interaction term to be small compared to the kinetic term, we must have
λ ϑ2 ≪ 1 , (15.6)
which is slightly different from the usual criterion of small λ, since this condition
depends on the field magnitude via ϑ. The purpose of this chapter is to explore
situations of weak coupling (i.e. λ ≪ 1) where the inequality (15.6) is not satisfied
because of strong fields. We call this the strong field regime of quantum field theory.
We will discuss two main situations where strong fields may occur:
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 503
• The initial state is a highly occupied state, such as a coherent state.c sileG siocnarF
• The initial state is the ground state, but the system is driven by a strong external
source.
As we shall see, since the coupling constant is assumed to be small, there is neverthe-
less a loop expansion, but each loop order (including the tree level approximation) is
non-perturbative in a sense that we will clarify in the rest of the chapter.
The first equation tells us that χin is an eigenstate of annihilation operators, which
is another definition of coherent states, and the second one provides the value of the
normalization constant. The occupation number in the initial state is closely related
to the function χ(k). Indeed, we have
In other words, the number of particles in the mode of momentum p is the squared
modulus of the function χ(p). A large χ thus corresponds to a highly occupied initial
state (at the opposite, χ(p) ≡ 0 corresponds to the vacuum).
504 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Consider now the generating functional for the extension of the Schwinger-Kel-
dysh formalism in this coherent state,
Z
Zχ [j] ≡ χin P exp i d4 x j(x)φ(x) χin
C
Z h i
= χin P exp i d4 x Lint (φin (x)) + j(x)φin (x) χin , (15.11)
C
where j(x) is a fictitious source that lives on the closed-time contour C introduced in
the figure 1.4. As usual, the first step is to factor out the interactions as follows:
Z δ Z
4
Zχ [j] = exp i d x Lint χin P exp i d4 x j(x)φin (x) χin . (15.12)
C iδj(x) C
| {z }
Zχ0 [j]
where θc (x0 − y0 ) generalizes the step function to the ordered contour C. Note
that the factor on the second line is a commuting number and thus can be removed
from the expectation value. A second application of the Baker-Campbell-Hausdorff
formula allows to normal-order the first factor. Decomposing the in-field as follows,
Z Z
d3 k d3 k
φin (x) ≡ ak,in e−ik·x
+ a† e+ik·x , (15.14)
3
(2π) 2Ek (2π)3 2Ek k,in
| {z } | {z }
(−) (+)
φin (x) φin (x)
The factor of the first line can be evaluated by using the fact that the coherent state is
an eigenstate of annihilation operators:
Z Z
(+) (−)
χin exp i d x j(x)φin (x) exp i d4 y j(y)φin (y) χin
4
C C
Z Z 3
d k −ik·x ∗ +ik·x
= exp i d4 x j(x) χ(k)e + χ (k)e .
C (2π)3 2Ek
| {z }
Φχ (x)
(15.16)
We denote Φχ (x) the field obtained by substituting the creation and annihilation
operators of the in-field by χ∗ (k) and χ(k) respectively. Note that this is no longer
an operator, but a (real valued) c-number field. Moreover, because it is a linear
superposition of plane waves, this field is a free field:
(x + m2 ) Φχ (x) = 0 . (15.17)
The second and third factors of eq. (15.15) are commuting numbers, provided we
do not attempt to disassemble the commutators. Using the decomposition of the in-
field in terms of creation and annihilation operators, and the canonical commutation
relation of the latter, we obtain
(+) (−)
θc (x0 − y0 ) φin (x), φin (y) − φin (x), φin (y)
Z
d3 k
= θc (x0 − y0 ) e−ik·(x−y)
(2π)3 2Ek
Z
d3 k
0
+ θc (y − x )0
e+ik·(x−y) ,
(2π)3 2Ek
| {z }
G0
c (x,y)
(15.18)
which is nothing but the usual bare path-ordered propagator G0c (x, y). Collecting
all the factors, the generating functional for path-ordered Green’s functions in the
Schwinger-Keldysh formalism with an initial coherent state reads
Z δ Z
Zχ [j] = exp i d4 x Lint exp i d4 x j(x) Φχ (x)
C iδj(x) C
Z
1
× exp − d4 xd4 y j(x)j(y) G0c (x, y) . (15.19)
2 C
It differs from the corresponding functional with the perturbative vacuum2 as initial
state only by the second factor, that we have underlined. This generating functional is
2 The vacuum initial state corresponds to the function χ(k) ≡ 0, i.e. to Φχ (x) = 0.
506 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The first factor has the effect of shifting the fields by Φχ (x). The simplest way to see
this is to write
φ ≡ Φχ + ζ . (15.21)
where the second factor in the right hand side is the generating functional for correla-
tors of ζ. Comparing with eq. (15.20), we see that the generating functional for ζ is
identical to the vacuum one, except that the argument φ of the interaction Lagrangian
is replaced by Φχ + ζ:
appearance of these new vertices that involve a background field, the Feynman rules
are the same as in the Schwinger-Keldysh formalism for a vacuum initial state, with
+ and − vertices, and bare propagators G0++ , G0+− , G0−+ and G0−− to connect them.
In summary, replacing the vacuum initial state by a coherent state amounts to extend
the usual Schwinger-Keldysh formalism with a background field Φχ .
As in eq. (15.4), let us assume for the purpose of power counting that
Φχ ∼ ϑQ , (15.25)
3 In this transformation, we use the functional analogue of
F(∂x ) eαx G(x) = eαx F(α + ∂x ) G(x) .
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 507
Figure 15.1: Vertices that appear in the perturbative expansion for the calculation
of expectation values with a coherent initial state. The circled cross denotes the
field Φχ .
The first factor is nothing but the usual order in λ of a connected graph with nE
external lines and nL loops. The second factor counts the number of insertions
(3n1 + 2n2 + √ n3 ) of the background field Φχ . Interestingly, it involves only the
combination λ ϑ, that appears also in the inequality (15.6) that delineates the strong
field regime. From eq. (15.27), we can draw the following conclusions:
• When λϑ2 ≪ 1, i.e. in the weak field regime, we can make a double pertur-
bative expansion in λ and in ϑ (i.e. in the occupation of the initial coherent
state). Leading order results correspond to tree diagrams with zero (or the mini-
mal number necessary for the observable under consideration to be non-zero)
insertions of the background field.
• When λϑ2 & 1, i.e. in the strong field regime, the expansion in powers of λ
is still possible (and is organized by the number of loops in the graphs). But
the expansion in powers of the background field becomes illegitimate, and
one should instead treat Φχ to all orders. As we shall see now, this leads to
important modifications in the calculation of observables in the strong field
regime.
508 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that for a system prepared in a coherent initial state, it is the function χ(k)
that defines the coherent that determines whether we are in the weak or strong field
regime.
In order to illustrate the changes to the perturbative expansion in the strong field
regime, let us consider a very simple observable, the expectation value of the field
operator,
Φ(x) = + + + + +...
tree
(15.29)
In fact, at tree level, Φ(x) is the sum of all the tree diagrams (weighted by the
appropriate symmetry factor) whose root is the point x and whose leaves are the
coherent field Φχ . This infinite set of trees can be generated recursively by the
following integral representation:
Z h i λ
Φ(x) = Φχ (x) + i d4 x G0++ (x, y) − G0+− (x, y) − Φ3 (y) . (15.30)
| {z } | 6 {z }
G0 (x,y)
R U ′ (Φ(y))
Interestingly, after one has summed over the + and − indices carried by the vertices,
the propagators G0++ and G0+− of the Schwinger-Keldysh diagrammatic rules always
appear via their difference, which is nothing but the bare retarded propagator:
G0++ (x, y) − G0+− (x, y) = G0−+ (x, y) − G0−− (x, y) = G0R (x, y) . (15.31)
In other words, at tree level, the field expectation value obeys the classical field
equation of motion, with the boundary value Φχ (x) at the initial time. The non-
linearity of this equation of motion is crucial in the strong field regime, and all the
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 509
terms of the series (15.29) have the same magnitude when λ ϑ2 ∼ 1. Nevertheless,
the representation of this series as the solution of the classical field equation of
motion with a retarded boundary condition is very useful, since it turns the problem
of summing an infinite series of Feynman graphs into the much simpler (at least
numerically) problem of solving a partial differential equation. c sileG siocnarF
This result for the expectation value of φ(x) generalizes to the expectation value
of any observable built from the field operator: at tree level, its expectation value is
obtained by replacing the operator φ(x) by the c-number classical field Φ(x) inside
the observable:
χin O φ(x) χin = O Φ(x) . (15.34)
tree level
We will defer the study of loop corrections to these expectation values until the section
15.4, because this discussion will be common with another strong field situation that
we shall discuss first, namely the case of quantum field theories coupled to a strong
external source.
Although we consider here the example of a φ4 interaction term, we will often write
the equations for a generic potential U(φ), and sometimes diagrammatic illustrations
will be given for a cubic interaction for simplicity. These more general interactions
terms will be defined as λ−1+n/2 Qn−4 φn , where Q is an object of mass dimension
1. The Feynman rules for this theory are the usual ones, with the addition of a special
rule for the external current J. In momentum space, a source j attached to the end
of a propagator of momentum p contributes a factor iJ̃(p) (where J̃ is the Fourier
transform of J).
The source J(x) is a given function of space-time, fixed once for all. As we
shall see shortly, the strong field regime corresponds to large sources J ∼ λ−1/2 –
we all call this situation the strong source dense. In contrast, the situation where
510 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
the external source J is small is called the weak source dilute. Consider a simply
connected diagram (see figure 15.2), with nE external legs, nI internal lines, nL
(4)
independent loops, nJ sources, and n3 cubic vertices, n4 quartic vertices, etc...
These parameters are not all independent. First, the number of propagator endpoints
should match the available sites to which they can be attached. This leads to a first
identity,
A second identity expresses the number of independent loops in terms of the other
parameters,
nL = nI − (n3 + n4 + n5 + · · · ) − nJ + 1 . (15.37)
This formula is very similar to eq. (15.27). First, it does not depend on the number
of vertices and on the number of internal lines; only the number of external legs, the
number of loops and the number of sources appear in the result. The strong source
regime√is the regime where it is not legitimate to expand in powers of J because the
factor λ J is not small. In this case, the order of a diagram does not depend on its
number of sources, and an infinite number of diagrams –with fixed nE and nL but
arbitrary nJ – contribute at each order.
have seen in the previous sections, this can be achieved by the presence of strong
external sources, or by starting from a highly occupied coherent state. In both case,
the calculation of expectation values is done with the Schwinger-Keldysh formalism.
Note that since the field operators in the observable are taken at equal times, they
commute and the result does not depend on the + or − assignments for those fields.
But it is crucial to sum over all the ± indices in the internal vertices of the graphs.
At leading order in λ, its expectation value is obtained by simply replacing the
field operator φ by the solution Φ of the classical equations of motion,
O(φ) LO
= O(Φ) , (15.39)
with
(x + m2 )Φ + U ′ (Φ) = J ,
lim Φ(x) = Φχ (x) . (15.40)
x0 →ti
(We have combined in a single description the two situations, with an external source
J and starting from a non-trivial coherent state χin .) Note that it is the internal
sums over the ± indices of the Schwinger-Keldysh formalism that lead to retarded
boundary conditions, by virtue of eq. (15.31).
Let us start with δΦ± . The propagators in the diagram on the left of the figure
15.3 are the Schwinger-Keldysh propagators in the presence of a background field Φ,
i.e. the propagators Gǫǫ ′ . For a generic interaction potential, we can write δΦ± (x)
as follows:
Z
i X
δΦǫ (x) = − d4 z ǫ′ Gǫǫ′ (x, z) U′′′ (Φ(z)) Gǫ′ ǫ′ (z, z) . (15.43)
2 ′
ǫ =±
In this formula, the 1/2 is a symmetry factor, the factor ǫ′ in the integrand takes
into account the fact that vertices of type − have an opposite sign in the Schwinger-
Keldysh formalism, and the factor −i U′′′ (Φ(z)) is the general form of the 3-particle
vertex in the presence of an external field (for an arbitrary interaction potential U).
Thus, we have reduced the calculation to that of the 2-point functions G±± . These
four propagators are defined recursively by the following equations :
X Z
Gǫǫ′ (x, y) = G0ǫǫ′ (x, y)−i η d4 z G0ǫη (x, z) U′′ (Φ(z)) Gηǫ′ (z, y) . (15.44)
η=±
Here, −i U′′ (Φ(z)) is the general form for the insertion of a background field on a
propagator in a theory with potential U(Φ). From these equations, we obtain the
following equations :
x +m2 +U′′ (Φ(x)) G+− (x, y) = y +m2 +U′′ (Φ(y)) G+− (x, y) = 0 ,
x +m2 +U′′ (Φ(x)) G−+ (x, y) = y +m2 +U′′ (Φ(y)) G−+ (x, y) = 0 .
(15.45)
The above conditions determine G+− and G−+ uniquely. In order to find these
propagators, let us recall the following representation of their bare counterparts :
Z
d3 p
G0+− (x, y) = a−p (x)a+p (y) ,
(2π)3 2Ep
Z
d3 p
G0−+ (x, y) = a+p (x)a−p (y) , (15.47)
(2π)3 2Ep
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 513
where
with
x +m2 +U′′ (Φ(x)) a±p (x) = 0 , lim a±p (x) = e∓ip·x . (15.50)
x0 →−∞
By construction, these expressions of G+− and G−+ obey the appropriate equations
of motion, and go to the correct limit in the remote past. The functions a±p (x) are
sometimes called mode functions. They provide a complete basis for the linear space
of solutions of the equation (15.50), i.e. the space of linearized perturbations to the
classical solution of the field equation of motion.
Relationship between LO and NLO : At this point, we have all the building
blocks in order to obtain the single inclusive spectrum at NLO. One can go further
and obtain a formal relationship between the LO and NLO inclusive spectra. A key
observation for this is that the functions ak that appear in the dressed propagators
G±∓ can be obtained from the classical field Φ as follows:
❚
In words, the operator ±k in eq. (15.51) differentiates the classical field Φ with
respect to its initial condition Φini , and replaces it by the initial condition of a±k .
Since a±k is a linear perturbation to Φ, this indeed gives the correct result. c sileG siocnarF
514 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Thus, the propagator G+− (x, y) that enters at NLO can be written as
Z h ih i
G+− (x, y) =
d3 k
(2π)3 2Ek
−k ❚
Φ(x) +k Φ(y)❚ . (15.53)
In the rest of our NLO calculation, we only need this propagator for a space-like
separation between x and y, which implies that G+− (x, y) = G−+ (x, y). In this case,
we can symmetrize the expression of the propagator as follows:
Z h ih i
G+− (x, y) =
1 d3 k
2 (2π)3 2Ek
❚−k Φ(x) ❚+k Φ(y)
h ih i
+ ❚+k Φ(x) ❚−k Φ(y) . (15.54)
As we shall see now, a similar expression can be obtained for δΦ± . Let us
start from eq. (15.43). Since the propagators G++ and G−− are equal when the two
endpoints are evaluated at equal times, we have
Z h i
i d3 k 4
δΦǫ (x) = − d z G ǫ+ (x, z) − G ǫ− (x, z)
2 (2π)3 2Ek | {z }
GR (x, z)
×U′′′ (Φ(z)) a−k (z)a+k (z) , (15.55)
(with G0R the bare retarded propagator), one may prove that
Z
δΦǫ (x) =
1
2
d3 k
(2π)3 2Ek
❚+k ❚−k Φ(x) . (15.57)
By inserting this expression, as well as eq. (15.54), in eq. (15.41), we can write the
NLO expectation value as follows,
" Z #
O NLO =
1 d3 k
2 (2π)3 2Ek
+k −k ❚ ❚ O LO . (15.58)
This central result is illustrated in the figure 15.4. Some remarks should be made
about this formula:
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 515
classical
quantum
Figure 15.4: Illustration of eq. (15.58). The open squares represent the operator
❚ ❚
k (u) −k (v). Their action is to remove two instances of the initial classical field
(the open circles), and to connect them with the light colored link to form a loop.
i. In this formula, the LO observable that appears in the right hand side must be
considered as a functional of the initial classical field.
ii. The LO and NLO observables cannot be obtained in closed analytical form, be-
cause they contain the classical field Φ – retarded solution of a non-linear partial
differential equation that cannot be solved analytically in general. Nevertheless,
eq. (15.58) is an exact relationship between the two.
Why is the NLO “nearly classical”? : In a sense, eq. (15.58) indicates that ob-
servables at NLO in the strong field regime are almost classical, since they can be
obtained from the LO result (that depends only on the classical field Φ) by acting with
❚
the operators ±k (i.e. derivatives with respect to the initial value of the classical
field). If one had kept track of the powers of h̄, the h̄ that comes at NLO would just
be an overall prefactor (the prefactor 1/2 in eq. (15.58) would become h̄/2), but all
the rest of the formula would not contain any h̄.
This is in fact not specific to the strong field regime nor to quantum field theory, but
is a general property of quantum mechanics. To see this, consider a generic quantum
system of Hamiltonian H and density operator ρt . The latter evolves according to the
Liouville-von Neumann equation:
∂ρt
i h̄ = H, ρt . (15.59)
∂t
516 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The next step is to introduce the Wigner transforms of the density operator:
Z
s s
Wt (x, p) ≡ ds eip·s x + ρt x − . (15.60)
2 2
to
∂Wt 2 i h̄ ← → ← →
= H(x, p) sin ∂p ∂x − ∂x ∂p Wt (x, p) (15.61)
∂τ i h̄ 2
= H, Wt + O(h̄2 ) (15.62)
| {z }
Poisson bracket
The first line is an exact equation, known as the Moyal-Groenewold equation. In the
second line, we have performed an expansion in powers of h̄, and one can readily
see that the order zero in h̄ is nothing but the classical Liouville equation (it thus
describes a system whose time evolution is classical). The first quantum correction
to the time evolution arises only at the order h̄2 . Therefore, at the order h̄ (i.e. NLO
in the language of quantum field theory), the time evolution of the system remains
purely classical. This does not mean that there are no quantum corrections of order
h̄, but that these corrections can only come from the initial state of the system (in
particular, from the fact that a quantum system cannot have well defined x and p at
the same time, and the Wigner distribution Wt (x, p) must have a width of order h̄
❚ ❚
at least). The effect of the operator in +k −k that acts on the LO in eq. (15.58) is
precisely to restore this quantum width of the initial state.
Eq. (15.51), that formally relates a small field perturbation to the background field on
top of which it propagates, plays a crucial role in discussing many questions related
to strong fields. A standard proof of this formula relies on Green’s formulas, that we
shall discuss in this section.
4 W is not a bona fide probability distribution, because it is not positive definite in general. But the
t
regions of phase-space where it is negative are small, typically of order h̄. After being integrated either over
x or over p, it becomes a genuine probability distribution for the expectation values of p or x, respectively.
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 517
(The superscript 0 is a reminder of the fact that this is a free Green’s function, that
does not depend on the interaction potential U(Φ).) Note that G0R (x, y) obeys the
same equation if acted upon with y + m2 instead. From the equations obeyed by Φ
and by G0R , we obtain
→ h i
G0R (x, y) y +m2 Φ(y) = G0R (x, y) j(y) − U′ (Φ(y)) ,
←
G0R (x, y) y +m2 Φ(y) = −iδ(x − y)Φ(y) , (15.65)
where the arrows on the d’Alembertian operators indicate on which side they act. By
integrating these equations over y above the initial surface t = 0, and by subtracting
them, we get the following relation
Z h← i
→
Φ(x) = i d4 y G0R (x, y) (y − y )Φ(y) + j(y) − U′ (Φ(y)) . (15.66)
y0 >0
The last step is to show that the term that involves the difference between the two
d’Alembertian operators is in fact a boundary term that depends only on the initial
conditions. Note first the following identity,
← → ← →
A( − )B = ∂µ A( ∂ µ − ∂ µ )B , (15.67)
where the leftmost ∂µ acts on everything on its right. In other words, the left hand side
is a total derivative, and its integral over d4 y can be rewritten as a surface integral
thanks to Stokes’ theorem. The integration domain defined by y0 > 0 has three
boundaries:
5 This equation is the classical equation of motion in the scalar field theory of Lagrangian
1 1
L≡ (∂µ φ)(∂µ φ) − m2 φ2 − U(φ) + jφ .
2 2
518 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
i. y0 = +∞ : this boundary at infinite time does not contribute, since the retarded
propagator obeys G0R (x, y) = 0 if y0 > x0 .
ii. y0 = 0 : this boundary gives a non zero contribution, that depends only on the
initial conditions for the field Φ.
iii. Boundary at spatial infinity : this boundary does not contribute if we assume
that the field vanishes when |x| → ∞, or for a finite volume with periodic
boundary conditions in the spatial directions.
Therefore, we obtain
Z h i
Φ(x) = i d4 y G0R (x, y) j(y) − U′ (Φ(y))
y0 >0
Z
→ ←
+i d3 y G0R (x, y)( ∂ y0 − ∂ y0 )Φ(y) . (15.68)
y0 =0
In this Green’s formula, the first term in the right hand side provides the dependence
on the source j, and on the interactions, while the second term tells us how Φ(x)
depends on the initial values of Φ(x) and of its first time derivative. c sileG siocnarF
Except in the trivial case where the potential U(Φ) is zero, eq. (15.68) does not
provide an explicit result for Φ(x), since the right hand side depends on Φ(y) at
points above the initial surface. Despite this limitation, this is a very useful tool in
order to perform formal manipulations involving retarded solutions of eq. (15.63). To
end this section, let us mention a diagrammatic interpretation of eq. (15.68), illustrated
in figure 15.5. One can expand the right hand side of eq. (15.68) in powers of the
interactions. The starting point is the zeroth order approximation, obtained by setting
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 519
Extension to a generic initial surface : In eq. (15.68), the initial conditions for
the field Φ have been set on the surface of constant time y0 = 0. However, there are
many situations in which this initial data is known on a different initial surface. Let
us consider a generic surface Σ, on which the field Φ and its derivatives are known.
As before, we wish to obtain a formula that expresses Φ(x) at some point x above Σ
in terms of these initial conditions on Σ.
Most of the derivation is identical to the case of a constant time initial surface,
with all the integrals over the domain y0 > 0 replaced by integrals over the domain Ω
located above Σ. The only significant change occurs when we apply Stokes’ theorem
in order to transform the 4-dimensional integral of a total derivative into an integral
over the boundary of Ω. Like in the previous case, the boundaries at infinite time, and
at infinity in the spatial directions do not contribute, and we have only a contribution
from the surface Σ. Stokes’ theorem can then be written as
Z Z
d y ∂µ F (y) = − d3 Sy nµ Fµ (y) ,
4 µ
(15.69)
Ω Σ
where d3 Sy is the measure on the surface Σ, and nµ is a 4-vector normal to the surface
Σ at the point y, pointing above the surface Σ. In the important case where the initial
surface is invariant by translation in the transverse directions, the proper normalization
for nµ and d3 Sy can be obtained as follows. Parameterize an arbitrary displacement
dyµ on the surface Σ about the point y as dyµ = (βdy3 , dy1 , dy2 , dy3 ), where β
is the local slope of the surface Σ in the (y3 , y0 ) plane. Then, we have:
nµ dyµ = 0 ,
nµ nµ = 1 , n 0 > 0 ,
p
d3 Sy = 1 − β2 dy1 dy2 dy3 . (15.70)
The second and third conditions require to have β < 1 in order to make sense. This
implies that the surface Σ must be locally space-like. Physically, this means that a
signal emitted from a point of the surface Σ cannot reach the surface again in the
future. The relations (15.70) are illustrated in figure 15.6. Note that the orthogonality
defined by nµ dyµ = 0 does not correspond to the Euclidean concept of orthogonality.
520 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Ω nµ
dyµ
Σ
y
Thanks to eq. (15.69), it is possible to write the Green’s formula for an arbitrary
initial surface Σ as
Z h i
Φ(x) = i d4 y G0R (x, y) j(y) − U′ (Φ(y))
Ω
Z
→ ←
+i d3 Sy G0R (x, y)(n· ∂ y −n· ∂ y )Φ(y) . (15.71)
Σ
For an arbitrary surface Σ, the second term in the right hand side of this formula tells
us explicitly what information about Φ we must provide on the initial surface in order
to determine it uniquely above the surface: at every point y ∈ Σ, one must specify
the values of the field Φ(y) and of its normal derivative n · ∂y Φ(y).
Treating the term U′′ (Φ(x))a(x) as an interaction, we can easily derive a Green’s
formula that expresses the field fluctuation a(x) in terms of its initial conditions on a
surface Σ,
Z h i
a(x) = i d4 y G0R (x, y) − U′′ (Φ(y))a(y)
Ω
Z
→ ←
+i d3 Sy G0R (x, y)(n· ∂ y −n· ∂ y )a(y) . (15.73)
Σ
Eq. (15.73) is illustrated in the figure 15.7. Every diagram contributing to a(x) has
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 521
exactly one instance of the initial value of a(y) (represented by an open square in
the figure) on the initial surface. Indeed, it is easy to see from eq. (15.73) that a(x)
depends linearly on its value a(y) on the initial surface. This is a consequence of the
fact that equation of motion for a small fluctuation is a linear equation. c sileG siocnarF
By comparing the figures 15.5 and 15.7, one sees that they differ only by the fact
that one instance of the field Φ(y) has been replaced by the small fluctuation a(y)
on the initial surface. Therefore, we expect a linear relationship between a(x) and
Φ(x), of the form
❚
where a is a linear operator that substitutes one power of Φ(y) by a(y) on Σ (i.e.
an operator that involves first derivatives with respect to the initial conditions on Σ).
It is easy to prove this relation by using eqs. (15.71) and (15.73). In order to do so
❚
and at the same time determine the form of the operator a , let us apply a to the ❚
Green’s formula that gives Φ(x). We get6
Z h i
❚a Φ(x) = i d4 y G0R (x, y) − U′′ (Φ(y)) ❚a Φ(y)
Ω
Z
+i ❚a → ←
d3 Sy G0R (x, y)(n· ∂ y −n· ∂ y )Φ(y) . (15.75)
Σ
6 Since ❚a acts only on the initial fields on Σ, we have ❚a j = 0.
522 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
If the boundary term in this formula can be made identical to the boundary term in the
Green’s formula for a(x), then this equation will be identical to the Green’s formula
for a(x) and we will have proven the announced relationship between a(x) and Φ(x).
❚
This is the case if the operator a is chosen as
Z
❚ 3
a ≡ d Sy a(y)
δ
δΦ(y)
+ (n · ∂a(y))
δ
δ(n · ∂Φ(y))
, (15.76)
Σ
which is nothing but the operator that substitutes a(y) to Φ(y) on the initial surface
Σ, as announced (this definition generalizes eq. (15.52) to a generic initial surface and
to generic initial conditions for the perturbation).
❚
Note that a is an operator that performs an infinitesimal translation (by an
amount a(y)) to the initial condition of the classical field. By exponentiation, it may
be promoted into an operator that performs a finite shift of the initial condition. In
particular, if we denote by Φ[Φ0 ] the classical field whose initial value on Σ is Φ0 ,
then we have
(For simplicity we take the same source j(x) for the two fields, but this limitation
is easily circumvented if necessary.) Since the Schwinger-Keldysh propagator G0++
is also a Green’s function of the operator x + m2 , we can reproduce the previous
derivation of Green’s formula, which leads to7
Z h i
Φ+ (x) = i d4 y G0++ (x, y) j(y) − U′ (Φ+ (y))
Z h iy0 =+∞
← →
+i d3 y G0++ (x, y)( ∂ y0 − ∂ y0 )Φ+ (y) , (15.79)
y0 =−∞
0
y =b
where we used the notation [f(y0 )]y 0 =a ≡ f(b) − f(a). The only difference with
the Green’s formula derived with retarded propagators is the boundary term: since
7 Here also, the prefactors i follow from our convention for the propagators of the Schwinger-Keldysh
G0++ (x, y) does not vanish when y0 > x0 , there is also a non-zero contribution from
the boundary at y0 = +∞. Then, by using the fact that (y + m2 )G0+− (x, y) = 0,
we obtain in a similar way :
Z h i
0 = i d4 y G0+− (x, y) j(y) − U′ (Φ− (y))
Z h iy0 =+∞
← →
+i d3 y G0+− (x, y)( ∂ y0 − ∂ y0 )Φ− (y) . (15.80)
y0 =−∞
Φ+ (x)
Z h i h i
= i d4 y G0++ (x, y) j(y)−U′ (Φ+ (y)) − G0+− (x, y) j(y)−U′ (Φ− (y))
Z h iy0 =+∞
↔ ↔
−i d3 y G0++ (x, y) ∂ y0 Φ+ (y) − G0+− (x, y) ∂ y0 Φ− (y) ,
y0 =−∞
(15.81)
↔ → ←
where A ∂ y0 B ≡ A( ∂ y0 − ∂ y0 )B. Similarly, we obtain for Φ− (x) :
Φ− (x)
Z h i h i
= i d4 y G0−+ (x, y) j(y)−U′ (Φ+ (y)) − G0−− (x, y) j(y)−U′ (Φ− (y))
Z h iy0 =+∞
↔ ↔
−i d3 y G0−+ (x, y) ∂ y0 Φ+ (y) − G0−− (x, y) ∂ y0 Φ− (y) 0 .
y =−∞
(15.82)
At this point, these formulas are rather formal, and it is not clear why we have
gone through the trouble of subtracting the quantity given by eq. (15.80), since it is
identically zero. This will become transparent in the next section, where we show
that these formulas enable one to sum series of tree diagrams encountered in the
Schwinger-Keldysh formalism.
Note also that the only property of the propagators G0−+ and G0+− that we have
used in this derivation is the fact that they are annihilated by the operator y . There-
fore, the equations (15.81) and (15.82) remain valid if we replace these propagators
by any other pair of propagators sharing the same property. For instance, one can
replace the propagators G0+− and G0−+ of eqs. (1.367) by the following objects
Z
0 d4 p −ip·(x−y)
G+− (x, y) = e u(p) 2πθ(−p0 )δ(p2 ) ,
(2π)4
Z 4
0 d p −ip·(x−y)
G−+ (x, y) = e v(p) 2πθ(+p0 )δ(p2 ) , (15.83)
(2π)4
524 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
where u(p) and v(p) are some arbitrary functions of the momentum p, without
altering any of the formulas in this section. We will make use of this freedom in the
next section. c sileG siocnarF
Many problems involving strong fields require that one sums infinite series of tree
diagrams. These sums of diagrams can in general be expressed in terms of solutions
of the classical equations of motion. However, in order to determine them uniquely,
one must know the boundary conditions obeyed by these classical solutions. The
strategy in order to obtain them is to write the sum of tree diagrams as a recursive
integral equation. Then, by comparing this integral equation with a Green’s formula
such as eq. (15.68), one can read off the boundary conditions easily.
Sum of retarded trees : Let us illustrate this first in the simplest case, where one
must sum all the tree diagrams built with retarded propagators, and whose leaves are
a source j(x). Let us call Φ(x) the sum of all such tree diagrams. Given the recursive
structure of such trees, one can write immediately :
Z h i
Φ(x) = i d4 y G0R (x, y) j(y) − U′ (Φ(y)) , (15.84)
In addition to summing over all the possible trees, we sum over all the combinations of
8 In the Feynman rules the integration at each vertex is extended to the full space-time ❘4 .
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 525
± indices at every internal vertex. Firstly, the sum of these trees can be written in the
form of two coupled integral equations (there are now two fields Φ± (x) depending
on the index carried by the root of the tree) :
Z h i
Φ+ (x) = i d4 y G0++ (x, y) j(y) − U′ (Φ+ (y))
Z h i
0
−i d4 y G+− (x, y) j(y) − U′ (Φ− (y)) ,
Z h i
0
Φ− (x) = i d4 y G−+ (x, y) j(y) − U′ (Φ+ (y))
Z h i
−i d4 y G0−− (x, y) j(y) − U′ (Φ− (y)) . (15.86)
At this point, we recognize that the right hand side of these equations is identical to
the first term in the right hand side of eqs. (15.81) and (15.82). From this observation,
we conclude that Φ+ (x) and Φ− (x) are solutions of the classical equation of motion,
We have now coupled boundary conditions for the fields Φ+ and Φ− , that involve the
value of the fields both at y0 = −∞ and at y0 = +∞. In addition, these boundary
conditions are non-local in coordinate space, since they involve integrals over d3 y on
the surfaces y0 = ±∞. However, they can be simplified considerably if one uses the
following Fourier representations for the propagators
Z
d3 p h 0 0 −ip·(x−y) 0 +ip·(x−y)
i
G0++ (x, y) = θ(x − y )e +θ(y0
− x )e ,
(2π)3 2Ep
Z
d3 p h 0 0 +ip·(x−y) 0 −ip·(x−y)
i
G0−− (x, y) = θ(x − y )e +θ(y0
− x )e ,
(2π)3 2Ep
Z
0 d3 p
G+− (x, y) = u(p) e+ip·(x−y) ,
(2π)3 2Ep
Z
0 d3 p
G−+ (x, y) = v(p) e−ip·(x−y) . (15.89)
(2π)3 2Ep
526 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
The superscripts (±) on the Fourier coefficients serve to distinguish the positive and
negative frequency modes. Note that because the fields Φ± (y) are not free fields,
these Fourier coefficients are time dependent. In practice, one may assume that the
(±)
interactions are switched off at y0 = ±∞, so that the coefficients fǫ (y0 , p) tend to
constants when y0 → ±∞. However, these limiting values are different at y0 = +∞
and at y0 = −∞, and we must keep the y0 argument to distinguish them. Using the
identity
Z
↔ ′ ′
d3 y eiǫp·(x−y) ∂ y0 eiǫ p ·y = iδǫǫ′ eiǫp·x (2π)3 2Ep δ(p − p′ ) , (15.91)
(+) (−)
f+ (−∞, p) = f− (−∞, p) = 0 ,
(−) (−)
f+ (+∞, p) = u(p) f− (+∞, p) ,
(+) (+)
f− (+∞, p) = v(p) f+ (+∞, p) . (15.92)
The boundary conditions have a very compact expression in terms of the Fourier
coefficients of the fields Φ± . At y0 = −∞, Φ+ (y) has no positive energy modes
and Φ− (y) has no negative energy modes. At y0 = +∞, the negative energy modes
of Φ+ (y) and Φ− (y) are proportional (with a proportionality relation that involves
the function u(p)). A similar relation, that involves the function v(p), holds between
their positive energy modes at y0 = +∞. Eqs. (15.92), together with the equations
of motion (15.87), determine uniquely the fields Φ± (x) and therefore provide the
solution to our original problem of summing tree diagrams in the Schwinger-Keldysh
formalism. One should however keep in mind that this solution is somewhat formal,
because it is in general extremely difficult to solve a non-linear field equation of
motion with boundary conditions specified both at y0 = −∞ and y0 = +∞.
Let us also mention that these boundary conditions become considerably simpler
in the case where u(p) = v(p) ≡ 1. Indeed, from the second and third of eqs. (15.92),
we see that the fields Φ± (y) have identical Fourier coefficients at y0 = +∞. There-
fore, the two fields must be equal in the limit y0 → +∞. Then, by solving their
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 527
equation of motion backwards in time, one sees trivially that they are equal at all
times (since they obey identical equations of motion),
To summarize, when u(p) = v(p) ≡ 1, the two fields Φ± (x) are equal to the
retarded field that vanishes when x0 → −∞. This result could in fact have been
obtained by a much more elementary argument. Indeed, when u(p) = v(p) ≡ 1,
the summation over the ± indices at the vertices of tree diagrams always leads to the
following combinations of propagators,
In other words, summing over these indices amounts to replacing all the propagators
in a given tree by retarded propagators, and one is thus led to the problem discussed
in section 15.5.4.c sileG siocnarF
●
X
αβ ≡ Ωαǫ Ωβǫ′ Gǫǫ′ . (15.97)
ǫ,ǫ′ =±
(The same rotation is applied to the free propagators.) There is not a unique choice
●
of the matrix Ωαǫ that gives a zero component in αβ , but the following choice is
convenient:
1 −1
Ωαǫ ≡ . (15.98)
1/2 1/2
(The subscripts R, A and S stand respectively for retarded, advanced and symmetric.)
After having performed this rotation, eq. (15.44) is transformed into
XZ
●
αβ (x, y) = ●0
αβ (x, y) − i ● ●
d4 z 0αδ (x, z) U′′ (Φ(z)) σδγ γβ (z, y) ,
δ,γ
(15.101)
where we denote
0 1
σ≡ . (15.102)
1 0
In order to make the notations more compact, let us introduce the following shorthand,
h i XZ
❆ ◦ ❇ αβ(x, y) ≡ −i d4 z ❆αδ(x, z) U′′(Φ(z)) σδγ ❇γβ(z, y) . (15.103)
δ,γ
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 529
● = ●0 + ●0 ◦ ● , (15.104)
◆
One has 2 = 0, which simplifies a lot the calculation of the n-th power of 0 σ. ●
From this observation, it is easy to obtain
0 ⋆(n+1) !
h i◦(n+1)
●0 0
= ⋆(n+1) Pn ⋆i
GA
⋆(n−i) , (15.107)
G0R 0
i=0 GR ⋆ G0S ⋆ G0A
(and an obvious definition for the ⋆-exponentiation.) The summation of the off-
diagonal components of eq. (15.107) is trivial since these terms do not mix. Moreover,
the resummed GS propagator has a simple expression in terms of the resummed
retarded and advanced propagators. These results can be summarized by
∞ h
X i⋆n ∞ h
X i⋆n
GR = G0R , GA = G0A ,
n=0 n=0
GS = GR (G0R )−1 G0S (GA ) 0 −1
GA . (15.109)
At this stage, we know all the components of the resummed propagator in the rotated
basis. In order to obtain them in the original basis, we just have to invert the rotation
of eq. (15.97), which gives
It is easy to check that these equations are equivalent to eqs. (15.49). c sileG siocnarF
530 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
Note that for a real potential U(Φ), a±k (x) are mutual complex conjugates. Any
solution of the equation of motion for small fluctuations can be written as
Z h i
d3 k
a(x) = 3
αk k
+ a+k (x) + α− a−k (x) , (15.113)
(2π) 2Ek
where the αk
± are constant coefficients that depend on the boundary conditions (the
boundary conditions in general lead to a set of linear equations for the coefficients).
where the dot denotes a time derivative and σ2 is the second Pauli matrix. Thanks
to the fact that the background potential U′′ (Φ(x)) is real, one can construct from
a1 and a2 an inner product which is an invariant of the time evolution of the two
perturbations. This quantity is reminiscent of the Wronskian for two solutions of a
second order ordinary differential equation, and it is defined as follows
Z h i
a1 a2 ≡ i d3 x ȧ∗1 (x) a2 (x) − a∗1 (x) ȧ2 (x) . (15.115)
Although a1 a2 could in principle depend on time (since one integrates only over
space in its definition), it is immediate to verify that
∂
0
a1 a2 = 0 . (15.116)
∂x
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 531
Since it is a constant in time, one can compute this inner product from the value of the
field fluctuations in the remote past. This is particularly handy when the fluctuations
under consideration are specified by retarded boundary conditions, as is the case for
a±k (x). One finds
a+k a+l = (2π)3 2Ek δ(k − l) ,
a−k a−l = −(2π)3 2Ek δ(k − l) ,
a+k a−l = a−k a+l = 0 . (15.117)
Consider now a generic solution a(x) of eq. (15.111). Since the a±k a basis of
the linear space of solutions, one can write a(x) as a linear superposition
Z
d3 k h k i
a = 3
α+ a+k + αk
− a−k , (15.118)
(2π) 2k
By inserting these relations back into eq. (15.118), and by using the fact that it is
valid for any small fluctuation a(x) solution of eq. (15.111), we obtain the following
identity
Z
d3 k h i
3
ak ak − a−k a−k = 1 . (15.120)
(2π) 2k
This identity is valid at all times over the space of solutions of eq. (15.111). It is a
manifestation of the fact that, when the backgroundfield is real, the time evolution
preserves the completeness of the set of states a±k .
where the subscript c indicates that we retain only the connected part of the correlator.
From the generic power counting arguments developed in the previous sections,
these connected correlators are all of order λ−1 in the strong field regime. It is
also important to realize that the connected part of these correlators is subleading
compared to their fully disconnected part, since
+ · · · + O(x1 ) · · · O(xn ) c
(15.122)
| {z }
λ−1
We see in this formula that, in the strong field regime, the fully connected part of a
n-point correlator is suppressed by λn−1 compared to the trivial disconnected term.
Thus, even at tree level, the correlated part of a n-point function is not a leading order
quantity, but arises only at order n − 1 in the expansion in powers of λ.
One can encapsulate all the correlation functions (15.121) into a generating
functional defined as follows9 :
Z
F[z(x)] ≡ 0in exp d3 x z(x) O(φ(x)) 0in , (15.123)
tf
where the argument of the field in O is x ≡ (tf , x). From this generating functional,
the correlation functions are obtained by differentiating with respect to the z(xi ) and
by setting z ≡ 0 afterwards. In order to remove the uncorrelated part of the n-point
9 This is easily generalized to the case where the initial state is a coherent state instead of the vacuum.
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 533
δn ln F
C{1···n} = (15.124)
δz(x1 ) · · · δz(xn ) z≡0
The observable O(φ(x)) is made of the field in the Heisenberg picture, φ(x), that can
be related to the field φin (x) of the interaction picture as follows:
We can therefore rewrite the generating functional solely in terms of the interaction
picture field φin ,
Z Z
F[z(x)] = 0in P exp d x i dx0 Lint (φ+
3 −
in (x)) − Lint (φin (x))
Figure 15.8:
Diagrammatic rules for
+ + 0 − − 0 the extended Schwinger-
G++ G−− Keldysh formalism that
+ − − +
gives the generating func-
0 0
G+− G−+ tional. The Feynman rules
shown here for the self-
+ −
−iλ +iλ interactions correspond
to a λφ4 /4! interaction
+ − term. In this illustration,
+iJ(x) −iJ(x)
we have assumed that the
observable is quartic in
z(x) the field when drawing
the corresponding vertex
(proportional to z(x)).
the vacuum initial state, we recall that the propagators have the following explicit
expressions:
Z
d3 k
G0−+ (x, y) = e−ik·(x−y) ,
(2π)3 2Ek
Z
0 d3 k
G+− (x, y) = eik·(x−y) ,
(2π)3 2Ek
G0++ (x, y) = θ(x0 − y0 ) G0−+ (x, y) + θ(y0 − x0 ) G0+− (x, y) ,
G0−− (x, y) = θ(x0 − y0 ) G0+− (x, y) + θ(y0 − x0 ) G0−+ (x, y) .(15.128)
Note that when we set z ≡ 0, these diagrammatic rules fall back to the pure Schwin-
ger-Keldysh formalism, for which all the connected vacuum-to-vacuum graphs are
zero. This implies that
F[z ≡ 0] = 1 , (15.129)
The half-sum φ2 in a sense captures the classical content (plus some quantum correc-
tions), while the difference φ1 is purely quantum (because it represents the different
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 535
histories of the fields in the amplitude and in the complex conjugated amplitude). To
see how the Feynman rules are modified in terms of these new fields, let us start from
X
φα = Ωαǫ φǫ (α = 1, 2) , (15.131)
ǫ=±
with Ωαǫ the matrix defined in eq. (15.98). The new propagators after this rotation
have calculated earlier,
Note that G021 is the bare retarded propagator, while G012 is the bare advanced propa-
gator. The vertices in the new formalism (here written for a quartic interaction) are
given by
h i
Λαβγδ ≡ −i λ Ω−1 −1 −1 −1 −1 −1
+α Ω+β Ω+γ Ω+δ − Ω−α Ω−β Ω−γ Ω−δ ,
−1 −1
(15.133)
where
1/2 1
Ω−1
ǫα = [Ωαǫ Ω−1
ǫβ = δαβ ] . (15.134)
−1/2 1
(The vertices not listed explicitly here are obtained by permutations.) Finally, the
rules for an external source in the retarded-advanced basis are :
J1 = J , J2 = 0 . (15.136)
Finally, note that the observable depends only on the field φ2 , i.e. O = O(φ2 ).
Indeed, the fields φ+ and φ− represent the field in the amplitude and in the conjugated
amplitude. Their difference should vanish when a measurement is performed. c sileG siocnarF
weighting this vertex by z(x) and integrating over x). Furthermore, by considering
the logarithm of the generating functional rather than F itself, we have only diagrams
that are connected to the point x, as shown in this representation:
δ ln F
= x , (15.137)
δz(x)
where the gray blob is a sum of graphs constructed with the Feynman rules of the
figure 15.8, or their analogue in the retarded-advanced formulation. Therefore, these
graphs still depend implicitly on z. Note that this blob does not have to be connected.
Tree level expression : Without further specifying the content of the blob, the
equation (15.137) is valid to all orders, both in z and in g. At lowest order in g (tree
level), a considerable simplification happens because the blob must be a product of
disconnected subgraphs, one for each line attached to the vertex O(φ(x)):
δ ln F
= x , (15.138)
δz(x) tree
where now each of the light coloured blob is a connected tree 1-point diagram. In the
retarded-advanced formalism, there are two of these 1-point functions, that we will
denote φ1 and φ2 . At tree level, they can be defined recursively by the following pair
of coupled integral equations:
Z
∂Lint (φ1 , φ2 )
φ1 (x) = i d4 y G012 (x, y)
Ω ∂φ2 (y)
Z
+ d3 y G012 (x, y) z(y) O ′ (φ2 (y)) ,
tf
Z
∂Lint (φ1 , φ2 ) ∂Lint (φ1 , φ2 )
φ2 (x) = i d4 y G021 (x, y) + G022 (x, y)
Ω ∂φ1 (y) ∂φ2 (y)
Z
+ d3 y G022 (x, y) z(y) O ′ (φ2 (y)) . (15.139)
tf
In these equations, O ′ is the derivative of the observable with respect to the field, Ω is
the space-time domain comprised between the initial and final times, and we denote
Lint (φ1 , φ2 ) ≡ Lint (φ2 + 21 φ1 ) − Lint (φ2 − 12 φ1 ) . (15.140)
λ 4
For an interaction Lagrangian − 4! φ + Jφ, this difference reads
λ λ
Lint (φ1 , φ2 ) = − φ32 φ1 − φ31 φ2 + Jφ1 . (15.141)
6 4!
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 537
i.e. simply the observable O evaluated on the field φ2 (x) (but this field depends on z
to all orders, via the boundary terms in eqs. (15.139)).
Classical equations of motion : Using the fact that G012 and G021 are Green’s
functions of + m2 , respectively obeying the following identities
The second term of the right hand side is a total derivative thanks to
↔ h ↔ i
A B = ∂µ A ∂ µ B . (15.149)
Therefore, this term can be rewritten as a surface integral extended to the boundary of
the domain Ω. With reasonable assumptions on the spatial localization of the source
J(x) that drives the field, we may disregard the contribution from the boundary at
spatial infinity. The remaining boundaries are at the initial time ti and final time tf ,
Z Z h i tf
∂Lint (φ1 , φ2 ) ↔
φ1 (x) = i d4 y G012 (x, y) −i d3 y G012 (x, y) ∂ y0 φ1 (y) .
Ω ∂φ2 (y)
(15.150)
Note that the boundary term vanishes at the initial time ti , because G012 is the retarded
propagator. Likewise, we obtain the following equation for φ2 :
Z
∂Lint (φ1 , φ2 ) ∂Lint (φ1 , φ2 )
φ2 (x) = i d4 y G021 (x, y) + G022 (x, y)
Ω ∂φ 1 (y) ∂φ2 (y)
Z h i tf
↔ ↔
−i d3 y G021 (x, y) ∂ y0 φ2 (y) + G022 (x, y) ∂ y0 φ1 (y) .
ti
(15.151)
The boundary conditions at ti and tf are obtained by comparing eqs. (15.139) and
(15.150-15.151). At the final time tf , the boundary condition is
c sileG siocnarF
From the explicit form of the propagators G0+− and G0−+ (see eqs. (15.128)), we see
that, at the initial time, the combination φ2 + 21 φ1 has no positive frequency compo-
nents, and the combination φ2 − 21 φ1 has no negative frequency components. An
equivalent way to state this boundary condition is in terms of the Fourier coefficients
of the fields φ1,2 . Let us decompose them at the time ti as follows,
Z
d3 k e (+) (k) e−ik·x + φ
e (−) (k) e+ik·x . (15.155)
φ1,2 (ti , x) ≡ 3
φ 1,2 1,2
(2π) 2Ek
In terms of the coefficients introduced in this decomposition, the boundary conditions
at the initial time read:
e (+) (k) = − 1 φ
φ e (+) (k) , e (−) (k) = 1 φ
φ e (−) (k) . (15.156)
2
2 1 2
2 1
with a boundary condition at the initial time that depends on the coherent state
in which the system is initialized (Φini ≡ 0 when the initial state is the vacuum).
However, this expansion becomes increasingly cumbersome beyond this simple result.
Instead of pursuing this very complicated expansion in powers of z, we present an
approximation that allows for an all-orders solution of eqs. (15.145), (15.152) and
(15.156). Here, we give only a very sketchy motivation for this approximation, and a
lengthier discussion of its validity will be provided later in this section (after we have
derived expressions for the fields φ1 and φ2 ).
Let us first recall that the fields φ+ and φ− represent, respectively, the space-time
evolution of the field in amplitudes and in conjugate amplitudes. The fact that they are
distinct leads to interferences when squaring amplitudes, a quantum effect controlled
by h̄. Consequently, we may expect the difference φ1 ≡ φ+ − φ− to be small
compared to φ± themselves, i.e.
φ1 ≪ φ2 . (15.158)
In this situation, that we will call the quasi-classical approximation, we can approxi-
mate the equations of motion (15.145) by keeping only the lowest order in φ1 . This
540 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
amounts to keeping only the terms linear in φ1 in eq. (15.140) (in the case of a φ4
theory, it means dropping the φ31 φ2 term in eq. (15.141)). In the approximation, they
read
h i
′′
+ m2 − Lint (φ2 ) φ1 = 0 ,
′
( + m2 ) φ2 − Lint (φ2 ) = 0 , (15.159)
while the boundary conditions are still given by (15.152) and (15.154). The problem
one must now solve is illustrated in the figure 15.9. The field φ1 obeys a linear
φ2 φ1
∼ ∼ ∼ ∼ ti
φ(+)
2
= −φ(+)
1
/2 φ(−)
2
= +φ(−)
1
/2
equation of motion (dressed by the field φ2 , although this aspect is not visible in
the figure), with an advanced boundary condition that depends on φ2 . In parallel,
the field φ2 obeys the classical field equation of motion, with a retarded boundary
condition that depends on φ1 . As we shall show, this tightly constrained problem
admits a formal solution, valid to all orders in the function z, in the form of an implicit
functional equation for the first derivative of ln F[z].
c sileG siocnarF
Formal solution : In order to solve the equation of motion for φ1 , let us introduce
mode functions a±k (x), defined as follows
h i
′′
x + m2 − Lint (φ2 (x)) a±k (x) = 0
lim a±k (x) = e∓ik·x . (15.160)
x0 →ti
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 541
In other words, they form a basis of the linear space of solutions of the equation
obeyed by φ1 , and therefore we may express φ1 as a linear superposition of the mode
functions. At this point, we use a slightly more explicit form of eq. (15.120),
Z
d3 k a+k (x)ȧ−k (y) a−k (x)a+k (y)
− (+k ↔ −k)
(2π)3 2Ek ȧ+k (x)ȧ−k (y) ȧ−k (x)a+k (y)
1 0
= i δ(x − y) , (15.161)
x0 =y0 0 1
Thanks to these identities, it is easy to check that the field φ1 that obeys the required
equation of motion and boundary conditions is given by
Z
d3 k
φ1 (x) = d3 u a−k (x)a+k (tf , u) − a+k (x)a−k (tf , u)
(2π)3 2Ek
×z(u) O ′ (φ2 (tf , u)) . (15.162)
The above equation formally defines φ1 (x) in the bulk, x ∈ Ω, in terms of the field
φ2 at the final time. Besides the explicit factor z(u), the right hand side contains also
an implicit z dependence (to all orders in z) in the field φ2 (tf , u) and in the mode
functions a±k (since they evolve on top of the background φ2 ).
Then, using the boundary condition at the initial time, we obtain the following
expression for the field φ2 at ti ,
Z Z
1 d3 k
φ2 (ti , y) = d3 u e+ik·y a+k (tf , u)
2 (2π)3 2Ek
+e−ik·y a−k (tf , u) z(u) O ′ (φ2 (tf , u)) . (15.163)
This can be expressed in a more convenient way with eq. (15.51). In terms of the
❚
operators ±k , we may rewrite φ2 at the initial time as follows:
Z Z
1 d3 k
φ2 (ti , y) = d3 u z(u) O(φ2 (tf , u))
2 (2π)3 2Ek
❚+k e+ik·y+ ❚−k e−ik·y ,
← ←
× (15.164)
where the arrows indicate on which side the ❚±k operators act. This expression gives
the initial condition for the first of eqs. (15.159), in the form of a linear superposition
of plane waves exp(±ik · y). The next step is to note that the field φ2 (x) that satisfies
this equation of motion, and has the initial condition φ2 (ti , y) is formally given by
Z
h δ
φ2 (x) = exp d3 y φ2 (ti , y)
δΦini (ti , y)
δ i
+(∂0 φ2 (ti , y)) Φ(x) . (15.165)
δ(∂0 Φini (ti , y))
542 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
This formula follows from the fact that the derivative with respect to the initial field is
the generator for shifts of the initial condition of Φ; its exponential is therefore the
corresponding translation operator. The same formula applies also to any function of
the field, e.g. O(φ2 ). Substituting φ2 (ti , y) by eq. (15.164) inside the exponential,
this leads to
Z Z
1 d3 k
O φ2 (x) = exp d3 u z(u) O(φ2 (tf , u))
2 (2π)3 2Ek
h← → i
❚ ❚ ❚ ❚
← →
× +k −k + −k +k O Φ(x) Φini ≡0
Z
3
= exp d u z(u) O(φ2 (tf , u)) ⊗ O Φ(x) ,
Φini ≡0
(15.166)
the first derivative of ln F, we see that it obeys the following recursive formula
Z
3
D[x1 ; z] = exp d u z(u) D[u; z] ⊗ O Φ(tf , x1 ) . (15.169)
Φini ≡0
First of all, a comparison between eqs. (15.162) and (15.170) indicates that φ1 and
φ2 have the same order in the coupling constant g, since they are made of the same
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 543
building blocks (the only difference is the sign between the two terms of the integrand,
and an irrelevant overall factor 21 ).
However, a hierarchy between φ1 and φ2 arises dynamically when the classical
solutions of the field equation of motion (15.157) are unstable. Such instabilities are
fairly generic in several quantum field theories; in particular the scalar field theory
with a φ4 coupling that we are using as example is known to have a parametric
resonance. Since the mode functions a±k are linearized perturbations on top of the
classical field φ2 , an instability of the classical solution φ2 is equivalent to the fact
that some of the mode functions grow exponentially with time, as exp(µ(x0 − ti ))
(where µ is the Lyapunov exponent). Thus, since eq. (15.170) is bilinear in the mode
functions, we expect that
0
+tf −2ti )
φ2 (x) ∼ eµ(x . (15.171)
lin
Estimating the magnitude of φ1 requires more care. Indeed, from eqs. (15.161),
antisymmetric combinations of the mode functions at equal times remain of order 1
even if individual mode functions grow exponentially with time. Thus, at the final
time, we have
φ2 (tf , x)
φ1 (tf , x) ∼ 1 and ∼ e2µ(tf −ti ) ≫ 1 , (15.172)
φ1 (tf , x)
In order to estimate the ratio φ2 /φ1 at intermediate times, one may use the
following reasoning. The antisymmetric combination of mode functions that enters in
eq. (15.162) is the advanced propagator GA in the background φ2 . This advanced
propagator may also be expressed in terms of a different set of mode functions b±k
defined to be plane waves at the final time tf ,
h i
′′
x + m2 − Lint (φ2 (x)) b±k (x) = 0
lim b±k (x) = e∓ik·x . (15.173)
x0 →tf
In the presence of instabilities, these backward evolving mode functions grow when x0
decreases away from tf , as exp(µ(tf − x0 )) (in this sketchy argument, the Lyapunov
544 F. G ELIS – A S TROLL T HROUGH Q UANTUM F IELDS
exponent µ is assumed here to be the same for the forward and backward mode
functions). This implies
0
φ1 (x) ∼ eµ(tf −x )
, (15.175)
and the following magnitude for the ratio φ2 /φ1 at intermediate times
0
φ2 (x) eµ(x +tf −2ti ) 0
∼ µ(t −x0) ∼ e2µ(x −ti ) . (15.176)
φ1 (x) e f
Thus, with instabilities and non-zero Lyapunov exponents, the quasi-classical approx-
imation is generically satisfied thanks to the exponential growth of perturbations over
the background.
A B ≡ A⊗B,
in terms of which the functional equation obeyed by D[x1 ; z] reads:
Z !
3
D[x1 ; z] = exp d u z(u) D[u; z] 1 .
Φini ≡0
1 1
+ 1 + 1 . (15.180)
2! 3!
15. S TRONG FIELDS AND SEMI - CLASSICAL METHODS 545
These examples generalize to all orders in z: the functional D[x1 ; z] can be represented
as the sum of all the rooted trees (the root being the node carrying the fixed point x1 )
weighted by the corresponding symmetry factor 1/S(T ):
δ ln F[z] X 1
1
= D[x1 ; z] = . (15.181)
δz(z1 ) S(T )
rooted
trees T
δ ln F[z] X
1
= C{1···n} = ... 4 . (15.182)
δz(z1 ) · · · δz(xn ) ...
2 ...
n
The number of trees contributing to this sum is equal to nn−2 (Cayley’s formula).
The equation (15.182) tells us that, at tree level in the quasi-classical regime, all the
n-point correlation functions are entirely determined by the functional dependence of
the solution of the classical field equation of motion with respect to its initial condition.
Moreover, this formula provides a way to construct explicitly the correlation functions
in terms of functional derivatives with respect to the initial field. c sileG siocnarF
In the quasi-classical approximation, the final state correlations are entirely due to
quantum fluctuations in the initial state, that are encoded in the function G022 (x, y). If
the initial state is the vacuum, it reads
Z
d3 k
G022 (x, y) = eik·(x−y) . (15.183)
(2π)2 2Ek
The support of this function is dominated by distances |x − y| smaller than the
Compton wavelength m−1 . Thus, in the tree representation of eq. (15.182), a link
between the points xi and xj is nonzero provided that the past light-cones of summits
xi and xj overlap at the initial time (or at least approach each other within distances
. m−1 ), as illustrated in the figure 15.10. A more thorough analysis would indicate
that eq. (15.182) is exact at tree level for the 1-point and 2-point correlations, but is
incomplete (even at tree-level) beyond 2 points. The corrections to this formula are
nevertheless suppressed if the condition φ1 ≪ φ2 holds. c sileG siocnarF
1 2 3 tf
ti
Figure 15.10: Causal structure of the 3-point correlation function in the quasi-
classical regime.