Introduction To QCD: 1 2 3 4 5 6 PYTHIA Homepage 7
Introduction To QCD: 1 2 3 4 5 6 PYTHIA Homepage 7
P. Z. Skands
Abstract
These lectures were originally given at TASI 2012 and are directed at a level suitable
for graduate students in High Energy Physics. They are intended to give an introduction to
the theory and phenomenology of quantum chromodynamics (QCD), focusing on collider
physics applications. The aim is to bring the reader to a level where informed decisions can
be made concerning different approaches and their uncertainties. The material is divided
into five main areas: 1) fundamentals, 2) fixed-order perturbative QCD, 3) Monte Carlo
event generators and parton showers, 4) Matching at Leading and Next-to-Leading Order,
and 5) Soft QCD physics.
Lectures presented at TASI 2012: Theoretical Advanced Study Institute in Elementary Particle Physics
Searching for New Physics at Small and Large Scales. University of Colorado, Boulder, CO, June 4 29, 2012.
Based on lectures first given at ESHEP 2010 (Raseborg, Finland) and updated for AEPSHEP 2014 (Puri, India).
[email protected]
1
P. Skands Introduction to QCD
Contents
1 Introduction 3
1.1 A First Hint of Colour . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 The Lagrangian of QCD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.3 Colour Factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 The Strong Coupling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.5 Colour States . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2 Hard Processes 17
2.1 Factorization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.2 Parton Densities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.3 Fixed-Order QCD . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
2.4 The Subtraction Idea . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.5 Infrared Safety . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
References 65
Index 77
2
P. Skands Introduction to QCD
1 Introduction
When probed at very short wavelengths, QCD is essentially a theory of free partons quarks
and gluons which only scatter off one another through relatively small quantum corrections,
that can be systematically calculated. At longer wavelengths, of order the size of the proton
1fm = 1015 m, however, we see strongly bound towers of hadron resonances emerge, with
string-like potentials building up if we try to separate their partonic constituents. Due to our
inability to perform analytic calculations in strongly coupled field theories, QCD is therefore
still only partially solved. Nonetheless, all its features, across all distance scales, are believed
to be encoded in a single one-line formula of alluring simplicity; the Lagrangian of QCD.
The consequence for collider physics is that some parts of QCD can be calculated in terms
of the fundamental parameters of the Lagrangian, whereas others must be expressed through
models or functions whose effective parameters are not a priori calculable but which can be
constrained by fits to data. However, even in the absence of a perturbative expansion, there
are still several strong theorems which hold, and which can be used to give relations between
seemingly different processes. (This is, e.g., the reason it makes sense to constrain parton
distribution functions in ep collisions and then re-use the same ones for pp collisions.) Thus,
in the chapters dealing with phenomenological models we shall emphasize that the loss of a
factorized perturbative expansion is not equivalent to a total loss of predictivity.
An alternative approach would be to give up on calculating QCD and use leptons instead.
Formally, this amounts to summing inclusively over strong-interaction phenomena, when such
are present. While such a strategy might succeed in replacing what we do know about QCD
by unity, however, even the most adamant chromophobe must acknowledge a few basic
facts of collider physics for the next decade(s): 1) At the LHC, the initial states are hadrons,
and hence, at the very least, well-understood and precise parton distribution functions (PDFs)
will be required; 2) high precision will mandate calculations to higher orders in perturbation
theory, which in turn will involve more QCD; 3) the requirement of lepton isolation makes
the very definition of a lepton depend implicitly on QCD, and 4) the rate of jets that are
misreconstructed as leptons in the experiment depends explicitly on it. Finally, 5) though many
new-physics signals do give observable signals in the lepton sector, this is far from guaranteed,
nor is it exclusive when it occurs. It would therefore be unwise not to attempt to solve QCD
to the best of our ability, the better to prepare ourselves for both the largest possible discovery
reach and the highest attainable subsequent precision.
Moreover, QCD is the richest gauge theory we have so far encountered. Its emergent
phenomena, unitarity properties, colour structure, non-perturbative dynamics, quantum vs.
classical limits, interplay between scale-invariant and scale-dependent properties, and its wide
range of phenomenological applications, are still very much topics of active investigation,
about which we continue to learn.
In addition, or perhaps as a consequence, the field of QCD is currently experiencing some-
thing of a revolution. On the perturbative side, new methods to compute scattering am-
plitudes with very high particle multiplicities are being developed, together with advanced
techniques for combining such amplitudes with all-orders resummation frameworks. On the
non-perturbative side, the wealth of data on soft-physics processes from the LHC is forcing us
to reconsider the reliability of the standard fragmentation models, and heavy-ion collisions are
providing new insights into the collective behavior of hadronic matter. The study of cosmic
3
P. Skands Introduction to QCD
[...] It is concluded that the apparently anomalous features of the scattering can be
interpreted to be an indication of a resonant meson-nucleon interaction corresponding to
a nucleon isobar with spin 32 , isotopic spin 23 , and with an excitation energy of 277 MeV.
Figure 1: The title and part of the abstract of the 1951 paper [8] (published in 1952) in which
the existence of the ++ baryon was deduced, based on data from Sachs and Steinberger at
Columbia [9] and from Anderson, Fermi, Nagle, et al. at Chicago [10]. Further studies at
Chicago were quickly performed in [11, 12]. See also the memoir by Nagle [13].
rays impinging on the Earths atmosphere challenges our ability to extrapolate fragmentation
models from collider energy scales to the region of ultra-high energy cosmic rays. And finally,
dark-matter annihilation processes in space may produce hadrons, whose spectra are sensitive
to the modeling of fragmentation.
In the following, we shall focus on QCD for mainstream collider physics. This includes the
basics of SU(3), colour factors, the running of s , factorization, hard processes, infrared safety,
parton showers and matching, event generators, hadronization, and the so-called underlying
event. While not covering everything, hopefully these topics can also serve at least as stepping
stones to more specialized issues that have been left out, such as twistor-inspired techniques,
heavy flavours, polarization, or forward physics, or to topics more tangential to other fields,
such as axions, lattice QCD, or heavy-ion physics.
4
P. Skands Introduction to QCD
(adjoint) colour index a (i.e., a [1, . . . , 8]), and D is the covariant derivative in QCD,
with gs the strong coupling (related to s by gs2 = 4s ; we return to the strong coupling
in more detail below), Aa the gluon field with colour index a, and taij proportional to the
2
A complex N N matrix has 2N 2 degrees of freedom, on which unitarity provides N 2 constraints.
3
The dimension of the adjoint, or vector, representation is equal to the number of generators, N 2 1 = 8 for
SU(3), while the dimension of the fundamental representation is the degree of the group, N = 3 for SU(3).
5
P. Skands Introduction to QCD
A1
1
0 1
0 0
0 1 0 1 0 1
0 0 i 0 0 0 0 0 0 3
5 6 7
=@ 0 0 0 A, = @ 0 0 1 A, = @ 0 0 i A , 8 = @ 0 13 0 A ,
B C
2
i 0 0 0 1 0 0 i 0 0 0 3
. (5)
These generators are just the SU(3) analogs of the Pauli matrices in SU(2). By convention, the
constant of proportionality is normally taken to be
1
taij = aij . (6)
2
This choice in turn determines the normalization of the coupling gs , via equation (4), and
fixes the values of the SU(3) Casimirs and structure constants, to which we return below.
An example of the colour flow for a quark-gluon interaction in colour space is given in
figure 2. Normally, of course, we sum over all the colour indices, so this example merely gives
a pictorial representation of what one particular (non-zero) term in the colour sum looks like.
6
P. Skands Introduction to QCD
Figure 3: Illustration of the three crossings of the interaction of a lepton current (black) with a
quark current (red) via an intermediate photon or Z boson, with corresponding colour factors.
A very simple example of a colour factor is given by the decay process Z q q. This
vertex contains a simple ij in colour space; the outgoing quark and antiquark must have
identical (anti-)colours. Squaring the corresponding matrix element and summing over final-
state colours yields a colour factor of
X
e+ e Z q q :
|M |2 ij ji = Tr{} = NC = 3 , (7)
colours
where the colour factor now expresses a suppression which can be interpreted as due to the fact
that only quarks of matching colours are able to collide and produce a Z boson. The chance
that a quark and an antiquark picked at random from the colliding hadrons have matching
colours is 1/NC .
Similarly, `q `q via t-channel photon exchange (usually called Deep Inelastic Scattering
DIS with deep referring to a large virtuality of the exchanged photon), constitutes yet
another crossing of the same basic process, see figure 3. The colour factor in this case comes
out as unity.
To illustrate what happens when we insert (and sum over) quark-gluon vertices, such as the
one depicted in figure 2, we take the process Z 3 jets. The colour factor for this process can
be computed as follows, with the accompanying illustration showing a corresponding diagram
7
P. Skands Introduction to QCD
a b
Tr{ta tb } = TR ab a, b [1, . . . , 8]
P a a a [1, . . . , 8] i j k
a tij tjk = CF ik
i, j, k [1, . . . , 3] a
P acd f bcd
a b
c,d f = CA ab a, b, c, d [1, . . . , 8]
j i
taij tak` = TR jk i` 1
NC ij k` i, j, k, ` [1, . . . , 3] 1
NC (Fierz)
k
Table 1: Trace relations for t matrices (convention-independent). More relations can be found
in [2, Section 1.2] and in [1, Appendix A.3].
Z qg q : qk qk
X
|M |2 ij tajk (ta`k i` ) qj tajk tak q
colours a
= Tr{ta ta } ij a
gjk gk i (9)
1 qi qi
= Tr{} = 4 ,
2
where the last Tr{} = 8, since the trace runs over the 8-dimensional adjoint indices.
The tedious task of taking traces over t matrices can be greatly alleviated by use of the
relations given in Table 1. In the standard normalization convention for the SU(3) generators,
equation (6), the Casimirs of SU(3) appearing in Table 1 are5
1 4
TR = CF = CA = NC = 3 . (10)
2 3
In addition, the gluon self-coupling on the third line in Table 1 involves factors of f abc . These
are called the structure constants of QCD and they enter via the non-Abelian term in the gluon
field strength tensor appearing in equation (3),
a
F = Aa Aa + gs f abc Ab Ac . (11)
5
See, e.g., [1, Appendix A.3] for how to obtain the Casimirs in other normalization conventions.
8
P. Skands Introduction to QCD
A4 (k2 )
gs f 246 [(k3 k2 ) g
+(k2 k1 ) g
+(k1 k3 ) g ]
A6 (k1 ) A2 (k3 )
Figure 4: Illustration of a ggg vertex in QCD, before summing/averaging over colours: interac-
tion between gluons in the states 2 , 4 , and 6 is represented by the structure constant f 246 .
9
P. Skands Introduction to QCD
(s ) = s2 (b0 + b1 s + b2 s2 + . . .) , (19)
6
Strictly speaking, conformal symmetry is more restrictive than just scale invariance, but examples of scale-
invariant field theories that are not conformal are rare.
7
Originally, the observed approximate agreement with this was used as a powerful argument for pointlike
substructure in hadrons; since measurements at different energies are sensitive to different resolution scales,
independence of the absolute energy scale is indicative of the absence of other fundamental scales in the problem
and hence of pointlike constituents.
10
P. Skands Introduction to QCD
with relations including the O(s2 ) terms available, e.g., in [2]. Relations between scales
not involving MZ2 can obviously be obtained by just replacing MZ2 by some other scale Q02
everywhere in equation (22). A comparison of running at one- and two-loop order, in both
cases starting from s (MZ ) = 0.12, is given in figure 5. As is evident from the figure, the
2-loop running is somewhat faster than the 1-loop one.
As an application, let us prove that the logarithmic running of the coupling implies that
an intrinsically multi-scale problem can be converted to a single-scale one, up to corrections
suppressed by two powers of s , by taking the geometric mean of the scales involved. This
follows from expanding an arbitrary product of individual s factors around an arbitrary scale
, using equation (22),
n
Y 2
2
s (1 )s (2 ) s (n ) = s () 1 + b0 s ln + O(s )
i=1
2i
2n
= sn () 1 + b0 s ln + O(s2 ) , (23)
1 22 2n
2
whereby the specific single-scale choice n = 1 2 n (the geometric mean) can be seen to
push the difference between the two sides of the equation one order higher than would be the
case for any other combination of scales8 .
The appearance of the number of flavours, nf , in b0 implies that the slope of the running
depends on the number of contributing flavours. Since full QCD is best approximated by
8
In a fixed-order calculation, the individual scales i , would correspond, e.g., to the n hardest scales appearing
in an infrared safe sequential clustering algorithm applied to the given momentum configuration.
11
P. Skands Introduction to QCD
0.6
s (Q)
s (Q)
0.5
2-loop
1-loop
0.4
0.3
0.2
VINCIARO OT
0.1
0
0 0.5 1 1.5 2
log (Q/GeV)
10
Figure 5: Illustration of the running of s at 1- (open circles) and 2-loop order (filled circles),
starting from the same value of s (MZ ) = 0.12.
nf = 3 below the charm threshold, by nf = 4 and 5 from there to the b and t thresholds,
respectively, and then by nf = 6 at scales higher than mt , it is therefore important to be aware
that the running changes slope across quark flavour thresholds. Likewise, it would change
across the threshold for any coloured new-physics particles that might exist, with a magnitude
depending on the particles colour and spin quantum numbers.
The negative overall sign of equation (19), combined with the fact that b0 > 0 (for nf
16), leads to the famous result9 that the QCD coupling effectively decreases with energy, called
asymptotic freedom, for the discovery of which the Nobel prize in physics was awarded to
D. Gross, H. Politzer, and F. Wilczek in 2004. An extract of the prize announcement runs as
follows:
What this years Laureates discovered was something that, at first sight, seemed
completely contradictory. The interpretation of their mathematical result was
that the closer the quarks are to each other, the weaker is the colour charge.
When the quarks are really close to each other, the force is so weak that they
behave almost as free particlesa . This phenomenon is called asymptotic free-
dom. The converse is true when the quarks move apart: the force becomes
stronger when the distance increasesb .
a
More correctly, it is the coupling rather than the force which becomes weak as the distance
decreases. The 1/r2 Coulomb singularity of the force is only dampened, not removed, by the
diminishing coupling.
b
More correctly, it is the potential which grows, linearly, while the force becomes constant.
9
Perhaps the highest pinnacle of fame for equation (19) was reached when the sign of it featured in an episode
of the TV series Big Bang Theory.
12
P. Skands Introduction to QCD
0.5
April 2012
0.1
QCD s ( Z) = 0.1184 0.0007
1 10 100
Q [GeV]
Figure 6: Illustration of the running of s in a theoretical calculation (band) and in physical
processes at different characteristic scales, from [23, 25]. The little kinks at Q = mc and
Q = mb are caused by discontinuities in the running across the flavour thresholds.
Among the consequences of asymptotic freedom is that perturbation theory becomes better
behaved at higher absolute energies, due to the effectively decreasing coupling. Perturbative
calculations for our 900-GeV Z 0 boson from before should therefore be slightly faster con-
verging than equivalent calculations for the 90-GeV one. Furthermore, since the running of
s explicitly breaks Bjrken scaling, we also expect to see small changes in jet shapes and in
jet production ratios as we vary the energy. For instance, since high-p jets start out with
a smaller effective coupling, their intrinsic shape (irrespective of boost effects) is somewhat
narrower than for low-p jets, an issue which can be important for jet calibration. Our current
understanding of the running of the QCD coupling is summarized by the plot in figure 6, taken
from a recent comprehensive review by S. Bethke [23, 25].
As a final remark on asymptotic freedom, note that the decreasing value of the strong
coupling with energy must eventually cause it to become comparable to the electromagnetic
and weak ones, at some energy scale. Beyond that point, which may lie at energies of order
1015 1017 GeV (though it may be lower if as yet undiscovered particles generate large correc-
tions to the running), we do not know what the further evolution of the combined theory will
actually look like, or whether it will continue to exhibit asymptotic freedom.
Now consider what happens when we run the coupling in the other direction, towards
smaller energies. Taken at face value, the numerical value of the coupling diverges rapidly at
scales below 1 GeV, as illustrated by the curves disappearing off the left-hand edge of the plot
in figure 6. To make this divergence explicit, one can rewrite equation (22) in the following
form,
1
s (Q2 ) = 2 , (24)
b0 ln Q
2
13
P. Skands Introduction to QCD
where
200 MeV (25)
specifies the energy scale at which the perturbative coupling would nominally become infinite,
called the Landau pole. (Note, however, that this only parametrizes the purely perturbative
result, which is not reliable at strong coupling, so equation (24) should not be taken to imply
that the physical behavior of full QCD should exhibit a divergence for Q .)
Finally, one should be aware that there is a multitude of different ways of defining both
and s (MZ ). At the very least, the numerical value one obtains depends both on the renor-
malization scheme used (with the dimensional-regularization-based modified minimal sub-
traction scheme, MS, being the most common one) and on the perturbative order of the
calculations used to extract them. As a rule of thumb, fits to experimental data typically yield
smaller values for s (MZ ) the higher the order of the calculation used to extract it (see, e.g.,
[26, 27, 25, 23]), with s (MZ )|LO > s (MZ )|NLO >
s (MZ )|NNLO . Further, since the number
of flavours changes the slope of the running, the location of the Landau pole for fixed s (MZ )
depends explicitly on the number of flavours used in the running. Thus each value of nf is
associated with its own value of , with the following matching relations across thresholds
guaranteeing continuity of the coupling at one loop,
2 2
5 21 mt 23
nf = 5 6 : 6 = 5 5 = 6 , (26)
mt 6
2 2
4 23 mb 25
nf = 4 5 : 5 = 4 4 = 5 , (27)
mb 5
2 2
3 25 mc 27
nf = 3 4 : 4 = 3 3 = 4 . (28)
mc 4
It is sometimes stated that QCD only has a single free parameter, the strong coupling.
However, even in the perturbative region, the beta function depends explicitly on the number
of quark flavours, as we have seen, and thereby also on the quark masses. Furthermore, in
the non-perturbative region around or below QCD , the value of the perturbative coupling, as
obtained, e.g., from equation (24), gives little or no insight into the behavior of the full theory.
Instead, universal functions (such as parton densities, form factors, fragmentation functions,
etc), effective theories (such as the Operator Product Expansion, Chiral Perturbation Theory,
or Heavy Quark Effective Theory), or phenomenological models (such as Regge Theory or the
String and Cluster Hadronization Models) must be used, which in turn depend on additional
non-perturbative parameters whose relation to, e.g., s (MZ ), is not a priori known.
For some of these questions, such as hadron masses, lattice QCD can furnish important
additional insight, but for multi-scale and/or time-evolution problems, the applicability of
lattice methods is still severely restricted; the lattice formulation of QCD requires a Wick
rotation to Euclidean space. The time-coordinate can then be treated on an equal footing with
the other dimensions, but intrinsically Minkowskian problems, such as the time evolution of a
system, are inaccessible. The limited size of current lattices also severely constrain the scale
hierarchies that it is possible to fit between the lattice spacing and the lattice size.
14
P. Skands Introduction to QCD
The members of the sextet transform into (linear combinations of) each other under SU(3)
transformations, and similarly for the members of the antitriplet, hence neither of these can
be reduced further. The breakdown into irreducible SU(3) multiplets is therefore
33=63. (31)
Thus, an uncorrelated pair of quarks has a 1/3 chance to add to an overall anti-triplet state
(corresponding to coherent superpositions like red + green = antiblue10 ); otherwise it is in
an overall sextet state.
Note that the emphasis on the quark-(anti)quark pair being uncorrelated is important;
production processes that correlate the produced partons, like Z q q or g q q, will project
10
In the context of hadronization models, this coherent superposition of two quarks in an overall antitriplet state
is sometimes called a diquark (at low mqq ) or a string junction (at high mqq ), see section 5.1; it corresponds
to the antisymmatric red + green = antiblue combination needed to create a baryon wavefunction.
15
P. Skands Introduction to QCD
out specific components (here the singlet and octet, respectively). Note also that, if the quark
and (anti)quark are on opposite sides of the universe (i.e., living in two different hadrons),
the QCD dynamics will not care what overall colour state they are in, so for the formation of
multi-partonic states in QCD, obviously the spatial part of the wave functions (causality at the
very least) will also play a role. Here, we are considering only the colour part of the wave
functions. Some additional examples are
8 8 = 27 10 10 8 8 1 , (32)
3 8 = 15 6 3 , (33)
3 6 = 10 8 , (34)
3 3 3 = (6 3) 3 = 10 8 8 1 . (35)
Physically, the 27 in the first line corresponds to a completely incoherent addition of the colour
charges of two gluons; the decuplets are slightly more coherent (with a lower total colour
charge), the octets yet more, and the singlet corresponds to the combination of two gluons that
have precisely equal and opposite colour charges, so that their total charge is zero. Further
extensions and generalizations of these combination rules can be obtained, e.g., using the
method of Young tableaux [28, 29].
16
P. Skands Introduction to QCD
p1 p3
High
transverse-
momentum
interac2on
p2 p4
Figure 7: Left: Rutherford scattering of quarks in QCD, exemplifying the type of process that
dominates the short-distance interaction cross section at hadron colliders. Right: an example
of what such a reaction looks like in a detector, in this case the ATLAS experiment.
2 Hard Processes
Our main tool for solving QCD at high energy scales, Q QCD , is perturbative quantum
field theory, the starting point for which is Matrix Elements (MEs) which can be calculated
systematically at fixed orders (FO) in the strong coupling s . At least at lowest order (LO), the
procedure is standard textbook material [1] and it has also by now been highly automated, by
the advent of tools like M ADGRAPH [30], C ALCHEP [31] / C OMPHEP [32], and several others
[33, 34, 35, 36, 37, 38, 39]. Here, we require only that the reader has a basic familiarity with
the methods involved from graduate-level particle physics courses based, e.g., on [1, 3]. Our
main concern are the uses to which these calculations are put, their limitations, and ways to
improve on the results obtained with them.
For illustration, take one of the most commonly occurring processes in hadron collisions:
Rutherford scattering of two quarks via a t-channel gluon exchange figure 7 which at
leading order has the differential cross section
d 4 s2 + u 2
qq 0 qq 0 : = 2 s2 , (36)
dt s 9 t2
with the 2 2 Mandelstam variables (hatted to emphasize that they refer to a partonic
2 2 scattering rather than the full pp jets process)
s = (p1 + p2 )2 , (37)
(1 cos )
t = (p3 p1 )2 =
s , (38)
2
(1 + cos )
= (p4 p1 )2 =
u s . (39)
2
Reality, however, is more complicated; the picture on the right-hand pane of figure 7 shows
a real dijet event, as recorded by the ATLAS experiment. The complications to be addressed
when going from left to right in figure 7 are: firstly, additional jets, a.k.a. real-emission cor-
rections, which can significantly change the topology of the final state, potentially shifting jets
17
P. Skands
Parton Distribution Functions Introduction to QCD
u
p d
g
u
2.1 Factorization
In high-energy scattering problems involving hadrons in the initial state, we immediately face
the complication that hadrons are composite, with a time-dependent structure illustrated in
figure 8; there are partons within clouds of further partons, constantly being emitted and ab-
sorbed. Thus, before we can use perturbatively calculated partonic scattering matrix elements,
we must first address the partonic structure of the colliding hadron(s).
For the hadron to remain intact, the fluctuations inside it must involve momentum transfers
smaller than the confinement scale. Indeed, high-virtuality fluctuations are suppressed by
powers of
s 2
, (40)
|k|2
with the confinement scale ( 200 MeV, see section 1.4) and |k| the virtuality of the fluctu-
ation. Thus, most fluctuations occur over timescales 1/.
A hard perturbative probe, on the other hand, such as the exchanged photon in DIS (fig-
ure 3), interacts over a much shorter timescale 1/Q 1/, during which the partonic fluctu-
ations in the struck hadron appear almost frozen. The hard probe effectively takes an instan-
taneous snapshot of the hadron structure, at a characteristic resolution given by 1/Q.
This is formalized by the factorization theorem [41] (see also the TASI lectures by George
Sterman [42]), which expresses the independence of long-wavelength (soft) structure on the
nature of the hard (short-distance) process. Originally formulated for DIS, factorization allows
us to write the cross section for lepton-hadron scattering as a convolution of a non-perturbative
but universal (i.e., process-independent) parton density function (PDF) and a perturbatively
calculable partonic scattering cross section. Denoting the fraction of the hadron momentum
18
P. Skands Introduction to QCD
carried by parton i by xi ,
p~i = xi p~h , (41)
we may write the lepton-hadron cross section on factorized form (see, e.g., [43, 3]),
XZ 1 Z
`if (xi , f , 2F )
d
2
`h = dxi df fi/h (xi , F ) , (42)
0 dxi df
i
with i an index running over all possible parton types11 in the incoming hadron and f enu-
merating all possible (partonic) final states, with Lorentz-invariant phase space, f .
The parton density functions (PDFs), fi/h , parametrize the distribution of partons inside the
target hadron. They are not a priori calculable and must be constrained by fits to data. This is
discussed in section 2.2.
The partonic cross section, d , knows nothing of the target hadron apart from the fact that
it contained the struck parton. It is calculable within perturbation theory, as will be discussed
in section 2.3.
The dividing line between the two is drawn at an arbitrary (user-defined) scale F , called
the factorization scale. There is some arbitrariness involved in choosing a value for F . Some
heuristic arguments to guide in the choice of factorization scale are the following. On the
long-distance side, the PDFs include a (re)summation of fluctuations inside fluctuations up
to virtualities of order F . It would therefore not make much sense to take F significantly
larger than the scales characterizing resolved particles on the short-distance side of the calcu-
lation (i.e., the particles appearing explicitly in f ); otherwise the PDFs would be including
sums over fluctuations that happen on timescales shorter than those probed by the physical
process. Similarly, F should also not be taken much lower than the scale(s) appearing in the
hard process. For matrix elements characterized by a single well-defined scale, such as the
Q2 scale in DIS or the invariant-mass scale s in Drell-Yan production (q q Z/ `+ ` ),
such arguments essentially fix the preferred scale choice to F = Q or F = s, respectively,
which may then be varied by a factor of 2 (or larger) around the nominal value in order to
estimate uncertainties. For multi-scale problems, however, such as pp Z/W + n jets, there
are several a priori equally good choices available, from the lowest to the highest QCD scales
that can be constructed from the final-state momenta, usually with several dissenting groups
of theorists arguing over which particular choice is best. Suggesting that one might simply
measure the scale would not really be an improvement, as the factorization scale is fundamen-
tally unphysical and therefore unobservable (similarly to gauge or convention choices). One
plausible strategy is to look at higher-order (NLO or NNLO) calculations, in which correction
terms appear that cancel the dependence on the scale choice, stabilizing the final result. From
such comparisons, a most stable initial scale choice can in principle be determined, which
then furnishes a reasonable starting point, but we emphasize that the question is intrinsically
ambiguous, and no golden recipe is likely to magically give all the right answers. The best we
can do is to vary the value of F not only by an overall factor, but also by exploring different
possible forms for its functional dependence on the momenta appearing in f . A complemen-
tary useful discussion of the pros and cons of different factorization scale choices can be found
in the TASI lectures by Tilman Plehn [44].
11
Typically, only quarks and gluons are included in this sum, but also photons and even leptons can in principle
be included. Similarly, parton density functions are normally used to describe hadrons, but can also be defined,
e.g., to describe the cloud of virtual photons (and fermion pairs) surrounding an electron.
19
P. Skands Introduction to QCD
Secondly, and more technically, at NLO and beyond one also has to settle on a factorization
scheme in which to do the calculations. For all practical purposes, students focusing on LHC
physics are only likely to encounter one such scheme, the modified minimal subtraction (MS)
one already mentioned in the discussion of the definition of the strong coupling in Section 1.4.
At the level of these lectures, we shall therefore not elaborate further on this choice here.
We note that factorization can also be applied multiple times, to break up a complicated
calculation into simpler pieces that can be treated as approximately independent. This will be
very useful when dealing with successive emissions in a parton shower, section 3.2, or when
factoring off decays of long-lived particles from a hard production process, section 3.4.
We round off the discussion of factorization by mentioning a few caveats the reader should
be aware of. (See [42] for a more technical treatment.)
Firstly, the proof only applies to the first term in an operator product expansion in twist
= mass dimension - spin. Since operators with higher mass dimensions are suppressed by the
hard scale to some power, this leading twist approximation becomes exact in the limit Q ,
while at finite Q it neglects corrections of order
[ln(Q2 /2 )]m<2n
Higher Twist : (n = 2 for DIS) . (43)
Q2n
In section 5, we shall discuss some corrections that go beyond this approximation, in the
context of multiple parton-parton interactions.
Secondly, the proof only really applies to inclusive cross sections in DIS [41] and in Drell-
Yan [45]. For all other hadron-initiated processes, factorization is an ansatz. For a general
hadron-hadron process, we write the assumed factorizable cross section as:
XZ 1 Z 1 XZ dijf
dh1 h2 = dxi dxj df fi/h1 (xi , 2F ) fj/h2 (xj , 2F ) . (44)
0 0 dxi dxj df
i,j f
Note that, if d
is divergent (as, e.g., Rutherford scattering is) then the integral over df
must be regulated, e.g. by imposing some explicit minimal transverse-momentum cut and/or
other phase-space restrictions.
20
P. Skands Introduction to QCD
2 8
xf(x,Q2)
xf(x,Q2)
1.8 7
1.6
6
1.4
5
1.2
1 4
0.8 3
0.6
2
0.4
1
0.2
0 -4 0 -4 -3 -2 -1
-3 -2 -1
10 10 10 10 10 10 10 10 x
x
Figure 9: Illustration of the change of the u (black), u (red, dashed), c (green, dotted), and
g (blue, dot-dashed) distributions, from Q = F = 2 GeV (left) to Q = F = 100 GeV
(right). Note that a factor 0.1 has been applied to the gluon distribution. Plots made using the
HEPDATA online tool [54].
In the context of PDF fits (constraining the form of the PDF functions by fitting cross sections
to experimental data, e.g., from DIS [49, 50], Drell-Yan [51, 52], and pp jets [53]), the
reference scale 0 is usually taken to be relatively low, of order one or a few GeV.
The behavior of the PDFs as we evolve F from a low scale, 2 GeV, to a high one, 100 GeV,
is illustrated in figure 9, for the MSTW14 2008 LO15 PDF set [55]. At low Q = F = 2 GeV
(left), the proton structure is dominated by a few hard quarks (a valence bump is clearly
visible around x 0.2), while at higher scales Q = 100 GeV (right) we predominantly resolve
fluctuations within fluctuations, yielding increasingly large gluon- and sea-quark distributions
with rather small x values, while the valence quarks play a progressively smaller role.
We note that different collaborations, like CTEQ, MSTW, NNPDF, etc., use different ans atze
for the form of f (x, 20 ). They may also include different data in the fits, and/or treat or weight
the data differently. Thus, results from different groups may not always be mutually compati-
ble. An example is given in figure 10, which shows the difference between the CTEQ6L1 gluon
PDF [56] (red dashed) and the MSTW 2008 LO one [55], normalized to MSTW (which would
thus be a flat line at zero), at F = 10 GeV. The y axis shows the relative difference between
the sets, in per cent. Also shown are the 90% CL contours computed from the uncertainty vari-
ations included in the MSTW 2008 LO set (black). Using only the MSTW uncertainty band,
one would arrive at an estimated 5% uncertainty over most of the x range, while including
the CTEQ6L1 set would increase that to 10%. At NLO, this discrepancy is reduced, but not
removed. A significant effort is currently being undertaken within the PDF community to agree
on common, and more comprehensive, ways of defining PDF uncertainty bands [53, 57]. This
14
MSTW: Martin-Stirling-Thorne-Watt.
15
The LO means that the fit was performed using LO matrix elements in the cross section formulae.
21
P. Skands Introduction to QCD
Figure 10: Illustration of the difference between the MSTW 2008 and CTEQ6 LO gluon PDFs
at F = 10 GeV. All curves are normalized to the central MSTW 2008 prediction. The black
solid lines show the 90% CL MSTW variations, while the dashed red line shows the CTEQ6L1
distribution.
is complicated due to the different ways of defining f (x, 20 ) and due to the experimental data
sets not always being fully compatible with one another. For the time being, it is recommended
to try at least sets from two different groups, for a comprehensive uncertainty estimate.
Occasionally, the words structure functions and parton densities are used interchangeably.
However, there is an important distinction between the two, which we find often in (quantum)
physics: the former is a physical observable used to parametrize the DIS cross sections (see
e.g. [3]), while the latter is a fundamental quantity extracted from it. In particular, since the
parton densities are not, themselves, physically observable, they can only be defined within
a specific factorization scheme, order by order in perturbation theory. The only exception
is at leading order, at which they have the simple physical interpretation of parton number
densities. When going to higher orders, we tend to keep the simple intuitive picture from
LO in mind, but one should be aware that the fundamental relationship between PDFs and
measured quantities is now more complicated (due to the interplay between the PDFs and the
real and virtual corrections to the LO cross section), and that the parton densities no longer
have a clear probabilistic interpretation starting from NLO.
The reader should also be aware that there is some ambiguity whether NLO PDFs should
be used for LO calculations. In principle, the higher-order PDFs are better constrained and
the difference between, e.g., an NLO and an LO set should formally be beyond LO precision,
so that one might be tempted to simply use the highest-order available PDFs for any calcula-
tion. However, higher-order terms can sometimes be absorbed, at least partially, into effective
lower-order coefficients. In the context of PDFs, the fit parameters of lower-order PDFs will
attempt to compensate for missing higher-order contributions in the matrix elements. To the
extent those higher-order contributions are universal, this is both desirable and self-consistent.
This leads to some typical qualitative differences between LO and NLO PDFs, illustrated in
figure 11: NLO PDFs tend to be smaller at low x and slightly larger at high x, than LO ones.
Thus, it is quite possible that using an NLO PDF in conjunction with LO matrix elements can
give a worse agreement with data than LO PDFs do.
22
P. Skands Introduction to QCD
xf(x,Q2)
2
10
10
-1
10
-2
10 -4 -3 -2 -1
10 10 10 10 x
Figure 11: Illustration of the change between PDF fits using LO and NLO matrix elements:
the g distribution at LO (black) and NLO (red, dashed), and the u distribution at LO (green,
dotted) and NLO (blue, dot-dashed), for the MSTW 2008 PDF sets [55], at Q = F = 10 GeV.
Plots made using the HEPDATA online tool [54].
Finally, another oft-raised question concerns which PDF sets to use for the parton-shower
evolution in Monte Carlo generators. Importantly, the equations driving the initial-state show-
ers in Monte Carlo models are only sensitive to ratios of PDFs [58]. Since the shower evolution
typically only has leading-logarithmic (LL) precision, it should be theoretically consistent to
use any (LO or better) PDF set to drive the evolution. However, similarly to above, there will
be subleading differences between different choices, and one is justified in worrying about
the level of physical effects that could be generated. Unfortunately, there is currently no way
to ensure 100% self-consistency. Since PDF fits are not done with MC codes, but instead
use analytical resummation models (see, e.g., the TASI lectures by Sterman [42]), which are
not identical to their MC counterparts, the PDF fits are essentially tuned to a slightly dif-
ferent resummation than that incorporated in a given MC model. In practice, not much is
known about the size and impact of this ambiguity [59]. Known differences include: the size
of phase space (purely collinear massless PDF evolution vs. the finite-transverse-momentum
massive MC phase space), the treatment of momentum conservation and recoil effects, ad-
ditional higher-order effects explicitly or implicitly included in the MC evolution, choice of
renormalization scheme and scale, and, for those MC algorithms that do not rely on collinear
(DGLAP, see [3]) splitting kernels (e.g., the various kinds of dipole evolution algorithms, see
[60]), differences in the effective factorization scheme.
As a baseline, we recommend simply using whatever PDF set the given MC model was
originally tuned with, since this should de facto (by fitting the available data) reabsorb as
much of the inconsistency as possible. Furthermore, it should be emphasized that underlying-
event and minimum-bias models based on multi-parton interactions (see section 5.2) usually
make the explicit assumption that the PDFs can be interpreted as physical number densities
even down to very low Q and x, a property which is generally only true for LO PDFs. It must
23
P. Skands Introduction to QCD
where, for compactness, we have suppressed all PDF and luminosity normalization factors.
(`)
MF +k is the amplitude for producing F in association with k additional final-state partons,
legs, and with ` additional loops. The sums start at k = 0 and ` = 0, corresponding to
the Leading Order for producing F , while higher terms represent real and virtual corrections,
respectively.
The purpose of the function is to project out hypersurfaces of constant value of O in the
full dF +k phase space, with O(F +k ) a function that defines O evaluated on each specific
momentum configuration, F +k . (Without the function, the formula would give the total
integrated cross section, instead of the cross section differentially in O.)
We recover the various fixed-order truncations of perturbative QCD (pQCD) by limiting the
nested sums in equation (45) to include only specific values of k + `. Thus,
For k 1, we are not considering inclusive F production; instead, we are considering the
process F + k jets. If we simply integrate over all momenta, as implied by the integration over
dF +k in equation (45), we would be including configurations in which one or more of the
k partons are collinear or soft. Such configurations are infrared divergent in QCD and hence
must be regulated. Since we talk about collinear and soft divergences (the origins of which will
be discussed in more detail in sections 2.4 and 3.2), cuts on angles and energies and/or cuts
on combinations, like transverse momenta, can be used to cut away the problematic regions of
phase space.
Recall, however, that pQCD is approximately scale invariant. This implies that a regu-
larization cut on a dimensionful quantity, like energy or transverse momentum, should be
formulated as a ratio of scales, rather than as an absolute number. For example, a jet with
p = 50 GeV would be considered hard and well-separated if produced in association with
an ordinary Z boson (with hard scale MZ = 91.2 GeV), while the same jet would be consid-
ered soft if produced in association with a 900-GeV Z 0 boson (see [17, 18] for more explicit
examples).
24
P. Skands Introduction to QCD
F @ LO F + 2 @ LO
LO for F + 2
(2) (2) for F + 1
` (loops) 2 0 1 ... 2 (2)
0
(2)
1 ... for F + 0
` (loops)
(1) (1) (1)
1 0 1 2 ... Max Born, 1 (1)
0
(1)
1
(1)
2 ...
1882-1970
Nobel 1954
(0) (0) (0) (0)
0 0 1 2 3 ... 0 (0)
0
(0)
1
(0)
2 3
(0)
...
0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs)
Figure 12: Coefficients of the perturbative series covered by LO calculations. Left: F pro-
duction at lowest order. Right: F + 2 jets at LO, with the half-shaded box illustrating the
restriction to the region of phase space with exactly 2 resolved jets. The total power of s for
each coefficient is n = k + `. (Photo of Max Born from nobelprize.org).
The essence of the point is that, if the regularization scale is taken too low, logarithmic
enhancements of the type 2
n m2n QF
s ln (46)
Q2k
will generate progressively larger corrections, order by order, which will spoil any fixed-order
truncation of the perturbative series. Here, QF is the hard scale associated with the process
under consideration, while Qk is the scale associated with an additional parton, k.
A good rule of thumb is that if k+1 k (at whatever order you are calculating), then the
perturbative series is converging too slowly for a fixed-order truncation of it to be reliable. For
fixed-order perturbation theory to be applicable, you must place your cuts on the hard process
such that k+1 k . In the discussion of parton showers in Section 3.2, we shall see how the
region of applicability of perturbation theory can be extended.
The virtual amplitudes, for ` 1, are divergent for any point in phase space. However,
as encapsulated by the famous KLN theorem [61, 62], unitarity (which essentially expresses
probability conservation) puts a powerful constraint on the IR divergences16 , forcing them to
cancel exactly against those coming from the unresolved real emissions that we had to cut out
above, order by order, making the complete answer for fixed k + ` = n finite17 Nonetheless,
since this cancellation happens between contributions that formally live in different phase
spaces, a main aspect of loop-level higher-order calculations is how to arrange for this cancel-
lation in practice, either analytically or numerically, with many different methods currently on
the market. We shall discuss the idea behind subtraction approaches in section 2.4.
A convenient way of illustrating the terms of the perturbative series that a given matrix-
element-based calculation includes is given in figure 12. In the left-hand pane, the shaded
box corresponds to the lowest-order Born-level matrix element squared. This coefficient
is non-singular and hence can be integrated over all of phase space, which we illustrate by
letting the shaded area fill all of the relevant box. A different kind of leading-order calculation
16
The loop integrals also exhibit UV divergences, but these are dealt with by renormalization.
17
Formally, the KLN theorem states that the sum over degenerate quantum states is finite. In context of fixed-
order perturbation theory, this is exemplified by states with infinitely collinear and/or soft radiation being degen-
erate with the corresponding states with loop corrections; they cannot be told apart by any physical observable.
25
P. Skands Introduction to QCD
` (loops)
(1) (1) (1)
1 0 1 2 ... 1 (1)
0
(1)
1
(1)
2 ...
(0) (0) (0) (0)
0 0 1 2 3 ... 0 (0)
0
(0)
1
(0)
2 3
(0)
...
0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs)
Figure 13: Coefficients of the perturbative series covered by NLO calculations. Left: F produc-
tion at NLO. Right: F + 1 jet at NLO, with half-shaded boxes illustrating the restriction to the
region of phase space with exactly 1 resolved jet. The total power of s for each coefficient is
n = k + `.
is illustrated in the right-hand pane of figure 12, where the shaded box corresponds to the
lowest-order matrix element squared for F + 2 jets. This coefficient diverges in the part of
phase space where one or both of the jets are unresolved (i.e., soft or collinear), and hence
integrations can only cover the hard part of phase space, which we reflect by only shading the
upper half of the relevant box.
Figure 13 illustrates the inclusion of NLO virtual corrections. To prevent confusion, first a
(1)
point on notation: by 0 , we intend
Z
(1) (1) (0)
0 = d0 2Re[M0 M0 ] , (47)
which is of order s relative to the Born level. Compare, e.g., with the expansion of equa-
(1)
tion (45) to order k + ` = 1. In particular, 0 should not be confused with the integral over
the 1-loop matrix element squared (which would be of relative order s2 and hence forms part
(2)
of the NNLO coefficient 0 ). Returning to figure 13, the unitary cancellations between real
and virtual singularities imply that we can now extend the integration of the real correction in
the left-hand pane over all of phase space, while retaining a finite total cross section,
Z Z Z
NLO (0) 2 (0) 2 (1) (0)
0 = d0 |M0 | + d1 |M1 | + d0 2Re[M0 M0 ]
(48)
(0) (0) (1)
= 0 + 1 + 0 ,
(0)
with 0 the finite Born-level cross section, and the positive divergence caused by integrating
the second term over all of phase space is canceled by a negative one coming from the inte-
gration over loop momenta in the third term. One method for arranging the cancellation of
singularities subtraction is discussed in section 2.4.
However, if our starting point for the NLO calculation is a process which already has a
non-zero number of hard jets, we must continue to impose that at least that number of jets
26
P. Skands Introduction to QCD
F @ NNLO
NNLO for F + 0
(2) (2) NLO for F + 1
2 0 1 ... LO for F + 2
` (loops)
(1) (1) (1)
1 0 1 2 ...
Figure 14: Coefficients of the perturbative series covered by an NNLO calculation. The total
power of s for each coefficient is n = k + `. Green shading represents the full perturbative
coefficient at the respective k and `.
with the second and third terms having had their common (but opposite-sign) singularities
canceled out and some explicitly finite quantities remaining.
27
P. Skands Introduction to QCD
The first step towards this goal is to classify all IR singularities that could appear in the
amplitudes. We know that the IR limits are universal, so they can be classified using a set of
process-independent functions that only has to be worked out once and for all. A widely used
such set of functions are the Catani-Seymour (CS) dipole ones [63, 64], a method which by
now has even been partially automated [65, 66]. Here, we shall instead use a formalism based
on antennae [67, 68, 69]. The distinction between the two is basically that one antenna is
made up of two dipole ends, hence the antenna formalism tends to generate somewhat fewer
terms. At NLO, however, there is no fundamental incompatibility the antennae we use here
can always be partitioned into two dipole ends, if so desired. (Note: only the antenna method
has been successfully generalized to NNLO [70, 71]. Other NNLO techniques, not covered
here, are sector decomposition, see [72, 73], and the generic formalism for hadroproduction of
colorless states presented in [74].)
At NLO, the idea with subtraction is thus to rewrite the NLO cross section by adding and
subtracting a simple function, dS , that encapsulates all the IR limits,
Z
NLO Born (0)
= + dF +1 |MF +1 |2 dSNLO
| {z }
Finite by Universality
Z Z
(1) (0)
+ dF 2Re[MF MF ] + dF +1 dSNLO . (52)
| {z }
Finite by KLN
The task now is to construct a suitable form for dS . A main requirement is that it should be
sufficiently simple that the integral in the last term can be done analytically, in dimensional
regularization, so that the IR poles it generates can be canceled against those from the loop
term.
To build a set of universal terms that parametrize the IR singularities of any amplitude, we
start from the observation that gauge theory amplitudes factorize in the soft limit, as follows:
!
2 2m 2
j 0 2s ik 2m
|MF +1 (. . . , i, j, k, . . .)|2 gs2 NC 2 i 2 k |MF (. . . , i, k, . . .)|2 ,(53)
g
where parton j is a soft gluon, partons i, j, and k form a chain of color-space index contractions
(we say they are color-connected), gs is the strong coupling, and the terms in parenthesis are
called the soft eikonal factor. We here show it including mass corrections, which appear if i
and k have non-zero rest masses, with the invariants sab then defined as
sab 2pa pb = (pa + pb )2 m2a m2b . (54)
The color factor, NC , is valid for the leading-color contribution, regardless of whether the
i and k partons are quarks or gluons. At subleading color, an additional soft-eikonal factor
identical to the one above but with a color factor proportional to 1/NC arises for each q q
pair combination. This, e.g., modifies the effective color factor for q q qg q from NC to
NC (1 1/NC ) = 2CF , in agreement with the color factor for quarks being CF rather than CA .
Similarly, amplitudes also factorize in the collinear limit (partons i and j parallel, so
sij 0), in which the eikonal factor above is replaced by the famous Dokshitzer-Gribov-
Lipatov-Altarelli-Parisi (DGLAP) splitting kernels [47, 46, 48], which were already mentioned
28
P. Skands Introduction to QCD
in section 2.2, in the context of PDF evolution. They are also the basis of conventional parton-
shower models, to which we return in section 3.2.
Essentially, what antenna functions, CS dipoles, and the like, all do, is to combine the soft
(eikonal) and collinear (Altarelli-Parisi) limits into one universal set of functions that achieve
the correct limiting behavior for both soft and collinear radiation. To give an explicit example,
the antenna function for gluon emission from a color-connected q q pair can be derived from
the matrix elements squared for the process Z 0 q q qg q [75],
2s
M(Z 0 qi gj qk )2
2 ik 1 sjk sij
= gs 2CF + + , (55)
|M(Z 0 qI qK )|2 sij sjk sIK sij sjk
| {z } | {z }
eikonal collinear
where we have neglected mass corrections (see [76, 77] for massive expressions) and we
recognize the universal eikonal soft factor from equation (53) in the first term. The two
additional terms are less singular, and are required to obtain the correct collinear (Altarelli-
Parisi) limits as sij 0 or sjk 0.
However, since the singularity structure is universal, we could equally well have used the
process H 0 q q qg q to derive the antenna function. Our antenna function would then
have come out as [77],
2s
M(H 0 qi gj qk )2
2 ik 1 sjk sij 2
2 = gs 2C F + + + , (56)
0
|M(H qI qK )| s s
ij jk s IK sij s jk s IK
| {z } | {z } |{z}
eikonal collinear finite
where the additional term 2/sIK is non-singular (finite) over all of phase space. Thus, we
here see an explicit example that the singularities are process-independent while the non-
singular terms are process-dependent. Since we add and subtract the same term in equa-
tion (52), the final answer does not depend on the choice of finite terms. We say that they
correspond to different subtraction schemes. One standard antenna subtraction scheme, which
uses the antenna function defined in equation (55) rather than the one in equation (56), is
the Gehrmann-Gehrmann-de Ridder-Glover (GGG) one, given in [69].
If there is more than one color antenna in the Born-level process, the form of dS is
obtained as a sum over terms, each of which captures one specific soft limit and either all or
half of a collinear one, depending on the specific scheme and the type of parton,
X
dS = AIKijk |MF (. . . , I, K, . . .)|2 , (57)
j
with the sum running over all singular 3 2 clusterings of the (F + 1)-parton state to F
partons. An analysis of the different ways of partitioning the collinear singularity of gluons
among neighboring antenna is beyond the scope of this introduction, but useful discussions
can be found in [78, 79, 80].
29
P. Skands Introduction to QCD
low scale (i.e., any non-UV limit), without regard to whether it is collinear or soft.
The property of infrared safety defines a special class of observables which have minimal
sensitivity to long-distance physics, and which can be consistently computed in pQCD. An
observable is infrared safe if:
1. (Safety against soft radiation): Adding any number of infinitely soft particles should not
change the value of the observable.
2. (Safety against collinear radiation): Splitting an existing particle up into two comoving
particles, with arbitrary fractions z and 1 z, respectively, of the original momentum,
should not change the value of the observable.
If both of these conditions are satisfied, any long-distance non-perturbative corrections will be
suppressed by the ratio of the long-distance scale to the short-distance one to some (observable-
dependent) power, typically
Q2IR
IR Safe Observables: IR corrections (58)
Q2UV
where QUV denotes a generic hard scale in the problem, and QIR QCD O(1 GeV).
Due to this power suppression, IR safe observables are not so sensitive to our lack of ability
to solve the strongly coupled IR physics, unless of course we go to processes for which the
relevant hard scale, QUV , is small (such as minimum-bias, soft jets, or small-scale jet substruc-
ture). Even when a high scale is present, however, as in resonance decays, jet fragmentation,
or underlying-event-type studies, infrared safety only guarantees us that infrared corrections
are small, not that they are zero. Thus, ultimately, we run into a precision barrier even for IR
safe observables, which only a reliable understanding of the long-distance physics itself can
address.
To constrain models of long-distance physics, one needs infrared sensitive observables.
Hence it is not always the case that infrared safe observables are preferable the purpose
decides the tool. Instead of the suppressed corrections above, the perturbative prediction for
such observables contains logarithms of the type already encountered in equation (46),
2
n m2n QUV
IR Sensitive Observables: IR Corrections s log , (59)
Q2IR
which grow increasingly large as QIR /QUV 0. As an example, consider such a fundamental
quantity as particle multiplicities (= number of particles); in the absence of nontrivial infrared
effects, the number of partons tends logarithmically to infinity as the IR cutoff is lowered.
Similarly, the distinction between a charged and a neutral pion only occurs in the very last
phase of hadronization, and hence observables that only include charged tracks, for instance,
are always IR sensitive18 .
Two important categories of infrared safe observables that are widely used are event shapes
and jet algorithms. Jet algorithms are perhaps nowhere as pedagogically described as in the
2009 ESHEP lectures by Salam [4, Chapter 5]. Event shapes in the context of hadron colliders
18
This remains true in principle even if the tracks are clustered into jets, although the energy clustered in this
way does provide a lower bound on QUV in the given event, since charged + neutral > charged-only.
30
P. Skands Introduction to QCD
have not yet been as widely explored, but the basic phenomenology is introduced also by
Salam and collaborators in [81], with first measurements reported by CMS and ATLAS [82, 83]
and a proposal to use them also for the characterization of soft-QCD (minimum-bias) events
put forth in [84].
Let us here merely emphasize that the real reason to prefer infrared safe jet algorithms
over unsafe ones is not that they necessarily give very different or better answers in the ex-
periment experiments are infrared safe by definition, and the difference between infrared
safe and unsafe algorithms may not even be visible when running the algorithm on experi-
mental data but that it is only possible to compute perturbative QCD predictions for the
infrared safe ones. Any measurement performed with an infrared unsafe algorithm can only
be compared to calculations that include a detailed hadronization model. This both limits the
number of calculations that can be compared to and also adds an a priori unknown sensitivity
to the details of the hadronization description, details which one would rather investigate and
constrain separately, in the framework of more dedicated fragmentation studies.
For LHC phenomenology, the preferred IR safe algorithm for jet reconstruction is currently
the anti-kT one [85], with size parameter R varying between 0.4 and 0.7, though larger sizes
can be motivated in certain contexts, e.g., to look for highly energetic jets and/or the boosted
decay products of high-mass objects [86, 22]. This algorithm generates circular-looking jets,
so subtracting off the energy believed to be associated with the underlying event (UE, see
section 5.2) is particularly simple.
For jet substructure, typically either the kT or Cambridge/Aachen algorithms are used,
see e.g. [86, 22]. The clustering measures used in these algorithms more closely mimic the
singularity structure of QCD bremsstrahlung and they are therefore particularly well suited
to unravel a tree of QCD branchings [4], such as a parton shower generates. The Cam-
bridge/Aachen algorithm may also be used to characterize the underlying event, see [87].
31
P. Skands Introduction to QCD
32
P. Skands Introduction to QCD
Table 2: Relative uncertainty after n evaluations, in 1 and d dimensions, for two traditional
numerical integration methods and stochastic Monte Carlo. The last column shows the number
of function evaluations that are required per point, in d dimensions.
33
P. Skands Introduction to QCD
Figure 15: Left: The casino in Monaco. Right: extract from [7] concerning the nature of
Monte Carlo techniques.
However, due to the infrared singularities in perturbative QCD, and due to the presence of
short-lived resonances, the functions to be integrated, |MF +k |2 , can be highly non-uniform,
especially for large k. This implies that we will have to be clever in the way we sample
phase space if we want the integration to converge in any reasonable amount of time sim-
ple algorithms like R AMBO quickly become inefficient for k greater than a few. To address
this bottleneck, the simplest step up from R AMBO is to introduce generic (i.e., automated)
importance-sampling methods, such as offered by the V EGAS algorithm [95, 96]. This is still
the dominant basic technique, although most modern codes do employ several additional
refinements, such as several different copies of V EGAS running in parallel (multi-channel in-
tegration), to further optimize the sampling. Alternatively, a few algorithms incorporate the
singularity structure of QCD explicitly in their phase-space sampling, either by directly gen-
erating momenta distributed according to the leading-order QCD singularities, in a sort of
QCD-preweighted analog of R AMBO, called S ARGE [97], or by using all-orders Markovian
parton showers to generate them (V INCIA [79, 80]).
The price of using random numbers is that we must generalize our notion of convergence.
In calculus, we say that a sequence {A} converges to B if an n exists for which the difference
|Ai>n B| < for any > 0. In random-number-based techniques, we cannot completely rule
out the possibility of very pathological sequences of dice rolls leading to large deviations
from the true goal, hence we are restricted to say that {A} converges to B if an n exists for
which the probability for |Ai>n B| < , for any > 0, is greater than P , for any P [0, 1] [7].
This risk, that convergence is only given with a certain probability, is the reason why Monte
Carlo techniques were named after the famous casino in Monaco, illustrated in figure 15.
34
P. Skands Introduction to QCD
Secondly, however, in order to be allowed to truncate the perturbative series, we had to require
k+1 k , i.e., the corrections at successive orders must become successively smaller, which
due to the enhancements from soft/collinear singular (conformal) dynamics effectively
restricted us to consider only the phase-space region in which all jets are hard and well-
separated, equivalent to requiring all Qi /Qj 1. In this section, we shall see how to lift this
restriction, extending the applicability of perturbation theory into regions that include scale
hierarchies, Qi Qj QCD , such as occur for soft jets, jet substructure, etc.
In fact, the simultaneous restriction to all resolved scales being larger than QCD and
no large hierarchies is extremely severe, if taken at face value. Since we collide and observe
hadrons ( low scales) while simultaneously wishing to study short-distance physics processes
( high scales), it would appear trivial to conclude that fixed-order pQCD is not applicable to
collider physics at all. So why do we still use it?
The answer lies in the fact that we actually never truly perform a fixed-order calculation in
QCD. Let us repeat the factorized formula for the cross section, equation (44), now inserting
also a function, D, to represent the fragmentation of the final-state partons into observable
hadrons,
d XZ 1 XZ d
ijf
= dxi dxj df fi/h1 (xi , 2F )fj/h2 (xj , 2F ) O, 2F ) , (61)
Df (O
dO 0 d O
i,j f
with O denoting the observable evaluated on the partonic final state, and O the observable
evaluated on the hadronic final state, after fragmentation. Although the partonic cross sec-
tion, dijf , does represent a fixed-order calculation, the parton densities, fi/h1 and fj/h2 ,
include so-called resummations of perturbative corrections to all orders from the initial scale
of order the mass of the proton, up to the factorization scale, F (see section 2.2 and/or the
TASI lectures by Sterman [42]). Note that the oft-stated mantra that the PDFs are purely
non-perturbative functions is therefore misleading. True, they are defined as essentially non-
perturbative functions at some very low scale, 0 a few GeV, but, if F is taken large, they
necessarily incorporate a significant amount of perturbative physics as well. On the fixed-
order side, all we have left to ensure in dijf is then that there are no large hierarchies
remaining between F and the QCD scales appearing in f . Likewise, in the final state, the
fragmentation functions, Df , include infinite-order resummations of perturbative corrections
all the way from F down to some low scale, with similar caveats concerning mantras about
their non-perturbative nature as for the PDFs.
The infinite-order resummations that are included in objects such as the PDFs and FFs in
equation (61) (and in their parton-shower equivalents) rely on some very simple and powerful
properties of gauge field theories that were already touched on in section 2. In particular, we
saw in section 2.4 that we can represent all the infrared (IR) limits of any NLO amplitude with
a set of simple universal functions, based solely on knowing which partons are color-connected
(i.e., have color-space index contractions) with one another.
The diagrams in figure 16 show the basic origin of the universal IR singularities of gauge
theory amplitudes. On the left is shown a diagram (squared) in which an emission with small
sij interferes with itself. In the collinear limit, sij 0, the propagator of the parent parton,
35
P. Skands Introduction to QCD
i i
j
I I
j
K K
k k
Figure 16: Diagrams (squared) giving rise to collinear (left) and soft (right) singularities.
I, goes on shell; the singularity of the associated propagator factor is the origin of the 1/sij
collinear singularities. On the right is shown the interference between a diagram with emission
from parton I and one with emission from parton K. The resulting term has propagator
singularities when both partons I and K go on shell, which can happen simultaneously if
parton j is soft. This generates the 2sik /(sij sjk ) soft singularity, also called the soft eikonal
factor or the dipole factor.
We now understand the fundamental origin of the IR singularities, why they are universal,
and why amplitudes factorize in the soft and collinear limits the singularities are simply
generated by intermediate parton propagators going on shell, which is independent of the
nature of the hard process, and hence can be factorized from it.
Thus, for each pair of (massless) color-connected partons I and K in F , the squared am-
plitude for F + 1 gluon, |MF +1 |2 , will include a factor
2 2sik
|MF +1 | = gs NC2
+ collinear terms |MF |2 , (62)
sij sjk
| {z }
Antenna Function
where gs2 = 4s is the strong coupling, i and k represent partons I and K after the branching
(i.e., they include possible recoil effects) and sij is the invariant between parton i and the
emitted parton, j.
The branching phase space of a color dipole (i.e., a pair of partons connected by a color-
index contraction) is illustrated in figure 17. Expressed in the branching invariants, sij and sjk ,
the phase space has a characteristic triangular shape, imposed by the relation s = sij +sjk +sik
(assuming massless partons). Sketchings of the post-branching parton momenta have been
inserted in various places in the figure, for illustration. The soft singularity is located at the
origin of the plot and the collinear regions lie along the axes.
The collinear terms for a q q qg q antenna are unambiguous and are given in section 2.4.
Since gluons are in the adjoint representation, they carry both a color and an anticolor index
(one corresponding to the rows and the other to the columns of the Gell-Mann matrices),
and there is therefore some ambiguity concerning how to partition collinear radiation among
the two antennae they participate in. This is discussed in more detail in [79]. Differences
are subleading, however, and for our purposes here we shall consider gluon antenna ends as
radiating just like quark ones. The difference between quark and gluon radiation then arise
mainly because gluons participate in two antennae, while quarks only participate in one. This
36
P. Skands Introduction to QCD
sjk
P HASE S PACE FOR 2 3
KINEMATICS INCLUDING (E, P ) CONS
Original Dipole-Antenna:
I K
Collinear with K
Figure 17: Illustration of the branching phase space for q q qg q, with the original dipole-
antenna oriented horizontally, the two parents sharing the transverse component of recoil, and
the azimuthal angle (representing rotations of the emitted parton around the dipole axis)
chosen such that the gluon is radiated upwards. From [79].
37
P. Skands Introduction to QCD
yjk
0.4 0.8 0.4 0.6 0.8 0.4 0.4
0.6
0.2 0.5
0.2 0.2 0.4 0.2 0.2
0.4 0.75
0.2 0.25 0.5
0.0 0.0 0.75
0.0 0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
yij yij yar = sar sarb = 1-xb yar = sar sarb = 1-xb
Figure 18: A selection of parton-shower evolution variables, represented as contours over the
dipole phase space. Note: the right-most variable corresponds to evolution of only one of the
parents, the one with no collinear singularity along the bottom of the plot.
non-invariant formulations of quantum field theory: while they can be practical to work with
at intermediate stages of a calculation, one should be careful with any physical conclusions
that rely explicitly on them.
We shall therefore here restrict ourselves to a Lorentz-invariant formalism based directly
on equation (62), pioneered by the dipole formulation of QCD cascades [75]. The collinear
limit is then replaced by a more general single-pole limit in which a single parton-parton in-
variant vanishes (as, for instance, when a pair of partons become collinear), while the soft limit
is replaced by one in which two (or more) invariants involving the same parton vanish simul-
taneously (as, for instance by that parton becoming soft in a frame defined by two or more
hard partons). This avoids frame-dependent ambiguities from entering into the language, at
the price of a slight reinterpretation of what is meant by collinear and soft.
In the generator landscape, angular ordering is used by the H ERWIG [99] and H ERWIG++ [100]
programs, and an angular veto is imposed on the virtuality-ordered evolution in P YTHIA 6 [101].
Variants of the dipole approach is used by the A RIADNE [102], S HERPA [103, 104], and V IN -
CIA [105] programs, while the p -ordered showers in P YTHIA 6 and 8 represent a hybrid,
combining collinear splitting kernels with dipole kinematics [106]. Phase-space contours of
equal value of some of these choices are illustrated in figure 18. During the shower evolution,
each model effectively sweeps over phase space in the order implied by these contours. E.g.,
a p -ordered dipole shower (leftmost plot in figure 18) will treat a hard-collinear branching
as occurring earlier than a soft one, while a mass-ordered dipole shower (second plot) will
tend to do the opposite. This affects the tower of virtual corrections generated by each shower
model via the so-called Sudakov factor, discussed below. Experimental tests of the subleading
aspects of shower models can therefore quite important, see e.g. [107] for a recent example.
Independently of rewritings and philosophy, the real power of equation (62) lies in the fact
that it is universal. Thus, for any process F , we can apply equation (62) in order to get an
approximation for dF +1 . We may then, for instance, take our newly obtained expression for
F + 1 as our arbitrary process and crank equation (62) again, to obtain an approximation for
dF +2 , and so forth. What we have here is therefore a very simple recursion relation that can
be used to generate approximations to leading-order cross sections with arbitrary numbers of
additional legs. The quality of this approximation is governed by how many terms besides the
leading one shown in equation (53) are included in the game. Including all possible terms,
the most general form for the cross section at F + n jets, restricted to the phase-space region
38
P. Skands Introduction to QCD
above some infrared cutoff scale IR , has the following algebraic structure,
(0)
F +n = sn ln2n + ln2n1 + ln2n2 + . . . + ln + F (65)
where we use the notation ln without an argument to denote generic functions of transcen-
dentality (the logarithmic function to the power being a typical example of a function
with transcendentality appearing in cross section expressions, but also dilogarithms and
higher logarithmic functions20 of transcendentality > 1 should be implicitly understood to be-
long to our notation ln ). The last term, F, represents a rational function of transcendentality
0. We shall also use the nomenclature singular and finite for the ln and F terms, respectively,
a terminology which reflects their respective behavior in the limit IR 0.
The simplest approximation one can build on equation (65), dropping all but the leading
2n
ln term in the parenthesis, is thus the leading-transcendentality approximation. This approx-
imation is better known as the DLA (double logarithmic approximation), since it generates the
correct coefficient for terms which have two powers of logarithms for each power of s , while
terms of lower transcendentalities are not guaranteed to have the correct coefficients. In so-
called LL (leading-logarithmic) parton shower algorithms, one generally expects to reproduce
the correct coefficients for the ln2n and ln2n1 terms. In addition, several formally subleading
improvements are normally also introduced in such algorithms (such as explicit momentum
conservation, gluon polarization and other spin-correlation effects [108, 109, 110], higher-
order coherence effects [98], renormalization scale choices [111], finite-width effects [112],
etc), as a means to improve the agreement with some of the more subleading coefficients as
well, if not in every phase-space point then at least on average. Though LL showers do not
magically acquire NLL (next-to-leading-log) precision from such procedures, one therefore still
expects a significantly better average performance from them than from corresponding strict
LL analytical resummations. A side effect of this is that it is often possible to tune shower
algorithms to give better-than-nominal agreement with experimental distributions, by adjust-
ing the parameters controlling the treatment of subleading effects. One should remember,
however, that there is a limit to how much can be accomplished in this way at some point,
agreement with one process will only come at the price of disagreement with another, and at
this point further tuning would be meaningless.
Applying such an iterative process on a Born-level cross section, one obtains the description
of the full perturbative series illustrated in figure 19. The yellow (lighter) shades used here for
k 1 indicate that the coefficient obtained is not the exact one, but rather an approximation to
it that only gets its leading singularities right. However, since this is still only an approximation
to infinite-order tree-level cross sections (we have not yet included any virtual corrections), we
cannot yet integrate this approximation over all of phase space, as illustrated by the yellow
boxes being only half filled in figure 19; otherwise, the summed total cross section would still
be infinite. This particular approximation would therefore still appear to be very useless indeed
on one hand, it is only guaranteed to get the singular terms right, but on the other, it does
not actually allow us to integrate over the singular region. In order to obtain a truly all-orders
calculation, the constraint of unitarity must also be explicitly imposed, which furnishes an
approximation to all-orders loop corrections as well. Let us therefore emphasize that figure 19
20
Note: due to the theorems that allow us, for instance, to rewrite dilogarithms in different ways with logarithmic
and lower spillover terms, the coefficients at each are only well-defined up to reparametrization ambiguities
involving the terms with transcendentality greater than .
39
P. Skands Introduction to QCD
F @ LOLL(non-unitary)
(2) (2)
2 0 1 ...
` (loops)
(1) (1) (1)
1 0 1 2 ...
is included for pedagogical purposes only; all resummation calculations, whether analytical
or parton-shower based, include virtual corrections as well and consequently yield finite total
cross sections, as will now be described.
Order-by-order unitarity, such as used in the KLN theorem, implies that the singularities caused
by integration over unresolved radiation in the tree-level matrix elements must be canceled,
order by order, by equal but opposite-sign singularities in the virtual corrections at the same
order. That is, from equation (62), we immediately know that the 1-loop correction to dF
must contain a term,
Z ds ds 2s
(0) (1) 2 (0) 2 ij jk ik
2Re[MF MF ] gs NC MF + less singular terms , (66)
16 2 sijk sij sjk
that cancels the divergence coming from equation (62) itself. Further, since this is universally
true, we may apply equation (66) again to get an approximation to the corrections generated
by equation (62) at the next order and so on. By adding such terms explicitly, order by order,
we may now bootstrap our way around the entire perturbative series, using equation (62) to
move horizontally and equation (66) to move along diagonals of constant n = k + `. Since
real-virtual cancellations are now explicitly restored, we may finally extend the integrations
over all of phase space, resulting in the picture shown in figure 20.
The picture shown in figure 20, not the one in figure 19, corresponds to what is actually
done in resummation calculations, both of the analytic and parton-shower types21 . Physically,
there is a significant and intuitive meaning to the imposition of unitarity, as follows.
Take a jet algorithm, with some measure of jet resolution, Q, and apply it to an arbitrary
sample of events, say dijets. At a very crude resolution scale, corresponding to a high value
21
In the way these calculations are formulated in practice, they in fact rely on one additional property, called
exponentiation, that allows us to move along straight vertical lines in the loops-and-legs diagrams. However, since
the two different directions furnished by equations (62) and (66) are already sufficient to move freely in the full
2D coefficient space, we shall use exponentiation without extensively justifying it here.
40
P. Skands Introduction to QCD
F @ LOLL(unitary)
(2) (2)
2 0 1 ...
` (loops)
(1) (1) (1)
1 0 1 2 ...
for Q, you find that everything is clustered back to a dijet configuration, and the 2-jet cross
section is equal to the total inclusive cross section,
At finer resolutions, decreasing Q, you see that some events that were previously classified as
2-jet events contain additional, lower-scale jets, that you can now resolve, and hence those
events now migrate to the 3-jet bin, while the total inclusive cross section of course remains
unchanged,
tot = F ;incl = F ;excl (Q) + F +1;incl (Q) , (68)
where incl and excl stands for inclusive and exclusive cross sections22 , respectively, and the
Q-dependence in the two terms on the right-hand side must cancel so that the total inclusive
cross section is independent of Q. Later, some 3-jet events now migrate further, to 4 and higher
jets, while still more 2-jet events migrate into the 3-jet bin, etc. For arbitrary n and Q, we have
n1
X
F +n;incl (Q) = F ;incl F +m;excl (Q) . (69)
m=0
This equation expresses the trivial fact that the cross section for n or more jets can be computed
as the total inclusive cross section for F minus a sum over the cross sections for F + exactly
m jets including all m < n. On the theoretical side, it is these negative terms which must
be included in the calculation, for each order n = k + `, to restore unitarity. Physically, they
express that, at a given scale Q, each event will be classified as having either 0, 1, 2, or
whatever jets. Or, equivalently, for each event we gain in the 3-jet bin as Q is lowered, we
must loose one event in the 2-jet one; the negative contribution to the 2-jet bin is exactly
minus the integral of the positive contribution to the 3-jet one, and so on. We may perceive
of this detailed balance as an evolution of the event structure with Q, for each event, which is
effectively what is done in parton-shower algorithms, to which we shall return in Section 3.3.
22 P
F inclusive = F plus anything. F exclusive = F and only F . Thus, F ;incl = k=0 F +k;excl
41
P. Skands Introduction to QCD
Formally, this operator the evolution operator will be responsible for generating all
(real and virtual) higher-order corrections to the Born-level expression. The measurement
function appearing explicitly in equation (70) is now implicit in S.
Algorithmically, parton showers cast S as an iterative Markov (i.e., history-independent)
chain, with an evolution parameter, QE , that formally represents the factorization scale of
the event, below which all structure is summed over inclusively. Depending on the partic-
ular choice of shower algorithm, QE may be defined as a parton virtuality (virtuality-order
showers), as a transverse-momentum scale (p -ordered showers), or as a combination of en-
ergies times angles (angular ordering). Regardless of the specific form of QE , the evolution
parameter will go towards zero as the Markov chain develops, and the event structure will
become more and more exclusively resolved. A transition from a perturbative evolution to a
non-perturbative one can also be inserted, when the evolution reaches an appropriate scale,
typically around 1 GeV. This scale, called the hadronization scale, thus represents the lowest
perturbative scale that can appear in the calculations, with all perturbative corrections below
it summed over inclusively.
Working out the precise form that S must have in order to give the correct expansions
discussed in section 3.2 takes a bit of algebra, and is beyond the scope we aim to cover in these
lectures. Heuristically, the procedure is as follows. We noted that the singularity structure of
QCD is universal and that at least its first few terms are known to us. We also saw that we
could iterate that singularity structure, using universality and unitarity, thereby bootstrapping
our way around the entire perturbative series. This was illustrated by figure 20 in section 3.2.
Skipping intermediate steps, the form of the all-orders pure-shower Markov chain, for the
42
P. Skands Introduction to QCD
S(F , Q1 , Q2 , O) = (F , Q1 , Q2 ) (O O(F ))
| {z }
F + 0 exclusive above Q2
X Z QE1 drF +1
+ Sr (F +1 ) (F , Q1 , QF +1 ) S(F +1 , QF +1 , Q2 , O) ,
r QE2 dF
| {z }
F + 1 inclusive above Q2
(72)
with the so-called Sudakov factor,
" #
X Z Q1 drF +1
(F , Q1 , Q2 ) = exp Sr (F +1 ) , (73)
r Q2 dF
defining the probability that there is no evolution (i.e., no emissions) between the scales Q1 and
Q2 , according to the radiation functions Sr to which we shall return below. The term on the first
line of equation (72) thus represents all events that did not evolve as the resolution scale was
lowered from Q1 to Q2 , while the second line contains a sum and phase-space integral over
those events that did evolve including the insertion of S(F +1 ) representing the possible
further evolution of the event and completing the iterative definition of the Markov chain.
The factor drF +1 /dF defines the chosen phase space factorization. Our favorite is the
so-called dipole-antenna factorization, whose principal virtue is that it is the simplest Lorentz
invariant factorization which is simultaneously exact over all of phase space while only involv-
ing on-shell momenta. For completeness, its form is
drF +1 dr3 d 1
= = dsa1 ds1b , (74)
dF d2 2 16 2 sr
which involves just one color-anticolor pair for each r, with invariant mass squared sr =
(pa + p1 + pb )2 . Other choices, such as purely collinear ones (only exact in the collinear
limit or involving explicitly off-shell momenta), more global ones involving all partons in the
event (more complicated, in our opinion), or less global ones with a single parton playing
the dominant role as emitter, are also possible, again depending on the specific algorithm
considered.
The radiation functions Sr obviously play a crucial role in these equations, driving the
emission probabilities. For example, if Sr 0, then exp(0) = 1 and all events stay in the
top line of equation (72). Thus, in regions of phase space where Sr is small, there is little or
no evolution. Conversely, for Sr , we have 0, implying that all events evolve. One
possible choice for the radiation functions Sr was implicit in equation (62), in which we took
them to include only the leading (double) singularities, with r representing color-anticolor
pairs. In general, the shower may exponentiate the entire set of universal singular terms, or
only a subset of them (for example, the terms leading in the number of colors NC ), which is
why we here let the explicit form of Sr be unspecified. Suffice it to say that in traditional parton
showers, Sr would simply be the DGLAP splitting kernels (see, e.g., [3]), while they would be
so-called dipole or antenna radiation functions in the various dipole-based approaches to QCD
(see, e.g., [75, 63, 69, 104, 79, 80]).
43
P. Skands Introduction to QCD
The procedure for how to technically construct a shower algorithm of this kind, using
random numbers to generate scales distributed according to equation (72), is described more
fully in [79], using a notation that closely parallels the one used here. The procedure is also
described at a more technical level in the review [5], though using a slightly different notation.
Finally, a pedagogical introduction to Monte Carlo methods in general can be found in [7].
44
P. Skands Introduction to QCD
+ =
` (loops)
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1) (1) (1) (1)
1 0 1 2 ... 1 0 1 2 ... 1 0 1 2 ...
(0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0)
0 0 1 2 3 ... 0 0 1 2 3 ... 0 0 1 2 3 ...
0 1 2 3 ... 0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs) k (legs)
Figure 21: The double-counting problem caused by naively adding cross sections involving
matrix elements with different numbers of legs.
4.1 Slicing
The most commonly encountered matching type is currently based on separating (slicing)
phase space into two regions, one of which is supposed to be mainly described by hard matrix
elements and the other of which is supposed to be described by the shower. This type of ap-
proach was first used in H ERWIG [117], to include matrix-element corrections for one emission
beyond the basic hard process [118, 119]. This is illustrated in figure 22. The method has
45
P. Skands Introduction to QCD
F @ LOLL-Soft (H ERWIG Shower) F+1 @ LOLL (H ERWIG Corrections) F @ LO1 LL (H ERWIG Matched)
(2) (2) (2) (2) (2) (2)
2 0 1 ... 2 0 1 ... 2 0 1 ...
+ =
` (loops)
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1) (1) (1) (1)
1 0 1 2 ... 1 0 1 2 ... 1 0 1 2 ...
(0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0)
0 0 1 2 3 ... 0 0 1 2 3 ... 0 0 1 2 3 ...
0 1 2 3 ... 0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs) k (legs)
Figure 22: H ERWIGs original matching scheme [118, 119], in which the dead zone of the
H ERWIG shower was used as an effective matching scale for one emission beyond a basic
hard process.
since been generalized by several independent groups to include arbitrary numbers of addi-
tional legs, the most well-known of these being the CKKW [120], CKKW-L [121, 122], and
MLM [123, 124] approaches.
Effectively, the shower approximation is set to zero above some scale, either due to the
presence of explicit dead zones in the shower, as in H ERWIG, or by vetoing any emissions
above a certain matching scale, as in the (L)-CKKW and MLM approaches. The empty part of
phase space can then be filled by separate events generated according to higher-multiplicity
tree-level matrix elements (MEs). In the (L)-CKKW and MLM schemes, this process can be
iterated to include arbitrary numbers of additional hard legs (the practical limit being around
3 or 4, due to computational complexity).
In order to match smoothly with the shower calculation, the higher-multiplicity matrix ele-
ments must be associated with Sudakov form factors (representing the virtual corrections that
would have been generated if a shower had produced the same phase-space configuration),
and their s factors must be chosen so that, at least at the matching scale, they become identi-
cal to the choices made on the shower side [125]. The CKKW and MLM approaches do this by
constructing fake parton-shower histories for the higher-multiplicity matrix elements. By ap-
plying a sequential jet clustering algorithm, a tree-like branching structure can be created that
at least has the same dominant structure as that of a parton shower. Given the fake shower
tree, s factors can be chosen for each vertex with argument s (p ) and Sudakov factors can
be computed for each internal line in the tree. In the CKKW method, these Sudakov factors
are estimated analytically, while the MLM and CKKW-L methods compute them numerically,
from the actual shower evolution.
Thus, the matched result is identical to the matrix element (ME) in the region above the
matching scale, modulo higher-order (Sudakov and s ) corrections. We may sketch this as
ME corrections
z }| { z }| {
Matched (above matching scale) = Exact (1 + O(s )) , (76)
where the shower-corrections include the approximate Sudakov factors and s reweighting
factors applied to the matrix elements in order to obtain a smooth transition to the shower-
dominated region.
Below the matching scale, the small difference between the matrix elements and the
shower approximation can be dropped (since their leading singularities are identical and this
46
P. Skands Introduction to QCD
F @ LOLL-Soft (excl) F+1 @ LOLL-Soft (excl) F+2 @ LOLL (incl) F @ LO2 LL (MLM & (L)-CKKW)
(2) (2) (2) (2)
2 0 ... 2 0 ... 2 0 ... 2 0 ...
+ + =
` (loops)
` (loops)
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1) (1) (1)
1 0 1 ... 1 0 1 ... 1 0 1 ... 1 0 1 ...
(0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0)
0 0 1 2 0 0 1 2 0 0 1 2 0 0 1 2
0 1 2 0 1 2 0 1 2 0 1 2
k (legs) k (legs) k (legs) k (legs)
Figure 23: Slicing, with up to two additional emissions beyond the basic process. The showers
off F and F + 1 are set to zero above a specific matching scale. (The number of coefficients
shown was reduced a bit in these plots to make them fit in one row.)
region by definition includes no hard jets), yielding the pure shower answer in that region,
shower correction
z }| { z }| {
Matched (below matching scale) = Approximate + (Exact Approximate)
= Approximate + non-singular
Approximate . (77)
47
P. Skands Introduction to QCD
+ =
` (loops)
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1) (1) (1) (1)
1 0 1 2 ... 1 0 1 2 ... 1 0 1 2 ...
(0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0)
0 0 1 2 3 ... 0 0 1 2 3 ... 0 0 1 2 3 ...
0 1 2 3 ... 0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs) k (legs)
Figure 24: M C @ NLO. In the middle pane, cyan boxes denote non-singular correction terms,
while the egg-colored ones denote showers off such corrections, which cannot lead to double-
counting at the LL level.
such that a separate unweighting step (often with quite low efficiency) is needed before an
unweighted sample can be produced.
4.2 Subtraction
Another way of matching two calculations is by subtracting one from the other and correcting
by the difference, schematically
shower correction
z }| { z }| {
Matched = Approximate + (Exact Approximate) . (78)
This looks very much like the structure of a subtraction-based NLO fixed-order calculation,
section 2.4, in which the shower approximation here plays the role of subtraction terms, and
indeed this is what is used in strategies like M C @ NLO [127, 128, 129], illustrated in figure 24.
In this type of approach, however, negative-weight events will generally occur, for instance in
phase-space points where the approximation is larger than the exact answer.
Negative weights are not in principle an insurmountable problem. Histograms can be filled
with each event counted according to its weight, as usual. However, negative weights do affect
efficiency. Imagine a worst-case scenario in which 1000 positive-weight events have been
generated, along with 999 negative-weight ones (assuming each event weight has the same
absolute value, the closest one can get to an unweighted sample in the presence of negative
weights). The statistical precision of the MC answer would be equivalent to one event, for
2000 generated, i.e., a big loss in convergence rate.
In practice, generators like MC@NLO only produce around 10% or less events with nega-
tive weights, so the convergence rate should not be severely affected for ordinary applications.
Nevertheless, the problem of negative weights motivated the development of the so-called
P OWHEG approach [130], illustrated in figure 25, which is constructed specifically to prevent
negative-weight events from occurring and simultaneously to be more independent of which
parton-shower algorithm it is used with. In the P OWHEG method, one effectively modifies the
real-emission probability for the first emission to agree with the F + 1 matrix element (this is
covered under unitarity, below). One is then left with a purely virtual correction, which will
typically be positive, at least for processes for which the NLO cross section is larger than the
LO one.
48
P. Skands Introduction to QCD
+ =
` (loops)
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1) (1) (1) (1)
1 0 1 2 ... 1 0 1 2 ... 1 0 1 2 ...
(0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0)
0 0 1 2 3 ... 0 0 1 2 3 ... 0 0 1 2 3 ...
0 1 2 3 ... 0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs) k (legs)
Figure 25: P OWHEG. In the middle pane, cyan boxes denote non-singular correction terms,
while the egg-colored ones denote showers off such corrections, which cannot lead to double-
counting at the LL level.
+ =
` (loops)
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1) (1) (1) (1)
1 0 1 2 ... 1 0 1 2 ... 1 0 1 2 ...
(0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0) (0)
0 0 1 2 3 ... 0 0 1 2 3 ... 0 0 1 2 3 ...
0 1 2 3 ... 0 1 2 3 ... 0 1 2 3 ...
k (legs) k (legs) k (legs)
Figure 26: M ENLOPS. Note that each of the P OWHEG and CKKW samples are composed of
separate sub-samples, as illustrated in figures 23 and 25.
The advantage of these methods is obviously that NLO corrections to the Born level can
be systematically incorporated. However, a systematic way of extending this strategy beyond
the first additional emission is not available, save for combining them with a slicing-based
strategy for the additional legs, as in M ENLOPS [131], illustrated in figure 26. These issues
are, however, no more severe than in ordinary fixed-order NLO approaches, and hence they
are not viewed as disadvantages if the point of reference is an NLO computation.
4.3 Unitarity
The oldest, and in my view most attractive, approach [101, 132] consists of working out the
shower approximation to a given fixed order, and correcting the shower splitting functions at
that order by a multiplicative factor given by the ratio of the matrix element to the shower
approximation, phase-space point by phase-space point. We may sketch this as
correction
shower z }| {
z }| { Exact
Matched = Approximate . (79)
Approximate
When these correction factors are inserted back into the shower evolution, they guarantee
that the shower evolution off n 1 partons correctly reproduces the n-parton matrix elements,
without the need to generate a separate n-parton sample. That is, the shower approximation
is essentially used as a pre-weighted (stratified) all-orders phase-space generator, on which a
more exact answer can subsequently be imprinted order by order in perturbation theory. Since
49
P. Skands Introduction to QCD
` (loops)
` (loops)
(1) (1) (1) (1) (1) (1)
1 0 1 2 ... 1 0 1 2 ...
Figure 27: P YTHIA (left) and V INCIA (right). Unitarity-based. Only one event sample is pro-
duced by each of these methods, and hence no sub-components are shown.
the shower is already optimized for exactly the kind of singular structures that occur in QCD,
very fast computational speeds can therefore be obtained with this method [80].
In the original approach [101, 132], used by P YTHIA [133, 134], this was only worked
out for one additional emission beyond the basic hard process. In P OWHEG [130, 135], it
was extended to include also virtual corrections to the Born-level matrix element. Finally, in
V INCIA, it has been extended to include arbitrary numbers of emissions at tree level [79, 80]
and one emission at loop level [136], though that method has so far only been applied to
final-state showers.
An illustration of the perturbative coefficients that can be included in each of these ap-
proaches is given in figure 27, as usual with green (darker shaded) boxes representing exact
coefficients and yellow (light shaded) boxes representing logarithmic approximations.
Finally, two more properties unique to this method deserve mention. Firstly, since the
corrections modify the actual shower evolution kernels, the corrections are automatically re-
summed in the Sudakov exponential, which should improve the logarithmic precision once
k 2 is included, and secondly, since the shower is unitary, an initially unweighted sam-
ple of (n 1)-parton configurations remains unweighted, with no need for a separate event-
unweighting or event-rejection step.
50
P. Skands Introduction to QCD
1. Map the partonic system onto a continuum of high-mass hadronic states (called strings
or clusters).
2. Iteratively map strings/clusters onto discrete set of primary hadrons (via string breaks /
cluster splittings / cluster decays).
The physics governing this mapping is non-perturbative. However, we do have some knowl-
edge of the properties that such a solution must have. For instance, Poincare invariance,
unitarity, and causality are all concepts that apply beyond perturbation theory. In addition,
lattice QCD provides us a means of making explicit quantitative studies in a genuinely non-
perturbative setting (albeit only of certain questions).
An important result in quenched lattice QCD23 is that the potential of the colour-dipole
field between a charge and an anticharge appears to grow linearly with the separation of the
charges, at distances greater than about 0.5 femtometers; this behavior is illustrated by the
plot shown in figure 28, from [138]. (Note that the axes are scaled by units of the string
tension 420 MeV. Additional labels corresponding to 1 GeV and 2 GeV are also provided,
on the y axis, and to 1 fm and 2 fm, on the x axis.) This is known as linear confinement, and
it forms the starting point for the string model of hadronization, discussed below in section 5.1.
Alternatively, a property of perturbative QCD called preconfinement [139] is the basis of the
cluster model of hadronization, described in [5, 23].
In the generator landscape, P YTHIA uses string fragmentation, while H ERWIG and S HERPA
use cluster fragmentation. It should be emphasized that the so-called parton level that can
be obtained by switching off hadronization in an MC generator, is not a universal concept,
since each model defines the hadronization scale differently. E.g., the hadronization scale
can be defined by a cutoff in invariant mass, transverse momentum, or some other quantity,
with different tunes using different values for the cutoff. Note that the so-called parton level
that can be obtained by switching off hadronization in an MC generator, is not a universal
concept, since each model defines the hadronization scale differently, with different tunes
23
Quenched QCD implies no dynamical quarks, i.e., no g q q splittings allowed.
51
P. Skands Introduction to QCD
46 STATIC QUARK-ANTIQUARK POTENTIAL: SCALING. . . 2641
Scaling plot
2GeV-
1 GeV
2
B = 6.0, L=32 ~
I
~
~
B = 6.2, L=24
I B = 6.4, L-24
-2 k, B = 6.4, L=32 I
A
2'
t
80
5.1 String Model
5.6 5.8 6.2 6.4
Starting from early concepts developed by Artru and Mennessier [141], several hadronization
models based on strings were proposed in the late 1970ies and early 80ies. Of these, the
FIG. 5. The on-axis string tension [in units of the quantity c = &E /(a AL ) ] as a function of P. Our results are combined with pre-
most
vious widely used
values obtained by today is the so-called
the MTc collaboration LundMoriarty,
[10] and Barkai, model, and implemented
Rebbi [11]. in the P YTHIA code. We
24
Note, however, that while a hard scale is not required to be present, it is not explicitly required to be absent
either. Thus, both diffractive, minimum-bias, pile-up and underlying-event processes will have tails towards high-
p physics as well. For example, even tt pair production can be viewed as a tail of minimum-bias interactions, and
there is a tail of diffractive processes in which hard dijets can be produced diffractively (see, e.g., [140]).
52
P. Skands Introduction to QCD
1 fm
Figure 29: Illustration of the transition between a Coulomb potential at short distances to the
string-like one of equation (80) at large q q separations.
shall therefore concentrate on that particular model here, though many of the overall concepts
would be shared by any string-inspired method. (A more extended discussion can be found in
the very complete and pedagogical review of the Lund model by Andersson [6].)
Consider the production of a q q pair from vacuum, for instance in the process e+ e
/Z q q hadrons. As the quarks move apart, linear confinement implies that a potential
V (R) = R (80)
is asymptotically reached for large distances, R. At short distances, there is a Coulomb term
proportional to 1/R as well, cf. figure 28, but this is neglected in the Lund model. Such a
potential describes a string with tension (energy per unit length) , with the value [138]
which, for comparison with the world of macroscopic objects, would be sufficient to lift a
16-ton truck [142].
The string can be thought of as parameterizing the position of the axis of a cylindrically
symmetric flux tube, illustrated in figure 29. Such simple q q strings form the starting point
for the string model. More complicated multi-parton topologies are treated by representing
gluons as transverse kinks, e.g., q g q. The space-time evolution is then slightly more
involved [6], and modifications to the fragmentation model to handle stepping across gluon
corners have to be included, but the main point is that there are no separate free parameters
for gluon jets. Differences with respect to quark fragmentation arise simply because quarks
are only connected to a single string piece, while gluons have one on either side, increasing
the energy loss per unit (invariant) time from a gluon to the string by a factor of 2 relative
to quarks, which can be compared to the ratio of colour Casimirs CA /CF = 2.25. Another
appealing feature of the model is that low-energy gluons are absorbed smoothly into the string,
without leading to modifications. This improves the stability of the model with respect to
variations of the infrared behavior of the parton shower.
As the partonic string endpoints move apart, their kinetic energy is gradually converted
to potential energy, stored in the growing string spanned between them. In the quenched
approximation, in which g q q splittings are not allowed, this process would continue until
the endpoint quarks have lost all their momentum, at which point they would reverse direction
and be accelerated by the now shrinking string.
In the real world, quark-antiquark fluctuations inside the string field can make the tran-
sition to become real particles by absorbing energy from the string, thereby screening the
original endpoint charges from each other and breaking the string into two separate colour-
singlet pieces, (q q) (q q0 ) + (q 0 q), illustrated in figure 30 a. This process then continues
53
P. Skands String Break
Introduction to QCD
leftover string,
further breaks
time
q
space
a) b)
Figure 30: a) Illustration of string breaking by quark pair creation in the string field. b)
Illustration of the algorithmic choice to process the fragmentation from the outside-in, splitting
off a single on-shell hadron in each step.
26
until only ordinary hadrons remain. (We will give more details on the individual string breaks
below.)
Since the string breaks are causally disconnected (as can easily be realized from space-
time diagrams like the one in figure 30, see also [6]), they do not have to be considered in any
specific time-ordered sequence. In the Lund model, the string breaks are instead generated
starting with the leading (outermost) hadrons, containing the endpoint quarks, and iterating
inwards towards the center of the string, alternating randomly between fragmentation off the
left- and right-hand sides, respectively, figure 30 b. One can thereby split off a single well-
defined hadron in each step, with a mass that, for unstable hadrons, is selected according to a
Breit-Wigner distribution.
The details of the individual string breaks are not known from first principles. The Lund
model invokes the idea of quantum mechanical tunneling, which leads to a Gaussian suppres-
sion of the transverse momenta and masses imparted to the produced quarks,
! !
2 p 2
m q q
Prob(m2q , p2q ) exp exp , (82)
where mq is the mass of the produced quark and p is the transverse momentum imparted to
it by the breakup process (with the q having the opposite p ).
Due to the factorization of the p and m dependence implied by equation (82), the p spectrum
of produced quarks in this model is independent of the quark flavour, with a universal average
value of
2
pq = 2 = / (240 MeV)2 . (83)
Bear in mind that transverse is here defined with respect to the string axis. Thus, the p in a
frame where the string is moving is modified by a Lorentz boost factor. Also bear in mind that
2 is here a purely non-perturbative parameter. In a Monte Carlo model with a fixed shower
cutoff Qhad , the effective amount of non-perturbative p may be larger than this, due to
effects of additional unresolved soft-gluon radiation below Qhad . In principle, the magnitude
of this additional component should scale with the cutoff, but in practice it is up to the user
to enforce this by retuning (see section 5.4) the effective parameter when changing the
hadronization scale. Since hadrons receive p contributions from two breakups, one on either
54
P. Skands Introduction to QCD
The mass suppression implied by equation (82) is less straightforward to interpret. Since
quark masses are notoriously difficult to define for light quarks, the value of the strangeness
suppression must effectively be extracted from experimental measurements, e.g., of the K/
ratio, with a resulting suppression of roughly s/u s/d 0.2 0.3. Inserting even compara-
tively low values for the charm quark mass in equation (82), however, one obtains a relative
suppression of charm of the order of 1011 . Heavy quarks can therefore safely be considered
to be produced only in the perturbative stages and not by the soft fragmentation.
Baryon production can be incorporated in the same basic picture [143], by allowing string
breaks to occur also by the production of pairs of so-called diquarks, loosely bound states of
two quarks in an overall 3 representation (e.g., red + blue antigreen, cf. the rules for
colour combinations in section 1.2). Again, the relative rate of diquark-to-quark production is
not known a priori and must be extracted from experimental measurements, e.g., of the p/
ratio. More advanced scenarios for baryon production have also been proposed, in particular
the so-called popcorn model [144, 145], which is normally used in addition to the diquark
picture and then acts to decrease the correlations among neighboring baryon-antibaryon pairs
by allowing mesons to be formed inbetween them. Within the P YTHIA framework, a fragmen-
tation model including explicit string junctions [146] has so far only been applied to baryon-
number-violating new-physics processes and to the description of beam remnants (and then
acts to increase baryon stopping [147]).
This brings us to the next step of the algorithm: assignment of the produced quarks within
hadron multiplets. Using a nonrelativistic classification of spin states, the fragmenting q ( q)
may combine with the q0 (q 0 ) from a newly created breakup to produce either a vector or a
pseudoscalar meson, or, if diquarks are involved, either a spin-1/2 or spin-3/2 baryon. Unfor-
tunately, the string model is entirely unpredictive in this respect, and this is therefore the sector
that contains the largest amount of free parameters. From spin counting alone, one would ex-
pect the ratio V /S of vectors to pseudoscalars to be 3, but this is modified by the V S mass
splittings, which implies a phase-space suppression of vector production, with corresponding
suppression parameters to be extracted from data.
Especially for the light flavours, the substantial difference in phase space caused by the
V S mass splittings implies a rather large suppression of vector production. Thus, for D /D,
the effective ratio is already reduced to about 1.0 2.0, while for K /K and /, extracted
values range from 0.3 1.0. (Recall, as always, that these are production ratios of primary
hadrons, hence feed-down from secondary decays of heavier hadrons complicates the extrac-
tion of these parameters from experimental data, in particular for the lighter hadron species.)
The production of higher meson resonances is assumed to be low in a string framework25 .
For diquarks, separate parameters control the relative rates of spin-1 diquarks vs. spin-0 ones
and, likewise, have to extracted from data, with resulting values of order (qq)1 /(qq)0 0.075
0.15.
With p2 and m2 now fixed, the final step is to select the fraction, z, of the fragmenting
endpoint quarks longitudinal momentum that is carried by the created hadron. In this re-
25
The four L = 1 multiplets are implemented in P YTHIA, but are disabled by default, largely because several
states are poorly known and thus may result in a worse overall description when included.
55
P. Skands Introduction to QCD
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
b = 1, m = 1 GeV a = 0.5, m = 1 GeV
Figure 31: Normalized Lund symmetric fragmentation function, for fixed m = 1 GeV. Left:
variation of the a parameter, from 0.1 (blue) to 0.9 (red), with fixed b = 1 GeV2 . Right:
variation of the b parameter, from 0.5 (red) to 2 (blue) GeV2 , with fixed a = 0.5.
spect, the string picture is substantially more predictive than for the flavour selection. Firstly,
the requirement that the fragmentation be independent of the sequence in which breakups are
considered (causality) imposes a left-right symmetry on the possible form of the fragmenta-
tion function, f (z), with the solution
1 a b (m2h + p2h )
f (z) (1 z) exp , (85)
z z
which is known as the Lund symmetric fragmentation function (normalized to unit integral).
The a and b parameters, illustrated in figure 31, are the only free parameters of the fragmen-
tation function, though a may in principle be flavour-dependent. Note that the explicit mass
dependence in f (z) implies a harder fragmentation function for heavier hadrons (in the rest
frame of the string).
For massive endpoints (e.g., c and b quarks), which do not move along straight lightcone
sections, the exponential suppression with string area leads to modifications of the form [148],
2
f (z) f (z)/z b mQ , with mQ the heavy-quark mass. Strictly speaking, this is the only fragmen-
tation function that is consistent with causality in the string model, though a few alternative
forms are typically provided as well.
As a by-product, the probability distribution in invariant time of q 0 q0 breakup vertices,
or equivalently = ( )2 , is also obtained, with dP/d a exp(b) implying an area law
for the colour flux [149], and the average breakup time lying along a hyperbola of constant
invariant time 0 1023 s [6].
We may also ask, e.g., how many units of rapidity does the particle production from a
string span? Measuring pz along the string direction and defining rapidity by
1 E + pz
y = ln , (86)
2 E pz
the absolute highest rapidity that can be reached, by a pion traveling exactly along the string
direction and taking all of the endpoint quarks energy, is ymax = ln(2Eq /m ). I.e., the rapidity
56
P. Skands Introduction to QCD
dd
s
s
...
Figure 32: Iterative selection of flavours and momenta in the Lund string-fragmentation
model.
region covered by a fragmenting string scales logarithmically with the energy, and since the
density of hadrons produced per unit rapidity is roughly constant (modulo endpoint effects),
the average number of hadrons produced by string fragmentation likewise scales logarithmi-
cally with energy.
The iterative selection of flavours, p , and z values is illustrated in figure 32. A parton
produced in a hard process at some high scale QUV emerges from the parton shower, at the
hadronization scale QIR , with 3-momentum p~ = (~ p0 , p+ ), where the + on the third com-
ponent denotes light-cone momentum, p = E pz . Next, an adjacent dd pair from the
vacuum is created, with relative transverse momenta p1 . The fragmenting quark combines
with the d from the breakup to form a + , which carries off a fraction z1 of the total lightcone
momentum p+ . The next hadron carries off a fraction z2 of the remaining momentum, etc.
In principle, higher multi-gap diffractive components may be defined as well, the most im-
portant one being central diffraction: hh h + gap + X + gap + h, see the discussion of
diffraction in section 5.2.1 below.
Some important differences exist between theoretical and experimental terminology[152].
In the experimental setting, diffraction is defined by an observable rapidity gap, with |y|gap >
57
P. Skands Introduction to QCD
3 typically giving clean diffractive samples. In the MC context, however, each diffractive pro-
cess type produces a whole spectrum of gaps, with small ones suppressed but not excluded.
Likewise, events of non-diffractive origin may produce accidental rapidity gaps, now with
large ones suppressed (exponentially) but not excluded, and in the transition region there
could even be quantum mechanical interference between the two. Due to this unphysical
model dependence of theoretical definitions of diffraction, we strongly advise to phrase mea-
surements first and foremost in terms of physical observables, and only seek to connect with
theory models as a second, separate, step.
The distinction between elastic and inelastic events is, however, unambiguous (modulo
pp pp events); the final state either contains just two protons, or not. The total hadron-
hadron cross section can therefore be written as a sum of these two physically distinguishable
components,
tot (s) = el (s) + inel (s) , (87)
where s = (pA + pB )2 is the beam-beam center-of-mass energy squared.
Another potentially confusing term is minimum bias (MB). This originates from the
experimental requirement of a minimal energy or number of hits in a given (experiment-
dependent) instrumented region near the beam, used to determine whether there was any
non-trivial activity in the event, or not. This represents the smallest possible trigger bias
that the corresponding experiment is capable of. There is thus no universal definition of min-
bias; each experiment has its own. We give a brief discussion of minimum bias in section 5.2.2
below.
Finally, in events containing a hard parton-parton interaction, the underlying event (UE)
can be roughly conceived of as the difference between QCD with and without including the
remnants of the original beam hadrons. Without such beam remnants, only the hard inter-
action itself, and its associated parton showers and hadronization, would contribute to the
observed particle production. In reality, after the partons that participate in the hard inter-
action have been taken out, the remnants still contain whatever is left of the incoming beam
hadrons, including also a partonic substructure, which leads to the possibility of multiple par-
ton interactions (MPI). We discuss MPI-based models of the UE in section 5.3 below. Other
useful reviews of MPI-based MC models can be found in [5, 137]. Analytical models are
mostly formulated only for double parton scattering, see e.g., [153, 154, 155, 156].
As mentioned above, if the beam particles A and/or B are not elementary, the inelastic final
states may be divided into diffractive and non-diffractive topologies. This is a qualitative
classification, usually based on whether the final state looks like the decay of an excitation
of the beam particles (diffractive26 ), or not (non-diffractive), or upon the presence of a large
rapidity gap somewhere in the final state which would separate such excitations.
Given that an event has been labeled as diffractive, either within the context of a theoreti-
cal model, or by a final-state observable, we may distinguish between three different classes of
26
An example of a process that would be labeled as diffractive would be if one the protons is excited to a +
which then decays back to p+ + 0 , without anything else happening in the event. In general, a whole tower of
possible diffractive excitations are available, which in the continuum limit can be described by a mass spectrum
falling roughly as dM 2 /M 2 .
58
P. Skands Introduction to QCD
59
P. Skands Introduction to QCD
as the so-called underlying event (a.k.a. the jet pedestal effect). Keep in mind, however, that
especially at low multiplicities, and when gaps are present, the contributions from diffractive
processes should not be ignored.
Due to the simple fact that hadrons are composite, multi-parton interactions (several dis-
tinct parton-parton interactions in one and the same hadron-hadron collision) will always be
there but how many, and how much additional energy and tracks do they deposit in a
given measurement region? The first detailed Monte Carlo model for perturbative MPI was
proposed by Sj ostrand in [157], and with some variation this still forms the basis for most
modern implementations [5].
The first crucial observation is that the t-channel propagators appearing in perturbative
QCD 2 2 scattering almost go on shell at low p , causing the differential cross sections to
become very large, behaving roughly as
dt dp2
d22 . (89)
t2 p4
At LHC energies, this parton-parton cross section becomes larger than the total hadron-hadron
cross section at p scales of order 4 5 GeV. This is illustrated in figure 33, in which we com-
pare the integrated QCD parton-parton cross section (with two different s and PDF choices)
to the total inelastic cross section measured by TOTEM [150], for pp collisions at s = 8 TeV.
In the context of MPI models, this is interpreted straightforwardly to mean that each hadron-
hadron collision contains several few-GeV parton-parton collisions.
In the limit that all the partonic interactions are independent and equivalent, one would
simply have a Poisson distribution in the number of MPI, with average
22 (pmin )
hni(pmin ) = , (90)
tot
with pmin a lower cutoff scale which we shall return to below, and tot a measure of the
inelastic hadron-hadron cross section. This simple reinterpretation in fact expresses unitarity;
instead of the total interaction cross section diverging as pmin 0 (which would violate
unitarity), we have restated the problem so that it is now the number of MPI per collision that
diverges, with the total cross section remaining finite.
Two important ingredients remain to fully regulate the remaining divergence. Firstly, the
interactions cannot use up more momentum than is available in the parent hadron. This
suppresses the large-n tail of the estimate above. In P YTHIA-based models, the MPI are or-
dered in p [157, 106], and the parton densities for each successive interaction are explicitly
constructed so that the sum of x fractions can never be greater than unity. In the H ERWIG
models [159, 160], instead the uncorrelated estimate of hni above is used as an initial guess,
but the generation of actual MPI is stopped once the energy-momentum conservation limit is
reached.
The second ingredient invoked to suppress the number of interactions, at low p and
x, is colour screening; if the wavelength 1/p of an exchanged coloured parton becomes
larger than a typical colour-anticolour separation distance, it will only see an average colour
charge that vanishes in the limit p 0, hence leading to suppressed interactions. This
provides an infrared cutoff for MPI similar to that provided by the hadronization scale for
parton showers. A first estimate of the colour-screening cutoff would be the proton size,
60
P. Skands Introduction to QCD
8 TeV pp
104
102
10
VINCIAROOT
1
Pythia 8.183
10-1
1.5
Ratio
0.5
00 5 10 15 20
p
Tmin
Figure 33: Proton-proton collisions at 8 TeV. LO QCD parton-parton cross section (integrated
above pT min , for two different s and PDF choices) compared to the total inelastic hadron-
hadron cross section. Towards the right of the plot, we see, as expected, that hard dijet events
is only a tiny fraction of the total cross section. The fact that the curves cross at a scale of
order 5 GeV is interpreted to mean that this is a characteristic scale relevant for MPI. [158].
0 5 10 15 20
pmin ~/rp 0.3 GeV QCD , but empirically this appears to be far too low. In current
models, one replaces the proton radius rp in the above formula by a typical colour screening
distance, i.e., an average size of a region within which the net compensation of a given
colour charge occurs. This number is not known from first principles, though it may be related
to saturation [161]. In current MPI models, it is perceived of simply as an effective cutoff
parameter, to be determined from data.
Note that the partonic cross sections depend upon the PDF set used, and therefore the
optimal value to use for the cutoff will also depend on this choice [162]. Note also that the
cutoff does not have to be energy-independent. Higher energies imply that parton densities
can be probed at smaller x values, where the number of partons rapidly increases. Partons
then become closer packed and the colour-screening distance d decreases. The uncertainty
on the scaling of the cutoff is a major concern when extrapolating between different collider
energies [163, 162, 164].
We now turn to the origin of the observational fact that hard jets appear to sit on top
of a higher pedestal of underlying activity than events with no hard jets. That is, the so-
called underlying event (UE) is much more active, with larger fluctuations, than the average
min-bias event. This is interpreted as a consequence of impact-parameter-dependence: in pe-
ripheral collisions, only a small fraction of events contain any high-p activity, whereas central
collisions are more likely to contain at least one hard scattering; a high-p triggered sample
61
P. Skands Introduction to QCD
will therefore be biased towards small impact parameters, b, with a large number of MPI (and
associated increased activity). The ability of a model to describe the shape of the pedestal
(e.g. to describe both MB and UE distributions simultaneously) therefore depends upon its
modeling of the b-dependence, and correspondingly the impact-parameter shape constitutes
another main tuning parameter. A detailed discussion of impact-parameter dependent models
goes beyond the scope of these lectures; see [157, 147, 165].
For hard processes at the LHC atp13 TeV, the transverse energy, ET , in the UE is expected
to be about 3.3 GeV per unit R = 2 + 2 area [164] (for a reference case of 100-GeV
dijets, including both charged and neutral particles, with no cut on p ), though with large
event-to-event fluctuations of order 2 GeV [166]. Thus, for example, the total ET originating
from the UE, in a cone with radius 0.4 can be estimated to be ET UE (R = 0.4) 1.6 1 GeV,
while the ET in a cone with radius 1.0 would be ET UE (R = 1.0) 10 6 GeV. Note that the
magnetic field in realistic detectors will deflect some fraction of the soft charged component
of the underlying event into helix trajectories that will hence not contribute to the energy
deposition in the calorimeters.
5.4 Tuning
A main virtue of general-purpose Monte Carlo event generators is their ability to provide a
complete and fully differential picture of collider final states, down to the level of individual
particles. As has been emphasized in these lectures, the achievable accuracy depends both
on the inclusiveness of the chosen observable and on the sophistication of the simulation it-
self. An important driver for the latter is obviously the development of improved theoretical
models, e.g., by including matching to higher-order matrix elements, more accurate resum-
mations, or better non-perturbative models, as discussed in the previous sections; but it also
depends crucially on the available constraints on the remaining free parameters of the model.
Using existing data (or more precise calculations) to constrain these is referred to as generator
tuning.
Keep in mind that generators attempt to deliver a global description of the data; a tune
is no good if it fits one distribution perfectly, but not any others. It is therefore crucial to
study the simultaneous degree of agreement or disagreement over many, mutually comple-
mentary, distributions. A useful online resource for making such comparisons can be found
at the MCPLOTS web site[167] (which relies on computing power donated by volunteers, via
the LHC@home project [168]). The analyses come from the comprehensive R IVET analysis
toolkit [90], which can also be run stand-alone to make your own MC tests and comparisons.
Although MC models may appear to have a bewildering number of independently ad-
justable parameters, it is worth noting that most of these only control relatively small (exclu-
sive) details of the event generation. The majority of the (inclusive) physics is determined
by only a few, very important ones, such as the value of the strong coupling, in the per-
turbative domain, and the form of the fragmentation function for massless partons, in the
non-perturbative one.
Armed with a good understanding of the underlying model, an expert would therefore
normally take a highly factorized approach to constraining the parameters, first constraining
the perturbative ones (using IR safe observables and/or more precise theory calculations) and
thereafter the non-perturbative ones, each ordered in a measure of their relative significance
62
P. Skands Introduction to QCD
to the overall modeling. This allows one to concentrate on just a few parameters and a few
carefully chosen distributions at a time, reducing the full parameter space to manageable-
sized chunks. Still, each step will often involve more than one single parameter, and non-
factorizable correlations may still necessitate additional iterations from the beginning before
a fully satisfactory set of parameters is obtained.
Recent years have seen the emergence of automated tools that attempt to reduce the
amount of both computer and manpower required for this task, for instance by making full
generator runs only for a limited set of parameter points, and then interpolating between
these to obtain approximations to what the true generator result would have been for any
intermediate parameter point, as, e.g., in P ROFESSOR [169]. Automating the human expert
input is more difficult. Currently, this is addressed by a combination of input solicited from
the generator authors (e.g., which parameters and ranges to consider, which observables con-
stitute a complete set, etc) and the elaborate construction of non-trivial weighting functions
that determine how much weight is assigned to each individual bin in each distribution. The
field is still burgeoning, and future sophistications are to be expected. Nevertheless, at this
point the overall quality of the tunes obtained with automated methods appear to at least be
competitive with the manual ones.
However, though we have very good LHC tunes for essentially all the general-purpose gen-
erators by now, there are two important aspects which have so far been neglected, and which
it is becoming increasingly urgent to address. The first is that a central tune is not really worth
much, unless you know what the uncertainty on it is. A few individual proposals for systematic
tuning variations have been made [163, 170], but so far there is no general approach for es-
tablishing MC uncertainties by tune variations. The second issue is that virtually all generator
tuning is done at the pure LL shower level, and not much is known about what happens to
the tuning when matrix-element matching is subsequently included.
Finally, rather than performing one global tune to all the data, as is usually done, a more
systematic check on the validity of the underlying physics model could be obtained by instead
performing several independent optimizations of the model parameters for a range of different
phase-space windows and/or collider environments. In regions in which consistent parameter
sets are obtained (with reasonable 2 values), the underlying model can be considered as
interpolating well, i.e., it is universal. If not, a breakdown in the ability of the model to span
different physical regimes has been identified, and can be addressed, with the nature of the
deviations giving clues as to the nature of the breakdown. With the advent of automated tools,
such systematic studies are now becoming feasible, with a first example given in [162].
We round off by giving a sketch of a reasonably complete tuning procedure, without going
into details about the parameters that control each of these sectors in individual Monte Carlo
models (a recent detailed discussion in the context of PYTHIA 8 can be found in [158]):
1) Keep in mind that inabilities of models to describe data is a vital part of the feedback cy-
cle between theory and experiment. Also keep in mind that perturbation theory at (N)LO+LL
is doing very well if it gets within 10% of a given IR safe measurement. An agreement of 5%
should be considered the absolute sanity limit, beyond which it does not make any sense to
tune further. For some quantities, e.g., ones for which the underlying modeling is known to be
poor, an order-of-magnitude agreement or worse may have to be accepted.
2) Final-State Radiation and Hadronization: mainly using LEP and other e+ e collider
data. On the IR safe side, there are event shapes and jet observables. On the IR sensitive
63
P. Skands Introduction to QCD
side, multiplicities and particle spectra. Pay attention to the high-z tail of the fragmentation,
where a single hadron carries a large fraction of an entire jets momentum (most likely to give
fakes). Depending on the focus of the tuning, attention should also be paid to identified-
particle rates and ratios (perhaps with a nod to hadron-collider measurements), and to frag-
mentation in events containing heavy quarks and/or gluon jets. Usually, more weight is given
to those particles that are most copiously produced. The scaling properties of IR safe vs.
IR sensitive contributions can be tested by comparing data at several different e+ e collider
energies.
3) Initial-State Radiation, and Primordial28 kT : the main constraining distribution
is the dilepton p distribution in Drell-Yan events in hadron-hadron collisions. Ideally, one
would like to use several different Q2 values, and/or complementary processes, like p (dijet)
or p (tt). For any observables containing explicit jets, be aware that the UE can produce
small horizontal shifts in jet p distributions, which may in turn result in larger-than-expected
vertical changes if the distributions are falling sharply. Also note that the ISR evolution is
sensitive to the choice of PDFs.
4) Initial-Final Connections: (radiation from colour lines connecting the initial and final
states): DIS and jet broadening in hadron collisions. This is one of the most poorly controlled
parts of most MC models, highly sensitive to the treatment of coherence (see e.g. [171] for
illustrations). Keep in mind that it is not directly constrained by pure final-state observables,
such as LEP fragmentation, nor by pure initial-state observables, such as the Drell-Yan p
spectrum, which is why we list it as a separate item here. The modeling of this aspect can have
important effects on specific observables, a recent example being the tt forward-backward
asymmetry at the Tevatron [172].
5) Underlying Event: Good constraints on the overall level of the underlying event can be
obtained by counting the summed transverse energy (more IR safe) and/or particle multiplici-
ties and average transverse momenta (more IR sensitive) in regions transverse to a hard trigger
jet (more IR safe) or particle (more IR sensitive), see e.g. [173]. Constraints on the fluctua-
tions of the underlying event are also important, and can be obtained, e.g., by comparing to
measurements of the RMS of such distributions [166]. Again, note that the UE is sensitive to
the choice of PDFs [162].
6) Colour (Re-)Connections and other Final-State Interactions: By Final-State Inter-
actions, we intend a broad spectrum of possible collective effects that may be included to a
greater or lesser extent in various models. These effects include: Bose-Einstein correlations
(see, e.g., [174]), rescattering (see, e.g., [175]), colour reconnections / string interactions
(see, e.g., [176, 177, 178]), hydrodynamics (see, e.g., [179]), etc. As a rule, these effects
are soft and/or non-perturbative and hence should not modify hard IR safe observables ap-
preciably. They can, however, have drastic effects on IR sensitive ones, such as particle multi-
plicities, momentum distributions, and correlations, wherefore useful constraints are typically
furnished by measurements of spectra and correlations as functions of quantities believed to
serve as indicators of the strength of these phenomena (such as event multiplicity), and/or
by collective-flow-type measurements. Finally, if the model includes a universal description of
underlying event and soft-inclusive QCD, as many MPI-based models do, then minimum-bias
data can also be used as a control sample, though one must then be careful either to address
28
Primordial kT : an additional soft p component that is injected on top of the p generated by the initial-state
shower itself, see [5, Section 7.1].
64
P. Skands Introduction to QCD
Acknowledgments
Thanks to C. Brust, M. Cacciari, Y. Dokshitzer, L. Hartgring, A. Kronfeld, A. Larkoski, J. Lopez-
Villarejo, C. Murphy, P. Nason, P. Pigard, S. Prestel, G. Salam, and T. Sj
ostrand whose valuable
comments and sharing of insight contributed to these lectures. In addition, I have used ma-
terial from my 2010 ESHEP lectures [137] and 2014 AEPSHEP lectures, and from the ESHEP
lectures by Mangano [180], by Salam [4, 181], by Sj ostrand [40], and by Stirling [182], as
well as the recent review on Monte Carlo event generators by the MCnet collaboration [5].
References
[1] M. E. Peskin and D. V. Schroeder, An Introduction to quantum field theory.
Addison-Wesley, 1995.
[2] R. Ellis, W. Stirling, and B. Webber, QCD and collider physics. Camb. Monogr. Part.
Phys. Nucl. Phys. Cosmol., 1996.
[4] G. P. Salam, Elements of QCD for hadron colliders, 1011.5131. In Proceedings of the
17th European school on high-energy physics (ESHEP), Bautzen, Germany, 14 - 27
June, 2009, arXiv:1012.4643.
[6] B. Andersson, The Lund model. Camb. Monogr. Part. Phys. Nucl. Phys. Cosmol., 1997.
[7] F. James, Monte Carlo theory and practice, Rept. Prog. Phys. 43 (1980) 1145.
[9] C. Chedester, P. Isaacs, A. Sachs, and J. Steinberger, Total cross sections of pi-mesons
on protons and several other nuclei, Phys.Rev. 82 (1951) 958959.
65
P. Skands Introduction to QCD
[10] E. Fermi, H. Anderson, A. Lundby, D. Nagle, and G. Yodh, Ordinary and Exchange
Scattering of Negative Pions by Hydrogen, Phys.Rev. 85 (1952) 935936.
[11] H. Anderson, E. Fermi, E. Long, and D. Nagle, Total Cross-sections of Positive Pions in
Hydrogen, Phys.Rev. 85 (1952) 936.
[12] H. Anderson, E. Fermi, D. Nagle, and G. Yodh, Deuterium Total Cross Sections for
Positive and Negative Pions, Phys.Rev. 86 (1952) 413413.
[13] D. E. Nagle, The Delta: The First Pion Nucleon Resonance: Its Discovery and Applications.
1984. LALP84-27, DE84 017107, based on a lecture given at the University of
Chicago Symposium in honor of H.L. Anderson, May 11, 1982.
[14] O. Greenberg, Spin and Unitary Spin Independence in a Paraquark Model of Baryons
and Mesons, Phys.Rev.Lett. 13 (1964) 598602.
[15] M. Han and Y. Nambu, Three Triplet Model with Double SU(3) Symmetry, Phys.Rev.
139 (1965) B1006B1010.
[16] S. Drell and T.-M. Yan, Massive Lepton Pair Production in Hadron-Hadron Collisions
at High-Energies, Phys.Rev.Lett. 25 (1970) 316320.
[17] T. Plehn, D. Rainwater, and P. Z. Skands, Squark and gluino production with jets,
Phys.Lett. B645 (2007) 217221, hep-ph/0510144.
[18] J. Alwall, S. de Visscher, and F. Maltoni, QCD radiation in the production of heavy
colored particles at the LHC, JHEP 0902 (2009) 017, 0810.5350.
[19] A. Papaefstathiou and B. Webber, Effects of QCD radiation on inclusive variables for
determining the scale of new physics at hadron colliders, JHEP 06 (2009) 069,
0903.2013.
[20] D. Krohn, L. Randall, and L.-T. Wang, On the Feasibility and Utility of ISR Tagging,
1101.0810.
[21] C. K. Vermilion, Jet Substructure at the Large Hadron Collider: Harder, Better, Faster,
Stronger, 1101.1335.
[23] Particle Data Group Collaboration, J. Beringer et al., Review of particle physics,
Phys. Rev. D86 (2012) 010001.
[26] S. Bethke, The 2009 World Average of alpha(s), Eur.Phys.J. C64 (2009) 689703,
0908.1135.
66
P. Skands Introduction to QCD
[28] A. Young, Quantitative substitutional analysis I, Proc. London Math. Soc. 33 (1901)
97.
[29] B. Sagan, The ubiquitous young tableaux. in Invariant Theory and Tableaux,
ed. D. Stanton, Springer Verlag, 1990, p.262.
http://www.mth.msu.edu/users/sagan/Papers/Old/uyt.pdf.
[31] A. Pukhov, Calchep 2.3: MSSM, structure functions, event generation, 1, and
generation of matrix elements for other packages, hep-ph/0412191.
[32] CompHEP Collaboration, E. Boos et al., CompHEP 4.4: Automatic computations from
Lagrangians to events, Nucl.Instrum.Meth. A534 (2004) 250259, hep-ph/0403113.
[34] F. Krauss, R. Kuhn, and G. Soff, AMEGIC++ 1.0: A Matrix element generator in
C++, JHEP 0202 (2002) 044, hep-ph/0109036.
[35] M. Moretti, T. Ohl, and J. Reuter, OMega: An Optimizing matrix element generator,
hep-ph/0102195.
[38] M. B
ahr, S. Gieseke, M. Gigg, D. Grellscheid, K. Hamilton, et al., Herwig++ Physics
and Manual, Eur.Phys.J. C58 (2008) 639707, 0803.0883.
[39] T. Gleisberg and S. Hoeche, Comix, a new matrix element generator, JHEP 0812
(2008) 039, 0808.3674.
[40] T. Sj
ostrand, Monte Carlo Generators, hep-ph/0611247.
[41] J. C. Collins and D. E. Soper, Parton Distribution and Decay Functions, Nucl.Phys.
B194 (1982) 445.
[43] CTEQ Collaboration, R. Brock et al., Handbook of perturbative QCD: Version 1.1,
Rev.Mod.Phys. 67 (1995) 157248.
67
P. Skands Introduction to QCD
[46] G. Altarelli and G. Parisi, Asymptotic freedom in parton language, Nucl. Phys. B126
(1977) 298318.
[47] V. N. Gribov and L. N. Lipatov, Deep inelastic e-p scattering in perturbation theory,
Sov. J. Nucl. Phys. 15 (1972) 438.
[48] Y. L. Dokshitzer, Calculation of the structure functions for deep inelastic scattering
and e+ e annihilation by perturbation theory in quantum chromodynamics, Sov.
Phys. JETP 46 (1977) 641653.
[50] A. Cooper-Sarkar, What did HERA teach us about the structure of the proton?,
1206.0894.
[51] S. Alekhin, K. Melnikov, and F. Petriello, Fixed target Drell-Yan data and NNLO QCD
fits of parton distribution functions, Phys.Rev. D74 (2006) 054033, hep-ph/0606237.
[52] E. de Oliveira, A. Martin, and M. Ryskin, Drell-Yan as a probe of small x partons at the
LHC, 1205.6108.
[53] S. Alekhin, S. Alioli, R. D. Ball, V. Bertone, J. Blumlein, et al., The PDF4LHC Working
Group Interim Report, 1101.0536.
[54] A. Buckley and M. Whalley, HepData reloaded: reinventing the HEP data archive,
1006.0517. http://hepdata.cedar.ac.uk.
[55] A. Martin, W. Stirling, R. Thorne, and G. Watt, Parton distributions for the LHC,
Eur.Phys.J. C63 (2009) 189285, 0901.0002.
[57] G. Watt and R. Thorne, Study of Monte Carlo approach to experimental uncertainty
propagation with MSTW 2008 PDFs, JHEP 1208 (2012) 052, 1205.4024.
[58] M. Bengtsson, T. Sjostrand, and M. van Zijl, Initial State Radiation Effects on W and
Jet Production, Z. Phys. C32 (1986) 67.
[60] NLO Multileg Working Group Collaboration, Z. Bern et al., The NLO multileg
working group: Summary report, 0803.0494.
68
P. Skands Introduction to QCD
[62] T. D. Lee and M. Nauenberg, Degenerate Systems and Mass Singularities, Phys. Rev.
133 (1964) B1549B1562.
[63] S. Catani and M. H. Seymour, A general algorithm for calculating jet cross sections in
NLO QCD, Nucl. Phys. B485 (1997) 291419, hep-ph/9605323.
[64] S. Catani and M. H. Seymour, The Dipole Formalism for the Calculation of QCD Jet
Cross Sections at Next-to-Leading Order, Phys. Lett. B378 (1996) 287301,
hep-ph/9602277.
[65] Z. Nagy, Next-to-leading order calculation of three jet observables in hadron hadron
collision, Phys.Rev. D68 (2003) 094002, hep-ph/0307268.
[74] S. Catani and M. Grazzini, An NNLO subtraction formalism in hadron collisions and
its application to Higgs boson production at the LHC, Phys.Rev.Lett. 98 (2007)
222002, hep-ph/0703012.
[75] G. Gustafson and U. Pettersson, Dipole formulation of QCD cascades, Nucl. Phys.
B306 (1988) 746.
[76] A. Gehrmann-De Ridder and M. Ritzmann, NLO Antenna Subtraction with Massive
Fermions, JHEP 0907 (2009) 041, 0904.3297.
69
P. Skands Introduction to QCD
[85] M. Cacciari, G. P. Salam, and G. Soyez, The Anti-k(t) jet clustering algorithm, JHEP
0804 (2008) 063, 0802.1189.
[88] E. Boos et al., Generic user process interface for event generators, hep-ph/0109068.
[89] J. Alwall et al., A standard format for Les Houches Event Files, Comput. Phys.
Commun. 176 (2007) 300304, hep-ph/0609017.
[92] M. Dobbs and J. B. Hansen, The HepMC C++ Monte Carlo event record for High
Energy Physics, Comput. Phys. Commun. 134 (2001) 4146.
[93] T. Sj
ostrand, Monte Carlo Tools, 0911.5286.
70
P. Skands Introduction to QCD
[97] P. D. Draggiotis, A. van Hameren, and R. Kleiss, SARGE: An Algorithm for generating
QCD antennas, Phys.Lett. B483 (2000) 124130, hep-ph/0004047.
[98] G. Marchesini and B. Webber, Simulation of QCD Jets Including Soft Gluon
Interference, Nucl.Phys. B238 (1984) 1.
[99] G. Marchesini and B. R. Webber, Monte Carlo Simulation of General Hard Processes
with Coherent QCD Radiation, Nucl. Phys. B310 (1988) 461.
[100] S. Gieseke, P. Stephens, and B. Webber, New formalism for QCD parton showers,
JHEP 12 (2003) 045, hep-ph/0310083.
[102] L. L
onnblad, Ariadne version 4: A program for simulation of QCD cascades
implementing the colour dipole model, Comput. Phys. Commun. 71 (1992) 1531.
[103] Z. Nagy and D. E. Soper, Matching parton showers to NLO computations, JHEP 10
(2005) 024, hep-ph/0503053.
[106] T. Sj
ostrand and P. Z. Skands, Transverse-momentum-ordered showers and
interleaved multiple interactions, Eur. Phys. J. C39 (2005) 129154,
hep-ph/0408302.
[109] I. Knowles, Spin Correlations in Parton - Parton Scattering, Nucl.Phys. B310 (1988)
571.
[110] P. Richardson, Spin correlations in Monte Carlo simulations, JHEP 11 (2001) 029,
hep-ph/0110108.
[112] M. A. Gigg and P. Richardson, Simulation of Finite Width Effects in Physics Beyond
the Standard Model, 0805.3037.
71
P. Skands Introduction to QCD
[113] M. H. Seymour, The Higgs boson line shape and perturbative unitarity, Phys.Lett.
B354 (1995) 409414, hep-ph/9505211.
[114] T. Stelzer and S. Willenbrock, Spin correlation in top quark production at hadron
colliders, Phys.Lett. B374 (1996) 169172, hep-ph/9512292.
[115] S. J. Parke and Y. Shadmi, Spin correlations in top quark pair production at e+ e
colliders, Phys.Lett. B387 (1996) 199206, hep-ph/9606419.
[117] G. Corcella et al., HERWIG 6: an event generator for hadron emission reactions with
interfering gluons (including supersymmetric processes), JHEP 01 (2001) 010,
hep-ph/0011363.
[120] S. Catani, F. Krauss, R. Kuhn, and B. R. Webber, QCD matrix elements + parton
showers, JHEP 11 (2001) 063, hep-ph/0109231.
[121] L. L
onnblad, Correcting the colour-dipole cascade model with fixed order matrix
elements, JHEP 05 (2002) 046, hep-ph/0112284.
[124] S. Mrenna and P. Richardson, Matching matrix elements and parton showers with
H ERWIG and P YTHIA, JHEP 05 (2004) 040, hep-ph/0312274.
[127] S. Frixione and B. R. Webber, Matching NLO QCD computations and parton shower
simulations, JHEP 06 (2002) 029, hep-ph/0204244.
[128] S. Frixione, P. Nason, and B. R. Webber, Matching NLO QCD and parton showers in
heavy flavour production, JHEP 08 (2003) 007, hep-ph/0305252.
72
P. Skands Introduction to QCD
[129] S. Frixione and B. R. Webber, The MC@NLO 3.4 Event Generator, 0812.0770.
[130] S. Frixione, P. Nason, and C. Oleari, Matching NLO QCD computations with parton
shower simulations: the POWHEG method, JHEP 11 (2007) 070, 0709.2092.
[131] K. Hamilton and P. Nason, Improving NLO-parton shower matched simulations with
higher order matrix elements, JHEP 06 (2010) 039, 1004.1764.
ostrand, S. Mrenna, and P. Skands, PYTHIA 6.4 physics and manual, JHEP 05
[133] T. Sj
(2006) 026, hep-ph/0603175.
[134] T. Sj
ostrand, S. Ask, J. R. Christiansen, R. Corke, N. Desai, et al., An Introduction to
PYTHIA 8.2, 1410.3012.
[135] S. Alioli, P. Nason, C. Oleari, and E. Re, A general framework for implementing NLO
calculations in shower Monte Carlo programs: the POWHEG BOX, JHEP 06 (2010)
043, 1002.2581.
[136] L. Hartgring, E. Laenen, and P. Skands, Antenna Showers with One-Loop Matrix
Elements, JHEP 1310 (2013) 127, 1303.4974.
[137] P. Z. Skands, QCD for Collider Physics, 1104.2863. In 18th European school on
high-energy physics (ESHEP), Raseborg, Finland, 20 June - 3 July, 2010,
arXiv:1202.1629.
[138] G. Bali and K. Schilling, Static quark - anti-quark potential: Scaling behavior and
finite size effects in SU(3) lattice gauge theory, Phys.Rev. D46 (1992) 26362646.
[141] X. Artru and G. Mennessier, String model and multiproduction, Nucl. Phys. B70
(1974) 93115.
[142] M. Travis, T. E. Ford, F. Laine, E. Presley, J. Cash, et al., Sixteen Tons. Originally
recorded for Folk Songs of the Hills, Capitol Records, 1946. Capitol T-891.
[145] P. Eden and G. Gustafson, Baryon production in the string fragmentation picture,
Z.Phys. C75 (1997) 4149, hep-ph/9606454.
[146] T. Sj
ostrand and P. Z. Skands, Baryon number violation and string topologies, Nucl.
Phys. B659 (2003) 243, hep-ph/0212264.
73
P. Skands Introduction to QCD
[147] T. Sj
ostrand and P. Z. Skands, Multiple interactions and the structure of beam
remnants, JHEP 03 (2004) 053, hep-ph/0402078.
[148] M. G. Bowler, e+ e production of heavy quarks in the string model, Z. Phys. C11
(1981) 169.
[151] A. Donnachie and P. Landshoff, pp and pp total cross sections and elastic scattering,
Phys.Lett. B727 (2013) 500505, 1309.1292.
[152] V. Khoze, F. Krauss, A. Martin, M. Ryskin, and K. Zapp, Diffraction and correlations at
the LHC: Definitions and observables, Eur.Phys.J. C69 (2010) 8593, 1005.4839.
[153] B. Blok, Y. Dokshitzer, L. Frankfurt, and M. Strikman, The Four jet production at LHC
and Tevatron in QCD, Phys.Rev. D83 (2011) 071501, 1009.2714.
[155] J. R. Gaunt and W. J. Stirling, Single and Double Perturbative Splitting Diagrams in
Double Parton Scattering, 1202.3056.
[157] T. Sj
ostrand and M. van Zijl, A multiple-interaction model for the event structure in
hadron collisions, Phys. Rev. D36 (1987) 2019.
[158] P. Skands, S. Carrazza, and J. Rojo, Tuning PYTHIA 8.1: the Monash 2013 Tune,
1404.5630.
[160] M. B
ahr, J. M. Butterworth, S. Gieseke, and M. H. Seymour, Soft interactions in
Herwig++, 0905.4671.
[161] M. Ryskin, A. Martin, and V. Khoze, High-energy strong interactions: from hard to
soft, Eur.Phys.J. C71 (2011) 1617, 1102.2844.
[162] H. Schulz and P. Skands, Energy Scaling of Minimum-Bias Tunes, Eur.Phys.J. C71
(2011) 1644, 1103.3649.
[163] P. Z. Skands, Tuning Monte Carlo Generators: The Perugia Tunes, Phys.Rev. D82
(2010) 074018, 1005.3457.
74
P. Skands Introduction to QCD
[169] A. Buckley, H. Hoeth, H. Lacker, H. Schulz, and J. E. von Seggern, Systematic event
generator tuning for the LHC, Eur. Phys. J. C65 (2010) 331357, 0907.2973.
[171] M. Ritzmann, D. Kosower, and P. Skands, Antenna Showers with Hadronic Initial
States, Phys.Lett. B718 (2013) 13451350, 1210.6345.
[172] P. Skands, B. Webber, and J. Winter, QCD Coherence and the Top Quark Asymmetry,
JHEP 1207 (2012) 151, 1205.1466.
[173] R. Field, Min-Bias and the Underlying Event at the LHC, Acta Phys.Polon. B42 (2011)
26312656, 1110.5530.
[174] L. L
onnblad and T. Sj
ostrand, Modelling Bose-Einstein correlations at LEP 2, Eur.
Phys. J. C2 (1998) 165180, hep-ph/9711460.
[176] J. Rathsman, A generalised area law for hadronic string reinteractions, Phys. Lett.
B452 (1999) 364371, hep-ph/9812423.
[177] P. Z. Skands and D. Wicke, Non-perturbative QCD effects and the top mass at the
Tevatron, Eur.Phys.J. C52 (2007) 133140, hep-ph/0703081.
75
P. Skands Introduction to QCD
[180] N. Ellis, R. Fleischer, et al., Proceedings of the 2008 European School of High-Energy
Physics (ESHEP), Herbeumont-sur-Semois, Belgium, 8 - 21 June 2008,.
CERN-2009-002.
[182] N. Ellis, R. Fleischer, et al., 2007 European School of High-Energy Physics, Trest,
Czech Republic, 19 August - 1 September 2007,. CERN-2008-007.
76
Index
Adjoint representation, 5 Diquarks, 15, 55
s , 5, 1014 DIS, 7, 18, 20, 64
beta function, 10 Double counting, 45
QCD , 13, 14 Drell-Yan, 7, 20
Landau Pole, see QCD
Running coupling, 1014 Eightfold way, 4
Altarelli-Parisi, see DGLAP Eikonal, 28, 29, 36
Angular ordering, 32, 37, 42 Elastic scattering, 57
Antenna functions, see Antennae Event evolution, see Evolution
Antennae, 28, 29, 36, 43 Event generators, see Monte Carlo
Anti-kT algorithm, see Jets Event shapes, 30
Area law, 56 Evolution, 4042
Asymptotic freedom, 12, 13 Exclusive cross sections, 40, 41, 45
77
P. Skands Introduction to QCD
78
P. Skands Introduction to QCD
TR , 8
Traces in SU(3), see SU(3)
Transcendentality, 39
Transverse-momentum-ordering, see p -ordering
Tuning, see Monte Carlo
Tunneling, 54
Twist, 20
Uncertainties
Factorization, 19, 20
Monte Carlo statistics, 33
PDF sets, 21
Underlying event, 31, 51, 57
Unitarity, 25, 27, 40
Young tableaux, 16
79