Susy
Susy
Supersymmetry
Kevin Zhou
[email protected]
These notes cover supersymmetry, closely following the Part III and MMathPhys Supersymmetry
courses as lectured in 2017/2018 and 2018/2019, respectively. The primary sources were:
• Aitchison, Supersymmetry in Particle Physics. A friendly introductory book that covers the
basics with a minimum of formalism; for instance, the Wess–Zumino model is introduced
without using superfields. Also gives an extensive treatment of subtleties in two-component
spinor notation. The last half covers the phenomenology of the MSSM.
• Wess and Bagger, Supersymmetry and Supergravity. An incredibly terse book that serves as
a useful reference. Most modern sources follow the conventions set here. Many pages consist
entirely of equations, with no words.
We will use the conventions of Quevedo’s lecture notes. As such, the metric is mostly negative,
and a few other signs are flipped with respect to Wess and Bagger’s conventions. The most recent
version is here; please report any errors found to [email protected].
2 Contents
Contents
1 Introduction 3
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2 The Poincare Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.3 Spinors in Four Dimensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.4 Supersymmetric Quantum Mechanics . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.5 Spinors in Various Dimensions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4 Supersymmetric Lagrangians 41
4.1 N = 1 Supersymmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
4.2 The Wess–Zumino Model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
4.3 Non-Renormalization Theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.4 Extended Supersymmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
5 The MSSM 51
5.1 SUSY Breaking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
5.2 Particles and Interactions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
3 1. Introduction
1 Introduction
1.1 Motivation
We begin with a review of the Standard Model and its problems.
xµ → x′µ (xν )
and include Poincare transformations in special relativity, and more generally, general coordinate
transformations in general relativity.
Φa (x) → M ab Φb (x).
• Symmetries constrain the interactions between fields. For example, most quantum field theories
of vector bosons are non-renormalizable, but gauge theories are renormalizable.
• The SM has Poincare symmetry and a gauge SU (3)C ×SU (2)L ×U (1)Y symmetry, spontaneously
broken to SU (3)C × U (1)A .
(Λ/Mp )4 ∼ 10−120 .
A related issue is how the 20 free parameters of the SM are determined. Finally, the SM does
not account for dark matter.
• In the 1960’s, much progress was made by classifying hadrons into multiplets, and there were
attempts to enlarge the symmetry groups by including spacetime symmetries.
• The Coleman–Mandula theorem (1967) states that spacetime and internal symmetries cannot
be combined nontrivially in a relativistic theory with nontrivial scattering, a mass gap, and
finitely many particles. More precisely, the symmetry group of the S-matrix must be a direct
product of the Poincare group and an internal symmetry group. (Conformal field theories
(CFTs) evade this theorem because they don’t have a mass gap, allowing the larger spacetime
symmetry group SO(2, d) in d spacetime dimensions.)
4 1. Introduction
• In 1971, Gelfand and Likhtman extended the Poincare algebra by adding generators that trans-
formed like spinors and satisfied anticommutation relations, thus inventing SUSY; this evaded
the Coleman–Mandula theorem because the symmetry was described by a Lie superalgebra
rather than a Lie algebra. Note that the spin-statistics theorem ensures that in all dimensions,
SUSY generators must be spinors.
• Simultaneously, Ramond, Neveu, and Schwarz found that string theory extended with fermions
was a two-dimensional supersymmetric theory on the worldsheet, inventing superstring theory.
The string worldsheet also has conformal symmetry, making it a superconformal field theory
(SCFT).
• In the 1970’s, neutrinos were thought to be massless. In 1973, Volkov and Akulov proposed
that neutrinos were Goldstone fermions, called Goldstinos, due to the spontaneous breaking of
SUSY.
• In 1974, Wess and Zumino wrote down the first example of an interacting four-dimensional quan-
tum field theory with linearly realized SUSY. Simultaneously, Salam and Strathdee invented
the tools of superfields and superspace, coining the term ‘supersymmetry’.
• In 1975, Haag, Lopuszanski, and Sohnius generalized the Coleman–Mandula theorem to essen-
tially state that the most general symmetry possible was a direct product of the super Poincare
group and internal symmetries.
• Making Poincare symmetry local yields general coordinate transformations and hence general
relativity. In 1976, Friedman, van Niewenhuizen, and Ferrara, and Deser and Zumino made
SUSY local, yielding supergravity. The superpartner of the graviton was the spin 3/2 gravitino.
• From 1977 to the 1980’s, SUSY phenomenology was developed. It was demonstrated that SUSY
could solve the hierarchy problem in a natural way, though this is less relevant today.
• Simultaneously, in 1977 Gliozzi, Scherk, and Olive demonstrated how to remove the tachyon
from the Ramond-Neveu-Schwarz model, and conjectured the resulting theory had spacetime
supersymmetry. From 1981 to 1984, Green and Schwarz proved this conjecture, discovering
an anomaly cancellation mechanism for superstring theory in d = 10 and starting the first
superstring revolution.
• In 1991, LEP performed precision tests of the SM. It was found that gauge coupling unification
did not occur for the SM, presenting problems for GUTs, but would happen for the MSSM as
long as superpartners had masses in the range 100 GeV to 10 TeV.
• In 1996, Strominger and Vafa counted the microstates of a black hole in superstring theory to
confirm the Bekenstein-Hawking formula S = A/4.
• In 1998, the AdS/CFT duality was proposed by Maldacena, showing that certain CFTs in d
dimensions are dual to quantum gravity theories in AdS space in d + 1 dimensions. The best
studied instances of the AdS/CFT duality involve SCFTs which are dual to superstring theories
in AdS, making supersymmetry a useful tool for studying quantum gravity.
5 1. Introduction
• The issue is not that µ is small, but that quantum effects give large corrections to µ. This
doesn’t happen for gauge boson masses, which are held at zero by gauge symmetry, or for spinor
masses, because chiral symmetry is restored when the mass vanishes. Then δm ∼ m log Λ,
which is reasonably small even when Λ is the Planck scale.
so if Λ is the Planck scale, µ2 must be fine-tuned to get an acceptable observed value of µ2phys ,
the coefficient of ϕ† ϕ in the 1PI effective action. Thus to avoid fine tuning there must be new
physics around the TeV scale.
• Another, more radical solution is to set the Planck scale to the TeV scale; this is consistent if
there are large extra dimensions. We’ll put these ideas aside and focus on SUSY.
• The one-loop contribution to δµ2 above can also be canceled by fermion contributions. Consider
a fermion with Yukawa coupling gf to the Higgs. Then
Z Λ
2 2 1
δµ ∼ −gf d4 k tr ∼ −4gf2 Λ2
(k/ − mf )2
where the minus sign comes from the fermion loop. Then the quadratic divergence cancels if,
for every boson, there is a fermion whose coupling to the Higgs is related; this is guaranteed by
SUSY.
δµ2 ∼ λ(MH
2
− m2f ) log Λ
where we dropped all numerical factors, which depends quadratically on the particle masses.
This is completely generic; we would even get a contribution of m2 from a particle of mass m
that didn’t couple directly to the Higgs, by a multi-loop diagram. Hence the parameter µ2 is
quadratically sensitive to any scale associated with new physics.
6 1. Introduction
• Thus to avoid fine tuning, new physics should arise at the TeV scale. Moreover, if that physics
is SUSY, the superpartner masses should generally be around the TeV scale. This is especially
important in SUSY GUTs, where there are many very heavy particles.
• In the MSSM, applying naturalness to the coefficients shows that the Higgs should be no heavier
than 140 GeV. By contrast, in the SM there is no constraint, unless we count perturbative
unitarity, which bounds the Higgs mass by a few hundred GeV.
Note. A cartoon explanation of the Coleman–Mandula theorem. Essentially, the theorem states
that conserved charges from internal symmetries can’t have Lorentz indices; the only such charges
are momentum Pµ and angular momentum Mµν which arise from spacetime symmetries. Suppose
we had another such charge Qµν . By Lorentz invariance,
Now consider a two-particle state. We suppose that Qµν values are additive, conserved, and act on
only one particle at a time. Then
p(1) + p(2) = p(3) + p(4) , p(1) (1) (2) (2) (3) (3) (4) (4)
µ pν + pµ pν = pµ pν + pµ pν .
However, these conditions are so restrictive that there are no nontrivial solutions! We can only have
forward or backwards scattering.
Note. A preview of the SUSY algebra. We will have a spinorial generator Qa with a = 1, 2,
which relates bosons and fermions; then the above argument fails at the first step, since we cannot
superpose the two. The Qa satisfy anticommutation relations among themselves, and commute
with H. Then we have
[{Qa , Qb }, H] = 0.
Now, {Qa , Qb } should be a “spin one’ object’, so in a relativistic field theory it should be a four-vector.
The only conserved four-vector is Pµ , so
{Qa , Qb } ∼ Pµ .
Hence supersymmetry transformations inevitably relate internal and spacetime symmetries. They
function as a kind of
√ “square root” of translations, and hence take us from ordinary space to
superspace like how −1 takes us from the real line to the complex plane. Note that the spin 1/2
generator Qa is the only exception to the Coleman–Mandula theorem, i.e. the Haag–Lopuszanski–
Sohnius rules out spin 3/2 generators Qµa , and so on.
• The Poincare group corresponds to the basic symmetries of special relativity. It acts on spacetime
coordinates by
xµ → x′µ = Λµν xµ + aµ
7 1. Introduction
We will focus on the proper orthochronous Lorentz group SO(1, 3)↑ , while the full Lorentz
group is O(3, 1) = {1, ΛP , ΛT , ΛP T } × SO(1, 3)↑ . Below we’ll just write SO(1, 3) for SO(1, 3)↑ .
• Infinitesimally, we have
Λµν = δνµ + ω µν , aµ = ϵµ
where ωµν = −ωνµ . If the Poincare group is represented by U (Λ, a) on a Hilbert space, then
infinitesimally we define
i
U (1 + ω, ϵ) = 1 − ωµν M µν + iϵµ P µ .
2
A useful explicit expression is
Note that we use the same notation for the abstract Poincare algebra elements and their
representations on a Hilbert space, since we will use the latter constantly. By the definition of
a representation, the commutator on the latter is the bracket on the former.
• We now find the Poincare algebra. Since translations commute in the Hilbert space, we have
[Pµ , Pν ] = 0.
[M µν , P σ ] = i(P µ η νσ − P ν η µσ )
[M µν , M ρσ ] = i(M µσ η νρ + M νρ η µσ − M µρ η νσ − M νσ η µρ ).
Hence we conclude so(3, 1) ∼ = su(2) ⊕ su(2) as complex Lie algebras. Note that J = A + B,
and under parity J → J and K → K, so A and B are interchanged. This leads to the usual
classification of representations of the Lorentz group.
However, if we use mostly positive signature, we instead must define σ µ = (−1, σ).
• This map is well-defined and surjective since the only constraint on the x̃ transformations is
Note. The topology of SL(2, C). To see it, use the polar decomposition
N = eH U
• We are using a redundant notation: the ψ and χ don’t matter, but dotted indices are associated
with bars. This is useful because we can then write expressions unambiguously without indices.
• For the matrices N , dotted indices always accompany a conjugate, so they’re redundant as we
always write the conjugate explicitly. We simply assign indices to N so that the indices match
up properly; note that the first index is always down.
• The invariant tensors in SL(2, C) are delta functions δβα and δβ̇α̇ and the Levi–Civita symbols
where the minus sign ensures ϵαβ ϵβγ = δγα . They are invariant because
ϵαβ → Nα ρ Nβ σ ϵρσ = ϵαβ det N = ϵαβ
with similar proofs for the others. Then the Levi–Civita can be used to invert matrices,
ϵσδ Nδ β ϵβα = (N −1 )ασ
• The Levi–Civitas can be used to raise or lower indices. This is a bit tricky because ϵαβ is not
symmetric; by convention we always contract the second index. We define
which obeys a similar transformation law; note that if we didn’t ‘swap the indices’ then the
matrix σ 2 would have the wrong sign.
• There are some useful identities for σ and σ. They form a Clifford algebra, as
(σ µ σ ν + σ ν σ µ )βα = 2η µν δαβ .
We may think of σαµα̇ as a set of Clebsch–Gordan coefficients for the identity (1/2, 0) × (0, 1/2) =
(1/2, 1/2). The completeness of both bases is expressed by
• Just as the Dirac spinor is built from the Clifford algebra of gamma matrices, we have
i i
(σ µν )βα = (σ µ σ ν − σ ν σ µ )βα , (σ µν )β̇α̇ = (σ µ σ ν − σ ν σ µ )α̇β̇ .
4 4
Then the matrices σ µν , and the matrices σ µν , satisfy the Lorentz algebra,
[σ µν , σ λρ ] = i(η µρ σ νλ + η νλ σ µρ − η µλ σ νρ − η νρ σ µλ ).
• The (σ µν )βα can also be used to project out the (1, 0) representation in the product
That is, Vµ Wν (σ µν )βα transforms in the (1, 0). On the other hand, we also know that (1, 0) is the
symmetric product of two (1/2, 0)’s, which implies (σ µν )βα is symmetric in α and β. Similarly,
(σ µν )β̇α̇ projects out (0, 1).
• One can show that the left-handed and right-handed spinors transform as
β
i µν α̇ β̇ i µν
ψα → exp − ωµν σ ψβ , χ → χ exp − ωµν σ .
2 α 2
In terms of the usual classification of Lorentz irreps we can show
1 i
ψα :(A, B) = (1/2, 0), Ji = σi , Ki = − σi ,
2 2
1 i
χα̇ :(A, B) = (0, 1/2), Ji = σi , Ki = σi .
2 2
11 1. Introduction
• The contraction of Weyl spinors requires an ordering convention because the Levi–Civita is
antisymmetric. As motivated below, we define
α̇
χψ ≡ χα ψα = −χα ψ α , χψ ≡ χα̇ ψ = −χα̇ ψ α̇ .
That is, indices contract ↘ for undotted indices and ↗ for dotted indices.
• In particular, we have
ψψ = ψ α ψα = ϵαβ ψβ ψα = ψ2 ψ1 − ψ1 ψ2 .
This appears to vanish classically, but since spinors are inherently anticommuting we choose to
represent them as Grassmann numbers classically. Then
1
ψψ = 2ψ2 ψ1 , ψα ψβ = ϵαβ (ψψ).
2
This also implies that contraction is symmetric, χψ = ψχ and χψ = ψχ, and
1 1
(θχ)(θξ) = − (θθ)(χξ), (θχ)(θξ) = − (θθ)(χξ)
2 2
• One can conjugate a representation by just conjugating the vectors. That is, we define
α̇
ψ α̇ = ψα† , ψ = ψ α†
where the dagger simply stands for complex conjugation. Complex conjugation is defined to
reverse the order of Grassmann numbers, (θ1 θ2 )∗ = θ2∗ θ1∗ , which implies
where we used ((σ µ )αβ̇ )∗ = ((σ µ )αβ̇ )T = (σ µ )β α̇ since the σ µ are Hermitian.
• Two-component spinor notation can be used to deal with tensor products of Lorentz represen-
tations. For example, we have the Fierz identity
1
ψα χα̇ = (ψσµ χ)σαµα̇ , (1/2, 0) × (0, 1/2) = (1/2, 1/2)
2
showing that a left-handed and right-handed spinor yield a vector ψσµ χ.
12 1. Introduction
θσ µ χ = −χσ µ θ, θσ µ σ ν χ = χσ ν σ µ θ
• A Dirac spinor Ψ is the direct sum of two Weyl spinors ψ and χ of opposite chirality,
ψα
Ψ= .
χα̇
Here the left-handed component is on top and the right-handed component is on the bottom.
Then charge conjugation simply exchanges χ and ψ. Majorana spinors have ψ = χ and hence
are mapped to themselves under charge conjugation.
A = A0 ⊕ A1 ,
whose two components are called the bosonic and fermionic subalgebras respectively, with a
bilinear multiplication operator so that
a0 a′0 ∈ A0 , a0 a1 ∈ A1 , a1 a′1 ∈ A0
[QI , E] = 0
which indicates that all the states in an irrep of the supersymmetry algebra (a supermultiplet)
are degenerate.
{QI , QJ } = 2Eδ IJ + Z IJ
where the central charge Z IJ is real and symmetric. Central charges are not elements of the
original Lie algebra, and are taken to commute with all elements of it. They arise naturally
when we allow for projective representations, as we encountered for the Galilean group in the
notes on Group Theory.
14 1. Introduction
One simple N = 1 supersymmetric model is a theory with D free 1D bosons X µ (t) and D free 1D
fermions ψ µ (t), all of vanishing mass.
• The Lagrangian is
1
L = Ẋµ Ẋ µ + iψµ ψ̇ µ
2
where the fermions ψ µ are Grassmann-valued. If the indices above are raised and lowered with
the Minkowski metric, then this system represents a relativistic spinning massless particle in
flat D-dimensional spacetime.
• The conjugate momenta are
ΠµX = Ẋ µ , Πµψ = iψ µ .
In canonical quantization, we hence have
[X µ , Ẋ ν ] = iη µν , {ψ µ , ψ ν } = η µν .
The latter result tells us the spin degrees of freedom match those of a Dirac spinor.
• The Lagrangian has a 1D N = 1 supersymmetry, acting on the fields as
δX = 2iϵψ, δψ = −ϵẊ
where ϵ is a Grassmann parameter. Under this transformation the Lagrangian changes by a
total derivative,
d
δL = iϵ ψµ Ẋ µ .
dt
Applying Noether’s theorem, we find the conserved charge and supersymmetry generator
Q = ψµ Ẋ µ .
• We can also confirm this relation at the field level, defining δ = ϵQ. Since we have
δ 2 X = −2iϵ2 Ẋ, δ 2 ψ = −2iϵ2 ψ̇
then we have {Q, Q} = 2E when acting on the fields.
• Upon quantization, the state space is a tensor product of spin and spatial degrees of freedom.
The operators ψ µ serve as spin raising and lowering operators, with the total dimensionality of
this space matching the degrees of freedom of a single Dirac spinor particle.
• The operators X µ and Ẋ µ act on L2 (RD−1 ). Their equations of motion have solutions
X µ (t) = X µ (0) + Ẋ µ (0)t, Ẋ µ (t) = Ẋ µ (0).
Algebraically, they behave like the usual operators in nonrelativistic quantum mechanics. That
is, while the fields X µ are formally massless, they represent the position of a nonrelativistic
particle with unit mass.
15 1. Introduction
• Like other symmetries, we may find the irreducible representations of the symmetry algebra;
the Hilbert space will then be built out of these “supermultiplets”.
√ Since [Q, H] = 0, we may
restrict to states of energy E. Defining the rescaling b = Q/ 2E, the N = 1 SUSY algebra is
{b, b} = 1, b = b† .
The irreducible representations are all one-dimensional, so this gives us little information.
Qi Qi
ai = √ , a†i = √
2 E 2 E
then we have n independent fermionic QHOs, and hence a supermultiplet with 2n states.
We will see much of this again in more detail for d = 4, though there will be additional complications,
such as spinor indices.
• We focus on so(1, d − 1), where the signature is mostly positive. We take the generators
Mµν = −Mνν to satisfy the algebra
[Mµν , Mρσ ] = i(ηµσ Mνρ + ηνρ Mµσ − ηµρ Mνσ − ηνσ Mµρ ).
{γ µ , γ ν } = 2η µν .
The Dirac spinor representation of so(1, d − 1) is the one with representation matrices
i
Mµν = [γµ , γν ]
4
where the γµ have the minimum possible dimension while still representing the Clifford algebra
faithfully. Note that various factors of i may differ depending on the source.
16 1. Introduction
• To see the dimension of the Dirac spinor representation, let d = 2(k + 1) and define
1 1
γ 0± = (±γ 0 + γ 1 ), γ a± = (γ 2a ± iγ 2a+1 ), a = 1, . . . k.
2 2
This defines a set of n + 1 independent fermionic QHOs,
{γ a+ , γ b− } = δ ab , {γ a+ , γ b+ } = {γ a− , γ b− } = 0.
Assuming the Clifford algebra is represented faithfully and irreducibly, this gives 2k+1 states.
• Given a Clifford algebra in d = 2n dimensions, we can automatically construct a Clifford algebra
in d = 2n + 1 dimensions by adding
γ 2n+1 = (−i)n+1 γ 0 γ 1 . . . γ 2n−1 .
Note that this means the index µ in γ µ takes values {0, 1, . . . , 2n − 1, 2n + 1}, which is unfortu-
nately conventional.
• Given a Clifford algebra γ µ in d = 2n dimensions, there are various ways to construct a Clifford
algebra Γµ in d = 2n + 2 dimensions by tensor product. One way to do this is
Γµ = γ µ ⊗ σ 1 , Γ2n+1 = γ 2n+1 ⊗ σ 1 , Γ2n+2 = 1 ⊗ σ 2 .
• When d is even, the Dirac spinor representation is reducible. Conceptually, this is because the
generators Mµν are all built from an even number of γ matrices, so they preserve the parity of
the number of fermionic QHO excitations. Concretely, we can extract the irreps, called Weyl
spinors, using the projection operators (1 ± γ 2n+1 )/2. In all dimensions, we call these two Weyl
spinors left-chiral and right-chiral respectively; they correspond to the two special roots at the
end of the Dynkin diagram for so(2n). Going up to d = 2n + 1 combines these irreps into one.
• So far, all representations have been complex, in the mathematician’s sense. However, the Dirac
spinor is not complex, in the physicist’s sense; it is either real or pseudoreal. In the case it is
real (in the physicist’s sense), there is a chance we can extract a real representation (in the
mathematician’s sense) from it. Such a spinor is called a Majorana spinor, and it turns out to
be possible if d ≡ 0, 1, 2, 3, 4 (mod 8).
• The Weyl spinors may be self-conjugate, or conjugate to each other, depending on the dimension.
In the case where they are self-conjugate, there is a chance we can extract a real representation
from them, yielding a Majorana–Weyl spinor. This turns out to be possible if d ≡ 2 (mod 8).
• The facts above are summarized in the below table.
d dim γ Majorana Weyl Majorana–Weyl min. dim.
2 2 yes self yes 1
3 2 yes 2
4 4 yes complex 4
5 4 8
6 8 self 8
7 8 16
8 16 yes complex 16
9 16 yes 16
10 32 yes self yes 16
11 32 yes 32
12 64 yes complex 64
17 1. Introduction
The columns are the total spacetime dimension, the dimension of the (complex) gamma matrices,
the presence of Majorana, Weyl, and Majorana–Weyl spinors, and the real dimension of the
smallest possible representation.
This gives a two-dimensional Dirac spinor which decomposes into one-dimensional “left-moving”
and “right-moving” Weyl spinors. To move to d = 3 we may add
3 3 1 0
γ =σ =
0 −1
which also may be used to project out the left-movers and right-movers. Of course, the cases d = 4
and hence d = 5 are familiar, though we won’t get the usual chiral representation if we use our
inductive scheme above.
18 2. SUSY Algebra and Representations
• The SUSY algebra is a graded Lie algebra. The operators in such an algebra obey
Below we won’t use the [·, ·]± notation, but instead will make (anti)commutators explicit.
Note that the signs do nothing unless exactly two of the operators are fermionic.
• For the SUSY algebra, the generators are the Poincare generators P µ and M µν and the spinor
A A †
generators QA
α and Qα̇ = (Qα ) where A = 1, . . . , N . For N = 1, we have simple SUSY,
while for N > 1 we have extended SUSY. Here we focus on simple SUSY, which is the most
phenomenologically relevant.
• The logic for the first is like that for [P σ , M µν ]. Since Qα is a spinor, it transforms as
β β
′ i µν i µν
Qα = exp − ωµν σ Qβ ≈ 1 − ωµν σ Qβ .
2 α 2 α
On the other hand, for a representation of the super-Poincare algebra on a Hilbert space,
i i i
Q′α = U † Qα U ≈ 1 + ωµν M µν Qα 1 − ωµν M µν = Qα − ωµν [M µν , Qα ].
2 2 2
Comparing the two, we conclude
[Qα , M µν ] = (σ µν )αβ Qβ
by index structure and linearity on the right-hand side. The adjoint of this equation is
α̇
[Q , P µ ] = c∗ (σ µ )α̇β Qβ
• Next, consider {Qα , Qβ }. This transforms in the Lorentz representation (1/2, 0) × (1/2, 0) =
(1, 0) + (0, 0), but the (0, 0) piece vanishes because the Qα are anticommuting. Then
for an arbitrary constant k, where σ carries the appropriate SL(2, C) indices and Mµν is the only
thing that can absorb its Lorentz indices. By the Jacobi identity, the left-hand side commutes
with P µ but the right-hand side does not unless k = 0, so we must have
{Qα , Qβ } = 0.
• Finally, for {Qα , Qβ̇ } we have (1/2, 0) × (0, 1/2) = (1/2, 1/2), so we must have (why not the
Pauli–Lubanski vector?)
{Qα , Qβ̇ } = t(σ µ )αβ̇ Pµ .
There is no way to fix t. If we set t = 0, the algebra is trivial since the spinor and Poincare
parts are completely independent. Then by convention we set t = 2 for
Remarkably, this means that QQ is a translation! That is, if we start with a bosonic/fermionic
state and act with QQ, we get back a translated bosonic/fermionic state.
• Finally, let Ti generate an internal symmetry. Then usually we must have [Qα , Ti ] = 0. The
exception is the U (1) automorphism of the supersymmetry algebra called R symmetry,
A specific SUSY theory may or may not have this R-symmetry, depending on the Lagrangian;
furthermore R-symmetry may be anomalous. As we’ll see later, postulating an R-symmetry is
useful for constraining the MSSM.
20 2. SUSY Algebra and Representations
• Recall that for su(2), we have a Casimir operator J 2 which labels the irreps; states in the irreps
are labeled by Jz .
The first two simply say that Wµ is a translationally-invariant vector, while the third indicates
it does not form a closed algebra. In verifying these results it is useful to use the identity
a1 a
ϵa1 ...ap cp+1 ...cn ϵb1 ...bp cp+1 ...cn = −p!(n − p)!δ[b1
. . . δbpp] .
C1 = P µ P µ , C2 = W µ Wµ .
• Next, we find the irreps using the little group. To use this method, we fix a reference momentum
pµ and look at the subalgebra that preserves the momentum; in the Poincare group the only
such operators are the Wµ , which take the form
1
Wµ = ϵµνρσ pν M ρσ .
2
of the Euclidean group in two dimensions, E2 , which has infinite-dimensional irreps which are
not seen in nature. Concentrating on the finite-dimensional representations, the translations
must act trivially, leaving SO(2). The irreps are labeled by the helicity λ where W µ = λP µ ,
and projective representations allow half-integer λ.
21 2. SUSY Algebra and Representations
• Since the SUSY generators commute with P µ , C1 remains a Casimir operator, so all particles
in a SUSY multiplet have the same mass. Now, for N = 1 SUSY, we have
which means that C2 is no longer a Casimir operator. This is as expected, as we can have
particles of different spin inside a SUSY multiplet.
C̃2 = Cµν C µν .
• Next, we claim that in any SUSY multiplet the number nB of bosonic states equals the number
nF of fermionic states. Consider the fermion number operator (−)F , defined by
by the anticommutation relation and the cyclic property of the trace. On the other hand,
where in the last step we restricted to states with momentum pµ . This can only hold if
X X X X
0 = tr(−)F = ⟨B|(−)F |B⟩ + ⟨F |(−)F |F ⟩ = ⟨B|B⟩ − ⟨F |F ⟩ = nB − nF .
B F B F
• There is an exception to this reasoning. If the supersymmetry is not broken, then the vacuum
states have pµ = 0, as we’ll see below. Then the trace tr(−)F evaluated over the entire Hilbert
space may be nonzero; it is called the Witten index. It is important because it is preserved
under certain deformations of the theory.
We now construct the massless SUSY multiplets. These are the most relevant phenomenologically
as almost all particles in the SM are ‘really’ massless, only acquiring mass from the Higgs.
• We take the reference momentum to be pµ = (E, 0, 0, E) and consider states in this irrep
with the reference momentum |pµ , λ⟩, where λ stands for all quantum numbers. The Casimirs
C1 = P µ Pµ and C̃2 = Cµν C µν are both zero. We already know that the Poincare generators
don’t give any new states, so we focus on the spinors.
22 2. SUSY Algebra and Representations
Therefore, we have
⟨pµ , λ|{Q2 , Q2̇ }|pµ , λ⟩ = 0
which can only hold if Q2 |pµ , λ⟩ = 0.
Q1 Q
a= √ , a† = √1̇ , {a, a† } = 1, {a, a} = {a† , a† } = 0
2 E 2 E
which are the commutation relations for a fermionic harmonic oscillator.
• Thus, we see that a† raises the J 3 eigenvalue by 1/2. Since the particle is moving in the −z
direction, it lowers the helicity λ by 1/2.
• We let |Ω⟩ = |pµ , λ⟩ be the state of highest helicity. Then we get just one other state,
As before, CPT flips λ, so we get irreps where the states have helicities {±λ, ±(λ − 1/2)}.
In general, the λ = 1/2 components are called gauginos. The SM matter fields can’t be gauginos,
because both particles in a vector multiplet transform the same way under SU (3)c × SU (2)L ×
U (1)Y , and vector particles must be created by gauge fields, which transform in the adjoint.
C1 = P µ Pµ = m2 , C̃2 = 2m4 Y i Yi
Q1,2 Q1̇,2̇
a1,2 = √ , a†1,2 = √ , {ap , a†q } = δpq
2m 2m
which means that, starting from a vacuum state, we can build 4 states instead of 2.
|Ω⟩ = |m, j = y, pµ , j3 ⟩.
We can get all j3 values by Lorentz transformations, so there are 2y + 1 vacuum states.
• Since the SUSY generators carry spin 1/2, they act on spin j = y states to yield states of spin
j = y ± 1/2. Using the same relations as above, we find the SUSY generators change j3 by
1 1
[a†1 , J 3 ] = − a†1 , [a†2 , J 3 ] = − a†2
2 2
so that a†1 raises J3 as in the massless case and a†2 lowers it. Then it can be shown that
α̇ 3 α̇ β̇
[J 2 , Q ] = Q − (σi )α̇β̇ Q Ji , [J3 , a†1 a†2 ] = [J 2 , a†1 a†2 ] = 0.
4
The last identity states that acting with both a†1 and a†2 does not change J 2 .
and
a†2 |j = y, j3 ⟩ = k3 |j = y + 1/2, j3 − 1/2⟩ + k4 |j = y − 1/2, j3 − 1/2⟩
where we suppress m and pµ and the ki are Clebsch–Gordan coefficients.
24 2. SUSY Algebra and Representations
• Finally, we have spin j states of the form |Ω′ ⟩ = a†2 a†1 |Ω⟩. These are not proportional to |Ω⟩,
since the ai annihilate |Ω⟩ but not |Ω′ ⟩. Thus we have
which gives nF = nB = 2(2y + 1) as expected. The total number of physical states in the
multiplet is 4(2y + 1), i.e. 4 times the 2y + 1 vacuum states.
• The case y = 0 is slightly different. In this case we have y ⊗ 1/2 = 1/2, i.e. we have one particle
of spin 1/2 and two particles of spin 0. Again we have 4(2y + 1) = 4 states, but only three
SO(3) irreps.
β̇ α̇ ˙
PbQa Pb−1 = ηP (σ 0 )αβ̇ Q , PbQ Pb−1 = ηP∗ (σ 0 )αβ Qβ
where ηP is a phase factor, and we are using the identity matrices σ 0 and σ 0 just to make the
indices match up. As a result,
• Heuristically, parity exchanges Q and Q. Note that the ai annihilate |Ω⟩ and the a†i annihilate
|Ω′ ⟩. Then parity exchanges the highest and lowest states. The states with definite parity are
• The SUSY algebra remains the same, but the anticommutation relations between SUSY gen-
erators are modified. Before, we only showed that {QA B
α , Qβ } could not depend on Pµ or Mµν .
(In fact, in general dimension, the anticommutator can also depend on Pµ .) However, for N > 1
it is consistent to include a central charge,
{QA µ A
α , Qβ̇B } = 2(σ )αβ̇ Pµ δB , {QA B
α , Qβ } = ϵαβ Z
AB
.
The central charges Z AB are bosonic and antisymmetric; they commute with all of the generators
and with each other. Thus they form an abelian invariant subalgebra of internal symmetries.
25 2. SUSY Algebra and Representations
generalizing the U (1) symmetry found earlier. That is, the QA transform in the fundamental
A
and the Q in the antifundamental.
• The full R-symmetry need not be realized, depending on the theory. For example, the maximal
R-symmetry in N = 4 is U (4), but the actually realized symmetry in N = 4 SYM is SU (4).
λ = 0, 2 × λ = 1/2, λ = 1.
We could also keep track of the R-symmetry; in this case the λ = 1/2 transform in the 2 of
U (2) while the others transform in the 1.
λ = −1/2, 2 × λ = 0, λ = 1/2
• It’s tempting to conclude this multiplet could be its own CPT conjugate (as long as, e.g. there
are no additional complex internal quantum numbers). However, it turns out that the λ = 0
sector must be a real representation of the R-symmetry group, while in this case it is pseudoreal.
(Central charges make little difference here, as they only break the R-symmetry group from
U (2) to SU (2).) Hence the hyper multiplet contains two copies of the helicities above, while
only one copy is a “half hyper multiplet”.
This is the only N = 4 multiplet where |λ| ≤ 1. Restricting to N = 2, we get two N = 2 vector
multiplets and hypermultiplets. Restricting to N = 1, we get two N = 1 vector multiplets and
six N = 1 chiral multiplets. Since the whole N = 4 multiplet is symmetric under λ → −λ, and
the 6 of U (4) is real, it could be its own CPT conjugate, in which case the submultiplets pair
up under CPT.
• Finally, for N = 8 and λ0 = −2, we have the ‘maximum multiplet’ or ‘gravity multiplet’
Since the 28 of U (8) is real, this multiplet again could be its own CPT conjugate. The general
rule is that this holds when the maximum helicity is N /4, so we’ve exhausted the realistic
self-CPT conjugate possibilities above.
• Note that renormalizable field theories must have |λ| ≤ 1, since otherwise the propagator does
not fall off fast enough, so we require N ≤ 4 for renormalizability. This doesn’t mean such
theories are physically irrelevant, as gravity isn’t renormalizable either.
• Generally, a massless particle with |λ| ≥ 1 must couple to a conserved current, i.e. a conserved
vector for λ = ±1 (as in electromagnetism) and a conserved tensor for λ = ±2 (as in gravity).
This is necessary to remove the growth in the propagators, which would otherwise violate
perturbative unitarity.
• There aren’t conserved tensors of higher rank by the Coleman–Mandula theorem and its gener-
alizations. Thus, |λ| > 2 is forbidden, and we can only have one particle with λ = 2 because
all such particles must act like gravitons. Then N = 8 is the maximum realistic number of
supersymmetries.
• Theories with massless particles of helicity |λ| > 2 are called higher-spin theories and are rather
exotic. To be realized in quantum field theory, they must either be free, or contain an infinite
tower of particles of essentially every spin.
• In light of the above, N = 4 is the ‘nicest’ for gauge theory and N = 8 is the ‘nicest’ for gravity,
explaining why N = 4 SYM and N = 8 SUGRA are so well studied.
• It can be argued generally that these particles must be gauge bosons; as a result, they must
transform in the adjoint representation, which is in general a real representation. (For matrix
Lie groups, this is easy to see, since it is essentially the fundamental times the antifundamental.)
Since internal symmetries commute with SUSY transformations, the λ = ±1/2 particles must
also transform in the adjoint.
• If the multiplet contains both λ = ±1/2, then these particles transform in the same representa-
tion, so we cannot get a chiral theory; this accounts for the N = 2 hypermultiplet.
• On the other hand, if the multiplet contains only, say, λ = 1/2, then by CPT there must be
another multiplet with λ = −1/2 which transforms in the conjugate representation. Since the
adjoint is real, the λ = −1/2 particle also transforms in the adjoint, and we again don’t have a
chiral theory.
• Helicity λ = 3/2 is also somewhat exotic. It turns out that it must couple to the supersymmetry
current (i.e. the current whose charge corresponds to the SUSY generators) and be associated
with a field with gauge symmetry, where the gauge symmetry is local supersymmetry. This
necessitates local super-Poincare gauge symmetry, i.e. supergravity, so a theory with a λ = 3/2
particle (called a gravitino) must also have a λ = 2 particle.
We first consider the case where all the central charges vanish. Then we have 2N pairs of
fermionic creation and annihilation operators,
A
QA † Q
aA
α =√α , aA
α̇ = √ α̇
2m 2m
which yields a much larger multiplet containing 22N states for each vacuum state, for a total
of (2y + 1)22N . As before, each of the raising operators changes the spin by 1/2, while the
† A†
combination aA1̇
a2̇ does not change the spin.
• For example, for N = 2 with a spin 0 vacuum, at each level of raising we have
which gives 16 total states, with 5 spin 0 particles, 4 spin 1/2 particles, and 1 spin 1 particle.
There are 8 fermionic states and 8 bosonic states, as expected. However, note that the number
of fermionic Poincare irreps does not match the number of bosonic Poincare irreps.
• In the case Z AB =
̸ 0, the size of the multiplets depends on the central charges. It is simplest
to begin with N = 2, where the central charge has one degree of freedom. Then we may take
A 1 0 A A B
{Qα , Qβ̇B } = 2m δB , {QA B
1 , Q2 } = 2Zϵ
AB
, {Q1̇ , Q2̇ } = 2ZϵAB
0 1
for i = 1, 2.
α1 = α2 = ei arg Z
Since the left-hand sides are positive definite, we have the BPS bound
|Z| < m.
• If the BPS bound is not saturated, then upon a rescaling, we have four independent fermionic
QHOs and get 16 degrees of freedom as before. However, Z = m, then bi must be realized
trivially on the multiplet, so we only get 4 degrees of freedom.
• More generally, for even N we can diagonalize Z AB to 2 × 2 blocks of the above form, with
values Z1 through ZN /2 . There is a BPS bound for each individual block, 2m ≥ Zi .
• If none of these bounds are saturated, we get a “long multiplet” of 22N states. If k of them
are, we get a “short multiplet” of 22(N −k) states, by the same logic as the N = 2 case. If all of
them are, we get an “ultra-short multiplet” of 2N states.
• Historically, BPS bounds and states were first found for soliton/monopole solutions of the Yang–
Mills equations. The BPS states are stable because they are the lightest charged particles.
• Extremal black holes are also BPS states in extended supergravity theories. They are stable,
as they are the endpoints of Hawking radiation. In string theory, some D branes are BPS.
• BPS states are important for understanding strong/weak coupling dualities, because they are
distinguished by short multiplets, and multiplets can’t change size as the coupling continuously
changes from weak to strong.
Note. We can see how the Higgs mechanism would work in a supersymmetric field theory by
looking at the structure of the multiplets. Without supersymmetry, a helicity ±1 Poincare irrep
“eats” a helicity 0 irrep to gain mass, forming a spin 1 irrep. Similarly, for N = 1 a vector multiplet
eats a chiral multiplet. Accounting for their CPT conjugates as well, this forms the y = 1/2 massive
multiplet. For N = 2 a vector multiplet again eats a chiral multiplet. Accounting for their CPT
conjugates, this forms the y = 0 massive multiplet.
29 2. SUSY Algebra and Representations
• In general, the SUSY generators QI (for I = 1, . . . , N ) are taken to transform in the minimal
spinor representation of so(1, d − 1). We define N to be the ratio of the number of real
supercharges NQ to the real dimension of the minimal spinor representation. However, it turns
out that generally the important variable is NQ , not N .
Note that in d = 2, 6, 10 there are two distinct “minimal” representations, which are not
conjugate; hence N really must be described by two independent numbers. Similarly, in d = 4
(and others) the Weyl and Majorana spinors have the same real dimension, so one may take
the supercharges to be either; the two are equivalent as real representations.
• As for d = 4, Nmax
SUSY is the largest amount of supersymmetry where we can have massless
SUGRA is the same for helicity |λ| > 2.
particles without requiring helicity |λ| > 1, while Nmax
(typo in 11 row for number of degrees of freedom? shouldn’t things be called
Majorana?)
• Looking at higher dimensions, there is no rigid supersymmetry beyond d = 10, and N = (1, 0)
SYM in d = 10 is closely related to N = 4 SYM in d = 4.
• Furthermore, there is no supergravity beyond d = 11. The N = (1, 1) and N = (2, 0) SUGRA
theories in d = 10 are low-energy limits of type IIa and IIb superstring theory, respectively.
The N = 1 supergravity theory in d = 11 is thought to be the low-energy limit of M -theory.
Example. SUSY in d = 2. The Lorentz group is SO(1, 1) ∼ = R, so all representations are one-
dimensional and we may classify them by their SO(1, 1) charge, which we call the spin. In the case
N = (1, 1), there are right-moving and left-moving real supercharges Q± with spin ±1/2. The boost
generator M transforms in the trivial representation of SO(1, 1), while the translation operators
P µ decompose into spin ±1. The algebra contains
[P 0 , M ] = P 1 , [P 1 , M ] = P 0 , [Q± , M ] = ±Q±
30 2. SUSY Algebra and Representations
with all other commutators vanishing. As for the anticommutation relations between the SUSY
charges, representation theory fixes
{Q+ , Q+ } = P+ , {Q− , Q− } = P− , P± = P 0 ± P 1 .
Next, since {Q+ , Q− } carries spin 0, it could contain M or a central charge. By the same argument
as in d = 4, it can’t contain M , so
{Q+ , Q− } = Z.
For N = (2, 0), the result is similar, but there would be no room to include central charges.
Example. SUSY in d = 3. In this case, the SUSY generators are two-component Majorana spinors.
(An easy way to see this is to note that so(2, 1) ∼
= sl(2, R), and the 2 of sl(2, R) is manifestly real.)
We take the gamma matrices to be
(γ µ )αβ = (−iσ 2 , σ 1 , σ 3 ).
γ µ γ ν = η µν − ϵµνρ γρ
where ϵ012 = 1 and we use mostly positive signature. Then the spinor representation matrices are
i
Mµν = − ϵµνρ γ ρ
2
which are pure imaginary as required, which implies
i
[Mµν , QIα ] = ϵµνρ (γ ρ QI )α .
2
Since 2 × 2 = 1 + 3, the general anticommutator of the SUSY charges has the form
µ
{QIα , QJβ } = c0 γαβ Pµ δ IJ + ϵαβ Z IJ
µ
for an antisymmetric central charge Z IJ , as γαβ is symmetric in α and β. (finish this)
31 3. Superspace and Superfields
• For example, we could have introduced the four fields of the Dirac spinor as separate objects,
but then Lorentz invariance would have strongly constrained the couplings. It’s much more
convenient to work with the Dirac spinor as one object.
• Similarly, in supersymmetry, we work with superfields Φ(X) which transform under a definite
representation of the super-Poincare group. We will see this requires X to be in ‘superspace’,
Minkowski space with extra Grassmann dimensions.
• Also note that, in contrast to the particle case, symmetries for the fields should ideally hold
off-shell. That is, the Lagrangian must be SUSY-invariant independent of the equations of
motion. We will find below that we must introduce auxiliary fields to achieve this.
– For G = U (1), the elements are g = eiα with α ∈ [0, 2π], so MG = U (1).
– For G = SU (2), the elements are
α β
g= , |α|2 + |β|2 = 1
−β ∗ α∗
which implies MG = S 3 .
– For G = SL(2, C), we’ve already seen MG = R3 × S 3 .
– Consider G/H = SU (2)/U (1) ∼ = SO(3)/SO(2). The U (1) factor can be taken to be
diag(eiγ , e−iγ ), which can be used to make the parameter α above real. Then MG/H = S 2 .
– More generally, MSO(n+1)/SO(n) = S n .
– We have Poincare/Lorentz = {ω µν , aµ }/{ω µν } = {aµ } = Minkowski.
where the θα and θα̇ transform like Weyl spinors and hence must be Grassmann numbers by
spin-statistics. Note that this implies
β̇ β̇
{Qα , Qα̇ } = 2(σ µ )αα̇ Pµ , [θα Qα , θ Qβ̇ ] = 2θα (σ µ )αβ̇ θ Pµ .
Superspace is ordinary space augmented with the Grassmann dimensions θα , θα̇ .
We now review properties of Grassmann numbers.
• For a single Grassmann number θ, an arbitrary function f (θ) can be expanded as
f (θ) = f0 + f1 θ
and define df /dθ = f1 . Integrals are defined as
Z Z
dθ = 0, dθ θ = 1
which implies that the ‘Dirac delta’ is δ(θ) = θ. Note that the integral is equal to the derivative.
• Now consider spinors of Grassmann numbers θα , θα̇ . Their squares as defined, as earlier, by
α̇
θθ = θα θα , θθ = θα̇ θ
which gives the identities
1 α̇ β̇ 1
θα θβ = − ϵαβ θθ, θ θ = ϵα̇β̇ θθ.
2 2
• Derivatives are defined by
β̇
∂θβ β̇ ∂θ
∂α θ ≡ α = δαβ ,
β
∂ α̇ θ ≡ α̇
= δα̇β̇ .
∂θ ∂θ
However, note that by raising and lowering indices, this implies that
α̇
∂ α θβ = −δβα , ∂ θβ̇ = −δβ̇α̇ .
In index-free notation we thus have
(ψ∂)(θχ) = ψχ, (ψ∂)(θχ) = −ψχ.
Similarly, we define Z
2 1 α̇ β̇
d θ = dθ dθ ϵα̇β̇ , d2 θ θθ = 1.
4
• Integration can again be related to differentiation,
Z Z
2 1 αβ 1
d θ = ϵ ∂α ∂β , d2 θ = − ϵα̇β̇ ∂ α̇ ∂ β̇ .
4 4
33 3. Superspace and Superfields
• Consider a scalar field φ(xµ ). It is an element of the function space F which is a representation
of the Poincare group. Let P µ be the representation of P µ on F. Then
Mµν = −i(xµ ∂ν − xν ∂µ ).
If φ transformed in a nontrivial Lorentz representation, this expression would have extra terms,
as the Lorentz transformation would act on the field indices.
• More generally, Lorentz transformations are defined as vector fields under spacetime, and the
changes of fields under these transformations are given by Lie derivatives. In the case of a
scalar, this is just the vector field acting on the scalar, as we see above.
φ → exp(−iaµ P µ )φ exp(iaµ P µ ).
Comparing our two expressions, at first order in aµ , the change in φ under translation is
δφ = i[φ, aµ P µ ] = iaµ Pµ φ = aµ ∂µ φ.
Next, we turn to the scalar superfield. More complicated superfields which are not scalar-valued
can also appear, but the scalar superfield and restrictions of it will suffice for our purposes.
The scalar superfield contains fields that are not Lorentz scalars, such as ψ. The spinor fields
are Grassmann; these Grassmann variables are independent of the superspace variables θ, and
come in via the path integral measure.
• It’s clear that the terms up to second order are the most general possible. We could write more
third-order and fourth-order terms using σ µ , but they would be redundant with our existing
terms by Fierz identities.
• Since S is a scalar superfield, we know how Poincare transformations act on it, so we focus on
‘supertranslations’,
The second step is, at this point, a reasonable ansatz that we will verify holds.
34 3. Superspace and Superfields
We can then verify that Qα and Qα̇ satisfy the supersymmetry algebra,
{Qα , Qα̇ } = 2(σ µ )αα̇ Pµ , {Qα , Qβ } = 0.
Note that this is sometimes phrased in terms of the variations δϵ = iϵQ, δϵ = iϵQ, in which case
[δϵ , δϵ ] = −2(ϵσ µ ϵ)Pµ .
• Note that the ‘extra’ terms cancel out in the supertranslation operators,
α̇
θQ + θQ = −iθα ∂α − iθ ∂ α̇
which retroactively justifies our ansatz. By comparing our two expressions to first order in ϵ,
δS = i[S, ϵQ + ϵQ] = i(ϵQ + ϵQ)S, i(ϵQ + ϵQ) = ϵ∂ − i(ϵσ µ θ)∂µ − ϵ∂ + i(θσ µ ϵ)∂µ .
But this is equal to i(ϵQ + ϵQ)S1 S2 , because Q and Q obey the Leibniz rule.
because ∂α and ϵQ + ϵQ do not commute. This makes sense because ∂α S has terms only up to
linear order in θ, while S has terms up to quadratic order.
as above. Here, Q is not a vector field; it is instead the Noether charge associated with
supertranslation under Q. Since the charge is integrated over superspace, ∂α Q = 0.
which satisfy
{Dα , Qβ } = {Dα , Qβ̇ } = {Dα̇ , Qβ } = {Dα̇ , Qβ̇ } = 0.
To verify these results, we need to use the fact that ϵ is anticommuting. Note that these
covariant derivatives have nothing to do with gauge fields. The covariant derivatives are very
similar to Q and Q, but not quite the same.
The last result shows that if S is a superfield, so is Dα S. The first result means that the
connection ∇ = (∂µ , Dα , Dα̇ ) on flat superspace has nontrivial torsion.
Note. Our derivations above are somewhat heuristic. On a deeper level, given a Lie group G with
a subgroup H, there is an induced action of G on G/H by left multiplication; this is the action of
the supersymmetry generators we found heuristically above. The SUSY covariant derivatives above
are defined by the action of G on G/H by right multiplication, which is why they anticommute
with the Qα . The SUSY covariant derivatives (along with ∂µ ) define a connection on superspace,
and the fact that {Dα , Dβ } =
̸ 0 indicates the connection has nontrivial torsion.
36 3. Superspace and Superfields
• Suppose only the “body” φ is nonzero. Then under a supertranslation we pick up ψ and χ
terms unless ∂µ φ = 0, so we need φ to be constant, which is not interesting. We might also
attempt to set only ψ nonzero, but then we pick up φ terms. Instead, we build our constraints
using covariant derivatives.
Dα̇ Φ = 0, Dα Φ = 0.
Intuitively, these only depend on θ and θ, respectively, so they contain only left-chiral and
right-chiral spinor fields, respectively. As we’ll see below, these generate the particles in a chiral
multiplet.
• The vector or real superfield satisfies V = V † , where the dagger is a complex conjugate at the
classical level. It is the SUSY analogue of a real vector gauge field. These generate the particles
in a vector multiplet.
• The linear superfield L is a vector superfield satisfying DDL = 0. Since it contains the constraint
∂µ V µ = 0, it is the SUSY analogue of a conserved current.
Note. What about superspace beyond N = 1 SUSY? Though such formalisms do exist, they are
generally not useful. The problem is that for NQ real supercharges, we require NQ Grassmann
directions and hence 2NQ = 24N components for a general superfield. This is much larger than
even a long multiplet, which has only 22N states. The constraints required to eliminate this many
degrees of freedom become quite complicated, and in some cases cannot be imposed consistently.
For general dimension, a rule of thumb is that a superspace formalism is only useful for NQ ≤ 4.
where we suppressed the position argument y of the fields on the right. Here, φ and F are
complex scalar fields, and ψ is a left-chiral Weyl spinor.
37 3. Superspace and Superfields
• The field F is auxiliary, and hence determined by the others on-shell. Then on-shell there are
2 spin 0 degrees of freedom and 2 spin 1/2 degrees of freedom, obeying nB = nF .
• Note that off-shell, there are 4 bosonic degrees of freedom, since φ and F are complex, and
4 fermionic degrees of freedom. This is convenient, because it means SUSY is manifest even
off-shell, and is essentially the reason we need the F field.
√ i 1
Φ(x, θ, θ) = φ + 2θψ + (θθ)F + i(θσ µ θ)∂µ φ − √ (θθ)∂µ ψσ µ θ − (θθ)(θθ)∂µ ∂ µ φ
2 4
where we suppressed x-dependence. Note that this is not an approximation; higher-order terms
in the Taylor expansion are just all identically zero.
In particular, note that δF is a total derivative, just as δD was for a general superfield. It is
straightforward to check manually that this satisfies the supersymmetry algebra. Later on, the
transformation rules for the anti-chiral multiplet will also be useful; they are
√ √ √ √
δφ = 2 ϵψ, δψ = i 2 σ µ ϵ∂µ φ + 2 ϵF , δF = i 2 ϵσ µ ∂µ ψ.
• Note that the product of chiral superfields is also a chiral superfield, with
√ √
Φ1 Φ2 = (φ1 + 2θψ1 + θθF1 )(φ2 + 2θψ2 + θθF2 )
giving
φ′ = φ1 φ2 , ψ ′ = ψ1 φ2 + ψ2 φ1 , F ′ = F1 φ2 + F2 φ1 − ψ1 ψ2 .
This is one of the key benefits of superfields; we may easily take products of them.
• More generally, any holomorphic function f (Φ) is also chiral, but Φ = Φ† is antichiral. The
fields Φ† Φ and Φ† + Φ are real, but neither chiral nor antichiral.
• We can further constrain chiral superfields. For example, let X be a nilpotent chiral superfield,
so X 2 = 0 and Dα̇ X = 0. Renaming some of the fields, we have
√
X(y, θ) = x + 2θψx + θθFx
The final term vanishes if x = ψx2 /2Fx , and this makes the first two terms automatically vanish
as well, because they are proportional to ψx4 and ψx3 , and ψx is a two-component spinor. We see
the scalar field is a ‘composite’ of the fermion. However, it is only well-defined if Fx is nonzero,
which we will see indicates SUSY breaking.
38 3. Superspace and Superfields
• In the absence of a mass term, the chiral superfield corresponds to the particles of an N = 1
chiral multiplet. With a mass, it corresponds to the particles of an N = 1 massive multiplet
with superspin y = 0.
i i
V (x, θ, θ) = C + iθχ − iθχ + θθ(M + iN ) − θθ(M − iN ) + θσ µ θVµ
2 2
i i 1 1
+ iθθ θ −iλ + σ µ ∂µ χ − iθθ θ iλ − σ µ ∂µ χ + (θθ)(θθ) D − ∂µ ∂ µ C
2 2 2 2
where we have shifted some fields with respect to their definitions in the general superfield for
convenience. Off shell, there are eight bosonic components, as C, M , N , D, and V µ are all real,
and eight fermionic components, from the complex χ and λ.
• Just as in the non-supersymmetric case, we want to impose a gauge symmetry to get rid of the
unwanted degrees of freedom in the vector. If Λ is a chiral superfield, then i(Λ − Λ† ) is a vector
superfield with components
√ 1
C = i(φ − φ† ), χ= 2 ψ, (M + iN ) = F, Vµ = −∂µ (φ + φ† ), λ = D = 0.
2
We may define a generalized gauge transformation on vector superfields by
i
V → V − (Λ − Λ† ).
2
This generalizes the ordinary notion of a gauge transformation as it acts on Vµ by
Vµ → Vµ + ∂µ Re(φ) ≡ Vµ − ∂µ α.
• We may use this gauge freedom to remove some of the vector superfield components. In
Wess–Zumino gauge, which we use exclusively, we set C = χ = M = N = 0, giving
1
VWZ (x, θ, θ) = (θσ µ θ)Vµ + (θθ)(θλ) + (θθ)(θλ) + (θθ)(θθ)D.
2
This leaves only the usual gauge freedom of Vµ . We hence have a vector field which yields gauge
bosons, spinor fields which yield their superpartners, and another auxiliary field D.
• Given the gauge symmetry above, the vector superfield corresponds to the massless particles
of an N = 1 vector multiplet. In the supersymmetric analogue of the Higgs effect, a chiral
superfield couples to the vector superfield, and in terms of the particles, the vector multiplet
‘eats’ the chiral multiplet to become a massive y = 1/2 multiplet.
• Again, the number of degrees of freedom balance off-shell, since the gauge field Vµ has 3 bosonic
degrees of freedom and the real scalar D has 1, so an auxiliary field is again required.
• Recall that a complex scalar field φ and a U (1) gauge field Vµ have the gauge symmetry
φ → eiqα φ, Vµ → Vµ + ∂µ α
where α is a real-valued field that specifies the gauge transformation. Starting with a free φ
field, we may minimally couple it to the gauge field by a covariant derivative,
The kinetic term for the gauge field is written using the gauge invariant field strength
1
Fµν = ∂µ Vν − ∂ν Vµ , L ⊃ Fµν F µν .
4
• Similarly, in supersymmetry, we let a chiral and vector superfield have the gauge symmetry
i
Φ → eiqΛ Φ, V → V − (Λ − Λ† )
2
where Λ(x) is a chiral superfield that specifies the gauge transformation; this is necessary to
ensure that Φ remains a chiral superfield upon gauge transforming it. Note the term
Φ† exp(2qV )Φ
• To get an explicit expression, it’s most useful to work in terms of y rather than x, giving
β̇
Wα (y, θ) = λα + θα D + (σ µν θ)α Fµν − i(θθ)(σ µ )αβ̇ ∂µ λ .
Since W is invariant, the fields λ, D, and Fµν are all separately gauge invariant. Note that
since we have a spinor-valued superfield, the coefficient of θβ is a bispinor, which naturally
decomposes into a scalar (the field D) and a self-dual two-form (the field Fµν ).
• This result can also be generalized to non-abelian gauge fields, in which case we pick up extra
terms from the structure constants, and our gauge invariant fields become gauge covariant.
41 4. Supersymmetric Lagrangians
4 Supersymmetric Lagrangians
4.1 N = 1 Supersymmetry
Now we write supersymmetric Lagrangians, beginning with the simplest case of a chiral superfield.
Therefore, the Lagrangian is supersymmetric if it is built from the D terms of superfields and
the F terms of chiral superfields. This is not surprising, since the terms in Qα and Qα̇ that
multiply by Grassmann numbers come with factors of ∂µ .
• Note that for the Lagrangian to be real, the D terms come from real superfields and the F
terms of chiral superfields are paired with the F terms of antichiral superfields. In particular,
for a theory with a single chiral superfield Φ, the most general possibility is
Here the Kahler potential K is a real function of Φ and Φ† , the conjugate Φ† is an anti-chiral
superfield, and the superpotential W is a holomorphic function of Φ, and hence a chiral superfield.
Here K|D just means the coefficient of (θθ)(θθ) in K.
Note that not every term is integrated over all of superspace. This is perfectly acceptable and
also occurs in string theory, where objects may be confined to branes.
K ⊃ (θθ)(θθ)KD , W ⊃ (θθ)WF
[K] ≤ 2, [W ] ≤ 3.
Therefore, the Kahler potential is at most quadratic and the superpotential is at most cubic.
However, the terms Φ + Φ† and ΦΦ + h.c. do not have D terms, so the most general Kahler
potential is K = Φ† Φ, where we rescaled to remove the coefficient.
• For one chiral superfield, the general Lagrangian is known as the Wess–Zumino model,
Evaluating the Kahler potential term is straightforward and yields the kinetic terms along with
F F ∗ . The superpotential provides interactions; to evaluate it, we perform a Taylor expansion
in the Grassmann variables,
∂W 1 ∂2W ∂W ∂W
W (Φ) = W (φ) + (Φ − φ) + (Φ − φ)2 , ≡
∂φ 2 ∂φ2 ∂φ ∂Φ Φ=φ
where the linear term contributes θθF and the quadratic term contributes (θψ)(θψ). Then
1 ∂2W
∂W
LWZ = ∂ µ φ∗ ∂µ φ − iψσ µ ∂µ ψ + F F ∗ + F + h.c. − ψψ + h.c. .
∂φ 2 ∂φ2
Historically this was the first nontrivial four-dimensional supersymmetric model, and it was
originally written without the benefit of superspace and superfields.
∂W ∂W ∗
F∗ + = 0, F+ = 0.
∂φ ∂φ∗
Substituting this back in,
2
∂W
LF = − ≡ −VF (φ).
∂φ
That is, these terms simply yield a positive semi-definite scalar potential for φ.
• We can eliminate the auxiliary field F from the Wess–Zumino model by plugging in its equations
of motion. The cost of doing this is that the Lagrangian becomes SUSY invariant only on-shell;
thus we prefer to keep it explicit if we’re not doing practical calculations.
43 4. Supersymmetric Lagrangians
∂2K
Kij = ≡ ∂i ∂j K
∂φi ∂φj∗
where a bar denotes a conjugated field. Here, Kij may be regarded as a metric in a space with
coordinates φi which is a complex manifold, specifically a Kahler manifold because the metric
can be derived by differentiating a potential. For a renormalizable theory this is rather trivial,
as Kij is constant, but if we allow the Kahler potential to be arbitrary (e.g. viewing the theory
as an effective theory) we can get more complicated manifolds.
where a general Kahler potential produces extra fermion interaction terms. We say the Kahler
potential is canonical if Kij is diagonal and constant, which can be achieved by a U (n) field
redefinition in the renormalizable case.
Kij F j∗ + ∂i W = 0, Kij F i + ∂j W ∗ = 0.
VF = Kij F i F j∗ = K ij ∂i W ∂j W ∗
where the Kahler metric with raised indices is the inverse of the original Kahler metric.
Note. Keeping track of the R-symmetry. The U (1) R-symmetry in such theories takes the form
Φi → eiri α Φi
i
where ri = R[Φi ] is the U (1)R charge of the field Φi , and R[Φ ] = −ri . By definition, we have
This implies that the components of a chiral superfield have different R-charges,
The Kahler potential is invariant under U (1)R provided that it does not mix fields of different
R-charges. The superpotential is only invariant if R[W ] = 2, and hence it generically breaks the
R-symmetry. We also note that the R-symmetry may be ambiguous, as we may combine it with
flavor symmetries which rotate the Φi individually.
Next, we turn to the vector superfield Lagrangian, i.e. the theory of “super QED”.
• We can deduce QED by demanding a local U (1) symmetry for a complex scalar field φ,
parametrized by a scalar field α. To get super QED we replace the scalar fields with chi-
ral superfields, with the gauge transformation
Φ → exp(iqΛ)Φ.
• A new feature of super QED is a new gauge-invariant term, the Fayet–Iliopoulos (FI) term
1
LFI = ξV |D = ξD
2
where ξ is a constant. The FI term only appears for an abelian gauge theory, because the gauge
field is not charged under U (1). More generally the gauge field would be charged, which would
make D charged and the FI term not gauge invariant.
• Note that for the superpotential W (Φ) to be gauge invariant, it must contain terms like Φ1 · · · Φn
where the charges of the terms in the product add to zero. If there is only one charged Φ field,
the superpotential must vanish.
45 4. Supersymmetric Lagrangians
• Next, we write out the first term explicitly. Taking Wess–Zumino gauge and Taylor expanding,
exp(2qV ) = 1 + 2qV + 2q 2 V 2
We could make this manifestly gauge invariant by grouping terms into covariant derivatives.
More generally, we could take τ complex, yielding an F F̃ term. However, this term is a total
derivative, so it makes no difference perturbatively.
∂W 2 1 ξ
2
2
V (φ) = VF (φ) + VD (φ) = + + q|φ| ≥ 0.
∂φ 2 2
We see that the FI term could be responsible for spontaneous symmetry breaking.
We will not consider the non-abelian case, where there are many complications.
46 4. Supersymmetric Lagrangians
• The results easily generalize for more fields. Assuming the gauge group remains U (1), we don’t
get more vector superfields, but we can have multiple chiral superfields Φi . We find
1 1 ξ
LD = qDKij φi φj∗ + D2 + ξD, D = − − qKij φi φj∗
2 2 2
so the total scalar potential is
2
1 1 ξ
i i
V (φ ) = Kij F F j∗
+ D2 = K ij ∂i W ∂j W ∗ + + qKij φi φj∗ .
2 2 2
• To get the supersymmetric analogue of ordinary QED, with a single Dirac fermion, we require
two chiral superfields Φ+ and Φ− with opposite charges, corresponding to the electron and
positron. The Dirac mass term comes from the superpotential W = mΦ+ Φ− . A single massive
chiral superfield instead corresponds to a Majorana fermion, which cannot be charged.
• We take the superpotential to have the standard form given above. To establish notation, the
Lagrangian has kinetic terms
m
L0 = −∂µ ϕ∂ µ ϕ − m2 ϕϕ − iψσ µ ∂µ ψ − (ψψ + ψψ)
2
where we are using (− + ++) signature, so the propagators are: where the momentum p flows
to the right.
For the scalar, this leads to cubic and quartic vertices of −imλ and −iλ2 respectively, as well
as the Yukawa interactions:
• We could have also treated the fermion mass as an interaction term, in which case only ⟨ψψ⟩
and ⟨ψψ⟩ would be nonzero, and we would have an interaction term −im which reversed the
chirality.
• At one loop, the total scalar tadpole vanishes, as the two diagrams cancel: In fact, this cancel-
lation holds at all orders in perturbation theory, and it implies that vevs are not renormalized.
• Next, consider the scalar self-energy at zero external momentum. This directly determines the
coefficient of the mass term ϕϕ in the effective action, and three diagrams contribute: The total
contribution is
2
tr(−σ µ σ ν )qµ qν
−i −i
Z Z Z
2 21 2
(−imλ) d̄q − (−iλ) d̄q + (−iλ) d̄q 2 = 0.
q 2 + m2 2 (q 2 + m2 )2 q + m2
Hence the mass term isn’t renormalized either, and this again holds to all orders in perturbation
theory. One can check the same holds for all of the vertices.
47 4. Supersymmetric Lagrangians
• We could also formulate the perturbation theory with the auxiliary field,
1 1
L0 = −∂µ ϕ∂ µ ϕ − iψσ µ ∂µ ψ + F F + m(F ϕ + F ϕ − ψψ − ψψ)
2 2
which has a simpler set of interactions,
2
Lint = λ(F ϕ2 + F ϕ − ψψϕ − ψψϕ).
• To compute the scalar propagators, write the kinetic terms as (ϕ, F )M (ϕ, F )T and invert M ,
to find
−i ip2 im
⟨ϕϕ⟩ = 2 2
, ⟨F F ⟩ = 2 2
, ⟨ϕF ⟩ = ⟨ϕF ⟩ = 2 .
p +m p +m p + m2
In particular, there is a nontrivial, though unusual propagator for F .
• Upon computation of the effective action, we find no effects at one loop except for the same
field strength renormalization factor
ZΦ ≡ Zϕ = Zψ = ZF .
This continues at all orders in perturbation theory. The only effect is the anomalous dimension
γ = d log ZΦ /d log µ. If we account for renormalization effects by working in terms of the
1/2
renormalized fields ΦR = ZΦ Φ, and γ is given, then we can compute the RG flow of the
renormalized couplings exactly.
• In 1977, Grisaru, Siegel, and Rocek showed using “supergraphs” that, except for one-loop
corrections to f , quantum corrections only come in the form
Z Z
d x d4 θ . . . .
4
Then W and ξ are not renormalized in perturbation theory at all, while K is.
• In 1993, Seiberg used symmetry and holomorphicity arguments to establish this result nonper-
turbatively in a simple and elegant way; we will follow this proof here. The intuition is that
since the superpotential is holomorphic, it is determined by its singularities and its asymptotics;
this allows us to pin it down in general using its weak coupling limit.
X = (x, ψx , Fx ), Y = (y, ψy , Fy )
Here we note that the integrand of the d2 θ integral is holomorphic. These spurion fields have
no dynamics; they are just a way to rewrite numerical coupling constants in the action to make
symmetries manifest. We might think of them as being very heavy fields with fixed vevs.
48 4. Supersymmetric Lagrangians
Φi V X Y θ θ Wα
0 0 0 2 1 −1 1
where we note that if θ → eiα θ, then dθ → e−iα dθ because dθ θ = 1, and the charge for Wα
R
can be found from its definition in terms of covariant derivatives, using ∂θ → e−iα ∂θ .
X → X + ir, r∈R
and hence the shift contributes a total derivative, which does not affect perturbation theory.
This symmetry is also called a Peccei–Quinn symmetry, making X an ‘axion-like field’.
• Now consider the Wilsonian action SΛ attained by integrating out all degrees of freedom above
Λ. We must have
Z Z
SΛ = d4 x d4 θ J(Φ, eV , X, Y, D, . . .) + ξ(X, Y )V
Z Z
+ d4 x d2 θ H(Φ, X, Y, W α ) + h.c.
Moreover, we must still have invariance under shifts in X. Hence the only term involving X
that is allowed takes the form XW α Wα , so
Now, in the limit Y → 0, we must have h(Φ) = W (Φ), because any higher order corrections to
h(Φ) would be higher order in Y and hence negligible. Thus we must have h(Φ) = W (Φ) for
all Y , so the superpotential is not renormalized!
• We claim the gauge kinetic function is only renormalized at one loop. Since the gauge kinetic
term appears as XW α Wα , the gauge field propagator is proportional to 1/x and the three-point
gauge vertex is proportional to x. (don’t understand) Then the number of powers of x at
L loops is 1 − L, so αX is the tree-level contribution and g(Φ) is the one-loop contribution.
In practice, this means that one can compute divergent higher-loop corrections for the gauge
kinetic function, but they all miraculously cancel.
49 4. Supersymmetric Lagrangians
• We cannot constrain the Kahler potential nearly as much, because it is not holomorphic. How-
ever, the FI term must be a constant to maintain gauge invariance under
V → V + i(Λ − Λ† ).
P
Moreover, the contributions correcting ξ are proportional to i qi where the qi are the U (1)
charges. This vanishes if the gravitational anomaly vanishes, so ξ is not renormalized.
Note. The above derivation is a bit complicated; it’s easier to see the key ideas in the Wess–Zumino
model. Take the superpotential at scale µ0 to be
m 2 λ 3 m̃ λ
Wµ0 = Φ + Φ ≡ µ 0 + Φ3
2 3 2 3
where we defined the dimensionless coupling m̃. The free theory at W = 0 has a U (1) rotation
symmetry for Φ and a U (1)R symmetry, with charges
Then the most general allowed for the effective superpotential at scale µ is
λΦ µ
Wµ = µm̃Φ2 f ,
µm̃ µ0
where f is holomorphic in its first argument. Since the limit λ → 0 is regular, we can expand it in
a Taylor series,
∞
X λn
Wµ = cn (µ/µ0 ) Φn+2 .
(µm̃)n−1
n=0
Now we also need a regular m̃ → 0 limit, which means terms with n > 1 are disallowed. Then
But now we can take the weak coupling limit: we know that in the limit λ → 0 no nontrivial
renormalization occurs at all, so c0 = 1/2 and c1 = 1/3. Then
m̃(µ) 2 λ 3
Wµ = µ Φ + Φ
2 3
which shows that the superpotential is not renormalized.
• The simplest case in N = 2 is the vector multiplet, whose particles are created by a massless
chiral superfield Φ and massless vector superfield V , just as in super QED.
• It can be shown that F(Φ) only receives one-loop corrections in perturbation theory. It also
receives nonperturbative corrections which can be written in terms of an “instanton expansion”
2
P
k ak exp(−kc/g ) found by Seiberg and Witten in 1994.
• There are other combinations of fields that produce N = 2 multiplets, but they are much more
complicated.
• The AdS/CFT correspondence relates a gravitational theory in AdS space to a conformal field
theory without gravity in one fewer dimension. The prime example of this correspondence is
between AdS in five dimensions and N = 4 super Yang–Mills in four dimensions.
51 5. The MSSM
5 The MSSM
5.1 SUSY Breaking
We now review the basics of supersymmetry breaking, since it must occur in our universe.
αa (T a )ij (φvac )j ̸= 0.
• For example, for a complex field φ = ρeiθ with a U (1) internal symmetry,
δφ = iαφ, δρ = 0, δθ = α.
• In particular, consider the vacuum state |Ω⟩. If the vacuum is SUSY invariant, Qα |Ω⟩ = 0, then
the left-hand side vanishes and E = 0. If SUSY is broken, then the vacuum energy is positive.
• This might seem a bit strange, because we are used to the vacuum energy being indefinite in
quantum field theory; for example, we change it by normal ordering the Hamiltonian. The
difference is that the SUSY commutation relations involve the Hamiltonian itself, H ∼ |Q|2 .
Hence requiring a SUSY algebra at the quantum level constrains the operator ordering.
• As before, SUSY breaking is caused by fields acquiring vevs that are not SUSY invariant. Recall
that in the case of a chiral superfield Φ,
√ √ √ √
δφ = 2ϵψ, δψ = 2ϵF + i 2σ µ ϵ∂µ φ, δF = i 2ϵσ µ ∂µ ψ.
The field ψ cannot have a vev, as this would violate Lorentz invariance. Similarly, we must
have ⟨∂µ φ⟩ = 0. Then the transformation reduces to
√
δ⟨φ⟩ = δ⟨F ⟩ = 0, δ⟨ψ⟩ = 2ϵ⟨F ⟩.
VF = Kij F i F j∗
we see that SUSY is broken if ⟨VF ⟩ > 0, corresponding to a positive vacuum energy at the
classical level, just as at the quantum level. Equivalently, a SUSY-invariant vacuum is one
where ∂W/∂φ = 0.
Note. Reformulating the above in mathematical language. As we’ve seen above, the scalar fields
are a map from spacetime to a Kahler manifold; we can think of the φi as local coordinates on
this manifold. The vacuum equations ∂W/∂φi = 0 are a set of holomorphic algebraic equations
and hence define a subvariety of this manifold. If the subvariety is trivial, SUSY is necessarily
spontaneously broken. If there are a continuum of solutions, we say there is a vacuum moduli space.
Example. The Wess–Zumino model. We take the canonical Kahler potential and superpotential
m 2 λ 3
W = Φ + Φ .
2 3
The condition for a vacuum solution is
∂W
= mφ + λφ2 = 0
∂φ
which gives the two discrete supersymmetric vacua
m
φ = 0, φ=− .
λ
Example. Consider a theory with three chiral superfields, the canonical Kahler potential, and
W = Φ1 Φ2 Φ3 .
The vacuum equations are ϕ1 ϕ2 = ϕ1 ϕ3 = ϕ2 ϕ3 = 0, which defines a subvariety; note that the
moduli space is not a manifold in this case.
The only nonzero F field is thus ⟨F1 ⟩ ̸= 0. For simplicity, we take ⟨φ1 ⟩ = 0. The fermion mass
terms are then
2 0 0 0
∂ W
ψ i ψ j = 0 0 M ψ i ψ j .
∂φi ∂φj
0 M 0
Hence we have two fermions with mass M and one massless fermion ψ1 , which is the Goldstino
corresponding to the nonzero vev of F1 . The quadratic terms in the scalar potential expanded about
the vev are
VF (φi ) ⊃ −m2 g 2 (φ23 + φ∗3 2 ) + M 2 |φ3 |2 + M 2 |φ2 |2 .
Hence the φ1 field is massless, since it corresponds to a flat direction in the scalar potential, and
the φ2 has mass M . For the φ3 , expand φ3 = a + bi to find
m2a = M 2 − 2g 2 m2 , m2b = M 2 + 2g 2 m2 .
We define the supertrace as the trace with an extra minus sign for bosons,
X
STr(M 2 ) ≡ (−1)2j+1 (2j + 1)m2j = 0.
j
Note. We can show that the supertrace vanishes at tree level for arbitrarily many chiral superfields.
First note that the fermion mass matrix is
Hence we have
tr MB2 = 2∂∂V = 2Kii ∂ i ∂ i Kjj ⟨∂ j W ⟩⟨∂ j W ∗ ⟩ = 2⟨∂ i ∂ j W ⟩Kii Kjj ⟨∂ i ∂ j W ∗ ⟩ = 2 tr MF† MF .
Since each fermionic field contains two degrees of freedom, Str(M 2 ) = 0 as desired.
Note. We have shown that W is not renormalized to all orders in perturbation theory; hence if
SUSY is unbroken at tree level, it is unbroken in perturbation theory. Moreover, if SUSY is broken
at tree level, the supertrace of M 2 vanishes, implying that the superpartners cannot be too much
heavier. Since this appears to be experimentally ruled out, SUSY must be broken nonperturbatively.
54 5. The MSSM
Example. For a vector superfield V = (λ, Aµ , D) in Wess–Zumino gauge, we must have ⟨λ⟩ =
⟨Aµ ⟩ = 0 by Lorentz invariance. However, D can acquire a vev, and since
δλ ∝ ϵD
we see that SUSY can be broken when D acquires a vev, where λ is the Goldstino. This is called
D-term SUSY breaking, in contrast with F -term SUSY breaking above. Since the contribution to
the scalar potential is proportional to ⟨D⟩2 , SUSY is broken if ⟨VD ⟩ > 0.
In the very simplest case, we consider a single chiral superfield with U (1) charge q and trivial
superpotential, where q > 0 and ξ ≥ 0. The scalar potential is
2
1 ξ 2 ξ
V (φ) = + q|φ| , ⟨φ⟩ = 0, ⟨D⟩ = −
2 2 2
which means that SUSY is broken when ξ > 0. Since ⟨φ⟩ = 0, the U (1) symmetry is unbroken, so
the λ and Vµ remain massless. Since the superpotential is trivial, the ψ remains massless. Finally,
using the scalar potential, we see
ξ
V (φ) ⊃ q|φ|2 , m2φ = qξ/2.
2
On the other hand, if q > 0 and ξ < 0, then we have
ξ
|⟨φ⟩|2 = − , ⟨D⟩ = 0
2q
which indicates that SUSY is not broken, but the U (1) symmetry is. Then the λ and Vµ fields
acquire mass by the ordinary Higgs effect by interacting with the vev of φ.
Note. In the case of D-term breaking, the supertrace sum rule is slightly modified; it turns out
to be proportional to the sum of all U (1) charges. However, this quantity must vanish to ensure
anomaly cancellation.
Finally, we briefly discuss SUSY breaking in supergravity.
• In supergravity, there is a new auxiliary field Fg , which can break SUSY by acquiring a vev.
Specifically, the F -term is
This is important because it allows ⟨V ⟩ = 0 even after SUSY breaking, which avoids an
unacceptably large cosmological constant. However, this does not solve the cosmological constant
problem, because ⟨V ⟩ is generically large and negative.
• There are “no-scale” supergravity models where the Kahler potential and superpotential are
chosen so that ⟨V ⟩ = 0, but these are not regarded as a solution of the cosmological problem
either, because there is no reason the form of the potentials should be preserved by quantum
corrections.
55 5. The MSSM
• In the process of SUSY breaking, the gravitino field, which is the gauge field of N = 1 su-
pergravity, “eats” the goldstino and gains mass. This is called the super Higgs effect, and
should not be confused with the supersymmetric extension of the ordinary Higgs effect, where
a massless vector superfield eats a chiral superfield to gain mass.
• The MSSM has N = 1 SUSY with gauge group SU (3)C × SU (2)L × U (1)Y . The matter fields
are the same as in the SM, with spinor fields promoted to chiral superfields. Note that some
conjugations are necessary, since chiral superfields only contain left-chiral spinors.
• The Higgs acquires a superpartner, the Higgsino. Since the Higgsino contributes to the U (1)Y
anomaly, a second Higgs field with opposite hypercharge is required to cancel it. We have
where the H1 field is not present in the SM. Hence the MSSM is a two Higgs doublet model.
Another reason a second Higgs is required is that in the SM, we must use the Higgs conjugate
field for some of the Yukawa terms, but we can’t do that here since the superpotential is
holomorphic.
• The gauge bosons correspond to vector superfields, giving gluons and gluinos, W bosons and
winos, and B bosons and binos,
The neutral winos, binos, and Higgsinos mix to form Majorana fermions called neutralinos, the
lightest of which could serve as a dark matter candidate; typically this candidate is mostly bino.
The charged winos, binos, and Higgsinos form charginos.
• As in super QED, we have interactions by the chiral superfield kinetic term. However, the FI
term must be zero, as otherwise the scalar potential for squarks and sleptons would yield a
vacuum breaking U (1)A and SU (3)C symmetry.
• Rescaling the gauge fields, the gauge kinetic terms fa = τa have Re τa = 4π/ga2 , specifying the
gauge couplings.
56 5. The MSSM
where we have suppressed generation indices; properly every coefficient is a matrix in generation
space. The first three terms yield standard Higgs Yukawa couplings to matter, while the fourth
is a mass term for the two Higgs fields.
• The last four terms break either U (1)B or U (1)L . They are not allowed phenomenologically,
because if the parameter values were natural, then protons would decay in seconds.
where s is the spin. This has the additional benefit that the lightest superpartner (LSP) is
stable, and hence can serve as a candidate for cold weakly interacting dark matter. In collider
experiments, one can search for LSP pair production by ‘missing energy’.
• Note that it would have been completely equivalent to define R to be (−1)3(B−L) , because all
interaction terms are Lorentz scalars, so the spins of the fields involve must sum to an integer.
Our definition of R is just slightly nicer. R-parity can be realized by having a U (1)B−L gauge
symmetry spontaneously broken at high energies in an appropriate way.
Note. The imposition of R-parity is a useful general model building tool, outside of SUSY. In a
generic extension of the Standard Model with an R-parity-like symmetry (which we call T -parity),
every vertex has an even number of new particles, and these must be connected up to yield a
contribution to a Standard Model operator, so the leading contributions are at loop level. This
gives one more room to avoid bounds, and reduces contributions to the Higgs mass. Theories with
T -parity also tend to have a dark matter candidate, namely the lightest T -odd particle (LTOP).
• As shown above, naive SUSY breaking wouldn’t work, because STr(M 2 ) vanishes, and the
superpartners would be too light. Instead, we introduce a hidden sector which breaks SUSY.
The hidden sector may obey the sum rule, but it isn’t ruled out because it doesn’t interact
directly with the MSSM fields; instead it interacts through a messenger sector. Typically, the
gauge group is enlarged by another factor G, under which all MSSM fields are singlets.
• One possible SUSY breaking mechanism is gaugino condensation. Here an asymptotically free
gauge coupling g becomes large at some energy scale M . If we start with a cutoff Λ, then
M = Λ exp(g −2 (Λ)/β)
• Next, we need to specify the messenger sector. For example, the mediating field could simply
be the graviton. Then couplings are suppressed by Mpl , so by dimensional analysis
M2
∆m =
Mpl
where ∆m describes the size of the mass splittings in the MSSM. Setting ∆m ∼ 1 TeV and
Mpl ∼ 1018 GeV gives M ∼ 1011 GeV. This scenario requires a gravitino, which acquires a mass
∆m by the super Higgs mechanism.
• Another situation is gauge mediation. Here the messenger fields are charged under both G and
the SM gauge group, and the SUSY breaking is transmitted by loops. Then
M
∆m ∼
16π 2
which means M must also be around the TeV scale. Then the gravitino mass is on the order of
M 2 /Mpl ∼ eV so it is the LSP.
• To work phenomenologically, we integrate out the messenger sector and hidden sector to yield
a Lagrangian for the MSSM with SUSY breaking terms. Generically, we get all possible “soft
SUSY breaking terms”, i.e. renormalizable terms that do not reintroduce the hierarchy problem
(quadratic sensitivity to Λ2 ), such as mass terms for superpartners and additional interactions.
This is the source of the many (> 100) parameters in the MSSM.
• Almost all of the MSSM parameter space is ruled out, because SUSY particles could heavily
mix in general, and this mixing would be transferred to quarks by loops, causing flavor changing
neutral currents.
• Specific high scale models provide relations between the parameters. For example, the con-
strained MSSM (CMSSM), which may arise from string theory has only three free parameters.
• Extra structure is needed to account for neutrino masses. One may also add an additional
singlet Higgs (and its superpartner), which resolves some theoretical tensions. The result is the
next-to-minimal extension of the SM, the NMSSM.
• We may split the hierarchy problem into two parts: why Mew ≪ Mpl at tree level, and why this
is stable under quantum corrections. These are qualitatively distinct applications of naturalness,
and both are challenging.
• Note that SUSY introduces new scalar particles, but they don’t create new hierarchy problems
because they are superpartners of fermions, which are naturally light.
• As argued before, SUSY cancels the quadratic divergences in the Higgs self-energy. We retain
logarithmic divergences of the form ∆m log(M/∆m), which may naturally give a small result
as long as ∆m is around the TeV scale, motivating low-scale SUSY. An independent argument
for TeV scale SUSY is gauge coupling unification. A third argument comes from the WIMP
miracle, which is that TeV scale SUSY can account for dark matter by the LSP.
58 5. The MSSM
• A more sophisticated way to understand these cancellations comes from the non-renormalization
theorems above. The Higgs mass term comes from the superpotential, and we have shown it is
not renormalized.
• In principle, SUSY could also solve the cosmological constant problem. However, it is broken at
far too high a scale; we would have MΛ ∼ ∆m, while in reality MΛ ≪ ∆m. At present, there
is no satisfactory solution to this problem.
Note. Gauge coupling unification only makes sense in the context of grand unification. The
coupling constants for non-abelian gauge theories are normalized by normalizing the generators T a
so that, e.g. tr(T a T b ) = δ ab /2. However, there’s no canonical way to normalize the U (1) coupling;
by different choices of this normalization one can make gauge coupling unification happen for any
theory. The only way to resolve this ambiguity is to determine how U (1) is embedded in the GUT
group, which fixes its normalization.
Note. The technical naturalness of the smallness of the fermion masses can be seen nicely by
spurions. Let the mass parameter be m. Then the Lagrangian maintains chiral symmetry if m is
charged under it, which implies corrections to it must take the form δm = mf (|m|2 ). In general,
whenever a symmetry is restored when a parameter vanishes, it can be maintained for nonzero
values of that parameter by promoting it to a spurion charged under that symmetry.