Pub - The Physics of The Standard Model and Beyond PDF
Pub - The Physics of The Standard Model and Beyond PDF
Standard Model
and
T. Morii
Kobe University, Japan
C. S. Lim
Kobe University, Japan
S. N. Mukherjee
Banaras Hindu University, India
^P World Scientific
NEW JERSEY • LONDON • SINGAPORE • BEIJING- SHANGHAI • H O N G K O N G • TAIPEI • CHENNAI
Published by
World Scientific Publishing Co. Pte. Ltd.
5 Toh Tuck Link, Singapore 596224
USA office: Suite 202, 1060 Main Street, River Edge, NJ 07661
UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE
For photocopying of material in this volume, please pay a copying fee through the Copyright
Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to
photocopy is not required from the publisher.
ISBN 981-02-4571-8
Great advances in particle physics have been attained in the last several decades. In
the development of the new generation of accelerators and detectors, the quantity
and quality of the data on various interaction processes have been vastly increased
and our knowledge of particle physics became surprisingly fruitful. Moreover, many
interesting and new ideas were proposed and developed successfully in quantum field
theories for describing strong, electromagnetic and weak interactions. Those new
ideas were concentrated into a simple and beautiful theory known as the standard
model in the late 1960's which was formulated in the framework of non-Abelian
gauge theory and was extremely successful in describing a wide range of existing
phenomena of elementary particles. However, it is also believed that the model is not
the ultimate theory; it has many arbitrary parameters which cannot be predicted by
the theory and fundamental problems, such as the hierarchy problem, which should
be solved in some physics beyond the standard model, "New Physics". Thus it is a
general consensus that to search for the evidences for physics beyond the standard
model is the most urgent issue in the particle physics of the 21st century.
The aim of the present textbook is to give a unified description of the struc-
tures and interactions of elementary particles, by discussing the underlying theo-
ries, namely the standard model of elementary particles and physics beyond the
standard model. Especially, concerning the description of the physics beyond the
standard model, we will select the hot topics of our current interest, including the
issues inspired by various experiments as much as possible.
This book has emerged out of the introductory lectures we gave in the graduate
course or series of lectures given in various places over several years. Therefore,
this book is written primarily for graduate students and can be used by those who
wish to major in theoretical and experimental high energy physics. Throughout this
textbook, it is assumed that the readers have already gone through a basic course
in quantum field theory and particle physics. We hope that this book is also useful
for the researchers in the field of particle and nuclear physics.
There are two main features of this textbook. First, we have aimed at a concise
description but at the same time we have paid attention so that the basic concepts
vii
vm Preface
are clearly mentioned. For such a purpose each section is guided by pedagogical
arguments. Second, we also attempt to provide excitement in particle physics,
focusing on the important experimental observations (old and new) and a variety
of nice ideas for their interpretation.
The organization of the book is as follows. In Chapter 1, we give a brief in-
troduction of the standard model for the elementary particles to describe their
structures and interactions. This is followed by Chapter 2 with a description of
Fermi theory of weak interactions and its limitation. Preparation for the under-
standing of electroweak gauge theory is done in Chapter 3, which covers symmetry,
symmetry breaking and the gauge theory. Glashow-Weinberg-Salam (GWS) the-
ory of electroweak interactions is covered in Chapter 4. In Chapter 5, quantum
chromodynamics (QCD), i.e. the theory of strong interactions, is discussed briefly.
After these chapters, we will come to the description of physics beyond the standard
model. In Chapter 6, we discuss a current hot topic, neutrino physics, i.e. neutrino
masses and neutrino ocsillations. Then, in Chapter 7, we deal with supersymmetric
theories as the typical example of physics beyond the standard model. We further
attempt to discuss other representative topics in the physics beyond the standard
model; precision tests of electroweak radiative corrections and new physics search
through these analyses in Chapter 8 and flavor physics and CP violation, which
are also sensitive to the presence of new physics, in Chapter 9. Each chapter is
accompanied by a few problems, hoping that solving them will be of some help in
the understanding of the main text. The topics we chose depend on our preference.
However, we believe that those topics must be of general interest and instructive
not only for students who are going to enter the field of theoretical and experimen-
tal particle physics but also even for researchers in high energy physics. Finally,
appendices for some basics are given for the reader's convenience.
During this project, we have learned a lot from many excellent books and re-
views; some of them (Bjorken and Drell, 1964; Abers and Lee, 1973; Jauneau, 1977;
Quigg, 1983; Cheng and Li, 1984; Halzen and Martin, 1984; Rudaz, 1986; Aitchi-
son and Hey, 1989; Wess and Bagger, 1992; Nagashima, 1999) were good guides in
preparing this book.
Unfortunately, because of limited space and time, we could not help neglecting
some of the important topics, such as the full contents of QCD, discussion on the
grand unified theories (GUT), detailed phenomenological analyses of supersymmet-
ric theories, and so on. Fortunately, concerning these topics, there already exist
many excellent textbooks or reviews and we refer the reader to some of them: for
example, for comprehensive discussion of QCD, the textbooks by Yndurain (Yn-
durain, 1992) and by Muta (Muta, 1998), for discussion of GUT, the book by Ross
(Ross, 1985) and a review by Langacker (Langacker, 1982), for more details on su-
persymmtry including its phenomenology, the books by Weinberg (Weinberg, 2000)
and by Ross (Ross, 1985), and so on. Furthermore, in this textbook the references
are far from complete but rather limited only to the ones directly related to the
Preface IX
Preface vii
Chapter 1 INTRODUCTION 1
1.1 Elementary Particles in the Standard Model 1
1.2 Interactions Among Fundamental Particles 4
Chapter 3 S Y M M E T R I E S A N D T H E G A U G E THEORIES 33
3.1 Global Symmetries and Noether's Theorem 33
3.2 Local Gauge Symmetries and Gauge Fields 36
3.2.1 Quantum Electrodynamics — £/(l) model 37
3.2.2 Yang-Mills Gauge Theory — SU(2) model 40
3.3 Spontaneous Symmetry Breaking and Goldstone Bosons 44
3.4 Higgs Mechanism 51
xi
xii Contents
Bibliography 291
Index 295
Chapter 1
INTRODUCTION
"From the earliest times, man's dream has been to comprehend the complexity of
nature in terms of as few unifying concepts as possible"*
Stars which are twinkling in a night sky have brought us various dreams from
ancient times. People have been attracted in their mysterious appearance and have
looked for the physical law of the cosmos; how was the cosmos born?, what are the
most fundamental building blocks of matter constructing the cosmos?, how do those
fundamental particles interact with each other or one another? and so on. There
have been many questions to be answered. Particle physics is the most powerful
tool for investigating these fundamental questions.
In this introductory chapter, we make a simple explanation on fundamental
particles in the standard model and on the characteristic properties of their inter-
actions.
Elementary particles are the most fundamental building blocks of matter. Entity of
elementary particles has been changed in the long course of development of physics.
By the end of the last century, people have found that matter is made of molecules
and/or atoms, atoms are made of nuclei and electrons, nuclei are made of nucleons,
i.e. protons and neutrons, and finally nucleons are made of quarks. Elementary
particles of our own time are quarks (together with leptons such as electron). In
the development of big accelerators in the last 50 years, many (more than 300)
particles called hadrons were discovered in addition to protons, neutrons and pions.
There are two kinds of hadrons; baryons (proton, neutron etc.) with the baryon
number B = 1 and mesons (pions, kaons, etc.) with B = 0.
The quark model proposed by Gell-Mann and independently by Zweig in 1964
•Address by Abdus Salam at UNESCO Celebration of the Centenary of Einstein's birth, Paris
May 1979.
1
2 INTRODUCTION
(Gell-Mann, 1964; Zweig, 1964) classifies all existing hadrons surprisingly well based
on the internal symmetry of SU(3) for hadrons composed of relatively light three
quarks, and successfully explains static properties of those particles (see Appendix
E). In the quark model, baryons are composed of three quarks (qqq) as p = (uud),
n = (udd), A = (uds), etc. and mesons are composed of a quark q and an anti-quark
q as 7r+ = (ud), n~ — (ud), K+ = (us), K~ = (su), etc. Nowadays six different
quarks u, d, s, c, b, t are known to exist and thus, it is said that quarks possess 6
degrees of freedom called "flavor". A quark flavor can change into another quark
flavor through weak interactions mediated by charged weak bosons W±, which are
predicted to appear in the electroweak standard model as discussed later in Chapter
4. In addition to flavor, quarks have another degree of freedom called "color" as
briefly discussed in Chapter 5. The interaction between quarks due to the color
"charge", which is nothing but the strong interaction, is mediated by gluons and is
described by quantum chromodynamics(QCD). QCD is the gauge theory with color
SU(3) symmetry. While the flavor symmetry is broken by the difference of quark
masses, largely for heavy quarks, the color symmetry is an exact symmetry.
Another kind of elementary particles called leptons (electron e, muon LI, tau
r and their corresponding neutrinos ve, v^, vT) exist in Nature. Leptons are free
from the strong interaction and have no color degrees of freedom, i.e. leptons are
colorless. Among them, neutrinos possess only weak interactions, while e, fx and r
have the both of weak and electromagnetic interactions. Productions and decays
of leptons are successfully described by the electroweak standard model, i.e. the
SU(2)L x U(1)Y gauge theory of electroweak interactions, discussed in chapter 4.
The elementary particles in the standard model are as follows;
{
photon 7,
weak (gauge) bosons W^,Z°, (1.3)
gluons g,
Higgs bosons H. (1.4)
Quarks and leptons are fundamental building blocks of matter. All of them are
fermions and have spin | , whose quantum numbers are summarized in Table 1.1
and Table 1.2. It is interesting to note that both quarks and leptons are paired into
three doublets, the members of each doublet participating in the charged current
weak interaction processes together. The repetition of the doublets is stated as there
are three generations of quarks and leptons. The corresponding particles in different
Elementary particles in the standard model 3
Q /s S c B T mass
u +* +^ 0 0 0 0 1.5 ~ 5 MeV
1
d -* 2
0 0 0 0 3 ~ 9 MeV
s —3 0 -1 0 0 0 60 ~ 170 MeV
c 0 0 +1 0 0 1.47 ~ 1.83 GeV
b -i 0 0 0 -1 0 4.6 ~ 5.1 GeV
t +1 0 0 0 0 +1 174.3 ± 3.2 ± 4.0 GeV
Table 1.1 Quarks (Q: electric charge, 73: 3rd component of isospin, 5: strangeness, C: charm-
ness, B: bottomness, T: topness. These quantum numbers change their signs for anti-quarks.)
Q Le w LT mass
e -1 +1 0 0 2* 0.511 MeV
Ve 0 +1 0 0 <3eV
M -1 0 +1 0 =* 105.66 MeV
Vy. 0 0 +1 0 < 0.19 MeV
T -1 0 0 +1 =* 1777.0 MeV
vT 0 0 0 +1 < 18.2 MeV
Table 1.2 Leptons (Q: electric charge, L e : electron number, L M : muon number, L r : tau number.
These quantum numbers change their signs for anti-leptons.)
generations, for instance u, c, t have exactly the same quantum numbers. The only
property to distinguish different generations is the difference of the masses of quarks
and leptons, depending on the generation. The electric charges of upper and lower
components of quark doublets are + | and — | , respectively, while those of lepton
doublets are 0 and — 1, respectively. At the first sight, such charge "quantization"
seems to be miraculous. The quantization, however, is known to be well-suited to
the anomaly-free condition for the standard model as explained in Chapter 4.
Gauge bosons having spin 1 are mediators of interactions between quarks or
leptons. Interaction strength depends on which gauge bosons propagate between
quarks or leptons. Electromagnetic, weak and strong interactions are mediated by
photons 7, weak bosons W±, Z° and gluons g, respectively. A photon does not
couple to itself, while gluons and weak bosons have self-couplings, i.e. can couple to
themselves. This is due to the fact that the electromagnetic interaction is described
by the Abelian gauge theory, while the strong and weak interactions are by the
non-Abelian gauge theories. Among those gauge bosons shown in (1.3), a photon
7 and gluons g are massless and hence their interaction ranges are infinite, though
actually the argument on the strong interaction does not go so straightly because of
the non-Abelian nature of color interactions and quarks are confined inside hadrons
with the range ~ O(10~ 15 m). On the other hand, the weak bosons W±, Z° are
4 INTRODUCTION
massive and their interactions are very short-range, such as —-— ~ C(10 _ 1 8 m).
The Higgs boson with spin 0 is introduced for the Higgs mechanism to work,
which is operative in the theories with spontaneous symmetry breaking of local
gauge symmetries as will be discussed in Chapter 3. In the Higgs mechanism a
larger symmetry is spontaneously broken into a smaller symmetry through the vac-
uum expectation value of the Higgs field and accordingly (a part of) gauge bosons
become massive. The masses of not only gauge bosons but also all quarks and
leptons are originated from the spontaneous symmetry breaking in the standard
model, though neutrino masses are a little bit controversial. The real understand-
ing of the mechanism of the spontaneous breakdown and the Higgs mechanism is
still extremely challenging problem to be solved in field theories. Obviously, the dis-
covery of the Higgs boson in experiment will provide a crucial hint to the problem.
However, so far there is no evidence of the production of the predicted Higgs boson
in high energy reactions. To uncover the Higgs boson is the most important issue
to finally establish the standard model or even to search for the physics beyond the
standard model, "New Physics". This hopefully can be performed in the early time
of this century.
The theoretical description of weak interaction processes was first worked out by
Fermi in 1933 for the /3-decay of nuclei (Fermi, 1933). The following two decades saw
the refinement of the explicit form of the current-current interaction first suggested
by Fermi. The discovery of parity violation by Wu et al. in 1957 (following the
theoretical work of Lee and Yang) led Feynman and Gell-Mann and also Marshak
and Sudarshan to the vector minus axial vector V — A structure of the weak current
in 1958 (Feynman and Gell-Mann, 1958; Marshak and Sudarshan, 1958). In 1961,
Gell-Mann and Neeman independently introduced the 517(3) symmetry of strong
interaction (Gell-Mann and Neeman, 1964) and in 1963, Cabibbo (Cabibbo, 1963)
used the hypothesis that the weak currents of hadrons have definite SU(3) transfor-
mation properties to describe the decay of strange particles in the current-current
coupling scheme.
In this Chapter, after touching on the parity violation in weak interaction pro-
cesses, we describe the development of weak interactions following the above histor-
ical chronology. We also discuss some important subjects in the Fermi theory, i.e.
the lepton current universality, pion decays and Cabibbo currents. Unfortunately,
the Fermi theory is not complete. It has serious fundamental difnculies, i.e. unitar-
ity violation and non-renormalizability. These difficulties will be discussed in the
subsequent section. Finally, we will give some discussions on the intermediate weak
boson model, which is an alternative model of the Fermi theory, as the preliminary
discussion of the Standard Model.
The weak interaction among elementary particles was first discovered in the /?-decay
of nuclei. Basically it is realized as a decay of neutron n in the nucleus into a proton
p, an electron e~ and an anti-neutrino 9,
n -> p + e~ + v. (2.1)
7
8 WEAK INTERACTION
The life of a free neutron due to the /?-decay is about 900sec which is extremely
longer than that of other particles induced by strong interaction like r(p° ->
7r+7r-) ~ 10~24sec or electromagnetic interaction like r(7r° -> 27) ~ 10~ 18 sec.
Since the nuclear /^-decays share common nature with other weak processes, they
are classified into weak interaction processes, though the neurton life-time is much
larger than that of typical weak decays like r(A° -> pK~) ~ 10~ 10 sec.
y*
mirror
60
Co
Fig. 2.1
interaction processes, the effect should appear even in the /?-decay of nucleus. They
proposed to observe the correlation between the spin J of 60 Co and the momentum
pe of elecron produced in the following /3-decay,
Fig. 2.2
nucleus is described by
e j H # = e[upVp]#, (2.3)
where e is the electric charge of proton and up and A^ are the Dirac spinor of
proton and the photon field, respectively. The factor j ^ = u P 7 M u p is called an
electromagetic current of proton. To describe the /?-decay process n —> p + e~ +i>,
Fermi replaced A>* and the electromagnetic current jjf by a 4-vector weak lepton
current jfv_^e\ — u e 7 M u„ made of e~ and v fields and the weak nucleon current
j(p-->p) _ Up^^-Un of p and n fields, respectively, as
where G is the weak coupling constant being much smaller than the electromagnetic
coupling constant e. (Actually GE2 <C e2 for moderate energies E.) That is, the /?-
decay interaction is described by the product of two weak currents; nucleon current
j)j n-> and lepton current ffv^e\, in which 4 fermions p, n, e~ and v couple at
the same space-time point. There is no propagator connecting these two currents
and this is apparently different from electromagnetic processes where, for instance,
proton current and electron current are mediated by a photon with its propagator.
In addition to a big difference in magnitude of G and e, it is remarkable that the
weak nucleon and lepton currents are charged,* while the electromagnetic currents
are neutral.
Taking account of the parity violation in weak interaction, Fermi's original idea
for the /3-decay n —> p + e~ +9 can be generalized by writing the weak interaction
Hamiltonian for this process as (see, for example, Jauneau, 1977)
G
Hw =
^2~5~ [^P0^] [i'eOiil + CJ7 5 )VV] + h.C, (2.5)
where Cj can be taken to be ± 1 since the parity violation is maximum and actually
"Long later, in 1973 the weak neutral current was also discovered with neutrino elastic scattering
v + e~ —> v + e~ and deep inelastic scattering v + N —> v + X and P + N —>P + X a s predicted
by the Glashow-Weinberg-Salam(GWS) theory.
The Fermi Theory of Weak Interaction 11
Os = 1, Scalar(S')
Ov = In, Vector (V)
Assuming the time reflection invariance in the /?-decay, we can take Gi to be real.
Then, the remaining task is to determine experimentally which type of interaction
among (2.6) works in the /3-decay.
. , . ., / ff 0 \ . _ / 0 3 \ _ -. ^.
a 4 x 4 matrix a' = I _ , one can write a = _ as a = 75CT = a 75,
\ 0 aJ \ a 0 J
where 3 is the Pauli matrix. Now using 7! = 1, one can obtain the following
equations from (2.8)
3' • p 3' • p
i> = 75^, ihi> = ip, (2.9)
p p
t Recently, due to observation of neutrino oscillation, it has been strongly suggested that neutrinos
are massive. But in those days neutrinos were considered to be massless and here we treat
massless neutrinos. In any case, neutrino mass has to be extremely small even if it is massive
and it is a good approximation to consider the massless neutrinos.
12 WEAK INTERACTION
Then, by taking addition and subtraction of two equations in (2.9) and introducing
the helicity operator defined by h = (<? • p)/2p, we obtain from (2.9)
Comparing (2.10) with (2.11) we see that helicity h is the same as chirality 75 for
massless fermions. This is true even in the 75-diagonal representaion. (see Appendix
A.5.)
Now let us consider a free electron with nonzero mass m. We are interested
in the effect of the projection operators PR,L on the Dirac spinor of the electron,
which is the solution of (2.7)
(2.12)
an
where iV is the normalization factor and x = ( n ) d I ) represent that the
electron spin is parallel and anti-parallel to the z-axis, respectively. With (2.12),
we obtain
1+
^>), fc^f/'-^V (2,3)
Now let the electron momentum p be along the z-axis, for simplicity. Then, we can
calculate the expectation value of helicity h^ of electron as
•* Though according to the definition of helicity operator, the face value of helicity eigenvalue for
fermions with spin | i s / i = ; o r / i = - 1 , the two times value of it is conventinally used and we
follow this convention in this textbook.
The Fermi Theory of Weak Interaction 13
where the sign +(-) corresponds to IJ)R(L)- As § equals to v (in the natural unit
c = 1), with v being the velocity of electron, P^ approaches to ± 1 for relativistic
electron with v ~ 1. In other word, PR(L) picks up approximately the h^ = +1(—1)
state for a high energy electron(see also Appendix A.5).
= •IpeOi — - ( 1 + CiJ5)lp„
= &0<(1=F*)^V, (2-15)
where we used the relation 7^75 = —757^. The sign in the right-hand side of (2.15)
shows that — is for i = V,A and + for i = S, T, P. This equation is very important
in order to determine the type of Oj. If neutrino helicity is measured to be —1, one
can say that the interaction type must be V or A or a combination of them, and Cj
becomes — l(c» = + 1 is not allowed). On the contrary, if it is + 1 , the interaction
type should be S, T, P or a combination of them, and Cj become +l(cf = —1 is not
allowed). Therefore, the next step is to know neutrino helicity which is necessary
to determine the type of weak interactions.
In this process, the produced neutrino cannot be detected because it is neutral and
has no electromagnetic interaction.
Now, let us consider a photon which runs only in the direction parallel to
152
Sm(0 + ) in the final state. If we take the direction of running neutrino as +z-axis,
we see that only two cases of spin alignment are allowed in this process:
152 152
Eu((T) + e" -> Sm(0+) + 7 + 1 /
(i) 0 -\ 0 - 1 +\ (2.17)
(ii) 0 + | 0 + 1 -|
2.1.2.5 V — A interaction
Experimentally, Gy was determined from the decay 1 6 O(0 + ) -» 1 4 N(0 + ) + e + + v.
In this process, the spin of initial and final nucleons does not change and the spin
sum of e + and v is 0. Thus, the process is Fermi transition and hence only Gy in
(2.18) can contribute to this decay and then from this decay width one can obtain
the value of Gy; when we define Gp = Gy/y/2, we obtained
where mp is the mass of proton being the order of 1 GeV. Using this value, GA can be
obtained, in principle, from the /?-decay of neutron, where the Fermi transition and
the Gamow-Teller transition coexist. There was a long history of this measurement
and now we have gA = \GA/GV\ — 1.26. Unfortunately, the relative sign of Gy
and GA, cannot be determined from unpolarized experiments. To determine it, one
must study the polarized experiment.
Now, let us consider the /?-decay of neutron whose spin is polarized to the +z-
axis and let electron and anti-neutrino momenta be given by pe and pp, respectively.
Since the mass difference between neutron and proton is tiny, the produced proton's
momentum is very small and thus, it is a good approximation to treat the nucleon
current nonrelativistically. First consider the first term of (2.18), i.e. the Fermi
transition term. Using the nucleon wave function which is of the same form as (2.12)
with almost equal mass mp w mn « m for proton and neutron, we see that 'ipp'yoipn
is of 0(1), while 'ippjipn is of O(^) in the nonrelativistic approximation. Thus,
we can neglect the contribution of 'ipp^ipn and the remaining term is only "tpp^o^Pn
which can be written as xPXn in the nonrelativistic approximation using the Pauli
spinor Xp(n) °f proton(neutron). It has no spin flip operator a and therefore, in
this case only two patterns of the spin and momentum arrangement for particles
16 WEAK INTERACTION
it it H + a- it it + ij. + it
+ I
n P e-
(a) (b)
it —• a- + it + it
n p
(c)
Fig. 2.3 Spin arrangement in the neutron /?-decay. Long arrows show the momenta of e~ and Pe
participating in this /3-decay can be allowed as shown in Fig. 2.3(a) and (b). Next,
let us consider the second term of (2.18), i.e. the Gamow-Teller transition term.
This time the same wave function of nucleons leads to that 1/^775V"n is of 0(1), while
V'P7o75'0n i s of 0(%)- Thus, we can neglect V>p7o75Vv»- Only ^,775 VVi contributes
to this transition. ^775^/>n results in x\<?Xn in the nonrelativistic approximation.
ForCT3term in xl^Xn, the spin and momentum arrangement of particles are same
as shown in Fig. 2.3(a) and (b). However, o\ and 02 terms contribute to spin flip
and thus we have the spin and momentum arrangement as shown in Fig. 2.3(c).
After all, in the nonrelativistic approximation we have the following interaction
Hamiltonian for the /?-decay of neutron.
Hw - Gv [xlXn] [vtlv]
+ GA [xl^Xn] [vtffriv] + h.c, (2.20)
where ^ 2 L t/'i/ i n (2-18) is replaced by r\v being the two-component neutrino wave
function with helicity h^ — — 1 and r]e is also the two-component Weyl spinor
of electron with helicity h^ = — 1 (see (A.53) for the notation of two-component
Weyl spinors). Now, let us calculate the matrix element for the process presented
in Fig. 2.3(c) where the neutron spin flips. The spin flip is originated from the
following terms in (2.20),
where a± — (a\ ± icr 2 )/2 and actually only the 2nd term of (2.21) can contributes
to the spin arrangement presented in Fig. 2.3(c). We can parametrize this matrix
element as
M ( c ) = 2GAF, (2.22)
The Fermi Theory of Weak Interaction 17
where F is a factor containing all dynamics of this decay process except for the
coupling constant GA-
Next, let us consider the case of Fig. 2.3(a). This time, the 1st term of (2.20)
and the 173 term in the 2nd term of (2.20) contribute. In Fig. 2.3(a), the electron
spin being in parallel to the neutron spin is along the +z-axis. Hence 03 works as
+ 1 in the matrix element and we can write the matrix element for this process as
For the case of Fig. 2.3(b), since the electron spin is along the —z-axis, we obtain
the following matrix element,
To the contrary, the probability for an electron being emitted in anti-parallel to the
neutron spin, which is given by the sum of Fig. 2.3(a) and (c), can be calculated as
If Gv « GA, then P(S t Pe t ) w 0> which means that an electron cannot be emitted
in parallel to the neutron spin. If Gv ~ -GA, then P(a \ pe f) « -P(<? t Pe I),
which means that the number of electrons being emitted in parallel and anti-parallel
to the neutron spin is almost same. The latter case was observed in experiment
and then it was finally established that the weak interaction Hamiltonian for the
/?-decay was described by the V — A current-current interactions as summarized as
follow;
process so far.§ Existence of the particular decay mode [i~ —> e~ + i>e + v^ and
non-existence of the decay mode fi" —> e~ + 7 led that in Nature there are additive
conserved lepton numbers: the electron number Le (Le = +1 for e~~, ve and Le = — 1
for e + , 9e) and the muon number LM (LM = +1 for [i~, i/M and LM = - 1 for / J + ,
PM). e~ interacts with ue alone and ^~ interacts with v^ alone. Later, the third
lepton members, the tau r and its neutrino vT (tau neutrino) were also dicsovered
with additive conserved tau number LT (LT = + 1 for r~, uT and LT = — 1 for T+,
vT). Now we have three families of leptons as
The universality of weak interactions is expressed by writing the total weak current
of leptons as a sum of an electron, a muon and a tau current with equal weight as,
where the wave function of each particle is written by the symbol representing the
particle itself and V is a 3 x 3 diagonal matrix given as follows;
(2.31)
where u and t; denote the spinors for particles and anti-particles, respectively. The
weak coupling constant GM can be determined from measured value of the life-time
of muon. Since the muon decay involves no complication due to hadronic currents,
we can accurately determine the value of GM.
§Only upper bound of the branching ratio for this decay mode, being extremely small, is known:
Br(n -» e + 7) < 1.2 x 1 C T n .
The Fermi Theory of Weak Interaction 19
dR (2 )4j(4)(
=m ( ^ (2^X - ^ -» - ^ - **•>
X 2
- ^ ^ ^ ( ^ - ^ - ^ J ^ - p e - A ^ ) ) - (2.34)
(2TT)5 2 £ e 2w„.
In (2.34), 0 is the step function, 6(x) = 1 for x > 0 and = 0 for x < 0. \M\2
denotes the probability of spin-averaged for initial particles and spin-summed for
final particles,
spin
1 G2
= 2~2~ ^ N M 7 P ( 1 ~ Tto)u(ft»)u(P/*)7<r(l - 7te)«(*^)]
spin
= -fTr[^7p(l-75)(^-^)7„(l-75)]
x Tr^7"(l-75)^7'(l-75)]
= 64G2(Av.-pM)(^-pe), (2-35)
where the electron mass was neglected because it is very small as me < m M /200. In
the rest frame of muon p = (mM, 0,0,0), we have {kUt -p^ik,,^ -pe) = \u)Vi,m^rri^ —
2mliuje) and thus we can write the decay rate as
where 6 is an angle between produced electron and anti-neutrino. Using the relation,
dzped3kVe = 4-!rE2dEe •2-noj2tdiJjVedcos6', and integrating over cos# using 6 function,
we obtain
G2
dT = -ArdEedujUcuVc{ml - 2m M w,J. (2.37)
Prom the condition — 1 < cos# < 1, the energy u>„e and Ee are constarined as
m M /2 — Ee < OJVC < m M /2 and 0 < Ee < m M /2, respectively. Then, the energy
20 WEAK INTERACTION
dT _ G\mlEl /, 4Ee
(2.38)
dEe 47r3 V 3 m,,
The total muon decay rate is obtained by integrating by Ee
where the maximum of electron energy is E™ax = m M /2. Using TM = 2.2 x 10 6 sec
and mM = 105 MeV, we get
Comparing (2.40) with (2.19), we see that GM for muon decays is almost equal to Gp
for nuclear /3-decays and thus the weak coupling constant looks universal. Therefore,
the origin of the weak interaction seems to be same. In fact, by taking account of
radiative corrections (Roos and Sirlin 1971; Beg and Sirlin, 1982), we see that the
values of GM agrees with the one of G$ within only about 2%. However, it is very
important to notice this small difference between couplings in lepton currents and
hadron currents, which led Cabibbo to the splended idea of the so-called Cabbibo
mixing currents discussed later.
Before coming to the discussion of Cabibbo currents, we just consider the decay
of the third lepton r. The decay width of r is similarly calculated as in the muon
case discussed above. This can be done by just replacing mM and GM by mT and
G r , respectively. Then, using the relation
GM a mM 5Br(T~ ^e-9evT)
T T M } { (
- G T mT> B r V - > e - P e I V ) ' '
and the experimental data of mT = 1776.99 MeV, mM = 105.658357MeV, rM =
2.19703 xl0- 6 sec, r r = 2.906 xl0- 1 3 sec, BT(T~ ->• e~vevr) = 0.1784 and Br{n~ ->
e~~veVn) = 100%, we can obtain Gr/Gf,. = 1.001. The result confirms again the
universality of the weak interaction.
In summary, the near equality of coupling constant involved in the weak leptonic
decay of fi, r leptons and in the /3-decay, i.e. GM « GT w G/3 shows that there is
a universal weak interaction constant G in all weak processes. In other words, the
value of weak charge is universal and therefore, it is customary to represent this
universal coupling constant G by Gp called the Fermi constant.
For these decays, there are two remarkable experimental findings; (l)the observed
li~ in the process (2.42) was right-handed. This observation results in the conclusion
that the v^ is also right-handed, since in the rest frame of ir~~ being spinless, \T
and Up run into opposite direction with equal momentum and because of angular
momentum conservation, two decay particles must be right-handed. This result is
cosisitent with the (1 — 75) factor in the lepton current. (2)The observed ratio of
branching ratios of (2.43) to (2.42) is extremely small, i.e.
r ,r- -> e- + v )
( e w x lo_4
r(7r- -> n~ + i/M)
This is completely contradictory to the prediction of the phase space effect. The
origin of the strong suppression must be dynamical. Actually, this is due to the
V — A structure of the lepton current. In fact, since ve is right-handed, the emitted
electron must be also right-handed because of angular momentum conservation. But
this is the "wrong" helicity state for light electron, because a fast moving electron
should be left-handed as described in 2.1.2. This is not the case for a muon; a muon
emitted in the pion decay is rather heavy and has the right-handed component, i.e.
the "right" helicity state.
Let us go to a quantitative discussion. Assuming the V — A interaction, the
matrix element for these decays ir~(q) —>• £~(p) + vt(k) where I — fj, or e and
momenta of TT~, £~, Di are given in parentheses, is written by
M=
71 < 0|J*~|7r~(<?) > ^ f r ) ^ 1 - -*)""«(*)' ( 2 - 45 )
where < 0| J* \ir~(q) > contains all strong interaction effects, being called the pion
decay constant /„•, expressing the effect of the pion decay into vacuum, and must
be vector or axialvector so that this factor can contract with the lepton current to
make M be Lorentz scalar. Since a ir~ is spinless, available 4-vector is q alone.
Then we can write
<0\Jf\7r-(q)>=qiifn. (2.46)
Substituting (2.46) into (2.45) and using q = p+ k, we can write the amplitude as
Q
M = - ^ f e +W-[^(P)7 M (1-75K,(A;)],
Q
= -7^m,ifvui[p){\--f5)vvt{k), (2.47)
22 WEAK INTERACTION
where in going from the 1st line to the 2nd line, we used the Dirac equation for a
lepton £ with mass me and the one for a neutrino vt with zero mass. Then, using
the standard technique taken in the calculation of the muon decay width described
in the previous section, we obtain the result
Y{,-^e-+Ve) _fmeYfml-ml\2^128xl0_^ ( J M g )
r(7i— ->• fi- + i/p) V m M/ Vmf-mM
which fantasitcally agrees with the experimental result of (2.44). Furthermore, from
the observed life-time of TT - , T = 2.6033 x 10 _8 sec, we can estimate the value of the
decay constant f„ as
U ~ 0.92m,,. (2.50)
jM=aJ^+bj£\ (2.51)
To find the value of the Cabibbo angle 0C, we compare the decay width of n~ ->
fi~ + Vy, and K~ -> \iT + v^. The decay width for TT~ ->• \i~ + v^ is given by (2.48)
in which G is replaced by cos6cG by taking account of the Cabibbo mixing effect
presented in (2.53). The decay width for K~ -> fi~ + v^. is also given by replacing
mn, f„ and G by TTIK, JK and sin# c G, respectively. Then, we have
m
T(K~ ->/i- + v») singg / J ^ C1 ~ ^ )
Assuming the 51/(3) symmetry for meson decay constants of fx = fn and using
the experimental result(« 1.335) for the lhs of (2.54) and the values of mM, mn,
TTIK presented in the particle data table (Particle Data Group, 2002), we obtain
t a n 0 c ~ 0.275, (2.55)
which leads to sin# c « 0.26. More detailed analyses were carried out with the
results, sin0 c = 0.220 ± 0.002 from K~ ->• TT° + e~ + 9e (Braun et al., 1975) and
sin^ c = 0.231 ± 0.003 from A ->• p + e~ + ve (Bourquin et al., 1983). Because of
the small values of 9C, those decays whose amplitudes are proportional to cos# c ,
are known as Cabibbo-favored decays, while those with amplitude proportional to
sin0 c are Cabibbo-suppressed decays.
Let us now understand Cabibbo's proposal at the quark level. The /3 decay
process n —> pe~ue at a quark level is d —> ue~ve (Fig.2-4 (a)): one of the d quarks
in the neutron (ddu) transforms into a u quark with remaining d and u quarks as
spectators. In contrast, in the decay process A0 —> pe~ue in which A0 has quark
content uds, the strange quark in A0 transforms into a u quark (Fig.2-4 (b)). Again
this involves charge changing weak currents but in this case there is also a change of
strangeness at the baryon vertex and hadronic current is therefore called strangeness
changing or AS = 1 weak current. The quark current has the same V — A structure
Gcos6,
AS = 1
(b)
Fig. 2.4
be transformed into their partners in the same doublet; e.g. e «-» ve for
and (j, f> i/,, for )• Similarly, we can group the quarks into families like
But for quark transitions, they are observed not only within a
family but, to a lesser degree, from one family to another via Cabibbo mixing.
Therefore, for charged currents of quarks, the "partner" of the flavor eigenstate u
is not just the flavor eigenstate d but a linear combination of d and s, which is
called d'. Similarly, the "partner" of the c quark is another linear combination of
d and s which is orthogonal to d' and is called s'. The coefficients of these linear
combinations can be written by using a single mixing parameter 6C as
cos 0C sin 6C
U with U= (2.58)
— sin6r cos6r
The Fermi Theory of Weak Interaction 25
Fig. 2.5
phenomena of the observed charged current weak interactions. The Fermi theory of
its simple and elegant form could not only explain many experimental data of decays
but also matches well with the physical principle such as universality, algebraic
properties embodied in Cabibbo currents, and so on.
However, it is now also well known that the Fermi theory cannot be a complete
theory, even if it works well as phenomenology. When we apply it to scattering
processes, the lowest order approximation violates the unitarity bound. Further-
more, unlike the quantum electrodynamics for electromagnetic interactions, the
Fermi interaction is not a renormalizable theory and thus we cannot manage those
higher-order contributions to yield a finite outcome.
from which, by neglecting the electron mass me and following the standard technique
for calculating cross sections, we can obtain the differential cross section as
dQ 64?r2s 4TT2 V ;
where s = (k+p)2 — 4E2 = 4fc2 (E(k) is the CMS energy (momentum) of the ve+e~
system). Integrating over the solid angle fi, we obtain the total cross section for
this process
a = ^£Ls. (2.64)
7T
The Fermi Theory of Weak Interaction 27
Fig. 2.6
On the other hand, as is well known, every scattering cross section can be decom-
posed into partial waves. For spinless particles, we have in general
1 oo
da 2 -^(U+VftPticosO) (2.65)
dQ, = \f(0)\ 2ik
1=0
where fi is the partial wave amplitude for an orbital angular momentum I. For
the pointlike Fermi interaction, only /o contributes to ve — e~ scattering (2.61) and
there is no angular dependence (with P0(cos9) = 1). Then, we expect to have
da 1
AE2 I/of-
(2.66)
dn
However, since unitarity requires \fi\ < 1 for every partial wave, we obtain the
upper bound for the diffrential cros section as
da_ _ 1 _
(2.67)
dn ~ AE2'
Then, the total cross section is also bounded similarly as
da da -K
—raSJ = ATT—T < —2r . (2.68)
' • / dCl dQ ~ E
Therefore, the prediction of (2.64), being in the lowest order of the Fermi interaction,
violates unitarity bound at some high energy
Fig. 2.7
2.1.8 Non-renormalizability
Another serious difficulty in the Fermi theory is that it is not a renormalizable
theory, which is very different from the case of quantum electrodynamics(QED),
being renormalizable. As is well known, we have divergent integrals originating
from higher-order diagrams even in QED as well. However, in QED we can remove
those divergences by renormalizing the mass and charge into the observed physical
values. On the other hand, in the case of the Fermi interaction, if we try to follow
the renormalization procedure analogously as in QED and go to the next higher
order diagram, we will encounter a more serious divergence, which needs further
renormalization constants. Then, when we consider all higher order diagrams, we
need an infinite set of renormalization constants. That is, the theory cannot be
renormalized. (see, for example, Aitchison and Hey, 1989)
By taking the reaction ue + e - -» ve + e~, let us see what is going on. The
lowest order diagram for this reaction is given in Fig. 2.5, which contributes to
the amplitude with the order of GF- The 2nd order correction comes from Fig.
2.6. There we have two fermion propagators, each one behaves as 1/k for large
internal fermion momentum fc, and thus due to these internal fermion propagators
we have a loop integral contribution to the amplitude with GF f^r, which is di-
vergent. Similarly, the 3rd order diagram (Fig. 2.7) contributes to the amplitude
with G3F(f^r)2, which is more seriously divergent. After all, we have a series of
expansion with GF f^r> where the higher order terms yield more and more severe
divergence. This is because GF has a dimension of [mass] - 2 ; each term has a new
GF in the expansion series, which must be compensated by a divergent factor / ^ r
with [mass]2 dimension in each term to keep the dimension of the amplidude.
One of the interesting ideas to attempt to take away those difficulties is the in-
termediate weak boson exchange model. The weak coupling constant G(= GF)
appearing, for instance, in (2.27), (2.32) and (2.62) has a dimension of [mass] -2
(see (2.40)). This fact suggest us that we can develop further an analogy between
the weak interaction and the electromagnetic interaction by postulating that the
Intermediate Weak Boson Model
Fig. 2.8
1-75 9 _ 1-75
Af = - >1P—IT- u
u (2.70)
q2~Mw LV2 U e 7 f f U
W 2 "
instead of (2.32), where g is the dimensionless coupling constant and q represents
the momentum of an intermediate W boson with mass Mw- This form is quite
analogous to the electron-proton scattering amplitude mediated by a massless pho-
ton
We see that a photon propagator between two electromagnetic currents with charge
e is replaced by a W-boson propagator between two charged currents with coupling
constant A=. Since the Fermi interaction works as a contact interaction at zero
range, the W boson must be very heavy, unlike a massless photon. Furthermore,
at low energy where q2 is much smaller than M ^ , the Fermi interaction works very
well and thus (2.70) should reduce to (2.32) with
GF
(2.72)
v/2 8M£/
From (2.72), the fact that the weak interaction is weak with small GF is considered
to be not because the coupling g is small, but because Mw is large with g w e. This
suggests that the electromagnetic and weak interaction might be formulated into a
unified theory. In fact, later this idea was beautifully realized in the non-Abelian
Gauge theory called Glashow-Weinberg-Salam(GWS) theory, which is the one of
main subjects of this textbook and will be discussed in detail in the next Chapters.
But before coming into the GWS theory right now, it is instructive to learn a few
things about the intermediate weak boson model. One is related to the problem
30 WEAK INTERACTION
W+
Fig. 2.9
of unitarity bound. As discussed above, the Fermi interaction, i.e. the contact
interaction leads to violation of unitarity bound. Then, can the intermediate weak
boson model cure this disease? The answer is NO. For example, let us consider the
process, in the lowest order, (Fig. 2.9)
which is predicted to exist in the intermediate weak boson model. In (2.73), the
momentum of each particle is given in parentheses, and A and A' denote the polar-
ization of W+ and W~, respectively. The invariant amplitude for this process is
given by
W> 2 (p-ky-ml 1 7
2 'MP),
(2.74)
where e^x\k) and e'A \k') are the polarization vector of W+ and W~, respectively.
From this amplitude, one can calculate the differential cross section at high energy
as
da U
F „;„2 QT?2
(2.75)
dtt 8^Sin 9E
'
where E and 6 are the energy and the scattering angle of the incoming u^ in the
CMS, respectively. Integration over the solid angle results in the total cross section
a = I2n" (2.76)
where s = AE2. The cross section increases with s and hence violates the unitarity
bound at some high energy. Therefore, the disease of unitarity violation cannot be
cured even in the intermediate weak boson model.
Another difficulty, i.e. the non-renormalizability of the Fermi interaction also
cannot be solved by the intermediate weak boson model. To see this apparently, let
us consider an example of scattering presented in Fig. 2.10. In this figure, (a) is the
Intermediate Weak Boson Model 31
(a) (b)
Fig. 2.10
two photon exchange correction to the QED process, e + + e~ —> e+ + e~, while (b)
is the two W boson exchange correction to the weak process, ve + e~ —» ve + e~.
For the photon exchange diagram (a), since the photon propagator behaves as 1/k2
for large internal momentum k, we have a loop integral J^r for this amplitude,
which is finite. On the other hand, for the W boson exchange diagram (b), since
the W boson is massive and its propagator has a longitudinal component
1 i fZ^Kv.
(2.77)
Problems
2.1 Show that the coupling constant d in (2.5) must be real if the interaction
Hamiltoniam (2.5) for the /?-decay is time-reflection invariant.
32 WEAK INTERACTION
2.3 In the nonrelativistic approximation, one can see that the Fermi transition for
/?-decay processes occurs for O, = V and A, and the Gamow-Teller transition does
for Oi = A and T, while Oi — P can contribute to the Fermi transition only in the
order of ^ and thus can be neglected. Show these results using the nonrelativistic
expression for the Dirac spinor of nucleons.
One of the most fundamental principles in particle physics is that interactions among
fundamental particles are described by symmetry principles. The invariance of the
Lagrangian under certain symmetry transformations leads to a set of conservation
laws. In addition to space-time symmetries such as Lorentz invariance, parity invari-
ance, time reversal invariance etc., the internal symmetries such as isospin, flavor,
color etc. have been also known. The development of our understanding about
the fundamental interactions is based, to a large extent, on our understanding of
such underlying symmetries of Nature. It is remarkable to know that there is a
connection between exact symmetries and conservation laws and to see that the
requirement of local gauge invariance can serve as a dynamical principle to guide
the construction of interacting field theories.
In this Chapter, we first discuss the relation of the global symmetry to the
Noether's theorem, which connects the invariance of the Lagrangian under a con-
tinuous symmetry transformation to a conservation law and the conserved quantum
numbers. Then the local gauge theory will be developed with an example of quan-
tum electrodymanics(QED), being an Abelian gauge theory with U(l) symmetry.
A non-Abelian Yang-Mills gauge theory is also discussed with an example of SU(2)
symmetry. Furthermore, an idea and an important result of the spontaneous sym-
metry breaking(SSB) and the Goldstone theorem will be discussed. Finally, we will
discuss the Higgs mechanism, which is realized in the mechanism of SSB with local
gauge symmetries.
In the field theory, fields are used to describe the fundamental particles. Dynamics
among those particles are given by the Lagrangian density* C((p(x),d,j.(f>(x)) which
depends on the field <f>{x) and its derivative d^^x). Fields are arranged so as to
realize various symmetries mentioned above. The Lagrangian formalism provides
"The Lagrangian density is often called, simply, Lagrangian. We follow the same usage.
33
34 SYMMETRIES AND THE GAUGE THEORIES
We define the "momentum" fields which are canonically conjugate to field variables
(f>a(x) as
na(x) = , (3.3)
0{Oo<Pa)
and assume finally the canonical commutation relations to quantize the system
- a>(mk>H')- (36)
-
by using the Euler-Lagrange equation (3.1). When the transformation parameters
are independent of the space-time point x, it is called the "global" transformation.
As an example, we introduce a transformation under a special unitary group G =
SU(n) as
6<j>a = -ieXbMx) {4>a -> 4>'a = e~i(>iTL<Pb), (3-7)
using x-independent infinitesimal parameters 01 and a n n x n matrices 2", where
a,b = 1,2, • • •, n and i = 1,2, • • •, n 2 — 1. Here the summation over over i and also
tFor simplicity, we consider the case of boson fields for the moment. For the case of fermion fields,
we need no change in the following discussion if commutation relations (3.4) and (3.5) below are
replaced by anti-commutation relations.
Global Symmetries and Noether's Theorem 35
over b is implied. Tl is called generators of the SU(n) group. Then, when we define
the current
ffl5-^* <3 8)
'
(3.6) can be written as
SC = 6idll3^(x). (3.9)
Thus, if the transformation (j)a —• </>a + 6^>a leaves C invariant, i.e. 8C = 0, we have
the current conservation law,
d^(x) = 0. (3.10)
and by assuming for jl(x) to vanish at spatial infinity, we are led to, by using the
Gauss's theorem, the charge conservation law,
? = "• <3-12'
where the "charge" is defined by
One can see that a symmetry of the Lagrangian under a field transformation implies
conservation laws; this is called the Noether's theorem (Noether, 1918) and the
current jlfi(x) is called Noether currents.
In quantum theory, we have the Heisenberg equation
dJ^x)=i[P^r{x)], (3.14)
- ^ = i[H,Q'} = 0, (3.15)
with H = Po, which is the Hamiltonian. Therefore, it turns out that Ql commute
with Hamiltonian H and the states with Q' (i — 1,2, • • •, n2 — 1) degenerate with the
same energy so as to make a multiplet. Using the canonical commutation relations,
one can easily show that the charges Ql satisfy the following relation, which is the
Lie algebra,
since the generators Tl of SU(n) always satisfy the same Lie algebra,
[T*,T']=ifijkTk, (3.17)
where /f^ is the structure constant of the Lie group and totally antisymmetric with
respect to i, j and k. That is, the charges Q' satisfy the same Lie algebra as the
generators Tl of the group SU(n) do. Furthermore, for an infinitesimal # ' ( < 1),
one can derive the following relation, using canonical commutation relations of (3.4)
and (3.5),
with
are classified into two ways, (1) manifest or unbroken and (2) hidden or sponta-
neously broken. The electromagnetic interaction is unbroken local U(l) symmetric,
the weak interaction is spontaneously broken local SU(2) x U(l) symmetric and
the strong interaction is unbroken local SU(3) symmetric. The non-Abelian gauge
theories of weak and strong interaction is the generalization of quantum electrody-
namics(QED), which is the Abelian local C/(l) gauge theory.
where j M (x) = •0(x)7MV(2;) is the vector current carried by the fermion. Hence, the
Lagrangian is not invariant under the local transformation. Only if d^9(x) = 0, i.e.
6(x) is independent of x, C is invariant. Then, if we make the following replacement
in (3.22)
where e is the electromagnetic charge of the fermion which is negative for an electron.
Thus, if we define a new Lagrangian by replacing 9M in (3.22) by £>M of (3.27)
then this C becomes invariant under (3.25) if at the same time we make the replace-
ment
which precisely cancels the additional unwanted term in (3.26). -DM defined above is
called a "covariant derivative", a terminology borrowed from the general relativity.
From the above observation, one can prescribe the procedure for getting the
gauge invariant Lagrangian in a little different way; starting from an original La-
grangian (3.22) which possesses a global symmetry and making the replacement
(3.27), we can get a new Lagrangian
The requirement (3.32) leads to (3.30), where 6{x) is the ^-dependent infinitesimal
phase parameter. Then, electromagnetic dynamics is made invariant by introducing
a spin 1 vector (gauge) boson field A^ called the photon through the covariant
derivative, which is called the "minimal coupling". It is very important to know that
in the gauge invariant theories the interaction between gauge bosons and particles
(fermions and/or bosons) is uniquely determined only through the minimal coupling.
The Lagrangian which is invariant under local U(l) gauge transformation is,
therefore, given by (3.31). However, this is not the complete Lagrangian for de-
scribing the whole system. We have to add the kinetic energy term and mass term
Local Gauge Symmetries and Gauge Fields 39
of the electromagnetic field Afi(x), which should be also gauge invariant. As for the
mass term, the local gauge invariance necessarily leads to massless photon because
2
the mass term ^A^A1* violates the local gauge invariance, unless m = 0. It is
well known that the kinetic energy term of the electromagnetic field is given by
— \FllvF,iV, where the electromagnetic field-strength tensor FM„ is defined by
F^ = d ^ - dvA„. (3.33)
The coefficient - 1 is necessary for the requirement that the Euler-Lagrange equa-
tion is just the Maxwell's equation. The kinetic energy term is obviously gauge
invariant because SF^ = 0 under the transformation (3.30) as easily checked by
directly introducing (3.30) into (3.33). It is also seen in a little different way; the
covariant derivative satisfies
From (3.32), it follows that ([£>„,£>„] V)' = e-ie(-x\[D^,Dv]ip) and thus F'^ip' =
e-HMF^i/j = F^', that is, F^ = FM„.
Therefore, the complete gauge invariant Lagrangian for the system of an electron
and a photon takes the following form,
The field theory with CQBD is called quantum electrodynamics(QED). The gen-
eralization of classical electrodynamics to describe quantum effects had culminated
in the development of QED as the theory of interaction of photons and electrons.
In QED, the electromagnetic interactions are mediated by massless photons.
Photons are quanta of the electromagnetic fields, being massless with no charge and
with spin 1, and they do not interact amongst themselves. Theory of gauge fields
represents a class of theories which share and generalize characteristic properties
of the Maxwell's theory of electromagnetic fields. There are two characteristics of
Maxwell's theory: (1) Forces created by gauge fields, i.e. photons, are long-ranged
and obey inverse square of distance as realized as Coulomb force. (2) Force is
proportional to the quantum number of its source and that quantum number, i.e.
the charge, is conserved.
Typical physical processes due to the electromagnetic interactions are shown
in Fig. 3.1: (a) a fermion emitting or absorbing a photon, (b) annihilation of a
fermion-antifermion pair to produce a photon, (c) a fermion scattered by another
fermion in a two step process in which a photon emitted by the first fermion is
absorbed by the second.
The principle of gauge invariance has come to be recognized as most powerful
guiding light for our understanding of not only QED but also probably all interac-
tions.
40 SYMMETRIES AND THE GAUGE THEORIES
f r
(c)
Fig. 3.1
where the summation over i is implied, g is the coupling constant associated with
the group SU(2). T* (i = 1,2,3) are three 2 x 2 Pauli matrices and 6l (i = 1,2,3)
are three real parameters corresponding to three generators T1/2, (i = 1,2,3) of the
Local Gauge Symmetries and Gauge Fields 41
group SU(2).
Tk
= ieijkY ( 3 - 38 )
where the summation over k is implied, eijk, being the totally antisymmetric Levi-
Civita symbol with £123 = 1, is the structure constant of the group SU(2). Obvi-
ously these three generators do not commute, i.e. SU(2) is non-Abelian, hence the
gauge theory in question is called non-Abelian. ip is both a spinor and an isospinor.
Thus, instead of a single fermion field, we start with a doublet Dirac field
^ = ( tl ) ' ^ = (^ ^
$a = rl>t'f. (a =1,2)
Originally, Yang and Mills chose " 0 = 1 ) doublet where p and n denote the
proton and neutron, respectively. However, one can also choose ip = I 1 doublet
as well where u and d denote the up-quark and down-quark, respectively. Here, the
coupling strength g is a real constant to be determined from experiment (analogous
to e in QED). We insist that the Lagrangian is invariant under transformation (3.36)
and accordingly we encounter a number of additional complications, due to the fact
that there are now three orthogonal symmetry motions which do not commute with
one another. To construct a Lagrangian that is invariant under (3.36), we must
again define a covariant derivative that transforms in a simple way. In analogy of
QED, it is natural to introduce three independent gauge field potentials acting in
different directions, namely, A^(x) for i = 1,2,3 and require that operation on ip is
only through the covariant derivative
which leads to
D ; = UDpU-1. (3.41)
= U(dpil> - igA^ip)
(3.43)
U = l-i6 + O(02),
U'1 = l + i6 + 0(62), (3.44)
4=4+ey*#4--0M0i, (3.46)
or
w t n e
where F M „ = J2i=i ^Fftv = f 'Fpv ^ g a u g e field-strength tensor F£„ defined
by
(3.50)
or
f'^^uf^u-1, (3.52)
Local Gauge Symmetries and Gauge Fields 43
Note that F^„ transforms nontrivially unlike the case of QED where F^v is invariant
because of no structure constant in an Abelian group. In analogy of QED, we
construct the kinematic term for the gauge field A1^ as
where the 2nd equality is due to the relation Tr(Y-y) = \&x* • One can easily see
that this term is invariant under the gauge transformation as shown in the following;
8{FlvF^v) = 2SF^Fi>"/
= 2ey i f e ^F ( J„F""'= 0, (3.55)
where the last equality is due to the totally anti-symmetric property of Sijk • There-
fore, we can adopt (3.54) as the kinetic term for the gauge field A1.
In summary, we can write down the gauge invariant Lagrangian for a fermion
with mass m in the SU(2) symmetric world as
C = CF + CG, (3.56)
CF = $(x)(i'fDll-m)il>(x),
CG = - ^ ( F ^ - F ^ ^ - ^ F ^ ,
where dp and CG are represent the Lagrangian for the fermion and the kinetic
term of gauge fields, respectively. The covariant derivative is defined by (3.39) with
3 gauge fields A1 (i = 1,2,3). Transformation property of the gauge field A* is
given by (3.46) or (3.47). The gauge field tensor F^v is defined by (3.49) and its
transformation rule under 5(7(2) gauge transformation is given by (3.52). Change
under an infinitesimal transformation (3.44) is given by (3.53). In (3.56), the mass
term of gauge fields does not appear again because it violates the gauge symmetry.
Some comments are in order for (3.56). (1) Contrary to QED, the kinetic term
of gauge fields JCQ contains 3 gauge boson interactions with the same coupling
constant g as the one of the gauge field to fermions. It contains also a 4 gauge
boson interaction with the coupling g2 as follows;
CY = GY${x)ip(x)4>(x) (3.58)
in which the scalar field <j>(x) is introduced for Cy to be invariant under the gauge
transformation. There is no principle to determine the form of the Yukawa term
except for the requirement of gauge invariance. Gy is also totally unconstrained;
there is no other way to determine it from experiments.
Unfortunately, this theory (3.56) is not useful for weak interaction because it
gives identical coupling to right- and left-handed fermions and leads to the parity
conservation. To preserve gauge invariance, it is essential to have massless gauge
bosons A^, which should give weak interactions of infinite range like electromagnetic
case, being contrary to the real physics. Only if the gauge symmetry is broken by the
inclusion of mass term, it becomes possible to achieve agreement with experiment.
Examples are seen, for example, for the case of an approximate symmetry such as
the isospin SU{2) or flavor SU(3) symmetry and so on. This type of symmetry
breaking is useful when the symmetry breaking term is small and the perturbative
treatment is meaningful. In this case, C recovers the exact symmetry for vanishing
^breaking- However, this case is rather artificial because there is no fundamental
principle to determine the exact form of CbreaHng- (2) Another way is the one called
a hidden or a spontaneous symmetry breaking(SSB), where the Lagrangian remains
symmetric under certain group transformation while the physical vacuum is made
non-invariant. It is well-known that there are many examples of the SSB both in
classical and quantum physics such as bent rods under a strong force, infinite fero-
magnets, crystal lattices, superconductors, etc. Here we are interested in the field
theory model of the SSB.
• Goldstone theorem
Spontaneous Symmetry Breaking and Goldstone Bosons 45
The system is equivalent to the one described by the following Lagrangian composed
of 2 real fields 931 and v?2 which are related to (j> and </>* as $ = (<pi + i<p2)/'v/2 and
4>* - (Vi - i<P2)/V2,
c = -d^xd^tpi
-d^d^i + -8^28^2
-i - v(<pl + ft). (3.6i)
(3.61) has the 0(2) symmetry, that is, the Lagrangian of (3.61) is invariant under
the following 0(2) transformation,
/>* W t f W c o s * - s i n 0 W ^
Smd C0S
V ¥>2 / V ¥>2 / V ° J \ <P2 J
and one can see that the 17(1) symmetry is equivalent to the 0(2) symmetry. The
potential V{(j)*4>) = V{<p\ + <p\) must have the following properties for the theory
to be meaningful; V is (1) at most the 4th order of the fields in order to ensure
renormalizability of the theory and (2) bounded below for change of (4>*(j>)1/2 = |^|,
so that the theory has a stable ground state. A typical example of V is given in
(3.60) with A > 0.
In quantum field theories, particle excitations of a field are defined as quantized
fluctuations of the field about its lowest energy state, i.e. the vacuum state. The
constant value of the field corresponding to the lowest energy state is called the
vacuum expectation value (VEV), i.e. (0|(/>|0) = <f>o- To find the particle spectra,
we expand the potential about its minimum corresponding to the lowest energy
state as
+
O 5Z ( a,„ a,„ ) (<Pa. ~ <P0a)(<Pb ~ <P0b) + • • • , (3-63)
vm
+ V2 *"¥>2
Fig. 3.2
Then, we are led to the unique vacuum </>oi = <Po2 = 0. The mass matrix becomes
diagonal in this case
m2 0
™-ab = (3.66)
0 m2
which means that <$\ and </?2 have the same mass m as already seen from (3.64).
(2) Another one is the case where the vacuum is not unique and called "Nambu-
Goldstone phase". This case is realized, for example, for continuously or infinitely
degenerated vacuum states with <p0i ^ 0 and/or ip02 ^ 0, as shown in Fig. 3.2(b),
for the potential with m 2 = -fi2 (fi2 > 0) and A > 0,
¥&+¥&=«2 = ^ or (0V)o = W 2
= y = ^- (3-69)
In other words, all points on a circle with radius v = yfJJP/X in the (</>i,y2) plane
correspond to the minimum of V, that is, the vacuum state is no longer unique but
is 0(2) symmetric. One can choose any point as the physical vacuum. From (3.67),
we obtain
m = (3 71)
«{ 0 oj- -
Therefore, we find that <p[ = (fi — v corresponds to a massive particle with mass
m 2 = 2Au2, while ip'2 = tpz is massless. ip2 is called a "Goldstone boson". Actually
using these new fields, we can rewrite the Lagrangian (3.61) as
In literature, another and more useful parametrization of the field is often used.
Introducing the two real fields p(x) and 0(x), we can write the complex field fax)
in (3.60) as
fax) = ^ p ( i ) e t f ( * ) / « , (3.73)
dli4>=^=e»<"(dllp+X-pdlt0), (3.74)
£=\{9,pf+2^w)2 - vtfy ( 3 - 75 )
To find the mass of the particle corresponding to the excitation of the would-be
radial field p{x), we expand it as p(x) = v + r)(x) and obtain
with
It is noted that there is no quadratic term of 0 in this Lagrangian. Prom this La-
grangian, one can easily find that we have a massive 77 field with mass m,, = \J2p?
and a massless Goldstone boson 0.
2 complex fields, fa = {ip\ + i^ 2 )/%/2 and fa = (</>3 + itpi)/y/2. Then, the global
SU(2) invariant Lagrangian is given as
£ = d^d^-Vitffa, (3.78)
V = -pVtf + W W . (3-79)
with p? > 0 as before. Now, by using 4 real fields H(x) and £*(a;) (i = 1,2,3), we
write 0 as
* = 7f"'™/2'(»+V))' (3 80)
'
Spontaneous Symmetry Breaking and Goldstone Bosons 49
1 0
dM<£ = —eiT't'/2v (3.81)
v2 aM# + ;TW v+H
{(^)^T^(JE)}-V^^- <3'82>
which leads to
using the relation rV-' = 6** +ieijkTk. Prom this Lagrangian, one can see that the 3
fields £* (i = 1,2,3) have no mass terms. They are massless Goldstone bosons, while
H is massive with mass ma = v 2 / ? - The original SU(2) symmetry disappeared.
It is remarkable to note that the number of Goldstone bosons is equal to the number
of generators breaking the symmetry of the vacuum state. In an example of SU(2)
model, the number of Goldstone bosons are three, £l (i = 1,2,3), corresponding to
the 3 generators, r* {i = 1,2,3).
This is generalized for cases with larger symmetries. Now, let us consider an
SU(n) symmetric world with n component fields,
f2
(3.84)
\ fn j
6(f) = - z ^ T V , (3.86)
where the summation over i is implied. Tl are the matrices with nxn components.
Suppose that the vacuum is not unique but is symmetrically degenerate. Then,
50 SYMMETRIES AND THE GAUGE THEORIES
¥>2
<i>o = (3.87)
with
82V \
and mlab
h = (3.89)
\8<P*)o ~ \dtpadw)0
The potential should be invariant under SU(n), i.e. SV = ^2a J^-<fy>a = 0 which,
by using (3.86), results in
dV
T 6 = 0 for
E^r > ' * = l,2,---,iV. (3.90)
d<Pa
a,b
d2v dV
E dipcdifa 7 > 6 + E d>fia
a.b
T l
—0 (3.91)
In the previous section, we discussed the spontaneous symmetry breaking under the
global symmetry, i.e. the Goldstone theorem. Here we go further by extending the
global symmetry to the local gauge symmetry.
First, let us consider an example of the scalar electrodynamics for a scalar com-
plex field 4> = ipi + i<f2, which is local U(\) invariant. The Lagrangian is given
by
with /i 2 > 0, where FM„ = d^A,, — <9„AM is the field-strength tensor for photon
fields AM and D^cf) is the covariant derivative D^cj) = (9M — ieA^cp as before. The
Lagrangian (3.93) is invariant under the following f/(l) gauge transformation,
where r](x) and 6(x) are real fields. We can define a new set of fields by taking a
particular gauge transformation with a(x) = 6(x)/v, which is called the "unitary
gauge",
and
Here we can easily show Fflv(B) — Fllv{A) by substituting (3.99) into (3.101), that
is, the field-strength tensor FMI/ is gauge invariant as it should be. Substituting
(3.98) ~ (3.101) into (3.93), one can rewrite the Lagrangian as follows;
We can see that this Lagrangian describes a massive vector boson B with mass
TUB = ev and a massive scalar 77 with mass mn = ^2fi2. 77 is called "Higgs boson".
Here we have no Goldstone boson which has gone out of the Lagrangian (3.102).
In summary, by extending the symmetry of the Lagrangian from the global to
local one, we found that the massless Goldstone boson 6 disappeared and a massive
gauge vector boson B and a massive scalar boson 77 called Higgs boson came out.
This is called the Higgs mechanism. (Higgs, 1964, 1966; Englert and Brout, 1964;
Guralnik, Hagen and Kibble, 1965; Kibble 1967) The Goldstone boson 6 was eaten
up by the gauge boson B and became the longitudinal component of it. It is re-
markable to note that in the Higgs mechanism the degree of freedom is conserved,
that is to say, starting from 2 real scalar fields ( y i , ^ ) or (r],9) plus 2 polarization
states of massless photons A^, we finally got one real massive scalar field 77 and one
massive vector boson B^ with 3 polarization degrees of freedom
Let us next consider the non-Abelian SU(2) model by generalizing the C/(l)
x
model. We have again the complex doublet field (f> = I I. Then, the gauge
invariant Lagrangian with SU(2) symmetry is given by
with
Then, by taking the unitary gauge as before, we define the new fields as
with
U(x) = e-iTi(t^/2v, (3.110)
where the summation over i is implied. This transformation leads to
D (3 nl)
»+ - v>»+y=^ -"T^TS (t,+°H(X) ) - -
Flv{A)Fi,lv{A) -> F^(B)F^(B) = F^(A)Fi"l/(A), (3.112)
with
F}V(B) = dMB* - duBi + geijkBlBt. (3.113)
Then the Lagrangian becomes
£ = ^Hd^H-ti2H2-\F^(B)FiliV(B) + ^-BifiBi'i
v
9 D i ltl
n i u T r m . . , TT\ \..TT3 3
+ ?-BlB H(2v + H)~ XvH - ^-H*
ui
H - —. (3.116)
4 ~1
A triplet of massive vector fields B^ (i = 1,2,3) with mass TUB — \gv and a single
massive scalar, i.e. Higgs boson H with mass mjj = y/2fj,2 appeared. We found here
again that the Goldstone bosons f' (i = 1,2,3) were eaten by the gauge bosons Bl
(i = 1,2,3) to make their longitudinal components. This is the Higgs mechanism in
the non-Abelian 517(2) gauge theory. It is again interesting to note that the number
of degrees of freedom is conserved in the Higgs mechanism, i.e. the 3 Goldstone
bosons £* (i = 1,2,3) become the longitudinal components of the respective gauge
fields which lead to appearance of 3 massive vector bosons Bl (i = 1,2,3). The
54 SYMMETRIES AND THE GAUGE THEORIES
discovery of the spontaneous symmetry breaking (SSB) and the Higgs mechanism in
the non-Abelian gauge theories made a great breakthrough toward the unification
of electromagnetic and weak interactions, which will be discussed in detail in the
next Chapter.
Problems
3.1 Show that when the generators Tl satisfy the Lie algebra (3.17),
[Ti,T']=ifiJkTk,
the "charge" Ql also satisfy the same Lie algebra (3.16),
[Qi,Qi] = ifiJkQk,
where Ql is defined by (3.13).
3.2 Using the canonical commutation relations of (3.4) and (3.5), prove the relation
(3.18).
3.3 In the SSB of the global 0(2) symmetric theory described in 3.3, discuss what
happens if we take the vacuum to be (y>oi = 7k, <A)2 = 775) instead of (<p0i = v,
(p02 = 0), and show that both cases are physically equivalent.
4> =
\ <Pn I
After the spontaneous breaking of this 0(n) symmetry by taking m 2 — —fx2 (fi2 >
0), show that one can make the vacuum to be invariant under 0(n — 1) transfor-
mation, that is, 0(n) breaks into 0(n — 1). Furthermore, show that the number of
Goldstone bosons is equal to the number of broken generators of the original 0(n)
group.
Chapter 4
T H E STANDARD MODEL OF
ELECTROWEAK INTERACTIONS
Let us start with the discussion on the one family of leptons, an electron e and
its neutrino ve. ve is considered to be massless in this model, while e is massive
with a small mass, me ~ 0.5MeV. Both of them have spin 1/2. In this section, we
assign these particles to the appropriate representations of SU(2)L X U(1)Y gauge
symmetry, the gauge symmetry of the GWS model.
As we have seen in chapter 2, in the weak processes such as
only left-handed leptons and right-handed anti-leptons take part in and the decay
55
56 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
= ^(^(i-ls^o;), (4.4)
Jn(x)* = J»(x)~ = eL(a;)7MJ/eL
= 2e(x)'yll(l-j5)ve(x). (4.5)
This suggests that BL and veL could be arranged to simply make a doublet associated
with SU(2) group. Let us introduce a lepton doublet composed of the left-handed
components of fermions,
Then, by using this doublet and 2 x 2 matrices in the so-called weak isospin space
_ r i - H r W o n
(4.7)
2 ~ ^ o o y'
Tl-iT2 ( 0 0 ^
T
(4.8)
- 2 " V 1 0 ) '
one can rewrite the charged currents (4.4) and (4.5) as follows;
J+ = L^T+L, (4.9)
J
H = LJ^T-L. (4.10)
(Note that in this Chapter T1 are used to denote Pauli matrices, in order to make
clear that they stand for the weak isospin, not the ordinary spin.) These forms
suggest that the weak currents make an SU(2) group by introducing an additional
neutral current,
Then, we have 2 charged and 1 neutral currents, Jjf and J3, which couple to
the weak bosons W^ and A3^, respectively, just as the electromagnetic current
J^m(x) couples to photon Afl(x), as we will see later. Existence of J ± and J3
suggests that we have weak isotopic triplet currents of SU(2), belonging to the
adjoint representation,
r = fr0{x)d3x, (4.13)
with e'jk being the totally antisymmetric Levi-Civita tensor with e 123 = 1. In
(4.12) the subscript L of SU(2)L means that the weak isospin currents are composed
of only the left-handed Weyl fermions. In all of these currents, the right-handed
component of electron eR does not appear. It has no interactions with any other
particles and thus it should be a singlet under SU(2)L transformation, i.e.
j;mW=HW. (4-17)
where Q is the charge for the fermion ip. For the case of an electron, the eigenvalue
of Q is Q = —1. From (4.17), we can define the generator of U(l)em for an electron,
58 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
"e,*Vr 0 (*,+*) -1
ez,,A*L, TL -l (I _ I ) -1
V2' 2'
efi,Mfl> T
R -l 0 -2
Table 4.1 Quantum number assignment of lepton families in the GWS model.
(The reader should not be confused by this notation, though the same Q as in (4.17)
is used here.) However, since this generator Q does not commute with Tl denned
by (4.13), [/(l)em and SU(2)i cannot be simultaneous symmetries of the model.
Then, we look for a new U(l) symmetry, called £/(l)y, so that Q is given by a
linear combination of the generator T3 of the SU(2)L group and the generator of
this new U(1)Y group. In doing so, it is interesting to note the relation
which shows that each element in an SU(2)L doublet (ve e)lL has the same eigenvalue
— | and the eigenvalue of en is — 1. Furthermore, Q — T3 commutes with Tl{i =
1,2,3), i.e. [Q -T3, T'] = 0, that is, Q — T3 and Tl can be simultaneous symmetries
of the model. Thus, it is reasonable to define this new generator of the U(1)Y group
as Q - T3 or
Q = T3 + ^-. (4.20)
The eigenvalue of Y is called weak hypercharge; the name of this quantum number
is originated from the fact that (4.20) is of the same form as Nakano-Nishijima-
Gell-Mann relation (Nakano and Nishijima 1953; Gell-Mann 1953), established in
1950s in successfully describing the hadron classification, in which Y was called
hypercharge. These quantum numbers for an electron e and its neutrino ve are
summarized in Table 4.1. (In this table, the quantum numbers of muon and tau
families are also given,)
In summary, the GWS model is the SU(2)L X U(1)Y gauge theory and we have,
as matter fields, a left-handed doublet L and a right-handed singlet R of SU(2)
group which are represented as
L = R = eR. (4.21)
SU(2)L x U(l)y Invariant Lagrangian 59
The model starts with the Lagrangian constructed with L and R, which is invariant
under the direct product of SU(2)L and U(1)Y groups:
where a1 (i = 1,2,3) and j3 are group parameters for weak isospin and weak hyper-
charge operators, respectively. Since we are considering the local gauge invariant
Lagrangian, a1 and /? are z-dependent.
CF = Li^id^-igyA^+'-g'B^L
+ Ri^id^ + ig'B^R, (4.23)
where Al^(i = 1,2,3) and B^ are gauge boson fields associated with SU(2)L and
U(1)Y, respectively, g and g' are the gauge coupling constants corresponding to
SU(2)L and I/(l)y, respectively. Here the explicit forms of the covariant derivatives
for L and R come out from the general form
with
K» = d^Ai-dvAl+geijkAiAt, (4.26)
£„„ = dy.Bv-dvB„, (4.27)
where F ^ ( i = 1,2,3) and B^ are field strength tensors of gauge fields correspond-
ing to SU{2)L and U(l)y, respectively. As is well known, the mass terms of these
gauge bosons do not appear because of the local gauge invariance.
60 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
In order to make fermions and gauge bosons massive and make the real world,
we need spontaneous breakdown of gauge invariance, i.e. the Higgs mechanism,
described in the previous chapter. Since we are living in the U(l)em symmetric
world with massless photon, we need to have the following breaking,
To realize this symmetry breaking, we introduce the scalar fields, called Higgs
bosons, which give rise to the Higgs mechanism. Since we start with 4 gauge
bosons(3 associated with SU(2)L and 1 with U(l)y) and finally want to have 1
massless photon associated with U(l)em, we need scalars with at least 4 degrees of
freedom. The simplest example of such scalars, which is called the minimal model,
is an SU(2) doublet of 2 complex scalar fields whose weak hypercharge is Y$ = + 1 ,
<A = ( £o ) . (4-29)
where (p+ and tp° are positively charged and neutral complex scalar fields, respec-
tively. The Lagrangian for these scalars is given by
£s = (D^(D'i<f>)-V(^cl>), (4-30)
with
D^ = (0„ - igT- • 1 M - %
-g'B^)<j>, (4.31)
where the explicit form of the covariant derivative is due to Y$ = + 1 . The potential
term V((j>^<j>) being gauge invariant is given by
where m 2 and A are real constant parameters. A should be positive to ensure the
stable vacuum. Further higher power terms of </>V are not allowed in order for the
theory to be renormarizable.
We can also add the coupling terms between fermions and scalars, called Yukawa
interaction terms, which are SU(2)L X U(1)Y gauge invariant and are to provide
the electron mass after the spontaneous symmetry breakdown,
where Ge is called Yukawa coupling constant and cannot be determined within the
GWS model itself. One can easily check the SU(2)L X U{1)Y invariance of Cy using
the values of hypercharge of L, R and <>/ defined above.
A full set of SU{2)L X U(1)Y gauge invariant Lagrangian of the GWS model is,
thus, given by the sum of pieces presented above,
£ = £F + £G + £S + £Y. (4.34)
Spontaneous Breaking of SU(2)i x t/(l)y- Symmetry 61
Strictly speaking, the gauge fixing term with gauge parameter £ (of "i?f gauge")
and the associated term for Faddeev-Popov ghosts also should be included, in the
quantization procedure of the theory. (See, for example, the books by Peskin and
Schroeder (Peskin and Schroeder, 1995) or Pokorski (Pokorski, 2000).)
As shown in Fig. 3.2(b), the potential V($((>) of (4.32) with positive A and negative
m 2 = —/i2(/x2 > 0) has a minimum at the value of <j> determined by
2 I 2~
f+ \ _ „W-«72v ( °
Here the original 2 complex scalar fields <p+ and <p° in (4.29) are replaced by 4 real
fields, &(i = 1,2,3) and H, where & are so-called Goldstone bosons being absorbed
into the longitudinal components of W± and Z° bosons as described later and H is
a Higgs boson. (4.36) leads to zero vacuum expectation values for all of these fields
Here we can rewrite the Lagrangian in the "unitary gauge", where 3 Goldstone
bosons & disappear by being 'eaten up' by gauge bosons, W± and Z°, and thus,
physical particle spectra and their interactions become apparent. By applying the
unitary SU(2) transformation
[/(£) = e~ir-(/2v^ ( 4 4 2 )
one can come to the real world induced in the unitary gauge. Then, we can define
the new fields in our real world as
A; = UiOA^UiO'1 - l
-(d„U(0)U\0, (4-45)
R' = R, (4.46)
Bl = B». (4.47)
The Lagrangian is invariant under this transformation and one can rewrite each
piece as
£G = -IF^F'^-^B'^B'^, (4.49)
Ca = (D^YiD^y-V^'W), (4.50)
CY = -Ge(L'(f)'R' + R'^L') + h.c. (4.51)
Now let us discuss the physics described by this Lagrangian realized in the
unitary gauge. First we consider the scalar sector. The scalar fields generate masses
of gauge bosons and those of quarks and leptons via the Higgs mechanism. Cs is
explicitly written as
CS = {DI14>)'{D^)'-V{^4>'), (4.52)
with
(D^)' = (d.-igyA^-'-g'B'^'
The first term of (4.52) contains the mass-squared term for weak gauge bosons
which is originated from the quadratic terms of gauge fields as shown in the follow-
ing,
Then the sum of the 1st and 2nd terms of (4.54) can be written as \g2v2W+W~,i.
±
It means that the charged vector bosons W are massive with the mass
Mw = -gv. (4.56)
by an orthogonal transformation
Z ^ \ _ / cos6w -sm9w \ / A*
(4.59)
Ap J \ sin8w cos#jy 7 V B'„
M
where 9\v is called the weak mixing angle or Weinberg angle. The diagonalization
leads to
<?'
tanfliy = — (4.60)
or
sin^w = , •, cos#w = . (4.61)
Prom (4.58), we see that the neutral Z boson becomes massive with the mass
and another neutral boson AM is massless and hence can be identified with the real
photon. Note that in the GWS model the mass of Z° boson is related to the one of
W ± bosons as
MZ = 2*%-. (4.63)
cos 6 w
One can see that the masses of W*1 and Z° are quite large; in fact by using (4.83),
(4.84) and (4.89) below, we can estimate them to be
2
l i p 1 OQ
1 / 2
Mw = ^ = x(^^) —--^-GeV>37GeV, (4.64)
2 2 \J2GF sinflw siatiw
Mz = -¥%- ~ . !fa GeV > 76GeV. (4.65)
cos Uw sin 20w
The values of Mw and Mz are obtained if sinfljy is determined experimentally.
Actually, the value of sin2 6w is obtained in experiment to be around 0.23, leading
to Mw ^ 80GeV and Mz ~ 90GeV.
The potential term (4.32) of scalars becomes, after symmetry breaking,
From (4.66), we see that the mass of the physical Higgs boson H can be identified
with
MH = > / V , (4.67)
whose value cannot be predicted, based on some principle, in the GWS model. After
all, in the unitary gauge the Lagrangian Cs results in (up to a constant term)
Let us consider next the Yukawa interaction term Cy in the unitary gauge
= _^e'e'-%Fe'e'. (4.69)
Charged and Neutral Currents, Comparison with Effective Fermi Theory 65
Since only physical degrees of freedom appear in the unitary gauge, we can identify
e' to be physical electron and thus we suppress the prime ' hereafter. Then, the first
term of this equation corresponds to the mass term of an electron with the mass
me = ^ v . (4.70)
V2
It is interesting to note that the electron mass is proportional to the vacuum expec-
tation value v of the Higgs boson as well as in the case of weak gauge boson masses.
The second term shows an interaction term of an electron to the Higgs boson with
the coupling constant
% =^ . (4.71)
JM = L%^L', (4.73)
Jj = -L'jliL'-2R'lliR'. (4.74)
In (4.72), the 1st and 2nd terms result in the conventional kinetic terms of an
electron and its neutrino
where the prime for electron and neutrino fields is dropped in the right hand side.
66 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
Fig. 4.1
Prom (4.72), one can pick up the charged current interaction which is given by
where
J± = Jl±Ul. (4.77)
Therefore, one can see that the Fermi theory of weak interactions is the low energy
effective theory of the GWS theory. Comparing (4.81) with (4.82), we obtain
% = sk- <«3>
By using
GF = 1.16639 x l ( T 5 G e V - 2 (4.84)
and (4.56), (4.83) and (4.84), one can determine the vacuum expectation value of
<f> as
CNC = gW + lg'JlB'x
JY
= (gsm9wJl+g'cos9w-±-)A>i
JY
+ (gcoS9wjZ-glsm9w^-)Z». (4.86)
The second term vanishes from (4.60). The first term can be identified with the
interaction of an electron to photon and hence we can identify the coupling constant
to be electric charge e, i.e.
or
4 = 4 + 4r (4-9°)
e g g
The current coupled to Z^ becomes
L=
(f) ' Rf = '*> Rf' = &> (493)
Jjf is explicitly written, by using (4.11) and (4.17), as
Jl = 4-sin2ewj;m
= L 7 / , y L - sin2 OwiQfftnf + Q'ff'^f)
Similarly as in (4.81), the lowest order amplitude for the low energy neutral
current processes, e.g. ve + e —t ve + e, shown in Fig. 4.2, is given by
M = - i j v . , , 9 , „ JuJZ,i- (4-102)
2!M|cos20w M
where the factor ^ comes from the fact that j£ J^ M is the square of the identical
current. Thus, the corresponding effective Lagrangian for neutral current processes
can be written as
C^ = -^-U^Jz\ (4.103)
Charged and Neutral Currents, Comparison with Effective Fermi Theory 69
Fig. 4.2
defining QL = 8M a fosi e • Hence, we see that the total effective Lagrangian at low
energy can be described by
£e// =
" 7 § ' 4(J^+J~"+ pJZjZ,i)> (4 104)
-
where
n„N
G M Ml.
^
p
GF M | cos2 6W 1. (4.105)
It should be noted that p = 1 is the inevitable consequence of the fact that the
Higgs scalars belong to the doublet representation of SU(2)L- It is no longer the
case for Higgs scalars with higher dimensional representations of SU(2)L, such as
triplet. To see this explicitly, let us consider the case with higher representation of
Higgs multiplet. Then, we can only replace the Higgs doublet <j> in Cs of (4.52) by
a new Higgs scalars <j>'h(x) of higher multiplet. The gauge boson mass term can be
written as well as in (4.54)
where Th and Y/, are appropriate generators corresponding to SU(2)L and U(l)y,
respectively, and Vh is the vacuum expectation value of this Higgs scalar. \h rep-
resents a similar vector as in (4.43), in which only one nonzero component is 1 and
others are zero. For this multiplet we have again Qh — Th3 + ^-, as before, where
Qh is charge of h and Thz is the 3rd component of 7/,. Now, let us put Qh = 0
for one component of <j)'h{x). For this component, we have ^ = —Thz- Then, the
vacuum expectation value Vh of this field generates the masses of gauge bosons, ex-
cept the photon, in the same way as the case of the doublet Higgs scalars described
before. The resultant mass term is
V
Cmass = ^x\{^2fh-A^fh-A^ + g'\^fB'^ + 2g9'fh-A'^-B'^xh
70 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
r.2,,2 9
= -^iAnAn,T2i+Al2Al2,T22)
9
+ -^f
L
(A*A*»1& + ^ W " - 29-TtsA«B'»
\ 9 9
= ^(A^A^ni + A^A^Ti2) + ^£^-ZlT2h3, (4.107)
where in the second and the third lines, Xh^liXh (* = 1,2,3) has been written
simply as T ^ . Prom this equation, we have
P
(4.108)
Mfcos^ ZhvlTL'
with
where //, is the (magnitude of) weak isospin of h. Therefore, p can take any value,
in principle. When all Higgs multiplets belong to the same representation, as in the
case of the supersymmetric extension of the standard model, i.e. the "Minimal Su-
persymmetric Standard Model" (MSSM), YLiv\ c a n c e l °ut between the numerator
and the denominator of (4.108) and we have
iw+D-na (4110)
HZ
One can easily check p = 1 for Ih = \ and T/j3 = ± | , as in the case of the standard
model or MSSM. In other words, the GWS model with minimal Higgs doublet
necessarily predicts p = 1 at the classical (tree) level, though the p deviates from 1
at the quantum level, as we will see in Chapter 8.
So far we have considered only one lepton family, i.e. electron e and its neutrino ve.
The GWS model can be easily extended for the case with more lepton families. Let
us first consider, for simplicity, the case of 2 lepton families, electron (1st generation)
and muon (2nd generation) families, represented as
CF = CP+CP, (4.112)
Addition of More Leptons 71
where both terms have the same expression as (4.23) by replacing L and R with
those given in (4.111), respectively. The Lagrangian for gauge field and scalar boson
sector remains unchanged. However, as for the Yukawa interactions, new terms are
allowed without contradicting with the requirement of SU{2)i x £/(l)y invariance,
Notice that here we require only the local gauge invariance and do not assume
separately the electron number and muon number conservation to start with. After
the spontaneous symmetry breaking, one can easily obtain the fermion mass term
from Cy just as we did before for an electron family.
iHp.e.
ee '"'efj,
efi
M = (4.115)
\ mMe mMM )
is, in general, complex and not necessarily hermitian and is not diagonalized by use
of one unitary matrix. In general, it can be mathematically shown that an arbitrary
n x n complex matrix M can be diagonalized by a bi-unitary transformation as
where Md is a diagonal matrix and both of U and V are unitary matrices, which
are the unitary matrices to diagonalize the hermitian matrices MM^ and M^M:
W(MM*)U = V t ( M t M ) V = MdM^ Redefining the phase of \x and e fields
appropriately, one can always make the elements of M& be real and positive as
4mo") = -(eLjlL)M( eR
R \+h.c.
6
= -{e'Lp!L)Md( 'f )+h.c. (4.119)
where
L6 = ( Ue ) = ( UeL
)
\ e )h \ cos6Le'L-sm6Ln'L J'
As is easily shown, under the following rotation between the lepton doublets
/ Le \ / L'e \ _ / cos a sina \ / Le \
(4.123)
\ L^ J \ L'p ) \ -sina cos a J \ L„ J '
we have
n • a T , a T ( - s i n 0 L i / e i + cos0z,i/ML \ . .
LJ, = -sm8LLe + cos0z,LM = I , )' (4-126)
Addition of More Leptons 73
Here ve and v^ are degenerate in their masses, as they are both massless, and
therefore we can redefine the observed neutrino fields as
and so we end up with the weak doublets, which are still diagonal in the base of
mass eigenstates. The mass eigenstates are now written as
D (4129)
-{"i)L' ">~{$)L-
and there is no flavor mixing between electron and muon in the weak charged
current interactions. Thus, if we define individual lepton numbers as ne (electron
number) = + 1 for e'~, v'e; ne = - 1 for e'+, v'e and nM(muon number) = + 1 for fj,'~,
v
'\i'i nn = _ 1 f° r M' + J v'tn respectively, then it is naturally derived that both of the
electron number ne and the muon number nM are conserved separately. The GWS
model automatically leads to the conclusion that the decay mode ^ -¥ e^ + 7 is
strictly forbidden. It is remarkable to note that the flavor conservation was not put
in by hand but originated from the massless nature of the neutrinos in the GWS
model. In other words, if neutrinos are massive, one can no longer redefine the
neutrino fields as in (4.127)-(4.128) and the weak eigenstates of neutrinos are given
by a linear combination of the mass eigenstates v\ and 1/2, which leads to neutrino
oscillations, as we will see in Chapter 6. (In the quark sector which will be discussed
later in section 4.6, the mixing angle cannot be rotated away because quarks are
massive with different masses.)
Let us next discuss the Higgs boson couplings to electrons and muons. Prom
(4.113), one can extract them in the unitary gauge as
m m
CHU = -~(eL H) ( - <» ) ( * « ) + h.c. (4.130)
When we go to the mass eigenstate by using (4.120) and (4.121), CHU turns into
e
CHU = -~{e'L ~A)Md ( 'f ) +h.c.
This shows that (i) Higgs couplings do not mix the lepton flavors and (ii)they are
proportional to the masses of leptons, which couple to the Higgs.
In conclusion, there is no lepton flavor number violation in the standard GWS
model; the electron number ne and muon number n^ are strictly conserved. Exten-
sion of the model to tau lepton family (uT, r) is straightforward and the tau number
74 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
( : - ) , - ( ; - ) , - ( ; - ) , - <«->
In the standard GWS model, ne, n^, and nT are separately conserved. Therefore,
the processes such as fi~ -> e~7, fi~ -> e+e~e~ and \i~ + Z —> e _ + Z, etc. are
forbidden at the all orders of perturbation in the GWS model. Therefore, if some
of these processes are discovered, it will clearly signal the presence of some physics
beyond the standard model, such as MSSM.
b
(3rd generation) ( . ) , *fl> R,
The GWS model can be easily extended so that it can incorporate these quark
families, though there are several differences between quarks and leptons; (i) quarks
have three color degrees of freedom, whereas leptons are colorless. However, since
the electroweak interactions are color-blind, one can simply suppress the color index
of quarks in the GWS model, unless otherwise mentioned, (ii) all quarks are massive,
while neutrinos are considered to be massless in the GWS model. Therefore, we
have 2 right-handed singlets such as UR and <1R for each generation, compared to
the case of lepton families where we have only one right-handed singlet such as
eR. Thus, the Lagrangian must contain an additional term for the up-type right-
handed singlets like UR, in order to form their mass term, (iii) charge of quarks are
different from that of leptons and thus, keeping the relation, Q — T3 + y , weak-
hypercharges of quarks are also different from those of leptons. The charge Q, weak
iso-spin (T,T3) and weak-hypercharge Y of these quarks are summarized in Table
4.2.
To write the Lagrangian for quarks in a compact form, we introduce the following
notation
UL,CL,tL T
3 (§,+§) +1
1
d,L,SL,bL 3 V2' 11 +3-
UR,CR,tR ^3 0 +1
dR,sR,bR 1 2
3
0 3
CF = Y^QLih,1{d^-ig1--All-l-glBlt)QLi
3
+ ^tW^-i^Egt/m
3
+ ^2DBii'f(dlt + '\<"
Ti-g'BJDm. (4.135)
i=l
The terms for gauge fields CQ and scalar fields Cs remain unchanged from the case
of leptons. As for the Yukawa coupling, we can write the following SU{2)L X U{1)Y
invariant terms
where 1^ • and T\j are Yukawa couplings which are, in general, unconstrained
complex parameters. 0 whose weak-hypercharge is —1, i.e. Yi = —1, is defined
from 0 as
0*
4> = {TI4>* = (4.137)
-<p-
and we can see that the 1st and 2nd terms of (4.136) yield the mass matrices of the
down-type and up-type quarks, respectively.
Here, all fields in the right-hand side are ones redefined in the unitary gauge
with
M{D) _ „(£>) _^_ M(U) _ r(U) V_ ,. 14o^
where UD,U and VD,U are unitary, and M^D,U^ are diagonal matrices. Thus we have
moved to the base of mass eigenstates:
/ DLI
\ / D'L1 \
DL2 D'L2
f
DL D'L = (4.150)
V I \ /
etc. and the fields with and without prime denote the mass eigenstates and the
weak eigenstates, respectively. Then, we have
<»> = m i V , (4.152)
M.(u)
ia
= m) 'e ' , (4.153)
we can absorb all these phase factors into new quark fields induced by a chiral U(l)
transformation, which keeps physics unchanged.
(D)
D'Li -»• e" /2D',Li' (4.154)
,-la^/2D, (4.155)
D'Ri
U'Li e*°r/'U'u, (4.156)
U'Ri ->*-*>#" fiV'n. (4.157)
In (4.159), the matrix element rrn represents the quark mass of the i-th flavor.
In the above discussion on the GWS model, we have implicitly assumed that
we have only electroweak interaction as the gauge interaction. However, if we take
strong interactions of quarks into account, £( m «) must have another piece induced
by the QCD anomaly of the chiral U{1) current; this is related to the problem called
"strong CP problem". (See, for example, the review by Peccei (Peccei, 1989).)
where the sum of i is taken for all generations. For a while, we let the number of
generations, which we denote by n, be arbitrary, for generality. In terms of mass
eigenstates defined in (4.146)-(4.149), we can write this CQQ as
where D'L etc. are column vectors defined in (4.150), though now they are n-
component vectors. Then, defining a flavor mixing matrix V, which is unitary,
V = U\jUD, (4.163)
we can write
+
+ U'Lnr(VniD'L1 + Vn2D'L2 + --- + VnnD'Ln), (4.165)
the n components of the 1st column (Vn, V21, • • •, V n i)' of the mixing matrix V can
be made real by writing Vn = \Vn\eiai and redefining the up-type quark fields as
U'Li -> e"*< U'Li (i = 1,2, • • •,n). Similarly, we can make the (n — 1) components
of the 1st row(Vi2, V13, • • •, Vin) real in the redefinition of D'Li -¥ e~*a< D'Li (i =
2,3, •••,n). (Note that Vn = |Vn| has already been made real by the redefini-
tion of U'L1.) After this procedure, (In — 1) phases are removed from V without
changing physical consequences. After all, remaining parameters are characterized
by n(n - l ) / 2 angles and n(n + l ) / 2 — (2n - 1) = (n - l)(n - 2)/2 phases.
Here the quark mixing matrix V can be parametrized by only one flavor mixing
angle 6C, called Cabibbo angle. There is no phase (("~ 1 H"~ 2 ) = 0 for n = 2)
and hence we have no CP violation in this case because of the absence of complex
coupling constants. The charged weak current interaction can be written as
d
Ccc = ^=(« c)r^~^V ( s ) W+ + h.c. (4.166)
with
Here and hereafter, the mass eigenstates of quarks are simply denoted by u, d, etc.,
suppressing the prime '.
This means that the left-handed quark doublets are written as
o(1)-
(:),• ° H : ) , -
L <"«>
where
dc \ _ ( cos6c sin# c \ ( d
(4.169)
sc ) L V -sin^c cos^ c ) \ s , L
and the weak charged current processes are due to the transition u «-» dc or c •(-> sc.
• GIM mechanism
In 1960s, only one quark doublet was known from the analysis of the ft decay of
neutron and A which occur via weak charged current transition u «-» dc between
80 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
Q{
? = (l)L = {dcosecls^ec)L < 4 - 17 °)
At that time, the orthogonal combination of CIL and SL, i.e. sc = scos6c — dsm9c,
was left as a SU(2)L singlet. Then, when we apply this to neutral currents, we have
where the final term corresponds to a strangeness changing neutral current. (We can
check that the part of the neutral current, proportional to the electric charge, does
not contain such strangeness changing neutral current.) If this term really exists, we
necessarily predict a rather large decay width of the flavor changing neutral current
transition such as K^ -> fi+fj,~ or K+ -> n+uu, since flavor changing sdZ and dsZ
vertices appear already at the classical (tree) level. But this prediction is completely
in disagreement with experimental results, Br(K^ —• fi+fJ.~) = (7.25 ± 0.16) x
lfT 9 , or Br(K+ -> ir+Dv) = (1.6+Jj) x 1 0 - 1 0 . (See Chapter 9 for some detailed
discussions on these flavor (strangeness) changing neutral current processes.)
In 1970, Glashow, Iliopuolos and Maiani(GIM) (Glashow, Iliopuolos and Maiani,
1970) proposed a new mechanism to solve this problem by introducing the 2nd quark
doublet which contains the 4-th quark, which is now called the charm quark c,
Q = (4 1?2)
™ ( sc ) L = ( S cos0 c - dsm9c ) L ' -
Q(L}7MT^2) =
2^Ll,iCL ~ C ° s 2 e<:^SL + sin2 e
cdu^dL
+ cosdcSmOcidLjfiSL+ sLj^dL)). (4.173)
Summing up (4.171) and (4.173), we find that the neutral current finally becomes
flavor diagonal;
3 3 1
Q(L1»\Q(L+Q(LIJ-^Q(L = ^{uL^uL+cLlllcL-dLllldL-sLlliSL), (4.174)
and we have no flavor changing neutral current at all at the tree level.
Then, how the flavor changing neutral current transitions are possible, even
though it is extremely suppressed? GIM proposed a wonderful idea that it will occur
at the quantum level via, for example, the higher order charged current interaction
shown in Fig. 4.3 (for the K\ -> /u,+ fi~ decay). The contribution of the diagram
Extension to Quarks 81
sin0 c
-* f\AAAA| •-
W+
K° u ..
W~
^ W W V •
COS#c
(a)
COS0C
Y W V W
W+
K° c ,i
w-
— sin 6C
(b)
Fig. 4.3
2 2
(a) to the decay amplitude behave, ignoring the powers of small jjX- a n d
j$f", a s
where a = e2/47r is the fine structure constant. But this contribution, though
suppressed by the factor a, is still too large to account for the decay width of
the "rare process". However, the contribution of the 2nd doublet introduced by
GIM, as shown in the diagram (b), yields a contribution, — g s ' n A / f 2 C Q S c > which is
just opposite to that of (a). Thus the amplitude exactly vanishes at the leading
m2
order of the perturbation in the powers of -rrf-
M
(q = u,c), and therefore the non-
w
m2
vanishing contribution stems from the 1st order of TI§-,
m yielding a suppression factor
w
mc—jn 4
3 x 10~ . This mechanism of suppressing flavor (strangeness) changing
ATST
neutral current is called G I M mechanism. Hence, t h e amplitude behaves as
i/T,n 4- - \ g 4 s i n # c c o s 0 c m2 — m2 _, . . „, , •>-.,.
A(K°L-+ii+v )<x^ -| '-• \ 2 " «G2Fsm6ccosec(m2c-ml). (4. 176)
Mlw MW
Therefore, if mc = mu, K% -> fi+n~ decay is strictly forbidden. Gaillard and Lee
(Gaillard and Lee, 1974) predicted the mass of the introduced c quark as the value
to reproduce, via the above formula, the experimental value of Br(K^ —> fx+fi~),
before the discovery of the c quark at the collider experiments. The predicted
value was quite consistent with the value (mc « 1.5GeV) determined from the
spectroscopy of charmonium. More detailed discussion of these rare processes in the
82 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
There are several pamaretrizations of the CKM matrix V, which are physically
equivalent. Using 3 flavor (generation) mixing angles, 6\, 62, 03, and a CP violating
phase, 5, one example is given a la Kobayashi and Maskawa by
V = R1(e2)R3(61)C(0,0,6)R1(e3), (4.178)
C(0,0,S)= 0 1 0 . (4.180)
V is explicitly written as
S1C3 S1S3
V = I -sic2 C1C2C3 - s2s3ei5 C1C2S3 + s2c3eiS | , (4.181)
-C1S2C3 - c2s3etS -C1S2S3 + c2c3e%s
where the values of the 3 angles and the phase are not predictable within the
framework of GWS model; they must be extracted from experiments. A phase 8
Extension to Quarks 83
cannot be removed by the redefinition of quark fields and leads to the CP violation
of the processes including S. It is noted that the phase S disappears if any of #i,
62, 93 vanishes; If 63 = 0, Ri(93) = 1. Then, the phase S can be absorbed into
which leads to
V = 0 a b , (4.182)
( -S12C23 -
C12C13
Ci2S23Sl3etl513 C12C23 -
S12C13
Sl2S23Sl3elc513
si3e~iSl3
S23C13
\
, (4.183)
S12S23 - Ci2C23Si3e'Sla -C12S23 - Sl2C23Sl3e"513 C23C13 /
where Cij — cos 6ij and Sij = sin8ij (i,j = 1,2,3).
A convenient way to approximate V has been proposed by Wolfenstein (Wolfen-
stein, 1983); it is a parameterization by expanding each element of V in the powers
of the sine of Cabibbo angle, A = S12 — 0.22. In the approximation up to the order
of A3, it is written as
/ l-A2/2 A AX3(p-iri) \
V = -A 1 - A 2 /2 AX2 , (4.184)
\A\3(l-p-irj) -AX2 1 /
where A, p and rj are real parameters of the order one, and r] corresponds to the
CP violating phase. This is often used in phenomenological analysis because this
parameterization shows clearly the hierarchical structure of the CKM matrix.
where
Je™ = lulltU-±Dj„D,
These currents remain flavor diagonal, even in the base of mass eigenstates, because
of the unitarity of Uu, KD, VU and VD- For example, one can easily see
= U'^U'L + U'R^U'R,
= U'-ypU'. (4.187)
Therefore, we have no flavor changing neutral current at the tree level. The kinetic
terms of quarks are also unchanged when we move to the mass eigenstates. Thus,
we have no CP violation solely due to the weak neutral current and electromagnetic
interactions.
What about Higgs boson couplings to quarks? From (4.133), (4.140) and (4.141),
one can extract the Higgs-quark interactions in the unitary gauge as
H -
Cjjqq = (muuu + mddd + msss + rribbb + mccc + mitt), (4.188)
which is also flavor diagonal. Then, the Higgs couplings to quarks in the standard
model with one Higgs doublet do not generate CP violation. It should be noted
that this is not the case in the models with multi-Higgs, such as the models of spon-
taneous CP violation discussed by Lee and also by Weinberg (Lee 1974; Weinberg
1976).
4.7 Anomalies
(VWW^
Fig. 4.4
and the model becomes meaningless at the quantum level (Georgi and Glashow,
1974; Gross and Jackiw, 1972; Bouchiat, Iliopoulos and Meyer, 1972). In general,
anomaly cancellation is not trivial for the models with parity violation, containing
both vector and axial vector currents, in contrast to the case of e.g. QED where
only vector current exists. Thus, it is very interesting to examine if the anomaly
cancellation works for the GWS model which has both vector and axial vector
currents.
Let us consider, in general, the Lagrangian describing an interaction of left-
handed and right-handed fermions with gauge bosons A1^,
where the summation over i is implied. VL and tpR are column vectors composed
of left-handed and right-handed fermions, belonging to some representations of the
gauge group, and T£ and TR are the corresponding generators of the gauge group
G satisfying
for each of TlL and TR, where fak is the structure constant of G. Then, the currents
coupled to gauge bosons A1^ are given by
= ^ 7 M ( 1 - 7 5 ) ^ + ^ ( 1 + 75)7^- (4.191)
Now it is known that the triangle anomaly associated with the triangle diagram
with external gauge bosons A1, Ai, Ak is proportional to
Aijk = Aijk _ Aijk ^ (4192)
where the contributions of the right-handed and left-handed fermions are given by
and do not depend on the fermion masses. The anti-commutator in the above
equation comes from the exchange of A1 and A^ gauge bosons, which are assumed
86 THE STANDARD MODEL OF ELECTROWEAK INTERACTIONS
to couple to the vector currents at the two vertices. Then, the condition for being
anomaly free is
For the vector {TlL = T^) or vectorlike (TlL = [7 -1 Tp£7 with a unitary matrix
U) gauge theories*, the anomaly free condition (4.194) is automatically satisfied.
Typical examples of the vector theories are QED and QCD, both conserving parity
symmetry. Another possibility to satisfy (4.194) is the case
Aijk = Aijk = 0 ( 4 195)
f c
^' =Tr({^,y}y)=0 (4-196)
due to {T1,^} — 26ij. This property essentially comes from the fact that the
representation of SU(2) is "real", and therefore the contributions of a Weyl fermion
ip and (V0C! w ith c standing for the charge conjugation, should be the same, while
their contributions to the anomaly should have opposite signs, as their chiralities
are different.
Then, how about the GWS model? In the GWS model with only one family
of leptons, matter fields are the left-handed doublet ( f e i , e Z)* a n d a right-handed
singlet e^ and the generators for the gauge group are given by
T[ = j , Y ; TR =
¥' (4J97)
=
It is evident that T r M ^ ^ - l V ) 0 a s shown above. Then, for the model to be
anomaly free we must impose
and
(Note that another possible anomaly associated with a diagram with currents ac-
companied by single r% and two weak-hyper charge generators Y simply vanishes,
thanks to the relation Trr 4 = 0 . ) Here for right-handed singlets, Q = ^f- and thus
we have
can be satisfied only for iVc = 3, where 0, —1, § and — | represent the charges
of ue, e, u and d, respectively. (Though the statement that (4.199) is equivalent
to (4.202) was made for the pure leptonic model, it turns out to hold even after
the inclusion of the quarks.) The lepton anomaly can be cancelled by the quark
anomaly only when the color degree of freedom is 3. Therefore, the standard model
with G = SU(3)C x SU(2)L X U(1)Y suggests that leptons and quarks are closely
related to each other. The real unification of quarks and leptons in the multiplets
of gauge group is realized in the "Grand Unified Theories (GUT)" to unify strong
and electroweak interactions, such as SU(5) or 5O(10) gauge theories.
Problems
4.1 Prove the commutation relations [Q — T3, Tl] = 0 for i = 1,2,3, where Q is the
electric charge operator defined by (4.18) and Tl are weak isospin operators denned
by (4.13).
4.2 Show that the Yukawa interaction defined by (4.33) is invariant under SU(2)L X
U(\)Y gauge transformation.
4.3 Show that in the GWS model, the electric charge operator Q is not a broken
generator, though weak isospin operators T% violate the symmetry of the vacuum
88 THE STA NDA RD MODEL OF ELECTR O WE A K INTERA CTIONS
given by (4.36).
4.6 The lowest order decay amplitude of W -> li>i (I = e, /z, r) is given by
M = ^ = f i ( p ) 7 , . ( l - 7sM9)e"(*, A),
where e'x(fc,A) is the polarization vector of the W boson with helicity A. Then,
neglecting the lepton masses, calculate the decay width for this decay which is
given as
£Z y rZ rrfj.
NC ~ a—Ju " i
COS0ty ^
where jjf is given by (4.97).
(l)Neglecting the fermion mass, calculate the decay width of Z° ->• / / in the tree
level, which is given as
v{z
° ~* ff) =2^0K)2 + {CIA)2) '
where C ( , C{ are given in (4.98)~(4.101).
(2) Since the top quark is heavier than Z°, Z° cannot decay into a top quark pair.
Then, assuming sin2 6w — 0.22, estimate the branching ratio of Z° -> e + e~.
Chapter 5
Q U A N T U M CHROMODYNAMICS
89
90 QUANTUM CHROMODYNAMICS
There are several evidences for existence of the color degree of freedom, which is
actually in 3 species. The color was first introduced to solve the difficulty in the
relation of spin and statistics in the baryon spectroscopy. In the quark model,
baryons are made of three spin \ quarks. Without color space, the wavefunction
of a baryon is described by the product of space-, spin- and flavor-wavefunctions.
Then, consider the A + + (1232) with spin | . It is the ground state of the uuu
bound system and hence its wavefunction is totally symmeric under interchange
of any u quark pairs in space-, spin- and flavor-wavefunctions. However, since the
A + + (1232) is a fermion with spin | , its total wavefunction must be antisymmetric
under interchange of any u quark pairs. This difficulty can be solved by introducing
an antisymmetric color space wavefunction, where the color space is a new internal
space.
A more direct evidence for color to be in 3 species comes from the experimental
data on e + e~ annihilations at high energies. Based on the quark parton model, the
ratio
is predicted to be R = Y^,q eq, where eq is the electric charge of the quark q in quark
pairs produced in the e+e~ annihilation. Beyond the s quark pair production
threshold but for not so high energies, only u, d and s quarks contribute to this
Evidence for colors 91
Fig. 5.1
R = el + e\ + e\ = - , (without color)
R = 2. (with 3 colors) (5.2)
Similarly, for higher energies beyond the c quark pair production threshold, R be-
comes
and for more higher energies beyond the b quark pair production, it becomes
11
R = el + e2d + e2s+e2c+e2b (without color)
9'
R = —. (with 3 colors) (5.4)
where Nc is the number of colors and fn ~ 130MeV is the pion decay constant.
Then, we can predict the rate to be
Vj<86 e V
-° _->i -vV\
r(7r° 77) - J ' ( W i t n 0 U t C O l ° r (NC ~ 1)) (5.6)
"^ " " I 7.i
.65 eV. (with 3 colors (Nc = 3))
The measured rate is (7.74 ± 0.55) eV which agrees with iVc = 3.
Further example is the branching ratio of the r lepton, Br = r(r-%o») • ^
T~ lepton decays into kinematically allowed lepton pairs, (e~ De), {fi~ D^), and
quark pairs, (d u), (s u), through the diagram shown in Fig. 5.2. Among these
channels, the contribution of (s u) is very small because it is a Cabibbo-suppressed
92 QUANTUM CHROMODYNAMICS
le-\l»-\l d\l s \
\ v* \ n u j
MM
Fig. 5.2
process and hence can be neglected. Since the coupling is same for all these channels,
we can simply predict the above branching ratio as
Br (without color)
3'
Br — -. (with 3 colors) (5.7)
5
The experimental data is Br = (17.84 ±0.06)% which is again in agreement with 3
colors.
The QCD Lagrangian describing the interaction between quarks q and gluons A1^
(t = l,2,---,8) is given by
C= q(ip-m)q--FJtvFi^ (5.8)
where gs is the strong coupling constant and A* are the 3 x 3 Gell-Mann matrices
presented in Appendix (C.4). The summation over i (i — 1,2, • • •, 8) is implied. F^
are the field strength tensor for gluon fields A^ and given by
0MAt-0„4+0./yfc4Aj, (5.10)
where fak are the structure constant of the color SU(3)C group.
QCD Lagrangian and the strength of color forces 93
Here we are interested in the strength of color interactions between two quarks
and also between a quark and an antiquark. Using the explicit expression of A1
presented in (C.4), the interaction between quarks and gluons given in (5.8) is
written as
A'
M
2
where
represent redefined gluon fields. As can been seen here, the gluons come in 8 different
color combinations, RG, GR, ^(RR - GG), RB, BR, GB, BG, ^(RR + GG-
2BB). This means that the gluons belong to a color SU(3)C octet. For example,
G* changes the color of the quark from G(green) to i?(red). Another combination,
i.e. the color singlet -h=(RR + GG + BB), does not mediate color charge.
Now we are concerned with the interactions between two quarks. First let us
consider the interaction between i?(red) and i?(red) which arises from 2 diagrams
due to G^ and G® exchange as shown in Fig. 5.3(a) and (b). Then, the coupling of
this interaction is obtained from (5.11) as
The interaction between B (green) and B (green) arises only from G® exchange as
shown in Fig. 5.3(c) and the coupling becomes
The interaction between i?(red) and G(green) arises from 3 diagrams shown in Fig.
5.3(d), (e) and (f) and the coupling becomes
2
~±)'<*$)•(-*$)<%)'-U$)'-
.N/6%/2 ^
94 QUANTUM CHROMODYNAMICS
„R
££
^fe) W ^
(b)
2
fe))WOT ^K^fe)
(c) (d)
^
^fe)WMW-^fe)
(e) (f)
Fig. 5.3
As seen in the above examples, all color interactions between two quarks (color-
color interactions) are equal as expected from exact color symmetry. In other words,
two quark interactions are blind to the difference of colors.
What about the interactions between a quark and an antiquark (color-anti-color
interactions) in the color-singlet state, -h=(RR + GG + BB), i.e. between q and q
in a meson? Here, since each color is in equal weight in the color-singlet state of a
meson, it is enough to consider, for example, the case of S(blue) and -B(anti-blue)
whose interaction arises from 3 diagrams shown in Fig. 5.4(a), (b) and (c). (The
situation is same for between .R(red) and i?(red) and also between G(green) and
G(green).) Then, the coupling of this interaction becomes
yfcy/2
(5.16)
Here we should notice that the coupling between antiquark and gluon is in the
opposite sign to the one between quark and gluon because of vector nature of gluons,
QCD Lagrangian and the strength of color forces 95
-^ ($) V o W ^ (^)
*S«T
Fig. 5.5
which is essentially same for the QED process, q —• q + 7 except for the coupling
constant and the color matrix factor, a and b denote the color indices. Squaring
the above matrix element, one can extract the color-dependent factor as
lo> 2 2 39°
(5.18)
96 QUANTUM CHROMODYNAMICS
<Z3> ces> 0
Fig. 5.6
that is to say, the observed charge e or the fine-structure constant, a = f^, is not
a constant but depends on the distance r from the bare charge or the momentum
transfer Q2 in scattering by a test charge. In fact, the value of a(Q2) decreases
with decreasing Q2 (or increasing r) and thus the vacuum polarization leads to the
charge screening as shown in Fig. 5.7.
Running coupling constants
<*(Q2)
a(fi2) - ^
large Q small Q2
(small r) (large r)
Fig. 5.7
Fig. 5.8
iM = [ieu(k[)j>iu(k1)]^^[ieu(k'2)Yu(k2)}, (5.19)
where q = fci — k[ is the vertual photon momentum. However, to get an exact am-
plitude for this scattering, we need to calculate higher order corrections containing
all order of a. The first order correction to the above amplitude comes from the
diagram of Fig. 5.9, in which the photon propagator contains an electron loop, and
according to the standard Feynman rule, it is written as
l
-iM = (-l)[ieu(fci)7'Ju(Jfci)] - 9 UP
98 QUANTUM CHROMODYNAMICS
q-p
Fig. 5.9
d 4 pTr ( . e r ) ^ + ^ ( . e 7 A ) i « - ^ + m)
4
TT)
(2TT> y m' (P-Q)2 m*
-ig\v
[ieu(k'2)Yu(k2)}, (5.20)
where p is the loop momentum of the internal fermion(electron) and m denotes the
electron mass. The factor (—1) is originated from the fact that this diagram con-
tains one fermion loop. It should be noticed that the internal momentum p should
be integrated from 0 to oo and leads to the divergence of the amplitude. (Exact
calculation results in only a logarithmic divergence, though one might expect it
to be quadratic because of the apparent form f£°pdp of the integral in (5.20).)
This divergence can be removed in QED by the renormalization technique as dis-
cussed below. Note that though the amplitude (5.20) is of the 0(a2), this one-loop
correction to the photon propagator is of the 0(a).
Now, one might worry about additional contributions of the same 0(a) origi-
nated from other diagrams depicted in Fig. 5.10, where (a) and (b) are the self-
energy terms of electrons and (c) is the vertex correction. However, to our surprise,
QED resuts in exact cancellation of the sum of the contribution of (a) and (b) and
the one of (c). Actually this cancellation occurs in every order of perturbation.
This is the fundamental property of QED and known as the Ward-Takahashi iden-
tity. Because of this cancellation, it suffices to evaluate only the modification to the
photon propagator.
The addition of (5.20) to (5.19) can be considered as modification of the photon
propagator as
-iguv -jg»v
'-IP*
+
Running coupling constants 99
e e e e
7 / \ 7.
(a) (b)
(c)
Fig. 5.10
(5.21)
g2 V ^2
q2
where
V = -ig^q2i{q2) + (5.23)
with
q2(l-x)
Z
(5.24)
37T Jm2 P IT J0 V m"
The dots in (5.23) contain terms proportional to q^ and vanish when the prop-
agator is coupled to external electron currents. Substituting (5.23) in (5.21), we
obtain
-i9tii> -igyv
-¥ (i-/(s 2 ))- (5.25)
Therefore, we find that in the 0(a) the modification of the photon propagator
can be made just by mutiplying a factor (1 — I{q2))- Explicit derivations of these
100 QUANTUM CHROMODYNAMICS
formulas are beyond this textbook and can be found, for example, in the books
by Bjorken and Drell (Bjorken and Drell, 1964) or Jauch and Rohrlich (Jauch and
Rohrlich,1976). Here, we just comment on some important points of results. I(q2) is
infinite as mentioned above. But the infinity arises only from the first term of (5.24)
which is logarithmically divergent and independent of q2. One can estimate this
integral by introducing the cut-off parameter A for the upper limit of the integral
and then by approaching it to oo. Actual calculation will be done for the two
extremes, i.e. (-q2) » m2 and (-q2) <C m2.
Let us first consider the case of (—q2) » m2. In this case, log ( l - q x£i~ J ^
log(^j-) and then, we can evaluate I(q2) as
Thus, for large (—q2), the photon propagator is changed only by mutiplying a factor
(1 — i^F l ° g ( ^ i ) ) ' The multiplicative factor from loop corrections of all orders in a
can be also evaluated as
1
- £log (-^fi)+ ° ( Q 2 ) = 1
—nj\> (5 27)
-
and thus, the full amplitude becomes
a a (5 29)
°" = / » \ - -
«W2) = ^VTTV- ( 5 - 3 °)
l + f?l°g(&)'
Running coupling constants 101
where a(Q2) and ao represent the renormalized and bare charge/coupling, respec-
tively.
Now, suppose that we can get the result, a(fi2) = ^§7, in experiment at some
value of Q2 = fj,2. Then, using the formula (5.30) for a(p,2), one can obtain
a(/z 2 )
a(Q2) = (5.31)
^loe
Notice that in (5.31) there is no dependence on the cut-off A and «o- &{Q2) depends
only on the finite measurable quanitities. Since a(Q2) depends on Q2, a(Q2) is
called the "running coupling constant". (5.31) shows the charge screening, that is
to say, d{Q2) decreases (increases) with decreasing (increasing) Q2. (See Fig. 5.7)
Here we introduced a parameter (i with the dimension of mass to avoid the infinity of
the amplitude by renormalizing the charge, p is called the renormarization mass(or
scale). Different choice of fi corresponds to different renormalization schemes but
final result does not depend on the choice of fi. This is because the dependence of
the amplitude on A is absorbed into a(fJ.2).
Now, let us move to the oppsite extreme of small (—q2). This limit is relevant
in the Coulomb scattering of an electron by a static nucleus terget with charge Ze.
In the limit (—q2) -¥ 0, we see log(l - q2x(l — x)/m2) ~ -q2x(l - x)/m2 and thus,
(5.24) reduces to
/(s2) = f l o g ( 4 ) + T f 4
z l
(5.32)
3ir \m J 157r m
where we again introduce the cut-off parameter A for upper limit of the first term
of the integral (5.24). Note that the divergent term, i.e. the first term of this result
is of the same form as (5.26), just by replacing m2 by — q2.
Then, to order a 2 , the matrix element for the Coulomb scattering of an electron
by a charge Ze is calculated by replacing the factor [ieU(k'2)j'/u(k2)] in (5.20) by
-if with j " = (p = Ze,j = 0)
[A2 qz
= [i e u(Jfei)y u (*i)]-^ •1 a a
-iM 1 g Hi")
"37 ° U ? 157r m2
2 ,22
—% a (A \ a q
ZTT \m2 J 157rm2 (-iZe). (5.33)
= [ieu(fc|[)7ou(A:i)]-2-
Fig. 5.11
(a) (b)
Fig. 5.12
we can easily check that to 0(a2), (5.35) and (5.34) are equal, en is interptered as
the measured or renormalized charge | £ = a = ^ . Here the infinity of the matrix
element coming from the cut-off A —> oo is again absorbed into the observed charge
en by renormalizing the bare charge/coupling.
constant as(Q2) = | £ from the one in QED discussed above. The basic quark-gluon
interaction is given in Fig. 5.11. Concerning the one-loop corrections, there are two
diagrams as shown in Fig. 5.12(a) and (b). (a) shows the correction from a quark-
loop which contributes equally to every quark flavor because of flavor independence,
(b) shows the gluon-loop correction originated from the self-interaction of gluons.
Apart from the color factor, the contribution from the quark-loop diagram (a) is
Running coupling constants 103
essentially the same as the one in QED; it suffices to replace the factor ° ^ ^ by
"ST f° r e a c n flavor by taking into account of the color factor. The crucial difference
between QCD and QED arises from the gluon-loop diagram (b). This diagram give
rise to another numerical factor, -|^Q: s (/x 2 ), whose sign is opposite to the one from
the quark-loop. Combining the contribution from both the quark-loop (a) with n/
flavors and the gluon-loop (b), one can obtain the QCD running coupling constant
as(Q2) just by replacing the factor ^^- in QED by ^ ^ ( f n / - 11) as
(33-2 n / )log^
In summary, for large values of Q2 much larger than A2QCD, the effective cou-
plings between quarks and gluons becomes small and thus, the strong interaction
can be treated perturbatively. In this region, quarks and gluons behave as free
particles ("asymptotic freedom"). In fact, this is the region of the deep inelastic
scattering scattering where the parton picture works well. On the other hand, for
small Q2 region like Q2 w A"QCD, the quark-gluon coupling becomes large and the
perturbative calculation cannot be justified. Because of the large coupling con-
stant, all quarks and gluons are confined in hadrons ("confinement"). AQCD is
the scale which separates the world of confinement scale(hadrons) and asymptotic
freedom (free quarks and gluons). QCD is the color 5(7(3) gauge theory of strong
interactions and is the fundamental field theory describing the dynamics of quarks
104 QUANTUM CHROMODYNAMICS
as(Q2)
a. ~ 0.2
Q2 = lOOGeV2
large Q small <32
(small r) (large r)
Fig. 5.13
Problems
5.1 T~ can decay into pairs of (e~ ue), (fi~ D^), (du), and (s u); among these decay
modes, the contribution of (s u) is very small because it is a Cabibbo-suppressed
process and can be neglected. Neglecting the masses of all decay particles, estimate
the branching ratio, Br = r(^%aii)T to be jr as given in (5.7).
5.2 In the tree level, the color-dependent coupling factor for the process q -> q + g,
where q and g represent a quark and a gluon, respectively, is given by (5.18). Then,
estimate the factor for g —> q + q.
5.3 Consulting with other textbooks, for example, the one by Bjorken and Drell
(Bjorken and Drell, 1964), derive (5.24).
Chapter 6
N E U T R I N O MASSES AND N E U T R I N O
OSCILLATIONS
When P.A.M. Dirac first invented the spinor to describe the electron, it was a 4-
component complex vector. Fermions described by such 4-component spinors are
called Dirac fermions. Though the electron has a finite mass, it may be worthwhile
considering a massless fermion. In this case a new feature arises: a state with h = 1
(h being the helicity of the particle) and a state with h = — 1 never mix with each
other during the propagation of the fermion. This can be understood intuitively by
considering a inertia frame, observing the fermion co-moving toward the direction of
the fermion's momentum. If the helicity h of the fermion is 1 in that frame, in any
frame the helicity should be the same, just because the massless fermion is moving
with the light velocity and any observer never can pass through it to invert the
direction of momentum and therefore the helicity of the fermion. It is well-known
that for massless fermion, the eigenstates of helicity and chiral fermions with definite
eigenvalues of 75, i.e. Weyl fermions ipn and ipL, are identical. We easily know that
each of chiral fermion forms an irreducible representation of Lorentz transformation
SL(2,C), as [75, £""] = 0 (E"" = £[7",7"] are 6 generators of 517(2, C)). In such
a sense, Weyl fermions are more fundamental fermions than Dirac fermions. Let
us recall that fermions to start with in the Standard Model are Weyl fermions,
essentially because the weak interaction maximally breaks parity symmetry.
The statement above becomes more explicit if we use the chiral representation
of 7 matrices, where 75 is a diagonal matrix (see also Appendix A):
0 <r" \
* = Uo» 0 )>
0 ) M
a* = {-I,Oi), (6.2)
a" = (-1,-*), (6.3)
I 0 \
75 = i-f-f-fr =[ 0
n -r / ; . (6-4)
105
106 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
where / is a 2 x 2 unit matrix and <7j (i = 1,2,3) are Pauli matrices. In the
chiral representation a right- and a left-handed Weyl fermions are described by
2-conponent complex spinors rja and | a :
V* = ( ^ ) , (6.5)
For a massive fermion moving with a speed lower than the speed of light, the
helicity may change depending on the velocity of the inertia frame, observing the
fermion: the chirality is no longer preserved for massive fermions. Accordingly the
fermion mass term connects Weyl fermions with different chiralities:
m(ipRipL+'*pL'4>R)- (6-9)
Thus the fermion suitable to describe the propagation of the massive fermion is the
combination of these two Weyl fermions, i.e. a Dirac fermion I[>D, a full 4-component
complex spinor:
The free Lagrangian for the Weyl fermions is now rewritten in terms of a Dirac
spinor tpjj:
where (r)a)* - rj&, rj"1 = e^rjp, e12 = - e 2 1 = 1. Thus chiral partner of a Weyl
spinor can be the charge conjugation of the Weyl fermion itself. Combining these
chiral partners the following full 4-component spinors are possible to be formed:
In other words, for massless fermions, the propagators are just the sum of those for
independent Weyl fermions.
Once fermions get masses, the type of fermion has real physical meaning. Clearly
Weyl fermion is irrelevant in this case, since the mass term connects the chiral
partners. What determines the type of the full 4-component spinor, i.e. Dirac or
Majorana, is the choice of chiral-partners or in other words, the type of fermion
mass term. In the case of ordinary mass term for Dirac fermion (Dirac type mass
term),
the chiral-partners are two independent Weyl fermions £ a and r)a. We, however,
need not to take independent 2-componet spinors £ and rj as chiral partners and
they can be identical. Namely if we take as chiral partners the charge conjugation
of each Weyl fermion the following mass terms are also possible,
These mass terms clearly violate fermion number (lepton number for, e.g., neutri-
nos) and are called Majorana type mass terms, while we can assign a conserved
fermion number for the case of Dirac mass term. If we adopt the Majorana type
mass terms listed above, the fermions suitable to describe the free Lagrangian should
108 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
the free Lagrangian for the neutrino still can be written in terms of two Majorana
neutrinos, as we now confirm. The Dirac mass term m o in Cm causes a mixing
between vR and VL- We thus have to "diagonalize" the whole mass term. The mass
term can be neatly written by use of matrices as
where the property VRVL — {VL)C{VR)C has been used. Thus, the mass term, as the
whole, can be written in the form of Majorana type masse term, even though there
is Dirac mass term as well. This is based on the observation that vi, and {yR)c
are both left-handed Weyl fermions and have no essential difference, though they
have opposite lepton numbers, L = 1 and L = — 1. Thus every mass term should
be generally written in the form of Majorana mass term, -(V'IL) C V'2L + h.c; in the
case tpiL = 1P2L the mass term is "genuine" Majorana mass term, while in the case
of Dirac mass term ipn ^ ip2L- We also find that the 2 x 2 mass matrix above
Neutrino Masses 109
Mv = TTIL mo
(6.26)
mD mR
are given as
where
2m,D
tan2#„ = (6.31)
ma — mi.
Note that the eigenvalues of Mv obtained by an orthogonal transformation are ms
and —ma, not ma. We, however, have changed the sign, — ma -> m 0 , by putting
the extra factor i for the eigenstate ua, so that the eigenvalues ma and ma get
degenerate for the case of pure Dirac mass, mR = TTIL = 0, and both eigenvalues
become positive for the case of "seesaw" scenario. The mass term now has been
diagonalized:
Thus, including the kinetic term, the free Lagrangian for the neutrino is now written
in terms of two Majorana neutrinos obtained by combining vs and va with their
anti-particles as chiral partners, Ns = vs + (us)c and 7Va = va 4- (^o)c,
Actually for the case of scalar fields what matters is the mass-squared matrix. To
make the correspondence clear, let us take
m m mD L +
MM =( y ° , ^ TR) ) , (6-35)
instead of M„ itself. It has been well-known that when the masses of two real scalar
fields are degenerate, m\ = m2, = m 2 , mi 2 = 0, they can be equivalently described
by a complex scalar <f> = (l/-\/2)(<£i + i<fo),
2
£ct>m = - m (f)*(j>. (6.36)
The same thing happens for fermions; for the case of pure Dirac mass, TUL =
TUR = 0, MvMl has degenerate eigenvalues m|> and two Majorana neutrinos are
equivalently described in terms of Dirac particle,
Ns-iNa IT
VD -j=— = vR + vL (ev = -), (6.37)
The factor 1/2 appearing in the free Lagrangian for Majorana neutrinos (6.32) just
corresponds to the same normalization factor in the free Lagrangian for real scalars.
Hence, just as in the case of scalar fields, such normalized Majorana fields Ns and
TV,, should have the same propagator as the one for a Dirac field. For instance,
for the Majorana fermion ipMi in (6-21) the propagator without chirality flip, and
therefore without lepton number violation, is given (in momentum space) as usual,
(B) Pseudo-Dirac
We next consider what happens if we put small Majorana masses, TUR, TUL -C mo-
The neutrinos are still almost Dirac particle and are called pseudo-Dirac neutrinos
(Wolfenstein, 1981). The mixing angle is still almost maximal, i.e. 8U ~ 7r/4.
One remarkable feature is that in this case there appears a slight difference of mass
eigenvalues m1-m2a~ 2mD(mR + mi). Together with the almost maximal mixing,
this mass-squared difference leads to a neutrino oscillation even if there is only 1
generation (Kobayashi and Lim, 2001). In fact a neutrino oscillation VL —> {VR)C
occurs with almost maximal mixing. The probability for the neutrino produced as
112 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
Piu^iu^^^r^^h (6.42)
M
r- - ^ ~1, (6.43)
~ M%cos2ew
no longer holds at the classical (tree) level. On the other hand there is no good
reason to expect that m,R, being SU(2)L invariant, should be small. Rather, it will
be natural to expect that it is much larger than the VEV of the doublet Higgs,
i.e. than the weak scale M\y, since it is quite possible that the TUR actually comes
from the energy scale of some "new physics" beyond the standard model, which
is expected much higher than Mw- The above argument suggests that mi -C
m c <C m,R. Here, for brevity, let us assume that HT is absent and therefore
Neutrino Masses 113
TTIL — O (mo "C TTIR). In this case the neutrino mass matrix reduces to
m D
Mv={ ° ) . (6.44)
\ mD mR )
One immediately knows that one mass eigenvalue of M„, m8 ~ TUR under mo -C
rriR, and another eigenvalue
this means ma ~ — ^ ^ , with opposite sign, but the extra minus sign can be absorbed
by putting a phase i to the eigenstate va, as is seen in (6.30).) These eigenvalues
may be obtained from (6.27) and (6.28), as well. We thus have obtained a naturally
small (<C TUD) neutrino mass ma by making TUR much larger than mp. The relation
ms • ma ~ m2D suggests the terminology seesaw. Namely, when one eigenstate Ns
gets heavy another eigenstate iV0 is "lifted". We know that under mo <C TUR the
mixing angle 6V is small
6V ~ ^ ^ « l (6.46)
mR
Ns ~ VR + {VR)\ Na~i{vL-(vL)c}. (6.47)
As the heavier Majorana neutrino iVs decouples from lower energy processes, only
Na ~ i{uL — {VL)C} will remain and there will be no neutrino oscillation in one
generation scheme. Thus, as in the case of pure Dirac scenario, to have neutrino
oscillation, flavor or generation mixing is inevitable.
One may wonder, in the absence of the triplet Higgs HT, which operator can
provide the small Majorana mass m Q , which is effectively the Majorana mass for
the left-handed neutrino, as iV0 ~ I{VL — {VL)C}- It turns out that though there
is no renormalizable operator in the original Lagrangian relevant for the Majorana
mass, a higher dimensional (d = 5) "irrelevanth effective operator,
plays the role, where <j> — (<p+, <p0)1 is the Higgs doublet and e is 2x 2 anti-symmetric
matrix (e12 = —e21 = 1) and C is the charge conjugation matrix. The coefficient
c is a dimensionless coupling and if we regard c = f^ (fn'- Yukawa coupling in
the Dirac mass term), M = TUR and replace ip° by the VEV v/\/2, we get a left-
f2 v2 m2
handed Majorana mass, ~ ^ — ~ j ^ 2 - , in accordance with the result obtained by
the diagonalization of the mass matrix. In fact, the diagram shown in Fig. 6.1 is
known to provide (a part of) the effective higher-dimensional operator, when the
momentum is ignored compared with mR in the propagator of the internal line.
NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
/
,+
/
/
-4 (
VR
VL VL
Fig. 6.1
When M goes to infinity the effective operator vanishes. Thus the essence of
seesaw may be understood as the decoupling phenomenon of a heavy particle with
mass M, which is right-handed Majorana neutrino Ns in the case of seesaw mech-
anism. The importance of the argument in terms of the effective operator is that
the M needs not to come from the right-handed Majorana neutrino, but may be
attributed to any heavy 51/(2) singlet field with mass M.
VeL VTL
VeR, VvR,VTR,eR,m,TR- (6-49)
e-L V-L TL
Without loss of generality, the charged leptons, e^, e^ etc., may be understood as
their mass eigenstates, and their partners in the doublets, vei, etc. are states emitted
by weak interaction processes and are called "weak eigenstates". Suppose the mass
matrix for neutrinos in the base of weak eigenstates is diagonal. (For brevity we
here assume neutrinos have only Dirac masses), Then, the generation number or
quantum number of flavor such as electron number Le (Le = 1 for e~, ve, Le = 0 for
others), muon-number LM etc. are strictly conserved. If, on the other hand, the mass
matrix is non-diagonal, the flavor changing processes such as ue —> v^ or /J, —> e'y
will become possible. The phenomenon such as ve -> */M, where a neutrino of some
specific flavor is transformed into a neutrino of another flavor, is called "neutrino
oscillation", since the probability of the transition oscillates as the function of time.
Even if we switch on flavor mixing, provided the neutrino masses are all vanishing
or, more generally, degenerate, the amplitudes of flavor changing processes will
exactly vanish. This can be explicitly checked in the formula of neutrino oscillation
probabilities, as we will see later (see for instance (6.69) for Amjj = 0). This fact
also may be easily understood from a symmetry argument. When neutrino masses
are all degenerate, there appears a global symmetry C/(3) among 3 generations or
Flavor Mixing and Neutrino Oscillation (in the Vacuum) 115
flavors in neutrino sector, and by use of Noether's theorem, we easily find that
there should be 3 conserved additive quantum numbers, which may be interpreted
as Le etc. Or, more intuitively, if neutrino masses are all degenerate there would
be nothing to distinguish neutrino flavors, and we may always perform a unitary
transformation, so that all flavor mixing angles vanish in the new base. Thus to
realize flavor changing processes we need both flavor (or generation) mixing and
non-degenerate neutrino masses. This is in complete similarity with the content of
GIM mechanism in quark sector.
It should be emphasized that now there are increasing evidences for neutrino os-
cillations claimed by the experiments to detect the solar and atmospheric neutrinos,
as will be discussed later in this chapter. The phenomenon of neutrino oscillation
is quite interesting by its own right. In addition, theoretically, it is expected to
play a central role in the search for and the establishment of "new physics", since it
clearly indicates non-vanishing neutrino masses and therefore some physics beyond
the standard model.
A nice comprehensive review on the neutrino masses and neutrino oscillations is
given, e.g., in the recent textbook by Fukugita and Yanagida (Fukugita-Yanagida,
2003).
Though a Dirac fermion can be decomposed into two Majorana fermions with de-
generate masses as we have already seen in the previous section, we do not take
this picture, and the mass matrix is 3 x 3 matrix just as in the quark sector. As
we have learned in the quark sector (see (4.144), (4.145)), the matrix m o can be
diagonalized by a bi-unitary transformation,
/ mi 0 0 \
U^mDV = I 0 m2 0 , (6.51)
\ 0 0 m3 /
where U and V are unitary matrices. The left-handed eigenstates of mass matrix,
i.e. mass-eigenstates, vn, V2L, V3L with Dirac masses, m\, rri2, m.3 are related to
116 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
weak eigenstates as
( VeL
VeL \\I (1 VlL
v1L \
V»L \=u. 1 V2L (6.52)
1
VTL ) V VZL
Accordingly the charged current interaction of left-handed leptons is written in
terms of U
mL m
fV (6.55)
mo rriR J
where 3 x 3 matrices TUL, TUR, mo denote left-handed, right-handed Majorana
mass matrices and Dirac mass matrix, respectively. As we have already seen in the
simplified 1 generation scheme, the mass matrix m„, as the whole, is a symmetric
matrix, m^, = m„ (mL = mL,mR = TUR). TO consider the seesaw scenario, we
assume that mi = 0. The mass matrix with TTIL = 0 is easily achieved in the
standard model, by introducing right-handed neutrinos and their Yukawa couplings
and bare mass terms,
where {mD)a0 = f%* • ^ , 4> = iT2<\>* (see (4.137)) and ^ is the VEV of the
neutral Higgs ip°. To account for the seesaw mechanism we understand that all
matrix elements of TUR are much larger than those of mo- Then, as we have seen in
Flavor Mixing and Neutrino Oscillation (in the Vacuum) 117
the 1 generation scheme, the mutual mixings among uaL and {vaR)c (corresponding
to 0V in (6.29), (6.30)) are small, being suppressed by the ratios of elements of mo
to those of TUR. Just as in the 1 generation scheme, the mass matrix is made into the
form of block-diagonal by a suitable unitary transformation with the small mixing
angles,
/ %I -imjjmjf 1 \ m ( H_ ™>D™~R \
\ mRlm*D I J " \ —im*R 1rriD I J
„ ( m^rofc)-1^ 0 \ , 5?)
y
- V 0 m*R J' •
Thus again eigenstates with masses of the order of TUR, decouple from the low
energy effective theory, and the mass matrix for the remaining neutrinos, which are
approximately uaL, may be regarded as
Again what matters in the neutrino oscillation is not m^i itself, but
this chapter. We have argued that neutrino oscillation is possible, for pure Dirac or
seesaw scenarios, only if there are more than 1 generation of leptons. Strictly speak-
ing even in 1 generation scheme, there exists an "oscillation" of the type "VL -> VR"
with chirality flip. This seems to be able to explain, just with 1 generation, the puz-
zle of e.g. solar neutrino, since the transformed state VR has no weak interaction
(a "sterile" state) and escapes the detection. However, unfortunately this is not
the case. As the neutrino masses are known to be quite small, neutrinos are highly
relativistic under the real situation of experimental detection, i.e. mv -C Ev. Since
for the relativistic particle the chirality is nearly equal to the helicity and therefore
is almost preserved, the probability of the transition UL —• VR is quite small,
The fact that the chirality flip is negligible suggests that in the consideration of
neutrino oscillation the spin degree of freedom is not relevant. (If we take the ef-
fect of background magnetic field into account, which may be suitable for neutrinos
propagating inside the sun or supernovae, the spin-precession of the neutrinos due
to the anomalous magnetic moment may take place without the chiral suppression
factor {mv/Ev)2 (Voloshin et al., 1988; Lim and Marciano, 1988; Akhmedov, 1988),
and this statement does not hold. We, however, ignore here the effect of the mag-
netic field.) Thus the relevant equation of motion for neutrinos is the Klein-Gordon
equation, rather than the Dirac equation. We have already discussed that when
the chirality flip can be ignored there is no essential difference, concerning neutrino
oscillations, in the two cases of pure Dirac and seesaw. Thus we now assume that
neutrinos are pure Dirac particles. The mass eigenstates vn, (i = 1,2,3) obey the
following Klein-Gordon equation
( • + m2) Vi = 0, (6.62)
whose solution with definite momentum and (positive) energy is just an ordinary
plane-wave solution
(6.65)
As we have seen above this expression holds for both of pure Dirac and seesaw
scenarios. This is why we cannot discriminate these two types of neutrino mass
generation just from the data of neutrino oscillations. The above evolution equation
is readily solved to be
exp(-^mI/m+i)
= U (6.66)
\
where a relation exp(— jgmumlt) = exp(—; m
-2^ diag
2
= Uexp(-^m diagt)U* ( m d i a g = diag(mf,mi,m|)) has been used.
Suppose a neutrino is born as va at time 0. The initial condition is then given
as va(0) = 1, others = 0. Under this initial condition, the probability amplitude
m 2
to detect VQ at time t becomes u0(t) = £V Upi e - *"^"' U*{. Thus the probability
for the neutrino born as va at time 0 to be observed as vp at time t is given by a
formula (see Fig. 6.2)
= I£ u
Pi e~ U*ai\ (6.67)
2 _ 2
In the second line of (6.67) Am-j rrv- — m\(Am ! = 0) and an overall phase
factor has been ignored. This formula clearly shows the fact that to get neutrino
oscillation both flavor mixing described by U and the mass (-squared) differences
Am 2 are necessary. In fact, if A m ^ = A m ^ = 0 then P(va -> v$) = 0 for
a 7^ P due to the unitarity (UU^)pa = 0. More intuitively, the neutrino oscillation
120 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
_.m?
» » » > » •
Fig. 6.2
real orthogonal matrix, as there does not appear any CP violating phase for n = 2:
( cos 6 sin 6 \
TT . „_
U= [ . 6.68
v
\ -sin 6 cos 6 J '
In this case, the mass-squared difference A m ^ is unique. Then (6.67) tells us that
the transition probability of the process ve —> v^ reads as
= l-sin220sin2(^|^). (6.70)
and therefore the unitarity for ve, (6.71), and the corresponding relation for v^
imply
production and detection points of the neutrinos. If it is the case, the relevant
quantities are time-averaged (the average over the positions is equivalent to the
time-average) transition or survival probabilities. For instance, in the 2 generation
scheme, we have the time-averaged probabilities denoted by P,
Though the survival probability (6.70) itself can be sufficiently small for 6 ~ 7r/4,
the time-averaged probability has a lower bound,
Then the inequality is easily proven by use of the unitarity condition of the MNS
matrix, £ ] f |£/ tt j| 2 = 1. The square of this unitariry relation yields
which gives
4
Y>-| >-- (6-86)
Resonant Neutrino Matter Oscillation 123
The inequality P{ve -*ve)>\ is the simplest example in the 2 generation scheme.
We further note that the equality in the above relation, i.e. £)i \Uai\A — ^ is real-
ized when the equality in the (6.84) is met: \Uai\2 — \Ua2\2 ~ ••• = \Uan\2, namely
in the case of "maximal mixing".
We have seen in the previous section that the time-averaged survival probability
never gets lower than 1/n (n : the number of generations). Thus, in the 3 genera-
tion scheme it is impossible for the vacuum oscillation (neutrino oscillation in the
vacuum) to account for data which indicates the survival probability lower than
1/3, such as the one of the pioneering solar neutrino experiment by R. Davis and
collaborators. Furthermore to realize the minimum value 1/3, a fine tuning of the
mixing angles is necessary so that \Uai\2 = \Ua2\2 = \Ua3\2 is realized.
Such difficulty may be overcome, once we invoke to a resonant neutrino os-
cillation due to the interaction of neutrinos with matter inside the medium, e.g.
the sun. Before discussing the detail of the "resonant matter oscillation" it will
be useful to note that the matter effect (the interaction of neutrinos with matter)
leads to an additional potential energy V(x) only for ve. The V{x) then causes
the change of the frequency of the matter wave of ve, depending on the coordinate
x, and therefore time t (x ~ t), for a highly relativistic light neutrinos, while the
frequency of the matter wave of another flavor, e.g. v^ in 2 generation scheme,
stays a constant. Thus it is quite possible that at some point, or equivalently at
some time, the frequencies of these two matter waves just coincide, and a resonance
phenomenon occurs. If we start from the matter wave of ue, at the resonance point
almost complete transition of the matter wave from ve to v^ is possible, even for
small mixing angle 8, as far as some suitable condition is met. It is analogous to the
case of two tuning-folks of musical instrument, the length of one of these two folks
being variable. When the lengths of two folks coincide the vibration of one folk
may be resonantly transferred into that of another folk, almost completely. One
important thing is that for the transfer to be efficient the variation of the length
should be slow enough, i.e. an adiabaticity condition should be met. The role of
the mixing angle 6 is played by the air (!), which mediates the sound connecting
the vibrations of the two tuning folks (see Fig. 6.3).
Now let us confirm the above intuitive argument really holds by use of math-
ematical formulae. First let us calculate the potential V(x). Since neutrino os-
cillation is caused by the interference effect (beat) of coherent matter waves with
a fixed direction of momentum ft, the potential V(x) should be attributed to the
elastic forward scattering of neutrinos off electron, proton and neutron inside the
medium, due to the weak interaction. V(x) changes the dispersion relation between
|p| and E of neutrinos, and therefore the index of refraction of neutrinos. Such elas-
124 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
•* • •* •
resonance
point
air
Fig. 6.3
p,n,e
'ei"\ii p,n,e
(a) (b)
Fig. 6.4
tic scattering with protons and neutrons are possible only through neutral current
processes (Fig. 6.4(a)). In the case of neutrino interaction with electron, however,
in addition to the neutral current process, only i/e has a charged current process
(Fig. 6.4(b)). The matter effect due to the neutral current process via Z-exchange,
Vn(x) is universal for all of three neutrinos. This Vn(t) provides an overall phase
factor exp(—if Vn(t')dt') for all neutrinos. The universal phase has no physical
consequence, since neutrino oscillation is caused by superposition of multiple matter
waves with slightly different frequencies, i.e. by the "beat" of matter waves. The re-
maining charged current process only for ve seems to be a scattering process, rather
than providing a potential. It, however, turns out that it can be interpreted as a
process to give the potential. The charged current process, Fig. 6.4(b), provides an
effective 4-Fermi contact interaction Hamiltonian (for neutrino energies E C Mw)
which is practically equivalent to the neutral current process. Let us recall that in
QED, a background electric Coulomb potential V(x) interact with electron as e^e x
V(x). Similarly, the above 4-Fermi interaction may be regarded as an interaction
of the left-handed ve with "Coulomb" type "static" potential Vc{x)
where e7°e is identified with the electron number density Ne(x), and other com-
ponents, ejle (i = l,2,3),e7 M 7 5 e have been neglected as they are expected to be
proportional to the expectation values of velocity or spin of the electron and may
be neglected for static and un-polarized medium of electron. Let us again recall
that under the influence of 4-potential A^ = (V, A) the dispersion relation be-
tween momentum and energy of ve is modified into E — Vc = y(p — A)2+m2 or
E = y (p - A)2 +m2 +VC. Thus concerning the static un-polarized medium, only
uei gets additional potential energy Vc, and corresponding change of the energy.
Thus the time evolution equation, "Schrodinger equation" in the base of (ve, v^, vT),
(6.65) is accordingly modified into
0
°\ / v
a(t) 0 e
0
Amj|
| • C/+ + 0 0
° HW""r
0 0 0/
2E
(6.91)
where the matter effect has been indicated as a(x) = ^/2GpNe(x), instead of Vc,
and the replacement mf —> Am-] has been done ignoring an overall phase associated
with ml/2E.
Though we should treat the realistic 3 generation scheme, such treatment is com-
plicated, generally speaking. As we will discuss in the next section, fortunately there
is an approximate reduction formula, valid under the hierarchy of mass-squared dif-
ferences A m ^ <3C A m ^ , which enables us to reduce the problem in 3 generation
scheme to that in an effective 2 generation scheme. Therefore, we will deal with the
matter oscillation in 2 generation scheme with a set of generic parameters of one
mass-squared difference and one mixing angle (Am2,8), in this section. Such argu-
ment will be useful, even in the realistic 3 generation scheme, since the reduction
formula is valid.
In the simplified 2 generation scheme, by use of (the simplified version of) (6.91)
and (6.68) the time-evolution equation can be explicitly written (after subtracting
a common factor ^ 21 sin 2 fl at the diagonal elements) as
ferential equation can not be solved analytically, unless the iVe (t) has some specific
time dependence. However, in an appropriate circumstance, we may still get a sim-
ple analytic formula for the neutrino oscillation. Namely, as we have seen in the
analogous case of the tuning-folk, as far as the time dependence of a time variant fre-
quency is mild (the exact meaning of this will be defined shortly), almost complete
conversion of one matter wave to the another should be possible. The condition of
the "mild time dependence" is called "adiabaticity". The resonance phenomenon,
which occurs when the two frequencies become identical, should happen in this sys-
tem of two neutrinos when the two diagonal elements of H(t) becomes the same,
namely at the point where the following condition is met,
^fsm26
tan20m = -r-2 — 7= • (6.98)
^£cos26-V2GFNe(t)
Fig. 6.5 shows how the mixing angle inside the matter 6m changes as Ne varies.
We see that the resonance point, satisfying Ne(t) — Ner, is the point where the
conversion of matter wave is most efficient and therefore the mixing 0m is maximal,
i.e.
0m = \- (6.99)
We may also say that the 6m changes most rapidly around the resonance point, and
the "resonance region" can be defined as to be a region where |tan20 m | > 1.
Actually, although we have diagonalized H(t), the evolution equation is still a
coupled differential equation, since the unitary transformation by Um(t) is time-
dependent. That is why we cannot solve the differential equation analytically.
Resonant Neutrino Matter Oscillation 127
Fig. 6.5
Namely, we get a residual term with off-diagonal elements in the evolution equation
in the base of {vmi, vm2)1,
.d E1(t) 0 0
!-— (6.100)
dt 0 E2(t) + 0
Then what is a merit of working in this base? Well, in this base the time evolution
in the adiabatic case, the case where the adiabaticity condition is met, is easily
solved. In fact if the condition
|^m|«|£2-E1| (6.101)
is always satisfied, the residual term in the Hamiltonian can be safely ignored, and
the system is approximately diagonalized:
Bk(t) 0
(6.102)
'dt Vm2 0 E2(t)
The adiabaticity condition will be the most non-trivial at the resonance point, where
the change of 0m is maximal and the difference of energy eigenvalues is minimum,
\E2 — E\\ —
= 4rg-sin20. Thus the adiabaticity condition may be written as
tan20 E
» (6.103)
d logJVc Am 2 sin20'
dx
where 6m has been written in terms of — " 1d^g" ' ',. which should be evaluated at
the resonance point. The physical meaning of this adiabaticity condition is that
the wave length of the "beat" at the resonance point, i.e. ~ 1/|£^2 — E\\ =
E/(Am2sm28), should be much smaller than the width of the resonance region,
i.e. ( A ^ s i n 2 0 ) / ( v / 2 G F M , | r e s ) = t a n 2 0 / ( f L ! o g ^ | r ( ; s ) _
128 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
(i/e|Texp(-t / H(t')dt')\ue)
Jo
e x p ( - i /„' E1(t')dt')
( cos9m(t) sm6m(t) )
exp(-t /„' E2(t')dt')
cos0m(O)
sin0m(O)
cos0m(i)cos0m(O) e x p ( - i / Ex(t')dt')
Jo
P(ye -> i/e) = cos 2 0 m (i)cos 2 0 ro (O) + sin 2 ^ m (i)sin 2 6l m (0). (6.107)
If the matter effect is ignored, i.e. if Ne = 0 and 6m(t) = 9m(0) = 9, this reduces
to (6.78) for the vacuum oscillation.
We now realize that even if the mixing 9 is small, almost complete conversion
of a solar neutrino, starting as ue, into v^ at the solar surface is possible, so called
MSW effect (Mikheyev and Smirnov, 1985; Wolfenstein, 1978). Namely, setting
9m(t) ~ 9, 9m(0) — f and assuming iVe(0) 3> Ner, the above formula is simplified
into
which can be sufficiently small even for small mixing 9, in clear contrast to the case
of (6.78), due to the resonance phenomenon. Such effective adiabatic conversion
mechanism between two states, having level crossing, is visualized in Fig. 6.6.
In Fig. 6.6, the dashed lines indicate the behaviors of the two diagonal elements
of H(t) as the functions of Ne. The solid curves correspond to the two eigenvalues
i?i(£) and E2(t). If ve is produced at a point with iVe(0) 3> Ner, ue state is approx-
imately on the upper solid curve, as the state ve is approximately an eigenstate of
H(0) with dominant energy due to the matter effect. As the matter density and
Resonant Neutrino Matter Oscillation 129
E
i i
s ^ v e
"m2 > ^
-^—'
Am2 * ^ S"
2E v ^T«^ =-^
* "M
^J>^^ "ml
^ ^ 1
1
1
^s^ve
. 1 —• JVe
Ne/
Fig. 6.6
therefore iVe decreases, e.g. as ve traverses from the center toward the surface of
the sun, the state evolves along the upper solid curve, as long as the adiabaticity
is met, and after passing through the resonance (level crossing) point, it will follow
the upper dashed line, to end up as the i/M state.
In the case that the adiabaticity condition is not satisfied, the above formula
should be modified. In this case at the resonance point or at the level crossing,
there should be a "jumping" between vm\ and vm2 states, i.e. between the two
solid curves. Let the probability of the jumping be Pjump- Then the two states
interchange by a probability PjUmp and remain to be the same by a probability
1 — Pjump- Now the survival probability reads as
This is based on the reasonable assumption that the jumping takes place only
near the resonance point. Then it will be a meaningful way of doing to replace
the matter effect by its linear approximation so that they coincide at the resonance
point. Such a replacement makes it possible to use the Landau-Zener formula for a
level crossing (Landau, 1932; Zener, 1932), leading to
7rAm2sin220
• jump = exp(- -), (6.111)
d log
4Ecos20 dx
NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
log Am 2
Fig. 6.7
where the derivative —J* " e should be taken at the resonance point.
In this way, substantial conversion of ve is possible by the MSW effect for solar
neutrinos, as the typical example of the resonant matter oscillation. When the
survival probability P{ve -> ve) is fixed by a experimental data, it will determine a
curve in the 2-dimensional space of the parameters, ( ^ f ,Am 2 ). When P(ue -¥
ue) < 1/2, the curve is known to draw a closed triangle (roughly speaking) in the
log-log plot of the parameters shown in Fig.6.7, which we call MSW triangle.
The three sides of the triangle have their own physical meaning. Namely, the
horizontal size represents the parameters, for which the level crossing starts to
occur, Ne(0) ~ Ner = A 92y/2G
T
^ | ^° SE^ 6> with E being an average neutrino energy. For
F
So far the concrete formulae for the probabilities of neutrino oscillations by use
of the mixing angle and the mass-squared difference have been given only for the
simplified two generation scheme, for illustrative purpose. Needless to say, to make
the analysis realistic, we have to derive formulae in the full three generation scheme,
in terms of two mass-squared differences of neutrinos,
f 1 0 0
0 C23 «23
V 0 -S23 C23
23^13^12,
where Sij and Cij represent sin#,j and cos0jj, respectively. (This parameterization
just corresponds to (4.183) for the KM matrix in the quark sector.) Such formulae,
however, are complicated even for the case of vacuum neutrino oscillations, being
described by the 6 parameters A m ^ , A m ^ , ^ , 0 2 3 , 0 1 3 , 5 . Thus given data on the
neutrino oscillations provide 5-dimensional hyper-surfaces in the parameter space,
which of course cannot be shown graphically. Even if we get the allowed values of the
6 parameters numerically, it may be practically impossible to get some meaningful
physical information on the parameters from them. To be worth, we will not be sure
with the obtained numerical values of the parameters, which mass-squared difference
and mixing angle are really handling the oscillation, while each experimental data
on the neutrino oscillation is usually provided in the form of the allowed region of the
parameters assuming a simplified two generation scheme with generic parameters
(0,Am 2 ).
Fortunately, the hierarchical structure of two mass-squared differences of neu-
trinos, suggested by the data of atmospheric and solar neutrinos experiments,
enables us to reduce the problem of analyzing neutrino oscillations in the full three
generation scheme to that in the effective two generation scheme. Namely, "re-
duction formulae" are known to exist for each type of neutrino oscillations, which
obviously make the analysis of the allowed region of theory parameters quite similar
to those in the simplified two generation scheme, thus making the analysis quite
transparent and physically meaningful. In addition, in this way we can clearly see
which mixing angle and mass-squared difference out of the 6 parameters should
be identified with those appearing in the experimental data. As we will briefly
discuss in the next section, to explain the data on the atmospheric and solar
neutrino oscillation two different scales of mass-squared difference are inevitable:
Am2atm ~ 2 x 10~ 3 (eV2) and A m ^ l o r ~ 7 x 1CT5 (eV2). Thus the identification
should be made, which means the hierarchy (6.115) is a reasonably good assumption.
(In fact numerical calculations show that the reduction formulae given below provide
enough accurate oscillation probabilities.)
Now we will discuss these reduction formulae for each of vacuum and resonant
matter oscillations, successively below.
Amf, _
Amf,
where L denotes the distance between the production and the detection points of
neutrinos. The oscillation of atmospheric neutrinos and terrestrial "long-baseline"
accelerator neutrino oscillation experiments may be classified into this category. It is
worth while noticing that the oscillation length of atmospheric neutrino oscillation,
implied by the zenith-angle dependence of its survival probability, is not the order
of the size of the earth, but is of the order of a few hundred kilometers, roughly the
distance from the Super Kamiokande detector till the top of the atmosphere at the
Zenith-angle = \. Strictly, for the up-going neutrinos in the Super-Kamiokande
detector, i.e. from the opposite side of the earth, the oscillation due to A m ^
may not be negligible, though we do not consider the effect in this textbook. The
wavelength is comparable to the baselines of K2K (KEK to Kamiokande) and other
proposed long baseline experiments.
Thus, ignoring the smaller mass-squared difference, A m ^ = 0, the general
oscillation formula (6.67) reduces into
due to the unitarity J2i UpiUai ~ <W- Several survival and transition oscillation
probabilities, relevant, e.g., for the atmospheric neutrinos, now takes simple concrete
forms in terms of mixing angles
= sin 2 20 23 cos 4 0 13 s i n 2 ( ^ - i )
AE
J 22 , A m | l f ) )
~ sin 2023 sin ( (6.121)
AE
'31
Pfa-H'e) = 4|C/ e3 | 2 |t/ M 3| 2 sin 2 (- t)
AE
sin22013sin2023sin2(^-t)~O, (6.122)
where the approximate formulae are for small #i 3 , reported by e.g. CHOOZ exper-
iment,
sin 2 20 13 < 0.2. (6.123)
These formulae imply that, roughly speaking, ve does not oscillate and is decoupled
from other neutrino species, and only v^ O vT oscillation is possible. This is what
we expect for atmospheric neutrinos. We should also note that the formula for the
Vf! <-> vT oscillation (6.121) is of exactly the same form as the transition probabil-
ity in the simplified two generation scheme, i.e. as (6.69), though the replacement
0 -> ^23, Am 2 ! —> A m | j has been made. Namely, in this case the mixing handling
the oscillation is known to be 0 23 • Thus, the experimental data on the atmospheric
neutrinos should provide constraints on the set of parameters (0 2 3 ,Am 3 1 ), as we
will see in the next section. Concerning the possible CP violation in neutrino os-
cillations, the formulae given above are not suitable, since ignoring Am 2 1 and 0i 3
will erase the CP asymmetries, just as in the quark sector (see (9.57), (9.58)).
so large that the oscillation length due to AmJi is much shorter than the baseline L,
i.e. the distance between the production and the detection points of the neutrinos.
Thus the oscillatory factor due to A m ^ should be time-averaged, i.e.
.Am?, . „ ,
C0S( } (6 124)
~ 2 # * ~* °' '
which means the interference of the matter waves of v3 with those of v\ and v2 can
be ignored. This treatment will be relevant for the experimental situation satisfying,
A 2 ( Am 2 i \ T
{
E (^f^) 100km>- '
\5MeV I
^ 3 1 L = 2 .0xl02. W ^ i Z ( _ £ _ ) » ! . (6.125)
This is exactly the case for the KamLAND experiment, recently started in Japan.
Ignoring the interference of the matter waves of v3 with those of v\ and v2, the
general oscillation formula (6.67) reduces into
where Peff denotes the survival probability in the effective two generation scheme,
which is obtained by the replacing the factor sin20 in the formula for two generation
\u 2 12
scheme by a factor 1_iU ' p . Namely,
For instance, the concrete reduction formula in terms of mixing angle for the ue
survival probability, which we denote by 5, is given as
S = cos 4 6i 1 3-5e//(6ii2,Am^ 1 )+sin 4 (9 1 3,
~ cos 20i3-S e / / (0i2, A r r 4 ) (6.129)
where the survival probability of ve in the effective two generation system, denoted
by Seff is described by a set of parameters (#12, Am^i) and reads as
The approximate formula in the second line of (6.129) is for small 9i3, (6.123). The
formula (6.129) with (6.130) will be also applicable to the survival probability of VI,
relevant for the KamLAND experiment, as well, as long as the small matter effect
Neutrino Oscillation in the Three Generation Scheme 135
due to the neutrino interaction with the earth is negligible. Thus, in this case the
oscillation is known to be handled by the mixing angle #12-
The consequence of the CPT theorem shown in (6.72) (for the negligible matter
effect) implies that the CP asymmetry is identical to the T asymmetry defined by
Namely, A^F = A^g. This can be confirmed explicitly in the formula (6.67).
Namely, the charge conjugation causes the replacement Uai —> U*i for every MNS
matrix elements, which is equivalent to the replacement a <->• /? in (6.67).
Then, a trivial relation A^p = —ATpa tells us
Thus the CP violating effects do not appear in the survival probabilities and appear
only in the transition probabilities. On the other hand, since the neutrino oscilla-
tion we are considering here does not break lepton number, the unitarity, or the
probability conservation read as
which means J2p Aap = °- Together with A%£ - 0 and A°% = -A^ we get a
relation for a = e, A^ = A^f. Similarly, for a = n we get A^J = Aff. We thus
136 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
obtain a remarkable relation as the specific feature of the three generation model,
This is essentially the consequence of the fact that in the three generation model
the CP violating phase is unique, i.e. 8 alone.
Let us now derive the explicit form of AGP. From (6.67) we realize that for an
arbitrary choice of the pair (a,/3),
4
A% = P{»a -+ vp) - P(*a -> ftp) = ~ E MUaUtiUnUZj) •sin(^t).
(6.137)
Write TmiUdU^UpiU'j) = Jap,ij- By definition we readily know Jap,ij — ~Jp,a,ij
and Jap,ji = —Jap,ij- On the other hand, the orthogonality of the MNS matrix
implies £V Jap,ij = 0. We thus know for j = 2, e.g., J a/ 8,i 2 + Ja/3,22 + Ja/3,32 =
Jap,i2 + Ja0,32 = 0, Similarly, for j - 3 we get Jap,\z + Ja/3,23 = 0. By use of
the property Jap,ji = —Jap,ij, we thus conclude Jap,i2 = Ja0,23 = Ja0,3i- Similar
reasoning provides Jeii,ij = J\IT,%J = Jre,ij- In this way we have demonstrated that
actually Jap,ij is unique, in spite of the possible choices of the combinations of
a, /? and i,j. Hence we define J = J e /i,i2, which is the leptonic counterpart of the
Jarlskog parameter (9.58) in the quark sector. In terms of the parameterization in
(6.114),
J = Cl2Sl2C23S23Ci3Si3S<5, , (6.138)
anti-neutrinos are not the same. Thus how to avoid or extract the matter effect is
another challenging issue.
^ p , V2GFNe « ^ , (6.140)
which is the case for solar neutrinos. (Concerning the neutrinos emitted from super-
novae, the matter effect can easily exceed Am^1/(2E) in the region near to the core,
and two-step level crossings become possible. Though there exists an extension of
the reduction formula, applicable for the case, it is out of scope of this textbook.)
Here let us focus on the ne survival probability S. We will now see that, even in
the presence of the matter effect, a similar reduction formula to the one given in
(6.129) holds;
where aeff = cos 2 0 13 a(x) = cos 2 0 13 \J2GFN<,(X) is the "effective" matter effect,
reduced by the factor cos 2 #i3, in the effective two generation scheme (Lim, 1987;
Smirnov, 1992; Shi and Schramm, 1992).
Now let us derive this formula. We start with the time-evolution equation in
the three generation scheme, (6.91). What we attempt is to separate the 3 x
3 Hamiltonian matrix, though it cannot be exactly diagonalized because of the
presence of the matter effect, into those for one and two neutrino systems, i.e.
3 —» 1 + 2, by making some suitable approximation. Namely, we attempt to make
the Hamiltonian of the form of block-diagonal, relying on the hierarchy (6.140). It is
reasonable to expect that one mass eigenstate uz, having the dominant "energyh, i.e.
Am2
the diagonal matrix element 2£x, is decoupled form other two states, orthogonal
to 1/3, namely the linear combinations of v\ and v-i. Thus, in the base of mass-
eigenstates, the Hamiltonian is approximated by H ~ diag(0,0, -^gp-), roughly
speaking. Instead of mass-eigenstates i>i and i/2, however, it will be convenient to
take their linear combination defined by u'e = cosfl^i + sm6i2V2, ^ = — s i n f l ^ i +
cos#i2^2- The reason is, in this way, the 2 x 2 sub-matrix of the Hamiltonian in the
space of v'e and v' has a similar form to that in the two generation scheme, since
only v'e has the matter effect and the term of a(x) appears in the diagonal element
138 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
cos# 12 ^i + sin^12i/2
-sin^i 2 ^i + cos^i2^2 | • (6.142)
"3 / \ VZ
Recall that from (6.52), (6.114) the relation ipw = V ^ V i s V ^ m holds between the
column vectors of weak- and mass- eigenstates, \j)w = {veL,vllL,vTi,)i and ipm =
(viL,v2L,V3L)t- Thus, the relation (VJ3ipw) = Vi3(Vi2ipm) means £ = Vi2ipm =
V/3 (V23 ipw). More explicitly,
/ cos^i3i/e - smBlze~iSvT
( v'e \
^
1/3 /
= ^
V sin6>i3ei,5^e+cos^i3J>r
\
/
, (6-143)
where
sm
£M = COS#23^V ~ ^23^T - -7={Vfi ~ VT),
vT =
sin0 2 3^ + cos023^T - —j=(vn + vT), (6.144)
v2
where the approximate relations are for #23 — f, as suggested by the data on the
atmospheric neutrino oscillation. Actually, as far as the survival probability of ve
is concerned, we may set #23 = 0 without changing the answer, since both of initial
and final states have nothing to do with v^, uT and the matter effect is invariant
under the rotation of these states. As we expected, among v'e and 1/^, only v'e has
the ve component and, therefore, the matter effect.
Now the time-evolution equation in the base of £ is written as
S=\v
dt 12
l A12 W2 + ^
A 13,
' A i 2 5 ? 2 + ac? 3 Ai 2 si 2 c 1 2 0 \ / 0 0asi3ci3'
A12s12c12 A 12 c? 2 0 + 0 0 0 ) ^,
0 0 A13 + a s ? , / \as13C13O 0
(6.145)
Am|, A Am*
where A 1 2 = -fgf 1 ) ^13 — ~ ^ a n d c*j = cos dij, s^ - sin 6{j. The second matrix
in the right-hand-side of the above equation may be treated as a small perturbation
under the hierarchy (6.140), and may be ignored. One may wonder why the small
quantities in the 2 x 2 sub-matrix in the first matrix in the right-hand-side are kept.
This is because at the leading 0-th order of perturbation, i.e. eliminating these
small quantities as well, the Hamiltonian has degenerate eigenvalues 0 in the space
of sub-matrix. Thus the two eigenstates in the sub-space is not fixed at the 0-th
order, and the small perturbation should be kept only in the sub-space. We can
Atmospheric and Solar Neutrino Oscillations 139
read off the factor cos2#i3 a(t) in the sub-matrix, which comes from the relation
v'e — cos#i3i/e - sin^ise - 1 " 5 ^. Namely, in this sub-space the matter effect is reduced
to
dc ( Ai 2 s? 2 + acf3 Ai 2 s 12 ci2 0 \
if ~ A12s12c12 A 12 c? 2 0 £. (6.147)
V 0 0 Al3+as213J
Let us note that the separation 3 —> 1 + 2 has been realized, and the time-evolution
in the subspace of (u'e, u'^)1 is exactly the same as that of the two generation scheme,
except for the replacement of the matter effect a —> aeff.
Since ve state corresponds to (ci3,0, Si3et<5)', the survival probability S is given
as
2
S = l & ^ O O i / J + ^e"^"*''")'!
= cos40i3 • S e / / ( 0 i 2 , A m 2 1 ; a e / / ) + sin 4 0i 3
~ cos2913-Seff(e12,Aml1;aeff), (6.148)
which is nothing but the reduction formula (6.141). The approximate formula in the
third line is for small #13, (6.123). In the derivation we have ignored the interference
of 1/3 with other two states, as the oscillation due to A13 is very rapid: e~ t A l 3 ' —>• 0.
The effective survival probability 5 e //(0i 2 , Am 2 1 ; a e / / ) should be calculated by use
of the 2 x 2 sub-matrix in the Hamiltonian of (6.147). Let us note that if we neglect
the small #13, 5 = Seff and ve = v'e. Namely, in this case the two generation
treatment is exact and the solar neutrinos oscillate into v' = j>M;
and we expect roughly the same amount of v^ and vT in the solar neutrinos. The
error of the reduction formula (6.141) is anticipated to be handled by the relative
strength of the ignored off-diagonal matrix element to the dominant matrix element,
of the order a fin/13 < 10~ 2 , which has been confirmed by numerical calculation.
\ 2B I
Thus the neutrino source with large L and very intense flux should be ideal one to
test the oscillation. We can find such ideal neutrinos sources with astrophysical or
cosmological origin, i.e. atmospheric neutrinos and solar neutrinos.
As we now very briefly discuss, both of these neutrinos have experimental puz-
zles, called atmospheric and solar neutrino problems (anomalies): the observed
event rates of these two-types of neutrinos are significantly smaller than the theo-
retical predictions. We will see below that these reductions of the neutrino event
rates are naturally accounted for by virtue of the atmospheric and solar neutrino
oscillations.
Let us note that the decay rate of n+ —> e + + ue is strongly suppressed compared
with that of n+ -> fi+ + v^ shown above roughly by a factor (me/m^)2 (see (2.49)).
Though the absolute values of vt and v^ fluxes have some uncertainties in their
calculations, in the ratio of these fluxes {yis,-\-vv)j{ytJrve) {v^ denoting the i/M flux,
etc.), such uncertainty is considerably reduced. The above decay chain suggests
that the ratio is roughly 2.
The atmospheric neutrino experiment by Super-Kamiokande collaboration claims
that the "double ratioh of the observed [y^ + PM)/(i/e + ve) to that of the predicted
value remarkably deviates from 1,
R= K + PMWfo+PeU ^Q6 ( 6 m )
\Vn + V^pred/yVe + Ue)pred
sub-GeV multi-GeV
250 76 r
e-like e-like
p < 0.4 GeV/c 200 . p>0.4GeV/c
200 -
150 150 • i
100
-frri j
1UU •\A±
50 50
0
200
H-like
160 p < 0.4 GeV/c
T 7 - r < 7 T 7 |\i 11 k i n v r r
120
80
40
0
"-1 -0.6 -0.2 0.2 0.6 •0.6 -0.2 0.2 0.6 1 -1 -0.6 -0.2 0.2 0.6 1 -0.6 -0.2 0.2 0.6 1
cos© cose cose
Fig. 6.8 The Zenith-angle (0) dependence of ;i-like and e-like neutrino events for the scattered
particles in the energy ranges of sub-GeV and multi-GeV. The hatched regions are expected events
for no neutrino oscillation, while the bold lines indicate the (best-fit) expected events assuming
the neutrino oscillation Vy. —• vT. From the result of Super-Kamiokande collaboration, (Super-
Kamiokande collaboration, 1998).
(6.119), with the data, Super-Kamiokande has derived the allowed parameter re-
gion
To be interesting, the best fit value of sin2 2#23 is 1. Namely the maximal mixing
has been realized in the lepton sector, in clear contrast to the case of quark sector !
To get the restrictive range of A m ^ , the data on the Zenith-angle dependence is
very helpful. Namely, the dependence of the atmospheric neutrino capture rate on
the Zenith-angle 9 at Kamioka, shown in Fig. 6.8, is probably the most convincing
direct evidence of neutrino oscillation.
The figures tell us that the muon-like event rates at cos 9 = —1, i.e. 9 = n,
corresponding to the atmospheric neutrinos from the opposite side of the earth,
indicates a clear deficit, while the event rate at 9 — 0 is quite consistent with the
prediction. This may be attributed to the oscillation of atmospheric neutrinos v^,
which traverse long way toward the Super-Kamiokande detector. The remarkable
decrease of the capture rate at 9 = | shows the wave length of the oscillation
is of the order a few hundred kilometers. This is why the terrestrial test of the
atmospheric neutrino oscillation at the long-baseline experiment, such as K2K, is
possible.
142 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
e_+7Be -> 7
Li + i/e, Ev = 0.86(MeV),
8 8
B -> Be* + e+ + i/e, < Ev >= 7.2{MeV),
: , (6.155)
where three reactions, which mainly contribute to the event rates at the on-going
solar neutrino experiments, have been shown, together with the average energies <
E > (a fixed energy for the second reaction concerning Be) of the emitted neutrinos.
Actually, the second and the third reactions actually belong to different branches
of the chain reactions. The neutrinos emitted by these three processes are called
pp, Be and B neutrinos, respectively. The pp neutrino dominants the solar neutrino
flux, though its average energy is relatively low and its detection is not easy, while
Be and B neutrinos have higher energies and are detectable even in the experiments
with relatively higher threshold energies, such as Super-Kamiokande, Homestake
and SNO experiments, in the mines in Japan, U.S. and Canada, respectively .
The chain reactions are summarized in the following net reaction of the nuclear
fusion
Though of course the nuclear fusion itself is due to the strong interaction, the
presence of the weak interaction causes the emission of the solar neutrinos ue.
The experiment for the detection of solar neutrinos, solar neutrino experiment,
started by the pioneering work by R. Davis and collaborators, at Homestake mine,
the "CI experiment" by use of C2CI4. Now the data on the solar neutrino capture
rates are available from four different types of solar neutrino experiments:
1. CI experiment:
i / e + 3 7 C l -> 37
Ar + e~, obs./exp. = 0.34 ± 0.03,
(mainly) sensitive to B, Be neutrinos (Eth = 0.81MeV)
2. Super-Kamiokande experiment:
v + e~ -» v + e~, obs./exp. = 0.47 ±0.02,
Atmospheric and Solar Neutrino Oscillations 143
10 • 1
sin2(26)
Fig. 6.9 The allowed area in the parameter space of (sin 2 (20i 2 ), A m f 2 ) , where 6\2 and Amf 2 are
simply written as 0 and A m 2 , imposed by the data of solar neutrino experiments. The hatched area
is the allowed area at 95% C.L. obtained by the combined analysis on the neutrino flux observed
in the Ga, CI and Super-Kamiokande experiments. The area of the triangular shape, roughly
speaking, is the exclusion area due to the day/night spectrum analysis by Super-Kamiokande.
From the result of (Super-)Kamiokande collaboration, (Super-Kamiokande collaboration, 2001).
Let us note the SNO neutral current result obs./exp. = 1.01 ± 0.13 in (6.157)
is consistent with the neutrino oscillation (6.149), since the i/M and vT are active
states and contribute to the capture rates of solar neutrinos, just as ve does. Namely
only in the SNO neutral current experiment, the capture rate is expected to be the
same as the prediction of the standard solar model, which is exactly what has been
confirmed by the experiment.
When we discuss the deficit of the solar neutrinos, of course we have to know
the prediction of the standard solar model. There exist, however, a way to confirm
the neutrino oscillation, irrespectively of the solar model, by use of the combined
results of Super-Kamiokande (Super-Kamiokande collaboration, 2001) and SNO
Atmospheric and Solar Neutrino Oscillations 145
charged current reaction (SNO collaboration, 2001)! This should be a direct and
convincing proof of the solar neutrino oscillation. The clue is in the fact that in
the elastic scattering of solar neutrinos off electron, not only ue but also v^ and
uT contribute to the event rate, though the ratio Ra of scattering cross section of
v^ and vT to that of ve is roughly Ra = 1/6 to 1/7, because of the presence of
the additional charged current process Fig. 6.4(b) in the case of the ve scattering.
On the other hand, in the SNO charged current reaction, only ve can contribute to
the process, as the charged current processes of other neutrinos require higher than
available neutrino energies. Thus if the B neutrino flux predicted by the standard
solar model is denoted by <£ggM, the "effective" fluxes at these experiments, denoted
by $gj{ and 0gj^Q (CC) should be given as
where S is the survival probability (6.141). (For illustrative purpose, we have ig-
nored the dependence of S and Ra on the solar neutrino energy.) It is quite impor-
tant to note that the observed value </>gj^ ~ 2.4 x 106 (/cm2/s) is certainly larger
than </>gjsfo(CC) — 1-8 x 106 (/cm2/s). This means S < 1 and that among the
solar neutrinos there surely exist the active neutrinos v^ and vT, which strongly
suggests the solar neutrino oscillation ve —> i/M, uT. (It also implies that the so-
lar neutrino oscillation into a sterile state (states without weak interaction) is not
favored.) Furthermore, equating (6.160) and (6.161) with the observed values, we
get 0 g g M = 5.4 x 106 (/cm2/a) and S = 0.33, for Ra = 1/6. The obtained <I>SSM
is quite consistent with the prediction of Bahcall et al., (5 ± 1) x 106 (/cm2/s).
The most recent remarkable experimental achievement is the confirmation of
the solar neutrino oscillation in quite different circumstance, i.e. the terrestrial
oscillation experiment of ue produced by nuclear reactors in Japan, i.e. KamLAND.
As we have seen in the subsection 6.5.1, the experimental setting is such that the
oscillation is due to the smaller mass-squared difference A m ^ and 9\2, just as
in the solar neutrino oscillation, though the matter effect may be safely ignored,
even though not negligible. Thus the formulae (6.129) and (6.130), similar to the
one in the solar neutrino oscillation (6.141), gives a prediction on the deficit of ve
at the detector. The predicted deficit rate for the MSW-LMA solution, (6.158)
and (6.159), is quite consistent with the observation, which started very recently
(KamLAND collaboration, 2003).
Problems
6.1 Show that the relation (6.17) holds, to verify that a free Weyl fermion and a
free Majorana fermion are equivalent, as far as they are massless.
146 NEUTRINO MASSES AND NEUTRINO OSCILLATIONS
6.3 Show that the coefficient of the (irrelevant) operator (6.48) provides a Majorana
mass for VL when the Higgs doublet 4> develops its vacuum expectation value.
6.5 Verify that the adiabaticity condition (6.101) at the resonance point is equiva-
lent to the condition (6.103).
Supersymmetry (SUSY for short) is a symmetry which connects particles and fields
with different spin-statistics, i.e. bosons and fermions. When a Lagrangian density
is invariant under a properly chosen transformation which connects particles whose
spins differ by 1/2, the theory is said to be supersymmetric. The supersymmetry
is independent of any internal symmetry such as gauge symmetry, and therefore
connects a pair of particles, "superpartners" of each other, with different spins
but the same quantum numbers such as electric charge, weak isospin, color etc.
As we will see later the generator of the supersymmetry transformation is known
to commute with the generator of space-time translation PM (4-momentum), and
therefore with P%. We thus learn that the superpartners have the same mass.
If we wish to apply the supersymmetry to the description of quarks and leptons,
the supersymmetry clearly should be broken spontaneously or explicitly, as we have
not seen any spin = 0 scalar particles with the same charge and the mass as those of
quarks and leptons. Though it is not hard to break supersymmetry spontaneously, it
easily happens that some of superpartners of light quarks or leptons are lighter than
these particles, thus contradicting with the reality. In the Lagrangian, to put by
hand the terms which break supersymmetry explicitly, such as the additional mass-
squared term for the scalar electron, is also possible and an easy way. However,
without any guiding principle, such arbitrarily introduced explicit SUSY breaking
terms easily lead to an unacceptable prediction, such as too large flavor changing
neutral currents. The detailed discussion on the mechanism of SUSY breaking is
beyond the scope of this textbook, though we briefly discuss the mechanism of
spontaneous SUSY breaking in a later section, and we just consider here how large
the SUSY breaking mass scale can be. The scale will be fixed so that the SUSY
works as a solution of the so-called hierarchy problem.
What is the hierarchy problem? Though it has been perfectly explaining all
phenomena (except the recently reported neutrino oscillation), the standard model
of elementary particles is widely believed not to be the final theory; in particular,
147
148 SUPERSYMMETRY
H
y ~ ^
H
* '
--^ H H (
r\ J H
(a) (b)
Fig. 7.1
it has too many free parameters and the gauge couplings are not really unified as
the gauge group is not a simple group. We thus expect that the standard model is
valid up to some energy (mass) scale A (a physical cutoff of the standard model),
and at A the theory is replaced by more fundamental new theory incorporating the
standard model, i.e. "beyond the standard model" or "new physics". The standard
model is thus regarded as the "low" energy effective theory with the cutoff A.
The cutoff A is MQUT if the new physics is Grand Unified Theories (GUT), with
e.g. MQUT ~ 1015GeV for SU(5) GUT, and Mpi ~ 1019GeV (Planck mass) if
the new physics is a unified theory with gravity. These mass scales, MGUT, MPI
are much higher than the weak scale Mw, the typical mass scale of the standard
model. The hierarchy problem is the problem of how to maintain the hierarchy
of these mass scales, i.e. Mw C MGUT,MPI. The problem is the most serious in
the sector of scalar particle such as the Higgs particle, as the Higgs mass-squared
get a huge quantum correction proportional to MGUT or Ml\. This large quantum
correction manifest itself as a "quadratic divergence" A2 if we calculate the quantum
correction to the Higgs mass-squared in the effective theory, i.e. in the standard
model: the problem of "quadratic divergence" (See Fig. 7.1(a)). Thus it is hard to
maintain the hierarchy naturally. Actually, it is possible to adjust the bare mass
of the Higgs, so that the renormalized Higgs mass remain to be the weak scale. It,
however, will require a fine tuning of the bare parameter at the precision roughly
of (MW/MGUT)2 ~ 10 - 2 6 , for instance, which we regard as unnatural.
Supersymmetry had been a subject of the theoretical research for rather long
time, but its application to the particle physics was strongly motivated by the desire
to solve the hierarchy problem. In fact, supersymmetric SU(5) GUT was proposed
by Sakai and Dimopoulos-Georgi (Sakai, 1981; Dimopoulos-Georgi, 1981) based on
the motivation. Let us note that in the supersymmetric theory coupling constants
of a particle and its superpartners are identical, and also that the Feynman rule
provides an additional negative sign for the diagram with a loop of fermion. Thus
the quadratic divergences from the two diagrams Fig. 7.1(a) and (b) are known
to cancel with each other as long as the supersymmetry is exact. In this way, the
Two-component Notation 149
T - U 0)•
ff" = (-/,0-i), (7.2)
d* = (-/,-o-i), (7.3)
150 SUPERSYMMETRY
75= i / y 7 V = ^ ^y ( 7.4)
where I is a 2 x 2 unit matrix and Oi (i = 1,2,3) are Pauli matrices. Then the Weyl
fermions are described by 2-component complex spinors as we expected:
V
</>* = ( ° ) , (7.5)
which clearly means that the chirality is preserved under the Lorentz transforma-
tion. It will be necessary to comment on the raising and lowering of the 2-component
spinor indices a or a and the meaning of the dot. We first note that the right-handed
Weyl spinor, t]a, is transformed by a Lorentz transformation as r)'a = Ma ^rip, where
the 2 x 2 matrix
M = expos'"'). (7-9)
with 6 transformation parameters eM„ (e„M = —e^). It should be noted that the M
is an element of SL(2,C). In fact, since the generators £M„ are traceless detM =
1 and also the 6 real degrees of freedom of eM„ just coincide with the degree of
SL(2,C), 2 x 2 2 — 2. We can easily show that in SL(2,C) an invariant inner-
product of two fundamental representations, i.e. doublets r\a and Xa, is given by
eal3T]aX0 (e a/3 : 2x2 Levi-Civita tensor, e12 = - e 2 1 = 1), by use of the property
detM = 1. Thus the invariant product can be written as
Xa = e Q / V (7.11)
In other words, the Levi-Civita tensor ea® behaves as the metric tensor in the 2-
component spinor space. Hereafter we will use the abbreviation, r\x = i]aX<x =
-VaXa = XaVa = Xn,WX = VaXa = Xn, for convenience.
Noting an identity W'"' = —ea^v*e, we find that the matrix of Lorentz transfor-
mation for a left-handed Weyl spinor should have the following indices,
where the dot of the index denotes the complex conjugation of the quantity. To
be consistent, the left-handed 2-component spinor should have the following upper
dotted index
1pL=[ -a • (7.13)
We may also understand the origin of the dot as the result of charge-conjugation.
Namely, the charge-conjugation interchanges the chirality of Weyl spinors, e.g.
^-^•(IPR)0 = 0, while the charge conjugation always has the action of complex
conjugation. Thus, when the right-handed spinor has an undotted index the left-
handed spinor should have a dotted index. More explicitly, the charge-conjugation
of a Dirac-spinor ipr> = ,4>R + ipL is given as,
(i[>D)c = C^>D=i12xP*D=il
(7.14)
where rj^ = (r/a)* etc. This clearly shows that when £ = 77 the spinor
*M = ( 1% ) , (7-15)
V"
is self-conjugate under the charge-conjugation,
This specific "real" spinor is called Majorana spinor, and the particle described by
the spinor is called Majorana particle.
Once the assignment of the spinor indices of Weyl spinors is fixed, the assignment
of the indices of various 2x2 matrices are determined as,
0
(*")««, ( r ' T V O a , ( ^ r ^etc. (7.17)
The Lagrangians of renormalizable theories have bi-linear terms of spinors. Weyl
spinors ipR and tpi behave as 2 and 2 representations of SL(2,C), or equivalently
( | , 0 ) and (0, | ) representations of the Lorentz group, where \ and 0 are the "spins"
of two 5J7(2)-like groups, contained in the Lorentz group. We then ask how bi-linear
forms of Weyl fermions behave under the Lorentz group. The bi-linear form of two
Weyl spinors with the same chirality decomposes into (1,0) + (1,0) or (0,1) + (1,0)
representations, as is known from the recombination of the spin: | x | = 0 + 1.
More concretely, the singlet (0,0) and the triplet (1,0) representations formed by
the product of two, e.g., right-handed spinors, 771 and 772, are written as
The reader can easily check that {aliv)a^ is symmetric under the exchange of the
spinor indices a and /?, the property necessary for the triplet. This statement,
however, is a little puzzling, as the number of possible combinations of fi and v in
the a11" is 6, instead of 3. Actually, we find that not all of <r'"/ are independent; for
instance a01 and a23 are essentially identical matrices. (In fact, if our space-time
were Euclidean, this tensor satisfies the self-dual condition a^v = |C'" / K A (T K A , while
a^v satisfies the anti-self-dual condition.) Similarly, the product of two left-handed
spinors decomposes into (0,1) + (0,0) by the contractions with (^1/)&a and e d/ j. In
the four component-notation, where the right-handed spinors are written as ipi =
(f?ia,0)' and ip2 = (^2ai0)', the singlet and the triplet discussed above correspond
to the following bi-linear forms, ip\Cip2 a n d iplCE^^- Then what corresponds to
the Lorentz vector V ' I T ^ ^ ? (Note that V'IC'T/IV^ identically vanishes.) This time
the bi-linear form is made by the mixture of the dotted and undotted indices, i.e.
~Via {v*i)0"*rl2cc Namely, ( | , 0 ) x (0, | ) = ( | , | ) behaves as a Lorentz vector with
4 components.
We display in the Problem 7.1 some of formulae in the 2-component notation,
which are frequently used and useful in the discussion below.
space-time translation,
Lorentz transformation,
gauge transformation,
where Q etc. and X etc. are, say, fermionic and bosonic generators. Since SUSY
transformation connects fields whose spins differ by 1/2, the generators of SUSY
SUSY Algebra and SUSY Group 153
naturally carry spin 1/2, and therefore satisfy the anti-commutation relation due
to the Fermi-statistics, just as Q etc. do in (7.19).
Through such rigorous argument, it turns out that the SUSY algebra is given
as
[P»,P»] = 0, (7-23)
where Qa and Qa are complex conjugate, or hermitian conjugate in the sense of
operators, of each other, and are Grassmannian operators, as is seen typically in
(7.21). One may wonder why the anti-commutator in (7.21) should vanish, while
that in (7.20) is non-vanishing. Just as the product of two doublets \ x | decomposes
into a triplet and a singlet 1 + 0 in the calculus of spin, if {Qa, Qp} were non-
vanishing, it generally should have contained two parts C^a^ and C[a,0], i-e. the
parts symmetric and anti-symmetric under the exchange of a and /?, belonging
to (1,0) and (0,0) representations of the Lorentz group, respectively. The anti-
symmetric part, however, trivially vanishes, since {Qa, Qp} is symmetric under
the exchange. Let us note that this situation changes in the case of extended SUSY
with N (N > 1) kinds of SUSY transformations generated by Qla (and Q&ti),
where a "central charge" term of the form eap c ^ l , symmetric under the exchange
(a,i) «-» (/?, j), is allowed to exist. In the case of simple SUSY N = 1, howver,
the only possibility is the symmetric piece C{a<^, proportional to (<r'"/)a^ MM„.
A rigorous argument leads to the conclusion that this term is not allowed to exist,
being consistent with the SUSY algebra [PM,<5a] = 0.
We thus know that the commutator of SUSY transformations provides a space-
time translation, i.e.
Pft,Q and Q are the elements of SUSY algebra and ^,€,1 are the corresponding
transformation parameters. We need representations of SUSY algebra, on which
the transformations of SUSY group should act, so that we can assign our quarks
and leptons and gauge bosons to some of the representations. In the SU(2) gauge
theory, for instance, the representations are concretely expressed as multiplets, such
as doublet, triplet, etc., on which the group elements act in the form of matrices.
Similarly, the representations of SUSY group can be expressed as "SUSY multi-
plets". As the SUSY transformation connects fields whose spins differ by 1/2, it
is expected that each multiplet is made of "super-partners" of each other, such as
a (spin 0) scalar and a (spin 1/2) fermion. SUSY generators Q and Q commute
with p2. Thus all members of a SUSY multiplet on their mass-shell should have a
unique mass, and the SUSY multiplet can be classified by the mass M {p2 = M2).
Let us first consider massless (M = 0) SUSY multiplets. The massless multiplets
we discuss in this book are
where s denotes the spin. A is a complex scalar field and A should be a Majorana
fermion, corresponding to the real vector field VM. The chirality of the Weyl fermion
of the chiral multiplet can be either L or R. We will be able to assign our quarks
and leptons and Higgs to the component fields of the chiral multiplets, together with
their super-partners. Gauge bosons, such as photon, are assigned to V^ together
with their superpartners, such as "photino". The important feature of the SUSY
multiplets is that the (real) degrees of freedom of boson and fermion fields just
coincide. This, however, seems to be not the case in the above multiplets; for
instance, in the chiral multiplet (A, IPL,R), the complex scalar clearly has 2 degrees,
while the Weyl fermion ipL seems to have 4 real degrees of freedom, at the first sight.
This superficial puzzle may be resolved if we realize that the on-shell fermionic state
has just the half degrees of freedom, due to the equation of motion. For instance a
massless Weyl fermion ipR = (r)a, 0)* with 4-momentum pM = (p, 0,0, p) satisfies a
Dirac (Weyl) equation in the momentum space
We have formulated SUSY algebra and its representation, SUSY multiplets. The
next task is to find the rule of SUSY transformation among the component fields
of a SUSY multiplet, which is non-trivial. A powerful technique for such purpose
has been invented, which makes the formulation of SUSY transformation and the
construction of supersymmetric theories, i.e. the theories invariant under the SUSY
transformation, almost automatic.
The formulation is based on the notion of "superfields", which are the fields
defined on "superspace". Before going into the detail, let us recall that the gen-
erator of space-time translation, p M , acts on an ordinary field </>(xM) as a differ-
ential operator pM</>(a;M) = i-g^^x*1). Since (SUSY)2 ~ space-time translation,
roughly speaking, and a Lorentz vector can be expressed as a bi-linear form of a
spinor, eM = ea(a^)aaea as suggested by (7.24), it will be natural to expect that
the generators of SUSY transformations Q and Q act as a sort of translations
in a "fermionic space", with anti-commuting Grassmannian coordinates 9a, 6a
({6a,6p} = {0a,6a} = {8a,6a} = 0). The space with orninary space-time co-
ordinate and the Grassmannian coordinates, (x^, 0a, 6a), is called superspace, and
a field defined on the superspace is called superfield. Actually it turns out that the
156 SUPERSYMMETRY
( Ci 0
0 1 0 ,
-si \
RaWi) =
( Ci
-St a 0
Si
,
0 \
(7.28)
Si 0 ct J \ 0 0 1 /
where Cj = cosOi and Sj = sin#;. In fact,
sinflcos./. \ / 0 \ / 0
sin0sin</> \ = R3(-<t>)R2(-9)R3(a) I 0 = R3(-(j>)R2(-9) I 0 | , (7.29)
cos0 / V1/ V1
where we find that the 50(2) rotation with angle a is irrelevant. Let Q(6', <j)') be a
point on the 5 2 , which is obtained by the action of a 50(3) rotation G(K, A, /?) on
the point P(9,cf>). Then how can we express the transformation of space coordinates
{6, <f>) -> (0',<j>') by use of K, A, /?? We note
sin#'cos</>' \ / 0 \ / sin^cos(/>
sinfl'sin^' = G(9',4>',j) 0 =G{K,\,0) sin^sin./.
cosfl' / \ 1 / V cos6>
a coordinate (a;",0,0). By use of the SUSY algebra (7.20)-(7.23) and the well-
known relation eAeB = eA+B+^[A,B\+--- w e g e t ( s i n c e higher commutators do not
contribute)
G(0,e,e) • G(x",9,6) = G(x^ + iOa^e - ie(7"9,9 + e,e + e). (7.31)
where 0<rMe = 9a(a^)a^e6', etc. Thus we can conclude that the action of the SUSY
transformation G(0, e, e) on a superfield is equivalent to the application of e^eC3+£<3'
with the following differential operators, corresponding to the (i times) SUSY gen-
erators,
SC = ex + ex, (7.38)
SXa = 2e a M + K ) a d e d ( y M + i a M C ) , (7.39)
Stf* = 2^N + ea{a^)a&{Vtl-idtlC), (7.40)
% a
SD = -{^)adl{e d,x - {d^rrn- (7.41)
We learn from the above exercise the following things, which play quite impor-
tant roles when we attempt to construct a supersymmetric theory, i.e. a theory
invariant under the SUSY transformation.
stance, if we assign the Lorentz vector field V^ the mass dimension d = 1, the D field
has d — 2. As the quadratic term of D(x) already has d = 4, we cannot include any
space-time derivative in the free Lagrangian of the D field, thus making the D an
auxiliary field, i.e. a field without dynamical degree of freedom and is describable
in terms of other physical fields, once the equation of motion for the field is used.
As the D field has the highest mass dimension, the infinitesimal deviation of
the D(x) under the SUSY transformation, is inevitably the space-time deriva-
tive of the field of lower mass dimension. In fact 6D in (7.41) can be rewritten
as a total derivative, noting that e and e are space-time independent constant:
SD = dll{%ea(atl)aa\a - %i>a(&^)aa£d'}- This means that if the Lagrangian of a
theory can be regarded as the D-trem of some superfield, the action is supersymmet-
ric, since a total derivative term of the Lagrangian is irrelevant. (If space-time has
some compact dimension, as usually assumed in higher dimensional theories, the
derivative term may not be ignored for non-trivial boundary conditions of fields.) It
is suggestive to note that a similar thing happens when a theory is invariant under
a space-time translation, i.e. also in this case the Lagrangian itself is not invariant
under the translation, but the deviation can be written as a total derivative, thus
making the action invariant.
(c) T h e p r o p e r t y of closure
To find out the multiplication rule of SUSY multiplets, namely a rule to re-
construct a new SUSY multiplet out of the product of two SUSY multiplets, is
a non-trivial task. In the language of the superfield, however, the multiplication
rule is quite simple, i.e. we can just multiply two superfields. If fa and fa are two
superfields, their product fa(x,8,8) = fa(x,8,8)-fa(x,8,8) can be Taylor expanded
again, and therefore behaves as a new superfield ("closure" property of superfields
under the multiplication). Each component field of fa is written as a bi-linear term
of the component fields of fa and fa, which provides us the multiplication rule.
This is essentially due to the fact that the SUSY generator is a linear differential
operator and the chain rule i(eQ + eQ)fa = {i(eQ + eQ)fa} -fa + fa • {i(eQ + eQ)fa}
holds.
By use of these properties, we can easily construct (some part of) the action of
a supersymmetric theory,
S = Jdixf(fa,fa,.--)\D, (7.42)
where f(fa,fa, • • -)\D denotes the D-term (the highest component) of the polyno-
mial function / of superfields, fa, fa, • • •, which is again a superfield. The infinites-
imal deviation of the D-term is a total derivative and the action is SUSY invariant.
Since SUSY transformation is a sort of translation in the superspace, it is naturally
expected that the integral of the Lagrangian over the entire superspace yields a
SUSY invariant action. In fact, we find that the action can be equivalently written
160 S UPERS YMMETR Y
as
Let us note that the integration over Grassmannian variables is equivalent to the
differentiation in terms of these variables. Thus d26 d?8 is equivalent to extracting
the D-term.
D
° = ^ + <KUfld5(.. (7-44)
a
D& = -^y-*0 (*")««» 0M- (7.45)
Let us note that these "SUSY-covariant" derivatives Da and D& are quite similar
to iQa and iQa in (7.33) and (7.34), except the sign in front of the space-time
derivative. This is not an accident. Let us recall that the differential operator Q
and Q were read off from the relation, G(0,e,e) • G{x'i,d^) = G(x^ + ida^e -
iea^VjO + e,8 + e), where we identified the "left multiplication" of G(0,e,e) as
an action of the element of SUSY group. What happens if the multiplication of
Chiral superfield 161
G(0, e, e) is from the right? The only change will be that the sign of the space-time
translation, proportional to e or e, is just opposite to the case of left-multiplication,
i.e. -iOa^e + iea^O. Thus we find that the operators D and D can be regarded
as the differential operators corresponding to the right-multiplication. It is now
trivial that they commute with the SUSY generators, just because the right- and
left-multiplications are mutually independent.
Now, for instance, a right-handed chiral superfield <> / can be defined as
A i <f> = 0. (7.47)
where D = 9M9M and we have used the property that the power series expansion
of e'(e<r''9)di' ends up at {{0ali0)dll}2, because of the anti-commuting property of 8
and 0. In the derivation, some relations listed in Problem 7.1 may be useful. The
component field F has appeared in addition to the expected scalar and fermion
fields A and tp. The field F has mass dimension d = 2, and is anticipated to be an
auxiliary field, just as the D field in a general superfield. Let us note that in the
third line of the right hand side of (7.48), derivative terms have appeared though
there is no derivative of the chiral superfield at the beginning. This is due to the
"non-locality" of the coordinate yM, which has a deviation i(0a^0) from xM, and will
play an important role in the construction of the kinetic term for the component
fields.
In terms of the coordinates ?/M, 8a, the infinitesimal SUSY transformation reads
as
where <9M = d/dy^. It is now easy to derive the transformation property of each
162 SUPERSYMMETRY
component field:
SA = V2eip, (7.50)
Stp = y/Hio^edpA + V2eF, (7.51)
SF = -V^iea^d^ip. (7.52)
We realize that again the deviation of the "highest" component field F can be
written as a total derivative (as e is ^-independent), which becomes important in
the construction of supersymmetric theory of chiral superfields.
In order to see the multiplication rule for chiral superfields, it is suggestive
to recall that multiplication of holomorphic functions f(z) and g(z) gives another
holomorphic function h(z) — f(z)g(z), while the multiplication of holomorphic and
anti-holomorphic functions is no longer holomorphic nor anti-holomorphic, but is a
general complex function. A complete analogy holds in the multiplication rule for
chiral and anti-chiral superfields, i.e.,
Thanks to the multiplication rule, especially the closure property among chiral
or anti-chiral superfields, it is quite easy to form a SUSY invariant monomials of
such superfields,
where F denotes the extraction of the F-term of the chiral superfield obtained by the
multiplication of the chiral superfields <j>\{y,6) etc., which is a total derivative with
respect to yM. This procedure will be used in the formation of the "superpotential"
which is discussed below. In (7.54), we may finally replace yM by the ordinary
space-time coordinate a;M.
So called 4>4 theory is often discussed as a prototype theory of quantum field theory
for the purpose of getting some essential ingredients of the quantum field theory. In
this section, we discuss the supersymmetric extension of the </>4 theory, a supersym-
metric model of chiral multiplet (Ai, ipi) (i = 1,2, • • •), called Wess-Zumino model.
As the new feature of the Wess-Zumino model, we have not only self-interaction of
scalar fields but also Yukawa-type couplings between scalars and fermions. As the
consequence of the SUSY, the coupling constants of such interactions are identical,
and there should be mass degeneracy between the scalars and the fermions, unless
Wess-Zumino Model 163
• The superpotential
The aforementioned kinetic term is a free Lagrangian and does not contain any
interactions. Let us now consider the self interaction of chiral superfields. Just
as the self-interaction of scalar fields Ai are described by scalar potential V(A),
supersymmetric "self-" interaction is generally describable in terms of a polynomial
of chiral superfield W((j>), called superpotential:
^From the closure property under the multiplication, the superpotential itself should
be a chiral superfield. Thus / d4x W((J))\F (after the replacement of j / M -> :rM) should
be SUSY invariant. Let the mass dimension of W{4>) be dw • Then its F-term should
164 SUPERSYMMETRY
9 ( A , A n )
A1 • • • An^Fn + A1---An^Fn^An + ••• = ' ^ Ft. (7.58)
The right hand side means that such term can be compactly expressed in terms of
the first derivative of the monomial, obtained by replacing all chiral superfields by
their scalar components. The next possibility is to take fermions fron two different
places of the monomial and take scalar fields from all remaining chiral superfields:
-A1---W-i1>na-A1---rft_2An-1il>na--- =-\d{AQ^A'.An)^^i- ( 7 - 59 )
As this compact way of writing is easily known to be valid for arbitrary monomials
and because differential operators are linear operators, this procedure is applicable
for an arbitrary superpotential. Hence, the F-term of the superpotential can be
written as
wm =
* -BAT Fi' 2dJM; * *'• (7 60)
-
The whole Lagrangian now reads
C = Ckin-{W{<t>)\F + h.C.}
= (dftADiWAi) - i ^ d ^ i + F*F
fdW(A) „ ld2W(A) , , , . ._„,
The auxiliary field does not possess an independent dynamical freedom by itself,
and is expected to be eliminated from the Lagrangian, if we wish. In fact, by use
of Euler-Lagrange equation for Fi, Fi can be expressed in terms of scalar fields,
as there is no derivative term of the auxiliary field in the Lagrangian. For such
purpose we first write down the terms containing Fi fields in £,
Wess-Zumino Model 165
By use of the Euler-Lagrange equation, the quantities inside the bracket just Dis-
appear:
VIA) - I ^ f . (7.64)
In this way the Lagrangian of the Wess-Zumino model can be rewritten in terms
of only physical fields as
c = (aMA:)(aMi)-i^ff"aMVi
\1fd2W{A)
d2W{A) , , u .
2^8 Aid Aj
,dW{A).2
(7.65)
dAi
The first line at the right hand side is ordinary kinetic term and the second and the
third lines denote Yukawa coupling and scalar potential, respectively.
£ = (dltA*)(&'A)-ili&tdllil>
+ [{^rn - gA) i/>2 + h.c]
- \mA-gA2\2. (7.66)
Thus we know that the complex scalar A has a mass m and the fermion ip has
the same Majorana mass m. Thus the degeneracy of the masses of boson and
fermion is realized, as an important consequence of SUSY. We further note that the
Yukawa coupling Aip2 and the scalar self-interaction (A* A)2 have identical coupling
constant g. This is another important consequence of SUSY. One remark here is
that the scalar self-coupling is proportional to g2, not g as in the Yukawa coupling,
and SUSY seems to be not manifest. One may convince oneself that SUSY is really
maintained, by noting that the self-interaction is originally due to the coupling
between scalar A and auxiliary field F, 2gFA2 + h.c, whose coupling constant is
proportional to g, thus preserving SUSY (see Fig. 7.2).
As was mentioned in the beginning of this Chapter, SUSY has a remarkable
feature to stabilize the scalar mass under the quantum correction, i.e. the problem of
166 SUPERSYMMETRY
A ' \
A -+-4-2ig -2igf-+~- A
Fig. 7.2
quadratic divergence is absent in SUSY theories. Let us now explicitly show that the
cancellation of the quadratic divergence does happen in the Wess-Zumino model.
What we calculate is the Feynman diagrams (a) and (b) shown in Fig. 7.2. In order
to calculate the diagram with V'-exchange, we need a propagator for the fermion ip.
Since we are familiar with the propagator for 4-componet spinor, and as ip has a
Majorana mass term ^mip2 + h.c., let us define a Majorana fermion tpM — ip + {ip)G,
with tp being understood as a 4-vector (if}a,oy, whose free Lagrangian is given by
\ V ' M ( * ^ 7 ' ' — m ) V'M- In terms oiipM, the relevant Yukawa coupling can be written
as —gAip2 + h.c. = gA^M^^-^M + h.c. According to the discussion in Chapter 6,
the propagator of Majorana particle ipM is just the same as the case of ordinary
Dirac fermion, except that not only IPM^PM DU^ a l s o V'MV'M tyPe propagator exist,
due to the fact (II>M)C = V'M (see (6.40) and (6.41)). Paying attention to this
specific feature of the Majorana particle, we may calculate Fig. 7.2(b) according to
the ordinary Feynman rule. We can directly check that there is a cancellation of the
quadratic divergence between the diagrams (a) and (b). Actually, we find that for
m = 0 the sum of these two diagrams identically vanish. We may also check that
when SUSY is broken (spontaneously or explicitly) and only A gets a mass MSUSY ,
denoting the extent of the SUSY breaking, there remains a logarithmic divergence
in the sum of these diagrams, i.e. (for m = 0) oc MgUSY log A. This is why MSUSY
should not exceed 1 (TeV) or so, since otherwise the hierarchy problem will arise
again. Let us note that another type of diagram coming from the contraction of
two identical vertices, e.g. of the type of AIJJM^[L'IPM, does not yield the quadratic
divergence, even for m ^ 0, and therefore has been ignored in the discussion above.
Vector superfield 167
Though the general superfield, discussed in section 7.4, is a complex field in general,
we may get an irreducible representation V of SUSY by imposing a hermiticity
condition
V f = V, (7.67)
l %
V = C + iex-iO~X+ -06(M + iN) - -M{M - iN)
It is now easy to read off the SUSY gauge transformation of each component field
C^C = C + i(A-A*), (7.71)
D ^ D' = D. (7.76)
We realize that, because of the specific way of writing in (7.68), the gauge fermion
A and the auxiliary field D are gauge invariant: (7.75) and (7.76). This property
is desirable, since the gauge fermion, e.g., belongs adjoint representation, Which
is gauge invariant in the Abelian gauge theory. On the other hand, by use of the
transformation property (7.71) - (7.73) , one immediately know that by suitable
choice of the parameters ImA, ip, F, the lower component fields C, x, M, N are
readily gauged away. Thus, without loss of generality, we may assume the following
form of the vector superfield, called Wess-Zumino gauge:
t°=£- V. (7.80)
This is not a correct way of doing, since the differential operators J= etc. do not
commute with SUSY algebra, and some modification is needed:
Wa = -^D~DDaV. (7.81)
Dp Wa = ~D0~DDDaV = 0, (7.82)
where we have used the property D„DD = 0 as the result of {Da, Dp} = 0. Another
important feature of Wa is that it is gauge invariant:
v = -ea^evli(y) + i(e6ex(y)~ee9\(y))
+ ^eeM(D(y)+idflV'i(y)). (7.84)
Da = • ^ + 2i((jnaaOAdfi, (7.85)
Wa = -iXa(y) + [6a0D(y)-l-(a'iana0(dliVv(y)-dvVli(y))}e0
+ ee^aad^iy). (7.87)
As we expected the lowest component is the gauge fermion A and all of A, D and
the field strength F^„ = d^Vv — dvV^ are gauge invariant.
170 S UPERS YMMETR Y
The kinetic term for gauge boson V^ is of ordinary type. The kinetic term for the
gauge fermion A may be understood as that for a Weyl fermion. Since the gauge
fields should be "real", it may be natural to expect that gauge fermion is described
by a Majorana fermion AM = (A a , A a )'. In fact, the kinetic term can be re-written
as | A M ^ A M , up to a total derivative. (A Weyl and a Majorana fermions are iden-
tical, as far as the fermions are massless.) As we expected the "kinetic term" for the
D field does not have any derivative and D should be understood as an auxiliary
field.
When the U(l) transformation is made local, allowing A to depend on yM, the
superpotential clearly remains invariant. The kinetic term for fa, however, is no
longer invariant,
In order to make it gauge invariant we now introduce the vector superfield V to com-
pensate the factor e-2teQi(A~A )5 j , e . the kinetic term is now replaced by "covariant
derivative" term,
4>U2eQiVfa, (7.92)
where V should transform under the local gauge transformation inhomogeneously,
The terms in the second line of (7.94) are the new types of gauge interactions due
to the presence of gaugino A, A and auxiliary field D, whose coupling constants are
identical, i.e. e, because of the SUSY.
As the immediate application of the SUSY U(l) gauge theory discussed above, let
us consider the supersymmetric extention of QED, namely SUSY QED.
The matter field, electron, should be identified with the fermionic components
of two (right-handed) chiral superfields,
W = m<f>-cp+. (7.97)
Combining with the SUSY invariant and gauge invariant kinetic terms for vector
and chiral superfields, we get the full Lagrangian for the SUSY QED
(b) (c)
Fig. 7.3
+ (D^eRr(D^eRH(D^eir(D^el)
- i^a»D^- ie+a'D^ + |FflJ^+ |F L | 2
+ V2ie(eRXeR - /i.e.) - \f2ie(e*j\eR~ - h.c.) - eZ?(|e fl | 2 \eL\2)
+ m ( e ^ e j + /i.e.) - m(eRF£ + e*LFR + h.c). (7.99)
So far spinors for electron and positron have been all written down by use of the
2-component notation. When we calculate Feynman diagrams, however, it will be
more convenient to utilize the familiar 4-component notation, so that we can use
the ordinary form for the spinor propagators. Thus we define 4-component spinors,
Dirac spinor e denoting the electron, and a Majorana spinor AM for the photino:
c
Ra (7.100)
AM = (7.101)
A*
As we clearly see in the Feynman rule shown in Fig. 7.3 the coupling constants of
the photon, photino and the auxiliary fields are the same, i.e. e.
After eliminating the auxiliary fields FRIL and D, the scalar potential gets con-
tributions from both of superpotential and gauge interaction. To see this explicitly,
let us write down the terms including the auxiliary fields in the Lagrangian
FR = meL, FL = meR,
2
D = e(\eR\ - \eL\2), (7.104)
= m 2 (|e f l | 2 + |e L | 2 ) + i e 2 ( | e f l | 2 - | e L | 2 ) 2 . (7.105)
We easily find that the vacuum state of this theory is achieved by vacuum expecta-
tion values (eR) = (e/,) = 0. The corresponding vacuum energy Ev — (0\H|0) also
vanishes, and it indicates that SUSY is not spontaneously broken, as we will see in
the following section. The term m2(\eR\2 + \&L\2) in the potential V thus provides
scalar mass-squared m2. This means the masses of electron and selectron are the
same as the inevitable consequence of SUSY.
The argument for SUSY U{1) gauge theory is easily generalized to non-Abelian
gauge theory including the SUSY extension of the standard model, such as MSSM.
The transformation of a set of chiral superfields, denoted by a column vector <f>,
belonging to a representation of non-Abelian Yang-Mills gauge group, takes a form
A = TaAa, (7.107)
where A a are transformation parameters and the generators Ta satisfy (in a suitable
normalization)
where
It is easy to show that such defined field strength has a desirable transformation
rule,
Thus gauge invariant and SUSY invariant Yang-Mills theory with a gauge in-
variant superpotential W((p) is described by
The generalization to the case with several chiral superfields is quite straightforward.
Let us note that we still can impose a condition of Wess-Zumino gauge for each
of Va. The Feynman rule shown in Fig. 7.3 for SUSY QCD is now modified in a
trivial manner by a prescription,
for the gauge interaction vertices of (V£, X%[, Da), with (e, e) being replaced by the
scalar and spinor components of tj>.
The SUSY Yang-Mills theory of the most physical interest is the SUSY extension
of the standard model. Its minimal version, with minimal number of Higgs chiral
multiplets, is called Minimal Supersymmetric Standard Model (MSSM). In this sec-
tion, we discuss MSSM very briefly, focusing on the basic structure of the model.
For more detailed discussion on the MSSM and its various phenomenological im-
plications, we ask readers to refer to the nice reviews in the literature (Haber and
Kane, 1985; Weinberg, 2000).
Basically the construction of the MSSM is achieved just following the argument
of the previous section. In particular the Lagrangian is given by (7.113), though the
second term, the kinetic term (with covariant derivative) of chiral field <f>, should
be replaced by the sum of the kinetic term of each existing chiral field with distinct
gauge generator V = TaVa corresponding to its reperesentation of the gauge group
SU(3)C x SU(2)L x U(1)Y- The superpotential W of the chiral superfields may
take, in principle, the most general form compatible with gauge symmetry, as in
the standard model. We will argue, however, that some restriction is needed, since
Minimal Super symmetric Standard Model (MSSM) 175
otherwise some serious problems, such as too rapid proton decay is induced. The
restriction is due to a global symmetry called i?-symmetry, as we will see below.
The gauge group of MSSM SU(S)C x SU(2)L x U(1)Y is just the same As that of
the standard model, and the gauge bosons and gauge fermions form vector mmmul-
tiplets, whose interactions are uniquely determined by gayge principle, as is seen in
(7.113). The matter fields, on the other hand, are of our choice. In the stabdard
model, the quarks and leptons to start with are Weyl fermions. Thus it is natural
to assign quarks and leptons as the members of chiral multiples. The remaining
matter fields, Higgs, being complex fields, should belong to a chiral multiplet.
Actually we realize that to construct a realistic theory, one SU{2)L doublet of
Higgs chiral superfields is not enough, and we should add another doublet. The
reason is two-fold.
(1) In the (non-SUSY) standard model the Yukawa couplings or masses of down-
type and up-type quarks are provided by <j> a n d <t> = *cr2</'*, respectively. One may
naively expect that the Yukawa couplings are just replaced by the corresponding
superpotential of the chiral superfields of quarks and the Higgs doublet. Unfortu-
nately, its does not work. This is because, 4> is obtained by the operation of complex
conjugation (charge conjugation), and therefore the chiralities of <f> and <j> are op-
posite, while superpotential should be the polynomial of superfields with the same
chirality. We thus need to introduce two independent Higgs doublets in order to
provide masses to down-type and up-type quarks, say HD and Hu, with the same
representations of the gauge group as those of (f> and </>, both being left-handed chiral
superfields.
(2) Suppose we introduce chiral superfields whose scalar components are the mem-
bers of one Higgs doublet. As the new feature of the SUSY theory, chiral superfields
also contain the fermionic superpartners of the Higgs scalars, i.e. Higgs fermions. As
the Higgs fermions are Weyl fermions, they yield triangle gauge anomalies. For ex-
ample "t/(l)y" anomaly, coming from the triangle diagram where all three vertices
are made of the currents coupled with the U(l)y gauge boson, arises by the presence
of the Higgs fermion. Ther also appear the (SU(2)L)2U(1)Y type anomaly as well.
Such anomalies will be vancelled by the introduction of another Higgs doublet,
just because they have just opposite quantum numbers, with weak-hypercharges
Y = ± 1 . For instance U(1)Y anomaly will vanish as (+1) 3 + ( - 1 ) 3 = 0.
Thus the matter fields of MSSM with 3 generations of quarks and leptons are
all given as left-handed chiral superfields
*° = ( t ) • H" - ( i ) •
where we have used the same letters to denote the chiral superfields as those of the
ordinary particles in the standard model. For instance for generic quark q and Higgs
176 SUPERSYMMETRY
H chiral superfields, their spin s = 0,1/2 components together with the auxiliary
fields Fq, FH are written as
where q and H are the superpartners of ordinary particles, i.e. a squark and a Higgs
fermion.
Next thing to do for the construction of the MSSM is to choose the form of the
superpotential W. To provide the Yukawa couplings necessary to give quark and
lepton masses and the self-interaction of Higgs doublets, we employ the following
superpotential
These terms clearly break either of baryon or lepton number, B or L, and will easily
lead to too rapid B or L violating processes such as proton decay, or too frequent
FCNC processes such as /i —> cy, as the coefficients of these operators are generally
independent of the Yukawa couplings given in (7.117) and can be new sources of
flavor mixings. Thus it is desirable to devise a mechanism to guarantee the absence
of these undesirable operators.
Generally, to ignore terms, or operators, which is compatible with gauge sym-
metry is not allowed for the theory to be renormalizable, since such terms are
gauge invariant and they are generally induced at the quantum level with UV-
divergence and the counterterms are needed anyway. Thus the elimination of the
terms listed up in (7.118) does not seem to be justified. (One may expect that the
"non-renormalization theorem", characteristic to the SUSY theories, may work as
the justification. The SUSY, however, should be broken eventually for the theory
to be realistic.) The elimination is, at least, technically allowed if the elimination
is able to be regarded as the consequence of some global symmetry, independent of
the local gauge symmetry. It is not hard to imagine that the terms incompatible
Minimal Supersymmetric Standard Model (MSSM) 177
with the global symmetry, imposed on the whole Lagrangian, will never appear at
any orders of quantum corrections. Let us note that while the tri-linear term in the
superpotential dQHo yields ordinary Yukawa coupling of ordinary particles dQHo,
undesirable term dQL will result in a tri-linear coupling with one superpartner and
two ordinary particles, dQL. Thus in order to distinguish these two types of terms,
it seems to be natural to devise a global symmetry, which assign different transfor-
mation properties for ordinary particles and their superpartners. We thus consider
i?-symmetry, which is a characteristic symmetry in SUSY theories.
What we consider is some global U(l) symmetry. It is trivial that to distinguish
ordinary particles and their superpartners, the global symmetry should assign dif-
ferent U(l) charges to the different members of the same chiral multiplet. Hence
a global symmetry which assign an overall phase for the all members of a chiral
multiplet is not enough for our purpose. .R-symmetry is the symmetry under the
chiral transformation of the Grassmannian coordinates of superspace,
or
e< (7 120)
I) - ™(*)- -
Accordingly a generic chiral superfield <j> transforms as
where c denotes the overall U(l) charge of the chiral multiplet. The each component
field therfore transforms as
The F-term of the superpotential W\F ~ p W should be invariant under the trans-
formation of .R-symmetry. Thus the transformation property of the superpotential
should be
For instance, for the superpotential W = <$>\4>i$z of some three chiral superfields
to be i?-symmetric, their U(l) charges should satisfy C\ + Ci + c$ = 2. Let us note
that, since the .R-symmetry is a chiral symmetry, other terms in the Lagrangian,
i.e. kinetic terms with covariant derivatives are automatically .R-symmetric, though
the gauge fermions transform non-trivially.
We will assign the U(\) charge c = 1 for all of quark and lepton chiral superfields,
<3, u, d, L, I, and c = 0 for Higgs chiral superfields HQ and H\j. By taking a specific
value ip — ir, we can define a discrete symmetry, ".R-parity". The (overall) parity
178 SUPERSYMMETRY
of quark and lepton superfields is -1, while that of Higgs superfields is 1. Now the
condition reduce, with <p = n, to
It is now easy to see that the all terms in (7.118) do not satisfy the above condition,
while the terms in (7.117) are all consitent with the condition.
For each component field of the chiral multiplets the assigned 1/(1) phases for
<p = 7r, i.e. .R-parity denoted by R, is given by a compact form,
where J is the spin of the field and B,L are baryon and lepton numbers, and
3B + L = 1 for quarks and leptons and 0 for Higgs doublets. We thus easily know
that all of the quarks, leptons and Higgs have R = 1 while their superpartnes all
have R = — 1. In this way we have suceeded to distinguish ordinary particles and
their superpartners. In addition, the absence of the undesirable terms in (7.118)
is justified as the consequence of the -R-parity symmetry. We further note that,
in general, hevier superpartners having odd .R-parity should decay into lighter su-
perpartners and other ordinary particles, but not into a final state composed only
of ordinary particles, which have even .R-parity. Namely the .R-parity invariance
enforces the lightest supersymmetric particle (LSP) to be absolutely stable, thus
being an interesting candidate of (cold) dark matter.
Though we have implicitly assumed that the SUSY is an exact symmetry, actu-
ally the SUSY should be broken spontaneously (see section 7.12) or explicitly (with
"soft" terms, whose coefficients have positive mass dimensions, in order not to spoil
the nice property of the quadratic divergence cancellation), since the superpartners
of ordinary quarks and leptons having the same masses as those of the ordinary
particles have not been discovered. The theory should also possess the spontaneous
gauge symmetry breaking. These two kinds of symmetry breaking cause the mixings
among various particles via their mass (-squared) terms, in various sectors, which
makes the complete analysis of MSSM (Haber and Kane, 1985; Weinberg, 2000)
rather cumbersome.
We finally briefly comment on the specific feature of MSSM in its Higgs sector.
After the elimination of the auxiliary fields FHUID, the quadratic terms of Higgs
bosons in the obtained scalar potential are \H\2(H'DHD + H^Hu)- This in turn
means that the spontaneous gauge symmetry breaking does not happen, as long as
SUSY is preserved. An intriguing mechanism to achieve the negative mass-squared
of Higgs boson, necessary for the symmetry breakdown, has been proposed (Inoue,
et al., 1982), where the negative mass-squared is realized by the SUSY breaking
and the quantum correction due to the loop of a heavy fermion with large Yukawa
coupling, i.e. t quark.
We also note that the (Higgs scalar) 4 term of the scalar potential is uniquely
Some Phenomenological Predictions of SUSY Gauge Theories 179
In this section we will discuss very briefly how we can search for the trace of SUY
gauge theories for elementary particles and/or how we can test the characteristic
phenomenological predictions of SUSY gauge theories.
Very successful prediction of the SUSY version of the standard model, Minimal
Supersymmetric Standard Model (MSSM), is the gauge coupling unification. In
Grand Unified Theories (GUT), which unify all interactions (except gravity) of
elementary particles, three gauge couplings of SU(3)C, SU(2)L, U(1)Y, denoted
by <?3,<?2,<h for suitably normalized gauge generators, should be unified at higher
energies. Since these couplings are those for "strong", "weak" interactions etc.,
such couplings do not seem to be unified at least at low energy processes. Actually
the "asymptotic freedom" of non-Abelian gauge theories, and the "asymptotic non-
free" nature of Abelian gauge theory, briefly discussed in chapter 5, implied by the
renormalization group equations to discribe the energy evolution of gauge couplings,
make the grand unification possible. Since the dependence of the gauge couplings
on the energy is very mild, depending only logarithmically, the mass (energy) scale
of the grand unification MQUT is quite high. In the prototype (non-SUSY) SU(5)
GUT (Georgi and Glashow, 1974) MGUT = O(1015)(GeV). Such tremendously
high mass scale makes the life time of proton decay, typically predicted by GUT,
quite long, assuring the (approximate) stability of atoms.
The simplest SU(5) GUT model, however, seems to be facing the following diffi-
culties, (a) First, it predicts the proton life time TP ~ 3 x 1031 (years), much longer
the age of our universe. Nevertheless, the prediction has been ruled out by the
recent (Super-) Kamiokande result on the decay mode p -¥ ix° + e+. (b) Second,
the precision measurement of the gauge couplings g3, g2, g\ at E ~ 102 (GeV) at
LEP experiments (CERN) has revealed the fact that these couplings do not meet
with each other at an unique value of higher energy. It is quite impressive that
such difficulties can be evaded (though there still may remain some problem) in
SUSY SU(5) GUT (Sakai, 1981; Dimopoulos and Georgi, 1981), whose low-energy
180 SUPERSYMMETRY
effective theory is MSSM. Namely in the SUSY GUT, the evolution of gauge cou-
plings, or the ^-functions in the renormalization group equations are modified by
the presence of the newly introduced superpartners of ordinary particles (mainly
by the gauge fermions). If the SUSY breaking mass scale MSUSY is in a reasonable
range, MSUSY ~ l(TeV), all of such modified three "running" couplings meet at a
unique value of the energy ! Furthermore, the modification of the evolution raises
the unification scale a little bit, MQUT — 1016(GeV). This little change, however,
considerably raises the proton life time Tp (since Tp oc MQUT), thus evading the
lower bound on TP imposed by (Super-)Kamiokande experiment. It may be worth
noting that the most recent result on the lower bound of Tp from Super-Kamiokande
experiment is quite close to the prediction of the SUSY GUT, even though they
still seem to be mutually compatible.
Probably the most direct confirmation of SUSY will be the discovery of the char-
acteristic new heavy particles, i.e. the superpartners, in accelerator experiments,
and extensive efforts have been made in such direction. Such superpartners should
affect the low-energy physics as well, and their contributions may be (indirectly)
tested by use of various low energy observables, whose probabilities have been mea-
sured precisely. In particular the SUSY breaking masses of the order MSUSY of
these superpartners can be arbitrary a priori, as long as they are gauge invariant.
Therefore they may potentially spoil the successful predictions of the (non-SUSY)
standard model, unless there are some guiding principle to control them.
For illustrative purpose, here we will briefly discuss the contributions of super-
partners of quarks and leptons (squarks and sleptons) to the observables, which are
rendered to be small by global symmetries, i.e. (1) Ap and (2) Flavor Changing
Neutral Current (FCNC). These are quantities discussed in detail in chapters 8 and
9, respectively, and are handled by global symmetries, i.e. "custodial" and flavor
(or horizontal) symmetries. We will see in chapters 8 and 9, that because of these
global symmetries, these observables exactly vanish at the tree level. Though the
observables are induced at the quantum (loop) level, they are automatically finite
and are suppressed. Thus the smallness of these quantities are guaranteed without
any tuning of parameters of the theory, the property we call "natural". If SUSY is
exact, i.e. if there is no SUSY breaking, the situation essentially does not change,
and natural suppression of these observables are still operative. Once SUSY is
broken, which is necessary anyway for the models to be realistic, the situation re-
markably change, generally speaking. In particular, as we will argue below, FCNC
may get into trouble, since the SUSY breaking mass terms, being flavor or gen-
eration dependent, break the flavor symmetry. Thus unless there is some guiding
principle to control the mass terms, the natural suppression of FCNC is spoilt, and
it may cause a serious phenomenological problem of the model.
(l)Ap
The ^-parameter is defined in (4.105) and (8.5) as the ratio of charged and
neutral weak gauge boson mass-squared, p = 1 at the tree or classical level because
Some Phenomenological Predictions of SUSY Gauge Theories 181
of the SU(2)v (or SU(2)L) global "custodial" symmetry which remain in the Higgs
sector even after the spontaneous symmetry breakdown (see 8.4). However, the
symmetry is broken in the entire Lagrangian by e.g. the mass splitting between
the members of SU{2)L doublet, such as {t,b)1, the parameter deviates from 1 ,
i.e. Ap = p — 1 ^ 0. The contribution of the heavy top quark to Ap turns out to
2
behave as -rri-. Then, one may naively expect that the superpartners, which are
m
w
supposed to be heavy, ~ l(TeV) or so, may strongly affect Ap and SUSY gauge
theories may contradict with the experimental upper bound on Ap. Fortunately the
SUSY breaking mass-squared given for the doublet of squarks, for instance (iijd)',
does not break the global symmetry. Namely the SU{2)i gauge invariance makes
the SUSY breaking mass-squared term for the squarks MgUSY(\u\2 + \d\2) invariant
under the transformation of the custodial SU(2) symmetry. Thus we expect the
contribution of the squark doublet is under control. In fact we will see in (8.18) that
the contribution is strongly suppressed by the factor {m\ — m2l)2/(MyVMgUSY) (the
mixing between the superpartners of left-handed and right-handed quarks has been
ignored for brevity). This kind of suppression is generally stated as the "decoupling"
phenomena of heavy particles, which we will systematically analyze in section 8.2.
Thus the newly added contributions, characteristic to SUSY gauge models, are not
troublesome concerning Ap, as long as MSUSY is not too small.
(2) F C N C
We will discuss in section 9.2 that to guarantee the natural suppression of FCNC
processes, whose rates are experimentally stringently bound and these processes are
often called rare processes, fermions of the same electric charge and chirality should
belong to the same representation of the gauge group (Glashow-Weinberg's condi-
tion) . Since the SUSY generators commute with those of gauge symmetry, these
two symmetries are mutually independent. Thus the superpartners should belong to
the same representations as those of ordinary quarks and leptons, and the Glashow-
Weinberg's condition is still satisfied in SUSY extension of the gauge theories sat-
isfying the condition. In fact, each of neutral currents coupled to Z weak gauge
boson in quark and squark sectors, for instance, is flavor-diagonal and does not have
FCNC. There appear, however, new type of neutral current in SUSY gauge theories,
namely the (fermionic) current coupled with gauge fermion, which is bi-linear form
of a fermion and its superpartner, of the type shown in Fig. 7.3(b). As we will see
below if SUSY breaking mass-squared for superpartners are flavor-dependent, there
arises a mismatching between the mass (-squared) matrices of fermions and super-
partners, and when we move to their mass-eigenstates the neutral current coupled
to the gauge fermion may have FCNC. This happens already at the SUSY QED
discussed in 7.8 once the model discussed there is generalized so that it includes
(three) generations of leptons. Thus leptonic FCNC process, for instance p, —> ej,
becomes possible via the exchange of the gauge fermion, photino AM , as shown in
Fig. 7.4.
We are now ready to confirm the above statement by a little explicit calculation
182 SUPERSYMMETRY
AM
Fig. 7.4
in the SUSY QED with three generations of leptons. The chiral multiples in (7.96)
is generalized to include three generations, a = 1,2,3, as
W = mpa4>-a(f>+0, (7.127)
where mpa may be regarded as the (/?, a) elements of a mass matrix m. The mass
terms of leptons la and sleptons la are easily obtained as the generalization of the
mass and mass-squared terms for e and e given in (7.102) and (7.105):
where the lepton and slepton states are denoted by column vectors IL<R =
(IIL,R^2L,R,13L,RY and IL,R = ('iL,fl^2L,fl,'3L,fl)'- It is obvious that the mass
matrix for lepton and mass-squared matrices for sleptons are diagonalized simulta-
neously by the same unitary matrices UL and UR:
It is clear that even if we move to the mass-eigenstates lmL,R and lmL,R, the neutral
Some Phenomenological Predictions of SUSY Gauge Theories 183
since WL RUL,R = I (I is the 3 x 3 unit matrix). Namely all neutral currents have
no FCNC at tree level. More intuitively we may say that the superpotential (7.127)
itself can be diagonalized by UhyR without changing the kinetic terms for chiral
multiplets.
The situation changes once we introduce SUSY breaking mass-squared matrices
M\ R, only for sleptons (for brevity, possible mass-squared term to mix II and lR
is ignored),
Suppose the SUSY breaking is caused by some interactions, which are indepen-
dent of flavors, such as gauge interactions. Then the SUSY breaking mass-squared
matrices do not break the flavor symmetry and MRL should be invariant under
the unitary transformations by UL,R- This argument implies that MR L should be
proportional to the unit matrix,
where MSUSY,L and MSUSY,R denote the mass scales of the SUSY breaking. As
far as the SUSY breaking does not break the flavor (or horizontal) symmetry, we
expect that the same argument as that in the case of exact SUSY will holds. In
fact, we readily realize that the slepton mass-squared terms, including the SUSY
breaking terms, are simultaneously diagonalized as the diagonalization of the lepton
mass matrix,
Thus there does not appear FCNC in the neutral current coupled with the photino.
In the MSSM, the SUSY extension of the standard model, there appears the
flavor or generation mixings in the charged currents coupled with the superpartners
of W± as well. But such mixing is described by the same matrix as the KM matrix,
as far as the SUSY breaking mass-squared matrices are proportional to J. We
also learn from (7.135) and (7.136) that the mass-squared differences of squarks or
sleptons in MSSM, which play essential roles in FCNC processes, are the same as
those of quarks or leptons. For instance
m? - m? = m\ - m2u. (7.137)
Thus the rates of FCNC processes are suppressed by the small quark mass-squared
differences and/or small flavor (generation) mixings, just as in the case of the stan-
184 SUPERSYMMETRY
We have seen in the argument of SUSY QED that the masses of electron and se-
lectron should be the same provided the SUSY holds. Experimentally, however, no
selectron with the mass me has been observed. Thus SUSY should not be an exact
symmetry in the nature and should be broken spontaneously or explicitly. Unfortu-
nately, no successful spontaneous SUSY breaking in the sector containing ordinary
quarks and leptons is known because of some phenomenological difficulty, though
the spontaneous breaking is theoretically more appealing. Thus in the MSSM, for
instance, explicit SUSY breaking scalar mass-squared terms, for instance, are added
to the Lagrangian (by hand). We, however, discuss in this subsection the mecha-
nisms of spontaneous SUSY breaking, since it is not only theoretically appealing,
but also is expected to be the origin of the "explicit SUSY breaking". In fact, it
is argued that in the MSSM embedded into a supergravity theory, for instance, the
explicit SUSY breaking terms are brought into the "observed" sector of quarks and
leptons via supergravity interaction from some "Higgs sector", which is decoupled
from the observed sector once supergravity interaction is switched off and where
the SUSY is spontaneously broken.
First let us note that the vacuum energy, i.e. the vacuum expectation value
(VEV) of the Hamiltonian Ev — (0|-W|0) is the order parameter of the spontaneous
SUSY breaking. This is easily seen from the relation in SUSY algebra,
{Qa,<U = - 2 K ) a d J P M . (7.138)
As the each term in the left hand side is semi-positive definite operator, we find
Ev = (0\H\0) > 0. If Ev vanishes we conclude SUSY is not spontaneously broken
as Qa\0) — Qa\0) — 0. Thus to break SUSY spontaneously, EV > 0 is necessary.
Spontaneous SUSY breaking 185
In fact if this is the case, under some SUSY transformation due to Qa or Q&, the
vacuum state is not invariant. Thus
Actually it is quite easy to realize a SUSY model with Ev > 0. Let us imagine a
Wess-Zumino model with a chiral superfield <> / and a simple superpotential W(<f>) =
c 4> (c : constant). The resultant scalar potential is positive, V = | g^ '\2 = |c| 2 >
0, and SUSY is expected to be broken, formally speaking. It, however, does not lead
to any mass splitting between the scalar A and the spinor %p components of <f>. In
fact, W\F = c F (F : auxiliary field), which has no self-interaction, nor mass terms
for A and ip. Thus Ev > 0 is necessary but not sufficient condition for "physical"
SUSY breaking. (This is why the arrow in (7.140) has only one direction.) In the
following we discuss possible mechanisms to get physical SUSY breaking.
In general, scalar potential is the sum of the squared absolute values of auxiliary
fields,
where the auxiliary fields are written in terms of scalar fields Aj as implied by the
equations of motion,
where g is the gauge coupling and Ta are the gauge generators acting on A{. Thus
to achieve spontaneous SUSY breaking with constant scalar vacuum expectation
values, necessary for Lotentz invariance, Ev = (0|V|0) = |(JFi)|2 + \(Da)2 > 0 is
needed ((F?) = (0\F?\0) = a ^ ^ > > ) , etc.). We therefore think of three possibilities
to realize the spontaneous SUSY breaking:
(a) (Fi) i£ 0, irrespectively of (Da)
(b) (Da) ^ 0, irrespectively of (Fi) (7.143)
(c) (Ft) = 0 and (Da) = 0 do not hold simultaneously,
where in the former two cases we do not have to care whether other types of auxil-
iary fields develop vanishing VEV's or not. The spontaneous SUSY breaking of type
(a) and (c) are well-known and were proposed by O'Raifeartaigh (O'Raifeartaigh,
1975) and Fayet-Iliopoulos (Fayet and Iliopoulos, 1974). The third type (b) (Fayet,
1976; Inami, Lim and Sakai, 1983) is less-known, but will be discussed here, as it
is one logical possibility and has a nice feature that SUSY is spontaneously broken
purely due to the gauge interaction.
model with several chiral superfields, without considering their gauge interaction.
As we have pointed out above, (F) ^ 0 itself is easily realized only with 1 chiral
superfield, though it does not lead to any physical SUSY breaking. O'Raifeartaigh
found that to realize physically meaningful SUSY breaking at least three chiral
superfields are necessary. Namely he found that a superpotential W for three chiral
superfields fa (i = 1,2,3),
„ - 2 dA^ = ^ - 0 . (7,45)
2
are satisfied for any VEV's of ^1,2,3. Assume, for simplicity, that M, A, fi are
all real and /i 2 > 2A 2 M 2 , then the minimum of the scalar potential V = X2\Al -
M 2 | 2 + A*2|A3|2 + |2AA!A3 + fiA2\2 is known to be achieved by (Ax) = (A2) =
(A3) = 0. Thus (V) = A 2 M 4 > 0, as expected. Furthermore, we can check
that there appear mass splitting among the masses of scalars and fermions. Since
all VEV's of scalars are 0, the masses of fermions and scalars are easily read off
from the quadratic terms in W and V, respectively: fj.tp2ip3 for the fermion mass,
-2A 2 M 2 Re(A|) + ii2(\A2\2 + \A3\2) for the scalar mass-squared. Namely the Weyl
fermions ip2, if>3 are combined to form a Dirac fermion $ = (V,3iV'2)t with a mass
my = n (7.146)
Thus the masses of fermions and scalars are not identical any more, as the con-
sequence of SUSY breaking. We, however, also note that, even though the mass
splitting does occur, the average mass-squared of fermions and scalars are still
identical. To be more strict, so-called super-trace of mass-squared matrix, i.e.
StrM 2 = ^2j(-l)2J(2J + l)m2j vanishes when SUSY is spontaneously broken.
Here mj is the mass of the field with spin J field, where each of Majorana fermion
or real scalar with mass m contributes to the sum as —2m2 and m2. For instance,
in the O'Raifeartaigh model we can explicitly confirm that StrM 2 = 2 x (0 + (j?) +
(fi2 - 2A 2 M 2 ) + (/i2 + 2A 2 M 2 ) - 2 x (2 x \ + l ) ^ 2 = 0, where the multiplication
factor 2 denotes that the corresponding masses are for complex scalars and a Dirac
fermion.
Spontaneous SUSY breaking 187
|e f l | 2 + | e t | 2 = 0 , e | e f l | 2 - e | e L | 2 + £ = 0, (7.150)
*-eL
(a) (b)
Fig. 7.5
In the former case, though SUSY is broken, the U{\) gauge symmetry is not spon-
taneously broken with vanishing VEV's, while in the latter case both symmetries
are spontaneously broken (see Fig. 7.5).
In the case m 2 — e£ > 0, for instance, the masses of electron and selectrons are
given as
chiral superfields
both belonging to the doublet representation of SU(2). Let us note the assignment
of the U(l) charges Q = 1/2, —1/2 is such that the triangle gauge anomaly disap-
pears. By use of the equations of motion for D auxiliary fields, the auxiliary fields
are written in terms of scalar fields, which we denote by the same letters cf> and </>',
as
It is easy to check that for £ ^ 0 these three conditions are not met simultaneously
By any choice of x, y and z. Thus, we conclude that Ev = (V) = \(Da)2+ \{D)2 > 0
and SUSY is spontaneously broken, irrespectively of the choice of the superpotential.
This argument can be generalized to SU(n) x U(l) SUSY gauge theories. It
turns out that SUSY is spontaneously broken provided the number of pairs of
chiral superfields belonging to n and n representations of SU(n) with opposite
U(l) charges is less than n (Inami, Lim and Sakai, 1983). The SU(2) x [7(1) theory
with a single pair of chiral superfields just discussed above is the simplest example.
Problems
7.1 Prove the following relations concerning the 2-component spinors (anti-commuting
Grassman numbers) 6 and 6 and the 2 x 2 matrices a1* and CTM. In 96, 6a§ etc.,
spinor indices have been suppressed:
7.4 Show that in the simplified Wess-Zumino model with only one chiral superfield
(/>, whose Lagrangian is given in (7.66), there is an exact cancellation of the quadratic
divergence between the diagrams Fig. 7.2(a) and (b). Show also that the another
type of diagrams with the loop of fermionic field, coming form the contraction of
two identical vertices, e.g. AipM^-^ipM, does not yield the quadratic divergence,
even for m / 0 .
The standard model of elementary particles, especially its electroweak sector, was
devised as a renormalizable gauge theory with massive vector bosons such as Z°
and W±. Already at the classical (tree) level, without quantum corrections, the
standard model has its characteristic properties and such properties have been ex-
tensively tested experimentally. We may pick up some typical examples of such
tests: discovery of the predicted neutral current processes, confirmation of the ex-
istence of Z° and W± with their predicted masses, forward-backward asymmetries
AFB in the scattering e+e~ -> ff (f : fi,etc.).
The genuine features of the theory as the renormalizable theory, however, should
be studied by comparing the calculated (finite but small) quantum effects of the
theory on the physical observables, i.e. radiative corrections, with the data obtained
in the precision experiments, such as LEP, CDF, etc.
Let us recall how the renormalization procedure goes, namely how the finite
(calculable) radiative corrections are obtainable in the renormalizable theory. To
avoid unnecessary complication, let us focus on the gauge sector, namely the part
of the Lagrangian, which contains gauge bosons. Then we have 3 bare parameters:
2 gauge couplings g, g' of SU(2)L and U(1)Y and the vacuum expectation value
(VEV) of the neutral Higgs boson, v. What we should do first is to calculate
observables, which have been excellently well-measured experimentally, i.e.
191
192 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
as mt, TUH'-
which at the first sight have UV (ultra-violet) divergences, but actually are finite
"renormalized" quantities. (Or we may say that the bare parameters themselves
are UV divergent.) Equating these 3 functions with the measured values listed
above provides formulae, which enables us to express the bare parameters in terms
of observables Mz etc.: g = (j>(Mz,GF,a;mt,mH,- • •)> etc. Then, we may cal-
culate any other observables in terms of bare parameters, / ( # , <?',u;m t ,m#, • • •),
which are superficially divergent. Once the functions <f>(Mz,---) are substituted
for the bare parameters g etc., they become the functions of well-measured quan-
tities: f(g,g',v;Tnt,mH,---) = f(Mz,GF,a;mt,mH,- • •)• The obtained quantities
/ should be all finite, as long as the theory is renormalizable. These are real pre-
dictions of the renormalizable theory.
The above argument on the quantum effects is easily understood, relying on
(local gauge invariant) operators. As is well-known in 4 space-time dimension, a
renormalizable theory should contain only marginal or relevant operators with mass
dimension d = 4 or d < 4 to start with. After including quantum effects, there also
appear irrelevant operators with d > 4, together with the marginal and relevant
operators. The marginal and relevant operators get ultra-violet (UV) divergent cor-
rections on the coefficients of their operators, such as A2 or In ( £ ) , with A and
p, being momentum cutoff and the renormalization scale. (The momentum cutoff
potentially break local gauge invariance and usually alternative regularization meth-
ods such as dimensional regularization may be used.) The quantum corrections to
g and g' and the corrections to gauge boson mass-squared M§, M^,, keeping the
relation of bare masses,
belong to this category. On the other hand the coefficients of irrelevant operators,
induced by the quantum effects, should be automatically finite; otherwise the UV-
divergences cannot be removed, since there are no irrelevant operators nor countert-
erms in the original Lagrangian. The physical observable Ap = p — 1 or equivalently
Mf cos2# - My/i caused, e.g., by a mass splitting between the members of S U ( 2 ) L
doublet, is a typical example. Apparently, it seems that the radiatively induced Ap
is UV-divergent, as the gauge boson mass-squared seems to be the coefficient of a
d = 2 relevant operator. This apparent contradiction is resolved once we invoke to
the language of (local) gauge invariant operators with Higgs field included. As far
as the theory to start with is gauge invariant, every operators induced or corrected
by quantum or radiative effects should be all gauge invariant. This is, say, a picture
in the "symmetric phase" of the theory. Of course the gauge boson mass-squared
stems from the spontaneous symmetry breaking (SSB), i.e. in the "broken phase"
The Meaning of Precision Test of Electroweak Radiative Corrections 193
of the theory. Thus the mass-squared (difference) relevant for Ap should be under-
stood as the result of the replacement of the (neutral) Higgs field by its VEV in
an irrelevant operator composed of Higgs field, as well as gauge bosons. In fact we
will see in the later section that Ap can be regarded as the coefficient of a d = 6
operator. This is why the radiatively induced Ap is finite.
The precision tests of such finite radiative "corrections" (quantum effects) to
electroweak parameters, such as gauge couplings and gauge boson masses, are quite
important to check the validity of the standard model, since these finite effects are
genuine predictions of the model. For instance such tests are performed by the
precision measurements of various parity violating asymmetries such as forward-
backward asymmetry or LR asymmetry in e+e~ scattering (LEP etc.). The heavy
particles of the SM, like t quark and Higgs, and still unknown heavy particles
predicted by theories beyond the standard model ("New Physics") do not directly
appear in the external lines of Feynman diagrams of lower energy processes. They
appear only or mainly through their contributions to gauge boson self-energies,
i.e. through indirect "oblique corrections". Strictly speaking, e.g. the t quark
may contribute to bbZ vertex, and these extra contributions have to be estimated
depending on each case. It, however, is still true that the oblique corrections exist
universally when we consider the radiative corrections of heavy particles and are
worth general investigations.
As the coefficients of irrelevant operators have negative mass dimensions (d <
0), we may expect that the finite quantum effects due to heavy particles may be
suppressed by the inverse powers of the heavy particle masses ("decoupling" of
heavy particles in low energy processes). But it turns out that this is not always
true, and some radiative corrections are not suppressed by the inverse powers. Such
"non-decoupling" effects in oblique corrections potentially appear only in restricted
number of "oblique parameters", S, T and U, which parameterize electro-weak
radiative corrections. The precision tests of the 3 parameters S, T, U are quite
important not only to get useful information of the heavy particles in the standard
model, m t and mu, but also to test various theories of New Physics. In fact,
(original version of) "technicolor" theory was ruled out as the result of the precision
test of the 5-parameter (Peskin and Takeuchi, 1990).
The purpose of this chapter is to discuss the precision test of the radiative
corrections to the electro-weak parameters in some detail, not only due to the heavy
(comparable or greater than Mw) t quark or Higgs in the standard model, but also
due to the heavy particles predicted by various New Physics. We, however, assume
that the gauge group we take is the same as the standard model, i.e. SU(2)L X
l/(l)y.
194 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
fc2-M2~ M2' K
'
2 2
in lower (\k \ -C M ) energies. At the tree level, this replacement is justified. In
the loop diagrams, however, the replacement is no longer justified, since the 4-
momentum k^ varies as the loop momentum in the integration. Thus whether the
heavy particle contributions are really suppressed by the inverse powers of their
masses or not is a non-trivial question to be addressed.
In gauge theories without SSB, having only parity preserving vector-like (non-
chiral) interactions, such as QED or QCD, so-called "decoupling theorem" holds
(Appelquist and Carrazone, 1975): the contributions of heavy particles with mass
M to physical observables are suppressed by the powers of 1/M. Let us take the case
of QED as an example. Suppose we have a heavy charged lepton E~ with the mass
M, say a partner of ordinary electron belonging to hypothetical heavy generation.
Let us consider its contributions as virtual states in the processes described by
Feynman diagrams with n external photon lines (see Fig. 8.1).
The quantum effects provide effective Lagrangian for external photon fields
where Oi denote local gauge invariant operators with n photon fields A^ and Cj are
2
Fig. 8.1
inverse powers of M, 1/M di ~ 4 (di > n > 4). What all of these are saying is that
the coefficients of irrelevant operators are suppressed by the inverse powers of M.
This kind of dimensional analysis no longer holds for the case of marginal op-
erator, which appears in the case of n = 2, i.e. self-energy diagram. This just
corresponds to the correction to Ffil/F,iV. (We also get operators with higher
derivatives, which we ignore as they are contained in the category of irrelevant
operators.) The coefficient is dimensionless and behave as J0 dt In ( M2 _w 1 _ t N g2 ))
with A, t being momentum cutoff and Feynman parameter. Obviously, this co-
efficient is not suppressed by large M. This correction, however, is not a pre-
diction of the theory as it is UV-divergent. After imposing a renormilization
condition that the correction disappear at —q2 = /z2, which is met by adding a
counterterm made from bare parameters, the correction is rendered to be finite:
Jo dt In (
Mi-t{i-t)qi) ~ ( _ 9 2 - • /^2)> which may be approximated at low energies
H , \q \ < M 2 to be, - £ dt t(l - t) ^f-
2 2
= -\*~$r- « 1. Hence, again the net
effect of the heavy particle is suppressed by an inverse power of M, though it does
contribute to the renormalization of the bare parameters. These are contents of the
"decoupling theorem".
This situation is psychologically good, since when we calculate some physical
quantities just relying on the standard model we do not have to worry about what
really is the theory in higher energies where unknown heavy particles are supposed
to exist: these heavy particles may give some extra contributions but they are
safely small. However, it also says that we cannot have significant information con-
cerning the properties, such as masses, of the heavy particles or concerning New
Physics. Fortunately, in chiral theories with SSB like the standard model, some
"non-decoupling" effects are known to exist. As the above argument of decoupling
theorem is a convincing one, one may wonder what is new in such theories. The
key ingredient, which leads to the non-decoupling, is the fact that now (at least
some of) the coupling constants are proportional to masses of the relevant par-
ticles. More precisely, for instance in the standard model, all massive particles
(including Higgs itself) get their masses only through spontaneous gauge symmetry
196 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
\ / N /
\ H / N /
) ( s /
/ s
/ \
/ \ / X
V V V V
(a) (b)
Fig. 8.2
breaking (SSB), i.e. through their couplings with the Higgs field, which develops
VEV. Thus when some particle is heavy that means its coupling with the Higgs,
such as Yukawa couplings and Higgs self-coupling, are strong. (Hereafter we will
assume that the couplings are not extremely strong, not to spoil the perturbative
expansion.) For instance the Yukawa coupling of the top quark / ( i s proportional to
mt, ft — mt/(v/V2) = T/5i§^7- Such couplings appear in the vertices of Feynman
diagrams and therefore in the numerators of transition amplitudes of the processes
we consider, and the above argument on the decoupling theorem, solely relying
on dimensional analysis, is no longer valid. We thus anticipate that in some suit-
ably chosen processes, heavy particle contributions may be enhanced by (positive)
powers of M.
We will list up some known examples of such non-decoupling effects of heavy
particles in the following.
1 i / . M% , , . , ., o2 . 1 o2 , Mt , , .,
( -) + ] {ipip ? (8 8)
-* 2^MF ^ W ^ ^¥ ^ M M ~- '
2
The first term without q implies we have effectively got a contact interaction en-
hanced as Mfj. Actually this term is exactly cancelled by the original quartic
Decoupling and Non-decoupling 197
interaction, shown in the second diagram Fig. 8.2(b), - gMaH (ip+ip~)2. Thus in
the large MH or equivalently low energy {\q2\ C Mjy) limit, we have only derivative
interaction —5^5- (tp+<p~) D(<p+ip~~). This term is independent of Mjj and indi-
cates a non-decoupling effect of heavy Higgs. We may attach the lines of <p+, <p~ to
the internal line of H as many times as we want to get higher dimensional operators
of the fields tp. Such operators turn out to be all derivative interactions and to be
summarized in a compact form (u/\/2 being the VEV of the Higgs)
This non-linear realization of the Higgs field is the counterpart of non-linear sigma
model in QCD, where <pl should be understood as 3 pion fields -K% (i = 1,2,3), which
are only particles appearing in the (very) low energy regime of QCD. It is now clear
why the contact interaction term disappeared: in the polar coordinate system the
Higgs potential is independent of <p%, as U^U = / (/ : unit matrix).
(b) Ap
We have defined the p-parameter as the ratio of charged gauge boson mass-
squared to that of the neutral gauge boson (with an extra factor cos2 0w), which is
exactly 1 at the tree or classical level. The parameter, however, deviates from 1, i.e.
Ap = p — 1 ^ 0 , once the quantum correction is taken into account. Let us consider
the quantum correction due to the SU(2)L doublet (t, b)1. If there ever exists
extra generation of quarks and leptons, the doublet may be replaced by the doublet
of fourth generation quarks (£', b')* as well. Since W£ = cos6\y ZM -I- sin Owl11
and the photon field 7M never gets mass correction due to the unbroken U(l)em
symmetry, the quantum correction to cos26w M\ is equivalent to the correction
to the operator W^^W^, with 3 denoting the third component of SU(2) adjoint
representation. So at 1-loop level, namely as far as 0(a) corrections (denoted by
the quantities accompanied by 6) are concerned,
Ap 6{ X
- M*W3>- M>W + 6M*W3 -M%,{ 2 SM
n>> (8
-H)
where we have replaced the mass of the charged gauge boson by Mwt 2 :
M ^ + W + W - " = M*,+ w^w?+w^w3 . (8.H) is graphically expressed in Fig. 8.3,
where n a b(0) (a, b = 1 — 3) denote the scalar part of the vacuum polarization tensors
198 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
~ 2 ( n u ( o ) + n 22 (o)) -n 3 3 (o)
Fig. 8.3
of Wa^W^ 2 point functions (after factoring out g^u) at q2 = 0. Let us note that
the scalar parts at 0 momentum correspond to the quantum corrections to gauge
boson mass-squared.
The contributions of the (t, b)1 doublet is readily obtained by modifying the
original calculation by Veltman (Veltman, 1977) for the contribution of heavy lepton
doublet to be
A
P = ,„ • 2 „ T?T (mt + mb o \ In - 4 ) , (8.12)
167rsin2 ^ ^ ml-m2b m\n v ;
where the factor 3 comes from the colors of quarks. One may check that this
expression vanishes in the limit of "degenerate doublet" mt = m^. (The factor
—5^—j In ^ is not singular in this limit, but just yields a derivative of the log-
arithmic function, 1/m2. If we take the limit of mj C mi, as is realized in the
nature, we get
. 3a m?
l07TSin C'lV Mff
which clearly shows the enhancement of the quantum effect of heavy top quark,
proportional to m2, i.e. a non-decoupling phenomenon.
U,C,t
——*—
W+,<p+ SW~'V~
u,c,t
»
Fig. 8.4
where L = ^ p 2 - , Vjs etc. are KM matrix elements and the coefficient function
E(xi,Xj) denotes the contributions of internal up-type quarks with masses m*, m,-
2
(xi = -Tji-). The (pure) t quark contribution reads as,
m
w
<8 16)
**>*-H~\jk- -
which behaves as mf and is another example of the non-decoupling effect of the
heavy fermion. It is worth noticing that for c quark, with xc <C 1, E(xc) ~ —xc,
which just recovers the result of Gaillard-Lee. The calculation (Inami and Lim,
1981) also shows that similar enhancement proportional to m\ appears in the in-
duced FCNC vertex sdZ ("Z-penguin"), but not in sdj vertex (ordinary "penguin"),
where we have only logarithmic enhancement ln -n!j- due to the property of the
CVC. As far as we know the heavy fermion contributions to Ap and these FCNC
processes are only known examples where we have an quadratically growing non-
decoupling effects. Interestingly, it turns out that (at least at 1-loop level) the
contribution of a heavy Higgs to Ap does not grow as M\, but grows only as
ln
m ^M
a --
w
200 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
We have seen that the interesting non-decoupling effects arise in chiral theories
with SSB. Even in such chiral theories, however, heavy particles effects may be
decoupling. It is the case when heavy particle mass is not provided by SSB, i.e.
not through large couplings with Higgs, but by a new large mass scale, independent
of Mw, characterizing some New Physics. Such new mass scales are inevitably
invariant concerning the gauge symmetry of the standard model, since otherwise
it will break the gauge symmetry at much higher energies than Mw, which con-
tradicts with reality. In such a sense, the situation is similar to the case of the
decoupling phenomena we discussed above about a heavy lepton E~, whose large
mass was also gauge invariant and came to the denominator of the coefficients of the
effective operators. We will list up some typical examples of such decoupling effects.
have already seen in 6.2, the small Majorana mass ^f-, with mr>,m,R being a
Dirac mass and the Majorana mass for VR, is induced through a tree level dia-
gram in Fig. 6.1. The fact that the mass is suppressed by 1/TUR can be under-
stood as to indicate the decoupling effect of the intermediate state VR. Let us
note that m f i ( » Mw) is SU{2)L X U(1)Y invariant mass, which may be related
with the new mass scale of, say, left-right symmetric model of electro-weak interac-
tion SU{2)L x SU(2)R x U{\)B-L (Mohapatra and Senjanovic, 1980) (B, L denote
baryon and lepton numbers), where SU(2)R symmetry and therefore parity is spon-
taneously broken at the scale 0(rriR).
Ap = ^ h r - TRT K + m\ - ^ % In % . (8.17)
H
167rsin2 6W Mw y u d
ml-ml m\> y
'
Superficially, the form of this equation is the same as that in (8.12) for (t, b)( doublet
Oblique Corrections and S, T, U Parameters 201
/'
V/QAAA/
/'
Z „ 7 7 „ Z
z _ z
Fig. 8.5
contribution, and the decoupling of super-partners does not seem to be the case.
As the matter of fact, the decoupling holds: Since mu>d <£ MSUSY, we can expand
m2
the formula for Ap in terms of powers of m,/ri"'d to get
SUSY
relevant operator which is quartic in the Higgs field, as we will see later in this
chapter.
As has been already stated in 6.1, heavy particles of the standard model, t quark
and Higgs, and heavy particles predicted by New Physics do not directly appear
in the external lines of Feynman diagrams for the lower energy processes of light
particles. They appear only or at least mainly through their contributions to gauge
boson self-energies, so called "oblique corrections". Up to the order of 0(a2), or up
to the 1-loop order, the oblique corrections to the process / / —> / ' / ' are shown in
Fig. 7.5, where the blob denote the radiative corrections to gauge boson self-energies
due to the heavy particles.
Such oblique corrections are known to be conveniently incorporated by "star
prescription" (Kennedy and Lynn, 1989), namely by replacing the bare quantities
at tree level amplitude e, s 2 (= sin29w), etc. by corresponding "star" quantities
e*(q2), s2(q2), etc.. which incorporate quantum corrections as well and depend on
q2, with q^ being the momentum of intermediate gauge bosons. More explicitly,
202 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
the effective 4-Fermi Lagrangian for the process (in momentum space) is written as
+ ^(H[hL-slQ)f)-^—{j'^%L-slQ']n (8-19)
where Q, Q' and h,^ are electric charges (in the unit of e) and the third components
of weak isospin of fermions / and / ' , respectively.
Let us demonstrate this and find the expression of the star quantities in terms
of the vacuum polarization functions n(<72). We start from writing the radiative
corrections to gauge boson self-energies denoted by blobs in Fig. 8.5, in the form
of induced quadratic terms for gauge bosons in the effective Lagrangian, in terms
of the vacuum polarization functions II(g 2 ):
where we have included the correction to charged gauge boson self-energy Uww, for
later use. We have ignored the part proportional to q^,qv in the vacuum polarization
tensors, since in the scattering amplitude this part provides relatively negligible
terms which are at most proportional to external light fermion masses after the
usage of equations of motion, such as q^f^y^f = 2m/ / 7 s / . At the order of 0(a2)
or at the 1-loop level, the effective 4-Fermi Lagrangian relevant for the scattering
is given as
g '(i-n!, T ) ? S ^ \ f eQ'f'-ff
J%, „ ^ - J V fJ'l^L-s2Q')f
Fig. 8.6
~ — {i-[n77(<z2)-n77(0)]}, (8.22)
Ana
2 _ „2
3^ = 3 cs IL'Z„ (8.23)
dUzz. , c2 - s2
l + ^!^=Mi-n;7-^—-U' Z y } (8.24)
_lg2=M|-n77-—-
M2 = Mlz +
t- U
Ilres
= M2+Uzz(q2)-Uzz(M2)-(q2-M2z)^\q2=M2, (8.25)
where ^ ^ = ^ [1 —II77(0)] has been used. Let us note that generally when the vac-
uum polarization functions U(q2) are Taylor expanded in the powers of q2, only 11(0)
and the first derivative ^\q2=0 are possibly UV-divergent, since they correspond
204 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
to the radiative corrections to gauge boson mass-squared and kinetic terms. This
statement is true for the Taylor expansion around arbitrary point, such as q2 = M§.
Thus n ; 7 ( g 2 ) - n ; 7 ( 0 ) and II r es = V.zz{q2) -UZZ(M2) - (q2 - M 2 ) ^ | , 2 = M | ,
e.g., are finite, since they contain only the terms with 2nd or higher derivatives, and
we have got formulae in which the finiteness of physical quantities, like e* and M*
are manifest . The finiteness of Z* is less trivial (See Problem 8.4). The remaining
formula for s* contains bare quantity s2 and is not manifestly finite. So we attempt
to rewrite s2 by a quantity made from well-measured observables and e* at the Z
pole, sin.20w\z = (TTOGM 5 ") 1 ^ 2 - ^rom t n e relations
~j5-w{1--#Mr]' (8 28)
-
We get a formula for Ow\z
2cV
sm26w\z = s2 + S (s 2 ) = s 2 + 2csS6w = s2 + -r ~ 6(ln(sm26w))
^ ^ < ^+ ^ - S ^ ) . <-)
Thus substituting s2 obtained from the above relation, we finally get
(8.31)
The vacuum polarization functions II 7 7 , Tlz-y, Rzz can be written, factoring
the gauge coupling constants out and indicating the indices of SU(2)L and U(l)em
Oblique Corrections and S, T, U Parameters 205
where Tl22 = IIn due to the unbroken t / ( l ) e m symmetry, and it may also be written
as n "+ n 22 as well. We can understand the form of IIz 7 , Tizz easily if we recall
that neutral current coupled with Z takes a form —^ /7 M (J 3 L — s2Q)f.
As an example we calculate the contributions to these polarization functions
from a pair of quarks (t, b)1, which may be regarded as the quarks of another gen-
eration including 4-th generation as well, if it ever exists. All vacuum polarization
functions are reduced to just 2 independent functions which correspond to the vac-
uum polarization amplitudes induced by two electro-weak currents of quarks with
the same and opposite chiralities, respectively:
ULLim^m^q2) = II f l f l (m 2 ,m 2 ,g 2 )
dtln ( 22 jV2 ., , 1
= ~jBr [ 2
- *" "~2)'[t(1"t)q2 ~ ^M% (8-36)
'M -t{l-t)q2
^LR{m\,ml,q
i\,m\,q') = T\RL{m\,ml,q2)
1
12 f A2 1
dUn{
-~(4TT)2 J0 M*-t(l-t)qz>'2
= -[QtUvv{m2,m2,q2)-QbUvv(mlm2b,q2)}, (8.40)
It is worth noting that the "chiral amplitude" has non-decoupling effect for mt »
mb,
6 A2
2 2 2
ULL(m t,mlq ) ~ —— m \n{-^), (8.43)
2
3 ,A , 1,
nLL(m2,m2,g2) ~ 7 --T^?
7/•r m
?2 m
22 n
t £ (In
( l nv((—
—)
29 +-), (8.44)
(47r) m
which contribute to Ap as
On the other hand, when IILL and ULR are summed up to get TLvv, relevant for
the vector-like theories such as QED or QCD, the quadratic term m 2 are cancelled
out and I I w is proportional to q2:
24 f1 A2
2 2 2 2
nvv{m t,m ,q ) = q {-J-^ jQ t(l-t)ln(m2t_tii_t)q2)dt}. (8.46)
This proportionality to q2 is the result of CVC valid in the vector-like theories, and
is an indication that in such theories the decoupling theorem holds.
The star quantities are functions of q2 and each of them has an infinite number
of observables as the functions of heavy particle masses appearing as the coeffi-
cients of the Taylor expansion in terms of q2. We, however, can argue that pos-
sible non-decoupling radiative corrections, oblique corrections, of heavy particles
are concentrated only in three parameters, called 5 , T, and [/-parameters (Peskin
and Takeuchi, 1990), which therefore make the analysis of New Physics simple and
transparent.
To see this, suppose we have a vacuum polarization function Uij(q2) where the
indices (i, j) take either SU(2) adjoint indices 1,2,3 or Q corresponding to unbroken
U ( l ) e m . At least as far as we retain in the 1-loop level, vertices of the diagrams
of vacuum polarizations are all proportional to dimensionless gauge couplings (at
higher loop levels we may have mass dependent couplings due to Higgs exchanges),
and we may rely on a naive dimensional analysis. Namely when we Taylor expand
it as the power series in q2 or equivalently in - ^ (M: a generic mass of intermediate
heavy particles),
we readily know that the order of magnitudes of Hij(0) and -j4 L |,2 = 0 are 0(M2)
and 0{q2), respectively, and the residual terms are at most of 0(*jfi-), which give
only decoupling effects. Thus as far as our focus is on the possible non-decoupling
effects we may treat only the first two terms of the expansion. Thus, at the first
sight, there seem to exist 8 (= 2 x 4) parameters, corresponding to the 4 choices
of the gauge indices (i,j): (1,1) = (2,2), (3,3), (3,<2), (Q,Q), or equivalently the
Oblique Corrections and S, T, U Parameters 207
choices of gauge bosons in the external lines of the vacuum polarization diagrams
(W+,W~), (Z, Z), (7,7), (Z, 7). Actually, not all of these parameters exist. In
fact, I1QQ(0) = n 3 Q(0) = 0 as the result of CVC of U(l)em: we never have mass
renormalization to the photon as we wish the photon to travel at the speed of light.
We thus have 8 — 2 = 6 parameters remain. We also should aware of the fact that
some of the remaining 6 parameters should be understood as to be used for the
renormaliation of three bare quantities, g,g',v describing the gauge sector. This
is because the first two terms of the Taylor expansion correspond to the radiative
corrections to the operators with mass dimensions d = 2 and d = 4 (in "broken
phase" of the theory). Thus among 6 parameters, three parameters are used for
renormalization, i.e. in the process to fix the bare parameters in terms of a, GF, MZ
etc.. In other words, these are inputs rather than the prediction of the theory. In
this way, the outputs of the theory, i.e. the genuine predictions of the theory, are in
the remaining 6 — 3 = 3 parameters, which are nothing but the S,T, {/-parameters
we are interested in.
As is clear from the above argument, such three parameters S,T,U should be
automatically finite quantities, as they correspond to operators, which do not exist
in the original Lagrangian: they need not to be renormalized. Actually, such finite
combinations of poralization functions have already appeared in the discussion of
star-quantities, i.e. in the attempt to show the finiteness of the star-quantities.
Namely the parameters are defined as follows:
aS = 4e2[n^3(0)-n^(0)], (8.48)
aT [ r i l l ( 0 ) n 3 3 ( 0 ) ] (8 49)
= c^| - ' "
2
aU = 4e [n' u (0)-1133(0)], (8.50)
where we have used a notation IIJ(O) to denote -^-j*-|92=0, which are nearly equal
to Uij(q2)/q2 for the case of IIjj(O) = 0, which were originally denoted as IIJ -, once
we ignore 0{^fi-). We can confirm that S, T, U really do not suffer from the
UV-divergences. The finiteness of S-parameter is guaranteed by the fact that there
is no mixing term of SU(2)L and U(l)y field strengths, Fl^v B^v in the original La-
grangian. T and U should not get UV-divergence since whose existence contradicts
with the "custodial symmetry" of the terms £ L i F^ F^v and ^ - £ 3 = i A^A^
in the original Lagrangian: no couterterm can be prepared for the quantities which
violate the symmetry of the original Lagrangian.
We can easily check that, in the approximation that C( ^ 7 2 ) is negligible, the
star quantities are now describable in terms of 5, T, [/-parameters:
J_ 1_
(8.51)
e2 \-KOL ~~
sl-sm28w\z ~ (8.52)
cr — S J 4
208 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
z
*~1 - 4hs> < 8 - 53)
Ml-Ml ~ 0, (8.54)
where e* and M* do not obtain quantum effects in this approximation, i.e. for
q2 <g. M2, since these quantum effects stem from 2nd or higher derivatives of Ily.
In addition, we can list up a related star-quantity p*(0) which is defined as the ratio
of the 4-Fermi coupling of low energy (q2 = 0) neutral current process to that of
charged current process:
p*(0) - 1 ~ a T. (8.56)
assuming that the number of the technicolor NTC = 4. Comparing with the upper
bound on S obtained from the precision measurements at LEP etc., Peskin and
Takeuchi were able to rule out the 1 "generation" technicolor model (Peskin and
Takeuchi, 1990).
The constraints on S, T, {/-parameters imposed by the precision experiments at
LEP etc. are not only used to get information on the new physics, but also quite
useful to constrain the masses of heavy particles in the standard model, i.e. mt and
TTIH • Especially, since the T-parameter (Ap) is quite sensitive to mf, it was possible
to predict rather precisely the value of mt as somewhere around 175(GeV), before
the direct discovery of the t quark at Fermi Lab.
We have seen that all of S, T, U parameters are all "calculable" quantities, free from
UV-divergence without any need of renormalization. ^From a bit different point of
view this is the consequences of global symmetries, which remain in the gauge sector
of the theory even after the SSB. To see this, let us take a matrix form of the Higgs
fields and introduce a global SU(2)n symmetry.
We start by writing the Yukawa coupling in the following form with the matrix
form $ of the Higgs field (ignoring its VEV for a while),
which just corresponds to the linear-u model discussed in QCD to describe the low
energy effective theory, where H and Gl are replaced by a scalar and pseudo-scalar
bound states of quarks, a and 7rl (i = 1,2,3). The technicolor theory takes this
analogy seriously and regard the Higgs fields as the bound states of hypothetical
fermions with quantum numbers "technicolors", instead of ordinary colors, called
technifermions T. In particular H ~ fvTu + TDTD (for 1 doublet model of techni-
color), though the Arc, the mass scale where the technicolor interaction becomes
strong, is scaled up to 0(lTeV).
If the degeneracy mu = ma is realized, the theory is invariant under a global
210 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
I) - (%)="{%)•&)<%)-«{%
$ -> $ ' = 9L * 9*R.
( 9L, 9R- elements of SU(2) i and SU(2) fl ) (8.62)
Let us note the squared absolute-value of the Higgs doublet is given in this matrix
form as Tr($t $) = H2+Gi2, which is clearly invariant under the SU(2)LxSU(2)R:
Tr($'t $') = Tr($+ $) : H'2 + G'i2 = H2 + Gi2. This means the real fields
H, G% behave as the fundamental representation of 5 0 ( 4 ) , which is equivalent to
SU(2) x SU(2), as is well-known in the group theory. Just as in QCD, after Higgs
develops its VEV v (H —> v + H), corresponding to non-vanishing < uu > = < dd >
in QCD, the chiral symmetry is spontaneously broken, but leaving its vector-like
subgroup SU(2)v generated by gi, = gR = g unbroken:
< 8 6 3 )
' " ( * * ) • '
unless fu = fd- This is intuitively trivial as mu ^ rrid should break the weak isospin
symmetry.
The SU{2)L gauge bosons A1^ (i = 1,2,3) and U(l)y gauge boson B^ transform
under the chiral symmetry SU(2)i x SU(2)R and the custodial symmetry as
where the representation of the U(1)Y gauge boson breaks into the two pieces
(1,3) and (1,1), since the weak-hypercharge may be written as y = I^R + ^y^S
where I$R is the eigenvalue of the 3rd component of "right-handed weak-isospin"
SU(2)R and B, L denote baryon and lepton numbers respectively. (Or we may
rewrite the Nakano-Nishijima-Gell-Mann relation as Q = I^i + I3R + ^ = ^ in
a left-right symmetric form, which really holds in the left-right symmetric gauge
theory SU(2)L X SU(2)R X U(1)B-L, though in the gauge group of the standard
model, we are taking now, the SU(2)R is not gauged.)
Paying attention to the fact that II33 — U'3Q = \TVZY denotes the mixing of
SU(2)L and U(l)y gauge bosons in the kinetic term, and that IIn — II33 = ifrin +
Operator Analysis 211
II22] — II33 we realize the representation of the oblique parameters under the chiral
and custodial symmetries:
S: ( 3 , 3 ) + (3,1) 1+ 3 + 5 (8.67)
T,U: (5,1) 5. (8.68)
where in the representation of S, 1 + 5 comes from (3.3) while 3 comes from (3.1).
Let us recall the simple algebra in SU(2) group theory, 3 x 3 = 1 + 3 + 5, and
note that only 1 and 5 are relevant for us, since vacuum polarization functions
should be symmetric under the exchange of gauge indices. The linear combination
5 [II11 + II22] — II33 can be expressed by a traceless 3 x 3 matrix and should belong
to 5 representation of SU(2), while traceful combination I l n + II22 + II33 behave
as the singlet. It is now easy to understand that T, [/-parameters should vanish in
the limit of custodial symmetry: T, U behave as a non-singlet of SU(2)v, namely
as 5, and should vanish in the limit of exact SU(2)v symmetry, since in the all
orders of quantum corrections, quantities contradicting with the symmetry never
appear. In such sense, the symmetry handling the T, U or Ap is the custodial
symmetry. Strictly speaking, it may be more suitable to say that actually what
handle these parameters is the global SU(2)L rather than the custodial symmetry
(Inami-Lim-Yamada, 1992) simply because these parameters are concerned only
with SU(2)L- (In fact to get 5 representation we need quartic term of quark masses,
or equivalently an operator quartic in the Higgs field, as we will see in the following
subsection, while the quark mass matrix Mq = -j?di&g(fu, fd) has a piece of 3
under SU(2)v and quadratic term of quark mass is enough to get 5, if we invoke
to the custodial symmetry. This becomes clear in the operator analysis extended
in the next section.) Then, what symmetry handles the S-parameter ? This time
obviously custodial symmetry is not the one, since the part (3,3) of S contains a
singlet 1 under the SU(2)y, the constant term ^ in (8.57), which survives even
in the limit of degenerate doublet (mt = m^). We may say that the relevant
symmetry for the parameter S is just the chiral symmetry SU(2)L X SU(2)R, as
the S-parameter behaves as (3,3), therefore not as an invariant under the symmetry
(Inami, Lim and Yamada, 1992). In this way we can readily understand why each of
the technifermion doublet gives additive contribution to S: each of the condensation
< TjyTfy > = < TDTD > break the chiral symmetry spontaneously, thus contributing
to 5.
We have seen T, for instance, is calculable finite quantity, although the gauge bo-
son mass-squared seems to be the coefficient of relevant operators with d = 2,
W+W~ etc. As was discussed in section 8.1, in the renormalizable theory a calcu-
lable quantity should be a coefficient of some irrelevant operator. It is known that
212 PRECISION TEST OF ELECTROWEAK RADIATIVE CORRECTIONS ...
It is easy to see that when the replacement § —> (0, -T=)' is made this operator
causes the mixing between the field strengths of SU(2)L and U(l)y, i.e. F^„ and
It is now obvious why the S, T, [/-parameters are all calculable finite quantities:
they are all described by the coefficients of higher dimensional d > 4 irrelevant
operators, such as those listed above, which do not exist in the original Lagrangian.
Thus the coefficients should be all finite as long as the theory is renormalizable. Now
the difference of decoupling and non-decoupling effects of heavy particles is also easy
to understand. The examples we discussed above tell us that when heavy particles'
contributions to physical observables are decoupling ones, there is a characteristic
feature that heavy particles get their masses through some new mass scales, such as
MSUSY, which are inevitably gauge invariant. In that case, the coefficients of the
irrelevant operators induced by a heavy particle will be suppressed by the inverse
powers of M, with M denoting a generic new large mass scale. For instance the
effective Lagrangian for the 5-parameter behave as
U = eiS^, (8.72)
where the physical Higgs field H has been ignored, since it does not appear in the
processes we are interested in (at 1-loop level). In this non-linear representation we
do not have the problem of getting infinite number of operators for one observable,
since WU = / is a c-number. Analysis based on the non-linear realization shows
that potential non-decoupling effects in gauge boson 2 and 3 point functions are
given by 7 independent operators. The reason why we can treat both of the 2 and
3 point functions on an equal footing is that in non-Abelian gauge theory they are
mutually related. For instance, Os not only has gauge boson 2-point function, but
also 3-point function as well, since Fl has both linear and quadratic terms of gauge
bosons.
Problems
8.1 Calculate the diagrams in Fig.8.3 to get the formula for Ap in (8.12). Also
show that (8.12) vanishes for nit — nn,.
8.2 Show that (8.17) is approximated by (8.18) when m„ d <C MgUSY {m\ —
m
l + MsusY,m2j = ™2d + M2SUSY).
8.5 Verify the formula (8.57) by use of (8.40), (8.41) and (8.48).
8.6 Show that the T-parameter is given as the coefficient of the operator OT in
(8.69).
This page is intentionally left blank
Chapter 9
FLAVOR PHYSICS AND C P
VIOLATION
The gauge sector of the standard model (SM) is theoretically uniquely determined
by gauge principle. The Higgs sector, on the other hand, has many arbitrary pa-
rameters, whose values cannot be theoretically predicted. Especially so far we have
no definite idea about the origin of quark and lepton masses and generation or fla-
vor mixings, i.e. the origin of the Yukawa couplings of Higgs field. Issues related
with quark and lepton flavors, "flavor physics", is therefore very important clue, not
only to the the confirmation of the standard model, but also to the search for some
theories beyond the standard model, "New Physics", where the origin of flavors is
expected to be understood more deeply.
Among such flavor physics, Flavor Changing Neutral Current (FCNC) processes
are of special interest. They do not exist at tree or classical level in the standard
model, and are induced only at loop or quantum level. Therefore the observation of
such FCNC processes will provide us valuable information on the contributions of
all particles as the intermediate states. They are, therefore, very suitable, not only
to get information about the flavor mixings of relatively heavy t quark with lower
generations, but also to search for possible heavy particle effects of New Physics.
Namely the studies into the FCNC processes is important for the progress of particle
physics. The rates of such FCNC processes are known to be suppressed by both of
the higher order of the perturbation theory and the small mass difference or small
flavor mixings. Thus, the FCNC processes are often called rare processes.
In fact, historically the rare processes have played very important role in the
foundation of the particle physics. We may list up some of the most important
events, some of them have already been discussed in 4.6.2: the introduction of
c quark, in order to naturally suppress the rare processes of neutral kaon such as
K° <-> K° mixing, by GIM (Glashow, Iliopoulos and Maiani, 1970), the prediction of
the mass of such predicted c quark by Gaillard and Lee (Gaillard and Lee, 1974), the
introduction of the third generation to implement the observed CP violation in the
neutral kaon system by Kobayashi and Maskawa (Kobayashi and Maskawa, 1973),
215
216 FLAVOR PHYSICS AND CP VIOLATION
which embodies the standard model nowadays, the lower bound on mt imposed by
the data on B° <-> 5 ° mixings, obtained before the direct discovery of t quark, etc.
The CP-violating processes are also known to happen rather rarely in neutral
kaon system. In such sense, the physics of CP violation also provides valuable
information of the heavy t quark and/or heavy unknown particles of New Physics.
We should further note that the Kobayashi-Maskawa (KM) model of 3 generations
was originally devised as the theory to accommodate CP violation. In spite of the
remarkable success of the KM model, such as N„ = 3 (N„: the number of light
neutrinos) at LEP, yet there has been no final conclusive argument on the origin
of CP violation. So, the confirmation of the prediction on CP violating observables
of the model, such as CP asymmetries in B decays, is an urgent necessity to really
establish the standard model. We may even find some evidence of some New Physics
in the course of the investigation.
The FCNC processes and CP violation is, in principle, mutually independent. In
fact, in the model of spontaneous (or soft) CP violation (Lee, 1974; Weinberg, 1976),
the CP violation is attributed to the complex phase of the vacuum expectation value
(VEV) of the Higgs created by the suitably chosen scalar potential, and has no direct
relation with the flavor mixing. On the other hand, in the mechanism of KM, both
of quark mass matrix causing the flavor mixing and CP violation come from the
same Yukawa couplings. Therefore they are mutually very closely related. In fact
in KM model we will see below that the CP violation needs mass differences and
flavor mixing, (of course) in addition to the complex phase in the KM matrix. CP
violation, therefore, is observed in FCNC processes, such as K° -H- K° and B° «-> B°
mixings.
The purpose of this chapter is to discuss the rare processes due to FCNC and
the CP violation. We assume the theory we work on is the standard model, since to
provide the results in SM is quite useful even if we further investigate New Physics.
Furthermore, some of the formulae we derive are readily applicable for the class
of New Physics, where the Yukawa coupling of the Higgs has the same structure
as that of the standard model, e.g. the model with four or more generations,
minimal supersymmetric stadard model. In this chapter we will discuss FCNC rare
processes only in the quark sector. Once neutrinos become massive, the lepton flavor
violation, i.e. FCNC processes in lepton sector, also becomes physically meaningful.
We, however, will not deal with the lepton flavor violation here, since the neutrino
oscillation as the typical example of lepton flavor violation has been extensively
discussed in chapter 6.
As was discussed in 4.6.2, GIM escaped FCNC processes at tree level, by introduc-
ing c quark. What GIM has proposed is that by introducing c quark, the weak
isospin of s quark became the same as that of d quark and even after the uni-
Flavor Symmetry and FCNC Rare Processes 217
tary transformation to the mass eigenstates the neutral current is kept flavor- or
generation-diagonal. We may say that the essence of their idea is to make the
gauge interaction of each sector of fermions with definite electric charge and chiral-
ity universal, i.e. to make it invariant under global SU(ng) transformation, with ng
being the number of the generations (ng = 3 in KM model). If we call the gauge
interaction to connect the elements of doublet "vertical symmetry", this flavor (or
generation) symmetry may be called as "horizontal symmetry". By imposing such
global flavor symmetry, the FCNC, such as sj^d coupled with Z boson or photon are
strictly forbidden at the tree level. We call such flavor conservation in the neutral
currents as "natural flavor conservation". The concept of natural is used to denote
some property which is naturally guaranteed by some symmetry of the theory irre-
spectively of the tuning of the parameters of the theory. From such point of view,
for instance, small Higgs mass in the standard model is not natural (the hierarchy
problem), but it can be naturally small in the SUSY standard model, etc. Glashow
and Weinberg have summarized the conditions for the natural flavor conservation
in neutral currents coupled with gauge or Higgs bosons (Glashow-Weinberg, 1977):
1. Fermions with the same electric charge and chirality should belong to the same
representation of the gauge group.
2. Fermions with definite electric charge should couple with only one Higgs doublet.
The first condition is the condition for the neutral current gauge interactions do
not have FCNC. If the condition is met, a neutral current J^ ' generally has a form
in the base of weak-eigenstates
and this form is invariant under the unitary transformations to the mass eigenstates,
since the transformation is among the fields with the same electric charge and chi-
rality, such as UIL, U2L, •. -, and therefore the unitary matrix for the transformation
takes the form of block-diagonal:
etc. The second condition is necessary for the Yukawa coupling to naturally con-
serve flavors. If, for instance, up-type quarks are allowed to couple with two Higgs
doublets <f>i,<f>2, their neutral Higgs fields generally have the following Yukawa cou-
plings,
w+
\ u,c,t I
£
s — • — - » -^ » -2—» d
Fig. 9.1
the Yukawa coupling of the orthogonal linear combination ,} [v2 <£? — ^i ip®]
V"l+,;2
causes the FCNC at the tree level. Let us note that in SUSY standard model,
because of the chirality (holomorphic property) of the super-potential, even though
we have two Higgs doublets, there exists a selection rule to forbid the simultaneous
Yukawa couplings of the two doublet Higgs.
Since the standard model satisfies the Glashow-Weinberg's condition, the FCNC
is forbidden at the tree level. The flavor symmetry SU(ng), however, is broken by
the quark mass differences in each of up- and down-type quark sectors, i.e. by
mu 5^ m c ^ mt etc. We may still have symmetries of each generation number, i.e.
a sub-symmetry U(l)ng, if there is no generation mixings, but we know that there
exist such mixings and the flavor symmetry is thus completely broken. We thus
expect that at the quantum or loop level of Feynman diagram the FCNC processes
are induced by charged current interactions, which have generation mixings via
KM matrix U. In Fig. 9.1, we have a diagram with W + -exchange, which induces
effective dsZ and ds-y vertices.
The amplitude of Fig. 9.1 is expected to vanish for an exact SU(3) symmetry,
rnu = mc — mt- In fact, we easily find this is the case, since in this case the
orthogonality of the KM-matrix J2% U*dUis = 0 works. Thus, to get FCNC we
need the violation of flavor symmetry due to both of non-degenerate quark masses
at up- (and down-) type quark sector, and flavor mixings. More intuitively we
may say that when the degeneracy mu = mc = mt holds there is no mean to
distinguish the (u,c,t) quarks. Thus we may treat (u',c',t') obtained by arbitrary
unitary transformation of (u, c, t) as mass eigenstates with an equal right. Thus it
is always possible to choose the unitary transformation of the left-handed up-type
quarks, so that it makes the KM-matrix unit matrix, thus making the generation
mixing meaningless. Suppose we have only two generations with a mixing angle 6C.
2 2 2_ 2
Then the above amplitude is proportional to U*dUcs " 1 ^ f i m " = cos#c sin#c m'Milv —
6 x 1 0 - 5 . We learn from this simple exercise that the amplitudes of FCNC processes
are suppressed by small mass-squared differences (the GIM mechanism). If we
include the third generation the relevant mass differences ml — m\ c are not small
Rare Processes in Kaon System 219
compared with M ^ . But, on the other hand, the mixings of t with 1st or 2nd
generations are small compared with sin#c, thus making the amplitude rather small.
Thus, the rates of FCNC processes are tiny and these processes are often called "rare
processes". It is worth noticing that the FCNC processes in quark sector and the
neutrino oscillation in lepton sector are both induced by the mass differences of
different generations and generation mixing. This is not surprising, as the neutrino
oscillation is one of FCNC processes.
As we have seen in the introductory argument in section 9.1, the rare processes in
neutral kaon system of {K°, K°) have played very important roles in the establish-
ment of the standard model. (K°, K°) are the bound states of s and d quarks and
their anti-quarks, K° ~ §75d, K° ~ J75S. There is a freedom of relative phase
of these two states, but here we adopt a convention CP \K°) = —\K°). We may
define a quantum number (a U{1) charge) "strangeness" S carried only by s and s
quarks, S = - 1 for s, S = 1 for s, S = 0 otherwise. The neutral kaons, K°, K°,
thus have S = —1, 1, respectively. In the FCNC processes of kaons, the strangeness
changes, and we may classify the rare processes by the extent of the change of the
strangeness:
(a) | A 5 | = 2: K ° H K°,
(b) | A 5 | = 1: KL -> up, K+ -»• TT+I/P,
where in the decay K+ —• -K+VV, the contributions of neutrinos of all generations
should be summed up, as long as they are light. We discuss these two types of
FCNC processes separately below.
(a) | A 5 | = 2 process
The CPT theorem tells us that the masses and the life time (total decay width)
of particle and its anti-particle should be exactly the same. Thus if only strong
interaction QCD is switched on, K° and K°, being anti-particles of one another, are
two degenerate states without any interaction (as the strong interaction preserves
the strangeness S). The situation is somehow similar to the case of double-well
potential with very high wall separating two degenerate vacuum states |1), |2) in
the quantum mechanics (see Fig. 9.2).
In the case of double-well potential, as the height of the barrier wall gets reduced
the tunneling effects connecting two states |1), |2) starts to operative. Then the
Hamiltonian in the base of these states takes a form
(9 4)
" = ( S %)• -
220 FLAVOR PHYSICS AND CP VIOLATION
V(x)
Fig. 9.2
Even if the tunneling effect Et is small, the energy eigenstates get large modification,
M M12
H = (9.6)
12 M
where M is the common mass implied by the CPT theorem and Mu is due to the
K° «-» K° mixing. For simplicity, here we have ignored absorptive part, denoted by
T, coming from the on-shell intermediate states, which correspond to the fact that
the neutral kaons are unstable states decaying into irn etc. Thus the Hamiltonian is
hermitian, though we will treat non-hermitian Hamiltonian including the absorptive
part later in this section. We further simplify by ignoring a small CP violating
effect in the element M\i due to the CP phase of KM matrix for a while, assuming
M,12 Mi2. Then just as in the case of quantum mechanics, we find two mass
(energy) eigenstates
mi = M - M12, m 2 = M + M i 2 . (9.8)
As we have ignored the possible CP violating effect, these states have definite CP
eigenvalues:
U, C, t u,c,t
S «—5 « T—•— d S «—5 « 1 « d
\w+ V
u,c,t < u,c,t
d —*—£ •*- <
~~*— s •
(a) (b)
u, c, t
s —«- -*— d •— d
<pi 'W
u,c,t
d -*- • —
(c) (d)
Fig. 9.3
Actually when we include the CP violating effect and the absorptive parts, real
eigenstates of the Hamiltonian are Kg and KL states with shorter and longer life
times, which are nearly equal to K\ and K2, respectively. Their mass difference
quark exchange in this box diagram has a term proportional to m\, which comes
from the diagram with </?+ and (p~ exchanges (at least for £ 7^ 0), which has 4
Yukawa couplings yielding a factor (jm ( /Miv) 4 . The factor is then multiplied by
1/wt 2 (for mt > Mw) coming from dimensional analysis, thus producing a net
effect ( p 4 m t ) / M ^ , a non-decoupling effect of heavy t quark. Note that this box
diagram is only possible quark diagram at 1-loop level; the diagrams with 1-loop
induced FCNC Z- or 7 (photon)-vertices, which are relevant for | A 5 | = 1 processes,
such as KL —> up,, changes the strangeness only by 1 unit and do no contribute to
the | A 5 | = 2 process K° *•> K°. In the pioneering calculation by Gaillard-Lee,
an assumption that the masses of intermediate up-type quarks are much smaller
than Mw was made, which is a good approximation in the 2 generation (4 quark)
model, m c <S Mw, which was the standard picture of that time. The calculation,
however, needs to be revised, since we now know that there are 3 generations of
quarks and mt is greater than the weak-scale Mw- The calculation which is valid
222 FLAVOR PHYSICS AND CP VIOLATION
for the arbitrary masses of intermediate up-type quarks was performed (Inami and
Lim, 1981) and the result can be represented in a form of effective 4-Fermi | AS\ = 2
operator (see also (8.14)):
3 1
h (9-12)
4 (a* - 1 ) ^ - 1)
and L = ^-j^, Vis etc. are K-M matrix elements. We have replaced the original
coefficient function E(xi,Xj) by E{xi,Xj) = E(xi,Xj) — E(xu,Xj) — E(xi,xu) +
E(xu,xu), setting xu = -rrf = 0. The additional three terms exactly vanish by
m
w
the orthogonality of K-M matrix J2i V*sVid = 0, £ \ V£ V^ = 0. This is why the
summation of i, j is only over c and t. When both intermediate quarks coincide we
get
3
EM ss E(Xi,«) = - ^ ) In *, - «[\ - f ^ - I j ^ l (9.13)
Though the above result is valid for an arbitrary intermediate quark masses, the
external quark masses md, ms and their 4-momenta have been ignored there, which
is a good approximation. To get M 12 , we need to evaluate the matrix element of
the 4-Fermi effective operator with respect to kaon states,
where /K, mx are decay constant and the mass of if-meson (TTIK — M in (9.6)), and
the B is the "bag-parameter", which parameterizes the ambiguity due to the non-
perturbative QCD effects to form the bound states K° and K°, and the reference
value B = 1 corresponds to the vacuum saturation, (K°\(sjliLd)\0)(0\(s'y,iLd)\Ko).
Thus writing the off-diagonal element of the neutral kaon mass-squared matrix as
6m2 = -(K°\c[Aff=2\K0)
= ~%R \ f2Km2KB [(V*Vid)(V;sVjd) Eix^xj)}, (9.15)
V2 67rsin 9w
the mass-squared matrix reads as
and taking the square-root we obtain the Hamiltonian at the rest frame, with an
approximation 8m2 <C M 2 ,
M
H^['^ Sm'2 J £ ), (9-17)
2M
s 2
which means M 12 ~ j ^ - . We thus obtain
„_ , , Re<5m2
ArriK - 2ReMi 2 —
M
f
- ~%K \ KmKB M(V*V )(V*sVjd))
id E{Xi,Xi)
V2 67rsin 8w
~ - 3 . 8 x 1 0 - 2 m K B M(V*sVid)(V*sVjd)} E{XUXJ). (9.18)
(b) | A 5 | = 1 processes
The typical | A 5 | = 1 processes are KL —> Up- and K+ —> TX+VV. (KL —> 7r°e + e _
is the signature of CP violation and is another interesting process to study. But
the derivation of its decay rate is obtained in a similar manner with the argument
below, and will not be given.)
Now the diagrams with 1-loop induced FCNC Z- or 7 (photon)-vertices also
contribute to the processes, and the quark Feynman diagrams, relevant for the
+
KL —• nfi and K -> TT+UU, are shown in Fig. 9.4, where ui denotes all possible
light neutrinos (I = e,/u,r for the standard model). Note that the 7-exchange
diagram obviously does not contribute to the decay K+ —> TT+UU. The 1-loop
induced FCNC Z- or 7 (photon)-vertices, denoted by blobs in Fig. 9.4, are the sum
of the sub-diagrams shown in Fig. 9.5.
224 FLAVOR PHYSICS AND CP VIOLATION
W+,ip+ W+,<p+
[ W W W r rvww\>—•- vi
H,Vl
t*,Vl
Fig. 9.4
One may wonder why we should include the 1-particle reducible diagrams with
FCNC self-energies such as Fig. 9.5(a), and wonder if such FCNC self-energy should
be eliminated by adding a suitable counterterm for the external quarks d, s to have
physical meaning as the eigenstates of propagation. Such procedure is also pos-
sible, but it turns out that the contributions of all counterterms just cancel with
each other in the FCNC processes (Botella and Lim, 1986), intuitively because the
counterterms for the FCNC quark self energy and for the FCNC vertices of Z or 7
gauge bosons are not independent.
We first display the results for the 1-loop induced effective FCNC Z- and 7-
vertices (Inami and Lim, 1981):
£>sdZ 2
J2 (V*Vid)rz(xi)(s-L^dL) Z\ (9.20)
47TSin #iy COS0W ^
47rsin 2 0 w 2M 2 , £ (v*vid)
i=c,t
s[Fi(xi)(q2^ - q^i)L + F2{xi)atlviqv{msL + mdR)]d A", (9.21)
where the effective sd-y vertex is given in the momentum space (q^ = p£ — p^). The
coefficient functions are given in terms of the interemediate up-type quark masses
x, = ml/Myy (i = u, c, t) as
rl
r*(*i) = [\ In Xi + 7 ( 1 1 , O i (9.22)
4 8x~^llXi+8(xi-l)2
1 1 13 1 1 1
Fi(Xi) = Q{[ ] Xi
12 n - 1 + 12 (Xi - l ) 2
2 (xi - l ) 3
1 1 1 1 1
+ ri
3xi-l 3(xi-l)2 6 ( z i - l ) 3 + 2(x,
—] xMxi - ^H~)}
Rare Processes in Kaon System 225
7 1 13 1 1 1
[ +
3xi-l 12(34-1)2 2{Xi-\f]Xi
1 1 35 1 5 1 1 1
l
6 Xi - 1 12 (a* - l ) 2 6 (Xi - l ) 3 +
2 (a* - l ) 4 J Xi
'
- 27(^,0, (9-23)
rw x ^rr 1 1 3 1 3 1 . 3 xl . ,
*i(*i) = - Q { [ - i - ^ + i I - _ ^ + -I-_^]a;i--I-_TFIna;i}
r1 1 9 1 3 1 . 3 a ; ? . ,,._,.
where the term ln(^-) in F\ (x^ has an infra-red singularity at the limit xu, mu —> 0,
and xu dependence has been kept only there. This term just corresponds to the
contribution of "penguin-diagram" with gluon-exchange (see Fig. 9.6) discussed
below (we may call the logarithmic contribution as the contribution of "electro-
weak penguin" diagram). In Fifi{xi), Q denotes the electric charge of the internal
up-type quarks, i.e. Q = §. We have left the factor Q free, so that we can clearly
distinguish the contributions of the different types of diagrams where the photon is
attached to the internal or external quarks, respectively. This makes, for instance,
the calculation of sdg (g : gluon) vertex, namely the calculation of the gluon-
penguin-diagram, really straightforward (Problem 9.2). The appearance of the £-
dependent term
x
f ,\ 1 ,3 1 1 1 , ,
X
4. r (5£ + l ) z , 2x-jx\
implies that the effective vertices are not gauge invariant, though the Pauli-term
accompanied by F2 is gauge invariant, since only this term contributes to the decay
s —> dj with the on-shell condition q2 = 0. The remaining gauge dependence is
cancelled when the contributions from box diagrams are also summed up, as we will
see below.
Let us note that the sdj vertex has two pieces, the terms with the currents
[s(<Z 2 7 M _ 9 M ^)^ a n d {sGnvd)q", called "charge radius" and "Pauli" terms, respec-
tively. These currents automatically vanish (without the use of equations of motion),
when multiplied by <?M, thus satisfying the current conservation of electromagnetic
current (CVC). Note that in the FCNC vertices the current conservation is not
satisfied by use of the equations of motion, since s and d have different masses, but
satisfied by these specific forms of the currents including the external momentum
q^. Thus the sdj vertex vanish in the limit of gM = 0 , and we need to calculate the
diagram keeping the 0(q2) terms, in contrast to the case of calculations of other
FCNC effective Lagrangians. This proportionality to q2 makes the non-decoupling
effect of heavy t quark rather mild in the sdj vertex, i.e. In (mt/Mw), while in
226 FLAVOR PHYSICS AND CP VIOLATION
w+
ipT
s S, u,c,t ^d d s d
Z,7 z, 7
(a) (b)
W+
w+
S "" ~" N
s s ,y(^d s s '' ^ d s d s
Fig. 9.5
sdZ vertex we have an non-decoupling effect proportional to m2, just as in the box
diagram (see (9.22)-(9.24)). The reason is rather easy to see; in the sdZ vertex
the (p+-exchange gives the Yukawa coupling factor mt/Mw twice, thus giving the
non-decoupling effect proportional to m2, while in the sdj vertex, the additional
q2 factor enforces the remaining piece to have mass-dimension d = — 2, behaving as
1/mf, thus cancelling the m2 from the Yukawa couplings. In this way, when the
FCNC 7 vertex or similarly FCNC gluon vertex is relevant, the contributions from
the lighter u.c quarks become significant. Especially the penguin-diagram shown
in Fig. 9.6 with FCNC gluon vertex is expected to play an important role in the
explanation of "A7 = | rule" in K —> TTTT decays.
Write the effective 4-Fermi Lagrangian relevant for the decays of our interest as
4/f=1 = ^ 2f 47rsin
, . ° L E (v;avid)[C{xi)(sZ'rltdL)(Tii'f^)
v °w it?,t
3
- ££(xi,^)(^7MdL)(I^7'Vii)]) (9.26)
where yj = jjf- with m;;. denoting the charged lepton mass of the j-th generation.
Rare Processes in Kaon System 227
w+
Fig. 9.6
The coefficient functions are the sum of the contributions from the box diagrams
and the Z-exchange diagrams in Fig. 9.4:
We readily know that Cz(xi) = Dz(xi) = Tz(xi). The contributions from the box
diagrams are known to be
d X%
Cn(xi) = lnxj — l(x,Q (9.28)
8 (xt - l ) 2 > Xi — 1
Dn(xi,yj) - fVi
8 Vj - Xi JJi
yj - 1
J.• r r Xi
Xi „fXjX{ **. \0
^2 , *^i
•"% ' •l -i i
x
+ 8\yj - Xi Xi - 1 ' Xi-V \xi\nxi
9 1
-•y(x,£). (9.29)
8 yj - 1 Xi - 1
Combining these two types of contributions we finally get gauge invariant coefficient
functions
1 3 Xi
C(xi) = -( ) lnx, + -xt - (9.30)
4 Xi - 1 ; X' 4 ^ - 1'
^ ^ = -8^^(^ri)ln^
1, • Xj
*fc j l l l j / j
81?/.,- -XiXi - V (xi-1)2
1 3. „ 1 . xt
+ -:Xi--(l +3 -) -. (9.31)
4 8 yj - 1 Xi - 1
Note that the piece with vector current fij^n does not contributes to the KL —>
^/i decay. This is because when gM coming from the hadronic matrix element
(0|si7pdL|.Kz,) = — !(0|s7M75(i|.ft'L) is multiplied to the leptonic current, the CVC
implies qli(p,'y^n) = 0. Thus the photon-exchange diagram in Fig. 9.4 does not
contribute to the process KL —» fiji, as the matter of fact.
228 FLAVOR PHYSICS AND CP VIOLATION
Fig. 9.7
We thus obtain formulae for the ratios of the branching ratios of two processes to
those of reference decay processes, in terms of the coefficient functions given above,
Br(KL -> w2) sd T(KL) a 2 [ R e ( £ i = C | t VSVidCfo))] 2 . .
Br(K+ - £„„) ~ T(K+) Uirsm2ew> V*s > ^ )
Br{K+->it+v9) a 2 ^ | £<=c,t V S ^ O * ^ ) ! 2
Br{K+ _> noe+„e) ^4,mbx29w> 2- (Ks)2 - &•*>)
where the subscript "sd" denotes the short distance (or perturbative) contribution
to the process, without including the two-photon process discussed below, and in
the estimation of KL —> fip, the difference of phase spaces of KL and K+ decays
have been ignored. In the case of KL -> up, there is an additional contribution from
FCNC two photon vertex, as shown in Fig. 9.7, whose presence makes the precise
prediction of the branching ratio difficult, though including the two photon process
the above formula gives a reasonable prediction, compatible with the observed value
Br(KL -» tip,) = (7.25 ± 0.16) x lCT 9 .
On the other hand, the branching ratio Br(K+ —> •K+VV) can be reliably esti-
mated, as the photon process does not contribute to this decay. When the values of
quark masses and generation mixing angles (or KM matrix elements) are substituted
and an approximation yj = 0 is used, i.e.,
the above formula gives Br(K+ ->• -K+VV) of the order of 0(1O - 1 0 ), which seems
to be quite consistent with the recent result of BNL experiment, which has claimed
the observation of the rare decay.
We learn from (9.7) that K° or K° is the mixture of Ki and K2, or more
precisely Ks and KL, with mass difference A m # due to the K° «-» K° mixing.
Hence, if we can prepare an initial state of K° or K°, it's existence probability
will oscillate in time due to the interference effects of matter waves of Ks and KL,
just as the neutrino oscillation discussed in chapter 6. A feature not shared by the
neutrino oscillation is that neutral kaons decay into TTTT etc., when they propagate.
Thus the time evolution Hamiltonian have (effective) imaginary or non-hermitian
part to effectively represent the decay. In order to accommodate the absorptive
flare Processes in Kaon System 229
/ M-fr M12-ir12\
(9 35)
Ur2-§r12 M-fr J' -
where M, V, corresponding to the mass and the decay width of neutral mesons in the
absence of the mixing, are real, while M\2 and Ti2 may have imaginary parts due
to CP violation. Though this Hamiltonian is applicable to both of neutral kaon and
neutral B-meson system (Bd ~ b^d,Bd ~ d^b;Bs ~ b^s,Bs ~ S756), here we
will discuss the neutral B meson oscillation. (Here we discuss only (Bd, Bj) system,
and will simply use the abbreviation B° for Bd, unless otherwise mentioned.) One
reason for this is that in the neutral kaon system the lifetimes of KL and Ks are
too different for the interference to be operative, while in the B meson system the
difference of lifetimes due to Ti2 is very small.
The eigenstates of the Hamiltonian
|Bl) s
vw^{PlB0)~q{*0))' (9-36)
1
\JM{2 - f r; 2
(9.38)
p
y/M12 - f r 12
AA = A2 - Ai = Am - %- A r
(Am = m 2 - m i , Ar = r 2 - r 1 ) , (9.40)
with
l
Am - i Ar = 2 ^ M 1 2 - ^r 1 2 yJMt2 - -T*12. (9.41)
The time evolution of a state ip(t) = c(t) \B°)+c(t) \B°) is governed by a Schrodinger
equation
230 FLAVOR PHYSICS AND CP VIOLATION
where the Hamiltonian H has been given in (9.35). The diagonalization of the
Hamiltonian
g±(t) = |[e-^*±e-^«]
Let |2?(i)) and |B(t)) be the states with the initial conditions |2?(0)) = |B°),
|B(0)) = |B°). As |B°), e.g. corresponds to c(0) = 1,5(0) = 0 we easily find
U,C |
u,c I
b
(a) (b)
Fig. 9.8
dominant contribution. Thus the absorptive part is suppressed compared with the
dispersive part by 0(m\lm\) -C 1. This is the reason why A r is negligible in the B
system. The formulae (9.48), (9.49) remind us of those in neutrino oscillation (with
maximal mixing). One remarkable difference is that in the B° <-> B° oscillation,
the sum of the probabilities decrease as the time goes on, which is the consequence
of the loss of the unitarity due to the non-hermitian Hamiltonian. So, when we
study observables sensitive to the oscillation, such as CP asymmetries in B°, B°
decays, Am x r = ^- = x needs to be sizable, which is satisfied in B°, B\ system,
where x is known to be ~ 0.7. (For the B°, B° system, x is much larger, relatively
enhanced by \Vts/Vtd\2-)
We may define the ratio of the time-integrated transition and survival probabil-
ities
J~ P(B° ^ BQ)dt x-
~ /0°° P(J3° -> B°)dt ~ 2 + x 2 ' {
'
where a formula
f
Jo
e-rtcos(Ami)dt=^^-^ (a; = - p ) , (9.51)
If J2 ^ If I" » # # ! • (9-53)
Unfortunately, this deviation of |*| from 1 is quite small, and to see the CP violating
effect is not plausible. The reason is the following. The key ingredient to understand
the situation is in (9.38). There we see that if the M i 2 and Ti2 have the same phase
factor, | | | = 1, even though £ itself has a large phase factor (, which produces
large CP asymmetries in B decays as we will see later). As we see in Fig. 9.8, the
absorptive part Ti2 is due to the contributions of u, c quarks. In the hypothetical
limit m u = m c , the orthogonality of the K-M matrix V*bVuli + V*bVcd = —V*bVtd
implies that M12, handled by t quark, and Ti2, handles by u,c quarks have the
same phase factor, leading to |£| = 1. We thus learn the deviation of | a | from 1
is strongly suppressed by a factor (m2 - m2u)lm2. The difference m2 — m2u was
compared with m2, since in the limit of Ti2/Mi2 —> 0, again |^| = 1 is realized.
The 3 generation Kobayashi-Maskawa model was proposed to account for the ob-
served CP violation. As we have briefly discussed in the section 9.1, in this model
CP violation is closely related with FCNC processes. In other words, the CP vio-
lation needs the breaking of the global "horizontal" or generation symmetry ST/(3)
(ng = 3), which is also needed in FCNC. To see this, suppose mu = mc. Then,
51/(2) sub-group of SU(S) in the up-type quark sector, i.e. the symmetry between
u and c quarks becomes an exact symmetry. By use of this SU(2) symmetry, it
is always possible to perform a suitable unitary transformation, belonging to the
SU(2), so that Vud vanishes, while we can always make the 1st column and the 1st
row of the KM matrix real by suitable re-phasing of quark fields q —> el<i> q. Then,
it is easy to show (see Problem 9.4) that we can perform a re-phasing such that the
all matrix elements become real. We learn from this example that if there is a mass
degeneracy in either of up- or down-type quark sector, CP violation never happens
(except for the possible strong CP violation, which we ignore).
Thus there arises a natural question: what is the (necessary and sufficient)
condition for the CP violation in the Kobayashi-Maskawa model? In particular we
ask what is the re-phasing invariant quantity which characterizes the CP violation.
The importance of the re-phasing invariance should be stressed here. In (4.181), the
CP violating phase 6 appears in the matrix element of, e.g. Va,, while in (4.183) the
CP Violation in Kobayashi-Maskawa Model 233
U | U
W+ <ww\x(^J d
W w+
G G*
Fig. 9.10
matrix element is real. This is a little puzzling, since if we adopt (4.181) it seems
a process with t -> b transition has CP violating effect, while if we adopt (4.183)
there seems to be no CP violation in the process. Actually the CP violating phase
of (4.181) can be eliminated by a suitable re-phasing of t,b. As the physics should
be the same, irrespectively of the re-phasing, this means it is meaningless to talk
about such re-phasing variant quantity Vtb to judge whether the CP symmetry is
violated or not, and we should seek a re-phasing invariant quantity, which is due
to the interference of different KM matrix elements. Let us now come back to the
condition for the CP violation. We first note that any observables related to quarks
are represented by the cuts of Feynman diagrams with closed fermion loops. In fact
if a transition amplitude A is represented by a diagram G, its complex conjugation
A* is represented by a diagram G* where the directions of the arrows of all fermion
lines are reversed, as the complex conjugation implies the charge conjugation. Thus
\A\2 = AA* is represented by combining G and G*, namely by the cut of closed
fermion loop (see Fig. 9.9 for the example of W+ —> ud decay).
In the language of weak-eigenstates, the origin of the CP violation is attributed
to the complex Yukawa couplings or mass matrices of up- and down-type quarks,
Mu, Md, inserted into the fermion line of the closed loop (see Fig. 9.10).
234 FLAVOR PHYSICS AND CP VIOLATION
Im Tv{HdHuH2dH2u\. (9.55)
It can be shown that there appear no more independent imaginary part (Gronau,
Kfir and Loewy, 1986). On the other hand some arithmetic shows that
Im Tr[HdHuHJH2} ^ 0 <-> det [Hu,Hd] ^ 0. Thus we find the condition for
the CP violation is
det [MuMl MdMd) ± 0. (9.56)
By use of the fact Hu = U^L dia(m£,m;?,mf) UUL etc. {UUL is the unitary matrix
used to diagonalize the up-type quark mass matrix), we can show that the above
condition is equivalent to (see Problem 9.5)
{ml - m\){m\ - m2t){m2c - m2)(m2d - m2s)(m2d - m2b)(m2s - m2b) x J ^ 0, (9.57)
W w+ Im
(a) (b)
Fig. 9.11
vcavc}
Fig. 9.12
vcdv;b (~ A 3 ) vcdv;s (~ A)
(a) (b)
Fig. 9.13
is easily understood; when the triangle is closed the inner angles are either of 0,
j or 7r, which means sin 2fl = 0 for all inner angles. Let us note that now the
re-phasing invariance of CP violating observables are manifest; though the each
side of the unitarity triangle rotates under the re-phasing, the shape of the triangle
is manifestly invariant, so are the inner angles. The shape itself clearly changes
depending on the choice of (a,/3), while we know every CP violating observable is
described by an unique parameter, the Jarlskog parameter J. What happens is,
although we have several different triangles depending on (a, /?), their area S is
known to be unique and just the half of J:
where fly is the angle between ViaV*0 and VjaV^ (fly < n). As long as the area S
is unique, the CP symmetry will get large in the system whose triangle has three
sizes with comparable lengths, as in the case of B meson system (a = b, (3 = d) we
will see later, while if only one of the sizes is very small the inner angles will be near
to 0 or | , and the CP asymmetry will get small, as is the case of K meson system
(a = s,P = d). See Fig. 9.13.
So far our argument has been focused on the CP phase in the KM matrix and the
resultant Jarlskog parameter. We have also seen that for CP to be really broken the
non-degeneracy of quarks masses in both of the up-type and down type sectors is
needed. But, actually we still need an additional condition for the CP violation: the
CP Violation in the Neutral K System 237
presence of the absorptive part in the relevant transition amplitude (the cut in the
Feynman diagram). In fact, in the end of the section 9.3, we have already seen that
T 12 = 0 leads to a vanishing CP violating effect in B meson system. The necessity
of the absorptive part is seen from the following illustrative argument. Suppose the
transition amplitude of a process A gets contributions form two different Feynman
diagrams A = A\ + A2. We will decompose the each amplitude to the part which
possibly contains the CP phase 8 due to the weak interaction, and a phase factor
indicating the possible presence of the absorptive part due to the on-shell property
of the intermediate state: A1<2 = Awl>2 ei5l~2, A = Awl eiSl + Aw2 ei62. The
transition amplitude of the process between the CP-conjugated states is written as
A(cp) = A*, etSl + A^2 e'*2. Let us note the phases 8i>2 do not change their sings,
since they have dynamical origin and have nothing to do with the KM CP phase.
The CP asymmetry proportional to the difference of squared absolute values of A
and A(cp} now reads as
| A | 2 _ ^ ( C P ) |2 = 4 g i n {h _ h ) lm{A*wiAw2y (g.ei)
The CP violation had been observed only in the system of neutral kaon system
(K°, K°), although the B-factory experiments in KEK and SLAC, recently started,
have begun to release the data on the CP asymmetries in the decay processes of
neutral B meson system (B°,B°). In fact, historically the first confirmation of the
CP violation was made by the detection of the long-lived neutral kaon KL —> TTTT.
In this section we focus on the CP violation in the kaon system, as it is still playing
an important and complimentary roles in the study of CP violation.
As has been already mentioned above, the CP asymmetry in the neutral kaon
system turns out to be small. This comes from the fact |Vt«V^| <C l^usKTdl ~
|V-CSVC^| (\VtsVt*d\ = 0(A2X5), \VusV:d\ = 0(A)). The CP asymmetry in the kaon
system is thus expected to be 0(A2Xi) = 0(1O - 3 ), which has been confirmed
experimentally.
Our main concern is the aforementioned KL -> TTTT decay. In the kaon system,
238 FLAVOR PHYSICS AND CP VIOLATION
in clear contrast to the B meson system, the life times of KL and Ks are quite dif-
ferent, as suggested by the subscripts L and S, thus making to isolate the KL state
possible. The decay of the KL state is a clear signature of the CP violation. The
state KL is determined as an eigenstate of the 2 x 2 Hamiltonian of the (K°, K°)
system. If there is no CP violating effect in the Hamiltonian, the eigenstates should
be the eigenstates of CP transformation, i.e. KL = K2 (see (9.7), (9.9)). On
the other hand, the K2 state is odd under the CP transformation, while the final
state of the decay, the 7T7T system, is even under the transformation. This simple
argument shows the decay KL —> W7r never happens, provided the theory is CP
invariant. Therefore, its detection should be a clear signature of the CP violation
in the theory. This argument also suggests that actually such CP violation decay
may have two distinct sources of CP violation, i.e. "indirect CP violation" in the
Hamiltonian of the (K°,K°), and the "direct CP violation" in the dacay amplitude
K-i -> 7T7T. We will discuss these issues successively below.
"Indirect CP violation"
The indirect CP violation in the Hamiltonian of the (K°, K°) is due to the imag-
inary parts of the amplitude of | A 5 | = 2 mixing process K° «-> K°, denoted by Mi 2
and Ti2. As the expected CP asymmetry in the kaon system is very small, (9(10 - 3 ),
we expect ImMi2 <C ReMi2 etc., and the deviation of the ratio of coefficients of two
neutral kaon states qa/pK (see (9.36) - (9.38) from unity is small. Thus, writing
PK/QK — (1 — e)/(l + e), the two eigenstates of the Hamiltonian are re-written as
w
- 75™ l ( 1 + E ) l *° ) + < 1 - € ) l * 0 > 1
- 7mm[m+llKl)]
1
(9 62)
'
\KS) = . [(1 + e)\K°) - (1 - e)\K°)}
1
[\K!)+^K2)]. (9.63)
VW+W)
We learn that non-vanishing e indicates KL, KS are not pure eigenstates of CP, im-
plying CP symmetry is violated in the Hamiltonian. As lm(VcsV*d) = -Im(VtsV*d)
(see (4.183) and (4.184)) and the contributions of the intermediate quarks in the
box daigram behave as E(xc)/E(xt) ~ rnl/rrfi < 1 (see (9.13)), we conclude
Imri2 <S ImMi 2 . The experimental values tell us A r ^ = TL — Ts(— —Ts) =
2 r i 2 ~ -2AmK = -4Mi2, i.e. r i 2 ~ - 2 M i 2 . We thus get a relation from (9.38)
_ e t i ImMi2. ,„„,..
e ~ —= (— ). (9.64)
W+
Fig. 9.14
e-i<t>Ko # o _> ei<t>K0 <ZJL(= I=f\ _> e-m ajc Or, for infinitesimal transformation
v
' ' PK 1+ C ' PK _
\(j>\ <S 1, e —> e — i</>. Thus, though Ree ~ | ( 1 — lyxfl) *s re-phasing invariant, Ime
is not re-phasing invariant.
The re-phasing invariant Ree has been measured by the observation of the
"charge asymmetry"
g = r(KL ->• it-l+vi) - T(KL -» TT+1-UI) ^ \pK\2 - \qK?
L
~ TiKL^Tt-l+vO + TiKL^ir+l-vt) \pK\ + \qK\2
2
where Ai are due to the weak interaction, including possible CP phase, while 5i
denote the absorptive parts coming from the strong (final state) interactions. The
corresponding amplitudes for K° decays are obtained by replacing Ai by —A},
without changing the signs of 5/. The AQ gets a remarkable contribution from the
"penguin" contribution with gluon-exchange (see Fig. 9.6), which is expected to
be responsible for the relative enhancement of the 1 = 0 amplitude compared with
1 = 2 amplitude in the K -»• 7T7T decays, p | ^ n ~ 22, so called AI = | rule, while
the Ai gets the main contribution from the ordinary charged current process shown
in Fig. 9.14, though its CP violating imaginary part may be provided by "electro-
weak penguin" diagram, which is obtained by replacing the gluon by 7 or Z in the
penguin diagram.
The final state TTTT is either ir+n~ or 7r°7r°. Write the corresponding measures of
240 FLAVOR PHYSICS AND CP VIOLATION
CP violation as
A(KL ->• TT+TT~) .
v+- = -rnr z—k = e + e 9 67
-
A(KL -> 7r°7r°) „. ,
in terms of two parameters e and e', which denote the part common for two kinds
of decay and the difference of the amplitudes of the decays, respectively. By use of
a relation
l ? r +
0 = \jI \iric(I = 0)) + y ^ |7T7r(J = 2)) (9.69)
and A(KL ->• irn)/A(Ks -> 7T7T) ~ (eA(Ki ->• 7T7T) + A(J£T2 -> 7r7r))/A(i<r1 -»• 7T7T) =
e + {A{K2 -> TTTT)/A(KI —> 7T7r)), we easily get the following approximate relation
, K ^ . n * , ^ - . . ^ , , ^ . ^ , , (,72)
where we have used the facts A2 is sufficiently smaller than Ao in both in their real
part ( A / = 1/2 rule) and in their imaginary part (electromagnetic penguin contri-
bution is suppressed compared to ordinary penguin contribution by the difference
of gauge couplings). As KL and IT are physical states, the decay amplitudes, and
therefore e, e', should be re-phasing invariant. In fact, neglecting A2,
A*
A(KL^nn) l - f f ^ _ 1, QK A*0
A(Ks-+irit) i + 2^45. ~ 2L p * V {
'
PK AO
As ^ —> e - 2 1 ^ 2iL
, and ilo —• e~%^Ao under the strangeness 1/(1) transformation,
A*
the combination SM--^- is re-phasing invariant. The re-phasing invariance of e' is
trivial.
In the KM model, T" 1 ^ 0 is less important compared with e, and e ~ e. Thus in
terms of the coefficient function E(x) obtained from the box diagram,
is useful to restrict the shape of the unitarity triangle, or on (1 - p)r) (in terms
of Wolfenstein's parameterization) from the factor Im{(V^*Vtd)2}, since we now
know mt ^ 175GeV. The prediction of another CP violating variable e' is less
reliable, since we have two competing terms p " ^ 2 and pjffi- Though |ImA 2 | <£
|ImA 0 |, the AI = 1/2 rule relatively enhances the importance of ImA 2 , which gets
a contribution from the electro-weak penguin diagram. As the coefficient function
of the FCNC sdZ vertex, Tz(jk-) in (9.20) grows roughly in proportion to m% for
larger mt, the non-decoupling contribution of the intermediate heavy t quark tends
to cancel the effect of {"M0 coming from ordinary penguin diagram, thus making the
prediction of e' rather smaller than naively expected, i.e. |^-| = 0(1O - 4 ) to 0(1O - 3 ),
while the present experimental data (Particle Data Group, 2002),
seem to support this expectation. In accordance with our general argument, the CP
violating observables e, e' result from the interference effects of different amplitudes,
e.g. the interference of AQ and A^ in the case of e'. We may summarize this section
by classifying the CP violating observables depending on where they have their
origin, i.e. in the AS = 2 "indirect" CP violation in the mass matrix, or in the
AS = 1 "direct" CP violation in the decay amplitudes or in the interference of these
two types:
• pure AS = 2 CP violation: Re e (6L)
• interference between AS — 2 and AS = 1: Im e
• pure AS — 1 CP violation: e'.
In the KM model, as long as there are three independent unitarity triangles de-
pending on the choice of (a,/?) with a,/3 = d,s,b, there should be CP violating
phenomena, not only in the K system (a = d, /3 = s), but also in the neutral B
meson systems, Bd (a = d,/3 = b) and Bs (a = s,/3 = b). Hence, the observa-
tions of the CP violating events, in the rates predicted by the KM model will be
very useful for the final confirmation of the model, as the model for explaining CP
violation, or by chance we may even be able to find some deviation from the pre-
diction, which then will be an indication of some "new physics". In addition, as
was discussed in section 9.4, we expect large CP asymmetries, of 0(10%), in the
Bd system, stemming from the fact that the three sizes of the unitarity triangle for
(a = d, (3 = b) are all of comparable lengths, 0(AX3). These are main motivations
for currently on-going B-factory experiments at KEK and SLAC. In this section we
mainly discuss the Bd system, and we will simply use the abbreviation B° for Bd,
unless otherwise mentioned.
242 FLAVOR PHYSICS AND CP VIOLATION
than the naive expectation, being suppressed by |^| — 1 ~ ^ (see the argument
below (9.53)). We also note the rates of "direct" CP violation in the pure AB = 1
decay amplitudes cannot be predicted with accuracy: the (partial) decay rate asym-
metries oc T(B° -> / ) — F(B° -> / ) (/, / : distinct final states) need the interfer-
ence among different amplitudes, which have absorptive parts with different phases,
whose estimation is not easy especially for those caused by strong interactions.
d. Thus the remaining candidate to see the sizable CP asymmetries with reli-
able predictions, is the CP asymmetries caused by the interference effect between
AB — 2 and AB = 1 amplitudes, corresponding to Im e in the K system. In fact,
we will see below the CP asymmetry is due to
qMB^Jl
Im{ (977)
pA(B0^f)^
which corresponds to — 21m e in K system (see (9.73)). Such type of CP asymmetries
is known to be of 0(10%), as was first pointed out by Sanda and collaborators
(Bigi and Sanda, 2000). Another nice thing of this type of CP asymmetry is the
possible phase due to strong interaction exactly cancelled out, as long as we choose
a final state / such that the decay amplitude A is dominated by one Feynman
diagram, as in the case of / = ipKs- In this type of CP violation B° o B°
CP Violation in the Neutral B System 243
oscillation obviously plays a central role. In fact, the CP asymmetry is caused by the
interference of two distinct amplitudes, B°(B°) -» f and B°(B°) -> B°(B°) -> / ,
realized by the oscillation. (We may rewrite the CP violating factor Im ( | J g o Z ^ j )
as
u(Bo_»f)|2 Im ( | A(B° ->• / ) A(B° - • /)*).) Thus we need a non-vanishing Am
for the oscillation, and the observed value of x ~ 0.7 is in an ideal range,
e. Although the CP asymmetry of this type is expected to be quite large, this
in turn means the lengths of the three sizes of the relevant unitarity triangle are
tiny, 0(AX3). Namely the branching ratios of the decay processes to observe the
CP asymmetries are rather small, e.g. Br(B ->• ipKs) = 0(1O - 4 ). Thus, even
though the CP asymmetries are sizable, we still need high luminosity for the collision
experiments. Let us note that in the K system, Ks -> TTTT was the main decay mode.
Of course, the large CP asymmetries will help a lot in the statistical significance of
the signal: needed luminosity is proportional to 1/A2, instead of I/A, with A being
a generic CP asymmetry.
We are now ready to discuss the sizable CP asymmetry caused by the B° f+ B°
oscillation. For instance in e + e~ collision experiments, e+e~ -> B°B°, B+B~,
we a priori do not know which one of two neutral mesons is B°, which makes the
observation of CP asymmetry difficult. We start with the idealized situation, where
we know that at time 0 B° or B° was produced, and denote their state at time t
as B°(t) or B°(t) with B°(0) = B° or B°(0) = B°, which are linear combinations
of B° and B°, as shown in (9.46) and (9.47). Let us consider a CP asymmetry,
which is the difference of the probabilities, or event rates, for the state starting as
B° or B° to decay into the final state / or / at time t, divided by the sum of the
probabilities:
= P(fl°(t)->/)-P(fl»(t)->/)
AW j
~ P(B0(t) -+ f) + P{B°(t) -> / ) ' ^
where
In the expression for the time-dependent CP asymmetry A(t), the term accompanied
by cos(Amt) does not go away at t = 0, and is the indication of the direct CP
violation in the AB = 1 decay amplitude, which does not need the help of the
flavor oscillation, while the term accompanied by sin(Ami) and Im ( £ ^ | B ° ~ > ^ )
vanishes at t — 0, and indicates the CP violation due to the interference between
A B = 2 and AB = 1 amplitudes, with large CP asymmetry, we are seeking. As
we have discussed above, we are interested in the process where the decay into /
is dominated by one Feynman diagram. In that case the direct CP violation is
negligible, i.e. T(B° - • / ) ~ T(B° - • / ) . In this way, the formula for the CP
asymmetry greatly simplified into
= T T ^ I m ( M ( f l o _,/))• <9-85)
where the smearing factor j ^ — 0.5 is fortunately not small. Write the time-
independent CP asymmetry factor as a(f) = Im ( £ ^ f ° ^ ^ ) - As the three sizes
of the unitarity triangle are of comparable lengths, and the coefficient function
representing the contributions of itj = u,c,t to AB - 2 box diagram grows roughly
as ~ ml/Mw the Mi 2 , Ti 2 stemming form the box diagram are dominated by the
t quark contributions. Thus 1 ~ J jfc is written in terms of KM matrix elements
CP Violation in the Neutral B System 245
related to t quark, (VtaVt*d)2. Thus for the most promising channel to observe the
CP asymmetry in the B-factory experiments,
a(7TKs) n I m c K f f a ) = - ^ ^ i , (9.86)
V
" td\
where ly'ffii = e i(</,1_,r) was used (see Fig. 9.13(a)). As we expected, the CP
asymmetry is described by the sin of an inner angle of the unitarity triangle in
(b,d) sector. In terms of Wolfenstein's parameters p and 77 of O(l) (see (4.184)),
and the \O,(T;KS)\ can be of order 10%. This large CP asymmetry has already been
observed at B-factories: Belle and BaBar experiments have reported \a(nKs)\ ~
70%.
One may wonder why the expression above is not re-phasing invariant: the phase
of {VtsVt*d)2 changes by the re-phasing. Actually when the factor J g o I ^ K s ) ~
v v" « — i/'tv \2wVv n is taken into account, we realize that the whole ex-
pression is re-phasing invariant, as it should be. Let us note that VcjVcs comes from
the decay b -» ccs due to the ordinary charged current process, while PK/QR —
(V*sVcd)/(VC8V*d) reflects that K° and K° are contained in Ks by the fractions px
and qK- As the matter of fact, however, V^Vcs is almost real and the CP violating
phase appearing in p*K/qK ~ 1 + 2e* is suppressed by |e| = 0(1O - 3 ). That is why
the CP asymmetry is dominated by the phase of \ydyi\-
So far we have assumed that we can prepare a neutral B meson with definite
flavor, i.e. B° or B°, at t = 0. Such identification is crucial when we discuss CP
asymmetry, since the interchange B° +-> B° at t = 0 will change the sign of CP
asymmetry. Unfortunately such identification of flavor is not possible in a realistic
situation of e+e~~ experiment, such as the B-factory experiments Belle and BaBar,
since B° and B° are produced in a pair. In addition the presence of B° -H B°
oscillation makes the identification difficult. One intriguing idea is to tag the flavor
of one of the mesons to be, e.g. B°, by the observation of its decay products
at some time t. As B mesons are produced at the collider experiment through
e+e~ —> T(45) -> B°B°, the relative orbital angular momentum I of two mesons
is 1, i.e. p-wave, whose wave function is antisymmetric under the exchange of two
mesons. Thus the remaining meson at time t should have opposite flavor, e.g. B°,
even though two mesons may have the same flavor at different times as the result
of the oscillation. If we regard the time of the tagging t as a sort of initial time we
are able to repeat the argument extended above for the idealized situation.
Let us illustrate the idea more concretely by explicit calculations. At the initial
time when B mesons are produced via the decay of T, the state of two mesons with
246 FLAVOR PHYSICS AND CP VIOLATION
P(fa,fb\t,t%
<x |-^=[< fa\B°(t) X fb\B°(t') > +c < fb\B°(t') >< fb\B°(t) >]| 2
P(fa,h;t,t%
oc r ( B ° -> / o ) r ( B 0 -* / 6 ) e - r t +
{1 + I m (
' pl(B°^/I) ) s i n ( A m ^ ) } ' (9 90)
-
where a good approximations r ( B ° -» / a ) ~ r(B° ->• / a ) , and r(f?° -> /ft) ~
r ( B ° —> fb) have been used, which are valid for decay processes dominated by tree
level diagrams. The subscript =F correspond to c = =F1. At the time £' of tagging of
the meson with momentum k', another meson is known to have different flavor, and
if we regard i_ as the time t in the ideal situation it is expected that we have a CP
asymmetry with the same time dependence as what we got above for the idealizes
case. In fact, for the case of our main interest c = —1, after the integration over t+
are performed by asymmetric collider where the energies of e + and e~ are different
and therefore the center of mass system is moving. Thus the difference of t and t1 is
measurable by identifying the corresponding decay points. If we denote the number
of events for the two mesons to decay into fa and fb at time t and t' with t > t',
etc. by N(fa,ft)t>t' etc., the time-integrated CP asymmetry A = J — J o-(fa) is
given as
A _ N(fa,fb)t>t' -N(fa,fb)t>t' - N(fa,fb)t<tl + N(fa, fo)t<t' ,~ „„\
[
N(fa, f b ) t > v + N(fa, f b ) t > t , + N(fa, }b)t<v + N(fa, f b ) t < t , " - '
Problems
9.1 Consider the Yukawa couplings of two Higgs doublets <f>i, fc, given in (9.3).
Show that a linear combination of their neutral components
—j===^{vi<p\ -\-v-i^) («i2 = (v?,2)) n a s flavor-diagonal Yukawa coupling (no
FCNC).
9.2 Utilize the effective FCNC 7-vertex given in (9.21) and (9.23), (9.24) to get
the effective FCNC gluon-vertex, i.e. the contribution of "penguin" diagram. The
comment below (9.24), concerning the Q-dependence of the effective 7-vertex may
be helpful.
9.3 Solve the time-evolution equation (9.42) to get the solution (9.44), (9.45).
9.4 Prove that when mu = mc, for instance, all the elements of the KM (Kobayashi-
Maskawa) matrix can be made real, by use of suitable re-phasing of quark fields.
Throughout this book, we use the natural unit h{=. /i/27r) = c = 1, where h is the
Planck constant and c is the light velocity. We follow the convention of Bjorken
and Drell (Bjorken and Drell, 1964).
Space-time coordinates x = (i, x, y, z) = (t, x) are described by the contravariant
4-vector
The repeated indices are summed unless otherwise specified. Then the scalar prod-
uct of space-time points is given as
2 z2
1^ = 9!" Qliv*& •£ — Q XpXy — Z X (A.4)
^ = 0«=(if,-*v) (A.6)
249
250 Notation and Useful Relations
Thus,
^pM = _ 5 ^ M = - g + V2 = - D , (A.7)
where V 2 and • are called the Laplacian and the d'Alembertian operator, respec-
tively.
A.2 7 matrices
7 M 7M = 4, (A.9)
ll = 1 , (A.14)
and
{75,7"} = 0 . (A.15)
By using the Levi-Civita tensor which is totally antisymmetric, one can also write
75 as
i
(A.16)
4!'"'"
where
+1 for even permutations of 0,1,2,3,
pCT
W o - = -e"" = { - 1 for odd permutations of 0,1,2,3, (A.17)
0 otherwise.
The Levi-Civita tensor satisfies the useful relations:
7MaM =i = 7 0 a 0 - 7 • a (A.23)
and similarly
'-(Vo')-'-(^)-*-(S- 0 0- ^
where I is the 2 x 2 unit matrix and a is the 2 x 2 Pauli matrix
0 1 \ / 0 -» \ / 1 0
". = 1 n • "ff. == " n • CTz
" . == n " • (A-2?)
1 0 J' » I i 0 J' I 0 -1
The spin tensor is defined as
(i,j,fc = 1,2,3)
In calculating the transition matrix elements, the following trace theorems are use-
ful:
TrJ = 4, (A.30)
Tr 7 5 = 0, (A.31)
Tr(odd number of 7's) = 0, (A.32)
T±(W») = 4$„„ , (A.33)
Tr(7^7„7p7<r) = 4:{g„ugp<r - g„pgva + g^gvp) , (A.34)
Tt(757M) = 0, (A.35)
TV( 7 5 7M7^) = 0, (A.36)
Tf(7S7/.7„7„) = 0, (A.37)
PL + PR = 1, PI = PL, PR = PR,
PI = PL, PR = PR, PLPR = PRPL =0. (A.46)
Note that
an
IhipR = V>fl d TsV'i = ~i>L • (A-47)
Thus, tpL and ipR are eigenvectors of the chirality operator 75. Define the Dirac
adjoint spinor i/> = ^ 7 ° . Then we have the following important identities:
When we write the 4-component Dirac spinor in terms of 2-component Weyl spinors
X and 77 as
U-V*"*v")(:)-(:)-. <-»
which results in the following simultaneous equation
(-i^-+ia-V)r]-mx = 0. (A.55b)
254 Notation and Useful Relations
Then we have
EX = +V-PX- (A.58a)
Erj = -a-pq , (A.58b)
l
h='-4=
1^1
M-
2 |p|
(A.60)
Then we can write
% = +2X, (A.61a)
2 I n
0 -11 )MU„ )/ == 2'oTsV' , (A.62)
where p = i§r. Therefore, helicity /i is the same as chirality 75 for massless fermions
with positive energy. For negative-energy fermions(which correspond to antiparti-
cles), helicity h has the opposite sign to chirality 75. If a fermion is massive (m ^ 0),
Dirac equation in the ^-diagonal representation: 255
but they are no longer eigenstates of helicity h because of the non-zero mass term
in the Dirac equation.
X + (A 65a)
hx = + 2 ^ ' -
T 1 m , . „_, .
hn = ~2T,~2EX- (A.65b)
By using the eigenstates oih, x' and r/, as defined as hx' = + | x ' and hr)' = —1»/,
respectively, one can write them as
771
X « x' + ^ V , (A.66a)
777
i; « r/--X'. (A.66b)
That is, x and 77 are given by the mixture of both helicity states where contribution
of wrong sign helicity is order of m/E.
This page is intentionally left blank
Appendix B
Cross sections and Feynman rule
The scattering cross section or the transition probability for particle reactions is
described by the S matrix as
S = l+iT (B.l)
where T, being called the transition matrix, denotes the transition between initial
i and final / states.
First, consider a 2-body collision, a(pa) + b(pt) —• c(pc) + d{pd), as an example,
where the momentum of each particle is given in parentheses. The transition matrix
element for this process can be written as
For the time being, we consider the case of spinless particles. Extention to more
general case including particles with spins is straightforward as described later. In
(B.2), all dynamics for this transition are included in M;i which is Lorentz invariant
and called the invariant amplitude. N is a normalization factor which depends on
the convention of the wave function normalization. Here we take the convention
for normalizing 2E particles in a volume V, i.e. N = -k= by defining the free field
wave function as
ftx) = ^ = 6 - * * . (B.3)
The transition probability per unit time and unit volume for this collision is
257
258 Cross sections and Feynman rule
given by
IT/-12
Wfi = LM_, (B.5)
From (B.7), we can obtain the cross section a by multipling it by the number of
available final states and dividing it by the incident flux F. As is well known, the
number of states in which a particle in a volume V has its momentum between
P = {Px,Py,Pz) and p + dp= {px + dpx,py + dpy,pz + dpz) is ^ f . Since we are
normalizing IE particles in V, the number of states per one particle is i2ix\32E •
Then, the number of available final states for the process a + b -> c + d is i2„)32B '
(2TT)*2E • ^he m c m e n t flux F can be calculated as follows; if we consider the process
in the laboratory frame, the number of the incident particle a passing through unit
area per unit time in this collision is ^p^-l^ml) where Vin is the velocity of the
incident particle a, while the number of the target particle b per unit volume is ^ - .
Thus, the incident flux for the collision of a and b is F = \v%n\^h^v^- Then we can
obtain the cross section for this process as follows;
1 ,4,
da(a + b -> c + d) = T ^ v ^ (Pa + Pb - Pc - Pd)
d3pc d3pd
\M(a + b^c + d)\\ (B.8)
2EC 2Ed
Now, the extension of this formula to the n particle production processes, a+b -+
1 + 2H + n, is straightforward,
da(a + b -+ 1 + 2 + • • • + n)
1 1 ,4
One can easily generalize these formulas to processes including particles with spin
by replacing |M/i| 2 in (B.8) and (B.10) by X^ gpin l-^/*l2> which means that the spin
degree of freedom must be summed for the final states and averaged for the initial
states if the spin of initial particles is not polarized. In addition, for spin | fermions
the normalization convention for the wave function,
The decay width of a particle can be calculated similarly; the decay width formula
for a particle with mass M decaying into n particles in its rest frame is given by
dT{M -> 1 + 2 + • • • + n)
= -i
2M I T
(27T)(3"-4) \Mfi\264(Pa +Pb-Pl~P2 Pn)
2M (2TT)( 3 "- 4 ) '
d 3 Pi d 3 p 2 d?pn (B.15)
2E\ 2E2 2En
260 Cross sections and Feynman rule
The invariant amplitude Mfi for an specific process is given by the sum of Feynman
diagrams corresponding to its process. They are usually composed of three kinds
of factors, i.e. external lines, propagators (internal lines) and vertices, which are
described below.
(l)External lines
• spin=0 bosons: For both incoming and outgoing bosons, assign 1.
• spin=l photon: For an incoming photon, assign ej, (k). For an outgoing photon,
assign e£ (&')
(2)Propagators
• Boson with spin=0:
_ i
p2 — m2
_ i _ . ^+m
$—m p2 — m2
(3)Vertices
• spin=0 boson-photon vertex:
Feynman rule
-ieip + p')^
(for charge +e)
2ie2g„,v
s p i n = | fermion-photon vertex:
(l)External lines
- spin— 2- quarks: For an incoming quark or anti-quark, assign u^(p) or v^(p),
respectively. For an outgoing quark or anti-quark, assign u^s \p') or v^s ^(p1), re
spectively. Though quarks have the flavor and color degree of freedom, it is not
written explicitly in these spinors.
(2)Propagators
• Quark with spin=^:
. p" + m
rfi -
m m"
(3)Vertices
• Quark-gluon vertex:
-*s«y7^
• Gluon-gluon vertex:
[i, ki, i
-i9s[fijmfktm(9»\9vp ~ 9p.p9v\)
+fitmfjkm(9nv9\p ~ 9p.\9up)
v,k2,j X,k3,k +fikmfljm{9np9v\ — 9p.v9\p))
(l)External lines
• spin=0 Higgs bosons: For both incoming and outgoing Higgs bosons, assign 1.
• spin=l gauge bosons: For an incoming gauge boson, assign ej, '(k). For an out-
going gauge boson, assign e*f} (&')
(2) Propagators
• Higgs boson with spin=0:
p2_M2
i _ . ft + m
•p — m p2 — m2
V-Ml\9 +
Ml )
(3) Vertices
• Charged current vertex:
._9_ 1-75
V2^ 2
• 9 a 1-75
264 Cross sections and Feynman rule
U,C
qFi
^sin6,c7"~T^
— is for s-u-W vertex
+ is for d-c-W vertex
W,n
—le
sin 6w cos 6w M ('i1-- 7 5 , j 1 + 75
Z°,fi
where
1 2 2
a{ = - - - s i n 2 ^ , a£ = - - s i n 2 0 w , f o r / = u,c,t
W+,ku^ W-,k2,\
7 , Ky, fJ,
^ory+xy- vertex:
Feynman rule 265
W+,kuv. w-,k2,\
+9»v(k3 ~h)\]
l,a.
W+,fi W~,v
7>«.
W+,M W~,v
Z , a.
W+,ii W~sv
W+,a w-,P
26
. 2 a {I9na9v0 ~ 9»&9av ~ 9iiv9ap)
sin Vw
W+,n W~,v
ie
-Mwgv)i
sin# w
2. HZ°Z° vertex:
Z°,kuu Z°,k2,X
^ r
lie
Mz9v\
sin 20w
ie m,f
2 sin 6w Mw
H^ M
3H2e
2Mw cos 9w
ff. M
ie2
9lLV
2 sin 8\v
W+,fi W,v
Feynman rule 267
HHZ°Z° coupling:
HHHH coupling:
H^ JI
^^ ,-''' 3i/x2e2
It'' "^H
This page is intentionally left blank
Appendix C
Basics of the group theory
The Lagrangian is the fundamental object describing the dynamics of the physical
system and is to be constructed so that the interactions are invariant under certain
symmetry transformations. The requirement of invariance under symmetry trans-
formations is a guiding principle for the construction of interacting field theories.
Here we give a little mathematical basics of the symmetry, i.e. the group theory, in
particular, an SU(n) group and its algebra.
A group G is defined as a set of elements {a, b,c,- • •} with the follwing conditions,
Among many examples of groups, the n dimensional orthogonal group (0(n)) and
special unitary group (SU(n)) are interesting in particle physics. 0(n) is the one in
which the length r 2 = x\ + x\ H 1- x\ in the n dimensional real space is invariant
under any rotation in this real space. U{n) is the one in which the length defined
by s 2 = |a;i| 2 + \x%\2 + 1- \xn\2 in the n dimensional complex space is invariant
under any rotation in this complex space and in addition, when the determinant of
the transformation matrix U is 1, i.e. detU = 1, it is called the SU(n) group.
In case of a • b = b • a for any elements a, b in G, the group is called the Abelian
or commutative group, while in case of a • b ^ b • a, it is called the non-Abelian or
non-commutative group. One-dimensional translation, two-dimensional rotation
(0(2)) or phase transformation (U(l)) etc. belong to the Abelian group, while
three-dimensional rotation (0(3)) or special unitary transformation (SU(n) with
269
270 Basics of the group theory
n>2) etc. belong to the non-Abelian group. Specially interesting group in particle
physics is the Lie group in which the group element is the analytic function of the
continuous parameter.
Let us assume that there is a matrix or operator M{a) for any elements a in G.
Then, if M(a) satisfys the conditions, M(a)M(b) — M(ab), M(a~l) — M'1(a) and
M ( l ) = 1, then M(a) is called representation of the group G.
We are interested in the SU(n) group defined in the n dimensional complex space,
which is expressed by an n x n unitary matrix with its determinant being equal to
1
Then, how many parametrs are there in U1 As is well known, an n x n complex ma-
trix has 2n 2 real parameters. But there are n 2 constraints among these parameters
because of the unitarity relation WU = 1 and moreover, there is one additional
constraint due to the condition of detC/ = 1. Therefore, the number of the real
parameters of the matrix U is 2n 2 — (n 2 + 1) = n2 — 1. For example, the unitary
matrix for SU(2) and SU(3) has 3 and 8 parameters, respectively. Using these
parameters 6l (i = 1,2, • • •,n 2 — 1), the nxn unitary matrix U for SU(n) is written
by
where Ll are the nxn matrices called the generators for the SU(n) group. Here
the summation over i is implied. Since U is unitary, L1 are hermitian matrices. In
addition, L% is traceless due to detC/ = 1. Explicit expressions of Ll, for example,
for SU(2) are given by the 2 x 2 Pauli spin matrices ^- (i = 1,2,3) or, equivalently,
2 x 2 isospin matrices y (i = 1,2,3) with
A7 = 0 0 -i , A8 = -== 0 1 0 . (C.4)
As explicitly shown above, the SU{2) group has one diagonal matrix, ^-, and
the SU(3) group has two diagonal matrices, ^- and ^-. The number of diagonal
matrices is equal to the rank of the group. The SU(ri) group has n — 1 diagonal
generators and is a group of rank n — 1.
From the group property of U, we can see that these generators satisfy the
following algebra which is called the Lie algebra
where fijk, being antisymmetric constants for exchange of any pairs of i, j and
k, are called the structure constants of the group. Here the summation over k is
implied. This statement can be proved as follows; let us start with the foil wing
relation for non-commuting operators A and B,
A . eB = eA+B+i[A,B]+ ( C 6 )
e
Now let A and B to be given by A = ialL' = ia-L and B = ifilLl = i/3-L with n 2 - 1
parameters a1 and p*, respectively, where the summation over i is implied. Then,
since U(a) = eA and £/(/?) = eB are the elements of SU(n), U(a) • U(0) = eA • eB
is also the element of SU(n) because of the group property. Hence, eA • eB must
be also written by ec with C = ij • L. Therefore, the commutator [Ll, V\ must be
linearly related to the sum of the generators as described in (C.5). The structure
constants faf. are determined from the associativity law of the group.
The Jacobi identity
[Ti,Ti]=ifijkTk, (C.9)
which is just the same algebra as (C.5). That is to say, the generators being just
equal to the structure constants of the group satisfy the Lie algebra. In other
words, the structure constants themselves generate a representation of the Lie alge-
bra. The representation generated by the structure constants is called the "adjoint
representation". The dimention of the adjoint representation is just the number of
generators, which is n 2 — 1 for SU(n).
272 Basics of the group theory
Let us consider a field <j> composed of n complex fields ipa (a— 1,2, • • •, n) and write
it by a column vector,
f2
<t> = (CIO)
\<Pn J
If <>
/ changes into <p' under the SU(n) transformation as
The representation for those fields ipa is called the "conjugate fundamental repre-
sentation" and denoted by n*. Since the infinitesimal transformation associated
with U*(0) is 17*(0) ~ 1 + ifl'L" = 1 - iO^-L"), the conjugate representation
<p = y?* are generated by — IS*. Therefore, all eigenvectors for generators Ll of
a
diagonal matrices are still eigenvectors of — L%* but of opposite sign. Regarding this
property, the SU(2) group is specially interesting because it has only one diagonal
matrix Ij. Then for SU(2), we can expect that the 2 and 2* representation is
equivalent. In fact, this is the case as shown in the following. Let us look for a
Representation in SU(n) group 273
matrix A so that A<j> transforms, like <j> in (C.ll), as A<j>* -¥ (A(f>*)' = U{A(f>*)
or 4>* -» 0'* = (A-lUA)4>* = £/*<£*. Thus, we must have A~XUA = U*, that
is, A~X^A = -^- (i = 1,2,3) since U ~ 1 - iQ% for SU(2). This can be
possible, when we choose A = ir2 = I J. Actually, we can see that
By taking the trace of (C.17), we find Ti(ipa ® X6) = ¥>aX°- Hence the 1st term of
the right-hand side of (C.17) is the traceless matrix with n 2 - 1 components,
r
« = V-* 6 - i * W , (C18)
n
and denoted by n 2 - 1. The 2nd term is just the trace term having only one
component,
and
Using the unitarity condition U™U^ — S™ where the summation over c is implied,
these transformation law can be written in the matrix form as
and
S->S' = S. (C.23)
I* = At{Li)ta, (i = l , 2 , - - - , n 2 - l ) (C.24)
274 Basics of the group theory
where A1 are the parameters of the SU(n) group. Here the summation over i
is implied. Now, let us consider the transformation law of A1 under the SU(n)
transformation. Under an infinitesimal transformation
n ® n* = n 2 - 1 © 1. (C.30)
Appendix D
C, P and T transformation
275
276 C, P and T transformation
An important example of parity transformation is the one for the weak left-handed
current denned by J^ ' = $aL(z',t)jli.il)bL{x,t), where the left- and right-handed
fermion fields transform under parity inversion as
Here we just list up the transformation rule of fields under time inversion. For
simplicity, we fix the phase to be T\T — 1.
{DA6)
Dxrac field . { ^ t ) ^frpj^^fr-tW,
Vector field : V{x,t) -» V " T ( f ,t') = VM(f, - t ) . (D.17)
One should not be confused at the superscript T to Dirac fields in (D.16); the
superscript T on the left-hand side means the time-reversed field, while the one on
the right-hand side does the transposed field.
Furthermore, the bilinear covariants composed of Dirac fields transform as
In addition to the space-time symmetries discussed above, there are other symme-
tries called internal symmetries. Among many such internal symmetries, we are
here interested in the charge conjugation. This is a discrete transformation of par-
ticles into antiparticles, in which the electric charge and other quantum numbers of
particles are reversed without changing any kinematic attributes.
Here we just list up the transformation rule of fields under charge conjugation.
For simplicity, we fix the phase to be rjc = 1.
Scalar field <f>(x,t) -+<j>c(x',t') = tf(x,t), (D.23)
c
Pseudoscalar field 71(x,t)^rl (x',t')=rf(x,t), (D.24)
f i>(x,t)->i>c(x',t')=C$T(x,t),
Dirac field (D.25)
\ $(3,t) -> H>c{x',t') = -$T(x,t)C-1,
Vector field : V"(2, t) -4 V^ ( f , t') = -V^(x, t), (D.26)
where the charge conjugation operator C in (D.25) is explicitly written by C = Z727°
and satisfies the relations, C'^C = - ^ and C = - C _ 1 = - C * = -CT.
Furthermore, the bilinear covariants composed of Dirac fields transform as
Now, a comment on the transformation of the left- and right-fields under charge
conjugation is useful. Applying the transformation rule (D.25) to the left-handed
Dirac spinor, we can obtain the following result,
T
G
(VL) = ai>i=c (- ^ V O V
2
J*y> 1-75
= c p Ci° P
1 + 75 .0,/,* 1 + 75
C7°V - C-ip1
2
1 + 75 c
i> = (i>c)R = tfg. (D.32)
Similarly we obtain the result, (IPR)C = (II>C)L = V'L J f° r the right-handed spinor.
Furthermore, we can also obtain the similar result even for these adjoint spinors.
278 C, P and T transformation
We summarized them as
D.4 C P transformation
and furthermore, combining (D.ll) and (D.33), similar rules are derived for the left-
and right-handed spinors as
= -^bL{-X,t)^i)aL{-X,t)
where the minus sign in the last two equations originates from the interchange of
fermion fields in taking the transposition.
Let us consider the transformation of the charged current weak interaction under
CP inversion. The Lagrangian is given in (4.164) as
r {jncp = -D'Ll,VTU'L,
+ CP (D 38)
I (J» ) = -U>Lj»V*D<L. -
CP transformation 279
Furthermore, the application of C and P inversions to the vecor field Wjjf yields
w
jt -> W$P± = -W*. (D.39)
Then, we get the following transformation,
j dix{U'LYVD'LW+ + D'L>fVlU'LW-)
In modern particle physics, the symmetry and the constituent of matter having its
symmetry are powerful tools to understand the structure of matter and its physics.
One such example is the proton and neutron which are the constituents of all nuclei
and possess the SU(2) symmetry of isospin. Various properties of nuclei are well
understood by the dynamics of protons and neutrons having the SU(2) symmetry
of isospin. Now, it is known that there are more than 300 hadrons, including the
proton and the neutron. Then, it is natural to expect that there must be new
fundamental constituents at a deeper level in Nature which build those hadrons
systematically based on new underlying symmetries. In 1964, Gell-Mann and Zweig
(Gell-Mann, 1964; Zweig, 1964) proposed such a new constituent model of hadrons
called the quark model, in which the hadrons are beautifully classified with the
SU(3) symmetry. The idea of quarks as the fundamental constituents of hadrons
has been developed with a lot of theoretical and experimental efforts in 1960s and
1970s, and the quark model was established as an important component of the
standard model of modern particle physics. Here we would like to discuss the quark
model.
Before discussing the quark model, let us begin by briefly reviewing the isospin
symmetry. Nuclei are bound states of protons and neutrons and their nuclear force
is issued from pion exchange. Interestingly the nuclear force does not distinguish
the proton and the neutron. The proton mass mp = 938.27200 ± 0.00004 MeV is
very close to the neutron mass m„ = 939.56533 ± 0.00004 MeV and their small
mass difference is considered to be due to the electromagnetic interaction; a proton
has a positive charge, while a neutron is neutral. This situation naturally suggests
that the proton and the neutron are two manifestations of the same particle, called
the nucleon, and one can write it as a doublet 2 = I ) of the fundamental
281
282 The quark model
representation of the SU(2) group, that is, the proton and neutron are represented
in the isospin space as
[l\P]=ieijkIk. (E.2)
where e^ is the totally antisymmetric Levi-Civita symbol with £123 = 1 and the
summation over k is implied. r% have the same expression as the Pauli matrices
a1 (i = 1,2,3) and their explicit expression are presented in (C.3).
For the two-riucleon system, analogously to the spin composition of two spin | ,
one can construct the J = 1 (triplet) and / = 0 (singlet) states of the nucleon-nucleon
system,
r \i = i,i3 = i)=pp,
I = l(triplet) I \I = 1, J 3 = 0) = ^{jm + np), (E.3)
[ \I = l,I3 = -l) = nn,
Now, let us define the charge conjugation operator C which transforms particles
into antiparticles,
(i)-(J ?)(:)•
However, the sign of the eigenvalue of I3 for antiparticles is opposite to the one for
particles and we want to arrange h to the upper component (I 3 = + | ) and p to
the lower component (I 3 = — | ) in the antiparticle doublet, as 2* = I J. Then,
if we introduce a minus sign to n, we can see that it transforms in exactly the same
way as the the nucleon doublet I 1 under the SU{2) isospin transformation, as
shown
( ? ) - ( ; ?)(?)•
(E.9) is certainly the same transformation as (E.6). Namely, 2* and 2 transform
in the same way under the SU(2) transformation, i.e. 2* ~ 2. (Here don't worry
about the nasty minus sign to n.) This can be done only for the SU(2) symmetry
as proved in Appendix C.
Now, we can construct the isospin states of the nucleon-antinucleon system as
follows;
\I = 1J3 = 1) =-pn,
J = 1 (triplet) < \I=l,I3=0) = j-(pp-nn), (E.10)
\I - 1,I3 = - 1 ) =np,
In 1950s, new hadrons were discovered. Those new hadrons were surprisingly long-
lived compared to the strong interaction scale. For example, the A0 and K° are
284 The quark model
easily produced in high energy n p scattering but those particles decay into light
hadrons with very long lifetime,
To explain the fact that while the production of A0 and K° occurs with strong inter-
action scale, the decay of those particles does with weak interaction scale, Nakano
and Nishijima and, independently, Gell-Mann introduced a new additive quantum
number called "strangeness" (S). They assigned S as S = 0 for p, TT~ , S = + 1
for K° and S = — 1 for A0, and considered that while the strong interaction con-
serves the quantum number S, the weak interaction does not. In the production
process (E.12), S is conserved but in the decay processes (E.13), it is not. Soon
later, the idea of Nakano, Nishijima and Gell-Mann was confirmed from observed
properties of many strange particles discovered those days. The conservation of the
strangeness S is similar to the one of the charge Q due to the (7(1) electromag-
netic symmetry. This suggests the existence of a new {7(1) symmetry. Actually
one can introduce the symmetry called the U(l) hypercharge symmetry, where the
new quantum number Y, called hypercharge, is defined by the sum of the baryon
number B and the strangeness S,Y = B + S. Because of this new (7(1) symmetry,
the strong interaction conserves the hypercharge Y and hence, the strangeness S
is also conserved in strong interactions, because the baryon number B is a good
quantum number for the strong interaction. Then, we can see that the following
relation, being called the Nakano-Nishijima-Gell-Mann (NNG) relation (Nakano
and Nishijima, 1953; Gell-Mann, 1953),
Q = I3 + ^, (E.14)
works well for all hadrons discovered those days, where Q and I3 are the charge
and the 3rd component of the isospin of the hadron, respectively.
In 1964, Gell-Mann and Zweig introduced the quarks as physical substances
to realize the relation (E.14). In the quark model, all hadrons are made of a few
quarks; while all baryons are made of 3 quarks q, all mesons are made of a quark q
and an antiquark q, where all quantum numbers of q is opposite to those of q. Since
the quark model should make even strange hadrons like A0 and K°, we need a new
quark, i.e. the s(strange) quark in addition to the u(up) and d(down) quarks which
nicely build the non-strange hadrons like p, n, ir, etc. Thus, in the original quark
model the u, d and s quarks were considered to be the fundamental constituents of
hadrons and to have the SU(3) symmetry. Note that though the 5(7(2) symmetry of
isospin is a rather good symmetry due to almost equal masses of p and n, the 5(7(3)
symmetry is not such a good symmetry because mass differences of strange hadrons
and non-strange hadrons are rather big. Later, the existence of more heavier quarks
c(charm), 6(bottom) and t(top) quarks were also established. Now, we have 6
Quark model and SU(3) symmetry 285
I I3 s B Y Q
1 I 1
u 2 +1 0 3 3 ^3
1 1 1 1 1
d 2 2 0 3 3 3
1 2 1
s 0 0 -1 3 3 3
( u\
Q = d (E.15)
Vs J
Now, let us consider the 517(3) transformation of this triplet,
"Here we consider an idealized world of equal quark masses of u, d and s, though they are, in fact,
different. Therefore, the same mathematics can be applied for the color SU(3), which is an exact
symmetry.
286 The quark model
* / 3
(a) q = 3 (b)g = 3*
we can plot the members of the quark triplet with their quantum numbers of I3
and Y in (I3, Y) space, as shown in Fig. E.l (a). The members of the antiquark
triplet (=3*) are also ploted in Fig. E.l(b). (Fig. E.l is called the weight diagram.)
Introducing I3 and Y into (E.14), we obtain the charge operator,
3 0 0 \
A8
Q = i3 + ^ = 0 1 0 (E.18)
+ 2v/3 ~ 3
1
lo 0 3 /
The diagonal elements of Q are certainly equal to the charge of the u, d and s
quarks given in Table E.l. Then we see that the NNG relation works well even for
quarks.
In the quark model, mesons are composed of qq, while baryons are of qqq. The
fundamental representation q = 3 transforms as (E.16) under U, while the conjugate
representation q = 3* transforms as
b a
9" -» «" = q (U^) b (E.19)
First consider the product qaqb — 3 <g> 3*, which is reducible. When we decom-
pose it as
we see that the 1st term is traceless because of Tv(qaqb) = qaqa and has 8 compo-
nents. The 2nd term is just the trace term and of only one component. The 1st
and 2nd terms do not mix each other and hence they are irreducible. Then we can
say that 3 ® 3* can be resolved into the octet 8 and singlet 1 as
3®3*=8©1. (E.21)
Next, consider the direct product qaqb = 3 <g> 3. It is also reducible. To resolve
this reducible representation into two irreducible representations, let us separate it
into the symmetric and antisymmetric parts as
where the 1st and 2nd terms are symmetric and antisymmetric, respectively, for
interchange of qa and qt- The antisymmetric part has 3 components and identical
to the set of objects ea6cQ69ci where eabc is the totally antisymmetric Levi-Civita
symbol with £123 = 1- Then if we define eabcqbqc = 9 a , w e c a n see that qaqa is
invariant under the transformation of U as follows;
where det U = 1. Since U is unitary, from the transformation rule of (E.16) for qa
and the result of (E.23), we can easily find that qa, i.e. the antisymmetric part of
(E.22), transforms as the conjugate representation 3*. The symmetric part has 6
components and thus, we can obtain the decomposition rule,
3 ® 3 = 3* © 6. (E.24)
3* and 6 do not mix under the 51/(3) transformation and each one cannot be
decomposed any more. They are irreducible.
Finally, what happens when we multiply one more quark state to (E.24)? In this
case, we have two products 3* ® 3 and 6 ® 3. The product 3* ® 3 has 9 components
and is decomposed into the octet 8 and singlet 1 as presented in (E.21). But here
we should be careful about that in 1 all quarks are totally antisymmetric (as can be
seen from qaqa in (E.23)) and in 8 the first 2 quarks are antisymmetric. The product
6 ® 3 has 18 components and is decomposed into the decuplet 10 and octet 8. In
10 all quarks are totally symmetric, while in 8 only first 2 quarks are symmetric.
288 The quark model
Y
1 I
ds 1 US
/-> "\
/ \
/ \
/ 1 \
/ \
' dj/ 3 \
i ^•v \
duL/\^\ 1 I ' O • J3
1u \ ' 1 \ v ' »% / I ' ,
2
» \ ' 2 ' 1
\ \ / '
\ \ '/ 2 ''
^s \
si
\\ \ 3 '
<s~y .
^s~ / /
\r*i-^^ . r~s\
SU -1 sd
(a) qq = 3 X 3*
»js I3
where the first 2 quarks are antisymmetric and symmetric in 8A and 85, respectively.
Likewise in the SU(2) case, we can illustrate the quark triplet (E.15) as • = 3.
Representations of mesons and baryons 289
• Mesons
In the quark model, mesons are made up from a quark q and an antiquark q. Here
we consider an example of pseudoscalar mesons with Jp = 0~.
qq = 3 <g> 3* = 1 © 8. (E.26)
Or using Young tableaux, one can write this as
^ = • 0 0 = §©EB (E.27)
3 <g> 3* = 1©8
The number below the Young tableaux denotes the dimension of each irreducible
representation. Therefore, mesons are represented by a sum of the octet 8 and the
singlet 1. By combining the weight diagrams of q (Fig. E.l(a)) and q (Fig. E.l(b)),
we can show this in a different way as shown in Fig. E.2. The same configuration
is obtained even for vector mesons with Jp = 1~.
• Baryons
Baryons are composed of three quarks qqq and its product of representations are
given in (E.25). The Young tableaux for this configuration is given as
Y
-,
A- A0 A+ A++
(b) Jp = | +
baryon decuplet (10)
291
292 Bibliography
M. J. Murtagh, BNL-52079,C87/02/05
Lim, C. S., Inami, T. and Sakai, N. (1984), Phys. Rev. D29, 1488
Lim, C. S. and Marciano, W. J. (1988), Phys. Rev. D37, 1368
Maki, Z., Nakagawa, M. and Sakata, S. (1962), Progr. Theor. Phys. 28, 870
Marshak, R. E. and Sudarshan, E. C. G. (1958), Phys. Rev. 109, 1860
Mikheyev, S. P. and Smirnov, A. Yu. (1985), Yad. Fiz. 42, 1441 [Sov. J. Nucl. Phys. 42,
913 (1985)]
Mohapatra, R. N. and Senjanovic, G. (1980), Phys. Rev. Lett. 44, 912
Muta, T. (1998), "Foundations of Quantum Chromodynamics-An Introduction to Pertur-
bative Methods in Gauge Theories" (2nd Edition), World Scientific Pub.
Nagashima, Y. (1999), "Standard Theory of Elementary Particles and Their Experimental
Ground", (in Japanese), Asakura Pub, Tokyo
Nakanao, T. and Nishijima, K. (1953), Prog. Theor. Phys. 10, 581
Noether, E. (1918), Nachr. Kgl. Ges. Wiss. Gottingen, p.235
O'Raifeartaigh, L. (1975), Nucl. Phys. B96, 331
Particle Data Group, Hagiwara, K. et al. (2002), Phys. Rev. D66, 010001
Peccei, R. D. (1989), "The Strong CP Problem", pp. 503-552 in "CP Violation", ed. by
Jarlskog, C., World Scientific Pub.
Peskin, M. E. and Schroeder, D. V. (1995), "An Introduction to Quantum Field Theory",
Addison-Wesley Pub.
Peskin, M. and Takeuchi, T. (1990), Phys. Rev. Lett. 65, 964
Pokorski, S. (2000), "Gauge Field Theories", 2nd edition, Cambridge Univ. Press, UK
Quigg, C. (1984), "Gauge Theories of The Strong, Weak, and Electromagnetic Interac-
tions" , The Benjamin/Cummings Pub.
Roos, M. and Sirlin, A. (1971), Nucl. Phys. B29, 296
Ross, G. G. (1985), "Grand Unified Thoeries", Benjamin/Cummings Pub.
Rudaz, S. (1986), Lecture at 1986 CERN School of Physics, "Introduction to Symmetries
and Gauge Theories", Proceedings, CERN-87-02 (1987)
Sakai, N. (1981), Z. Phys. C l l , 153
Salam, A. (1968), "Elementary Particle Theory: Relativistic Group and Analyticity" (No-
bel Symposium No.8), ed. by N. Svartholm, Almqvist and Wiksell, Stockholm,
p.367
Susskind, L. (1979), Phys. Rev. D20, 2619
Shi, X. and Schramm, D. N. (1992), Phys. Lett. B283, 305
Shi, X., Schramm, D. N. and Bahcall, J. N. (1992), Phys. Rev. Lett. 69, 717
Sikivie, P., Susskind, L., Voloshin, M. and Zakharov, V. I. (1980), Nucl. Phys. B173, 189
Smirnov, A. Yu. (1992) in Proceedings of the International Symposium on Neutrino As-
trophysics, Takayama, Japan: Frontiers of Neutrino Physics, ed. by Y. Suzuki and
K. Nakamura
SNO collaboration, (2001), Phys. Rev. Lett., 87, 071301
Strassler M. J. and Peskin, M. E. (1991), Phys. Rev. D 4 3 , 1500
Super-Kamiokande collaboration, (1998), Phys. Rev. Lett. 81, 1562
Super-Kamiokande collaboration, (2001), Phys. Rev. Lett. 86, 5656
Takahashi, Y. (1957), Nuovo Cim. 6, 370
Veltman, M. (1977), Nucl. Phys. B123, 89
Voloshin, M. et al. (1988), J E T P 64, 446
Ward, J. C. (1950), Phys. Rev. 78, 1824
Weiberg, S. (1967), Phys. Rev. Lett. 19, 1264
Weinberg, S. (1976) Phys. Rev. Lett. 37, 657
Weinberg, S. (1979), Phys. Rev. D19, 1277
294 Bibliography
Weinberg, S. (2000), "The Quantum Theory of Fields", Vol. Ill, Cambridge University
Press
Wess, J. and Bagger, J. (1992), "Supersymmetry and Supergravity", Princeton Univ.
Press
Wolfenstein, L. (1978), Phys. Rev. D17, 2369
Wolfenstein, L. (1981), Nucl. Phys. B186, 147
Wolfenstein, L. (1983), Phys. Rev. Lett. 51, 1945
Wu, C. S. et al. (1957), Phys. Rev. 105, 1413
Yanagida, T. (1979), Proceedings of "Workshop on Unified Theory and Baryon Number
in the Universe", ed. by Sawada, O. and Sugamoto, A. KEK, Japan
Yang, C. N. and Mills, R. L. (1954), Phys. Rev. 96, 191
Yndurain, F. J. (1992), "The Theory of Quark and Gluon Interactions", Springer-Verlag
Zener, C. (1932), Proc. Roy. Soc. London, A137, 696
Zweig, G. (1964), CERN Report 8182/TH.401; 8419/TH.412
Index
295
296 Index
tau number, 18
T — 6 puzzle, 8
technicolor theory, 208
time inversion T, 276
transition probability of the process
ve -*• Vp, 121
two-component notation, 149
World Scientific
www.worldscientific.com
4655 he