Introduction To Particle Physics
Introduction To Particle Physics
Axel Maas
2
Contents
1 Particle Physics 1
1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Natural units . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Elementary particles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4 Fermions and bosons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 Particles and anti-particles . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.6 Interactions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2 Scattering experiments 8
2.1 Non-relativistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.1.1 Elastic scattering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.1.2 Cross section . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.1.3 Luminosity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.2 Relativistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.2.1 Repetition: Relativistic notation . . . . . . . . . . . . . . . . . . . . 11
2.2.2 Inelastic scattering . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.3 Decays . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.4 Cross-sections and particles . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.5 Feynman diagrams . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
i
ii Contents
5 Strong interactions 44
5.1 Nuclei and the nuclear force . . . . . . . . . . . . . . . . . . . . . . . . . . 45
5.2 Mesons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
5.3 Nucleons, isospin, and baryons . . . . . . . . . . . . . . . . . . . . . . . . . 47
5.4 The quark model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
5.5 Color and gluons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
5.6 Chiral symmetry breaking . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
5.7 The Goldstone theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
5.8 Confinement and asymptotic freedom . . . . . . . . . . . . . . . . . . . . . 57
5.9 Hadronic resonances . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
5.10 Glueballs, hybrids, tetraquarks, and pentaquarks . . . . . . . . . . . . . . . 59
5.11 Flavor symmetry and strangeness . . . . . . . . . . . . . . . . . . . . . . . 61
5.12 Charm, bottom and quarkonia . . . . . . . . . . . . . . . . . . . . . . . . . 63
5.13 Top . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
5.14 Partons and scattering experiments . . . . . . . . . . . . . . . . . . . . . . 67
5.14.1 Fragmentation functions and jets . . . . . . . . . . . . . . . . . . . 67
5.14.2 Parton distribution functions . . . . . . . . . . . . . . . . . . . . . 68
5.14.3 Monte-Carlo generators . . . . . . . . . . . . . . . . . . . . . . . . . 70
5.15 Some algebra and group theory . . . . . . . . . . . . . . . . . . . . . . . . 71
Contents iii
6 Weak interactions 83
6.1 decay and parity violation . . . . . . . . . . . . . . . . . . . . . . . . . . 84
6.2 Neutrinos . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
6.3 Flavor-changing currents . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
6.4 W and Z bosons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
6.5 The Higgs effect and the Higgs boson . . . . . . . . . . . . . . . . . . . . . 91
6.6 Parity violation and fermion masses . . . . . . . . . . . . . . . . . . . . . . 95
6.7 Weak isospin, hypercharge, and electroweak unification . . . . . . . . . . . 96
6.7.1 Constructing the gauge group . . . . . . . . . . . . . . . . . . . . . 97
6.7.2 Hiding the electroweak symmetry . . . . . . . . . . . . . . . . . . . 99
6.8 CP violation and the CKM matrix . . . . . . . . . . . . . . . . . . . . . . 104
6.9 Neutrino oscillations and the PMNS matrix . . . . . . . . . . . . . . . . . 106
6.10 The weak interactions as a gauge theory . . . . . . . . . . . . . . . . . . . 109
6.11 Anomalies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
6.12 Landau-poles, triviality, and naturalness . . . . . . . . . . . . . . . . . . . 115
6.13 Baryon-number violation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
6.14 The early universe . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
6.14.1 Global symmetry . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
6.14.2 Abelian case . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
6.14.3 The electroweak case . . . . . . . . . . . . . . . . . . . . . . . . . . 125
Particle Physics
1.1 Introduction
Particle physics is the science of the smallest constituents of matter, and how they interact.
In a sense, it evolved out of physical chemistry, where it was first resolved that all chemical
substances, like molecules, are made out of set of chemical elements, of which we know
currently roughly 120. At the turn of the 19th to the 20th century, it was then found that
the atoms themselves were not elementary, but rather had constituents - a single nuclei,
which was characteristic for the element, and a number of indistinguishable electrons. The
latter number was in turn uniquely fixed by the nucleus.
In the early 20th century it was then found that the nuclei themselves are not elemen-
tary, but were made up out of just two types of constituents, the protons and neutrons,
commonly denoted as nucleons. Again, and as will be described later in detail, these nu-
cleons are not elementary, but are made up out of the quarks. At the current time, we
do not know, whether either quarks or electrons do have a substructure, but substantial
effort is invested to find out.
As is visible from this short historical remark, the study of elementary particles has
changed subject many times over the course of time. Today, elementary particle physics is
considered to be the study of the most elementary particles known to date, their interac-
tions, and whether there could be even more elementary constituents. Today, such studies
are inseparable inked to quantum mechanics, as quantum effects dominate the world of
the elementary particles.
It is this field which will be presented in this lecture. After some more general remarks,
the three known fundamental interactions, as well as the known particles will be presented.
Their theoretical description together constitutes the standard model of particle physics.
This theory is well established, and has been experimentally verified in the accessible range
1
2 1.1. Introduction
of parameters. But there are several reasons which prove that it can not be the final theory
of particle physics. These will be briefly summarized at the end of the lecture. Thus, this
lecture can only be taken as a snapshot of our current understanding of particle physics.
Suspiciously absent from these forces is gravity. The reason for this is twofold. On the
one hand, so far nobody has found a reliable way how to quantize gravity, though this
is a very active area of research. This quantum gravity is usually also considered to be
part of particle physics. However, because of the complications involved in formulating it
theoretically, there is not (yet) a standard model of quantum gravity, and just to describe
the more promising candidates in a fair way is a lecture of its own, and rather specula-
tive. The other is that also experimentally no particle has been yet observed which can
be considered as an elementary particle of gravitation, the hypothesized graviton. Thus,
quantum gravity is not yet part of the standard model of particle physics. This is proba-
bly the most stringent evidence for the incompleteness of the standard model of particle
physics.
As the standard model of particle physics in its current form has been theoretically
established at the turn of the 60ties and 70ties of the 20th century, with the final major
theoretical touches added in the early 80ies, there are many excellent textbooks on its
concepts. Especially, there are many theoretical textbooks on its foundations. The present
lecture, however, is a more phenomenological introduction. For the concepts, this lecture is
based on a number of older textbooks, but also some books on quantum-field theory have
been used. However, there are numerous textbooks on the topic, with very different styles.
Giving therefore a reasonable recommendation, especially in a field evolving as quickly as
particle physics, is of little value. At the same time, identifying a suitable textbook to
learn new concepts is a very helpful exercise for latter literature studies. Therefore, no
general recommendations for this lecture will be given.
However, the latest experimental results for the standard model are from the year 2012,
and two quantities still remain to be determined, as will be discussed below. Furthermore,
the formulation of a theory is one problem. Determining all its consequences theoretically
is a much harder problem, which is in many respects still unsolved today. We are far
from the point where we can compute any given quantity in the standard model to any
given accuracy. Furthermore, experimental results are never exact, and have errors. So, in
many cases we know for deviations from the standard model only upper limits, and cannot
exclude something is different. Thus, our knowledge of the standard model is in continuous
motion. The best repository of the current knowledge is from the particle data group,
which is accessible for free at pdg.lbl.gov. The most important discoveries (and information
for this lecture) can also be obtained from my twitter feed, twitter.com/axelmaas.
Chapter 1. Particle Physics 3
Note that a lecture on particle physics without full-fledged quantum-field theory can
necessarily often only give hints to the origin of many phenomena. The main aim is to
make you acquainted with the general structure, with concepts, and with ideas. A full
understanding of the details is necessarily relegated to quantum-field theory, and, in part,
advanced quantum field theory courses. Wherever possible, I will try to give as much
insights from these, without going into the technical details, skipping calculations for
hand-waving arguments. As a result, with the knowledge of this course, you will be able
to follow an experimental or theoretical overview talk at a mixed conference, but it will
not be sufficient to follow a specialist talk in a parallel session, or likely an overview talk
at a purely theoretical conference. Nonetheless, even in the latter cases, many of the ideas
and underlying physics should sound then at least familiar to you. Eventually, the aim of
this lecture is to prepare you for a full quantum-field theory course on particle physics.
which therefore have earned their own name, electron volts (eV), and Fermi (being identical
to a femtometer). Typical energy scales are actually rather 109 eV, a GeV.
Furthermore, in particle physics the system of unit is further modified. Since units
are man-made, they are not necessary, and the only thing really useful is a single scale to
characterize everything. This is achieved in the system of natural units, which is obtained
by setting
c = ~ = kB = 1
leaving only one unit. This is usually selected to be GeV, and thus everything is mea-
sured in GeV. E. g., times and lengths are then both measured in GeV1 . In certain
circumstances, GeV are exchanged for fm, which an be easily converted with the rule
~c = 1 0.1973 GeV fm
like MeV, being 103 GeV, or TeV, being 103 GeV. Also other dimensionful quantities are
then expressed in these units. E. g. Newtons constant is roughly 1.221019 GeV.
field theories, and of most hypothesized ones, implies a deep relation between the statistical
properties of particles, the Lorentz group, and the spin of a particle. It essentially states
that for any Lorentz-symmetric, i. e. special relativistic, quantum theory in more than
two (one space and one time) dimensions every particle obeying Bose-Einstein statistics
has integer spin, and any particle obeying Fermi-Dirac statistics has half-integer spin.
There are no further possibilities for the spin values. Thus, bosons have integer spin, and
fermions have half-integer spin. The electron, e. g., is thus a fermion, with its spin of 1/2.
The photon, with its spin of 1 is therefore a boson.
Note that spin is an intrinsic property, like rest mass, of an elementary particle. It is
not something which can be constructed from any properties, and has to be determined
in experiment, at least inside the standard model. For a bound state, however, the spin
can be computed as a function of the spins of its constituents.
A further important consequence is that the wave-function of a system with n particles
has to be antisymmetric under the exchange of two fermions, while it is symmetric under
the exchange of two bosons. This already illustrates the Pauli-principle. If there would be
two particles of the same quantum numbers (and thus of the same type, since type is a
quantum number) at the the same place, their wave-function must change sign under an
exchange. At the same time, they are identical, and thus numerically it cannot change.
Since the only quantity which at the same time changes and does not change sign is zero,
any such state has to be zero when the particles are at the same place, and thus they
cannot be - this is the Pauli principle. On the contrary, for bosons the wave-function
remains unchanged under an exchange of two identical bosons.
which has a finite life-time. It decays because of the annihilation of the two particles,
decaying into two or three photons, depending on how the spin of the electron and anti-
electron had been aligned with respect to each other.
Since this is a very fundamental prediction of our understanding of particle physics,
great experimental effort is invested to check whether particles and anti-particles really
have the same properties, e. g. the same mass. Particular attention has been devoted to
produce anti-atoms, and measure their spectrum, as this is a very sensitive test of this
theoretical prediction. So far, no deviation has been found. Still, although the production
of anti-matter is tedious, and yields are measured in individual atoms, rather than grams,
these tests remain the most sensitive ones of the foundations of particle physics.
1.6 Interactions
The annihilation of electrons and positrons into photons is an example for an interaction,
in this case an electromagnetic interaction. Writing on a black board or not falling through
the floor are also examples of electromagnetic interactions. The earth orbiting the sun is
an instance of a gravitational interaction. Generically, anything, except the Pauli prin-
ciple, what makes a particle aware of the presence of another particle is classified as an
interaction. This is called a dynamical effect. The propagation of particles, on the other
hand, is just a kinematical effect.
Besides the knowledge of all the elementary particles, it is also necessary to know
the interaction of them with each other to write down a theory describing them. Such
an interaction is not necessarily restricted to connect only two particles. The maximum
number known so far is a four-particle interaction, and any interactions involving more
particles than four can be broken down to those with less particles. This does not mean
that it is not possible to have more; theoretical, this is possible, but there is no experimental
evidence that it is needed so far. Also, there can be interactions which appear at a coarse
scale to be an interaction involving more than four particles, but on a fine scale this is not
the case.
There are two particular properties of interactions, which deserve a special mentioning.
One is that interactions are not totally arbitrary. Rather they only occur between specific
particles. E. g. the aforementioned annihilations only proceeds with one electron, one
positron, and two or more photons. It will not occur with two electrons and two photons,
or one electron, a proton, and two photons. The reason are conservation laws, in this
case the one of electromagnetic charge. Throughout many more conservation laws will be
encountered, and any interaction will strictly respect any exactly conserved quantity.
Chapter 1. Particle Physics 7
1
Actually, this is not an elementary relation like the Heisenberg uncertainty principle, but follows from
identifying particles with wave packets.
Chapter 2
Scattering experiments
The primary tool to understand particle physics today experimentally are scattering ex-
periments, i. e. letting particles collide with each other. The reason has been allude to
before: When two particles collide they interact, and their kinetic energy can be used to
create new particles. The higher the energy, the more massive particles can be created.
Furthermore, because of the DeBroglie relation that wave-length and momentum are re-
lated by = 1/p, only high energies permit to resolve very small structures, and therefore
permit to investigate whether a given particle has a substructure.
Of course, due to the presence of virtual particles, it would in principle be possible
to also investigate everything using low-energy collisions or even measurements of static
properties of a particle. However, as will be seen later, the required precision to resolve new
particles in most cases drops with increasing energy. Therefore, to access new particles in
low-energy collisions requires very precise experiments. For some cases, particular effects
reverse the situation. As a consequence, today both low-energy precision experiments and
high-energy collisions work hand in hand together, both having their own importance.
The formulation of scattering experiments is thus central to particle physics. Almost
all discoveries in particle physics have been made with such experiments. As a consequence
also most modern theoretical tools have been developed and optimized to describe scat-
tering experiments. Thus, at least an elementary understanding of scattering processes is
a necessity in particle physics.
2.1 Non-relativistic
Many of the necessary concepts can already be understood non-relativistically, which offers
a simpler formalism.
8
Chapter 2. Scattering experiments 9
repeated many (many billion) times to obtain a distribution of scattering angles. The
number of particles scattered into a solid angle d = sin dd is then defining this cross
section as
dN
(, ) = .
d
In practice, there are two quantities, which can be derived from the cross section, which
are of particular importance. One is the total cross section
Z
t = d,
(, , x)
x = ,
x
where x can be any kind of variable, including also and themselves, on which the
cross-section can depend. Also higher order differentials could be useful, so-called multiple-
differential cross sections.
The unit of a cross section is an area. A useful unit to measure it in particle physics
is barn, abbreviated b, corresponding to 100 fm2 . Although nowadays usual interesting
cross-sections are of the order of nb to f b.
Predicting such cross sections is one of the most important tasks when one wants to
test a given theory against modern particle physics experiments. However, this is quite
an indirect process. The cross-section is calculated as a function of the parameters of a
theory, which includes also the type and number of elementary particles. This result is
then compared to the experimental result, an by comparison of how the cross-section is
then depending on external parameters (e. g. the angle or the type of particles involved),
the theory is either supported or falsified.
This becomes complicated in practice, if the cross section has contribution not only
from the theory in question, but especially also from other known origins. E. g., when
searching for yet unobserved physics, one has to subtract any contribution of known physics
from the measured cross-section. Especially if the known contribution is the dominant
one, this is difficult, as it is practically impossible to calculate it to arbitrary precision.
It is also not possible to derive it from experiment; there is no possibility to continue
unambiguously an experimental result from one case to another, without performing a
theoretical calculation. Thus, this background reduction of known processes has become
one of the major challenges in contemporary particle physics.
Chapter 2. Scattering experiments 11
2.1.3 Luminosity
The definition of a cross-section yields immediately another figure of merit for a modern
experiment, the luminosity. Technically, it is defined as the number of particles in two
beams, N1 and N2 , which interact in an (effective) area A with frequency f ,
N1 N2 f
L= ,
A
i. e. this is the number of processes per unit time and unit area. In modern particle physics
experiments, the beam is usually bunched into n packages, which act like there would be
several beams, and thus multiply the luminosity by n.
More important is the integrated luminosity,
ZT
L= dtL,
that is how many collisions per unit area occur during an experiment of temporal length
T . From this quantity, the number of events N occurring with a cross-section can be
calculated as
N = L.
Especially, to obtain one event occurring with cross-section during the time of the
experiment, an integrated luminosity of L = 1/ is necessary. This permits to directly
estimate whether an experiment is able to measure a process at all. Typical values for
integrated luminosity at the LHC are 25 fb1 until the end of 2012, and at least 300 fb1
until 2018.
A concept which will become important later on is the so-called partial (or later parton)
luminosities. Assume that the beams consist not out of a single type of projectiles, but
several. Than one can define for each possible pairing an individual partial luminosity.
2.2 Relativistic
2.2.1 Repetition: Relativistic notation
A non-relativistic description is, unfortunately, not sufficient in particle physics. There are
three reasons for this limitation. One is that there are massless particles, which necessarily
move with the speed of light, and therefore require a relativistic description. The second
is that many regularities of particle physics are very obscure when not viewed from a
relativistic perspective. Finally, in the reactions of elementary particles the creation and
12 2.2. Relativistic
x1 = x1
x2 = x1
x3 = (x3 x0 )
x0 = (x0 x3 )
1
= (1 2 ) 2
which leave the length x2 = x20 ~x2 invariant. The quantity x denotes the corresponding
four-vector with components x in contrast to the spatial position vector ~x with compo-
~ is in natural units just the speed.
nents xi . Note that = ||
The scalar product of two four-vectors can be obtained using the metric g = diag(1, 1, 1, 1),
i. e.,
x2 = x x = x g x ,
i. e., the metric lowers and raises indices, which distinguish covariant and contravariant
vectors. Though there are profound geometric differences between both, these have es-
sentially no bearing on particle physics in the context of the standard model of particle
physics.
From this follows the definition of the relativistic momentum and energy,
p = m
E 2 = p~2 + m2 (2.1)
where m is the (rest) mass of the particle, an intrinsic and immutable (at least within
known particle physics theories) property of any given type of particle. The concept of a
relativistic mass, defined as m is actually not necessary nor relevant in particle physics,
since rest mass, and spatial momentum completely characterize the state of a particle.
Note that for ~p = ~0 the relation (2.1) implies E = m, and thus rest mass and energy in
the rest frame are exchangeable concepts. Especially, this relation implies that mass and
energy can be freely converted into each other in special relativity.
An important special case are massless particles, which always move at the speed of
light, i. e. = 1. In that case, E = |~p| precisely. Note that is neither well-defined, nor
needed, when describing massless particles.
Chapter 2. Scattering experiments 13
The relation (2.1) is only true for real particles. The virtual particles stemming from
quantum fluctuations introduced earlier do not fulfill it. Such particles are called off-
shell, since the relation (2.1) defines a surface in momentum space, called the mass-shell.
Particles fulfilling (2.1) are consequently called on-shell.
One further consequence relevant to particle physics is time dilatation, i. e. the time
(difference) observed in a frame moving relative to the rest frame of the particle is longer
than the eigentime in the rest frame as
= . (2.2)
Of course, this also implies that a moving particle experiences in its own rest frame a
shorter time than outside. This will be very relevant for objects with a finite life-time: They
exist the longer the faster they are. This has especially consequences for the experimental
accessibility of massive particles.
Elastic scattering proceeds then in the relativistic case just as in the non-relativistic
case. The only exchange is that the separate conservation of spatial momentum and energy
is replaced by the conservation of four momentum,
p1 + p2 = q1 + q2 .
Still, the identities of the particles are conserved in an elastic scattering, and therefore
their rest mass is not changed. Also, there is still only one scattering angle characterizing
this (also still coplanar) reaction.
However, an interesting new possibility are now inelastic scatterings where particles
are produced or destroyed1
1
In principle, e. g. by breaking up into more than two fragments, there are also non-relativistic inelastic
scattering processes possible, where the total mass is conserved. Though this plays an important role in,
e. g., chemistry, this possibility will be ignored here .
14 2.2. Relativistic
In the center-of-mass frame, where the initial 3-momentum vanishes, it is thus in principle
possible to transform the rest energy and the kinetic energy of the initial particles only
into the rest energy of a number of new particles. This is called an on-resonance produc-
tion. Especially, it is possible to create particles which are heavier than the original ones,
provided the kinetic energy is large enough. This explains why higher and higher energies
are required in particle experiments to obtain heavier and heavier particles.
Of course, this can be generalized to more than two incoming particles. However,
in practice it is almost impossible in an experiment to coordinate three particles such
that they collide simultaneously. Hence, this will not play a role in the following. It
is not irrelevant, though. If the number of collisions becomes much larger than currently
technically achievable, this can happen for an appreciable fraction of the events. Inside the
sun such huge numbers are possible, and then three-body interactions become important
for the thermonuclear processes. In fact, without a certain reaction of three helium nuclei
to a carbon nucleus, the production of heavier elements in the sun would like not occur in
the way it does in nature.
This also yields another important way of how to deconstruct cross sections. Assume a
collision of two particles of type x as the so-called initial state. As a quantum mechanical
process, the reaction will not only lead to a single outcome, a single final state, i. e. only
one set of final particles. If it is possible, given the conservation of four-momentum and any
other constraints, that there are more than one possible outcomes yi , they all will occur.
Their relative frequency is given by the details of both the reaction and the underlying
interactions. These rates ni for n final states yield the partial cross-sections,
ni
2xi = lim Pn .
i ni
ni
Partial cross sections can also be defined in a broader sense. Instead of having precisely
defined particles in the final state it is e. g., possible to define a partial cross-section for n
particles of any type.
Partial cross-sections play an important role in experiment and theory alike. Measuring
a total cross-section is called an inclusive measurement, while identifying some or all
final particles and their properties are called exclusive measurements. At the same time,
theoretical calculations permit to make a judicious choice of a final state in which the
Chapter 2. Scattering experiments 15
background due to known physics is small or vanishing, thereby increasing the signal-to-
noise ratio. Determining partial cross-sections is therefore a very important task.
2.3 Decays
In a sense a very particular type of inelastic cross-section is the decay of a single particle,
i. e. a cross-section with a single initial particle, and two or more final particles. A decay
occurs if a massive particle can, in principle, be converted into two or more lighter particles.
For this to be possible, the rest mass of the original, or parent, particle must be larger
than the sum of the rest masses of the daughter particle, plus possible a certain amount of
energy to satisfy spatial momentum conservation. Furthermore, there may be additional
constraints like that electric charge is conserved.
Since the decay of a particle is a quantum-mechanical process, it does not occur after
a fixed decay time , but occurs at a time with an exponentially decaying probability,
exp( /2), where is the so-called decay width. The value of the decay width is
determined by details of the interaction, but is in most cases the larger the larger the
difference between the rest mass of the initial particle and the total rest mass of the final
particles is. Furthermore, the ratio /m is used to characterize how unstable a particle is.
If this ratio is small, the particle is rather stable, as it can traverse many of its Compton
wave-lengths before decaying. Such particles are also known as resonances. If the ratio is
close to or larger than one, the particle is not really behaving as a real particle, and there
is no final consensus on whether such objects should be regarded as particles or not. Most
particles well accessible in experiments have small widths.
However, quantum mechanically all possible outcomes can always be realized, just
with differing probabilities. Therefore, if it is kinematically, and with respect to all other
constraints, possible for a particle to decay into different final states, this will occur. Each
possible outcome is called a decay channel, and is characterized by a corresponding partial
decay width i , The sum of these widths together can be shown to yield the total decay
width, which determines the life-time of the particle.
For the experimental measurement of the properties of resonances time dilatation (2.2)
plays an important role. Since in experimental particle physics decay products are often
produced at considerable speeds, their life-time in the experiments rest frame is usually
much larger than the life-time in their respective rest-frame. As a consequence, they can
travel over considerable distances away from where the original particle has been produced,
which permits to disentangle the processes of resonance production and decay. This also
permits to much better determine the decay products, which in turn permit to identifying
16 2.4. Cross-sections and particles
Thus, the divergences becomes a peak with a width determined by the decay width of the
particle, which is therefore also accessible to an experimental measurement.
Of course, this is an idealized situation. Higher order processes or processes of a dif-
ferent origin can all contribute to the cross-section. Already a second resonance close by
could distort the cross-section appreciably. Furthermore, quantum mechanically interfer-
ence processes could suppress a peak or, even worse, create a fake peak. This has been
observed, and is therefore not just an academic possibility.
Chapter 2. Scattering experiments 17
Finally, if the process is inelastic, there could be more than just two particles in the
final states. Then the cross-section becomes more involved, and the presence of resonances
is not just a simple peak. There are advanced techniques to deal with such situations, like
the Dalitz analysis, which now lead too far.
One should not that the formula (2.4) is only accurate if /m is small. For resonances
with a large width, the shape can be significantly distorted to the point where no real
peak is observable at all.
can perform. Especially vertices are restricted by conservation laws, like four-momentum
conservation.
However, in practice for a reasonable complicated theory the number of diagrams
grows factorial with the order of the expansion. Thus higher-order calculations require
sophisticated methods to deal with the logistical problems involved.
Despite the power of these expression, it should always be kept in mind that already a
hydrogen atom cannot be described with these methods.
Finally, it has turned out that also beyond perturbation theory a graphical language
can very often be introduced in a mathematical precise way. Hence, Feynman-graph-
like representations are ubiquitous in particle physics, though may mean very different
mathematical objects. Especially, they are not necessarily restricted to a series expansion,
and it is possible to encode also the full content of a theory graphically. Thus, one should
always make sure what entities a given set of graphs corresponds to.
Chapter 3
The first example of a particle physics theory, the first part of the modern standard model
of particle physics, to emerge was quantum electrodynamics, QED. It already contains
many of the pertinent features of the more complex standard model, while at the same
time is much simpler. It is therefore the ideal starting point to get acquainted with a
particle physics theory.
Quantum electrodynamics is describing the existence and interaction of electromag-
netically active particles. As such, it is the quantum generalization of the classical elec-
trodynamics, which already had been implementing special relativity.
19
20 3.1. Electrons and photons
The second role is that of the force carrier. Such particles are exchanged by charge
carriers to transmit the force. The force carriers are thus the quanta of the electromagnetic
field, which transmit electromagnetic interactions classically. Thus, the particles carrying
the electromagnetic force are the quanta of the electromagnetic field, called photons. These
are massless2 bosons of spin one, and therefore so-called vector bosons.
~ and B,
In classical electrodynamics it is the electromagnetic fields E ~ which are associ-
ated with the force. These fields can be derived from the vector potential A,
Bi = ijk j Ak
Ei = i A0 0 Ai .
~ E(~
~ x) = i Ei = 4(~x)
~ B(~
~ x) t E
~ = ijk j Bk (~x)~ei t E~ = 4~j(~x)
~ E(~
~ x) + t B(~
~ x) = ijk j Ek (~x)~ei + t B(~
~ x) =0
~ B(~
~ x) = i Bi =0
~ ~j + t =
i ji + t = 0,
which describe the interaction between the electromagnetic fields and the sources. How-
ever, already in the quantum-mechanical Hamilton operator of a (spin-less) electron
1
H= (ii eAi )2 + eA0 , (3.1)
2m
it is instead the vector potential that couples to the particle. Thus, the photons are
actually the quanta of the vector potential A, instead of the E and B fields3 . Classically,
~ and B
for any given E ~ fields the corresponding vector potential is not uniquely determined.
It is always possible to perform a gauge transformation
A A + , (3.2)
2
The concept of mass in quantum field theory is a non-trivial one. What precisely is meant by the
statement that the photon has zero mass involves a significant amount of technical subtleties, which will
be glossed over here. A full discussion requires a treatment of quantum field theory beyond perturbation
theory.
3
It is actually possible to write down a quantum theory also in terms of the E ~ and B~ fields. However,
such a theory is much more complicated, especially it contains non-local interactions, and is almost in-
tractable analytically. Hence the formulation in terms of the vector potential is used essentially exclusively.
It is called the localized version, since no non-local interactions are present.
Chapter 3. A role model: Quantum electrodynamics 21
with some arbitrary function . This is also true quantum-mechanically, except that also
the wave-function of the electron needs to be modified as
exp(ie). (3.3)
Thus, the quanta describing electrons and photons are not uniquely defined, but can
be altered. It can, however, be shown than any experimentally observable consequence
of the theory does not depend on the the function , and thus on the choice of gauge.
Theories with these features are called gauge theories. All theories, which have been
experimentally supported, which we know today contain at least one gauge field. They
therefore represent the archetype of particle physics theories. QED is just the simplest
example for such a gauge theory. Later on, much more complicated gauge theories will
appear, which eventually form the standard model of particle physics.
Since the gauge symmetry requires that the ordinary derivative in the Hamilton (3.1)
is replaced by the combination of the ordinary derivative, , and the gauge field Ai ,
Di = i eAi , this combination is called the covariant derivative. The name covariant
stems from the fact that the whole operator transforms in such a way under a gauge
transformation, as to make the whole Hamiltonian gauge-invariant4 .
It is worthwhile to mention that the gauge symmetry is essentially what was called a
redundant variable in classical mechanics. In classical mechanics, the usual first step is to
eliminate all redundant variables by enforcing all constraints. This is not done in particle
physics, and the presence of gauge transformations (3.2-3.3) is just a manifestation of this
redundancy. Keeping this redundancy is, in contrast to classical mechanics, technically
much more advantageous than eliminating it. The presence of this gauge degree of freedom
is thus rather a mathematical convenience, than a genuine feature of nature.
As noted before in section 1.5, there are anti-particles to every particle, and especially
the positron as an anti-particle to the electron. It is thus also a spin 1/2 fermion with the
same mass, but opposite electric charge e = 0.3. The photon is uncharged. It is therefore
its own anti-particle, and there is no anti-photon which could be distinguished by any
means from the photon.
(QFT). QFT is technically much more complicated than ordinary quantum mechanics,
and is beyond the scope of this lecture. However, it is very useful to understand and use
the way QFTs are formulated, and this provides a quick way of representing a theory.
Furthermore, significant insight can be gained already by treating the QFT classically,
and thus just like classical electrodynamics, which is a classical field theory.
This should now be exemplified for the case of QED.
The simplest part is just starting with Maxwell theory, i. e. the theory of free electro-
magnetic fields. Since the Hamilton operator is not a Lorentz-invariant it is usually not
the best way to formulate a relativistic theory. A better starting point is the Lagrange
function L. For a field theory, this changes to a Lagrangian density L, or briefly denoted
by Lagrangian. Classical Maxwell theory takes the form
1
L = F F (3.4)
4
F = A A , (3.5)
which is explicitly invariant under the gauge transformation (3.2). The anti-symmetric
tensor F is the field-strength tensor, with components formed by the electromagnetic
fields. The fact that the photon is a spin one boson can be read off from the Lagrangian
since it carries a single Lorentz index, and therefore transforms like a vector, and is de-
scribed by an ordinary field.
Adding fermions, and thus electrons, is subtle. Fermions obey the Pauli principle.
There is no classical object, which does the same. To formulate a field theory for them,
it was necessary to construct mathematical entities which encode this property, so-called
Grassmann numbers. A Grassmann number has the property that it squares to zero, i.
e. 2 = 0, and anti-commutes with any other Grassmann number , = . This looks
rather strange at first,but can be dealt with quite straightforwardly in practice. Here,
however, it is only necessary to remember that one cannot commute fermion fields, which
are fields of Grassmann numbers, at will.
Spin 1/2 fermions, like electrons, are described by four complex Grassmann variables,
which are collected into a column matrix . The anti-particle can then be shown to be
described by the field = + 0 with 0 a four-dimensional matrix, which is unitarily
equivalent to diag(1, 1, 1, 1). The total Lagrangian of QED then becomes
1
L = F F + (i ieA ) m, (3.6)
4
where m is the mass of the electron. The fields representing the electron, the so-called
spinors , transform under gauge-transformations like the quantum-mechanical wave-
functions, and therefore this Lagrangian becomes gauge-invariant. These spinors are four-
dimensional vectors of complex Grassmann numbers. Though they are four-dimensional
Chapter 3. A role model: Quantum electrodynamics 23
they do not transform like vectors under Lorentz transformations, but differently, hence
the name spinors. The details of this will not be relevant in the lecture, and are therefore
skipped.
Furthermore, the matrices i form together with 0 the Dirac matrices , and fulfill
the so-called Clifford algebra
{ , } = 2g .
Note that despite the suggestive notations these matrices are invariant under Lorentz
transformations, but play an important role in showing that this theory is, indeed, Lorentz-
invariant. Nonetheless, the metric can be used to define -matrices with raised indices as
well. The details o this are beyond the present scope. However, it is important to note
that the quantity is a scalar and is a vector under Lorentz transformations. It
is furthermore possible to define the matrix 5 = i0 1 2 3 , which can be used to create
a pseudo scalar 5 and an axial vector 5 .
Besides the presence of photons and (massive) electrons and positrons another property
of the theory can be read off the Lagrangian (3.6). There is a term which involves not
two, but three fields, the interaction term ie A . This is the first example of an
interaction vertex, which specifies an interaction of three particles, an electron, a positron,
and a photon. They interact with the coupling strength e, the electromagnetic coupling.
The remainder factors i ensure that this respects Lorentz invariance, and that it yields
a meaningful quantum theory. Note that there is no term describing interactions with
more than three particles in this Lagrangian. This will change later on. Furthermore, the
relation between positron and electron requires them that a mass term includes both of
them, and there are no separate mass terms for both particle species.
It is a further remarkable feature that the involved fields are functions not only of
space, but also of time. Thus, a field configuration, i. e. any given field, represents a
complete space-time history of a universe containing only QED.
As emphasized before, the Lagrangian is a classical object, i. e. the fields are not
operator-valued. The quantization yields a QFT, a topic treated in a separate lecture.
Here, it suffices to use the language of Lagrangians to specify a theory.
describe an arbitrary number of particles of the corresponding type. Thus, since the
corresponding particles are only excitations of the same field, they cannot be distinguished.
How can such particles then be identified at all? This requires to understand what a
particle in the context of a field theory really means. A single particle is usually identified
with an (almost) Gaussian excitation of the field5 , and therefore exponentially localized.
Two separate particles are therefore two Gaussian excitations, where the two peaks are
separated far compared to the widths of the peaks. Thus, the two particles can be iden-
tified, but since they are just the same excitation type of the field, they are nonetheless
indistinguishable. This can be repeated for as many particles as desired, as long as the
separation is large compared to the widths, creating many indistinguishable particles.
If the two particles come close to each other, the peaks overlap, and it no longer
makes sense to speak of individual particles. Interactions play a role, and particles in
the conventional sense only reemerge when the particles have moved far away from each
other. In a perturbative description of a scattering process, the starting point are two such
Gaussians sufficiently far apart as to ignore any remaining overlap. The same applies to
the final states. That is what is called asymptotic states.
representation.
Such structures will again reappear for the other parts of the standard model, and are
also present in gravity. So far, only the fundamental and adjoint representation play a role
in particle physics, though several proposals exist for theories beyond the standard model
in which particles in different representations would appear. They would then transform
differently under gauge transformations. The precise form is given by the relevant gauge
group, and is mainly an exercise in group theory.
For comparison, gravity can also be thought of as a gauge theory, with as gauge group
the non-Abelian Poincare group SO(3,1)T, where T is the (Abelian) translation group.
This is, however, beyond the scope of this lecture.
are much heavier than the electron. Otherwise their properties are the same as for the
electron: They are fermions with spin 1/2 and negative electric charge. They can be easily
included into the Lagrangian (3.6),
1 X
L = F F + f (i ieA mf )f , (3.7)
4 f
where f now counts the so-called flavor, i. e. type, of the fermions, and can be electron,
muon, and tauon. In QED, this flavor is conserved, i. e. it is not possible to convert the
heavier muon into an electron. Later, additional interactions will be encountered, which
change this. Then neither the muon nor the tauon are stable, and thus are encountered
in nature only as short-lived products of some reaction.
The electron, muon, and tauon are otherwise exact replicas of each other. They form
the first members of three so-called families or generations of elementary particles, which
will have, except for the mass, otherwise identical properties. Their presence, and the
relative sizes of their masses form one of the most challenging mysteries of modern par-
ticle physics, subsumed under the name of flavor physics. However, several interesting
phenomena we observe in nature are only possible, if there are at least three such families,
and some will be encountered later on.
To distinguish the electron, muon, and tauon from other particles to be discussed later,
they are also collectively denoted as leptons.
may be created with arbitrarily large energies. These arbitrarily large energies then induce
divergences in the mathematical description, and all results become either zero or infinite.
This is, of course, at first sight a catastrophe, and for a while it seemed to mark the
end of quantum field theory. It is actually no an artifact of perturbation theory, as one
might suspect; though there are theories where this is the case, this is not so for those
relevant for the standard model of particle physics.
Today, this failure has been understood not as a problem, but rather as a signal of the
theory itself that it cannot work at arbitrarily high energies. It signals its own breakdown,
and the theory is, in fact, only a low-energy approximation, or a low-energy effective
theory, of the true theory. This is then as expected, since gravity is not yet part of the
standard model, but is expected to play an important role at very high energies. Hence
the prediction of failure is consistent with our experimental knowledge on the presence of
gravity.
This insight alone does not cure the problems, however. Fortunately, it turns out that
a certain class of theories exist, so-called renormalizable theories, where this problem can
be cured in exchange for having a finite number of undetermined parameters. In case
of the full QED Lagrangian (3.7) these are four: The strength of the electromagnetic
coupling and the three masses. We exchange the divergences in favor of having these four
numbers as an input to our theory, which we have to measure in an experiment. Once
these parameters are fixed, reliable predictions at not too high energies can be made,
where not too high for the standard model covers an energy range larger than our current
experimental reach, probably much larger.
This exchange, a process known for historical reasons as renormalization, was the final
savior of quantum field theory, making it again a viable theory. It also implies that a
renormalizable theory cannot be a complete description of physics: The parameters of the
theory cannot be predicted from inside the theory, and external input is needed. Whether a
completely parameter-free theory can be formulated, which describes nature, is not known.
7
There are some theories, so-called conformal theories, where this is not the case. None of the theories
realized in nature is of this type, though. This can actually be excluded exactly from experiment.
Chapter 3. A role model: Quantum electrodynamics 29
coupling (). It shows that the running coupling satisfies the differential equation
de e3
= (e) = 0 + O(e5 ), (3.8)
d ln 16 2
where is the so-called -function, and i are its Taylor coefficients. The latter can be
calculated in perturbation theory. Truncating the series at the lowest order, the differential
equation can be integrated, and yields
e(2 )2 (20) 4
(2 ) = = 2) 2 , (3.9)
4 (
0 ln 2
2
1 + 40 0 ln 2
0
where 0 is the energy where the experimental input (0 ) has been obtained. The so
introduced quantity is often called the scale of the theory, and is a characteristic energy
of the described interaction. The value of 0 for QED is -4. This implies that the running
coupling becomes larger with increasing energy, until it eventually diverges at a very large
energy scale.
This divergence is the so-called Landau pole. Similar problems arise also in the other
running quantities. The presence of such a Landau pole can be traced back to the use of
perturbation theory. Before reaching the Landau pole, the running coupling, which is the
expansion parameter, becomes larger than one, and therefore the perturbative expansion
breaks down. Thus, a Landau pole is a signal of the breakdown of perturbation theory.
When this point is reached, it is necessary to resort to non-perturbative methods. Even
for QED such a full non-perturbative treatment is not simple. The results, however, indi-
cate that QED is also breaking down beyond perturbation theory, and the only quantum
version of QED which make sense is the one with e = 0, a so-called trivial theory. This is
the triviality problem. It is assumed that this effect is cured, once QED is embedded into
a larger theory, in this case the standard model of particle physics. This is a recurring
problem, and will be discussed in more details later.
Chapter 4
31
32 4.1. Symmetries, groups, and algebras
ton operator, they lead in general to conserved quantum numbers. These can be either
multiplicatively or additive. This means that a for a composite state the quantum num-
bers of the constituents are either multiplied or added. Discrete symmetries usually lead
to multiplicative ones, while continuous ones to additive ones. Note that not all states
have well-defined quantum numbers, even if a symmetry exists, and that a composite state
can have well-defined quantum numbers even if the constituents do not have.
Furthermore, it is necessary to introduce a few notions of group theory. Later, in
section 5.15, this will be further elaborated. For now, it is sufficient to introduce a few
concepts. Most of these concepts are already familiar from quantum mechanics, especially
from the theory of atomic spectra, and thus angular momenta. However, these concepts
require generalization in the context of particle physics.
The basic property of a symmetry is that there exists some transformation t, which
leaves physics invariant. Especially, such a transformation must leave therefore a state |si
invariant, up to a non-observable phase. Thus, a transformation has to be implemented by
a unitary or anti-unitary operator, T . Especially in cases of continuous symmetries, it is
possible to act on any state repeatedly with different values of the relevant parameter(s),
T (1 ) = T1 , T (2 ) = T2 yielding T2 T1 |si. There is also always a unit element, i. e. one
transformation which leaves every state invariant. It is also furthermore for a unitary
transformation possible to reverse it. Hence, such transformations have a group structure,
and it is said that they are the corresponding symmetry group.
An example has been the group of the gauge transformations from section 3.4. There,
the group was SO(2)U(1). The group elements were the phase rotations exp(i(x)), with
space-time-dependent parameters. It was thus a local symmetry. The unit operator is thus
just the one, with = 0. The inverse one is just exp(i(x)), the complex conjugated
one, as expected for a unitary operator.
Since the application of two gauge transformations commute, this symmetry group is
Abelian. However, a consecutive application of more than one group element need not be
commutative,
T1 T2 |si =
6 T2 T1 |si.
where the i are Hermitian operators. These operator form the so-called generators of
the symmetry group, and form themselves the associated symmetry algebra. Hence, the
gauge fields transform not with the group, but with the algebra instead.
To separate both cases, it is said that the electron states and the gauge fields transform
in different representations of the gauge algebra.
i. e.
L = L( + , + ) L(, ) = K , (4.1)
where the equation of motion (4.2) were used in the derivation and differentiations have
been exchanged. This implies that the Noether current
L
j = K ,
or its corresponding extension to arbitrary sets of fields
L L
j = K i i
i i
is conserved, j = 0. Hence, the associated charge
Z
Q = d3~xj0 ,
where ~n is the unit vector on the surface S, stemming from the application of Gauss law3 .
Thus, the only change in the total charge is by the migration of charge carriers through
the surface in which the charge is enclosed. Hence, if the surface is send to infinity, where
no more charges can come from the outside, and the charge becomes time-independent.
In this way, conserved currents are linked to symmetries of the theory. This will be
encountered throughout the standard model of particle physics. Another useful feature
of symmetries in quantum-field theories is that they entail relations between different
quantities, so-called Ward-Takahashi identities for global symmetries and Slavnov-Taylor
identities for local (gauge) symmetries. This is a rather technical topic in detail, and will
therefore not be addressed further here.
3
Note that it was throughout tacitly assumed that the current and fields vanish sufficiently fast at
infinity to permit the integral manipulations. In quantum-field theory, this is not obviously true anymore.
But appropriate generalizations hold true in most cases, but not always, and care has to be taken. For
the content of this lecture, this can be assumed to be true.
Chapter 4. Invariances and quantum numbers 35
L
= (ie) (i )
L
= (ie ) 0
L
A = ( ) F = 0,
A
where the transformation function has been dropped, as it is arbitrary, and all identities
have to hold anyway for any arbitrary choice. The product symbolizes the adequate
composition rules for the various tensors involved. Note that the term for the gauge field
vanishes because of the antisymmetry of the field-strength tensor. Thus the current is
j = e , (4.3)
where use has been made of the fact that the fermion fields anticommute, and there is
an implicit 0 in the anti-fermion field. As expected, the current is carried only by the
fermion fields, and the photons do not contribute. It thus reproduces the expected form.
Thus electric charge is conserved in QED, as it is in classical physics. This will also
not be altered when passing to the standard model of particle physics. However, there is a
remarkable feature. Since the gauge-transformation of the photon field (3.2) is independent
of the electric charge, it is in principle possible that the three generations, i. e. electron,
muon, and tauon, could have different and arbitrary electric charges. Furthermore, there
is no reason that the proton has the opposite and positive charge as the electron, as it is
not made of of positrons, as discussed below. Still, all experimental results agree to very
good precision with this equality, especially the latter one. In QED, there is absolutely
no reason for this fact. A discussed later, the standard model of particle physics is only a
consistent quantum-field theory if and only if all the electric charges fulfill certain relations.
However, the fact that a theory only works if certain experimental facts are taken into
account is supporting the theory. But is does not dictate that the experimental facts have
to be this way. If a small deviation of these relation would be observed tomorrow, it is
the theory which has the problem. Therefore, the condition that the electric charges are
as they are for the theory to work is not an explanation of why this has to be. It is an
experimental fact which can be described, but not explained inside the standard model.
36 4.4. Implications of space-time symmetries
Though there are several proposals why this could be the case, especially so-called grand-
unified theories. There is not yet any experimental support for any explanation, and it
remains one of the great mysteries of modern particle physics.
4.5 Parity
Another symmetry, closely related to space-time symmetry, is the discrete parity symme-
try. The discrete transformation parity P is essentially a reflection in the mirror, i. e.
4
In two space-time dimensions this is not the case, and the spin can take on any real number, leading
to so-called anyonic particles. They are of no relevance in the standard model, but play a certain role in
solid-state physics.
Chapter 4. Invariances and quantum numbers 37
~x ~x. Time is unaltered. A distinction between space and time appears here, because
of the signature of the Minkowski metric, which makes time and space (in a loose sense)
distinguishable. Note that applying a parity transformation twice yields again the original
system. In fact, a parity transformation inverts the sign of each vector, e. g., coordinate
r or momenta p
P p = p.
Pseudo-vectors or axial vectors, however, do not change sign under parity transformation.
Such vectors are obtained from vectors, e. g., by forming a cross product. Thus the prime
example of an axial vector are (all kind of) angular momenta
P L = P (r p) = P r P p = r p = L.
As a consequence, fields can either change sign or remain invariant under a parity trans-
formation5 . Hence, a quantity can be assigned a positive or a negative parity, +1 and 1.
Generically, when there are only these two possibilities, the quantities are called even or
odd, respectively, in this case under parity transformations.
The fields describing particles have definite transformation properties under parity.
It is necessary to define the parity of some states, to remove some ambiguities regarding
absolute phases. Thus, the absolute parity of a particle is a question of convention. Usually,
the electrons, and thus also muons and tauons, are assigned positive parity, while the
photon has negative parity. Note that the anti-particles have the opposite parity for
fermions, but the same for bosons. The reason is the different statistics.
In QED, parity is conserved. Classically that can be read off the Lagrangian (3.7),
though there are some subtleties involved concerning fermions when performing the trans-
formation. Furthermore, it it is important to note that also differential operators have
definite transformation properties under parity transformation, and this is the same as
the one of the photon field. As a consequence all terms have a definite positive parity, and
the total Lagrangian has even parity. In principle, this could be changed in the quantiza-
tion procedure, but this does not occur for QED. Later, additional interactions of particle
physics will break this symmetry already classically.
physically equivalent to reversing all speeds and momenta, and as a consequence also
angular momenta: Either objects move backward or the time moves backward. Time-
reversal symmetry is anti-unitary, and therefore involves besides a factor of 1 or -1 also a
complex conjugation of the object it acts upon. Similarly to parity, fields can be classified
as being even or odd under time-reversal. However, because time-reversal is anti-unitary,
there is no quantum number associated with it.
As with parity, time reversal symmetry is respected both classically and quantum-
mechanical by QED, and again later on theories will be encountered which do not respect
it.
4.8 CPT
It is now a very deep result that quantum-field theories on Minkowski space-time have
the following property: A combination of charge parity transformation, parity and time
Chapter 4. Invariances and quantum numbers 39
reversal, i. e. the application of the operator CP T (or any other order, as they commute)
will always be a symmetry of the theory6 . Especially, causality is deeply connected with
this property, and one implies the other.
This implies that if one mirrors physics and exchanges particles by anti-particles, and
runs everything backwards there will not be any difference observed in any physical process.
From this, it can also be shown that necessarily all integer spin particles respect Bose-
Einstein symmetry, while all half-integer spin particles will respect Fermi-Dirac statistics.
The connection is made using special relativity: On the one hand it implies that only
certain spins may exist, and at the same time it implies that certain properties under
the exchange of particles have to be respected. Connecting this leads to the aforemen-
tioned classification. Therefore, in the context of particle physics, integer-spin particles
are synonymously called bosons and half-integer spin particles fermions. Since this is a
particular property of three-(or higher) dimensional quantum-field theory endowed with
special relativity on flat Minkowski space-time, which is the arena of the standard-model,
this association is fixed in particle physics. One must be careful, if one moves to a different
field.
As a consequence of this so-called CPT-theorem the behavior of a state or field under
one of the symmetries is fixed once the other two are given. Conventionally therefore
only the quantum numbers under P and C are recorded for a particle, which leads to the
J P C classification, where J is the total, orbital and intrinsic spin, angular momentum of
a particle, and P and C are only denoted as + or . E. g. a photon has J P C = 1 . The
charge parity quantum number of particles with anti-particles of half-integer spin is not
entirely trivial.
has the symmetry. It can be shown that such an explicit breaking can only be performed
for global symmetries. Any attempt to do this for a local symmetry, though classically
possible, cannot be quantized consistently.
The relevance of explicit symmetry breaking comes about in two special cases. One
is that of soft symmetry breaking. In this case the terms in Lb are such that they are
only taking effect in a certain energy regime, but become negligible in another one. The
most common case will be seen to be mass terms, which break symmetries of most particle
physics theories at low energies, but become irrelevant at large energies. Therefore, if
the theory is probed at sufficiently large energies, the symmetry appears to be effectively
restored.
The second possibility is, if is small. In this case the symmetry may not be exact, but
relations due to the symmetry may still hold true approximately also in the full theory.
In such a case it is said that the symmetry is only weakly broken. This concept plays an
important role later for the strong nuclear interaction.
The next possibility is that a classical symmetry is no longer a symmetry of the quan-
tized theory. Again, this only leads to a consistent quantum theory if the affected sym-
metry is a global one. Such an effect is also called an anomaly. In that case the symmetry
is just not present at the quantum level. There are several examples to be encountered
in particle physics. The simplest one is given by massless QED. The classical theory is
invariant under a scale transformation, i. e. when rescaling all dimensionful quantities by
a fixed number. This symmetry, the so-called dilation symmetry, is broken in the quan-
tization. As a consequence, the theory has an intrinsic mass scale. This mass scale is e.
g. the one encountered in the running of the coupling, see section 3.8. If the dilatation
symmetry would be unbroken, the coupling would be independent of energy. Incidentally,
it can be shown that this is deeply connected to the masslessness of the photon.
If a symmetry exists both classically and survives the quantization process, it may still
be broken spontaneously. It can be proven that this cannot occur in usual particle physics
theories for local symmetries, but only for global symmetries. This will manifest itself in
the possible outcomes of the theory. E. g., if a symmetry predicts that two states should be
the same, they will no longer be after spontaneous symmetry breaking. The spontaneous
magnetization of a magnet below th Curie temperature is an often cited example for this
phenomenon. Again, there is some fine-print in a full quantum-field theory7 , but the
simpler picture will suffice for this lecture.
7
E. g. the total magnetization of a magnet is always zero, because there is no preferred direction in
space-time. Spontaneous magnetization manifests itself rather in the relative magnetization of two neigh-
boring spins in the magnet. These are uncorrelated without spontaneous magnetization, but correlated
in the case of spontaneous magnetization.
Chapter 4. Invariances and quantum numbers 41
A more detailed treatment of what symmetry breaking really is will follow in section
5.7.
4.11 Flavor
QED has, however, not only a single fermion species, but three ones, so-called three flavors.
Multiplying only one of them with a phase still is a symmetry of the Lagrangian. Each
of these three symmetries has its own conserved current. Therefore, the total number
electrons, muons, and tauons are conserved in QED separately, it constitutes a flavor
quantum number. This flavor quantum number is, like electric charge or fermion number,
an internal symmetry, and therefore flavor is an internal quantum number. QED is said to
respect flavor symmetry, where the flavors are uniquely identified by their mass. Formally,
this is a U(1)3 symmetry, i. e. a product of three independent U(1) groups.
If the three flavors would be mass-degenerate, this symmetry would be enlarged. Since
all particles have the same mass and the same electric charge, they become indistinguish-
42 4.12. Chiral symmetry
able, and they can also be exchanged. Thus, it is possible to perform rotations in the
internal flavor space. If the leptons would be described by real fields, this would be just
the usual orthogonal group of three-dimensional rotations, O(3). But because they are
complex, this group is enlarged to SU(3). The total flavor symmetry of mass-degenerate
QED is hence SU(3), or, when supplemented by the additional fermion number symmetry,
SU(3)U(1).
tromagnetic interactions, the symmetry is so baldy broken that it has almost no relevance
for QED. This will change later when discussing the strong nuclear interaction.
4.13 Generators
A final remark on symmetries is the notion of generators, as it appears throughout the
literature. It is a technically very useful observation that for a continuous symmetry, which
changes + , the associated Noether charge Q plays a special role. It can be shown
that
[, Q] ,
where additional constants of proportionality may or may not appear, depending on con-
ventions, and the indicates that this may be a commutator or anti-commutator, de-
pending on whether the involved quantities are bosonic or fermionic, respectively. As a
consequence, the charges Q are also referred to as the generators of the symmetry. In this
sense, the electric charge creates the electromagnetic (gauge) symmetry.
Chapter 5
Strong interactions
It was very early on recognized that QED cannot be used to describe atomic nuclei, since
QED can never sustain a bound state of the positive charge carriers which make up the
nuclei, the so-called protons with their mass of roughly 938 MeV, and electrically neutral
particles, the neutrons, with their just about 1.5 MeV smaller mass. Since also gravity was
not an option, there had to be another force at wok, which is called the strong nuclear force.
This became an ever more pressing issue, as it became experimentally clear that neither
protons nor neutrons can be elementary particles, as they have a finite extension, about 1
fm, and behave in scattering experiments like having a sub-structure. To understand the
details was, however, a rather complicated challenge.
The formulation of the strong interactions in its currently final form of quantum chromo
dynamics (QCD) is, in a sense, the latest addition to the standard model. Because of the
phenomenon of confinement, discussed below in section 5.8, it was much longer formulated
than experimentally established. The reason behind is that the strong nuclear force is, as it
names suggest, strong. It is therefore only in very limited, though relevant, circumstances
possible to use perturbation theory to calculate anything in QCD. This made theoretical
progress slow. The mainstay of QCD calculations today are in the form of numerical
simulations, which for many problems in QCD physics has become the method of choice.
Still, since computational power is limited, though such simulations being among the top
ten contender for computation time in the world, it is not yet possible to calculate complex
objects like even a helium nucleus. Still, for many purposes, many of the more fundamental
features of the theory remain an area of active research today. In this context, QCD and
theories which are (slightly) modified versions of QCD also serve as role models for generic
strongly interacting theories in particle physics.
44
Chapter 5. Strong interactions 45
( 2 + m2 ) = 0, (5.1)
strictly speaking, only valid if the energy scales involved are small compared to the mass.
Furthermore, it does not lead to a stable vacuum state, and therefore it can only describe
physics relative to an (undetermined) vacuum. Both sicknesses are only cured when moving
to quantum field theory, though this is beyond the scope of this lecture.
Nonetheless, the Yukawa potential yields the right idea. Instead of having massless
photons as force particles, massive particles must mediate the strong nuclear force. They
were indeed found in the form of the mesons.
5.2 Mesons
While the protons and neutrons are fermions with spin 1/2, the force carrier of the nuclear
force were identified to be actually bosons. The lightest of them are the pions with quantum
numbers J P = 0 , i. e. they are pseudoscalars. They come as a neutral one, 0 , and two
oppositely charged ones, . The range of the nuclear force is about 1 fm, which indicates
that the mass of the force carrier, according to the Yukawa potential (5.2), should have
a mass around 100-200 MeV. Indeed, the pions are found to have masses of 135.0 and
139.6 MeV for the uncharged and charged ones, respectively, and are thus much lighter
than either protons or neutrons. These pions are not stable, but decay either dominantly
electromagnetically into photons for the neutral one or like the neutron for the charged
ones. Their life-time is of the order of 108 seconds and 1017 seconds for the charged and
uncharged ones, respectively. Therefore the charged ones live long enough to be directly
detectable in experiment.
One of the surprises is that the neutral one decays into two photons, as usually pho-
tons are expected to couple only to electromagnetically charged objects. While this can
be thought of as a neutral pion virtually splitting into two charged pions, and then anni-
hilation under emission of photons, this is somewhat awkward. A more elegant resolution
of this will be given in the quark model below in section 5.4.
With these pions it was possible to describe the overall properties of nucleons, especially
long-range properties. At shorter range and for finer details it turned out that a description
only with pions as force carriers was impossible. This was resolved by the introduction,
and also observation, of further mesons. Especially the vector meson with a mass of 770
MeV, spin one, and a very short life-time of roughly 1024 seconds and the vector meson
with a mass of about 780 MeV, but with a 20 times longer life-time than the , play an
important role. This larger number of mesons is also at the core of apparent three-body
forces observed in nuclear interactions, which are, e. g., necessary to describe deuterium
adequately. In fact, many more mesons have been discovered, and some more will appear
Chapter 5. Strong interactions 47
later.
Describing how these various mesons create the strong nuclear force is in detail very
complicated, but it in principle can be systematically performed, e. g. in the form of the
so-called chiral perturbation theory. This will lead too far astray from particle physics
itself, and will therefore not be detailed here. What is, however, remarkable is that out
of nowhere appear several different mesons, all contributing to the nuclear force, and
actually all of them also affected by the nuclear force. Such a diversity of force carriers is
distinctively different from the case of QED, where only the photon appears.
the different states of an isospin multiplet all have different charge, and that half-integer
or integer isospin corresponds to fermions or bosons, the properties of such a quadruplet
should be predictable. In fact, the relation
Q = I3 + S (5.3)
seems to hold so far, where I3 is the charge of the state, and S its spin1 . For the proton
and neutron, the assignments I3 = 1/2 and I3 = 1/2, respectively, yield the correct
result. For the three pions, the I3 assignments of 1, 0, and 1 reproduce the correct
electric charge for , 0 , and + .
However, this fails for the meson, which has spin one, and is an isospin singlet, but
in contrast to the rule (5.3) is uncharged. This can be remedied by replacing the spin S
by B/2, where B is the so-called baryon number,
B
Q = I3 + (5.4)
2
which is one for the nucleons and zero for the mesons. This is so far a phenomenological
identification, but will become quite relevant in section 5.4. Of course, the anti-particle of
the nucleons, the anti-proton and anti-neutron, carry negative baryon number.
According to this rule, it is possible to attempt to construct a quadruplet, having
four states with I3 = 3/2, 1/2, 1/2, 3/2. To get integer charges, it must then have a
baryon number, like the nucleons. These particles should therefore have electric charge
1, 0, +1, and +2, and corresponding anti-particles. These particles have been observed
experimentally, and again the different states have almost the same mass. They are called
, have masses of about 1232 MeV, and are fermions, as are the nucleons. However,
their spin is 3/2. Since both nucleons and baryons carry baryon number, there are called
commonly baryons, to distinguish them from the mesons. In fact, they are not the only
baryons, and many more have been found experimentally.
Together, mesons and baryons are denoted by hadrons, and are identified as those
particles directly affected by the strong nuclear force.
showed that the proton had a finite size of about 1 fm, and Rutherford-like experiments
found that there are scattering centers inside the proton, which appeared point-like. How-
ever, in contrast to the atoms these constituents were not almost isolated in essentially
free space, but very tightly packed. Furthermore, while the neutron is electrically neutral,
it was found to have a magnetic dipole moment, a feature beforehand believed to be only
existing if there is an electrically charged substructure present.
This evidence together suggested that the elementary particle zoo could possibly be
obtained from simpler constituent and put into a scheme like the periodic table of chemical
elements, which originates from just three different particles.
Playing around with quantum numbers showed a number of regular features of the
hadrons. This gave rise to the quark model, where in the beginning two quarks were
needed to explain the regularities observed, the up quark and down quark, abbreviated
by u and d, as well as their anti-particles. Since both the bosonic mesons and fermionic
hadrons must be constructed from them, it requires them to be fermions themselves.
Since all of the hadrons have an extension, none of them can be identified with a single
quark, just like the periodic table does not contain a single proton, neutron or electron.
However, in contrast to the latter no free quarks are observed in nature, a phenomenon to
be discussed below in section 5.8.
The simplest possibility to construct then a hadron would be from two quarks. This
must be a boson, as two times a half-integer spin can only be coupled to an integer spin, and
therefore a meson. Since no free quarks are seen, the nucleons must contain at least three
quarks to get a half-integer spin. These considerations turn out to be correct. However,
they lead to the conclusion that quarks cannot have integer electric charges. This is most
easily seen by looking at the nucleons.
Scattering experiments identified that the nucleons have no uniform sub-structure, but
have a two-one structure, that is two quarks of one type and one quark of the other type.
Since it is found that the down quark is heavier than the up quark, the heavier one, i.
e. the neutron, should have two down quarks. This yields a composition of uud for the
proton and udd for the neutron. The only solution for the observed electric charges of
the proton and the neutron are then an assignment of 2/3 of the (absolute value of the)
electron charge for the up quark, and 1/3 of the (absolute value of the) electron charge
for the down quark. This consistently yields the required positive and neutral proton and
neutron charges. This also explains the magnetic dipole moment of the neutron. At the
same time, the baryon number of quarks must be 1/3 for up quark and down quark. This
implies also that the isospin of the up quark is +1/2 and that of the down quark is 1/2.
The pions are then constructed as a combination of a quark and an anti-quark, ud for
50 5.5. Color and gluons
,
0
= cos |uui + sin dd
the attempt to make the proton and neutron color-neutral simultaneously. It is therefore
necessary to depart from the simple structure of the electromagnetic charge.
As a consequence, it is assumed that there are three different charges, suggestively
called red, green (or sometimes, especially in older literature, yellow), and blue. It is
furthermore assumed that not only a color and the corresponding anti-color is neutral,
but also a set of each of the colors is neutral. Then there are three quarks for each flavor:
red, green, and blue up quarks, and red, green, and blue down quarks, totaling six quarks.
A color-neutral baryon is then containing a quark of each color, e. g. a proton contains
a red and a blue up quark, and a green down quark. In fact, since the total charge of a
proton is zero, it is a mixture of any possible combination of color assignments to each
three quarks, which are consistent with neutrality and the Pauli principle. Similar, the
++ now consists of a red up quark, a green up quark, and a blue up quark.
This construction is rather strange at first sight, but it can formulated in a mathemat-
ically well-defined way. This will be done below in section 5.16, after collecting the other
ingredients of the strong interactions.
One other important ingredient, now that there is a new charge, is what mediates
the force between the charges. In electromagnetism it was the massless photons. It is
therefore reasonable to assume that there is also a mediator of the force between color
charges. These were indeed found, and named gluons. As the photons these are massless2
bosons with spin one. However, they differ from photons in a very important property.
While photons are only mediating electromagnetic force, they are not themselves affected
by it, since they carry no electric charge. But gluons carry color charge. In fact there are
8 different charges3 carried by gluons, and none of these eight are either the quark charges,
nor is their any simple relation to the quark charges. Especially, it is impossible to add a
single quark charge with any combination of the gluon charges to obtain a neutral object.
To achieve this, at least two quarks have to be added to one or more gluons.
Nonetheless, the idea of gluons has been experimentally verified, and they have been
identified as the carrier of the strong interaction, binding quarks into color-neutral hadrons.
The exchange of mesons to bind nucleons into nuclei can be viewed as a high-order effect
of the gluon interaction. This is similar to Van-der-Waals force, though the details are
different, as here not a color dipole moment enters, and the details are not yet fully
resolved. Still, the effect can be traced back to the gluons.
2
Again, the notion of mass is murky here, and the precise details are only becoming clear in a full
non-perturbative treatment of the corresponding quantum field theory. However, for most purposes the
notion of massless is sufficient.
3
One can think of four of them as particles and four as anti-particles, though this is not really how it
works, see section 5.16.
52 5.6. Chiral symmetry breaking
Hence, the combination of quarks, gluons, and colors, can explain the structure of
all known hadrons, similar to the periodic table. Unfortunately, the strong force binding
quarks by gluon exchange is not accessible using perturbation theory, at least when it
comes to describing hadrons. Its treatment is therefore highly non-trivial. Because of the
color, this underlying theory of hadrons is called chromodynamics, it quantum version
quantum chromodynamics, or QCD for brief.
metry. Experiments show that at sufficiently high energies, many times the hadronic
scales, the quarks again behave as if they have only their current masses. Thus, the break-
ing must be spontaneous. This will be confirmed when writing down the field theory of
QCD in section 5.16. Hence, the strong interaction break chiral symmetry spontaneously.
This proceeds in the following way. The gluon interaction create a strong binding of the
quarks, which creates a so-called vacuum condensate huui + hddi 5. Since the condensate
is made from the quarks, any other quark will interact with it. Since the strong force is
attractive, otherwise protons, neutrons, or nuclei would not exist, this will slow down any
movement of a quark. Thus, quarks gain in this process inertial mass6 , and thus effectively
a larger mass. This additional mass is the same for up quarks and down quarks, and
approximately 300 MeV. This explains how the heavier mesons and the baryons gain a
mass of this size. It does not yet explain two other facts: The lightness of the pions nor
why this is no longer the case at high energies. These two questions will be answered in
the next two sections.
Before this, some remarks should be added. It is easy to wonder how the universe
should be filled with such a condensate, but no effect appears to be visible. Here, one
notes how everyday experience can be deceiving. Of course, the effect is visible, since
without it, all nucleons would be very much lighter, and so would be we, since almost all
our mass is due to nucleons. Thus, whenever we feel our mass, we feel the consequences of
this condensate. In fact, this condensate is responsible for essentially 99% of the mass of
everything we can see in the universe, i. e. suns, planets, asteroids, and gas, the so-called
luminous mass of the universe.
where U describes the potential for the field, which can only come from interactions
between the two fields. The simplest (and in known particle physics so far exclusively
appearing) possibility to obtain a symmetry for this theory is that the potential depends
only the product .
To obtain the simplest example for a symmetry, take the potential
2 v 2 2 + 2
U( ) = + 2 ( )2 .
4 2 4v
The pre-factors, i. e. coupling constants, as well as the irrelevant constant term have been
chosen judiciously such that the result will be looking simple. This potential, as well as
the kinetic term, is invariant under the phase rotation exp(i). Therefore, this
theory has a global (U(1)) phase symmetry. It is a bit odd theory, as the quadratic term is
usually associated with a mass. This can be seen when determining the classical equation
of motion, which read
L L 2 2 2 2
0 = = + . (5.6)
2 2v 2
The equation of motion for the field is, up to conjugation, identical. Ignoring the
interaction term, this is just again the Klein-Gordon equation (5.1), which describes the
movement of a free scalar particle, if the sign of the second term would be positive. In
this case, it looks like the particle has an imaginary mass, what would be called a tachyon.
This is, however, not a problem. The sign of the quartic term is positive. Therefore, the
energy is bounded from below classically, and the theory remains stable. It is therefore
just an odd term in the potential energy.
To proceed, an interesting question is what the classical lowest energy state is. Since the
kinetic term is positive, any spatial or temporal variation would increase the total energy.
Hence, the state of lowest energy is necessarily a field 0 constant throughout space and
time, which minimizes the potential (5.6). This constant is found to be 0 = vei , where
is an arbitrary phase, i. e. all values of are a minimum. Thus, the solution manifold is
highly degenerate. This is a consequence of the symmetry: Any change in can be offset
by a symmetry transformation, without changing the physics.
One can proceed by specifying further. However, any choice is physically indistinct,
and therefore arbitrary. For further calculations keeping manifest is awkward, and
therefore in the following the explicit choice = 0 is made. Since the choice is arbitrary,
the symmetry is not violated. However, the symmetry is no longer manifest either. One
therefore speaks of hiding the symmetry, or a hidden symmetry.
To make the situation more transparent, the next step is to shift the field by its
Chapter 5. Strong interactions 55
making now the complex structure manifest with the two real fields and . In this way,
fluctuations of the fields around the classical vacuum can be studied. Inserting this into
the Lagrangian (5.5) yields
2 3 2 2 2 2 2
L = + 2 2 + + + 2 2 + 2 4 + 2 4 . (5.8)
v v 2v 4v 4v
This Lagrangian shows now a number of very interesting features, which are very generic.
The first is that the two fields and behave differently. While there is a mass term,
now with the correct sign, for , giving it a mass of , there is no mass for . Pictorially,
one can think of as excitations which describe fluctuations out of the minimum, while
, which is orthogonal to the direction of the chosen vacuum, moves between the different
minima of the potential Since the vacua all have the same energy, this does not cost any
energy, and therefore the mode is massless. This is a generic feature of such situations,
and is known as Goldstones theorem. In a nutshell, it is the statement that there as many
massless particles as there are directions in which the minima are equivalent8 .
The second is that there are now many different interactions between the fields
and . However their couplings, i. e. their pre-factors, are not all different, but completely
determined by the original parameters. The reason is that the symmetry is just hidden. To
ensure that any symmetry transformation is still valid requires that the various interactions
cannot have arbitrary pre-factors, because otherwise it would no longer be invariant under
the symmetry transformation
where it should be noted that the vacuum solution v is also transformed accordingly. For
that to work out, it is necessary to keep track when changing from (5.5) to (5.8), which
occurrence of v stem from the original coupling constants in (5.5), and which from the
shift (5.7), since only the latter are affected.
If at any point a term is added to the Lagrangian, which violates the symmetry, the
symmetry becomes explicitly broken. The most obvious way is to add a mass term for
the field to the Lagrangian (5.8). Then, the Lagrangian is no longer invariant under the
symmetry transformation (5.9). Of course, this can be translated back into the original
8
The precise formulation is that there are as many massless particles as there are generators of the
symmetry group minus the number of generators of symmetry transformations after all possible remaining
choices have been made, here this is one minus zero, and thus one massless particles.
56 5.7. The Goldstone theorem
Lagrangian (5.5), where it takes the form of an additional quadratic term in the potential
(5.6) of type ()2 . The effect is essentially that the potential is tilted, and the vacuum
state has now a unique solution v, which has no longer an invariance. This gives also
a physical explanation for the mass: Since there are no degenerate vacuum solutions
anymore, any movement increases the energy.
This now illustrates how pions gain their small mass. The breaking of chiral symmetry
would lead to a number of Goldstone bosons. In case of two quark flavors9 , there will
be three Goldstone bosons, which are the three pions. These would be massless, if the
quarks would be massless. However, because of the small current mass of the quarks, the
symmetry is not exact, but rather explicitly broken. This gives the mass to the pions.
That the masses of the pions are still large compared to current masses of the quarks is a
dynamical effect. Approximately, the pion masses scale linearly with the current masses
of the quarks, the so-called Gell-Mann-Oaks-Renner relation, but the pre-factor is large.
Before continuing on, a few words about subtleties and semantics must be said. When
going to the quantum theory, quantum effects do what they always do, and they will mix
all the degenerate vacuum states, and no vacuum will be preferred. Therefore, the vacuum
state will exhibit a perfect symmetry, in contrast to the classical case. But the quantum
system also carries the seed of the classical physics within, as it is metastable. If any
arbitrarily small external perturbation, e. g. an infinitesimal mass for the from some
other physics process, arises, it will immediately have a unique vacuum state, in which
the symmetry is no longer realized. Hence, though strictly speaking the system without
external influence is perfectly symmetric, the presence of this metastability has lead to the
expression that the symmetry is nonetheless spontaneously broken.
In fact, even though the symmetry is exact, a full non-perturbative calculation shows
that the system has both an ordinary massive and a massless excitation, and hence the
most pertinent feature of the Goldstone theorem are realized even with the symmetry
present. Of course, in a perturbative calculation this will not show. Since perturbation
theory only permits very small deviations from the vacuum state, the particles will still
appear to be tachyons, as the relevant Lagrangian is (5.5). To cure this problem, one
can introduce a weak external perturbation to the theory, which prefers a single vacuum,
perform perturbation theory around this vacuum, i. e. using the Lagrangian (5.8) instead,
and remove the external perturbation at the end10 . Then, also in perturbation theory
the system exhibits a massive and a massless particle. Especially because of this trick,
which is extremely useful in the standard model, the exact notion of hidden symmetry is
9
It can be shown that the number of colors does not matter.
10
There are some subtleties associated with non-analyticities in this case, but in most cases they can
be dealt with.
Chapter 5. Strong interactions 57
nowadays very rarely used, and almost always the situation will be denoted by spontaneous
symmetry breaking.
4 s
V = + r, (5.10)
3 r
i. e. it has two components. There is a short-range Coulomb-like interaction, with the
strong fine-structure constant s = gs2/(4) defined by the strong coupling constant, sim-
ilar to the electromagnetic force. But there is a second component, which rises linearly
with distances, with a constant of proportionality . This latter contribution is of purely
non-perturbative origin, which also made it very hard to discover its theoretical origin.
That we do need such a potential for the strong interaction is an experimental fact. Only
that s and k are not unrelated can be understood so far, but why this is necessary is just
an observation.
This now explains why quarks appear to be lighter at high energies and thus shorter
distances. At long distances, the interaction is very strong, which also manifests itself
in chiral symmetry breaking and the condensation of quarks There the quarks are thus
behave as described, and are quite heavy. However, this interaction becomes weaker at
higher energies, i. e. quarks moving through the quark condensate at high enough energies
will no longer be slowed down by it. The quarks will thus behave as if the only have their
11
Note for the following that the notion of potential has no real meaning anymore when particles can
be created or destroyed, and it is necessary to pass to quantum field theory for a more precise description.
But for the present purpose the analogy is sufficiently good as to give a qualitative correct picture, and is
even quantitatively surprisingly good.
58 5.9. Hadronic resonances
current mass. This explains the difference of both masses. This effect that the strong
interaction becomes weaker at higher energies is called asymptotic freedom. Of course, the
remaining force is still as strong as the electromagnetic one, due to the classical Coulomb
potential in (5.10), but this is much weaker than at long distances, and therefore effectively
free. Moreover, quantum corrections reduce this interaction further: The running strong
coupling decreases at high energies, because the leading coefficient of the -function 0
in (3.9) is negative, with a scale of roughly 1 GeV. Hence, at a very high energy, the
electromagnetic force becomes actually stronger than the strong one, above roughly 1015
GeV.
The potential (5.10) also explains why no individual quarks are seen, at least to some
extent. Since the potential energy between two quarks rises indeterminate when increasing
the distance, one needs enormous amounts of energy when trying to separate them, much
more than available for any earthbound experiment for any experimentally resolvable
distance. This energy is stored in the gluonic field between the two quarks. Investigations
have shown that this field is highly collimated between the two quarks, it is like a string
connecting them both. Since the tension of this string is characterized to lowest order
by the parameter in (5.10), this parameter is therefore also called string tension. This
explains12 why colored quarks are not observed individually, but are confined in hadrons,
and the phenomenon is denoted as confinement.
However, in practical cases the energy limits of experiments is not a real concern. Long
before the relevant energy limit is reached, there is so much energy stored in the string
between the two quarks that it becomes energetically favored to convert it into hadrons.
Hence, when trying to separate the two quarks inside, e. g., a meson by pumping energy
into it, at some point it will just split into two mesons. This is called string breaking
or screening, and is a 100% efficient mechanism, i. e. there is no remaining quark. The
reason is that color, like electric charge, is conserved. Hence, overall the system must
remain color-neutral. Thus, there would be at least two colored objects left over, which
would again create immediately a string. This is also the mechanism why gluons are not
observed. When trying to separate gluons from a hadron, again a string is created, which
breaks and yields just additional hadrons.
E. g. the first excited state of the nucleon is the N(1440), with a mass of 1440 MeV,
as the name indicates. In total, about 28, experimentally more or less well established,
resonances have been observed for the nucleons, and likewise for the other hadrons.
An interesting observation, which can be understood on basis of the string picture
discussed in the previous section, is the presence of so-called Regge trajectories. A Regge
trajectory is that the mass squared of the resonances are roughly proportional to its total
angular momentum, i. e. the combination of spin and orbital angular momentum. This
has been experimentally observed before the inception of QCD, and has led to the first
formulation of a string theory as a theory of the hadronic interactions. Due to consistency
problems, and the lack of ability to describe other experimental observations, this approach
of a quantized string has been dropped later in favor of QCD.
All hadronic resonances are unstable, and decay in various ways. Most quickly are those
where the decay proceeds by splitting into hadrons, which usually involves the decay into
a lower resonance and some pions. E. g. the N(1440) decays in about 50-75% into a proton
or neutron and a pion, with a decay time of roughly (200 MeV)1 . Some resonances can
kinematically not decay into two hadrons, and they de-excite by emission of photons or
other particles. This happens on a much longer time scale. Such states are often considered
as excited states in contrast to the resonances. These are under the decay threshold into
purely hadronic final states. The much longer decay time is of course due to the fact
that the strong force is, indeed, the strongest one, and all processes subjected to it occur
very quickly. The only exceptions are those processes close to, but above, the hadronic
decay threshold. There kinematic reasons, the absence of a large phase space, statistically
suppress decays13 .
Th study of the decays of hadrons has been very useful in the understanding of the
strong force. As will be seen later in section 5.12, today decays of hadronic resonances
have also become an important window into the study of other processes.
are not yet any unequivocally observed tetraquarks or pentaquarks. The reason for this
are due to two effects.
The first effect is mixing. E. g. for a tetra-quark, it is almost always possible to
construct a meson with the same quantum numbers, i. e. the same spin, parity, charge-
parity and electric charge. There is also the possibility to construct equally well a dimeson
molecule. One of the most infamous examples is the meson14 . It is a light neutral meson,
with quantum numbers compatible with the states uu, dd, uudd, uuuu, dd
dd,
+ , and
0 0 . Since it is a quantum state, it follows the quantum rules, which in particular
imply that all states with the same quantum numbers mix. It is therefore a superposition
of all these seven states. The question which of these states contribute most is highly
non-trivial. It can, in principle, be experimentally measured or theoretical calculated.
There is no really reliable way of estimating it. The results found so far indicate that the
combination of two pions is most dominant, it is therefore likely a molecule. For most
other states the two-quark components appears to be the dominant one. Similarly, almost
all baryons turn out to be completely dominated by the three-quark state. There are only
few cases left, where there is still some doubt.
The second possibility is to investigate one of those possibilities where the quantum
numbers of the tetraquarks cannot be created using a two-quark system. Such cases are
rare, but they exist. In principle, it would therefore be sufficient to just observe such a
state. Unfortunately, almost all of these states are highly unstable. They are therefore
experimentally hard to observe, and only very few candidates have been found so far,
though recently a few candidates have been identified, to be discussed later in section
5.12.
This problem becomes more complicated due to the gluons. Though it is not possible
to create a color-neutral state from a quark and a gluon, it is possible to combine a quark,
an anti-quark and one or more gluons to obtain a colorless state. It is similarly possible to
combine three quarks and a number of gluons to obtain a colorless state. Such states are
called hybrids. However, the gluons can at most add angular momentum, but no other
charges to the state. Therefore, there is always a state with the same quantum numbers,
but just made from quarks. Since adding a particle or orbital angular momentum to a
state usually increases its mass, these states are unstable against the decay to a state
with just the minimum number of quarks. Though these hybrids are therefore formally
admixtures to any state, it is essentially always a small one, and therefore hybrids are very
hard to identify both experimentally and theoretically.
The last class of states which can come into the mix are bosonic glueballs, which
14
The official name is f0 (500), though the historical name of meson still is commonly used.
Chapter 5. Strong interactions 61
combine only gluons to a colorless objects. The usual counting rules of the quark model
do not apply to them, but as a rough estimate even the simplest state is made out of four
gluons. Such states carry no electric charge, and most of them have the same J P C quantum
numbers as mesons, and therefore mix. However, there are some candidates, particularly
the so-called f(x) mesons, with x around 1500 MeV, which appear to have a large admixture
from glueballs. This is experimentally identified by the possible decays. Since gluons are,
in contrast to quarks, electrically neutral, decays into electrically neutral decay products,
except for photons, should be favored if there is a large glueball contribution in the state.
This has been observed, especially when comparing the partial decay widths of decays
to uncharged particles to the one to photons. However, even at best these state ar only
partially glueballs.
There are some glueball states which have quantum numbers which cannot be formed
by only quarks, at least not in the simple quark model. Unfortunately, all theoretical
estimates place theses states at masses of 2.5-3 GeV, and therefore far above any hadronic
decay threshold. They are therefore highly unstable, and decay quickly. Hence, there is
not yet any experimental evidence for them, though new dedicated searches are ongoing
or are prepared.
then that the kaons, with only 500 MeV mass, are also would-be Goldstone mesons,
obtaining their mass from the with the strange quark enlarged chiral symmetry breaking.
Since the strange quarks current mass is already not too small compared to usual hadronic
scales, the kaons are much closer to their expected mass of about 700 MeV in the quark
model obtained from a construction of a u or a d quark and one of the new s quarks.
Indeed, the s quark has an electric charge of 1/3, just like the d quark. The charged
kaons are therefore the combinations us, us, and the neutral ones ds and sd, explaining
their small mass difference, and their multiplicity. The Goldstone theorem actually pre-
dicts that there should be 8 Goldstone bosons. These are six. The other two are the
and mesons, which are made from ss combinations, and some admixtures from neutral
combinations of u and d quarks. They are therefore even heavier, the having a mass
of 550 MeV. Somewhat peculiar, the mass of the is much higher, about 960 MeV. The
reason is that the also receives mass from another source, the so-called axial anomaly.
The latter will be discussed below in section 6.11.
Besides the broken chiral symmetry, there is also the now enlarged flavor symmetry.
Since it involves up, down, and strange quarks, it is an SU(3) group. Since the quarks have
different masses, the group is not unbroken, but reduced to three counting symmetries,
i. e. U(1)3 . Hence, the individual quark flavors are conserved, but bound states with
differing quark content have differing masses. This conservation of quark flavor by the
strong interaction is also at the origin of the name strangeness. When the kaons were
discovered, the quark model was yet to be established. The kaons, and also baryons,
called hyperons, with a single or more strange quarks included, showed a different decay
patterns than ordinary hadrons, due to the conservation of strangeness. Thus, they did
not fit into the scheme, and were therefore considered strange.
The presence of the strange quark, which has an effective mass of about 400 MeV, and
thus close to the masses of the up and down quarks of 300 MeV, adds many more possible
combinations to the quark model, which all have very similar masses. Thus, there is a very
large number of hadrons with masses between 500 MeV, and roughly 3000 MeV, where the
states become too unstable to be still considered as real particles. In fact, the number of
states N turns out to rise exponentially with mass, N exp(M/TH ). This is a so-called
Hagedorn spectrum, where TH is called the Hagedorn temperature. The reason for the
name is that naively a system with such a spectrum has the property that at a finite
temperature, the Hagedorn temperature TH , an infinite number of particles is created,
and thus the system cannot be heated beyond this point. For the strong interactions,
the Hagedorn temperature is about 160 MeV. Of course, it is in practice possible to go
beyond this temperature. What happens is that at this point the quark substructure can
Chapter 5. Strong interactions 63
no longer be ignored, and this effect limits the number of states growth. This has originally
lead to the idea that at this temperature a phase transition has to occur, which signals
the change from a hadronic system to one were the quark substructure becomes more
important. However, the most reliable results to date rather indicate that it is a cross-
over. This transition plays nonetheless a role in the development of the early universe,
though a rather small one. This will be discussed in detail in section 5.18
The experimental and theoretical study of this transition is an important part of mod-
ern particle physics. Nowadays it is not restricted to the question of large temperatures,
but also small temperature and large densities. This is the situation which is encountered
during supernovas and in the interior of neutron stars and during neutron star mergers.
At theses densities, which become larger than the density of nuclei, nucleons can no longer
exist without starting to overlap with each other. Similarly to the high-temperature case,
such an overlap of nucleons is expected to change the properties of matter dramatically,
as the quark substructure can no longer be ignored. However, much less is known about
this situation, as it is both experimentally and theoretically much more demanding to
investigate. Some hope is provided by the possibility to use gravitational waves to probe
the interior of neutron stars during neutron star mergers, but this will crucially depend
on the developments in the next few years, whether this turns out to be correct. One of
the most interesting questions is, whether strange quarks play a role for neutron stars.
Another aspect were strange quarks play an important role are the quests for tetraquarks
and pentaquarks. Since strangeness is a conserved quantum number of the electromagnetic
and strong interaction, it is possible to construct states which do not have the quantum
numbers of an ordinary state, e. g. a meson with total strangeness of 2, which therefore
must contain two strange quarks, and cannot be a simple quark-anti-quark state, or a
baryon with strangeness -1, due to a single anti-strange quark, which therefore must be a
tetra-quark. Searches for such signatures are intensively pursued.
while the strange quark has a rather similar mass, despite its larger current mass, as the
light quarks, the charm quark has not. Its current mass is 1275 MeV, and thus similar
to its effective mass of roughly 1600 MeV. As a consequence, hadrons involving a charm
quark are much heavier than hadrons containing only the lighter quarks. Furthermore,
though chiral symmetry is even more enlarged, its so badly explicitly broken that would-
be Goldstone bosons involving charm quarks have essentially the same mass as without
the breaking of chiral symmetry. The same is true for the flavor symmetry, and the only
remnant is that charm is again a conserved quantum number in both the electromagnetic
and strong interaction.
This conservation of charm has very interesting consequences. Of course there are
hadrons were only one of the quarks is a charm quark, which are called open charm
hadrons. The best known ones are the D mesons, with masses of about 1870 MeV mass
and having the structure of a single charm quark and either an up or down quark. These
are the lightest particles with a charm quark.
But there are also particles, especially mesons, which consists only of charm quarks. In
the meson case, where the total charm is zero if they consist out of a charm and an anti-
charm quark, these are called hidden charm. The latter states are particularly interesting,
because they show a very interesting mass spectrum. In fact, the lightest cc states have
a mass which is just below threshold for the decay into two hadrons with a charm quark
and an anti-charm quark each, the DD threshold. They can therefore not decay directly.
Of course, the charm and anti-charm quarks can annihilate. But because of how quark
and gluon color charges are arranged, such a process is substantial suppressed in QCD
compared to the decay with a production of an additional quark-anti-quark pair. Hence,
decays occur very slowly. Hence, these hadrons are extremely stable compared to hadrons
made from lighter hadrons, where the pions offer a simpler decay channel. Thus, these
meta-stable state carry the name of charmonia, which have masses of about 3 GeV, but
decay widths of around a few 100 keV.
Because of this fact, the charmonia states turn out to present a very good realization of
the possible states permitted by the potential (5.10). Similarly to the hydrogen atom, this
potential creates states distinguished by a main quantum number and orbital quantum
numbers. The most well-known state is the J/, at about 3097 MeV with a decay width
of 93 keV, which is a state with one unit of angular momentum. However, the ground
state of the system is the c , with a mass of 2984 MeV and a decay width of 320 keV.
That the ground state decays quicker is mainly due to kinematic effects from the angular
momentum. Simply put, the ground state is in an s-wave, and thus the wave-functions of
the two charm quarks have a large overlap. Thus an annihilation into photons is much
Chapter 5. Strong interactions 65
more likely than in the case with angular momentum, where the overlap of the wave
functions is much smaller. Right now about 8 states are known, which are below the DD
threshold, the heaviest the so-called (2S), with a mass of 3690 MeV and a decay width
of 303 keV.
These charmonia have been very instrumental in understanding the potential (5.10),
and thus the strong interactions. The very well-defined spectrum, which provides the
opportunity of a true spectroscopy, including many angular momentum states, permits a
much cleaner study than in case of the light hadrons, where the ubiquitous decays into
pseudo-Goldstone bosons make resonances decay very quickly.
However, not all of the states in this spectrum are easily explained within the framework
of the quark model and the potential (5.10). These are the so-called X, Y, and Z mesons,
with masses above the DD threshold, and some also with open charm. These states do
not fit into the spectroscopic scheme, and especially some may have quantum numbers,
which are not in agreement with a simple quark-anti-quark system. This is still under
experimental and theoretical investigation. However, it already shows that the simple
quark model is not also able to explain all hadrons.
With the charm quark, it may appear that everything is complete and symmetric.
However, nature did not decide to stop at the charm quark, but added another quark:
The bottom (or in older text beauty) quark. It is a heavier copy of the down quark, and
has therefore an electric charge of -1/3. Its mass is about three times that of the charm
quark, with a current mass of 4.2 GeV. It therefore introduces another quark flavor, but like
for the charm quark, this does not play any role for dynamical chiral symmetric breaking,
as the explicit breaking is far too large.
Other than the mass and the electric charge, the bottom quark behaves essentially as
the charm quarks. Especially, there is a rich spectroscopy with open and hidden beauty17 ,
the latter also called bottonium in analogy to charmonium. Similar to the case for the
charm quark, the lightest meson with open beauty is rather heavy, B and B 0 being 5.3
GeV. As a consequence, the bottonium spectrum has a large number of quasi-stable states,
the lightest being the b with a mass of 9.4 GeV and a decay width of roughly 10 MeV, the
playing the role of the J/ with a mass of 9.5 GeV and a width of 54 keV, and then even
15 states to the heaviest b (3P ) with 10.5 GeV observed so far. There are also heavier
states, including bottom versions of the X, Y, and Z mesons, which do not fit easily into
a simple quark model explanation. Thus, an even richer spectroscopy is possible, though
the production of bottonia in so- called beauty farms, requires more resources than for the
charmonia.
17
For the flavor quantum number the term beauty still survives.
66 5.13. Top
Of course, for both bottom quarks and charm quarks there exist also baryons, with
one or more of these quarks, also with both charm and bottom quarks. These are rather
complicated to produce, but have been observed, though baryons with multiple charm or
bottom quarks only very recently. These baryons are not as stable as the mesons, but are
still sufficiently stable that they production and decay take place so far apart from each
other, a few m, that both processes can be experimentally resolved. They play therefore
an important role to identify the production of charm and especially bottom quarks in
high-energy collisions (so called c- and b-tagging).
Together, charmonia and bottonia are usually referred to as quarkonia. Studying theses
states are also interesting for other reasons than to understand QCD. Because the J/ and
are very long-lived, they are very well suited for precision measurements. Furthermore,
as will become evident in section 6.5, it appears plausible that new phenomena will be
influencing heavy quarks stronger than light quarks. Searching therefore for deviations of
the decays of quarkonia, especially bottomonia, from the standard-model predictions has
become an important branch in experimental physics, which is also called flavor physics.
5.13 Top
With the introduction of the bottom quark the situation appears again as unbalanced as
with the introduction of the strange quark. This is indeed true, and the picture extended
by the top (or in old texts truth) quark. This is the last quark, which so far has been
found, though there is no a-priori reason to expect that there may not be further quarks,
and searching for them is a very active field in experimental physics.
The top quark is a heavy version of the up quark, and thus has an electric charge of
2/3. The fact which is really uprising about it, is its mass of 173 GeV. It is therefore forty
times heavier than the bottom quark, and this is the largest jump in masses in the quark
mass hierarchy. It is an interesting side remark, and a triumph of theory, that this mass
has been established within 10 GeV before the direct observation of the top quark, by
measuring other processes sensitive to the top quark very precisely, and then using theory
to make a prediction.
The enormous mass of the top quark makes it the heaviest particle detected so far. Due
to its large mass, it decays much quicker than the lighter quarks, with a decay width of 2
GeV. This is a consequence of effects to be introduced in section 6. Hence, the formation
even of short-lived hadrons with a top quark is almost impossible, and no hadron with a
top quark has so far been directly observed. Whether a quasi-stable toponium is possible
is not clear, but so far there is no experimental evidence for it. But, due to the mass,
Chapter 5. Strong interactions 67
it is also not trivial to produce large numbers of top quarks, and thus the study of top
quarks is rather complicated. Hence, the top quark is so far more of interest for its own
properties, particularly its mass, rather than for its relation to QCD.
collision, the particles start to move apart, energy must be invested into the string, and it
breaks eventually. This produces a spray of hadrons. It is said that the partons hadronize.
In the rest frame of the produced quark or gluon, this hadronization will be isotropic,
if sufficiently many quarks and gluons are present that the color charge is roughly isotrop-
ically distributed. That is a rather good approximation, due to the light gluons, for most
cases. But in high-energy collisions the quarks and gluons are usually produced at high
energies. Hence, in the laboratory frame the produced hadrons will be highly collimated
along the direction of the original quark and gluon. Such a collimated spray of hadrons
originating from a single quark or gluon is called a jet.
This jet, in principle, carries all the information of the original quark and gluon, like
electric charge, momentum, invariant mass, and flavor. In practice, some particles may
still be not emitted very close to the original direction, i. e. not in the jet cone, and there
may be other jets in an event. Thus, such information is not always easy to reconstruct.
Still, by identifying such a jet, especially if it carries very much energy, it is possible to
infer information about the original quark or gluon. E. g. top quarks are identified by the
total invariant mass of the jet.
A further information can be obtained from which hadrons are present in the jet. Since
quarks and gluons have different flavors, masses, spins, and electric charge, they produce
different hadrons. The mathematical description of the probability for a hadron of type
i to emerge from a quark or gluon of type j is called a fragmentation function. Such
fragmentation functions are very hard to calculate, but can be measured experimentally,
to some extent.
18
This is somewhat symbolic, and the indices i, j, and f include many further properties, like spins,
polarizations etc.
Chapter 5. Strong interactions 69
Hence, the actual cross-section gets contribution from all possible parton combinations
inside the projectile hadrons.
The calculation of the parton distribution functions, or PDFs for short, is highly com-
plicated. It is, especially for small values of x a non-perturbative question, and has not yet
been performed satisfactorily. Hence, they are usually parametrized using experimental
input, where theoretical considerations determine the parametrization. The experimental
results are most easily obtained when instead of two hadrons electrons and hadrons are
collided, as has been done at DESY in Hamburg with electrons on protons.
The importance of the PDFs cannot be underestimated, since they vary substantially
with x. Especially, except for the valence quark flavors they drop very quickly with x.
As a consequence, it is rather unlikely to find a parton with large energy fraction inside
a hadron. Since the cross-section for particle production are usually peaked around the
mass of the particle to be produced, see equation (2.4), the effectiveness of producing
new particles drops quicker with the energy than would be expected from just the energy
dependency of the elementary cross section. Hence, the parton luminosities, i. e. the
luminosity folded with the PDFs, is the actual figure of merit for a hadron collider. On
the other hand, this means that increasing the energy for a hadron collider can increase
the effectiveness faster than the naive energy scaling of the elementary cross-section.
PDFs have also a theoretical importance. Because of asymptotic freedom, it is often
possible to obtain perturbatively a rather good approximation to the elementary cross-
section ijf . The involved PDFs are then known experimental input, and the interesting
process can be obtained directly. That is the standard procedure used for most calculations
in high-energy experiments. The assumption is that (5.11) is valid, i. e. that the process
can be factorized into the PDFs and the elementary interaction. Because of asymptotic
freedom, this is a reasonable assumption so far as the fractions x and y yield energy
fractions for the corresponding partons much larger than 1 GeV. Since an integration is
performed over all x and y, however, this will only work if the product of the PDFs and
the cross-section becomes small at small x and y. For many cases of practical relevance,
this is still a good approximation. Otherwise, so-called factorization violations are found,
and this approach breaks down.
Formally, due to the presence of virtual particles, any elementary particle has to be
described in terms of a PDF. E. g., an electron is surrounded by virtual photons. There is
hence a PDF describing the probability to observe a photon with an energy fraction x of
the total electron momentum inside an electron, Pe (x). Of course, also quarks or gluons
can be found in this virtual cloud. However, since the electron is not strongly interacting,
the PDFs for non-partons can be often calculated to a reasonable accuracy perturbatively.
70 5.14. Partons and scattering experiments
Still, since especially at low energies also hadrons can be encountered inside an electron
or a photon, this is not completely possible. This is not an esoteric effect. E. g. in the
interaction of photons with nuclei the PDF P (x) describing the mutability of protons
into mesons plays an important, and experimentally verified, role.
with the anti-symmetric structure constants f abc . Noe that sometimes the factor of 2 is
included in the f abc . This algebra implies already that everything is non-commutative,
hence the name of non-Abelian Lie algebra. These structure constants fulfill the Jacobi
identity
f abe f cde + f ace f dbe + f ade f bce = 0. (5.13)
These base vectors have to be further hermitiean, i. e., a = a . Note that the position
at top or bottom (covariant and contravariant) of the indices is of no relevance for the
Lie algebras to be encountered in the standard model, but can become important in more
general settings: For Lie algebras, the metric is just ab .
Such a Lie algebra can be represented, e. g., by a set of finite-dimensional matrices.
An example is the su(2) algebra with its three generators, which can be chosen to be the
Pauli matrices,
!
0 1
1 =
1 0
!
0 i
2 =
i 0
!
1 0
3 = .
0 1
Furthermore, to each algebra one or more groups can be associated by exponentiation, i.
e.,
a
a = ei , (5.14)
provides base vectors for the associated group, which are by definition unitary and thus
1
a = a . For su(2), these are again the Pauli matrices, generating the group SU(2).
However, the relation 5.14 is not necessarily a unique relation, and there can be more than
one group representation. E. g., for su(2) there are two possible groups related to the
algebra by the relation (5.14), the group SU(2) and the group SU(2)/Z2 SO(3), where
matrices which differ only be a negative unit matrix are identified with each other.
Because of the exponential relation, a generic group element exp(ia a ) with real
numbers a can be expanded for infinitesimal a as 1 + ia a . Thus the algebra de-
scribes infinitesimal transformations in the group. This will play an important role when
introducing gauge transformations for non-Abelian gauge theories.
Chapter 5. Strong interactions 73
f 123 = 1
3
f 458 = f 678 =
2
1
f 147 = f 156 = f 246 = f 257 = f 345 = f 367 = ,
2
and the corresponding ones with permuted indices. There is some arbitrary normalization
possible, and the values here are therefore conventional.
From these, also the generators for the eight-dimensional algebra su(3) can be con-
structed, the so-called Gell-Mann matrices,
0 1 0 0 i 0 1 0 0
1 = 1 0 0 2 = i 0 0 3 = 0 1 0
0 0 0 0 0 0 0 0 0
0 0 1 0 0 i 0 0 0
4 = 0 0 0 5 = 0 0 0 6 = 0 0 1
1 0 0 i 0 0 0 1 0
0 0 0 1 0 0
7 8 1
= 0 0 i = 0 1 0 . (5.15)
3
0 i 0 0 0 2
In general, there are N 2 1 base vectors for su(N), but the dependency for the other
algebras is different. For the sake of simplicity, in the following only the expressions for
su(N) will be given.
Generators, which are diagonal as matrices, and therefore commute with each other,
are said to be in the Cartan sub-algebra or sub-group of the algebra or group, respectively.
74 5.16. Yang-Mills theory and QCD
For su(2), this is only one generator, for su(3) there are two. This also gives the rank of
the group.
These lowest-dimensional realization of the commutation relations is called the fun-
damental representations of the algebra or group. Since the commutation relations are
invariant under unitary transformations, it is possible to select a particular convenient
realization. Note, however, that there may be more than one unitarily inequivalent fun-
damental representation. E. g., for su(3) there are two, called fundamental and anti-
fundamental, while there is only one for su(2).
It also possible to give representations of the algebras with higher-dimensional matrices.
The next simple one is the so-called adjoint representations with the matrices
which are three-dimensional for su(2) and eight-dimensional for su(3). There are cases in
which the fundamental and the adjoint representation coincide. This can be continued to
an infinite number of further representations, which will not be needed here, though they
may appear in the context of beyond-the-standard-model physics.
This completes the required group theory necessary to describe the standard model.
additional color index like i . A gauge transformation acts now as a rotation in this color
space, and is thus a matrix G as
g
i = Gij j = exp i a a j ,
2 ij
where the i are space-time-dependent functions, and g is playing the same role as the
electric charge, and is thus a coupling constant. The corresponding covariant derivative to
form a gauge-invariant Lagrangian has to take this matrix structure into account, yielding
g
Dij = ij i Aa ( a )ij , (5.16)
2
where the Aa are now the eight gluon fields. These have to transform under an infinitesimal
gauge transformation like
1
Aa = Aa + Dab b , (5.17)
g
Dab = gf abc Ac .
ab
(5.18)
This theory has a number of further remarkable features, which an be read off this La-
grangian.
The first is that the interaction between quarks and gluons is given by a vertex very
similar to the QED one, it is just adorned by a Gell-Mann matrix. Since the interaction
potential between quarks (5.10) is very different than the Coulomb potential, this implies
that there must be substantial radiative corrections. In fact, since there is a qualitative
change, these must be of non-perturbative origin.
Second, in QED it is in principle possible that each flavor has its own electric charge.
This is not the case here. Due to the appearance of the second term in the field-strength
tensor (5.19) the coupling constant in the gauge transformation in the gauge field (5.17)
is fixed. But the covariant derivative (5.16) will then only yield the necessary cancella-
tions between the transformations of the gluons and the quarks if the coupling constant
appearing in it is the same as for the gluon fields. Thus, every flavor has to couple to the
gluons with the same coupling constant g. This feature, which arises from the underlying
group structure, is called coupling universality, and has been experimentally verified.
Third, since the field strength tensor (5.19) is not gauge-invariant, so are not the
chromoelectric and chromomagnetic fields, and then neither can be the chromo version
of the Maxwell equation nor the color current. Hence, color and colored fields are not
gauge-invariant, and therefore cannot be observable. Because of confinement, as described
in section 5.8, this was not expected. But this makes confinement a necessity. Still, this
leaves open the question why confinement operators in the way it does, with the potential
(5.10), and not in any other way.
Fourth, the appearance of a quadratic term in the field-strength tensor (5.19) implies
that there are cubic and quartic self-interactions of the gluons with each other. This
coupling strength, due to the underlying group structure, are uniquely fixed, again a
consequence of coupling universality. But this also implies that gluons, in contrast to
photons, are not ignorant of each other. A theory with only gluons in it is therefore not a
trivial theory, but an interacting one. In fact, it turns out to be a strongly interacting one,
exhibiting features like confinement or the formation of glueballs. This reduced version of
QCD is called Yang-Mills theory. It is sometimes also denoted as quenched QCD, in the
sense that any conversion of gluons in quarks (and thus sea quarks) is fully suppressed, i. e.
quenched. It is possible to calculate in this reduced theory also approximately quantities
like hadron masses. They are then found to be roughly of the same size as in (full) QCD,
Chapter 5. Strong interactions 77
indicating that most of the physics in QCD resides in the gluon sector.
Fifth, a careful investigation of the vertices shows that the gluon-self-interaction and
the quark-self-interaction are opposite. While quarks tend to reduce the interaction, i.
e. they screen color charges, gluons tend to increase them, they anti-screen it. The ex-
perimental consequences of this fact actually lead originally to the discovery of the group
structure of QCD, as this behavior is almost exclusive to theories with a non-Abelian Lie
group as gauge group. Because anti-screening dominates this leads to the difference be-
tween QED and QCD, especially the feature of asymptotic freedom discussed in section
5.8. In the terms of section 3.8, the coupling does not become stronger but weaker with
energies.
Especially, it is possible to calculate the function (3.8) also of QCD. To leading
perturbative order only the first Taylor coefficients is relevant, which takes for QCD the
value
11 3
0 = Nc Nf = 2,
3 2
where Nc = 3 is the number of colors and Nf = 6 is the number of flavors. The value is
positive, in contrast to QED. Thus, because of the formula for the running coupling to
leading order (3.9), the interaction strength in QCD diminishes with increasing energy, a
manifestation of asymptotic freedom. The scale turns out to be roughly 1 GeV, and the
size of the coupling at about 2 GeV is 0.3, to be compared with the one of QED being
about 0.0075. QCD is thus indeed much stronger interacting at every-day energies than
QED.
However, at energies of the order of the proton mass, the perturbative coupling diverges,
and a Landau pole arises. In this case, this clearly signals the breakdown of perturbation
theory. In fact, even a value of 0.3 is already shaky for perturbative calculations, and
strong deviations from perturbation theory have been encountered up to 25 GeV. Thus,
only if all involved energy scales are in the range of (a few) 10 GeV or more QCD can be
treated perturbatively.
However, it is possible to treat QCD at low energies also non-perturbatively. There
are various methods available. The most successful one is arguably numerical simulations,
which confirms that the QCD Lagrangian (5.20) describes objects like the proton and
other hadrons. QCD is thus, to the best of our knowledge, the correct theory of hadrons,
and hence of nuclear physics.
To obtain the full Lagrangian of QCD and QED, it is just necessary to add the QED
Lagrangian to the QCD Lagrangian (5.20), and modify the covariant derivative of the
quarks to include the electromagnetic interaction as
where now the electric charge depends on the flavor, and is either 2/3 or 1/3 in units
of the electron charge. This ratio of the electron to quark charge is actually very well
established experimentally, though why it is the case is unclear. However, as will be
discussed in section 6.11 it is necessary for the formulation of the standard model in its
current form.
Before adding further elements to the standard model it is worthwhile to discuss two
examples where QCD (and partly also QED) plays an important role, which are usually
not directly associated with particles physics.
One of them is cosmic radiation. There are many sources in the universe which create
high-energetic particles. These are leptons and photons, but also protons. Electrically
neutral particles can be most easily tracked back to their origin, as their paths are not
changed due to galactic or intergalactic magnetic fields, while protons are affected. These
cosmic particles can have energies many orders of magnitude larger than created by any
experiment. Energies of up to almost 109 TeV have been observed20 . Possible sources may
be active galactic nuclei.
When such a high-energetic particles hits earths atmosphere, it reacts with the atoms
and atomic nuclei. As a consequence, a jet, called in this context a cascade, of hadrons
and electrons, muons, and photons is created, which an be observed with observatories on
earth. To be able to decide whether the original particle was charged or uncharged requires
to understand the evolution of this jet. This is a complicated problem, as it requires to
understand how the various particles interact and are produced. But this is decisively
important, so that it becomes possible to understand whether the direction of the cascade
could have been influenced by magnetic fields, and thus whether its origin is meaningful
or not.
Similar to the Monte Carlo generators of section 5.14.3 for particle collisions, this can be
usually only done by simulations. This is an important task for astrophysical observation,
and our knowledge of the universe.
20
Such very high-energetic particles are only observed extremely rarely, at rates which can be as low as
one per km2 and year, and are thus not useful to test particle physics.
Chapter 5. Strong interactions 79
an experimental setup, temperatures as high as 600-700 MeV with almost zero chemical
potential, despite the 416 nucleons in the original nuclei, can be achieved. Furthermore,
this situation poses no serious problems to numerical simulations. Hence, the knowledge
of this axis is rather good.
It turns out that the physics depends significantly on the masses of the up and down
quarks. Though this is also suspected for the remainder of the phase diagram, it is ev-
ident in this case. If both quarks are very heavy, there is a first-order phase transition
at a temperature of about 250-300 MeV. As the quark masses become lighter, this tem-
perature decreases, and the transition becomes a rapid cross-over at a temperature23 of
about 150-160 MeV, the aforementioned Hagedorn temperature. This is the situation for
up and down quark masses observed in nature, the physical masses. The mass of the
strange quark influences the precise values of the temperatures, but does not provide any
qualitative influence, and the heavier quarks have even less relevance. If the quark masses
are decreased further, the temperature still drops a little bit. More importantly, at some
point the cross-over turns again into a phase transition, this time of second order, where
it stays until zero quark masses.
What happens can be understood already in a simple picture. Temperature is classi-
cally nothing more than the kinetic energy of particles. In a quantum theory, temperature
is just energy, which can also be converted to new particles. This will be exponentially
suppressed with the mass of the created particles. Hence, the lightest particles will be
most copiously produced. In QCD, these are the pions. These particles will have large
kinetic energies, and will rapidly and repeatedly collide. Because of the asymptotic free-
dom of QCD, these scatterings will mainly be dominated by hard partonic scatterings, and
thus almost perturbative. Thus, QCD becomes essentially as it behaves at high-energies.
Especially, this implies that the effects of chiral symmetry breaking become reduced, and
the quarks lose their effective mass, though not their current mass, at the phase transition
or cross-over. In fact, in the limit of zero quark mass, the second order transition becomes
a symmetry transition where chiral symmetry becomes restored. At the same time, since
most collisions are hard and partonic, excited states become very unstable and in most
cases it does not matter anymore that quarks are confined into hadrons. They act effec-
tively as if they would no longer be confined. Thus, one speaks also of a deconfined phase,
and calls the transition a deconfinement transition. However, since it is a cross-over, it is
clear that qualitatively nothing has changed, but quantitatively it is a completely different
situation.
23
There is no unique definition of a cross-over temperature. This is just the temperature were most
changes occur.
82 5.18. The QCD phase diagram
Weak interactions
The last ingredient of the standard model are the weak interactions, which will again be a
gauge interaction, though of quite different character than both the electromagnetic and
the strong interaction. The name weak is also somewhat misleading, as the interaction
is actually stronger than the electromagnetic one, but it is short-range, as is the strong
interaction, though for entirely different reasons.
It is impossible to separate from any description of the weak interactions the recently
established Higgs sector, as without it the weak interactions would be very different in
character. In fact, without the Higgs sector, it would be very similar to the strong interac-
tion. The Higgs effect introduces the only non-gauge interactions into the standard-model.
Depending on personal taste, the interactions of the Higgs are not counted at all as true
interactions, as many believe these are just low-energy effective realizations of a yet un-
known gauge interaction, a single interaction, or, by counting the number of independent
coupling constants, 13 interactions. This will become clearer in the following.
One of of the most interesting facts about the weak interactions is that they live on
a quite different scale than electromagnetism and the strong interactions. While electro-
magnetism acts mainly on the scales of atomic physics, and the strong interactions at the
scale of nuclear and hadronic physics, the typical scale of the weak interactions will be
found to be of the order of 100 GeV. Since this is the highest known scale in particle
physics yet, apart from the gravitational scale, it is widely believed that the study of the
weak interaction will be the gateway to understand whether there is any other physics
between this scale and gravity. This is the main reason it plays such a central role in
modern particle physics.
One of the reasons why the weak interactions, or more commonly denoted as the
electroweak sectors for reasons to become clear later, is rather unwieldy is that it is an
aggregation of several, intertwined phenomena. Though each of them can be studied on
83
84 6.1. decay and parity violation
its own, essentially none of them unfolds its full relevance without the others. This makes
the electroweak sector often at first somewhat confusing. The different elements appearing
will be a new gauge interaction, a mixing between this new interaction and QED, parity
violation, the Higgs effect, and flavor changes.
where p, n, e, and e represent the fields of the involved proton, neutron, electron, and
the (anti)neutrino and are four-component spinors. The characteristic scale for the weak
process was set by the Fermi constant, which is of order 1.14105 GeV2 . This is
not a renormalizable interaction, and as such should be only the low-energy limit of an
underlying renormalizable theory.
Besides the appearance of a new particle, the -decay shows another quite surprising
property: It violates parity. Thus, a -decay is not operating in the same way as it would in
the mirror. Experimentally, this can be observed by embedding the process into a magnetic
Chapter 6. Weak interactions 85
field. A parity-conserving interaction, like electromagnetism, should yield the same result,
irrespective of whether the spins are aligned or anti-aligned with the magnetic field1 , due
to parity invariance. However, experimentally a difference is observed, indicating parity
violation. More precisely, it was observed that a polarized neutron which decays will emit
the electron preferentially in one direction. Therefore, the interaction must couple spin
s and momenta, and would therefore have a contribution proportional to sp. However,
the momenta of the decay products also depend on the invariant mass, p2 , and thus on
a scalar contribution. Since therefore both scalars (scalar products of two vectors or two
axial vectors) and pseudoscalars (products of a vector and an axial vector) appear imply
that the interaction is not having a definite transformation behavior under parity, and is
thus parity violating. In fact, it turned out that it is maximally parity violating.
To give this a more formal version, it is necessary to consider how coupling of fermions
transform under parity. The parity transformation of a spinor is obtained by multiplying
it with 0 , the time-Dirac matrix. So, for spinors the parity transformation is given by
P = 0 .
{5, } = 0.
one of the helicity states would be affected by the weak interactions. The helicity state
for a spinor is projected out as
1 5
.
2
The sign is determined by whether left-handed or right-handed states should be selected.
Experiment finds that only left-handed states are involved, and thus a minus sign is appro-
priate. Furthermore, the weak interactions are found to violate also the charge conjugation
symmetry maximally. Hence, the sign is not reversed for the anti-particle state. Therefore,
the correct four-fermion interaction version of the weak interactions would be (appropri-
ately normalized)
GF 1 5 1 5 1 5 1 5
,
2 2 2 2 2
which therefore exhibits maximal violation of C and P individually, but conserves CP.
Hence, the new interaction is very different from either the strong or electromagnetic
force: There are particles only responding to it, the neutrinos, and it violates both parity
and flavor conservation. Furthermore, it is found to be very short range, much more than
the strong interaction. On the other hand, electrons and neutrinos are both sensitive to
it, but are not confined. Thus, the way how this range limitation is operating has to be
different from the confinement mechanism of QCD.
6.2 Neutrinos
Before discussing the weak interaction itself. It is worthwhile to introduce the neutrinos
first. Some further properties of them will be added later, in section 6.9, when the weak
force has been discussed in some more detail.
So far, only one neutrino has been introduced, the electron-neutrino, a fermion of
spin 1/2. There are actually two more flavors of neutrinos, the muon-neutrino and
the -neutrino , which are again fermions of spin 1/2. As their name suggest, they are
partnered with one lepton each, forming the last member of the particle generations, which
are conventionally assigned to be the sets {u, d, e, e}, {c, s, , }, and {t, b, , }. These
combinations are somewhat arbitrary, and mainly motivated by the masses of the particles.
Another motivation is that flavor is, though violated, still approximately conserved for
leptons and neutrinos. In then turns out that, if electrons carry an electron flavor number
of one and positrons of -1, electro-neutrinos likewise carry an electron flavor number of 1
and anti-electron neutrinos of -1. Hence, e. g. the -decay, conserves this flavor number.
Likewise, there are approximately conserved flavors for muon-like and tauon-like particles.
Chapter 6. Weak interactions 87
As a consequence, neutrinos are usually also included in the term leptons, though not
always.
When it comes to masses, the neutrinos actually are very surprising. To date, it is
only known for sure that their mass is below2 2 eV. A direct measurement of the mass of
the neutrinos is complicated by the fact that they are only interacting through the weak
force, which reduces cross-sections by order of magnitudes compared to the leptons and
the quarks. However, it was possible to measure the mass differences of the neutrinos in a
way to be explained in section 6.9. But so far, it was only possible to measure differences
without knowing which of the possible differences are measured. The two values are about
0.009 eV and 0.05 eV. This implies that at least one of the neutrinos has a mass of 0.05
eV, though it is not clear, which of them. It is furthermore not yet clear, whether the
other two to or just one is much lighter. If the electron neutrino would be the heaviest,
there are very good chances to measure its mass directly within the next decade. Since the
other neutrino species are much harder to produce in larger numbers, this will take much
longer if it is is either of the other ones. It is also, in principle, still possible that one of
the neutrinos could be massless, though there is yet no strong evidence for this possibility.
however, that in FCCC also the masses play a role, and for reasons to be discussed later
massless particles would not participate. Therefore flavor changes for the leptons occur
much more rarely because of the very small neutrino masses, though have been observed
nonetheless.
Over the time other weak effects were found, all characterized by the scale GF . In
particular, after some time the postulated interaction bosons have been observed directly.
There are two charged ones, the W , and a neutral one Z 0 , with masses about 80 and 91
GeV, respectively, and where the superscripts indicate their electric charge. In fact, the
weak interaction turns out to have a very similar structure as QED and QCD in the sense
that it is a gauge theory, where additional gauge bosons appear. The doublet structure
already suggests as the underlying gauge group one with two different charges and three
gauge bosons. It finally turns out that the correct gauge group is SU(2). That the W
carry electric charge indicate a relation to QED, which will be discussed in section 6.7.
However, masses are not permitted for gauge bosons, or else gauge invariance would be
broken. This breaking has severe effects, making the theory non-renormalizable. There-
fore, despite the experimental verification of the existence of the weak gauge bosons and
their masses, this would not improve the theoretical description. This was only achieved
after gauge theories have been combined with the Goldstone theorem described in section
5.7.
To understand that this is the right way, consider a photon A coupled to some external
current j in the Lagrangian
1
L = ( A A )( A A ) j A .
4
It will be this current, and its generalization, which will be instrumental in obtaining a
massive gauge boson and in addition a gauge symmetry, and thus a superficially massive
vector boson. However, since these two concepts are usually contradictory, this will lead
to a hiding of the gauge symmetry, similarly as the global symmetry has been hidden in
section 5.7.
The corresponding equation of motion for the photon is
2 A A = j . (6.3)
( 2 + M 2 )A = 0. (6.4)
To proceed further, it is useful to choose a particular gauge, in this case the Landau-
Lorentz gauge A = 0. Of course, fixing a gauge is a perfectly acceptable way to perform
a calculation, provided the calculation is not violating gauge invariance at any point. Then
the results for gauge-invariant quantities, like the number of physical polarizations, will be
valid. In the present case, doing a calculation without fixing the gauge is at best tedious.
The results can be translated to the general case of an arbitrary gauge, but it should
90 6.4. W and Z bosons
be kept in mind that any relation on gauge-variant quantities, like the gauge field A ,
will only be valid in this particular gauge. However, statements about gauge-invariant
quantities, like, e. g., the field-strength tensor F in this Abelian example, will hold in
any gauge.
After imposing this gauge, the equation of motion (6.3) for the spatial components
becomes
2A~ = ~j. (6.5)
To simplify matters further, restrict to a time-independent situation. Then all time-
derivatives vanish, and the equation (6.4) takes the form
~ = M 2 A.
2A ~ (6.6)
From the equations (6.5) and (6.6) it is clear that the current must satisfy the condition
~
~j = M 2 A
in order that the photon (in this gauge) becomes apparently massive and at the same
time the gauge invariance is not broken. It is left to find some way of producing the
~ is due
appropriate current. Physically, the origin of such a current being proportional to A
to the response of a medium to the acting electromagnetic fields. E. g., this is realized by
the Meissner effect in superconductivity. Therefore, giving a mass to the photon requires
a medium which provides a response such that the photon becomes damped and can
therefore propagate only over a finite distance. In the electroweak case, the role of this
medium will be taken by a condensate of Higgs particles.
This is a screening process as can be seen from the following simple example. Consider
the Maxwell-equation
~ B = ~j = M 2 A.
~
~ = M 2 B.
2B ~
mass has been obtained. The other vital ingredient for a massive vector boson is a third
polarization state. Similarly, also this other degree of freedom will be provided effectively
by the medium, as will be discussed below.
To make the preceding discussion applicable to elementary particle physics, it is useful
to write them down covariantly. For this purpose, select the Landau gauge A = 0.
Then the Maxwell equation takes the form
2 A = j .
The necessary requirement that, at least approximately, the current has a component
proportional to A is then directly obtained by considering the current for a charged,
scalar field, which has the form
j = iq(+ ( + )) 2q 2 A ||2 ,
which can be derived as the Noether current from the corresponding Lagrangian
L = D + D
D = + iqA .
2 A = iq(+ ( + )) 2q 2 A ||2.
This will have precisely the required form, if the modulus of the field, || = v can be
arranged to have a non-zero value in the vacuum. However, this will only be possible, if
its expectation value obeys
h0||0i =
6 0 (6.7)
in the selected gauge. In this case, the photon would acquire an effective mass of (2q 2 v 2 )1/2 .
And by this, the photon is screened out, making the effect short-ranged. How to implement
this will be discussed next.
becomes more complicated, but at the same time also more interesting, than in the case
without gauge fields.
For simplicity, start with an Abelian gauge theory, coupled to a single, complex scalar,
the so-called Abelian Higgs model, before going to the full electroweak and non-Abelian
case. This theory has the Lagrangian
1 1
L = (( + iqA ))+ ( + iqA ) ( A A )( A A )
2 4
1 2 + 1 2 + 2
+ ( ) . (6.8)
2 2 f2
Note that the potential terms are not modified by the presence of the gauge-field. There-
fore, the extrema have still the same form and values as in the previous case, at least
classically. However, it cannot be excluded that the quartic + A A term strongly dis-
torts the potential. This does not appear to be the case in the electroweak interaction,
and this possibility will therefore be ignored.
To make the consequences of the Higgs effect transparent, it is useful to rewrite the
scalar field as3
f
(x) = + (x) exp(i(x)).
2
This is just another reparametrization for the scalar field, compared to and previously.
It is such that at = 0 this field configuration will be a classical minimum of the potential
for any value of the phase . Inserting this parametrization into the Lagrangian (6.8)
yields
2 2 2
q2
1 1 f f f
L = +
+ + qA + + A A +
2 2 2 2 4 2
2 2
4
1 1 f 1 f
( A A )( A A ) + 2 + 2
+
4 2 2 2f 2
This is an interesting structure, where the interaction pattern of the photon with the radial
and angular part are more readily observable.
Now, it is possible to make the deliberate gauge choice
1
A = 2 . (6.9)
q
3
Note that if the space-time manifold is not simply connected and/or contains holes, it becomes
important that is only defined modulo 2. For flat Minkowski (or Euclidean) space, this is of no
importance. However, it can be important, e. g., in finite temperature calculations. It is definitely
important in ordinary quantum mechanics, where, e. g., the Aharanov-Bohm effect and flux quantization
depend on this.
Chapter 6. Weak interactions 93
This is always possible. It is implemented by first going to Landau gauge and then perform
the gauge transformation
1
A A +
q
exp(i).
This gauge choice has two consequences. The first is that it makes the scalar field real
everywhere. Therefore, the possibility of selecting the vacuum expectation value of to
be real is a gauge choice. Any other possibilities, e. g. purely imaginary, would be an
equally well justified gauge choice. This also implies that the actual value of the vacuum
expectation value f of is a gauge-dependent quantity, and will vanish when not fixing a
gauge. The Lagrangian then takes the form4
2
q2
1 f
L = + A A +
2 4 2
2 4
1 2
1 1 f f
( A A )( A A ) + 2 + + (6.10)
4 2 2 2 f2 2
i. e., the second term has now exactly the form of a screening term, and yields an effective
mass qf /4 for the photon field. Furthermore, if could be neglected, the Lagrangian
would be just
q2f 2 1
L= A A ( A A )( A A ),
8 4
i. e., the one of a massive gauge field. Together with the gauge condition for A , which links
the longitudinal part of the gauge boson to the now explicitly absent degree of freedom
, this implies that the field A acts now indeed as a massive spin-1 field. Furthermore,
this Lagrangian is no longer gauge-invariant. This is not a problem, as it was obtained by
a gauge choice. Thus, it is said that the gauge symmetry is hidden. Its consequences are
still manifest, e. g., the mass for the gauge boson is not a free parameter, but given by the
other parameters in the theory. By measuring such relations, it is in principle possible to
determine whether a theory has a hidden symmetry or not.
Colloquial, the hiding of a symmetry is also referred to as the breaking of the symmetry
in analogy with the case of a global symmetry. However, a theorem, Elitzurs theorem,
actually forbids this to be literally true.
The Goldstone theorem of section 5.7, actually guarantees that a mass will be provided
to each gauge boson associated with one of the hidden directions at the classical level. This
4
Upon quantization, additional terms are introduced due to the gauge-fixing. At the classically level
studied here, they can be ignored.
94 6.5. The Higgs effect and the Higgs boson
hidden direction is here the phase, which is massless. This masslessness of the Goldstone
boson is actually guaranteed by the Goldstone theorem. Hence, a massless Goldstone
boson is effectively providing a mass to a gauge boson by becoming its third component
via gauge-rotation, and vanishes by this from the spectrum. It is the tri-linear couplings
which provide the explicit mixing terms delivering the additional degree of freedom for
the gauge boson at the level of the Lagrangian. Hence, the number of degrees of freedom
is preserved in the process: In the beginning there were two scalar and two vector degrees
of freedom, now there is just one scalar degree of freedom, but three vector degrees of
freedom. The gauge-transformation made nothing more than to shift one of the dynamic
degrees of freedom from one field to the other. This was possible due to the fact that both
the scalar and the photon are transforming non-trivially under gauge-transformations.
This can, and will be, generalized below for other gauges. In general, it turns out that
for a covariant gauge there are indeed six degrees of freedom, four of the vector field, and
two from the scalars. Only after calculating a process it will turn out that certain degrees
of freedom cancel out, yielding just a system which appears like having a massive vector
particle and a single scalar.
Note that though the original scalar field was charged as a complex field, the radial
excitation as the remaining degree of freedom is actually no longer charged: The coupling
structure appearing in (6.10) is not the one expected for a charged field.
However, the choice (6.9), which is called the unitary gauge, is extremely intransparent
and cumbersome for most actual calculations. The reason is that the quantization of the
the theory in this gauge requires the introduction of an infinite number of further terms
into the Lagrangian, though with fixed coefficients.
A more convenient possibility, though at the cost of having unphysical degrees of
freedom which only cancel at the end, are t Hooft gauges. To define this gauge once more
the decomposition
= (, f + )
for the scalar field is useful. The Lagrangian then takes the form, up to constant terms,
1 (gf )2 1 1
L = F F + A A + 2 2 + gf A (6.11)
4 2 2 2
1 2 2
+2igA ( + ) 2g(gf )AA g 2 (2 + 2 )A A ( + 2 )2 .
2 f2
There are a number of interesting observations. Again, there is an effective mass for A ,
due to the four-field interaction term. Secondly, only the field has a conventional mass
term, the mass-term for the field has canceled with the two-field-two-condensate term
from the quartic piece of the potential. Finally, there will be terms of type f A . This
Chapter 6. Weak interactions 95
implies that a photon can change into a while moving, with a strength proportional to
the condensate f , and thus mixing between one of the Higgs degrees of freedom and the
photon occurs. Therefore, the photon and this scalar, the would-be Goldstone boson, will
mix. Note finally that though many more interaction terms have appeared, none of them
has any new free parameter, as a consequence of the now hidden symmetry. The only
quantity which looks like a new quantity is the condensate value f , though it is classically
uniquely determined by the shape of the potential. In a quantum calculation, it can also
be determined, but not in perturbation theory, where it remains a fit parameter.
The gauge condition to be used for quantizing this Lagrangian is then given by
A = qf , (6.12)
Lm = m(L R R L ),
However, massless fermions can be accommodated in the theory. Since the observed
quarks and (at least almost) all leptons have a mass, it is therefore necessary to find a
different mechanism which provides the fermions with a mass without spoiling the isoweak
gauge invariance.
A possibility to do so is by invoking the Higgs-effect also for the fermions and not only
for the weak gauge bosons. By adding an interaction
+
k 1 5 k 1 + 5 k 1 5 k 1 + 5
Lh = gf k i ij + ij j + gf k i ij + ij j ,
2 2 2 2
this is possible. The constant matrices and have to be chosen such that the terms
become gauge-invariant. Calculating their precise form is tedious, but straightforward. If,
in this interaction, the Higgs field acquires a vacuum expectation value, = f +quantum
fluctuations, this term becomes an effective mass term for the fermions, and it is triv-
ially gauge-invariant. Alongside with it comes then an interaction of Yukawa-type of the
fermions with the Higgs particle. However, the interaction strength is not a free parameter
of the theory, since the coupling constants are uniquely related to the tree-level mass mf
of the fermions by
2mf
gf = .
f
But the 12 coupling constants for the three generations of quarks and leptons are not
further constrained by the theory, introducing a large number of additional parameters in
the theory. Though phenomenologically successful, this is the reason why many feel this
type of description of the electroweak sector is insufficient. However, even if it would be
incorrect after all, it is an acceptable description at energies accessible so far, and thus
has become part of the standard model.
Right-handed neutrinos: t = 0, t3 = 0, y = 0 (Q = 0)
Right-handed leptons: t = 0, t3 = 0, y = 2 (Q = 1)
W+ : t = 1, t3 = 1, y = 0 (Q = 1)
W : t = 1, t3 = 1, y = 0 (Q = 1)
Z: t = 1, t3 = 0, y = 0 (Q = 0)
: t = 0, t3 = 0, y = 0 (Q = 0)
Gluon: t = 0, t3 = 0, y = 0 (Q = 0)
Higgs: a complex doublet, t = 1/2 with weak hypercharge y = 1. This implies zero
charge for the t3 = 1/2 component, and positive charge for the t3 = 1/2 component
and negative charge for its complex conjugate
This concludes the list of charge assignments for the standard model particles. The Higgs
case will be special, and will be detailed in great length below.
Since at the present time the photon field and the Z boson are not yet readily identified,
it is necessary to keep the gauge boson fields for the SU(2) and U(1) group differently, and
these will be denoted by W and B respectively. The corresponding pure gauge part of the
electroweak Lagrangian will therefore be
1 1
Lg = Ga G
a F F
4 4
F = B B
Ga = Wa Wa + gf abc Wb Wc ,
Chapter 6. Weak interactions 99
where g is the weak isospin gauge coupling. f abc are the structure constants of the weak
isospin gauge group, which is just the SU(2) gauge group.
Coupling matter fields to these gauge fields proceeds using the ordinary covariant
derivative, which takes the form
ig ig y
D = + a Wa + B ,
2 2
where g is the hypercharge coupling constant, which is modified by the empirical factor y.
Note that y is not constrained by the gauge symmetries, and its value is purely determined
by experiment. Thus, why it takes the rational values it has is an unresolved question
to date. However, this fact would come about naturally, if the weak gauge group would
originate from a different gauge group at higher energies, say SU(2)U(1)SU(3), which
is hidden to the extent that all other fields charged under this larger gauge group are
effectively so heavy that they cannot be observed with current experiments.
The matrices a are determined by the representation of SU(2) in which the matter
fields are in. For a doublet, these will be the Pauli-matrices. For the adjoint representation,
these would be given by the structure constants, bca = f abc , and so on. For fermions, of
course, this covariant derivative is contracted with the Dirac matrices . Precisely, to
couple only to the left-handed spinors, it will be contracted with (1 5 )/2 for the Wa
term and with for the kinetic and hypercharge term. By this, the phenomenological
couplings are recovered in the low-energy limit, as the exchange of a massive gauge boson
then becomes proportional to 1/M 2 , thus recovering the Fermi-coupling g 2 /M 2 . How this
mass disappears in the case of the Abelian gauge group will be discussed next.
Lh = Lg + (D )+ D + V (+ )
!
1 1 + i2
= (6.14)
2 3 + i4
100 6.7. Weak isospin, hypercharge, and electroweak unification
where the field is thus in the fundamental representation of the gauge group. Its hy-
percharge will be 1, an assignment which will be necessary below to obtain a massless
photon. The potential V can only depend on the gauge-invariant combination + , and
thus can, to be renormalizable, only contain a mass-term and a quartic self-interaction.
The mass-term must be again of the wrong sign (imaginary mass), such that there exists
a possibility for the field to acquire a (gauge-dependent) vacuum-expectation value.
To work in unitary gauge it is best to rewrite the Higgs field in the form
!
i a a 0
=e 2 .
There are now the three a fields and the field. Performing a gauge transformation such
that the phase becomes exactly canceled, and setting = f + with f constant makes
the situation similar to the one in the Abelian Higgs model. Note that by a global gauge
transformation the component with non-vanishing expectation value can be selected still
at will.
The mass will be made evident by investigating the equation of motion for the gauge
bosons
2 Wa ( Wa ) = ja + jaW
2 B ( B ) = jy ,
where j W contains contributions which involve only the W s, while j contains all remaining
terms. Only the latter will be relevant below, and therfeore for it the subscript will
be dropped. Thus, it is necessary to determine the weak isospin current ja and the
hypercharge current jy . In contrast to the Abelian case this current will now also contain
contributions from the self-interactions of the W gauge bosons. However, at tree-level
these can be ignored.
The contribution of the Higgs field to the currents6 are
ig + a
ja = ( D (D )+ a )
2
ig +
jy = ( D (D )+ ).
2
The appearance of a makes manifest that the current ja is a weak isovector current, while
their absence signifies that the hypercharge current jy is a weak isoscalar current.
6
Of course, all fields with non-zero weak isospin will contribute to this current. But only the Higgs-field
will do so classically.
Chapter 6. Weak interactions 101
where the quantum fluctuations vanish at tree-level will provide the vacuum expectation
value !
0
h0||0i = f .
2
This is then as in the Abelian case, and indeed sufficient to provide a mass for the gauge
bosons.
Ignore for a moment the hypercharge contribution. Then the term linear in Wa on the
right-hand side of (6.15) provides a term
2
2 a gf
M W = Wa ,
2
and thus a mass MW = gf /2 is provided. Thus, the equation of motion for the Wa field
takes the form
( 2 + MW
2
)Wa ( Wa ) = ja (f, , Wa, ...),
where the mass-term has been removed from the current. Thus, formally this is the
equation of motion for a massive gauge boson, interacting with itself and other fields due
to the current ja . As has been seen in the Abelian case, the additional degree of freedom
has been selected by the choice of gauge, and is provided by the phase of the Higgs field.
Next it is necessary to check what happens when including also the B part once
more. Since the t3 = 1/2 component of the Higgs field delivers the mass, this component
should not be electrically charged. According to the relation (6.13) thus the assignment
of the hypercharge y = 1 for the Higgs field is a-posterior justified. In the full current
of the Higgs field, besides the contribution + Wa also the contribution gg y/2B+ a
appears. Since the matrices 1 and 2 are off-diagonal it follows that
!
0
(0, f ) 1,2 = 0.
f
7
Using { a , b } = 2 ab .
102 6.7. Weak isospin, hypercharge, and electroweak unification
Thus only in the third component of the current a contribution due to the vacuum ex-
pectation value of the Higgs field appears. As a consequence, despite the appearance of
the additional B gauge boson the result for the mass (at tree-level) for the off-diagonal
W1,2 bosons is not changed. However, this is not the case for the third component. The
relevant part of the equation of motion reads
2 2 2
2 3 3 g f gf g f
W ( W ) = W3 + B .
2 2 2
Consequently, also the relevant part of the equation of motion for the hypercharge gauge
boson is of similar structure
2
2 gf gf g f 3
B ( B ) = B + W .
2 2 2
Hence, even in the vacuum the equations are coupled, and thus both gauge bosons mix.
These equations can be decoupled by changing variables as
A = W3 sin W + B cos W
Z = W3 cos W B sin W ,
which are the fields given the name of the photon A and the Z boson Z . The mixing
parameter W is the (Glashow-)Weinberg angle W , and is given entirely in terms of the
coupling constants g and g as
g
tan W =
g
g
cos W = p
g2 + g2
g
sin W = p .
g2 + g2
Using the inverse transformations
A cos W + Z sin W
W3 =
cos W sin W
A sin W Z cos W
B =
cos W sin W
it is possible to recast the equations of motion. The equation of motion for the photon
takes the form
2 A ( A ) = sin W j3 .
Here, j3 contains all contributions which are not providing a mass to the gauge bosons.
Thus, the photon is effectively massless. The price paid is that it now has also direct
Chapter 6. Weak interactions 103
interactions with the weak gauge bosons. On the other hand, the equation for the Z
boson takes the form
2 f 2 (g 2 + g 2 )
Z Z = Z + cos W j3 .
4
Thus, the Z boson acquires a mass larger than the W 1,2 boson. In particular the relation
is
MZ cos W = MW .
According to the best measurements this effect is, however, only of order 15%. At the same
time the self-interaction of the Z boson with the other weak bosons is reduced compare
to the one of the original W3 boson.
Of course, this changes also the form of the coupling to the neutral fields for matter. In
particular, the symmetry of SU(2) is no longer manifest, and the W 1,2 cannot be treated
on the same footing as the neutral bosons. E. g., the neutral part of the coupling in the
covariant derivative now takes the form
y g y
D(N ) = + ig sin W A t3 + +i Z t3 cos2 W sin2 W .
2 cos W 2
Using the relation (6.13), it is possible to identify the conventional electric charge as
e = g sin W ,
i. e., the observed electric charge is smaller than the hypercharge. It should be noted that
this also modifies the character of the interaction. While the interaction with the photon
is purely vectorial, and the one with the W bosons remains left-handed (axial-vector),
the interaction with the Z boson is now a mixture of both, and the mixing is parametrized
by the Weinberg angle.
Note that the masslessness of the photon is directly related to the fact that the corre-
sponding component of the Higgs-field has no vacuum expectation value,
y 3
+ h0||0i = 0,
2 2
and thus the vacuum is invariant under a gauge transformation involving a gauge trans-
formation of A ,
h0||0i = h0| exp(i(y/2 + 3 ))|0i .
Thus, the original SU(2)U(1) gauge group is hidden, and only a particular combination
of the subgroup U(1) of SU(2) and the factor U(1) is not hidden, but a manifest gauge
symmetry of the system, and thus this U(1) subgroup is the stability group of the elec-
troweak gauge group SU(2)U(1). It is said, by an abuse of language, that the gauge
104 6.8. CP violation and the CKM matrix
group SU(2)U(1) has been broken down to U(1). Since this gauge symmetry is mani-
fest, the associated gauge boson, the photon A , must be massless. If, instead, one would
calculate without the change of basis, none of the gauge symmetries would be manifest.
However, the mixing of the B and the W3 would ensure that at the end of the calculation
everything would come out as expected from a manifest electromagnetic symmetry.
Also, this analysis is specific to the unitary gauge. In other gauges the situation may be
significantly different formally. Only when determining gauge-invariant observables, like
scattering cross-sections or the masses of gauge invariant bound-states, like positronium,
everything will be the same once more.
gS=0 = g cos C
gS=1 = g sin C
where g is a universal strength parameter for the weak interactions, its coupling constant.
The angle parametrizing the decay is called the Cabibbo angle. A similar relation also
holds in the leptonic sector for the muon quantum number
g=0 = g cos CL
g=1 = g sin CL ,
where, however, sin CL is almost one, while in the quark sector sin C is about 0.22. Cor-
responding observations are also made for other flavors.
This result implies that the mass eigenstates of the matter particles are not at the
same time also weak eigenstates, but they mix. Hence, on top of the P-violating and
C-violating factors of (1 5 )/2, it is necessary to include something into the interaction
which provides this mixing. This can be done by introducing a flavor-dependent unitary
Chapter 6. Weak interactions 105
coupling matrix
(L)
0 cos C 0
g
G = cos C(L) 0
(L)
sin C .
2 (L)
0 sin C 0
This is equivalent to just use a doublet
u d cos C + s sin C ,
e. g., in the quark sector. Such a doublet structure can be associated with (weak) charges
Qu = 1/2 and Qds = 1/2. This is just the weak isospin.
Hence, the flavor (and mass) eigenstates of the quarks are effectively rotated by a
unitary matrix. For two generations, this matrix, the Cabibbo matrix is uniquely given
by !
cos C sin C
VC = ,
sin C cos C
with again the Cabibbo angle C , with a value of about sin C 0.22. For three gener-
ations, there exist no unique parametrization of the mixing matrix, which is called the
Cabibbo-Kobayashi-Maskawa (CKM) matrix. The interaction is thus parametrized as
dL
T +
(uL , cL , tL ) W VCKM sL + h.c.
bL
Vud Vus Vub
u d
VCKM = VL VL = Vcd Vcs Vcb .
s12 s23 ei13 c12 c23 s13 c12 s23 ei13 s12 c23 s13 c23 c13
cij = cos ij
sij = sin ij
106 6.9. Neutrino oscillations and the PMNS matrix
To have only 4 free parameters (12 , 13 , 23 , and 13 ) requires not only unitarity, but
also to exploit some freedom in redefining the absolute phases of the quark fields. Testing
whether this matrix is indeed unitary by measuring the nine components individual is
currently recognized as a rather sensitive test for physics beyond the standard model. The
condition for unitarity can be recast into a form that a sum of three functions of the angles
has to be , forming a triangle. Thus, testing for the unitarity, and thus standard-model
compatibility of CP violations, is often termed measuring the unitarity triangle.
The presence of this matrix also gives rise to the possibility that not only C and P are
violated separately, but that also the compound symmetry CP is violated (and therefore
also T by virtue of the CPT-theorem). That occurs to lowest order in perturbation theory
by a box-diagram exchanging the quark flavor of two quarks by the exchange of two W
bosons.
That such a process violates CP can be seen as follows. The process just described
is equivalent to the oscillation of, e. g., a ds bound state into a sd bound state, i. e., a
neutral kaon K 0 into its anti-particle K 0 . The C and P quantum numbers of both particles
are P= 1, C= 1 and P= 1, C= 1, respectively, and thus CP= 1 and CP= 1. Thus,
any such transition violates CP. Performing the calculation of the corresponding diagram
yields that it is proportional to the quantity
Thus, such a process, and thus CP violation is possible if there is mixing at all (all ij
non-zero) with a non-trivial phase 13 , and the masses of the quarks with fixed charge are
all different. They may be degenerate with ones of different charge, however. Since such
oscillations are experimentally observed, this already implies the existence of a non-trivial
quark-mixing matrix. The value of 13 is hard to determine, but is roughly 1.2.
Within the standard model, there is no explanation of this mixing, however, and thus
these are only parameters.
Under time evolution an initial pure state (1, 0) will therefore acquire a lower component,
if is non-zero. On the other hand, if the composition of a pure state after an elapsed
time is measured, it is possible to obtain the size of . The probability P to find a particle
in the state (0, 1) after a time t is given by
In the standard model, the corresponding expression for the transition probability involves
the mass difference between the two states. To lowest order it is given for the two-flavor
case by 2
2 m L
P = sin sin2 (2),
4E
where m is the mass difference between both states, E is the energy of the original
particle, L is the distance traveled, and is the corresponding Cabbibo angle. If the
probability, the energy and the distance is known for several cases, both the Cabbibo
angle and the mass difference can be obtained. Of course, both states have to have the
same conserved quantum numbers, like electrical charge.
The same calculation can be performed in the more relevant three-flavor case. The
obtained transition probability from a flavor fi to a flavor fj is more lengthy, and given
by
m2kl
X X
2 2
P (fi fj , L, E) = |Vjk | |Vik | + 2 |Vjk Vik Vil Vjl | cos L arg(Vjk Vik Vil Vjl )
2E
k k>l
This sum shows that the process is sensitive to all matrix elements, including the CP-
violating phase, of the CKM matrix, while it cannot determine the sign of the mass
differences.
For the quarks, the first observation of CKM effects were due to decays. Such oscil-
lations have also been found later, and studied to determine the matrix elements more
precisely. E. g., it is possible with a certain probability that a particle oscillates from a
108 6.9. Neutrino oscillations and the PMNS matrix
short-lived state to a long-lived state. This is the case for the so-called K-short KS and
K-long KL kaons, mixed bound states of an (anti-)strange and an (anti-)down quark. This
has been experimentally observed, but the distance L is of laboratory size, about 15 m for
KL and 15 cm for KS , giving a m of about 3.5 1012 MeV. However, in this case the
effect is rather used for a precision determination of the mixing angle, since the mass can
be accurately determined using other means.
In the lepton sector, these oscillations were the original hint for non-zero neutrino
masses. As it is visible, such oscillations only occurs if the involved particles have a mass.
Thus, only with massive neutrinos there was chance to see the effect. But there is no direct
determination of their mass, and the best results so far is an upper limit on the order of 2
eV from the -decay of tritium. Observing these oscillations then indicated the presence
of neutrino masses, and that there is a CKM-like matrix also in the lepton sector, as also
otherwise there would be no mixing - for a unit matrix the above given formula reduces
to the first term only.
With this, it is possible to determine the mass difference of neutrinos. It is found
that |m12 | = 0.0087 eV and |m23 | = 0.049 eV. As only the squares can be determined,
it is so far no possible to establish which neutrino is the heaviest, and if one of them is
massless. Still, the mass difference of 0.05 eV indicates that with an increase in sensi-
tivity by a factor of 40 it can be determined in decay experiments whether the electron
neutrino is the heaviest. If it is, the mass hierarchy is called inverse, as the first flavor,
corresponding to the electron, is lightest. Otherwise, it would be called a normal hierar-
chy. As a side remark, these tiny mass differences imply that the oscillation lengths L are
typically macroscopically, and of the order of several hundred kilometers and more for one
oscillation.
Aside with these mass differences, it was also possible to determine some of the elements
of the CKM matrix in the lepton sector, which is called Pontecorvo-Maki-Nakagawa-
Sakata, or PMNS, matrix. In terms of the parametrization (6.16), the values of the known
three angels are
sin2 12 = 0.31(2)
sin2 23 = 0.4(1)
sin2 13 = 0.025(4)
while the value of 13 was not yet measured, and it is thus not clear whether there is also
CP violation in the lepton sector. However, the values of the other angles imply that with
the sensitivity of current experiments, it should be possible to get a reasonable estimate
of 13 until the mid 2020s.
Chapter 6. Weak interactions 109
The values of the three real angles are quite astonishing. They imply that, irrespective
of the value of 13 , the PMNS matrix is not strongly diagonal dominant. Thus, the lepton
flavors mix much stronger than the corresponding quark flavors, and only the very small
values of the neutrino masses reduce the oscillation probabilities so strongly that the effect
is essentially not seen for electrons, muons, and tauons, and even requires the very weakly
interacting neutrinos, which can move over long distances, to observe it at all.
Again, at the current time there is no understanding for why this is the case, nor why
the quark sector and the lepton sector are so extremely different also in this respect, and
not only for the hierarchy of masses.
with = (+ )+ , i. e., its hermitiean conjugate. The fields + and carry integer
electric charge plus and minus, while the fields and are neutral. Since a non-vanishing
value of f leaves only a U(1) symmetry manifest, and are the would-be Goldstone
bosons. This leads to the properties of the Higgs fields and vector bosons as discussed
previously.
110 6.10. The weak interactions as a gauge theory
where i counts the generations, l are the leptons e, and , the corresponding neutrinos
e , , , u the up-type quarks u, c, t, and d the down-type quarks d, s, b. Correspondingly
exist the right-handed singlet fields liR , iR , uR R
i , and di . Using this basis the Yukawa
interaction part reads
LY = LLi Glr R r L r R r L ur R r L dr R r
ij lj + Li Gij j Qi Gij uj + Qi Gij dj + h.c..
1 1 1 1
Mijl = Glij f Mij = Gij f Miju = Guij f Mijd = Gdij f.
2 2 2 2
It is then possible to transform the fermion fields8 f into eigenstates of these mass-matrices,
and thus mass eigenstates, by a unitary transformation
fif L = Uik
fL L
fk (6.18)
fif R = Uik
fR R
fk ,
for left-handed and right-handed fermions respectively, and f numbers the fermion species
l, , u, and d and i the generation. The fermion masses are therefore
1 X fL f
mf i = Uik Gkm (U f R )+
mi f.
2 km
In this basis the fermions are no longer charge eigenstates of the weak interaction, and
thus the matrices U correspond to the CKM matrices. In fact, in neutral interactions
which are not changing flavors always combinations of type U f L (U f L )+ appear, and thus
they are not affected. For flavor-changing (non-neutral) currents the matrices
V q = U uL (U dL )+ (6.19)
V l = U L (U lL )+
8
The vacuum expectation value f of the Higgs field, carrying no indices, should not be confused with
the fermion fields fif , which carry various indices, f denoting the fermion class.
Chapter 6. Weak interactions 111
remain, providing the flavor mixing. Finally, the electric charge is given by
e= 4 = g sin W = g cos W ,
Putting everything together, the lengthy Lagrangian for the electroweak standard
model emerges:
3
where a sum over fermion species r is understood and IW is the corresponding weak isospin
quantum number.
112 6.10. The weak interactions as a gauge theory
Note that this Lagrangian is invariant under the infinitesimal gauge transformation
A A + A + ie(W+ W + ) (6.21)
cos W
Z Z + Z ie (W+ W + )
sin W
e
W W + i (W (sin W A cos W Z ) (sin W u cos W Z ) )
sin W
e e
+ Z + i (+ + )
2 sin W cos W 2 sin W
e e
(v + )Z + (+ + + )
2 sin W cos W 2 sin W
sin2 W cos2 W Z
A ie
ie + (f + i)
2 cos W sin W 2 sin W
f L f L A sin W f 1 ie
fi fi ie Qi + Qi 2 Z fiL + Vij fjL
cos W 2 sin W 2 sin W
sin W Z
fif R ieQfi A + fif R . (6.22)
cos W
The index for the left-handed fermion fields counts the isospin directions. The in-
finitesimal gauge functions are determined from the underlying weak isospin i and
hypercharge Y gauge transformations by
1 1
= ( i2 )
g
1 1
A = cos W Y
sin W 3
g g
1 1
Z = cos W 3 + sin W Y .
g g
It is now straightforward to upgrade the Lagrangian of the electroweak sector of the
standard model (6.20) to the full Lagrangian of the standard model by adding the one for
the strong interactions
1 rs a rs a
Ls = Ga G a g fi fi G
4
Ga = Ga Ga + g habc Gb Gc ,
where the generators a and the structure constants habc belong to the gauge group of
the strong interactions, the so-called color group SU(3)9 , and g is the corresponding
coupling constants. Ga are the gauge fields of the gluons, and the fermions now have also
an (implicit) vector structure in the strong-space, making them three-dimensional color
vectors or singlets, for quarks and leptons, respectively.
9
Actually SU(3)/Z3 , similar to the case of the weak isospin group.
Chapter 6. Weak interactions 113
Note that the number of Higgs fields is larger than the minimal number required: Only
two (real) degrees of Higgs are necessary to write down a consistent theory of the weak
interactions. However, in this case only one of the weak gauge bosons would become mas-
sive. Hence, experiments requires four degrees of freedom. This introduces an additional
global symmetry between the four Higgs degrees of freedom, similar to the flavor symme-
tries of the fermions. It is called a custodial symmetry, as in absence of other interactions
it would lead to mass-degenerate W and Z bosons. However, QED breaks this symmetry
explicitly, and leads, as has been shown above, to the slight mass difference between W
and Z bosons.
6.11 Anomalies
As noted in section 4.9, it is possible that a symmetry is broken by the quantization.
Such a breaking is called an anomalous breaking, mainly for the reason that this was
not expected to occur in the beginning. Today, it is clear that such a breaking is deeply
linked into the quantization process, and is in most cases also related to the necessity for
renormalization.
It is now possible to consider an anomaly for a global or a local symmetry. Anomalies
for global symmetries do not pose any inherent problems. Indeed, the axial part UA (1)
of the chiral symmetry described in section 4.12 is broken anomalously in the standard
model. This breaking is directly observable consequences. In that particular case, it leads
to a much larger partial decay width of the 0 into two photons, as it would be without
this anomaly. Also the large mass of the meson, as indicated in section 5.11 can be
shown to originate from the same anomaly.
The situation is much more severe for an anomalous breaking of a local symmetry. If
this occurs, this implies that while the classical theory is gauge-invariant, the quantum
version would not be so. Hence, the quantum version would depend on the gauge chosen
classically, and observables would become gauge-dependent. This would not yield a con-
sistent theory, and would therefore be discarded as a theory of nature. Thus, only theories
without local anomalies are considered for particle physics.
Considering theories of the type the standard model, i. e. a number of gauge fields with
fermions and scalars, it turns out that the possibility of gauge anomalies is tied to the
presence of fermions. Theories with only gauge fields and scalars do not develop anomalies.
Furthermore, it can be shown that only theories with parity violations can be affected by
local anomalies. Finally, anomalies can only occur for certain gauge groups10 , and not for
10
In total, the Lie groups U(1), SU(N > 2), Sp(N ), and O(2 < N < 6) are affected.
114 6.11. Anomalies
arbitrary ones, and only for particular assignments of charges to the fermions. However,
this affected combination of charge assignments and gauge groups includes the one of the
standard model. Thus, in principle, the standard model would be anomalous.
Fortunately, there is an escape route left. Though indeed such anomalies occur, it is
possible that if there are more than one anomaly, the consequences of these anomalies
cancel each other. If such an anomaly cancellation occurs, the theory becomes anomaly-
free, and is again well-defined. Given the gauge groups and charge assignments of the
standard model, the condition that all anomalies are canceled and the standard model is
consistent is
X 2 1 Nc
Qf = Ng (0 1) + Nc = Ng 1 + = 0,
f
3 3 3
where the sum is over all fermion flavors, quarks and leptons alike, ng is the number of
generations, Qf is the electric charge of each flavor f , and Nc is the number of colors. As
can be seen, the anomalies indeed cancel, and they do so for each generation individually.
However, the cancellation requires a particular ratio of the quark electric charges and the
lepton electric charges as well as the number of colors. Since the anomalies originate from
the parity-violating interaction, all three forces are involved in guaranteeing the anomaly-
freedom of the standard model.
This fact has led to many speculations. The most favored one is that this indicates an
underlying structure, which provides a common origin to all three force of the standard
model. Such theories are known as grand-unifying theories (GUT). Such theories have
one underlying gauge group, which would be spontaneously broken at high energies, and
therefore appear like three separate forces at low energies. This reasoning is supported by
the fact that the running gauge couplings of all three forces all approach a similar size at
about 1015 GeV, the GUT scale, which suggests a common origin at this energy scale.
A second explanation is that only a theory like the standard model, which such a fine
balance, has all the features necessary to permit sentient life to arise to recognize this
feature. In a universe with different laws of nature, where this kind of balance is not
met nobody would be there to observe it. This is called the anthropomorphic principle.
Though it cannot be dispelled easily, it is not as compelling an argument, as it does not
explain why such a universe should exist at all. This problem is often circumvented by
the requirement that actually many universes with all kinds of laws of nature exist, and
we just happen to be in one where we could exist.
The third possibility is, of course, that all of this is just coincidence, and nature is just
this way.
At any rate, future experiments will decided, which of these three options is realized. In
Chapter 6. Weak interactions 115
fact, experiments have already ruled out the simplest candidates for grand-unified theories,
though many remain.
couplings would be taken, the Higgs mass was would usually end up at masses orders of
magnitudes larger than the W and Z mass. Why this is not the case is called the hierarchy
problem: Why is there no large hierarchy between the electroweak scale and the Higgs
mass scale? This is sometimes rephrased as the naturalness problem: Without any prior
information, it would naively be expected that all parameters in a theory are of the same
order of magnitude. This is not the case in the standard model. Why do the values deviate
so strongly from their natural value?
There is another problem connected with these questions. If only the Higgs sector is
regarded, the theory is called a 4 theory, as it has the same structure as the Lagrangian
(5.5), except that there are four scalar fields. It is found that in such a theory quantum
effects always drive the self-interaction term in the potential to zero, i. e. the quantum
theory is non-interacting, even if the classical theory is interacting. Theories in which
this occurs are called trivial. Such a triviality can be removed, if the theory is equipped
with an intrinsic energy cutoff, which introduces an additional parameter, but restricts the
validity of the theory to below a certain energy level. It is then just a low-energy-effective
theory of some underlying theory.
It is not clear whether this triviality problem persists when the Higgs sector is coupled
to the rest of the standard model. If it does, the necessary cut-off for the standard model
will again depend sensitively on the mass of the Higgs boson. Practically, this is not a
serious issue, as with the present mass of the Higgs of roughly 125 GeV this triviality
cut-off can be easily as large as 1019 GeV, far beyond the current reach of experiments
and observations. Still, it remains as a doubt on how fundamental the standard model,
and especially the Higgs particle, really is, in accordance with the problems introduced by
the necessity of renormalization as described in section 3.7.
early evolution of the universe the quantitative level of the asymmetry between matter
and anti-matter. Thus, the current belief is that so far undiscovered physics is responsible
for the missing (large) amount.
It is nonetheless instructive to understand how baryon number violation comes about
in the standard model. Lepton number violation proceeds in the same way, but is even
more suppressed, due to the much smaller masses.
The basic ingredient is once more a classical field configuration, this time of Yang-Mills
theory. Define the field strength tensors F = a Fa
, with a the Pauli matrices - here the
weak interactions with the gauge group SU(2) is the interesting one. To proceed further,
it is useful to make the formal replacement it t, which can be undone at the end. This
is an analytic continuation from Minkowski space-time to Euclidean space-time, as now
all components of the metric have the same sign.
The Jacobi identity (5.13) together with the non-Abelian version of the homogeneous
Maxwell equation of QED,
F + F + F = 0,
D F + D F + D F = 0,
D F = Dij jk
a a
F .
D F = 0, (6.23)
F = F , (6.24)
as these convert the equation (6.23) into the trivial Bianchi identity. The self-duality
equations (6.24) have the advantage of being only first-order differential equations instead
of second-order differential equations, and are therefore easier to solve.
118 6.13. Baryon-number violation
Furthermore, classical solutions have to have a finite amount of energy, and therefore
their behavior at large distances is constrained. Especially, since the Lagrangian can be
written as the sum of the squares of the electric and magnetic field strength, both these
fields must vanish. This can only occur if the potential becomes at large distances gauge-
equivalent to the vacuum, i. e. it has the form
Aa a = A = ig(x) g 1(x),
where g = g a a is an arbitrary function. Since this is the second part of the gauge
transformation (5.17) in matrix notation, this is just a version of the vacuum A = 0.
Since all choices are gauge-equivalent, any choice will do. One possibility which turns out
to be technically convenient is
x
g(x) =
|x|
= (1, i a ). (6.25)
The simplest extension of this is a multiplication with a function f (x2 ) which becomes 1
at large distances,
x
Aa a = if (x2 )g(x) g 1(x) = 2f (x2 )
x2
1
= ( )
4i
= (1, i a ).
The matrices are called t Hooft symbols. Thus, this ansatz mixes non-trivially the
weak isospin and space-time.
Plugging this in into the self-duality equation (6.24) yields a first-order differential
equation for f (x2 ),
df
x2 f (1 f )) = 0.
dx
The solution to this equation, which can be obtained by separation of variables, is
x2
f (x2 ) = ,
x2 + 2
where is an integration constant. The function indeed goes to one at large distances, as
required. The structure described by this field configuration is now localized in space-time
at the origin, and extended over a range of size . Such a localized event in space and time
is called an instanton. Solving with the equation with the other sign in the self-duality
equation (6.24) yields a similar result, though with some small differences, and is called
an anti-instanton.
Chapter 6. Weak interactions 119
Going back to Minkowski space-time, the field configuration will have a singularity
at x2 = 2 . This is called a sphaleron, and hence a violent event in space-time. Note
that the gauge-field strength does not explicitly appear in the calculation. This result
can therefore not be obtained perturbatively, and the presence of instantons is a non-
perturbative effect.
Seeing now that these indeed create baryon number violation is unfortunately tech-
nically very complicated, so here only the most important steps will be sketched. One
highly non-trivial, but very fundamental, insight needed is that instantons turn out to be
very much connected with the anomalies of section 6.11. In fact, any instanton induces
via the global anomaly interactions between fermions. Especially, it can connect fermions
of different types, as long as all are charged and all are affected by the anomaly. Espe-
cially, this yields that instantons effectively create an interaction which involves, besides
the gauge fields also three quarks and one lepton, and thus permits baryon number viola-
tion. However, this integration still conserves fermion number, and as a consequence the
change in the baryon number must be offset by the same change in lepton number. Still,
this implies that reducing the baryon number by one can be offset by a change in lepton
number by one, which implies that a proton can be converted, with the involvement of
gauge fields, into a positron. Hence, baryon (and lepton number) are not conserved in the
standard model.
The effect is, however, small, and suppressed exponentially by exp(c/W ), where c
is a number, and W is the weak isospin fine-structure constant. Since the latter is small,
the suppression is huge, and the life-time of the proton in the standard model exceeds the
current upper experimental upper limit for the decay in any channel of 2.1 1028 years by
many orders of magnitude. Hence, the baryon number violation in the standard model is
not able to explain the fact that so much more baryons than anti-baryons exist. However,
this is a statement about the current state of the universe, and especially its temperature,
and it may change in earlier times. Especially, it can be shown that the effect becomes
exponentially enhanced with temperature. Still, the temepratures necessary to make this a
sucfficiently effective process have been available for too short times in the early universe.
large energies, which is a quantitative effect as all masses become negligible. However,
there is not necessarily a phase transition associated with the melting of the condensate.
In fact, it is in general not a symmetry restoration, as in case of a global symmetry to
be discussed here as well. This is due to the fact that the symmetry is just hidden, not
broken. As such, there is no local gauge-invariant order parameter associated with it.
Only gauge-dependent order parameters can be local, but in their case the temperature
where the symmetry becomes manifest once more is in general gauge-dependent. Only a
transition which would be indicated by a non-local gauge-invariant order parameter could
in principle mark a true phase transition.
Again, it is quite useful to first study the case of a global symmetry, then of an Abelian
local symmetry before going to the electroweak theory.
1 1 1 1
L = (6f 2 2 ) 2 + (2f 2 2 )2
2 2 2 2
1
2f ( 2 + 2 ) ( 2 + 2 )2 2 f 2 + f 4 . (6.26)
4
In this case the explicit zero-energy contribution is kept for reasons that will become
apparent shortly, but will be essentially the same as when treating non-relativistic Bose-
Einstein condensation. Only terms linear in the fields have been dropped, as they will not
contribute in the following. The situation is similar as before, but now the condensate f
has not been specified by the minimization of the classical potential, but is kept as a free
quantity, which will take its value dynamically.
To investigate the thermodynamic behavior it is useful to analyze the thermodynamic
potential in analogy to the non-relativistic case as
Z
(T, f ) = P (T, f ) = T ln ,
V
where P is the pressure, T the temperature, and V the volume. Z is the partition sum.
For the following purposes, it is sufficient to use the so-called mean-field approximation. In
this case, the interaction terms are first expanded around f , and higher-order terms which
involve more than two fields are neglected. In the Lagrangian (6.26), this has already been
done.
Without going into the technical details, the thermodynamic potential can be evaluated
Chapter 6. Weak interactions 121
directly. It reads
(T, f ) = 2 f 2 + f 4 (6.27)
3
2 2
dp 1 + 2
Z
1 2
+ 3
+ T ln 1 e T + ln 1 e T
(2) 2
p q
1 = 6f + p = m2 + p2
2 2 2
p q
2 = 2f 2 2 + p2 = m2 + p2 .
The frequencies consists of the momenta and the masses of the particles after hiding
the symmetry, which is dependent on the value of the condensate f . There are three
contributions. The first outside the integral is the classical contribution. The second
are the first two terms inside the integral. They are the contributions from quantum
fluctuations. The third term represents thermal fluctuations.
To recover the results from section 5.7, the second term must be neglected and the
zero-temperature limit taken. This yields
(0, f ) = 2 f 2 + f 4 .
As in section 5.7, this potential has a minimum at non-zero f , f = 2 /(2). Inserting this
into the Lagrangian (6.26) makes it equivalent to (5.8), with a massive Higgs boson and a
massless Goldstone boson.
Something new happens at finite temperature. At small temperature, little other
happens than that it is possible to excite Higgs bosons or Goldstone bosons, which then
form a thermal bath of non-interacting bosons, and the total pressure is just the sum of
their respective pressures. However, the value of f will become temperature-dependent:
At each temperature it will take the value which minimizes the thermodynamic potential.
When going to higher temperatures, it is useful to make a high-temperature expansion
for the thermodynamic potential. High temperature requires here T to be larger than the
scale of the zero-temperature case, which is given by the condensate, which is of order
/ . In this case, it is possible to obtain an expansion for . The leading terms up to
order O(1) are given by
2 2 T 2
4 1 2
(T, f ) = f + T f 2 T 4
2
. (6.28)
3 45 12
Note that at very large temperatures only the term 2 T 4 /45 is relevant, which is pre-
cisely the one of a free non-interacting gas of two boson species, a Stefan-Boltzmann-like
behavior.
122 6.14. The early universe
This results exhibits one interesting feature. The term of order f 2 has a temperature-
dependent coefficient, which changes sign at13 Tc2 = 32 /. As a consequence, the shape of
the thermodynamic potential as a function of f changes. Below Tc , it has a minimum away
from zero, as at zero temperature. With increasing temperature, this minimum moves to
smaller and smaller values, and arrives at zero at Tc . Hence, at Tc , the value of f changes
from a non-zero to a zero value, and the symmetry becomes manifest once more. Above
Tc , the minimum stays at zero, and for all higher temperatures the symmetry is manifest.
Replacing f with its temperature-dependent value in (6.28) yields the expressions
2 2
2
T =T 2 2
T <Tc = 12T 45 + 36
T4 = c 2
5
2 2
4 2 2 2 T =T
T >Tc = 4 45 T 4 12T = c
2,
5
which coincide at Tc . Also their first derivatives with respect to the temperature are equal
at Tc
r
2
dT <Tc T T =T 8 + 5 2
= (8 2 T 2 + 10T 2 152) 90 c
=
dT 300
r
dT >Tc T T =Tc 8 2 + 5 2
= (8 2 T 2 + 152) 90 = ,
dT 300
but their second derivatives are not
d2 T <Tc 2 2 T 2 T =Tc (25 + 24 2 ) 2
= 6
(4 + 5) 15
=
dT 2 30
2
d T >Tc 8 2 T 2 +52 T =Tc (5 + 24 2 ) 2
= 30
= .
dT 2 30
Thus, a phase transition of second order occurs at Tc .
As stressed previously repeatedly, it is possible that quantum effects could modify the
pattern considerably or even melt the condensate. It is therefore instructive to investigate
the leading quantum corrections to the previous discussion.
This is also necessary for another reason. If the symmetry becomes manifest once more
at large temperatures, the mass of Higgs-like excitations become tachyonic, indicating a
flaw of the theory. That can be seen directly by reading off the condensate-dependent
masses of the excitations,
m2 = 6f 2 2 = 2 (T Tc ) + (22 T 2 )(Tc T )
T 2
m2 = 2f 2 2 = 2 (T Tc ) (Tc T ).
3
13
Note that strictly speaking using the high-temperature expansion at this temperature is doubtful. For
the purpose here it will be kept since it makes the mechanisms more evident then the rather technical
calculations necessary beyond the high-temperature expansion. The qualitative outcome, however, is not
altered, at least within the first few orders of perturbation theory.
Chapter 6. Weak interactions 123
Furthermore, also the Goldstone theorem is violated, as the mass of the Goldstone boson
is no longer zero14 . Both problems are fixed by quantum corrections, demonstrating the
importance of quantum fluctuations even in the high-temperature phase.
In the expression for the free energy (6.27) the zero-point energy, and thus the quantum
fluctuations have been neglected. Including them yields the result
3
2 2 T 2 (m3 + m3 )T 4 8 2 T 2 e2+ 2
(T, f ) = T 4 + ln
45 12 12 32 2 2
8 2 T 2 e2+1 T 2
2 f 2 1 + + 2 ln
4 2 32
8 2 T 2 e2+1
5
+f 4 1 + + 2 ln .
8 2
Herein is the Euler number. The term is a quantum correction, which can be fixed, e.
g. by measuring Tc or by other experimental input, as can be seen from the new expression
for Tc where f vanishes,
32 24 2 e2+1
2
Tc = 1+ + 2 ln .
4
Here, only the qualitative result is interesting, and its precise numerical value of no im-
portance.
To obtain the corrections for the masses, it is necessary to calculate the corresponding
quantum corrections. Without going into the details, the complete masses to this order
are
T 2 32
2 2 1 2
m = 2 1 (Tc T ) + T (T Tc )
32 3
32
2 2
m = T (T Tc ).
3
These results yield a number of interesting observation. First, since Tc is larger15 than
32 /, the mass of the Higgs is always positive, stabilizing the system. Secondly, in this
case the mass of the Goldstone boson is always zero below the phase transition tempera-
ture, in agreement with the Goldstone theorem. Above the phase transition, the masses of
both particle degenerate, and the symmetry is manifest once more also in the spectrum.
14
In a full quantum treatment, the role of the Goldstone boson could be played at finite temperature
by some composite excitation instead. However, at the mean-field level no such excitations are available,
and thus the Goldstone theorem is violated.
15
It is not obvious that cannot be negative and large, thus making the improved estimate for Tc
smaller than before. However, it turns out not to be the case at this order for any renormalization
prescription.
124 6.14. The early universe
These properties are generic for symmetries hiding by a condensate which thaws with in-
creasing temperature. Also that the mean-field approximation is in general insufficient is
a lesson which should be kept duly in mind. Of course, at the present time much more
sophisticated methods are available to treat this problem, though they are in general very
complicated.
The temperature dependence of the condensate f and the pressure then read, respectively,
2 T2
2
f = 1 2 (Tc T )
Tc
2
4272 T 4 2 T2 2 2 2 4
360
+ 4 1 T 2 + f6 T 4 , T Tc
P = 2T 4
c
2T 2 4
.
427
360
+ 6
, 4
T T c
The corresponding phase transition is at mean-field level thus of second order, as the
second derivative of the pressure exhibits a discontinuity. For a Higgs mass of 100 GeV
the critical temperature is about 200 GeV. For the actual value of the Higgs mass of 126
GeV, it is only slightly higher. Thus the transition temperature is of the same order as
the Higgs condensate, and about three orders of magnitude larger than the corresponding
temperature in QCD.
Note that all problems with consistency of the mean-field approach pertain also to the
full electroweak standard model. Therefore, for a consistent treatment at least leading or-
der corrections have to be included. As previously, they do not change the phase transition
temperature, but may change its order.
126 6.14. The early universe
The relevance of such a temperature is only given in the early universe, or perhaps
during a collapse to a black hole. Here, the more certain case of the early universe will be
treated.
To assess the relevance for the early universe it is necessary to add equations which
describe its development. For the present purpose simplified versions of the Einstein
equations are sufficient. Adding energy conservation gives
2
dR 8G 2
= R (6.29)
dt 3
d(R3 )
= 3P R2 , (6.30)
dR
where G is Newtons constant, is the energy density, R is the scale factor, essentially
given by the Ricci curvature scalar, t the proper time, and P is the pressure. To close the
equations, an equation of state is necessary, which is given as a function of the pressure
by thermodynamic relations as
P
= P + T .
T
Since the sicknesses of the mean-field approximation are not too problematic for this
estimate, it is sufficient to use it for obtaining the corresponding energy density as
( 2 2
1281 2 4 32
360
+ 4T 4 T + 1 T 2 6T ,
4
T Tc
= 2
c
2T 2 4
c .
1281 4
360
T + 6
+ 4
, T Tc
Rewriting equation (6.30) in terms of the temperature yields the ordinary differential
equation
d dR
R +3 = 3P.
dT dT
Imposing as a boundary condition that R should be one at the phase transition yields
(
Tc Tc2 b2
3 T T 2 b2
, T Tc
R = Tc Tc2 +a2
T T 2 +a2
, T Tc
302
a2 =
427 2
a2 Tc2 (1 r)
b2 =
a2 + rTc2
4
r = 2 2 .
6 + 9g4 + 3g4
For a Higgs mass about 100 GeV the characteristic parameters are r = 0.22, a = 6 GeV and
b = 10 GeV. Thus, the dominant behavior is that R behaves like Tc3 /T 3 , up to some small
Chapter 6. Weak interactions 127
modifications close to the phase transition, and thus drops essentially in the electroweak
domain.
An interesting consequence is obtained if R3 is multiplied by the entropy
P
s= T 3.
T
An elementary calculations yields thus that sR3 is constant. Since s is in units inverse
length cubed, this is just the statement that entropy is conserved since R only describes
the expansion of a unit length over time. Hence, the electroweak interactions at mean-
field level conserve entropy. In particular, this is a consequence of the second order phase
transition, which is not permitting latent heat or supercooling. Finally, inserting the
numbers shows that R increases somewhat slower around the phase transition. Thus, the
expansion of the universe slows down during the electroweak phase transition.
Of course, all of this is just an estimate. That it can never be fully correct is seen by
the fact that R diverges at the finite temperature T = b, much above the QCD phase tran-
sition (and nowadays) temperature. This is an artifact of the high-temperature expansion
involved. To obtain the correct behavior down to the QCD phase transition would require
more detailed calculations.
Chapter 7
At the end of 2009 the largest particle physics experiment so far has been started, the
LHC at CERN. With proton-proton collisions at a center-of-mass energy of up to 14 TeV,
possibly upgraded to 33 TeV by the end of the 2020ies, there are two major objectives.
One was to complete the current picture of the standard model of particle physics by
finding the Higgs boson. This has been accomplished, even though its properties remain
still to be established with satisfactory precision, to make sure that it is indeed the Higgs
boson predicted by the standard model.
The second objective is to search for new physics beyond the standard model. For var-
ious reasons it is believed that there will be new phenomena appearing in particle physics
at a scale of 1 TeV, and thus within reach of the LHC. Though this is not guaranteed,
there is motivation for it.
In fact, there are a number of reasons to believe that there exists physics beyond the
standard model. These reasons can be categorized as being from within the standard
model, by the existence of gravity, and by observations which do not fit into the standard
model. Essentially all of the latter category are from astronomy, and there are currently
essentially no observations in terrestrial experiments which are reproducible and do not
satisfactorily agree with the standard model, and none which disagree with any reasonable
statistical accuracy.
Of course, it should always be kept in mind that the standard model has never been
completely solved. Though what has been solved, in particular using perturbation theory,
agrees excellently with measurements, it is a highly non-linear theory. It cannot a-priori
be excluded that some of the reasons to be listed here are actually completely within the
standard model, once it is possible to solve it exactly.
Many of the observations to be listed can be explained easily, but not necessarily, by
new physics at a scale of 1 TeV. However, it cannot be exclude that there is no new
128
Chapter 7. Beyond the standard model 129
phenomena necessary for any of them up to a scale of 1015 GeV, or possibly up to the
Planck scale of 1019 GeV, in which case the energy domain between the standard model
and this scale is known as the great dessert.
model, is an open question. However, since this problem can be postponed to energy
scales as high as 1015 GeV, or possibly even higher, in particular for a Higgs as light as
the observed one, this question is not necessarily of practical relevance.
Finally, when extrapolating the running gauge couplings for a not-to-massive Higgs to
an energy scale of about 1015 GeV, their values almost meet, suggesting that at this scale
a kind of unification would be possible. However, they do not meet exactly, and this is
somewhat puzzling as well. Why should this seem to appear almost, but not perfectly so?
7.2 Gravity
7.2.1 Problems with quantization
One obviously, and suspiciously, lacking element of the standard model is gravity. Up to
now, no fully consistent quantization of gravity has been obtained. Usually the problem is
that a canonical quantized theory of gravity is not perturbatively renormalizable. This is
visible when writing down the Einstein-Hilbert-Lagrangian of gravity, based on the metric
g , LEH ,
1 1
LEH = R + LM (7.1)
2
= g
R = +
1
= g ( g + g g )
2
From this, it can be inferred that the coupling constant involved, or equivalently New-
tons constant, is dimensionful, as is the cosmological constant . Any matter is described
by the matter Lagrangian LM . This also yields the Euler-Lagrange equations of gravity,
the Einstein equations
1
R g R + g = T (7.2)
2
LM
T = LM + 2 (g = ) (7.3)
g
Superficial (perturbative) calculations for (7.1) show that the theory is perturbatively non-
renormalizable. As a consequence, an infinite hierarchy of independent parameters, all to
be fixed by experiment, would be necessary to perform perturbative calculations, spoiling
any predictivity of the theory. In pure gravity, these problems occur at NNLO, for matter
coupled to gravity already at the leading order of radiative corrections. In particular, this
132 7.2. Gravity
implies that the theory is perturbatively not reliable beyond the scale . Though this
may be an artifact of perturbation theory, this has led to developments like supergravity
based on local supersymmetry or loop quantum gravity.
Irrespective of the details, the lack of gravity is an obvious flaw of the standard model.
Along with this lack comes also a riddle. The natural quantum scale of gravity is given
by the Planck scale
1
MP = 1.22 1019 GeV.
GN
This is 17 orders of magnitude larger than the natural scale of the electroweak interactions,
and 19 orders of magnitude larger than the one of QCD. The origin of this mismatch is
yet unsolved.
One of the more popular proposals how to explain this, discussed in section ??, is that
this is only an apparent mismatch: The scales of gravity and the standard model are the
same, but gravity is able to propagate also in additional dimensions not accessible by the
standard model. The mismatch comes from the ratio of the total volumes, the bulk, and
the apparent four dimensional volume, which is thus only a boundary, a so-called brane.
performed to at least obtain a better experimental value, while the theoretical calculations
are continously improved.
The next is that two experiments have in agreement reported results that cannot be
explained with just the three ordinary neutrinos. However, these results are rather indirect,
and at least another independent experiment will be required for confirmation.
All of these observations are currently investigated further. Given the amount of
statistics needed, it may take quite some time for a final answer about the reality of
any of these deviations.
holes, as the necessary density of such objects would turn up in a cloaking of extragalactic
light and of light from globular clusters. This matter is therefore not made out of any
conventional objects, in particular, it is non-baryonic. Furthermore, it is gravitational
but not electromagnetically active. It also shows different fluid dynamics (observed in the
case of colliding galaxies) as ordinary luminous matter. Also, the dark matter cannot be
strongly interacting, as it otherwise would turn up as bound in nuclei.
Thus this matter has to have particular properties. The only particle in the standard
model which could have provided it would have been a massive neutrino. However, though
the neutrinos do have mass, the upper limits on their mass is so low, and the flux of cosmic
neutrinos too small, to make up even a considerable fraction of the dark matter. This can
be seen by a simple estimate. If the neutrinos have mass and would fill the galaxy up to
the maximum possible by Fermi-statistics, their density would be
p3F
n = (7.4)
2
with the Fermi momentum pF in the non-relativistic case given by m v . Since neutrinos
have to be bound gravitationally to the galaxy, their speed is linked via the Virial theorem
to their potential energy
GN Mgalaxy
v2 = , (7.5)
R
with Newtons constant GN and R the radius of the galaxy. Putting in the known numbers,
and using furthermore that the observational results imply that n , the total number of
neutrinos approximated to be inside a sphere of the size of the galaxy, must give a total
mass larger than the one of the galaxy leads to the bound
14 12
0.001c 1 kpc
m > 100eV ,
3v R
yielding even for a neutrino at the speed of light a lower bound for the mass of about 3
eV, which is excluded by direct measurements in tritium decays.
Therefore, a different type of particles is necessary to fill this gap. In fact, many can-
didate theories for physics beyond the standard model offer candidates for such particles.
But none has been detected so far, despite several dedicated experimental searches for
dark matter. These experiments are enormously complicated by the problem of distin-
guishing the signal from background, in particular natural radioactivity and cosmic rays.
The source of this matter stays therefore mysterious.
But not only the existence of dark matter, also its properties are surprising. The
observations are best explained by dark matter which is in thermal equilibrium. But how
this should be achieved if it is really so weakly interacting is unclear.
136 7.4. Astronomical observations
On the other hand, the idea of gravitational bound dark matter is also problematic. In
particular, there is no reason why it should neither form celestial dark bodies, which should
be observable by passing in front of luminous matter by gravitational lensing, or why it
should not be bound partly in the planets of our solar system. Only if its temperature
is so high that binding is prohibited this would be in agreement, but then the question
remains why it is so hot, and what is the origin of the enormous amount of energy stored
in the dark matter.
It should be noted that there are also attempts to explain these observations by a de-
parture of gravity from its classical behavior also at long distances. Though parametriza-
tions exist of such a modification which are compatible with observational data, no clean
explanation or necessity for such a modification in classical general relativity has been
established. This proposal is also challenged by recent observations of colliding galaxies
which show that the center-of-mass of the total matter and the center of luminous matter
move differently, which precludes any simple modification of the laws of gravity, and is
much more in-line with the existence of dark matter.
Another alternative is that the problem of asymptotic infrared safeness of quantum
gravity may be related to the apparent existence of dark matter.
7.4.3 Inflation
A second problem is the apparent smoothness of the universe around us, while having
at the same time small highly non-smooth patches, like galaxies, clusters, super clusters,
walls and voids. In the standard model of cosmological evolution this can only be obtained
by a rapid phase of expansion (by a factor e60 ) of the early universe, at temperatures
much larger than the standard model scale, but much less than the gravity scale. None of
the standard model physics can explain this, nor act as an agitator for it. In particular, it
is also very complicated to find a model which at the same time explains the appearance
of inflation and also its end.
However, the predictions of inflation have been very well confirmed by the investigation
of the cosmic microwave background radiation, including non-trivial features and up to
rather high precision.
background radiation, in particular the position of the quadrupole moment1 , but also that
the large-scale structure in the universe could not have been formed in the observed way
otherwise. For a universe, which is governed by Einsteins equation of general relativity,
this can only occur if there is a certain amount of energy inside it. Even including the
unknown dark matter, the amount of registered mass can provide at best about 30% of
the required amount to be in agreement with this observation. The other contribution,
amounting to about 70%, of what origin it may ever be, is called dark energy.
A second part of the puzzle is that the cosmic expansion is found to be accelerating.
This is found from distant supernova data, which are only consistent if the universe ex-
pands accelerated today. In particular, other explanations are very hard to reconcile with
the data, as it behaves non-monotonous with distance, in contrast to any kind of light-
screening from any known physical process. Furthermore, the large-scale structures of the
universe indicate this expansion, but also that the universe would be too young (about
10.000.000.000 years) for its oldest stars (about 12-13.000.000.000 years) if this would not
be the case. For such a flat universe such an acceleration within the framework of gen-
eral relativity requires a non-zero cosmological constant , which appears in the Einstein
equations (7.2). This constant could also provide the remaining 70% of the mass to close
the universe, and is in fact a vacuum energy. Such a constant is covariantly conserved,
since both T and the first two terms together are independently in general relativity,
and thus indeed constant. However, the known (quantum) effects contributing to such a
constant provide a much too large value for , about 1040 times too large. These include
quantities like the chiral and gluon condensates2 . These are of order GeV, and in addition
would have the wrong sign. What leads to the necessary enormous suppression is unclear.
After the big-bang, the produced very hot and dense matter was formed essentially from
a system of rapidly decaying and recombining particles. When the system cooled down, the
stable bound states remained in this process, leading first to stable nucleons and leptons in
the baryogenesis, and afterwards to stable nuclei and atoms in the nucleosynthesis. Only
over this time matter could have become dominant over antimatter, leading to the stable
universe observed today. But the electroweak effects would not have been strong enough
for the available time to produce the almost perfect asymmetry of matter vs. antimatter
observed today, by a factor of about 1019 . Thus, a further mechanism must exist which
provides matter dominance today.
There is a profound connection to inflation. It can be shown that inflation would
not have been efficient enough, if the number of baryons would have been conserved in
the process. In particular, the almost-baryon-number conserving electroweak interactions
would have permitted only an inflationary growth of e45 instead of e60 .
The possibility that this violation is sufficient to create pockets of matter at least as
large as our horizon, but not on larger scales has been tested, and found to yield only
pockets of matter much smaller than our horizon.
A further obstacle to a standard-model-conform breaking of matter-antimatter symme-
try is the necessity for a first order phase transition. This is required since in a equilibrium
(or almost equilibrium like at a higher-order transition), the equilibration of matter vs.
anti-matter counters the necessary breaking. However, with a Higgs of about 125 GeV
mass, this will not occur within the standard model.