Manuscript M1 15 10 2021
Manuscript M1 15 10 2021
Samuel Wallon
version: 15 October 2021
Contents
I. Relativistic symmetries 1
1. Lorentz group 3
1.1. Space-time interval . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2. Poincaré and Lorentz groups . . . . . . . . . . . . . . . . . . . 6
1.2.1. Characterization of P . . . . . . . . . . . . . . . . . . . 6
1.2.2. Structure of the Lorentz group . . . . . . . . . . . . . . 9
1.2.3. The two types of Lorentz transformations . . . . . . . . 11
1.2.4. Number of parameters . . . . . . . . . . . . . . . . . . . 13
1.3. Covariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.3.1. Contravariant and covariant vectors . . . . . . . . . . . 14
1.3.2. Covariance and contravariance applied to Minkowski space
17
1.3.3. Tensors . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.3.4. A few examples . . . . . . . . . . . . . . . . . . . . . . 22
1
Contents
4. Electrodynamics 51
4.1. Maxwell equations . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.1.1. Local form . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.1.2. Integral form . . . . . . . . . . . . . . . . . . . . . . . . 51
4.2. Covariant formulation of electrodynamics . . . . . . . . . . . . 52
4.2.1. Field strength tensor and its dual . . . . . . . . . . . . . 52
4.2.2. Lorentz transformations . . . . . . . . . . . . . . . . . . 54
4.2.3. Relativistic invariants . . . . . . . . . . . . . . . . . . . 57
4.3. Covariant form of Maxwell’s equations . . . . . . . . . . . . . . 58
4.3.1. Maxwell’s equations . . . . . . . . . . . . . . . . . . . . 58
4.3.2. Four-potential . . . . . . . . . . . . . . . . . . . . . . . 60
4.4. Lagrangian for photons . . . . . . . . . . . . . . . . . . . . . . 63
4.5. Coupling between matter and electromagnetic field . . . . . . . 66
4.5.1. Matter part . . . . . . . . . . . . . . . . . . . . . . . . . 66
4.5.2. Pure photon part . . . . . . . . . . . . . . . . . . . . . . 67
4.5.3. Interaction between matter and photons . . . . . . . . . 67
2
Contents
Appendices 82
A. Elements of group theory 85
A.1. Group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
A.1.1. Definitions . . . . . . . . . . . . . . . . . . . . . . . . . 85
A.1.2. Examples of groups . . . . . . . . . . . . . . . . . . . . 86
A.2. Morphisms and subgroups . . . . . . . . . . . . . . . . . . . . . 88
A.2.1. Maps, injections, surjections . . . . . . . . . . . . . . . 88
A.2.2. Morphism . . . . . . . . . . . . . . . . . . . . . . . . . . 89
A.2.3. Subgroup . . . . . . . . . . . . . . . . . . . . . . . . . . 90
3
Part I.
Relativistic symmetries
1
1. Lorentz group
Maxwell equations, which describe electric and magnetic phenomena, are not in-
variant under the Galilean group. Thus, contrarily to what states the galilean
principle, one should be able to reveal an absolute reference system, named
ether. Several experiments led by Michelson and Morley (1881) invalidated
such an hypothesis: speed of light is independent of the direction of propaga-
tion. Time cannot be considered as an absolute variable: simultaneity of two
phenomena in reference frame does not imply simultaneity in another one, in
contrast with classical mechanics.
By definition, the square of the space-time interval between two events is given
by
(∆s)2 = c2 (t2 − t1 )2 − (~x2 − ~x1)2 . (1.1)
Of course in the present case, ~x2 − ~x1 = c(t2 − t1 )~u where ~u is a unitary vector
pointing in the direction of the photon trajectory, and thus (∆s)2 = 0 .
A key assumption, made by Einstein, is the fact that c does not depend on the
inertial frame, therefore (∆s)2 = 0 in every inertial frame in which one can
wish to describe this experiment.
3
1. Lorentz group
Exercise 1.1
Show this (see Landau, Lifshitz, The Classical Theory of Fields, p. 12).
The space of points in space-time with the interval (1.2) is called Minkowski
space1. This is called a pseudo-euclidean geometry, in contradistinction with
usual euclidean geometry.
A point in Minkowski space M is described by its contravariant coordinates
µ
x = (ct, ~x) . This is an example of a quadrivector.
1
Hermann Minkowski introduced this concept of physical space in 1907. Henri Poincaré introduced the same
idea in 1905 from a mathematical point of view, in connexion with the invariance of Maxwell equations
under change of inertial frame.
4
1.1. Space-time interval
ct > 0
(∆s)2 > 0
0
=
s) 2
(∆
(∆s)2 < 0
spatial
components
ct < 0
Exercise 1.2
Show that the interval separating the emission and the detection of a particle
in any process is either time-like or light-like.
or equivalently
5
1. Lorentz group
From the invariance of ds2 under change of inertial frame, we deduce that the
proper time dτ does not depend on the choice of inertial frame: it is a Poincaré
scalar. We will characterize Poincaré transformations in the next section.
The relation (1.9) reveals a key feature. Indeed, suppose that a particle is at
rest in a given inertial frame, named proper frame. This is always possible if
its velocity differs from c, using a pure Lorentz boost along ~v, see next section.
Passing to another arbitrary inertial frame, of non zero velocity with respect to
the proper frame, we will have γ > 1, which implies the famous phenomena of
time dilation: dt > dτ : this means that time runs slower for a particle in any
inertial frame with a non zero velocity with respect to a proper frame.
Throughout the rest of the text, unless otherwise stated, we will use a set of
units in which c = 1 , and thus x0 = t, x1 = x, x2 = y, x3 = z .
1.2.1. Characterization of P
Consider a point x of M, measured in frame K, and x′ the coordinates of the
same point in frame K ′.
The invariance of the infinitesimal interval
implies that
Proof :
6
1.2. Poincaré and Lorentz groups
i.e.
∂x′α ∂x′β µ ν
gαβ dx dx = gµν dxµdxν
∂xµ ∂xν
thus
∂x′α ∂x′β
gαβ = gµν . (1.12)
∂xµ ∂xν
Taking the determinant of each side of this equation, one thus gets
2
∂x′
det g det = det g .
∂x
∂x′
Since g is regular, this implies that det 6= 0 : Poincaré transformations are
∂x
thus invertible.
∂
Acting with on Eq. (1.12), one gets
∂xρ
∂ 2x′α ∂x′β ∂x′α ∂ 2x′β
gαβ + gαβ =0
∂xµ∂xρ ∂xν ∂xµ ∂xν ∂xρ
which symbolically reads
∂ 2x′α ∂x′β
gαβ =0
∂xµ∂xρ ∂xν
∂x′ ∂ 2x′α
and thus (since det 6= 0) : =0
∂x ∂xµ∂xρ
which proves that x′ are linear functions of x .
7
1. Lorentz group
ւ line index
α
Λ β
տ column index
One deduces from Eqs. (1.12) and (1.11) that Λ is a real matrix satisfying
The set L of Lorentz transforms is a group (see Appendix A for a very short
introduction to group theory).
Proof :
• this is obvious from the definition: the product of two such homogeneous
transformations preserving ds2 is an homogeneous transformation which pre-
serves ds2 ; on the hand, the inverse of an homogeneous transformation pre-
serving ds2 is an homogeneous transformation preserving ds2 . And of course
the identity preserves ds2.
Proof :
The first of the two above proofs immediately extends to the case of the inho-
mogeneous Lorentz group (or Poincaré group). The second proof relies on the
group law on P :
8
1.2. Poincaré and Lorentz groups
On should notice that the defining relation (1.16) of L reads, in matrix form,
Λt g Λ = g . (1.18)
Connected component: in a connected component, one can pass from one ele-
ment to another one continuously.
9
1. Lorentz group
first one to the second one. If not, it is multiply connected. In our present
case, doubly connected means that in each of these components, there are two
classes of paths; considering two paths in a given class, one can pass from the
first one to the second one by continuous transformation of the parameters.
L↑+ L↑−
P
Λ00 > 1 Λ00 > 1
det Λ = 1 det Λ = −1
T T
PT
L↓− L↓+
P
Λ00 6 −1 Λ00 6 −1
det Λ = −1 det Λ = 1
• time reversal:
−1
1
T =
1
1
10
1.2. Poincaré and Lorentz groups
Exercise 1.3
Check that the restricted Lorentz group, the complete Lorentz group and the
proper Lorentz group are subgroups of the Lorentz group L .
Compactness:
This is due to the existence of boosts, also called special Lorentz transforms,
which we will examine later, encoded by a rapidity which varies from −∞ to
+∞.
Rotations
The first set of Lorentz transformations is well known: they are the usual
rotations, elements of SO(3), defined as 3-dimensional transformations which
preserve the euclidean scalar product. They are characterized by the angle ψ
of the rotation, and the axis of the rotation, itself defined through a unitary
vector ~n depending on two spherical angles θ, φ. Thus, a rotation is defined by
three real numbers varying in compact sets
using the fact that a rotation around an oriented axis spanned by ~n and of
angle ψ is identical to a rotation of axis −~n and angle 2π − ψ, which allows to
11
1. Lorentz group
restrict ourselves to 0 6 ψ 6 π .
Exercise 1.4
Show that a rotation of axis x1 and angle Ψ in the passive point of view (i.e.
the axes rotates) reads
2′
x = cos ψ x2 + sin ψ x3
(1.21)
x3′ = − sin ψ x2 + cos ψ x3 .
Exercise 1.5
Boosts
Besides, pure Lorentz transformations, also named boosts, affect a single space
direction and time.
Exercise 1.6
Show that restricted Lorentz transforms which leaves invariant x2, x3 reads
0′
x = cosh φ x0 − sinh φ x1
(1.23)
x1′ = − sinh φ x0 + cosh φ x1
where φ ∈] − ∞, ∞[.
Indication: consider a generic 2 × 2 linear transformation mixing x0 and x1,
leaving the space-time interval invariant and preserving the sign of x0, and
solve for the 4 coefficients.
12
1.2. Poincaré and Lorentz groups
in a fixed frame. In the passive point of view, the movement of the origin O′ of
the new frame F ′ with respect to the origin O of the initial frame F is obtained
by setting x1′ = 0, i.e. x1 = tanh φ x0 corresponding to a speed ~v = c tanh φ e~1
of the frame F ′ with respect to the frame F , along the x axis.
Show that a boost in the direction ~n (~n2 = 1), of rapidity φ, i.e. with a speed
~v = c thφ ~n of the frame F ′ with respect to the frame F , reads
0′
x = x0 cosh φ − (~n · ~x) sinh φ
′ (1.30)
~x = ~x − ~n (~x · ~n) − x0 sinh φ + (~n · ~x) cosh φ ~n
Indication: consider how each part of the decomposition ~x = [(~x · ~n)~n] + [~x −
(~x · ~n)~n] transforms under a boost.
13
1. Lorentz group
1.3. Covariance
1.3.1. Contravariant and covariant vectors
In a formal way, a vector belongs to a vector space E (over the field K=R or
C), spanned by a basis. This naturally gives a set of contravariant coordinates
for this vector on this basis. A covector belongs to the dual space E ∗ of linear
forms2 E → K. In the special case of Hilbert spaces, there is a one-to-one
correspondence between E and E ∗, which is extremely convenient in practice.
Let us explain these different steps in detail.
Contravariance
Consider a vector space E (over the field K=R or C), of dimension n. A
′ ′
change of basis from B = {~e1 , · · · ~en } to B ′ = {~e1 , · · · ~en } (conveniently, basis
are denoted as row vectors) is given by
B ′ = BA (1.34)
2
We recall that a linear form is a linear map from a vector space to its field of scalars .
3
If not stated explicitly, we will now use Einstein convention, which states that repeated indexes are summed.
14
1.3. Covariance
which allows the use of a matrix product in the second equality of Eq. (1.36).
In the new basis B ′ , the vector ~v being invariant, one has the two equivalent
expansions
~v = B v[B] = B ′ v[B ′ ] = BA v[BA] (1.38)
and thus
v[BA] = A−1v[B] (1.39)
Such a transformation law is named contravariant, since one has under the
transformation A
B −→ B ′ (1.40)
v[B] ←− v[B ′ ] . (1.41)
Covariance
Consider now the dual space E ∗ , the space of linear forms from E to K.
A natural (thus named canonical) dual basis can be defined as a set of n
linear forms ei , through their action on the chosen basis of E, as
Indeed any coefficient ℓi [B] of this expansion can be easily obtained through
the action of ℓ on a vector basis ~ei :
15
1. Lorentz group
which allows the use of a matrix product in the last equality of Eq. (1.45).
The action of ℓ ∈ E ∗ on an arbitrary vector ~v ∈ E thus reads
ℓ(~v ) = ℓ(~ei)ei (~v) = ℓ(~ei )ei (v j~ej ) = ℓ(~ei )v j ei (~ej ) = ℓ(~ei )v j δji = ℓ(~ei )v i , (1.48)
were we have used the expansion of ~v on the basis B, see Eq. (1.36). Again
using our compact notations (row vectors for forms, column vectors for vectors),
we can write this as
Let us now investigate how the change of basis (1.34) for E translates in the
dual space. We have
Such a transformation law is named covariant, since one has under the trans-
formation A
B −→ B ′ (1.53)
ℓ[B] −→ ℓ[B ′ ] . (1.54)
16
1.3. Covariance
infinite dimension spaces), the Riesz representation theorem ensure that there
is a one to one correspondence between linear forms and vectors:
Coming back to Eq. (1.49), and removing any reference to g, we can now write
Contra-variant components
An arbitrary quadrivector A can be decomposed in such a basis as
A = Aµ eµ . (1.57)
Remark : the eµ are not vectors since they are not invariant when changing the
axis (recall that by definition a vector is an object with d components which is
invariant under changes of axis).
17
1. Lorentz group
Co-variant components
Using the metric g which allows through (1.55) to define a co-vector associated
to the vector A, the co-variant components Aµ of this co-vector reads, according
to Eq. (1.56),
Aµ = A · e µ (1.58)
In general Aµ 6= Aµ . As an example, consider the euclidean plane, see Fig. 1.3,
with a non orthonormal basis.
A2
A2 A
e2
e1 A1 A1
eµ · eν = gµν
with gµν given by (1.3) in the case of Minkowski space. We thus have
Aµ = A · eµ = Aν eν · eµ = gµν Aν
We define g µν as the inverse of gµν : g µν gνρ = δρµ .
In the case of Minkowski space, since gµν is its own inverse, gµν = g µν .
Thus
Aµ = gµν Aν
(1.59)
Aµ = g µν Aν .
This implies that
A · B = Aµ B ν gµν = Aµ Bµ = Aµ B µ . (1.60)
18
1.3. Covariance
Eq. (1.62) means that Λαµ is a tensor of order 2 which transforms as Aα Aµ , see
section 1.3.3. ν β
We thus have Λβν = Λ−1 β = t Λ−1 ν and one can check explicitly that
ν ν
Λβν Λρν = Λ−1 β Λρν = Λρν Λ−1 β = δβρ .
Remark :
19
1. Lorentz group
This is just paste and copy of the relation (1.50) with here A = Λ−1 :
and thus
eµ = Λνµ e′ν and e′µ = Λµν eν
according to the tensor notation introduced for Λ−1. Thus x′µ = x · e′µ =
x · Λµν eν = Λµν xν .
Remark :
The fact that covariant components transform under t Λ−1 , which is the inverse
transpose of the matrix encoding the transformation of contra-variant compo-
nents is due to the invariance of ds2 = dxµdxµ which implies that Λt gΛ = g,
see Eq. (1.18).
Indeed, this also reads t Λ−1 = g Λ g −1 . Starting from v ′ = Λ v for a con-
travariant v, one has
g v ′ = g Λ g −1 g v
|{z} | {z } |{z}
t −1
covariant vector Λ covariant vector
1.3.3. Tensors
Based on the tools introduced above, we can introduce relativistic covariance
in a more general context: physical quantities describing the state of a system
20
1.3. Covariance
at generally speaking not the same for different observers. We can therefore
distinguish:
• scalar quantities, which do not depend on a specific choice of reference
frame. These are defined by scalar functions: given a function φ defined on M,
in a given frame F , the corresponding scalar function in a new frame F ′ is such
that φ′ (x′) = φ(x) in order to leave invariant the associated physical quantity.
• the contravariant components v µ of a vector transform as coordinates (in
the general case of P, translations only affect coordinates) :
′µ
∂x
v ′µ = Λµν v ν = vν .
∂xν
µ ∂A0 ~ ~
µ
∂ Aµ = ∂µ A = +∇·A
∂x0
21
1. Lorentz group
∂2µ
⊔ = ∂µ ∂ = 2 − ∆ .
⊓ (1.70)
∂x0
Summary :
• one can pass from one to another form of a given tensor by raising or
lowering indexes through the metric tensor, as one does for quadri-vectors.
• Lorentz transformation should act by letting its representative matrix act-
ing on the left on a quadri-vector (either on its contravariant or covariant com-
ponents), summing over repeated indexes (one up, one down). For contravariant
components, this is just the matrix Λ itself, and for covariant components, its
transpose inverse.
Proper time
The first trivial example of Lorentz scalar is of course provided by the proper
infinitesimal time dτ of any particle: it is a Lorentz invariant (which equals zero
for a massless particle).
Four-velocity
It is defined has
µ dX µ dt d~x
V = = c , = (γc, γ~v) (1.71)
dτ dτ dτ
which satisfies
V 2 = γ 2(c2 − ~v 2) = c2 .
The above definition makes sense for a massive particle. For a massless particle,
the above discussion fails since dτ = 0 and γ → ∞. Still, in any reference frame,
we have d~x
dt = c~n where ~n is a unit vector pointing in the direction of the motion
of the particle, so that differentiating with respect to t instead of τ leads to
V µ = (c, c ~n) ,
22
1.3. Covariance
Four-momentum
For a massive particle, it is defined as
µ dX µ
µ E
P = mV = m = (γmc, γm~v) = , p~ (1.72)
dτ c
which satisfies
P 2 = m2 c2 .
The above definition makes sense for a massive particle. For a massless particle,
the last equality is still valid, so that for arbitrary mass, we define
E
Pµ = , p~ . (1.73)
c
Note that in any case,
~v ~p
= . (1.74)
c2 E
Four-current
From the usual charge density and the current density, one can define a four-
current as
µ ~
J (x) = cρ(x), j(x) . (1.75)
Introducing the rest charge density ρ0 (i.e. the charge density in a rest frame),
related to the charge density ρ = ρv in a frame moving at velocity ~v with respect
to the proper frame, one easily shows that
J µ (x) = (cρv (x), ρv (x)~v) = ρ0 V µ .
p
with4 ρv = γρ0, with as usual γ = 1/ 1 − v 2 /c2 , which shows that J µ are the
contravariant components of a four-vector (ρ0 is a Lorentz scalar).
Let us consider the special case of the four-current associated to the motion
~
of a charged particle, of charge q, moving along a trajectory X(t). We then
have
~
ρ(ct, ~x) = q δ (3) (~x − X(t)) (1.76)
~
~j(ct, ~x) = q dX δ (3) (~x − X(t))
~ . (1.77)
dt
3
4
Indeed ρ = dd3 xq and d3 xv = γ1 d3 xr when expressing the infinitesimal volume in the boosted frame (“v“) in
terms of the infinitesimal volume in the boosted frame (“r”), thus ρv = γρ0 .
23
1. Lorentz group
as expected.
24
2. The Klein-Gordon equation
In this chapter, our aim is to write a relativistic equation describing a massive
spin 0 particle. Meanwhile, we will see that such a description naturally leads
to states of negative energy, which will turn to be negative probability density
solutions. This will be reinterpreted as antiparticles.
ρ = |ψ|2 (2.5)
has the physical meaning that
d3 P = |ψ|2 d3 V (2.6)
25
2. The Klein-Gordon equation
~2 ∗ ∂ψ
− ψ ∆ψ = i~ ψ ∗ (2.7)
2m ∂t
2
~ ∗ ∂ψ ∗
− ψ∆ψ = −i~ ψ (2.8)
2m ∂t
and by subtraction
~2 ∂ψ
− (ψ ∗ ∆ψ − ψ∆ψ ∗ ) = 2i~ ψ ∗ (2.9)
2m ∂t
so that
∂ρ ∂ψ i~
= 2ψ ∗ = (ψ ∗ ∆ψ − ψ∆ψ ∗ ) (2.10)
∂t ∂t 2m
~ 2,
i.e., since ∆ = ∇
∂ρ i~ ∗ ~ 2 ~ 2 ∗
i~ ~ ∗ ~ ~ ∗
= ψ ∇ ψ − ψ∇ ψ = ∇ · ψ ∇ψ − ψ ∇ψ (2.11)
∂t 2m 2m
which reads
∂ρ ~ ~
+∇·j =0 (2.12)
∂t
with the probability current, also named probability flux density, given by
~j = − i~ ψ ∗ ∇ψ
~ − ψ ∇ψ
~ ∗ . (2.13)
2m
Eq. (2.12) expresses the local conservation of probability. The global point of
view is obtained by integration over a given volume V : the increase of the
probability for particles to be inside V , of boundary S, is given by
ZZZ ZZZ ZZ
∂ 3 ~ · ~j d V = −
3 ~
~j · d2 S
ρd V = − ∇ (2.14)
∂t V V S
where we have used the Green-Ostrogradsky theorem in the last step. Eq. (2.14)
simply states that this increase is equal to the opposite of the flux of particles
out of the volume V , i.e. through S.
26
2.2. The Klein-Gordon equation
ρ = |N |2 , (2.16)
~j = p~ |N |2 = ~v |N |2 . (2.17)
m
Looking for a first order equation, in the spirit of the Schrödinger equation, the
correspondence rule would thus lead to write
2
∂ψ p 2 2 2 ~
i~ = −~ c ∆ + m2 c4 ψ = mc 1 − ∆ + ··· ψ. (2.20)
∂t 2m2c2
This equation is very non-local, and the symmetry between ct and ~x is com-
pletely hidden.
27
2. The Klein-Gordon equation
28
2.3. Klein-Gordon equation and scalar field action
where L(x) is the lagrangian density (usually called lagrangian), which is as-
sumed to depend on a finite number of derivatives of fields4. In practice, in
general L(x) only depends on fields and their first order derivatives, just like
in mechanics, the Lagrange function only depends on generalized coordinates
and their first order time derivative, so that the equations of motion would be
second order equations5 Thus, L(x) has the generic form
The index i labels the various types of fields or, if field are not scalars (this will
not be the case in the present lectures), their components (example: Aµ (x) in
the case of electromagnetism, or Ψ(x), a bispinor field (a collection of 4 complex
numbers) describing a spin 1/2 relativistic particle). Generally speaking, we will
always assume that fields vanish fast enough at infinity, so that boundary terms
can be safely ignored in usual integration by parts. Note that at that stage, the
space-time group of transformation could be classical (galilean) or relativistic
(lorentzian). For simplicity, we will now use covariant relativistic notations.
3
The case of a finite number of degrees of freedom boils down to analytical mechanics. We refer to any lecture
on analytical mechanics, as well as to the famous books of L. Landau, E. Lifchitz, Mechanics, or H. Goldstein,
Classical Mechanics.
4
This is assumed in order to satisfy locality: in order that L(x) would depend on the values of fields at point
y, distinct from x, one should be able to reconstruct the field value φ(y) from φ(x) and its derivatives at
position x, which requires generally speaking an infinite number of derivatives from Taylor exansion...
5
This is due to the Ostrogradsky instability, for theories having equations of motion with more than two time
derivatives (higher-derivative theories). A theorem of Mikhail Ostrogradsky in classical mechanics indeed
states that a non-degenerate Lagrangian dependent on time derivatives higher than the first corresponds to
a Hamiltonian unbounded from below, therefore non physical.
6
In general, Ω is the whole space-time, in which case each x component covers ] − ∞, +∞[, but it is sometime
convenient to keep Ω arbitrary in full generality, in particular in view of the Noether theorem.
29
2. The Klein-Gordon equation
that it should be extremal around the actual values of the fields describing the
physical system. Under the transformation
with the constraint that fields on the boundary ∂Ω of Ω have fixed values, which
means that their variation δφ(x) vanish on ∂Ω, the action varies as
Z
δL(x) δL(x)
δS = d4 x δφi (x) + δ(∂µ φi (x)) , (2.32)
Ω δφi (x) δ∂ µ φi (x)
where we have performed a first order Taylor expansion of L, which is a func-
tional of the fields and their first derivatives. Integrating by parts, we thus
get
Z Z
δL(x) δL(x) δL(x) 3
δS = d4 x − ∂µ δφi (x) + δφi (x) d σµ (2.33)
Ω δφi (x) δ∂µ φi (x) ∂Ω δ∂µφi (x)
30
2.4. Physical content
we have
δL
= −m2 Φ (2.37)
δΦ
and
δL
= ∂ µΦ . (2.38)
δ∂µ Φ
Indeed µ and ν are dummy indexes, and thus one should not forget to differen-
tiate both terms in Eq. (2.36). This finally leads to the equation of motion
µ m2 c2
∂ ∂µ + 2 Φ(x) = 0 (2.39)
~
which is our celebrated Klein-Gordon equation.
1 ∗ ∂ 2Φ ∗~ 2 m2 c2 ∗
iΦ − iΦ ∇ Φ + iΦ Φ = 0 , (2.41)
c2 ∂t2 ~2
while considering the complex conjugate of the Klein-Gordon equation (2.40)
multiplied by iΦ leads to
1 ∂ 2 Φ∗ ~ 2 ∗ m2 c2 ∗
iΦ 2 − iΦ∇ Φ + 2 iΦ Φ = 0 , (2.42)
c2 ∂t ~
so that the subtraction of Eqs. (2.41, 2.42) gives
2 2 ∗
1 ∗ ∂ Φ ∂ Φ ∗~ 2 ~ 2 ∗
i Φ − Φ 2 − i Φ ∇ Φ − Φ∇ Φ = 0 , (2.43)
c2 ∂t2 ∂t
31
2. The Klein-Gordon equation
or equivalently
∂ ∗ ∂Φ ∂Φ∗ ~
h
∗~ ~ ∗
i
i Φ −Φ + ∇ · −i Φ ∇Φ − Φ∇Φ = 0, (2.44)
c∂t c∂t c∂t
i.e.
∂ρ ~ ~
+∇ ·j = 0, (2.45)
∂t
where
∗
i~ ∂Φ ∂Φ
ρ = 2 Φ∗ −Φ , (2.46)
c ∂t ∂t
~ − Φ∇Φ
~j = −i~ Φ∗∇Φ ~ ∗ . (2.47)
Equation (2.45) just reflects the conservation of probability if (2.46) and (2.47)
are to be interpreted as the relativistic probability density and the probability
current respectively.
Introducing the four-current
µ
j = ρc, j ,~ (2.48)
∂µ j µ = 0 . (2.51)
Note that the fact that ρ is the time component of a four-vector is consistent
with the fact that ρ d3 V is the probability that a particle would be in the volume
d3V . Under a boost of Lorentz factor γ, d3 V is Lorentz contracted by a factor
of 1/γ, which is compensated by the fact that ρ is dilated by a factor of γ as any
time component of a four-vector, so that the probability is indeed invariant.
32
2.4. Physical content
Φ = N e−ip·x . (2.54)
j µ = 2pµ|N |2 . (2.55)
This looks very simple and elegant. However, substituting the plane wave solu-
tion (2.54) in the Klein-Gordon equation (2.24), we get the dispersion relation
p
2 2
p = m i.e. E = ± p~ 2 + m2 , (2.56)
33
2. The Klein-Gordon equation
34
2.5. In-depth study: global symmetries and Noether theorem
x′µ = xµ + δxµ ,
(2.62)
φ′ (x′) = φ(x) + δφ(x) .
The action for fields solutions of the equation of motion, which reads, before
the transformation, Z
S= d4x L(φ(x), ∂µ φ(x), x) (2.64)
Ω
thus becomes Z
′
S = d4x L(φ′ (x), ∂µ φ′ (x), x) (2.65)
Ω′
9
Although we only consider scalar fields in the present chapter, the following discussion also applies to arbitrary
fields.
35
2. The Klein-Gordon equation
which gives, after a first order Taylor expansion, neglecting terms of order 2,
Z Z
′ 4 δL δL
S =S+ d x δ̄φ + ∂µ δ̄φ + d3σµ δxµ L(φ, ∂µ φ, x) + · · · .
Ω δφ δ(∂µ φ) δΩ
(2.67)
Integrating by parts, one thus gets
Z Z
′ 4 δL δL 4 δL
S = S+ d x − ∂µ δ̄φ + d x ∂µ δ̄φ (2.68)
Ω δφ δ(∂µ φ) Ω δ(∂µφ)
Z
+ d3 σµ δxµ L(φ, ∂µ φ, x) + · · · .
δΩ
The second term vanishes since fields satisfy the equations of motion. Integrat-
ing the third term, we thus get
Z
δL
S′ = S + d3 σ µ δ̄φ + δxµ L + · · · (2.69)
δΩ δ(∂µ φ)
Z
δL δL
= S+ d3 σ µ δφ − ∂ν φ δxν + δxµ L + · · ·
δΩ δ(∂µ φ) δ(∂µφ)
36
2.5. In-depth study: global symmetries and Noether theorem
∂µ j µ = 0 .
Indeed, Z Z
∂Q
= d x ∂0 j (x) = − d3x ∂i j i (x) = 0
3 0
∂t
for rapidly decreasing field at infinity. More generally, the conserved charge can
defined through the relation
Z
Q= d3σµ j µ (x) , (2.73)
N3
L = ∂µ Φ∗ ∂ µΦ − m2 Φ∗Φ . (2.74)
37
2. The Klein-Gordon equation
and
2 2
m c
∂ µ ∂µ + 2 Φ∗(x) = 0 . (2.76)
~
Obviously, the lagrangian (2.74), and therefore the corresponding action, are
invariant under the so-called global gauge transforms
Φ → eieα Φ (2.77)
Φ∗ → e−ieαΦ∗ (2.78)
where e is an arbitrary parameter, which has the physical meaning of the ele-
mentary electric charge, due to its role in the conserved charge current which
we will now construct.
Let us compute the Noether current associated to this transformation. We
consider the infinitesimal versions of the transformations (2.77-2.78)
δΦ = ieδαΦ (2.79)
δΦ∗ = −ieδαΦ∗ (2.80)
with δx = 0. Therefore, according to Eq. (2.71), and using the fact that
δL
= ∂ µ Φ∗ , (2.81)
δ(∂µΦ)
δL
∗
= ∂ µΦ (2.82)
δ(∂µ Φ )
we get
valid for any δα, which can thus be factorized out, so that the electromagnetic
four-current
38
Part II.
Introduction to Quantum
Electrodynamics
39
3. From nonrelativistic
perturbation theory to Feynman
diagrams
3.1. Time dependent perturbation theory
General framework
We consider a particle which is described by the hamiltonian
H = H0 + V (t) , (3.1)
where H0 is the free hamiltonian which has known spectrum and eigenstates,
and V (t) is an interaction potential, which generally speaking depends on time.
(0)
The eigenstates of the free hamiltonian H0 are denoted as |Ψn i. They thus
satisfy
H0 |Ψ(0) (0)
n i = En |Ψn i (3.2)
hΨ(0) (0)
n |Ψm i = δnm . (3.3)
41
3. From nonrelativistic perturbation theory to Feynman diagrams
For that, we aim at finding its solution through an expansion over the basis
(0)
|Ψn (t)i of solutions of the free Schrödinger’s equation (3.4) governed by the
hamiltonian H0 .
We thus generically write
X X En t
|Ψ(t)i = an (t) |Ψ(0)
n (t)i = an (t) e−i ~ |Ψ(0)
n i. (3.7)
n n
Inserting the expansion (3.7) inside the Schrödinger’s equation (3.6), we ob-
tain
∂ X dan (t) En t
−i ~ (0)
X En t
i~ |Ψ(t)i = i~ e |Ψn i + an (t)Ene−i ~ |Ψ(0)
n i (3.9)
∂t dt
Xn n
X
−i E~nt (0) En t
= an (t)e V (t)|Ψn i + an (t)e−i ~ H0 |Ψ(0)
n i (3.10)
n n
and thus
X dan (t) En t
X En t
i~ e−i ~ |Ψ(0)
n i = an (t)e−i ~ V (t)|Ψ(0)
n i, (3.11)
n
dt n
42
3.1. Time dependent perturbation theory
Perturbation theory
We now assume that V is small, so that a perturbative treatment can be per-
formed. This is nothing more than a sophisticated series expansion in powers
of V . We thus write
(0) (1) (2)
af (t) = af + af + af + ···
order V 0 order V 1 order V 2 (3.16)
43
3. From nonrelativistic perturbation theory to Feynman diagrams
etc.
Transition amplitude
We are mostly interested in transitions between an initial state (t = −T /2) and
a final state (t = +T /2), in which the interaction can be neglected both in the
initial and in the final states. We therefore define the transition amplitude
Z T
i 2
′
X
′ (0)
(Ef −En )t′
Tf i = af (T /2) − af (−T /2) = − dt an (t ) hΨf |V (t)|Ψ(0)
n ie
i ~ ,
~ − T2 n
(3.24)
where we have subtracted the contribution af (−T /2), since we are not inter-
ested in the situation in which nothing occurred.
The potential V (t) is assumed to be a local space-time operator, i.e.
Tf i (3.27)
Z T Z Z
i 2 X (0)
(Ef −En )t
=− dt d3 ~x d3 ~x ′ an (t) hΨf |~xih~x|V (t)|~x ′ ih~x ′ |Ψ(0)
n i an (t) e
i ~
~ − T2 n
Z T Z
i 2
∗(0)
X
=− dt d3 ~x Ψf (x) V (x) Ψ(0)
n (x)an (t) ,
~ − T2 n
44
3.1. Time dependent perturbation theory
Thus,
Z +∞
(1) i Ef −Ei
Tf i = − Vf i dt ei ~ t
= −2πi Vf i δ(Ef − Ei) . (3.32)
~ −∞
where
Z
(0) (0) ∗(0) (0)
Vf i = hΨf |V |Ψi i = d3~x Ψf (~x) V (~x) Ψi (~x) . (3.33)
∆t = ∞ ⇔ ∆E = 0 (3.34)
45
3. From nonrelativistic perturbation theory to Feynman diagrams
Order 2
(2)
From the expression (3.23) obtained for af (t), one gets
2 Z T
(2) i 2 X (0) E −En
i f ~ t′
Tf i = − dt′ hΨf |V (t′ )|Ψ(0) n i e
~ − T2 n
Z t′
(0) i En ~−Ei t′′
× dt′′ hΨ(0)
n |V (t′′
)|Ψ i ie . (3.40)
− T2
46
3.1. Time dependent perturbation theory
We should now take the limit T → ∞. Meanwhile, the interaction should cease
at t = −∞ and t = +∞. For that, we introduce an adiabatic parameter ǫ > 0
to make the t′′ integration meaningful. Physically, it is introduced in order to
circumvent the fact that choosing a time-independent potential is at odd with
the fact that it is supposed to vanish at t = −∞ and t = +∞.
One now has
Z t′
En −Ei −iε ′′
dt′′ ei ~ t
− T2
1 h En−Ei −iε ′ E −E −iε
i
i t i n ~ i (− T2 )
= i~ e ~ −e
Ei − En + iε
1 En −Ei −iε ′
−−−−−→ i~ ei ~ t (3.42)
T →+∞ Ei − En + iε
where we have used the fact that second term in the second line vanishes in the
limit T → +∞, thanks to the damping factor exp(−εT /(2~)).
Thus,
2 X Z +∞
(2) i Vf n Vni Ef −En +En −Ei ′
Tf i = − i~ dt′ ei ~ t
~ n
Ei − En + iε −∞
X 1
= −2πi Vf n Vni δ(Ei − Ef ) , (3.43)
n
E i − E n + iε
which exhibit the structure of the result: the transition amplitude is the product
of two matrix elements of the potential, between which an energy denominator
is to be inserted, describing the propagation of the particle from state i to
state f . The concept of propagation can be made more precise based on the
formalism of Green’s functions, see any advanced quantum mechanics lectures,
e.g. Messiah Chap. XIX.
47
3. From nonrelativistic perturbation theory to Feynman diagrams
•V
fi
⋄ in the first diagram, following this backward flow of arrows, one first faces
the last scattering (at time t2 ) and then the first scattering (at time t1 ).
Therefore, this first diagram involves a single particle at every stage.
⋄ in the second diagram, following this backward flow of arrows, one first
faces the first scattering (at time t1 ) and then the second scattering (at
time t2 ).
As we have seen in Chap. 2, a particle propagating back in time should be
interpreted as an antiparticle propagating forward in time. At time t1 , a
particle-antiparticle pair is produced, so that between the two scatterings
48
3.1. Time dependent perturbation theory
t
f f
t2 Vf n • n Vf n n •
t1 • Vni • Vni
i i
at t1 and then t2 , the state is made of two particles and one antiparticle.
At time t2, the virtual pair annihilates, and the state is made of a single
particle.
49
4. Electrodynamics
We use the Heaviside-Lorentz’s units: Coulomb force is given by
QQ′
F = (4.1)
4πr2
which means that ε0 = 1 and µ0 = 1 . Furthermore, we use a system of units in
which c = 1 . We refer to Jackson’s appendix1 for a detailed discussion on the
various systems of units.
51
4. Electrodynamics
i.e.
Another useful tensor is obtained from F µν , its dual. To define it, one should
introduce the 4-dimensional Levi-Civita tensor ε (see Appendix B for details).
Its is defined as a fully antisymmetric tensor such that its contra-variant com-
ponents are given by2
+1 if {µνρσ} is an even permutation of {0,1,2,3}
µνρσ
ε = −1 if {µνρσ} is an odd permutation of {0,1,2,3} (4.18)
0 otherwise.
1
F̃ µν = εµνρσ Fρσ = −F̃ νµ (4.20)
2
since εµνρσ is completely antisymmetric.
Conversely,
1
εµνρσ F̃ ρσ = εµνρσ ερσλτ Fλτ
2
1
= ερσµν ερσλτ Fλτ
2
1
= (−2)(gµλgντ − gµτ gνλ )Fλτ = −2Fµν (4.21)
2
2
Note that this definition is not universal. Some authors use this definition for the covariant components,
which leads to an opposite tensor with respect to the present definition.
53
4. Electrodynamics
and thus
1
Fµν = − εµνρσ F̃ ρσ . (4.22)
2
In conclusion, we have
1 µνρσ
F̃ µν = ε Fρσ (4.23)
2
1
Fµν = − εµνρσ F̃ ρσ . (4.24)
2
i.e.
~ → −E
B ~
µν
F −−−−−→ F̃ µν . (4.30)
~ →B
E ~
54
4.2. Covariant formulation of electrodynamics
Orthogonal transformations
Le us first consider orthogonal transformations O(3). They correspond to par-
ticular transformations Λ of the form
i.e.
1 0 0 0
0
Λ= . (4.33)
0
0
O
Ot O = OOt = Id (4.34)
so that:
⋄ either det O = +1: SO(3) which are rotations
Let us now consider the way various components of F transform under the
Lorentz transform Λ.
⋄ Electric field:
′
i0
F = Λi j Λ00 F j0 , (4.37)
so that
′
E i = Oi j E j (4.38)
55
4. Electrodynamics
~ transforms as a
as expected for a vector: this equation tells us that E
vector under a rotation O = R ∈ SO(3), and gets reversed, just as ~x,
under parity.
Therefore
F ′ ij = Λi i′ Λj j ′ F i j ,
′ ′
(4.39)
with
′
F ′ ij = −εi′ j ′ k′ B k . (4.40)
We thus have
1 1
B ′ k = − εijk F ′ ij = εijk Oi i′ Ojj ′ εi′ j ′ k′ B k .
′
(4.41)
2 2
Using the fact that
εijk = εijnδnk (4.42)
and
εijk Oi i′ Ojj ′ = εijn Oi i′ Ojj ′ Onp Okp = εi′ j ′ p Okp det O , (4.44)
since
εijnOi i′ Ojj ′ Onp = εi′j ′ p det O , (4.45)
see Appendix B. We finally get
1 1 ′ ′
B ′ k = − εijk F ′ ij = εi′j ′ p εi′j ′ k′ Okp B k det O = det O Okk′ B k , (4.46)
2 2
since εi′ j ′ p εi′ j ′ k′ = 2δpk′ .
~
Due to the presence of the prefactor det O, this equation tells us that B
transforms as a vector under a rotation O = R ∈ SO(3), and remains
invariant under parity. Therefore,
56
4.2. Covariant formulation of electrodynamics
Pure boosts
One can show, see tutorial, that under an arbitrary boost along the direction
~n (~n2 = 1), i.e. with a velocity ~v = β~n,
h i
~ ′ ~ ~ ~
E = (E · ~n)~n + γ E − (E · ~n)~n + γ ~v ∧ B ~, (4.47)
h i
~ ′ = (B
B ~ · ~n)~n + γ B~ − (B
~ · ~n)~n − γ ~v ∧ E
~. (4.48)
Similarly,
1 ′ ′
F̃µν F̃ µν = εµνρσ F ρσ εµνρ σ Fρ′ σ′
4 ′ ′
1 σ ′ ρ′
= (−2) gρ gσ − gρ gσ F ρσ Fρ′ σ′ = −F ρσ Fρσ , (4.52)
ρ σ
4
therefore identical (up to the sign) with the contraction of F with itself,
see the expression (4.51), a fact which is obvious by just applying the
symmetry (4.30) to the expression (4.51).
57
4. Electrodynamics
In conclusion, we have built two relativistic invariants from the two tensors F
and F̃ :
~2−B
Fµν F µν = −F̃µν F̃ µν = −2(E ~ 2) , (4.54)
~ ·B
Fµν F̃ µν = −4E ~. (4.55)
∂µ F µν = j ν , (4.56)
∂µ F̃ µν = 0 . (4.57)
Proof:
⋄ Gauss’s law:
→ ~
−
∂i F i0 = j 0 ⇔ ∇ · E =ρ (4.58)
58
4.3. Covariant form of Maxwell’s equations
with
so that
~ →i ~ n
− ~ → ~ k
−
k
−∂t E − εikn ∇ B = −∂t E + ∇ ∧ B = j k (4.61)
i.e.
−→ ~ ∂E ~
rotB − = ~j . (4.62)
∂t
∂i F̃ i0 = 0 ⇔ ∂i B i = 0 ⇔ ~ = 0.
divB (4.63)
⋄ Maxwell-Faraday’s equation:
with
so that
−→~ B ~
rotE + = 0. (4.66)
∂t
Compatibility condition:
∂ν ∂µ F µν = 0 = ∂ν j ν , (4.67)
∂ν j ν = 0 . (4.68)
59
4. Electrodynamics
4.3.2. Four-potential
From F µν to Aµ
Several remarks are in order:
~ and B,
⋄ Maxwell’s equations, expressed in terms of E ~ are not covariant.
⋄ They only involve first order time derivative of the dynamical variables
~ and B.
E ~ Thus, their conjugated momenta are not independent of the
dynamical variable3.
One should rather look for second order equations.
Let us introduce the four-potential Aµ such that
F µν = ∂ µAν − ∂ ν Aµ . (4.69)
Theorem (Poincaré):
→ 0 ∂A
− ~
F i0 i 0
= ∂ A −∂ A 0 i
⇔ ~
E = − ∇A − (4.70)
∂t
and
F ij = ∂ iAj − ∂ j Ai (4.71)
i.e.
1 1 → ~ k
−
ij i j j i i j
− εijk F = − εijk ∂ A − ∂ A = −εijk ∂ A = ∇ ∧ A (4.72)
2 2
and thus
→ 0 ∂A
− ~
~
E = − ∇A − (4.73)
∂t
→
−
~ = ∇ ∧A~.
B (4.74)
3
Consider for example the trivial case L = φ̇φ, an action in which φ̇ does not appear quadratically.
60
4.3. Covariant form of Maxwell’s equations
Consistency check:
Gauge invariance
The relation (4.69) does not provide a unique 4-potential Aµ .
Indeed, the gauge transformation
Aµ (x) → Aµ(x) + ∂ µ φ , (4.76)
where φ is an arbitrary function, leaves F µν invariant.
Assuming the space-time domain to be contractible, Maxwell’s equations are
equivalent to
∂µ F µν = Aν − ∂ ν (∂µ Aµ ) = j µ . (4.77)
The equation
Aν − ∂ ν (∂µ Aµ ) = j µ (4.78)
is gauge invariant, since under the gauge transformation (4.76), we have
Aν − ∂ ν (∂µ Aµ ) → Aν + ∂ ν φ − ∂ ν (∂µ Aµ ) − ∂ ν φ = Aν − ∂ ν (∂µ Aµ ) .
(4.79)
Let us write the various components of Maxwell’s equations (4.78) explicitly:
⋄ Time component:
0 ∂ ∂A0 ~ = 0,
A − + divA (4.80)
∂t ∂t
i.e.
∂ ~ = ρ.
−∆A0 − divA (4.81)
∂t
⋄ Space components:
0
−
→
~+∇ ∂A ~ = ~j .
A + divA (4.82)
∂t
61
4. Electrodynamics
Usual gauges
There is an infinite set of possible gauge choices. The most popular one are:
⋄ Landau gauge
It relies on the covariant gauge-fixing condition
∂µ Aµ = 0 (4.83)
which thus implies that
Aν = j ν . (4.84)
⋄ Coulomb gauge
It relies on the non-covariant gauge-fixing condition
~=0
divA (4.85)
which implies that the scalar potential A0 satisfies
−∆A0 = ρ . (4.86)
The solution of this second-order differential equation is given by
Z
1 ρ(t, ~y)
A0 (t, ~x) = d3 y (4.87)
4π |~y − ~x|
since
1 ′
−∆ = 4πδ(~
r − ~
r ). (4.88)
|~r − ~r ′ |
In this gauge, A0 is therefore an instantaneous potential. The vector
~ is then, from Eq. (4.82), solution of the equation
potential A
Z 3 ′ ′
→
−
~ = ~j − ∇ d x ∂ρ(t, ~
x ) 1
A . (4.89)
4π ∂t |~x − ~x ′ |
Note the consistency with the fact that the current should be conserved:
Z 3 ′ ′
~ = div~j − ∆ d x ∂ρ(t, ~
x ) 1
div A (4.90)
4π ∂t |~x − ~x ′ |
Z ′
′ ∂ρ(t, ~x )
= div~j + d3x′ δ(~x − ~x ) (4.91)
∂t
∂ρ(t, ~x)
= div~j + = 0, (4.92)
∂t
which is therefore consistent with our gauge choice divA ~ = 0.
62
4.4. Lagrangian for photons
⋄ Temporal gauge
In this gauge, the scalar potential is chosen to vanish
A0 = 0 . (4.93)
⋄ Axial gauge
In this gauge, specified by a given space-like direction, e.g. z,
A3 = 0 . (4.94)
⋄ Light-cone gauge
This gauge, of particular use in high-energy particle physics, is specified
by choosing a light-cone vector n with n2 = 0, such that
A· n = 0. (4.95)
A· n = 0. (4.96)
Clearly, n2 > 0 is the temporal gauge, n2 < 0 is the pure axial gauge, and
n2 = 0 is the light-cone gauge.
63
4. Electrodynamics
Let us study the gauge invariance of the equation of motion for j = 0 : under
the gauge transformation (4.76), we get
and using the fact that the second term in the r.h.s. is a total derivative, which
thus does not contribute to the action, it can be simply omitted. From the fact
that b + d = −c, we get
L = −c (∂µ Aν ∂ν Aµ − ∂µ Aν ∂ µ Aν ) + e Aµ j µ . (4.102)
64
4.4. Lagrangian for photons
in L. Now, by inspection
c
Fµν F µν (4.108)
2
provides
c 2
× 2 × F 0i F0i = c ∂ 0 Ai∂0Ai = −c ∂0 Ai (4.109)
2
so that c = − 21 .
In conclusion,
One should note that the Lagrangian density (4.110) is not gauge invariant if
j(x) is an external current. Indeed under a gauge transformation,
Aµ → Aµ + ∂ µ φ
L(x) −−−−−−−→ L(x) − jµ ∂ µ φ . (4.112)
Still, since
jµ ∂ µφ = ∂ µ (φjµ ) − φ ∂ µ jµ (4.113)
in which, in the r.h.s, the first term is a total derivative and the second one van-
ishes because jµ is a conserved current. Therefore, both terms can be omitted
from the action I.
65
4. Electrodynamics
66
4.5. Coupling between matter and electromagnetic field
One can add a potential term to L, of the form V (Φ∗Φ), without spoiling the
gauge invariance, so that we denote
67
4. Electrodynamics
problem: indeed, this is valid for jµ being external, non dynamical, but what if
j µ is promoted to be dynamical, being itself constructed from dynamical fields?
But this new term in the Lagrangian, which now involves derivatives of the
fields Φ and Φ∗, will modify the Noether current itself, so that this additional
term should be itself modified, changing the current, etc. My gosh! Do we enter
an endless loop?
Let us show that one can find a minimal and consistent solution to this
problem. For that purpose, we write
- Since the dynamics of Aµ (its time derivative, and by covariance any of its
derivatives) is inside Lem , only Lem contributes to the l.h.s. of Eq. (4.134).
68
4.5. Coupling between matter and electromagnetic field
We now make a minimal assumption: we look for a solution with L′int indepen-
dent of ∂µ Φ and ∂µ Φ∗.
Thus,
∂Lint
ρ
= ieAρ Φ∗, (4.141)
∂(∂ Φ)
∂Lint
= −ieAρ Φ (4.142)
∂(∂ ρΦ∗)
and
Jρ = jρ + ie [ieAρ Φ∗ Φ − (−ie)AρΦΦ∗]
= jρ − 2e2Aρ Φ∗Φ . (4.143)
69
4. Electrodynamics
∂µ F µν = J µ (4.147)
read4
∂V
(∂µ + ieAµ ) (∂ µ + ieAµ ) + m2 Φ∗ = − . (4.150)
∂Φ
and
∂V
(∂µ − ieAµ ) (∂ µ − ieAµ ) + m2 Φ = − ∗ . (4.151)
∂Φ
4
Beware on the fact that in these two equations, the differential operators ∂µ and ∂ µ act on any structure
which is on their right (e.g. Aµ , Φ).
70
4.5. Coupling between matter and electromagnetic field
71
4. Electrodynamics
Thus,
Clearly, while −m2 Φ∗Φ − V (Φ∗Φ) is gauge invariant, this invariance is broken
by the term (∂µΦ(x))∗∂ µ Φ(x).
The way to restore this gauge invariance is to introduce a new field Aµ , which
carries a µ index like ∂µ (it is thus a spin one field, as shown by the study of
the representations of the Lorentz group). These two are combined to built up
the covariant derivative
Dµ = ∂µ − ieAµ .
Dµ Φ(x) → [∂µ Φ(x) + ie∂µ α(x) Φ(x) − ieAµ (x) Φ(x) − ie∂µ α(x) Φ(x)] eieα(x)
= [∂µ Φ(x) − ieAµ (x) Φ(x)] eieα(x)
= [Dµ Φ(x)] eieα(x) (4.165)
and similarly
72
4.5. Coupling between matter and electromagnetic field
the kinetic part of the Lagrangian becomes gauge invariant. Adding this term
to the mass and potential terms for the fields Φ and Φ∗, and to the pure QED
Lagrangian, one therefore gets the full QED gauge invariant Lagrangian which
we obtained earlier, see Eq. (4.156).
This construction can be extended to other groups. This is the essence of
the Yang-Mills construction. This is THE way to construct a dynamical field
theory which couples matter and gauge fields.
For example, passing from U (1) to U (1) × SU (2) gauge invariance led to
the construction of the electroweak theory, with quarks and leptons (electron,
muon, tau and their associated neutrinos) as matter fields, carrying charges
under this group, and γ, W ±, Z 0 as gauge fields.
Similarly, having a gauge theory based on the (color) group SU (3) leads to the
(quantum) chromodynamics (QCD), the modern theory of strong interaction,
with quarks as matter fields, and gluons as gauge fields.
This leads to the Standard Model, based on the gauge group U (1) × SU (2) ×
SU (3).
One should note that the gauge field dynamics itself is governed by a term of
the type (4.131). In general, as it is the case for the Standard Model, the gauge
group is non-abelian. The consequence is that the gauge fields themselves carry
a charge: by construction, gauge fields live in the adjoint representation of the
group. For an abelian group, this representation is trivial, and indeed we know
that the photon as no charge, and therefore does not couple to itself. But for
a non-abelian group, the gauge fields acquire a charge, so that they can couple
to each other. This is the case of W ± which carry an electric charge, of Z 0 , W ±
which carry a weak isospin, of the gluons which carry a color charge. This is
technically hidden in the field strength F µν which is not anymore linear in the
gauge field.
We refer to lectures on group theory for more details.
73
5. From Lagrangians to
cross-sections
In this chapter, we will show how to compute a physical observable for a given
scattering process, relying on the Lagrangian built in the previous chapter. This
will be limited to the case of scalar matter, in order to avoid any complication
related to spin.
L = T − U, (5.1)
we can identify
U = −Lint . (5.2)
From now on, we only consider the dominant e1 term, neglecting the e2 term,
so that the Lint as the simple structure of a current jµ interacting with the field
Aµ :
75
5. From Lagrangians to cross-sections
Let us consider now the matrix element jµf i = hf |jµ |ii of this current:
The normalization constants Ni and Nf in Eqs. (5.7) and (5.8) will be specified
later. We thus have
Thus, since
76
5.1. Perturbation theory
(1)
jµ
pA pC
e− e −
γ ↓q
pB µ− µ− pD
(2)
jµ
Figure 5.1.: e− µ− scattering.
we get
1 (2)
Aµ = − j (5.14)
q2 µ
where q = pD − pB .
At lowest order, we get, after integration over x,
Z
1
Tf i = −i jµ(1) − 2 j (2) µ (x) d4x (5.15)
q
= iNA NB NC ND (2π)4δ (4) (pA + pB − pC − pD ) M (5.16)
where
g µν
µ
iM = [ie(pA + pC ) ] −i 2 [ie(pB + pD )ν ] (5.17)
q
(5.18)
vertex photon vertex (5.19)
propagator (5.20)
77
5. From Lagrangians to cross-sections
5.2. Cross-sections
5.2.1. normalization of a free particle wave-function
The wave-function of a particle of momentum p reads
We have shown in Chap. 2 that for such a plane wave, ρ = 2E|N |2, see
Eq. (2.55).
One should note the consistency and the Lorentz invariance under boosts.
Indeed, ρ d3 x is a Lorentz invariant, since
Boost d3 x
d3 x −→ (5.22)
γ
Boost
2E −→ γ2E . (5.23)
i.e.
1
N=√ . (5.25)
V
78
5.2. Cross-sections
The reason why one divides Wf i by the initial flux, which is the density of
incoming states is that one wants to normalize the result independently of any
particular experimental setup (like the density of the target, of the beam).
V d3 p
ρV (p) = states . (5.29)
(2π)3
This is a pure quantum mechanical effect. Consider first a particle in a one-
dimensional box of length L. The boundary conditions implies, since the wave-
function is of the form
Φ ∼ eipx x , (5.30)
ρV (p) V d3 p
= , (5.32)
2E (2π)32E
where we have normalized the wave-function such that there are 2E particles
in the volume V.
For outgoing particles C and D scattered into d3pC and d3 pD , we thus have:
V d3 pC V d3 pD
number of final states = . (5.33)
(2π)32EC (2π)32ED
79
5. From Lagrangians to cross-sections
~vA
Thus, the number of beam particles passing through a unit area per unit of
time is
2EA
|~vA | . (5.35)
V
The number of target particles per unit volume is
2EB
. (5.36)
V
We thus finally define the initial flux as
2EA 2EB
|~vA | . (5.37)
V V
5.2.5. Cross-section
Putting all the factors together, we obtain
V2 1 2 4 (4) V d3 pC V d3 pD
dσ = |M| (2π) δ (p A + p B − p C − p D )
|~vA |2EA 2EB V 4 (2π)32EC (2π)32ED
(5.38)
so that the V dependence cancels, as expected. One can thus write the differ-
ential cross-section as
|M|2
dσ = d(P S) (5.39)
F
where the phase-space is given by
4 (4) V d3 pC V d3 pD
d(P S) = (2π) δ (pA + pB − pC − pD ) (5.40)
(2π)32EC (2π)32ED
80
5.2. Cross-sections
Exercise 5.1
Exercise 5.2
show that
F = 2K = 4p∗i W ∗ (5.46)
where
Exercise 5.3
81
5. From Lagrangians to cross-sections
1 p∗f 2
d(P S) = 2 d Ω. (5.49)
4π 4W ∗
Finally, the differential cross-section can be written as
dσ 1 p∗f
2
= 2 ∗ |M|2 , (5.50)
d Ω c.m.s 64π s pi
s = (pA + pB )2 = W ∗2 . (5.51)
82
Appendices
83
A. Elements of group theory
A.1. Group
A.1.1. Definitions
Définition 1.1 : Group
A group is a pair (G, ·) made of a set G and an operation acting on this set,
which associates two a pair of elements a and b of G an element a · b.
This law should satisfy four axioms:
Associativity: ∀a, b, c ∈ G, (a · b) · c = a · (b · c)
Depending on the context, the group law · can be denoted using various symbols:
85
A. Elements of group theory
⋄ If the cardinal is finite, G is said to be a finite group (!) and its order is
denoted as |G|.
⋄ When the group has an infinite order, the group can be discrete (topolog-
ically), or continuous.
⋄ G = (S(E), ◦) :
On a set E, the set S(E) of bijections from E to E, equipped with the
composition law ◦ for maps is a group.
⋄ Symmetric group Sn :
In the special case where E = {1, · · · , n}, one denotes Sn as the set S(E),
which is called the symmetric group of n elements. Its order is n!, and it
is non abelian for n > 3, made of permutations of the various n elements
{1, · · · , n} .
86
A.1. Group
87
A. Elements of group theory
S t gS = g .
f (E) = {f (x)/x ∈ E} .
f (A) = {f (x)/x ∈ A} .
88
A.2. Morphisms and subgroups
∀y ∈ F, ∃x ∈ E /y = f (x) .
∀y ∈ F, ∃! x ∈ E /y = f (x) .
x = f −1(y) ⇔ y = f (x) .
Be aware that conventionally, the same notation f −1 is used for both the
inverse bijection and for the inverse image of a set.
A.2.2. Morphism
Définition 1.12 : Group morphism
Let (G, ⋆) et (G′ , ·) be two groups. A morphism (or homomorphism) from G to
G′ is a map f : G → G′ which satisfies
In other words, it means that the map f preserves the group law on G and G′ .
Définition 1.13 : Group isomorphism
89
A. Elements of group theory
⋄ Besides, as we have seen above for the general case of a function, a mor-
phism f is surjective if and only if Im f = G′ .
A.2.3. Subgroup
90
A.2. Morphisms and subgroups
Examples :
Propositions 1.20 :
More generally:
Examples :
91
B. Levi-Civita symbols and tensors
B.1. Levi-Civita symbol in an euclidean space
B.1.1. 2d
In two dimensions, one denotes
since
B.1.2. 3d
In three dimensions, one denotes
93
B. Levi-Civita symbols and tensors
Indeed, there are 6 terms, as can be seen from the fact in 3d, i, j and k, as well
as ℓ, m and n can take the 3 different values 1, 2, 3: thus i should be equal to
ℓ, m or n, which forces j to be one of the two remaining indexes among ℓ, m
or n, so that k is the last one, i.e. 3 × 2 × 1 = 6 choices. The first choice i = ℓ,
j = m, k = n, i.e δiℓδjm δkn, gives, without sum, ε2ijk = +1, while the 5 other
terms are simply obtained by permutation of indexes, accounting for the sign
provided by the product of the two ε.
since
Finally,
1
det A = εi1 ···in a1i1 · · · anin = εi ···i εj ···j ai j · · · ain jn (B.13)
n! 1 n 1 n 1 1
94
B.1. Levi-Civita symbol in an euclidean space
and
which can be understood as the way the Levi-Civita symbol transforms under
an arbitrary linear transformation.
Rn → Rn (B.15)
x 7→ O x with xi = Oi j xj (B.16)
O Ot = Ot O = 1 . (B.17)
Taking the determinant of both sides, one sees that det O = ±1 . The case
det O = 1 defines the subgroup SO(n), named special orthogonal group, also
named rotation group, by extension of the group of rotations in n = 2 dimension
or n = 3 dimension.
Inserting this inside the general transformation (B.14) of the Levi-Civita
symbol, we see that1
Oi1 j1 · · · Oin jn εi1 ···in = (det O) εj1 ···jn , with det O = ±1. (B.18)
1
We are working here in the euclidean space Rn , with a trivial metric given by the identity matrix, therefore
the up or down position of indexes is arbitrary.
95
B. Levi-Civita symbols and tensors
Using as usual the tensor metric gµν and its inverse, one can descend or raise
any index.
Since det g = −1 , one has
One should be careful with the fact that the normalization of εµνρσ is conven-
tional. In some textbook/articles, it is defined with an opposite sign. We use
here the convention of Jackson, Itzykson-Zuber and Peskin-Schroeder.
′ ′ ′ ′
εµνρσ εµν ρ σ = − det g αα α = ν, ρ, σ row
α′ = ν ′, ρ′ , σ ′ column (B.22)
′ ′
µνρσ ρ′ σ ′ ρρ σσ ρσ ′ ρ′ σ
ε εµν = −2 g g − g g . (B.23)
′ ′
εµνρσ εµνρ σ = −6g σσ . (B.24)
96
B.2. Levi-Civita tensor in 4d Minkowski space
Thus:
In conclusion,
97