Lecture Notes 12
Lecture Notes 12
Spin 1
Lecture notes 12
All these effects involve the behaviour of a particle with spin and/or orbital
angular momentum in a magnetic field.
In section 12.1 in these notes, we begin by considering the magnetic moment due
to the orbital motion of a particle, both classically and quantum-mechanically.
We then review the experiment of Stern and Gerlach, which clearly reveals the
intrinsic angular momentum (spin) of the electron. We also give a survey of the
spins of other particles.
In section 12.2 we establish a formalism for spin 12 , based on the general discussion
of angular momenta in Lecture notes 11.
µ = I A n̂ ≡ IA,
where I is the current and n̂ is the unit vector perpendicular to the loop plane.
The same effect is observed when a magnetic needle is placed in a megnetic field.
Another example: The figure shows a particle with mass m and charge q (< 0) which is
kept moving in a classical circular orbit by a central field V (r). This constitutes a current
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 3
loop. With the radius r, velocity v and revolution frequency ν = v/(2πr) the current is
I = qν. This results in a magnetic moment
v
|µ| = I A = q πr2 = 12 qrv = 21 q |r × v|.
2πr
Thus the magnetic moment due to the motion of the charge is proportional to the orbital
angular momentum L = m r × v :
q
µL = L. (T12.5)
2m
The ratio q/2m between these two quantities is known as the gyromagnetic ratio.
Let us add that for this kind of classical motion the angular momentum L and the
magnetic moment µL are not constants of motion when B differs from zeo. According to
Newtons 2. law and equation (T12.1) we have that
dL q
= τ = µL × B = − B × L ≡ ω L × L. (T12.6)
dt 2m
From this equation it follows that L (and hence µL ) precess. The precession frequency,
q
ωL ≡ − B, (T12.7)
2m
is known as the Larmor frquency.
With these classical considerations in mind, let us see what quantum mechanics has to say
about these things.
for a particle with mass m and charge q. As an example we may consider the “doughnut”
state
ψ211 = −(64πa50 )−1/2 r e−r/2a0 sin θ eiφ
for the hydrogen atom. This is an eigenstate of L b with eigenvalue h̄. According to (T12.8)
z
it is then also an eigenstate of
−e b
(µb L )z = Lz ,
2me
with eigenvalue −eh̄/(2me ). (In this example we neglect the difference between the electron
mass and the reduced mass.)
It is instructive to cosider this example more closely. The probability distribution
resembles a doughnut and is of course time independent, as is the case for all
stationary states,
1
ρ211 = r2 er/a0 sin2 θ.
64πa50
Inside this doughnut there is a probability current. This can be calculated from
the formula for the probability-current density:
" #
h̄ ∂ 1 ∂ 1 ∂
j = <e ψ ∗ ∇ψ ; ∇ = êr + êθ + êφ .
im ∂r r ∂θ r sin θ ∂φ
Here we observe that the components of the gradient in the r- and θ directions
give imaginary contributions to the expression inside the brackets, and hence
no contribution to j. Thus the probability in the doughnut is flowing in the φ
direction. We find that
h̄/me h̄ 1
j = êφ 5
r e−r/a0 sin θ = êφ ρ211 · .
64πa0 me r sin θ
Defining a velocity by the relation j = ρ211 v, we can calculate the local velocity
of the probability current:
|j| h̄ 1 a0
v= = = αc .
ρ211 me r sin θ r sin θ
Here, r sin θ — the distance from the z axis — is of course typically of the order of
the Bohr radius a0 , so that the velocity is of the order of αc, as could be expected.
Note, however, that this velocity is variable; the probability distribution is not
rotating around the z axis as a “solid” doughnut.
The probability ρd3 r in the volume element d3 r corresponds to a momentum
contribution ρd3 r · mv = mjd3 r and an electric-current contribution −ejd3 r.
Since h p i is real, the former also follows from
" #
Z
h̄ Z
h̄ Z
hpi = ψ ∗ ∇ψ d3 r = m <e ψ ∗ ∇ψ d3 r = mj d3 r.
i im
The expectation values of Lz and (µL )z can be calculated in a similar manner,
and the results are as expected: Because (r × êφ )z = r sin θ, we have
!
Z
h̄ Z
h Lz i211 = ψ∗
211 r× ∇ ψ211 d3 r = (r × me j)z d3 r
i zZ
Z
3
= rme j sin θ d r = h̄ρ211 d3 r = h̄,
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 5
and
Z
3 −eh̄ Z −eh̄
h (µL )z i211 = 1
2
[r × (−ej)]z d r = ρ211 d3 r = , q.e.d.
2me 2me
This example illustrates the connection between the orbital angular momentum
and the magnetic moment of a moving particle.
As illustrated by this example, the operator relation (T12.8) implies that the magnetic
moment µL due to the orbital motion is quantized in the same way as the orbital angular
momentum. This means that the size and one of the components can have sharp values
simultaneously. For an electron (with q = −e) this implies that the size can take the values
e q
|µL | = |L| = µB l(l + 1) ; l = 0, 1, 2, · · · , (T12.9)
2me
while for example the z component can take the values
e
(µL )z = − Lz = −mµB ; m = 0, ±1, ±2, · · · , ±l. (T12.10)
2me
Here the quantity
eh̄
µB ≡ (1 Bohr magneton) (T12.11)
2me
is the natural unit for the magnetic moment for the elctron, just as h̄ is the natural unit for
angular momenta. [1 µB = 5.788 · 10−5 eV/T(esla).]
We note also that the quantization of Lz and (µL )z corresponds to a so-called space
quantization, that is, quantized values of the angle between the vector µL and the z axis.
Note also that because L and µL are not classical constants of motion in a magnetic
field, they also are not quantum-mechanical constants of motion. With this statement
we mean that the expectation values h L i and h µL i are not constant. These vector-valued
expectation values will in fact precess in the same manner as the classical quantities.
Let us try to measure the magnetic moment of an atom by sending a beam of these atoms
(collimated by a slit) through an inhomogeneous magnetic field in order to measure the
deflection. If the beam is positioned such that it passes through a vertical field, it should be
deflected vertically (in the z direction). We can show this by assuming an inhomogeneous
field given by
B = axêx − (B0 + az)êz ,
where a = −∂Bz /∂z is a suitable constant. You can easily check that this field is divergence
free (∇·B = 0), as is the case for all magnetic fields. The force becomes
Here we must now remember that µ precesses rapidly around the B field, that is, about
the z axis. The x-component µx therefore averages to zero and gives no deflection in the
x direction. Thus we get a deflection in the positive or negative z direction, depending on
the force Fz = −aµz . The measurement of the deflection therefore is a measurement of µz .
(Note that µz is constant during the precession.)
In 1921 (before quantum mechanics) Stern and Gerlach expected that the directions of
the magnetic moments |µ| of the atoms entering the magnetic field should be randomly
distributed, corresponding to a continuous variation of µz between −|µ| and +|µ|. They
were hoping to measure the maximal deflection up and down (corresponding to α = 0 and
α = π), making it possible to calculate |µ|, using the velocity of the atoms, the atomic mass,
the length of the magnet, and the parameter a.
After 1925, quantum mechanics has taught us that a measurement of µz must give one
of its eigenvalues (according to the measurement postulate). Since the direction of µ and
hence µz are quantized according to (T12.10), we must therefore expect to observe a quan-
tized deflection. If the total angular momentum of the atom is “integral” (given by an
integral angular-momentum quantum number l), we should then expect to find 2l + 1 dis-
crete deflections, that is, an odd number of pictures of the slit on the screen in the figure
above.
The experiment, however, showed something else: Stern and Gerlach (1922) used a gas
of silver atoms (Z = 47) heated in a furnace. The result was two distinct pictures of the slit
on the screen.
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 7
A similar experiment with hydrogen atoms, conducted by Phipps and Taylor in 1927, also
resulted in two pictures of the slit.
Experiments show that the intrinsic magnetic moment connected with the spin is
−e
µS = ge S. (T12.13)
2me
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 8
Here the factors in front of S is the product of the gyromagnetic ratio we found for the
orbital motion, and ge which is a dimensionless factor. Very accurate measurements show
that this factor is
ge = 2 × 1.001159652187 (±4) (gyromagnetic factor of the electron), (T12.14)
where the uncertainty (±4) is in the last digit.
To sum up, we may state that the two discrete deflections observed in the SG-experiment
are due to the fact that there are only two possible values of µz , Sz and Fz :
eh̄
ms = + 21 : Sz = + 12 h̄, (µS )z = − 21 ge = − 12 ge µB , Fz = −aµz = 21 ge aµB ,
2me
ms = − 12 : Sz = − 21 h̄, (µS )z = + 21 ge µB , Fz = −aµz = − 21 ge aµB .
With a > 0 we see that the upper beam emerging from the SG magnet corresponds to
measuring Sz = + 12 h̄ (spin up), while the lower beam corresponds to Sz = − 21 h̄ (spin
down).1
We note that the gyromagnetic factor gp = 5.59 for the proton deviates quite much from the
factor ge ≈ 2 for the electron. This has to do with the fact that the proton is a composite
particle, consisting of two u-quarks with charge q = 2e/3 and one d-quark with q = −e/3.
The neutron is also a composite particle, consisting of one u-quark and two d-quarks, and
then it is perhaps not so surprising that it has a non-zero magnetic moment,
e
µn = −3.83 Sp (gn = −3.83), (T12.17)
2mp
are all three-quark systems. Because the quarks have spin 21 , it follows from the rules for the
“addition of angular momenta” (which we shall treat later) that the baryons can only have
“half-integral” spins (1/2, 3/2, etc). The other group consists of the mesons (π, K, ρ, ω, · · ·),
which are quark-antiquark systems, and which therefore can only have integral spins (e.g. 0
for the π mesons and 1 for the ρ mesons).
1
12.2 Formalism for spin 2
(Hemmer 8.3, Griffiths 4.4, B&J 6.7–8)
1
12.2.a “Ladder” of ket vectors for spin 2
Even if we do not quite “understand” what the spin is, we have a perfectly applicable theo-
retical model in the abstract ket-vector formalism for angular momenta which was developed
in Lecture notes 11, in the sense that we can apply it to processes involving the spin and
accurately predict the outcome of experiments.
The starting point is (T11.38) and (T11.39), which for the spin (with J
b = S,
b and with
1
j = s = 2 ) take the form
b 2 | 1 , mi = 3 2
S 2 4
h̄ | 12 , mi,
and
| 12 , − 21 i ≡ |− 21 i ≡ |−i ≡ |↓i (spin down).
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 11
Note that we may choose the labels as we wish. Thus we may use e.g. only m or only the sign
of m; without stating explicitly that s = 21 . These two vectors form a basis of orthogonal
and normalized vectors; they are the only eigenvectors of the Hermitian operator Sbz . Thus
the state space of this half-integral spin is a two-dimensional vector space, and is spanned
by a ladder with only two rungs:
The ladder operators Sb+ and Sb− take us up and down in the ladder: According to (T11.52)
we have (with j = s = 12 )
q
Sb± |mi = h̄ ( 12 ∓ m)( 32 ± m) |m ± 1i, m = ± 12 ,
or explicitly:
Sb+ |+ 21 i = 0 ; Sb+ |− 12 i = h̄ |+ 12 i ;
The last two formulae are particular for spin 21 and are of course due to the fact that we
have a ladder with only two rungs.
Note that as long as we consider only the spin, that is, do not take into account the other
degrees of freedom for the particle, the two vectors |+ 12 i and |− 12 i are a complete set; the
spin space is a two-dimensional sub-space of the Hilbert space. The completeness relation
of this set is (cf (T10.29))
X
|mihm| = |+ 12 ih+ 21 | + |− 12 ih− 21 | = 11. (T12.23)
1
m=±
2
Applying this unit operator we may expand an arbitrary vector |χi in the two-dimensional
spin space:
X X
|χi = |mihm|χi = hm|χi |mi = h+ 12 |χi |+ 12 i + h− 12 |χi |− 21 i
m m
≡ a+ |+ 12 i + a− |− 21 i.
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 12
a∗+ h+ 21 | + a∗− h− 12 |
hχ|χi ≡ hχ| · |χi = · = |a+ |2 + |a− |2 = 1.
a+ |+ 12 i + a− |− 12 i
(T12.24)
1
Note that the coefficient (or projection) a+ = h+ 2 |χi is the probability amplitude of
measuring spin up (Sz = 12 h̄) and leaving the spin in the state |+ 21 i. In the same manner,
a− = h− 12 |χi is the amplitude of “measuring spin down”.
which we denote simply by χ, and which is called a spinor. The basis vector|+ 12 i is repre-
sented by 3 ! !
h+ 12 |+ 12 i 1
= ≡ χ+ (T12.26)
h− 12 |+ 12 i 0
and the spin-down vector |− 21 i is represented by
! !
h+ 12 |− 12 i 0
= ≡ χ− . (T12.27)
h− 12 |− 12 i 1
1
These are known as the Pauli spinors, after Pauli, who invented the theory for spin 2
in
1925. We note that these spinors are orthonormal,
χ†0 χ = δ0 .
The adjoint of the ket vector |χi = a+ |+ 21 i + a− |− 12 i, which is (|χi)† = hχ|, is rep-
resented by the adjoint matrix (transpose and complex conjugate), that is, by the row
matrix
χ† = (a∗+ a∗− ).
The normalization condition may then be written as
!
a+
χ† χ = (a∗+ a∗− ) = |a+ |2 + |a− |2 = 1,
a−
3
Note: In the literature it is not uncommon to use an equality sign between the vector |χi and the column
matrix or spinor χ:
a
|χi = χ ≡ ( + ).
a−
The first equality sign should be read as “represented by”.
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 13
hm0 |S
b 2 |mi = 3 h̄2 δ 0
4 mm and hm0 |Sbz |mi = h̄m δm0 m ,
or
! !
3 1 0 3 1 0
S = h̄2
2
≡ h̄2 11 and Sz = 1
h̄ ≡ 12 h̄σz .
4 0 1 4 2 0 −1
(T12.28)
!
++ +−
Note that the matrix elements are “numbered” acording to the scheme , and
−+ −−
that 11 here stands for the unit matrix. Note also that the diagonal elements are identical
to the eigenvalues, and hence to the possible results for measurements of S2 and Sz ; a
measurement of Sz can only give + 21 h̄ or − 12 h̄, no matter which state the spin is prior to the
measurement.
To find the matrices representing the remaining operators we take (T12.21) as the starting
point. Projecting these relations on |+ 12 i and |− 12 i we get for example
In this manner we find that the ladder operators Sb+ = Sbx + iSby and Sb− = Sbx − iSby are
represented by the matrices
! !
1 0 2 1 0 0
S+ = 2
h̄ and S− = 2
h̄ . (T12.29)
0 0 2 0
By adding and subtracting these we find at last the matrices for Sbx and Sby :
! !
1 0 1 1 1 0 −i
Sx = 2
h̄ ≡ 2
h̄σx and Sy = 2
h̄ ≡ 12 h̄σy . (T12.30)
1 0 i 0
Here we observe that the matrices Sx and Sy become non-diagonal, which was of course to be
expected, since Sbx , Sby and Sbz do not commute, and since we have chosen the eigenvectors
of Sbz as our basis. To gain some experience with these matrices you should now check the
eigenvalue equations
Sz χ± = ± 12 h̄ χ± , (T12.31)
which correspond to (T12.20), and also the relations
S+ χ+ = 0, S+ χ− = h̄ χ+ , etc,
A small exercise: Check that these matrices satisfy [Sx , Sy ] = ih̄Sz , that is,
the angular-momentum algebra.
Another exercise: Show that the matrix squares Sx2 , Sy2 and Sz2 are equal.
What does this tell us about the possible eigenvalues of Sx and Sy , and hence
about possible measurement results?
Solution: If you square the matrices (T12.30), you will find that
h̄2
Sx2 = Sy2 = ·11 = Sz2 . (T12.32)
4
This implies that the eigenvalues of Sx and Sy are ± 21 h̄, just as for Sz . Thus the possible
measured values for the x- and y-components of S are the same as for Sz . This should of
course be expected from symmetry considerations; we are free to choose the orientation of
our coordinate system, and then there can be no difference between the possible measured
values for Sx and Sz .
! ! !
0 1 0 −i 1 0
σx = , σy = and σz = , (T12.33)
1 0 i 0 0 −1
which are called the Pauli matrices. The matrix representation of the spin operator may
then be written as
S = 21 h̄σ; σ ≡ êx σx + êy σy + êz σz .
The eigenvalue equation (T12.20) takes the form
σz χ+ = 1 · χ+ , σz χ− = −1 · χ− . (T12.34)
For future use we include some rules of calculation for the Pauli matrices. The angular-
momentum algebra [Sx , Sy ] = ih̄Sz takes the form
(If an index occurs more than twice in a term we have made a mistake).
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 15
σx σy + σy σx = 0, etc.
See the relations (T12.22) which imply that Sx and Sy satisfy the equation
From this we can derive the following relation which holds for arbitrary vectors a and b:
that is,
Note that the relations (T12.38) are special cases of this formula, and that it corresponds to
h̄2 h̄2
!
2 1 0
(S·n̂) = = ·11. (T12.43)
4 0 1 4
This formula is a generalization of (T12.32), and had to be expected considering the sym-
metry argument above; no matter which component S·n̂ of the spin we choose to measure,
the result must be 21 h̄ or − 12 h̄.
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 16
5
h Sz i becomes equal to 12 h̄ because χ+ is an eigenspinor with sharp Sz = 12 h̄. That h Sx i and h Sy i
become equal to zero is not strange. It would be strange if h S i were to point in another direction than ẑ
after the preparation of the state χ+ .
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 17
These states are illustrated in the figure, which is equivalent to that used for the orbital
angular momentum
q in Lecture notes 5. This picture is of limited value. It gives the correct
|S| (= h̄ 3/4 ) and Sz = ± 21 h̄, and also Sx2 + Sy2 = S2 − Sz2 = 12 h̄2 . The picture may
also (barely) remind us that the expectation values of Sx and Sy are equal to zero for both
the two states χ± ,
h Sx i = h Sy i = 0.
The drawback of this illustration is that it can mislead us to believe that the direction of S is
an observable, which it is not, because the componentsq are not compatible; they can not have
sharp values simultaneously. The angle arccos( 2 / 34 ) = 54.7◦ or 125.3◦ between S and the
1
The solution is that the eigenvalues are ± 12 h̄ (the same as for Sz ) and that the eigenstates
with spin directions x̂ and −x̂ are respectively
! !
1 1 1 1
χx̂ = √ and χ−x̂ =√ . (T12.46)
2 1 2 −1
with the same eigenvalues. In these spinors we have chosen the phases in such a way that
the upper components are real and positive.
is rotated instead. The “up” direction of the device (the original z direction) in the new
coordinate system then becomes a unit vector
n̂ ≡ x̂ sin θ cos φ + ŷ sin θ sin φ + ẑ cos θ, (T12.48)
characterized by the angles θ and φ :
In this coordinate system, the spin directions in the upper/lower beam are h σ i = ±n̂; the
spin directions are physical and measurable and remain fixed together with the device.
Viewed from the new coordinate system we are now measuring the spin component S·n̂.
The two possible measured values are of course unchanged, and so are the physical states, but
the matrix representations of these states are changed, because the new coordinate system
corresponds to a change of basis; the Pauli spinors
! !
1 0
χ+ = and χ− =
0 1
now correspond to spin up and spin down related to the new ẑ direction.
These matrix representations, which correspond to spin “up” and spin “down” in the n̂
direction, are determined by the eigenvalue equations
S·n̂ χ±n̂ = ± 12 h̄ χ±n̂ ⇐⇒ σ·n̂ χ±n̂ = ± χ±n̂ . (T12.49)
!
a
It is sufficient to solve the equation σ·n̂ χn̂ = χn̂ . We set χn̂ = . This spinor
b
satisfies the eigenvalue equation
!
a
(σ·n̂ − 11) χn̂ = (σx nx + σy ny + σz nz − 11)
b
! !
cos θ − 1 sin θ e−iφ a
= = 0. (T12.50)
sin θ eiφ − cos θ − 1 b
As you can easily check, both the upper and the lower components of this equation are
satisfied when
(cos θ − 1)a + sin θ e−iφ b = 0,
that is, for
b 1 − cos θ iφ sin 12 θ eiφ
= e = . (T12.51)
a sin θ cos 21 θ
This complex ratio between the lower and upper components is in fact all we need to know
about the spinor. Admittedly, the two complex numbers a and b contain four real parame-
ters, but the normalization condition |a|2 + |b|2 = 1 reduces the number to three, and one
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 19
parameter may be allowed to be free, in the form of a common phase factor which is of no
physical significance. Explicitly we have from the normalization condition
2
1 b sin2 12 θ 1
=1+ =1+ 1 = ,
|a|2 a 2
cos 2 θ cos2 12 θ
that is,
a = eiα cos 21 θ,
where the phase α can be chosen freely. Hence
b
b = a· = eiα sin 12 θ eiφ ,
a
so that
!
iα cos 12 θ
χn̂ = e , (T12.52)
sin 12 θ eiφ
A small exercise: Show that the other state χ−n̂ (with spin down compared to
the direction n̂) may be written as
! !
iα sin 12 θ iα0 − sin 12 θ e−iφ
χ−n̂ = e =e . (T12.53)
− cos 12 θ eiφ cos 21 θ
There are several ways to do this: You may (i) change the sign of the eigenvalue
in (T12.49) and (T12.50) and proceed as above, or (ii) use that χ−n̂ must be
orthogonal to χn̂ . A third method is (iii) to notice that this state has spin up
in the direction n̂0 = −n̂, which corresponds to the angles θ0 = π − θ and
φ0 = φ + π.
h σx i = χ† χ = <e(2a∗ b),
h σz i = |a|2 − |b|2 ,
b) Show that the length of the real vector h σ i is equal to 1. [Hint: [<e(a∗ b)]2 +
[=m(a∗ b)]2 = |a∗ b|2 = |a|2 |b|2 .]
d) Check that h σ i = n̂ for the state χn̂ , by inserting a and b from (T12.52).
E± = ± 21 h̄ω.
TFY4250/FY2045 Lecture notes 12 - Magnetic moments. Spin 21
The state χ−ẑ has h µ i parallell to the B field, and therefore is the ground state, with
the energy E− = − 21 h̄ω. The first (and only) excited state has spin up and therefore h µ i
antiparallell to the B field.
Each of these energy eigenstates corresponds to a stationary state,
χ± (t) = e−iE± t/h̄ χ±ẑ = e∓iωt/2 χ±ẑ . (T12.57)
In these stationary states nothing “happens”, so h µ i stays parallell or antiparallell to B the
whole time.
On the other hand, if we consider the superposition
! !
a0 e−iωt/2 a
χ(t) = a0 χ+ (t) + b0 χ− (t) = ≡ , (T12.58)
b0 eiωt/2 b
then this is a non-stationary state where things “happen”. If we assume for simplicity that
a∗0 b0 is real, we have
a∗ b = a∗0 b0 eiωt .
Inserting into (T12.55) we then find that
A small challenge: Use the formula ((4.19) in Hemmer) for the time develop-
ment of expectation values,
d iD c b E
hF i = [H, F ] ,
dt h̄
to derive the formulae (T12.60).