Lecture-Notes: Book Reference
Lecture-Notes: Book Reference
Analytical Mechanics
with animations in MAPLE
Rouben Rostamian
Department of Mathematics and Statistics
UMBC
rostamian@umbc.edu
December 2, 2018
ii
Contents
Preface vii
5 Lagrangian mechanics 25
5.1 Newtonian mechanics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
5.2 Holonomic constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5.3 Generalized coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
5.4 Virtual displacements, virtual work, and generalized force . . . . . . . 30
5.5 External versus reaction forces . . . . . . . . . . . . . . . . . . . . . . . . . 32
5.6 The equations of motion for a holonomic system . . . . . . . . . . . . . 33
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
6 Constraint reactions 37
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
7 Angular velocity 43
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
iii
iv Contents
12 Quaternions 91
12.1 The quaternion algebra . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
12.2 The geometry of the quaternions . . . . . . . . . . . . . . . . . . . . . . . 92
12.2.1 The reflection operator . . . . . . . . . . . . . . . . . . . . . 92
12.2.2 The rotation operator . . . . . . . . . . . . . . . . . . . . . . 93
12.3 Angular velocity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
12.4 A differential equation for the quaternion rotation . . . . . . . . . . . . 97
12.5 Unbalanced ball rolling on a horizontal plane . . . . . . . . . . . . . . . 98
12.5.1 The no-slip condition . . . . . . . . . . . . . . . . . . . . . . 98
12.5.2 The Gibbs function and the equations of motion . . . . 99
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
vi Contents
Preface
Unless otherwise specified, by “solving a problem” I mean performing all the steps
laid out below:
1 P
4. Compute the kinetic energy T = 2 i mi kvi k2 , the potential energy V , and the
Lagrangian L = T − V .
5. Form the equations of motion (a system of second order differential equations
(DEs)) in the unknowns q1 (t ), q2 (t ), . . .:
d ∂L ∂L
= , j = 1, 2, . . . .
d t ∂ q̇ j ∂ qj
If done by hand, this step would be the most labor-intensive part of the calculations.
The calculations can get unbearably complex and can easily lead to formulas that
fill more than one page. Fortunately we can relegate the tedious computations to
MAPLE.1
6. Solve the system of DEs. Except for a few special cases, such system are generally
not solvable in terms of elementary function. One solves them numerically with
the help of specialized software such as MAPLE (or MATHEMATICA).
The software replaces the continues time variable t by a closely spaced “time ticks”
t0 , t1 , t2 , . . . which span the time interval of interest, say [0, T ], and then it ap-
plies some rather sophisticated numerical algorithms to evaluate the unknowns
q1 (t ), q2 (t ), . . . at those time ticks. The result may be presented as:
(a) a table of numbers; but that’s not very illuminating, so it’s rarely done that
way;
1
Nowadays M APLE and M ATHEMATICA are the two dominant Computer Algebra Systems. If you are famil-
iar with M ATHEMATICA, you should be able to translate the M APLE commands in this book into the equivalent
M ATHEMATICA commands.
vii
viii Preface
(b) as a set of plots of q j versus t . This is the most common way. Both MAPLE
(and MATHEMATICA) can do this easily; or
(c) as a computer animation, which is the most “user friendly” choice but which
takes some work—and a certain amount of know-how—to produce. I will
show you how to do this in MAPLE.
Chapter 1
An introduction through
examples
This chapter introduces some of the basic ideas involved in the Lagrangian formulation
of dynamics through examples. You will need to take some of the statements and formu-
las for granted since they won’t be formally introduced until several chapters later. The
objective here is to acquire some “gut feeling” for the subject which can help to motivate
some of the abstract concepts that come later.
1
2 Chapter 1. An introduction through examples
eϕ i x
er
j
ϕ
ϕ
ℓ
−τe r
y
mg w = mgj
Figure 1.1: On the left is a depiction of the physical shape of the pendulum. On the
right we see the mathematical machinery devised to analyze the pendulum’s
motion. The unit vectors i and j are attached to the fixed Cartesian coordi-
nates system and are stationary; the unit vectors e r and eϕ move with the
pendulum. The weight of the bob is w = m g j.
bob’s initial velocity is small. Then the pendulum will oscillate back and forth about the
stable configuration, similar to what we see in a grandfather clock. If the initial velocity
is slightly larger, the pendulum will undergo wider oscillations. If, however, the initial
velocity is sufficiently large, the pendulum will not oscillate at all. It will swing about
pivot, reach the unstable equilibrium position at the top and go past it, fall down from the
other side, and return to its initial position, having made a complete 360 degree rotation
about the pivot. At this point the pendulum finds itself in the same condition that it had
at the initial time, therefore it will repeat what it did the first time around. In the absence
of energy dissipating factors, the rotations about the pivot will continue indefinitely.
To make a mathematical model of the pendulum, we introduce the Cartesian coordi-
nates xy with the origin at the pendulum’s pivot, and the y axis pointing down. We also
introduce the stationary unit vectors i and j along the x and y axes, and the moving unit
vectors e r along the pendulum’s rod and eϕ which is perpendicular to it, as shown in Fig-
ure 1.1. It it evident that the vectors e r and eϕ may be expressed as linear combinations
of the vectors i and j:
Furthermore, let us observe that their time derivatives are related through
ė r = iϕ̇ cos ϕ − j ϕ̇ sin ϕ = −ϕ̇eϕ , ėϕ = iϕ̇ sin ϕ + j ϕ̇ cos ϕ = ϕ̇e r . (1.1)
The bob’s position vector r(t ) relative to the origin is r = ℓe r , where ℓ is the length of
the rod, and therefore the bob’s velocity v = ṙ and acceleration a = v̇ may be computed
easily with the help of (1.1):
We see that the bob’s acceleration has a component along eϕ and another along e r .
Newton’s law of motion asserts that ma = F , where F is the resultant of all forces
acting on the bob. Referring to Figure 1.1 we see that the forces acting on the bob consist
1.2. The simple pendulum à la Euler 3
of weight w and the tension −τe r along the rod,3 where τ generally varies with time and
is unknown. It follows that
m(−ℓϕ̈eϕ − ℓϕ̇ 2 e r ) = w − τe r .
The weight, however, is w = m g j, where m is the mass of the bob and g is the acceler-
ation due to gravity. We replace w with its decomposition w = m g j = (m g cos ϕ)e r +
(m g sin ϕ)eϕ in the equation of motion, and collect the coefficients of e r and and eϕ , and
arrive at
mℓϕ̈ + m g sin ϕ eϕ + mℓϕ̇ 2 + m g cos ϕ − τ e r = 0.
Since e r and eϕ are orthogonal, hence linearly independent, each of the expressions in
the square brackets is zero. We conclude that
mℓϕ̈ + m g sin ϕ = 0, mℓϕ̇ 2 + m g cos ϕ − τ = 0. (1.2)
The first equation is a second orderdifferential equation in the unknown ϕ. It has a unique
solution for any initial condition ϕ(0), ϕ̇(0) , although the solution is not expressible in
terms of elementary functions. In practice, one solves the equation through a numerical
approximation algorithm on a computer. Once the solution ϕ(t ) is obtained, it may be
substituted in the second equation to evaluate the tension τ(t ) in the rod, should it be of
interest.
1
• A the kinetic energy T of a point mass m moving with velocity v is T = 2 mkvk2 .
1
In the case of the pendulum this is T = 2 mℓ2 ϕ̇ 2 .
• The potential energy V of a point mass m in a constant gravitational field equals
m g h where g is the acceleration due to gravity, and h is its height above an arbitrar-
ily selected reference point. In the case of the pendulum, the elevation of the bob rel-
ative to the lowest point in its path is h = ℓ(1−cos ϕ), therefore V = m g ℓ(1−cos ϕ).
• The Lagrangian L of a mechanical system is the difference between its kinetic and
potential energies, that is, L = T − V . In the case of the pendulum we have:
1
L(ϕ, ϕ̇) = mℓ2 ϕ̇ 2 − m g ℓ(1 − cos ϕ). (1.3)
2
As the notation above indicates, we are viewing the Lagrangian L as a function two
variables ϕ and ϕ̇. It should be emphasized that ϕ and ϕ̇ are considered independent
variables here.4
The Lagrangian completely characterizes a mechanical system. It incorporates the
system’s parameters, geometry, and physics, all in one neat bundle. Beyond this point the
analysis of the system’s motion is pure calculus—or analysis, as Lagrange called it in his
Mécanique Analytique—with no need to refer to the system’s components and geometry.
According to Lagrange’s theory which we will later study in detail, the equation of
motion of a mechanical system whose Lagrangian depends on two variables ϕ and ϕ̇, is
given by
d ∂L ∂L
= . (1.4)
d t ∂ ϕ̇ ∂ϕ
In the case of pendulum we have:
∂L ∂L
= mℓ2 ϕ̇, = −m g ℓ sin ϕ,
∂ ϕ̇ ∂ϕ
or equivalently,
g
ϕ̈ +
sin ϕ = 0, (1.5)
l
which agrees with the first equation in (1.2). The second of those equations may be ob-
tained through the Lagrangian approach as well, but we will not get into that right now.
i x
ℓ1 j r1 r2
ϕ1 ϕ1
ℓ2
y
ϕ2 ϕ 2 ℓ2
m1 g m1 g j
m2 g m2 g j
Figure 1.2: On the left is a depiction of the physical shape of the double pendulum. On
the right we see the pendulum’s mathematical model given by the position
vectors r1 and r2 of the two bobs.
In the previous section’s simple pendulum, the Lagrangian L(ϕ, ϕ̇) was a function
two variables. In the present case, the Lagrangian L(ϕ1 , ϕ2 , ϕ˙1 , ϕ̇2 ) is a function of four
variables. In general, if a mechanical system’s geometric configuration is specified through
n variables q1 , . . . , qn , then its Lagrangian is a function of 2n variables q1 , . . . , qn , q̇1 , . . . , q̇n .
The equivalent of the single equation of motion (1.4) now is a system of n equations, called
the mechanical system’s Euler–Lagrange equations:
d ∂L ∂L
= , i = 1, . . . , n.
d t ∂ q̇i ∂ qi
The variable q1 , . . . , qn are called the system’s generalized coordinates.
Applied to the case of double pendulum, the Euler–Lagrange equations lead to
d ∂L ∂L d ∂L ∂L
= , = .
d t ∂ ϕ̇1 ∂ ϕ1 d t ∂ ϕ̇2 ∂ ϕ2
To evaluate these explicitly, we begin by computing
∂L
= (m1 + m2 )ℓ21 ϕ̇1 + m2 ℓ1 ℓ2 ϕ˙2 cos(ϕ2 − ϕ1 ),
∂ ϕ̇1
∂L
= m2 ℓ22 ϕ̇2 + m2 ℓ1 ℓ2 ϕ̇1 cos(ϕ2 − ϕ1 ),
∂ ϕ̇2
∂L
= m2 ℓ1 ℓ2 ϕ̇1 ϕ̇2 sin(ϕ2 − ϕ1 ) − (m1 + m2 ) g sin ϕ1 ,
∂ ϕ1
∂L
= −m2 ℓ1 ℓ2 ϕ̇1 ϕ̇2 sin(ϕ2 − ϕ1 ) − m2 g ℓ2 sin ϕ2 .
∂ ϕ2
We conclude that the differential equations of motion are
·
(m1 + m2 )ℓ21 ϕ̇1 + m2 ℓ1 ℓ2 ϕ˙2 cos(ϕ2 − ϕ1 )
= m2 ℓ1 ℓ2 ϕ̇1 ϕ̇2 sin(ϕ2 − ϕ1 ) − (m1 + m2 ) g sin ϕ1 ,
·
m2 ℓ22 ϕ̇2 + m2 ℓ1 ℓ2 ϕ˙1 cos(ϕ2 − ϕ1 )
= −m2 ℓ1 ℓ2 ϕ̇1 ϕ̇2 sin(ϕ2 − ϕ1 ) − m2 g ℓ2 sin ϕ2 .
Exercises
1.1. Pendulum with a mobile pivot. Figure 1.3 shows a pendulum whose pivot is
allowed to move horizontally without friction. The pivot has mass m1 while the
bob has mass m2 . Find the equations of motion of the pendulum.
1.2. A spherical pendulum. The motion of the simple pendulum of length ℓ intro-
duced in this chapter was confined to a single vertical plane, and therefore the
pendulum’s bob moved along a circular arc of radius ℓ. If off-plane motions are
permitted, then the bob will move on a sphere of radius ℓ centered at the pivot. In
that setting the pendulum is called a spherical pendulum; see Figure 1.4.
Derive the equations of motion of the spherical pendulum.
1.3. Bead on a spinning hoop. A circular wire hoop of radius R spins about a vertical
diameter at a constant angular velocity Ω. A bead of mass m can slide without fric-
tion along the hoop. The hoop’s radius that connects to the bead makes an angle
Exercises 7
x
m1
x
ϕ ℓ
m2
i
j
θ x
y k
ϕ ℓ
of ϕ(t ) with respect to the vertical; see Figure 1.5. Find the differential equation
satisfied by ϕ.
1.4. A governor mechanism. Figure 1.6 is a schematic drawing of a (simplified) Watt
governor which was invented for the automatic control of the speed of steam en-
gines. Our version consists of four massless rigid links of length ℓ each, hinged at
their ends to form a rhombus. The vertex O remains motionless, while the sleeve
at vertex S can slide on the device’s vertical shaft, thereby change the rhombus’s
shape. Two balls of mass m1 each are attached to the vertices A and B. The sleeve’s
mass is m2 . The entire assembly rotates at a constant angular speed Ω about the
vertical shaft. Find the differential equation satisfied by the angle ϕ marked on the
diagram.
1.5. Two masses on a string. A particle P of mass m1 lies on a smooth horizontal table
and is attached to a long, inextensible string which passes through a smooth hole
O in the table and hangs down. The other end of the string carries a particle Q of
mass m2 ; see the illustration in Figure 1.7.
The particle P is positioned at the point (a, 0, 0) in the xy z coordinates shown, and
given a horizontal initial velocity perpendicular to the x axis. Find the differential
equations of motion.
Hint: Let ρ(t ) and ϕ(t ) be P ’s position at time t in polar coordinates as seen in
Figure 1.7. The equations of motions constitute a system of differential in ρ(t ) and
ϕ(t ) .
8 Chapter 1. An introduction through examples
R
ϕ
mg
ℓ ℓ
ϕ
A Ω B
m1 m1
ℓ ℓ
m2 S
y P
m1
ρ(t ) ϕ(t ) x
Q m2
Figure 1.7: The point P slides on the table. The point Q moves vertically (Exercise 1.5).
Chapter 2
Work is the product of force and its displacement. To be precise, the infinitesimal work
dW performed in displacing a force F by an infinitesimal distance d r is dW = F · d r.
If the point of the application of the force moves along a path C in space, then the work
performed along the path is the line integral
Z
W= F · d r. (2.1)
C
If you are repositioning a massive desk in an office, for example, then work measures the
amount of effort exerted by you in performing the task.
Expanding upon the moving of the desk scenario, suppose that you intend to move
the desk from a point A to a point B. It should be obvious that the amount of work
performed will vary, depending on the path along which you move the desk between A
and B. Chances are that the shortest (straight line) path will require lesser effort than a
long path that winds around the office.
There are many interesting and important situations where, unlike the moving of the
desk example, the work performed in goings from a point A to a point B is independent of
the path taken between A and B. The most elementary example is the raising or lowering
of a weight. To see how it works, set up a Cartesian coordinate system in space so that
the x and y axes are horizontal, and the z axis points up. Let ra = (xa , ya , za ) and r b =
(x b , y b , z b ) be the position vectors5 of the starting and ending points A and B, and let
r = 〈x, y, z〉 be the position vector of a generic point along a path C (A, B) with endpoints
A and B. Suppose that we move an object of mass m along that path. The force of the
object’s weight is F = 〈0, 0, −m g 〉, where g is the acceleration of gravity. The work
performed along the path is
Z
W= F · dr
C (A,B)
Z Z
= 〈0, 0, −m g 〉 · 〈d x, d y, d z〉 = −m g d z = −m g (z b − za ).
C (A,B) C (A,B)
We see that the work in moving the weight from A to B is expressed in terms the z co-
ordinates of the endpoints, thus it is the same on all conceivable paths that go from A
to B.
5
A position vector of a point P (x, y, ) is the vector r = 〈x, y, z〉 that extends from the origin to the point P .
9
10 Chapter 2. Work and potential energy
To generalize, consider a (possibly position dependent) force field F (r) with the prop-
erty that the work performed in going from a given point A to an arbitrary point r in space
is independent of the path from A to r. Let us write V (r) for the negative of the value of
that integral, that is, Z
V (r) = − F (r ′ ) · d r ′ . (2.2)
C (A,r)
The function V defined this way is called the potential function, or simply the potential, of
the the vector field F . Equivalently, the vector field F is said to be derived from a potential.
In (2.2) I have written r ′ for the dummy variable of integration in order to distinguish it
from the position vector r which designates the path’s endpoint. The minus sign does
not have a deep significance; it’s convenient to build it into the definition since it leads to
more pleasing forms of general statements, such as the one on conservation of energy.
Theorem 2.1. Consider a continuous vector field F defined in an open and connected do-
main D in the n-dimensional space, and suppose that F possesses a potential function V as
in (2.2). Then V is differentiable and F (r) = −∇V (r).
Proof. By definition, the gradient ∇V of a function V at a point r is the vector with the
property that for any unit vector e, the directional derivative of V in the direction of e is
given by ∇V (r) · e. That is,
V (r + he) − V (r)
∇V (r) · e = lim .
h→0 h
To simplify the discussion, let us write P and Q for the points in space corresponding
to the position vectors r and r + he, as illustrated in Figure 2.1. Pick a path C (A, r) to
evaluate V at P , then extend that path as a straight line segment to Q to evaluate V at Q.
Then the difference V (Q) − V (P ) amounts to an integration along the straight segment
P Q:
Zh
V (r + he) − V (r) = V (Q) − V (P ) = − F (r + ξ e) · e d ξ ,
0
Then it follows that
Z h
V (r + he) − V (r) 1
=− F (r + ξ e) · e d ξ = −F (r + ξˆe) · e
h h 0
for some ξˆ ∈ (0, h), the latter assertion being a consequence of the Mean Value Theorem
for integrals; see e.g., Stewart [13].
As h goes to zero, so does ξˆ because ξˆ ∈ (0, h). It follows that
V (r + he) − V (r)
∇V (r) · e = lim = −F (r) · e.
h→0 h
Since this holds for every e, it follows that ∇V (r) = −F (r).
Remark 2.1. Let is point out the roles that the theorem’s technical assumptions play in
the proof:
• The continuity of the vector field F enters at the point where the Mean Value The-
orem is applied. That theorem is not true without continuity.
11
P Q
e
)
r
A,
C(
he
r
r+
Figure 2.1: If V (P ) is evaluated by integration along the path C (A, r), then V (Q) may
be evaluated by integrating along that same path, and then continuing along
the straight line segment P Q of length h in the direction e.
• The assumption that the domain D is connected is needed to ensure that a path may
be established between any pair of points in D. That’s what enabled us to sketch
the curve C (A, r) that connects the points A and P in Figure 2.1.
• The assumption that the domain D is open means that a ball of positive radius may
be placed around any point P ∈ D so that the ball lies entirely within D. It’s that
property which ensures that moving away from P by a small distance h, as we did
in Figure 2.1, we land safely on a point Q which lies within D.
Remark 2.2. Adding a constant to the potential function V does not affect the equality
F (r) = ∇V (r). Thus, a vector field’s potential, if it has one, is defined modulo an additive
constant.
Example 2.2. Earlier in the this section we observed that the force field corresponding to
an object’s weight is F = 〈0, 0, −m g 〉. We see that F (r) = −∇V (r) where V (x, y, z) =
m g z. We will use m g z as the potential of a weight throughout these notes. Note the
effect of the minus sign in (2.2); in its absence the potential of a weight would have been
−m g z.
Example 2.3. In the previous example we assumed that the acceleration of gravity g is a
constant. That’s a good assumption if the changes in height during the motion are small
relative to the radius of the Earth. In general, the gravitational force that a point mass M
exerts on a point mass m drops as the inverse square of the distance. Specifically, Newton’s
law of gravitation says
GM m r
F (r) = − . (2.3)
krk 2 krk
where r is m’s position vector relative to M , and G is the universal gravitational constant.
The inverse square law is manifested through the krk2 term that appears in the denomi-
nator inside the parentheses. The factor r/krk is a unit vector that points from M to m.
It is possible to show (see Exercise ??) that F is derived from a potential.
Theorem 2.4. Suppose the force field F is derived from a potential V . Then the work per-
12 Chapter 2. Work and potential energy
formed in moving the force along any path from a point ra to r b is given by
W = V (ra ) − V (r b ). (2.4)
therefore Z Z rb
W= F · dr = −dV = V (ra ) − V (r b ).
C ra
Exercises
2.1. Verify that the gravitational potential field F (r) in (2.3) is derived from the poten-
tial
GM m
V (r) = .
krk
Chapter 3
A single particle in a
conservative force field
Remark 3.1. The kinetic and potential energies don’t remain constant during the mo-
tion; it’s their sum that does. Therefore the reduction of one is accompanied by the in-
crease of the other. It helps to think of this as a conversion of one form of energy to
the other. The myriad of motion phenomena encountered in our daily experiences are
manifestations of such interplay between the kinetic and potential energies.
Remark 3.2. The conservation of the total energy E is a consequence of the assumption
that the force field F is derived from a potential. This should explain the alternative name,
13
14 Chapter 3. A single particle in a conservative force field
a conservative force field, which is commonly used to refer to a force field derived from a
potential.
Remark 3.3. Had we chosen against putting the minus sign in the definition (2.2),
the principle of conservation of energy would have stated that the difference between the
kinetic and potential energies remains constant, which is not as appealing as saying that
their sum remains a constant.
Expect for the most trivial cases, the integral on the left is impossible to evaluate in
terms of elementary functions. It is possible, however, to obtain quite an adequate “feel”
for the solution, without performing any integration at all, through a phase plane analysis
of the equation.
To explain the idea, consider the potential function V whose graph is shown in Fig-
ure 3.1(a). Regard the solution x(t ) of the differential equation (3.3) as the abscissa of a
point P that moves along the horizontal axis in that figure according
to the equation’s
dynamics. Then the point Q with coordinates x(t ),V (x(t ) slides on the graph of V
accordingly. The length of the line segment P Q equals the potential energy V (x). We ex-
tend that segment upward to a point R so the the length of QR equals the kinetic energy
1
2
m ẋ 2 . Since the sum of the kinetic and potential energies remains a constant E during
the motion, the locus of the point R is the horizontal line V = E, as marked on the figure.
The point Q cannot rise above the horizontal line V = E during the motion because
1
the nonnegative length of the line segment QR (which equals 2 m ẋ 2 ) prevents it. Conse-
quently, the motion of Q is confined to the graph’s red-colored arc. We refer to that arc as
a potential well corresponding to the energy E and we think of Q as a point that has fallen
into the well and is unable to get out.
At the edges of the potential well the potential energy equals E and the kinetic en-
ergy, and therefore the velocity ẋ, are zero. In the interior, where the potential energy
3.2. The scalar case 15
V (x)
1
2
m ẋ 2
R
(a) E
V (x)
Q
P x
ẋ
(b)
x
drops below E, the kinetic energy, and therefore the velocity squared, ẋ 2 , are positive.
We conclude that as we traverse the potential well from left to right, the velocity begins
at zero, increases gradually (in absolute value) to a maximum, then drops back to zero at
the rightmost end. The sign of the velocity may be positive or negative since the only
information we are getting from Figure 3.1(a) is on the square of the velocity.
This observation leads to the red oval in the diagram shown in Figure 3.1(b). The
horizontal axis in that figure is the same as the x in Figure 3.1(a). The vertical axis is
the velocity ẋ. Observe that at the leftmost and rightmost points of the oval, which
correspond to the extremes of the potential well, the velocity is zero, and in between in
rises to a maximum (or falls to a minimum), as we expect. The oval is symmetric with
respect to the x axis because a given ẋ 2 yields two velocities ±|ẋ|.
The red oval constructed in the previous discourse depends on the choice of the energy
level E. It should be clear that lowering E slightly will shrink the oval, and raising E
slightly, will expand it. The black curves in Figure 3.1(b) are the result of selecting various
values of E.
Figure 3.1(b) is called the phase diagram or phase portrait of the differential equation (3.3).
The curves in it are called orbits. An alternative to the geometric construction of the or-
bits carried out above, we may equally well view them as implicitly defined curves in the
x–ẋ plane through the equation (3.4). Varying the parameter E produces the family of all
orbits, some of which are shown in Figure 3.1(b).
Yet another way of viewing the orbits is as level curves of the the surface defined by
the function
1
E(x, ẋ) = m ẋ 2 + V (x).
2
Two views of the surface E(x, ẋ) corresponding to the potential V (x) of Figure 3.1(a) are
16 Chapter 3. A single particle in a conservative force field
Figure 3.2: Two views of the energy surface E(x, ẋ) corresponding to the potential V (x)
of Figure 3.1(a). The graph on the right has been turned upside down to
bring out some of the details which are hidden in the conventional view on
the left.
3.3 Stability
We have seen that the dynamics of the equation (3.3) dictate that a hypothetical particle
trapped in an energy well cannot escape. The closer the energy level E is to the bottom
of the well, the lesser room there is for the particle to maneuver.6 In the extreme case
when the particle’s energy matches that of the well’s bottom, the particle cannot move
at all. We express this by saying that the bottom of the well is a stable equilibrium point
of the differential equation (3.3). If the energy level is increases just slightly, the particle
will move along an oval around the equilibrium point. Referring to the construction of
the phase portrait from the potential V , it should be evident that a local minimum of
V corresponds to a stable equilibrium. Similarly, a local maximum of V corresponds to a
saddle on the energy surface (see Figure 3.2) therefore a local maximum of V corresponds
to an unstable equilibrium.
V (ϕ) = 1 − cos ϕ
2
1
ϕ
ϕ −2π −π 0 π 2π
ℓ ϕ̇
ϕ
mg
Figure 3.3: The pendulum is shown on the left. The graph of the potential function
V (ϕ) = g /ℓ(1 − cos ϕ) (with g /ℓ = 1) is shown at top right. The corre-
sponding phase portrait is shown at bottom right. The function V has a
period of 2π, therefore it would have sufficed to limit the plots to the range
−π ≤ ϕ ≤ π. Outside of that range, things repeat by periodicity.
vious section applies.7 The graph of V (ϕ) and the resulting phase portrait, constructed
according to the previous section’s guidelines, are shown in Figure 3.3 on the right.
The configuration of a pendulum is completely specified by the angle ϕ at any instant.
The configuration corresponding to ϕ + 2kπ is exactly the same as that of ϕ for any
integer k. In other words, the pendulum’s configuration is determined by ϕ mod 2π. In
particular, the left and right edges of the phase portrait in Figure 3.3 correspond to the
same configuration. This is best visualized by wrapping the phase portrait into a cylinder
and gluing the left and right edges together. This is illustrated in Figure 3.4.
Exercises
3.1. Analyze the stability of the spinning hoop of Exercise 1.3. Show that the lower
equilibrium is stable if Ω is small, and unstable if it is large. Find the value of Ω
where the transition takes place.
3.2. Plot representative phase portraits for the two cases of the problem above.
7
As noted earlier, the potential function is defined within an additive arbitrary constant, therefore the “1”
in 1 − cos ϕ is immaterial; its inclusion makes V (0) = 0 which is nice but of no special consequence.
18 Chapter 3. A single particle in a conservative force field
Figure 3.4: Two views of the pendulum’s phase portrait as wrapped into a cylinder to
emphasize that the pendulum’s configuration depends on ϕ mod 2π.
Chapter 4
g aω 2
ϕ̈ + sin ϕ + sin ϕ cos ωt = 0, (4.1)
ℓ ℓ
where ϕ is the angle of the rod relative to the vertical, as shown.
If the amplitude a of the pivot’s oscillation is small, and if ω is not too large, then
we expect the system to behave similar to an ordinary pendulum, albeit with somewhat
jittery oscillations. In particular, the lower equilibrium ϕ = 0 would be stable and the
upper equilibrium ϕ = π would be unstable. A graph of the the solution ϕ(t ) of the
pendulum’s equation of motion with smallish a and ω is shown on Figure 4.1.
It is the purpose of this chapter to show that as ω is increases beyond a critical thresh-
old, the pendulum’s behavior changes drastically. Specifically, the lower equilibrium be-
comes unstable and the upper equilibrium becomes stable. Thus, the pendulum turns
around by 180 degrees, points upward, and oscillates about the ϕ = π position! Figure 4.2
shows the solution of the pendulum’s equation of motion for a relatively fast ω. Note
that the oscillation now take place about the upper equilibrium ϕ = π. The pendulum is
standing upright, pointing up!
19
20 Chapter 4. The Kapitsa pendulum
ϕ δ
ℓ t
y
π
mg −6
Figure 4.1: The pivot oscillates vertically according to a cos ωt about the pendulum’s
nominal pivot. In the schematic diagram on the left the pivot’s displacement
is exaggerated; we assume that a/ℓ is very small in our computations. When
the pendulum’s arm makes a “nominal” angle ψ with the vertical, the angle
actually oscillates rapidly in the range ψ ± δ. The graph on the right is that
of the angle ϕ(t ) obtained by solving the differential equation (4.1) with
parameters ℓ = 1, g = 1, a = 0.05, and ω = 20, and initial conditions ϕ(0) =
5π/6, ϕ̇(0) = 0. The oscillation about the lower equilibrium position (ϕ =
0) is stable since ω2 < 2g ℓ/a.
5π
4
ϕ(t )
π
3π
4
π
2
π
4
Figure 4.2: The solution of the differential equation (4.1) with parameters ℓ = 1, g = 1,
a = 0.05, and ω = 40, and initial conditions ϕ(0) = π/6, ϕ̇(0) = 0. The
oscillation about the upper equilibrium position (ϕ = π) is stable since ω2 >
2g ℓ/a. The figure on the right is an enlarged copy of a portion of the graph
on the left. We see that ϕ(t ) consists of high-frequency, small-amplitude
oscillations riding on a slowly oscillating function.
We are going to need δ’s second derivative soon, so let’s calculate it right now. We
4.2. Averaging out the fast oscillations 21
have:
a
δ̇ ≈ ψ̇ cos ψ cos ωt − ω sin ψ sin ωt ,
ℓ
a
δ̈ ≈ ψ̈ cos ψ cos ωt − ψ̇2 sin ψ cos ωt − 2ω ψ̇ cos ψ sin ωt − ω 2 sin ψ cos ωt .
ℓ
We are interested in high frequency oscillations of the pivot, that is, ω ≫ 1. Therefore,
the term with ω 2 in the expression above dominates the rest. We conclude that
aω 2
δ̈ ≈ − sin ψ cos ωt .
ℓ
Now we go to the differential equation (4.1) and replace ϕ by ψ + δ, and replace sin ϕ
with its Taylor series approximation
We get:
g aω 2
ψ̈ + δ̈ + sin ψ + δ cos ψ + sin ψ + δ cos ψ cos ωt = 0.
ℓ ℓ
We multiply out everything and replace δ̈ with the expression obtained above, and arrive
at
aω 2 g g aω 2 aω 2
ψ̈− sin ψ cos ωt + sin ψ+ δ cos ψ+ sin ψ cos ωt + δ cos ψ cos ωt = 0.
ℓ ℓ ℓ ℓ ℓ
The second and fifth terms cancel, leaving us with
g g aω 2
ψ̈ + sin ψ + δ cos ψ + δ cos ψ sin ωt = 0.
ℓ ℓ ℓ
Then we substitute for δ from (4.2):
g ag a2 ω2
ψ̈ + sin ψ + sin ψ cos ψ cos ωt + sin ψ cos ψ cos2 ωt = 0. (4.3)
ℓ ℓ2 ℓ2
In the graphs of ϕ(t ) in figures 4.1 and 4.2 we see that the period 2π/ω of the pivot’s
oscillations is much smaller than the oscillations of the pendulum itself. Consequently,
within one such time period, the value of ψ and its derivatives are essentially constants.
On the basis of this observations, we average (4.3) over one 2π/ω period, where we regard
ψ as constant. Since
Z 2π/ω Z 2π/ω
1 1 1
cos ωt d t = 0, cos2 ωt d t = ,
2π/ω 0 2π/ω 0 2
we get
g a2ω2
ψ̈ + sin ψ + sin ψ cos ψ = 0,
ℓ 2ℓ2
or the equivalent
gh a2ω2 i
ψ̈ + 1+ cos ψ sin ψ = 0. (4.4)
ℓ 2ℓg
22 Chapter 4. The Kapitsa pendulum
In comparison with the equation (3.5) of the motion of an ordinary unforced pendu-
lum, the Kapitsa pendulum sees an “effective acceleration of gravity” given by
h a2ω2 i
g 1+ cos ψ .
2ℓg
If the pendulum’s motion is in the 0 < ψ < π/2 regime, the quantity in the square brackets
is greater than 1, therefore vibrating the support is tantamount to increasing the acceler-
ation of gravity.8 If, however, the pendulum’s motion is in the π/2 < ψ < π regime,
then the effective acceleration of gravity may become negative if the coefficient of cos ψ
is sufficiently large. The latter will happen if ω is sufficiently large. That’s tantamount to
reversing the direction of gravity, which sort of explains why the pendulum turns upright.
g a2ω2
V ′ (ψ) = sin ψ + sin ψ cos ψ, (4.5)
ℓ 2ℓ2
whence
g a2ω2 2
V (ψ) = (1 − cos ψ) + sin ψ (4.6)
ℓ 4ℓ2
The analysis presented in Chapter 3 is based entirely on the shape of V ’s graph. Therefore
we proceed to analyze the shape.
The equation’s equilibria are the roots of the equation V ′ (ψ) = 0. Upon factorizing
the equation as
h g a2ω2 i
+ cos ψ sin ψ
ℓ 2ℓ2
we see that the roots are the solutions of
2g l
sin ψ = 0 and cos ψ = − .
a2ω2
The first equation yields ψ = 0 and ψ = π as roots. (It suffices to look for roots in the
0 ≤ ψ ≤ π range.) The second equation yields a root ψ̂ given by
2g l
ψ̂ = cos−1 − (4.7)
a2 ω2
if and only if ω 2 > 2g ℓ/a 2 . (If that ratio equals to 1 then the root is π, which duplicates
what we have already found.)
Table 4.1 lists the critical points of the function V (ψ), along with the values of V , V ′ ,
and V ′′ at those points. We see that:
• V ′′ (0) > 0 regardless of the parameter values, therefore the hanging-down equilib-
rium, ψ = 0, is always stable;
8
Don’t take this literally; the acceleration of gravity in (3.5) is a constant while the effective acceleration of
gravity in (4.4) depends on cos ψ, therefore is not a constant.
Exercises 23
ψ 0 ψ̂ π
2g
V (ψ) 0 V (ψ̂)
ℓ
V ′ (ψ) 0 0 0
g h a2ω2 i a2 ω2 2 g h a2 ω2 i
V ′′ (ψ) +1 − sin ψ −1
l 2g ℓ 2ℓ2 l 2g ℓ
Table 4.1: The analysis of the critical points of the function V (ψ) defined in (4.6). The
critical point ψ̂, defined in (4.7) exists if and only if ω2 > 2g ℓ/a 2 .
Figure 4.3: Two representative graphs of the function V (ψ) in (4.6) with the parameters
ℓ = 1, g = 1, a = 0.05. On the left we have taken ω = 20, which leads to
2g l /(a 2 ω2 ) = 2 > 1. This corresponds to a stable equilibrium at ψ = 0
and an unstable equilibrium at ψ = π. On the left we have taken ω = 40,
which leads to 2g l /(a 2 ω2 ) = 1/2 < 1. This corresponds to stable equilibria
at ψ = 0 and ψ = π, and an unstable equilibrium at ψ = 2π/3.
Figure 4.3 shows the graphs of V (ψ) for two representative cases. The graphs are
plotted over the range [−2π, 2π] to give a clear sense of their nature; only the range [0, π]
is of true relevance to us.
Exercises
4.1. Derive the equation of motion (4.1) of Kapitsa’s pendulum.
24 Chapter 4. The Kapitsa pendulum
Lagrangian mechanics
The following obvious trick flattens the doubly-indexed variables into singly-index
quantities and results in a significant algebraic simplification. We introduce the vectors
x = 〈 r1,1 , r1,2 , r1,3 , r2,1 , r2,2 , r2,3 , . . . rN ,1 , rN ,2 , rN ,3 〉, (5.3a)
| {z }| {z } | {z }
r1 r2 rN
m = 〈 m1 , m1 , m1 , m2 , m2 , m2 , . . . , mN , mN , mN 〉, (5.3c)
| {z } | {z } | {z }
m1 m2 mN
25
26 Chapter 5. Lagrangian mechanics
f21 f21
ℓ12 f23
ℓ m2 g f12
m2 g
f12 ℓ23
f32
f13
m1 g ℓ31 f31
m1 g m3 g
(a) A idealized dumbbell (b) a rigid triangle
Figure 5.1: This idealized dumbbell on the left consists of two points masses connected
through a massless rigid rod of length ℓ. In free flight, the force exerted on
m1 is the resultant of the weight vector m1 g and the push/pull of the rod
f12 . The rigid triangle on the right consists of three point masses connected
through a massless rigid rods.
Definition 5.2. The 3N -dimensional space introduced above is called the mechanical system’s
Cartesian configuration space. In analogy with Newton’s equations of motion, the vectors
x, f , m defined in (5.3) are called the position, the force, and the mass of the single abstract
“particle” moving in the configuration space.
m ẍi = fi , i = 1, 2, . . . , 3N ,
involving only a single m. In contrast, (5.4)’s fictitious “particle” exhibits different masses
along different coordinate directions.
curve the system’s orbit in the configuration space. When there is no risk of confusion, we
will simply call it the orbit.
If there are no impediments in placing the particles independently in arbitrary po-
sitions in space, then the orbit of the system of N particle may reach any point in the
configuration space—all is needed is the application of an appropriate force to get there.
If, however, the relative movements of the points are constrained, as in the dumbbell of
Figure 5.1(a), only a subset of the configuration space may be reached.
Example 5.3. Let r1 = 〈r1,1 , r1,2 , r1,3 〉 and r2 = 〈r2,1 , r2,2 , r2,3 〉. be the position vectors of
the dumbbell of Figure 5.1(a). Then, according to (5.3a) the position vector x ∈ R6 is
given by
x = 〈r1,1 , r1,2 , r1,3 , r2,1 , r2,2 , r2,3 〉.
The constraint of the fixed length ℓ of the connecting rod is expressed as kr1 − r2 k = ℓ,
or more explicitly, as
that is,
(x1 − x4 )2 + (x2 − x5 )2 + (x3 − x6 )2 − ℓ2 = 0. (5.5)
This defines a 5-dimensional “surface”—a manifold is the technical term—embedded in R6 .
The point orbit cannot roam arbitrarily in R6 ; it is constrained to stay on that manifold.
Example 5.4. Figure 5.1(b) shows three point masses connected with three massless rigid
rods, and thus forming a rigid triangle. The position vectors ri , i = 1, 2, 3, of the masses
are constrained through the three constraint equations
These confine the triangle’s orbit in the configuration space to a 6-dimensional manifold
embedded in R9 .
Example 5.5. Reconsider the previous example with a added twist. Suppose that the
triangle’s rods are equipped with remote-controlled motors with may be activated to vary
the rods’ lengths as desired during the flight. The previous example’s constraint equations
take the form
where ℓ12 (t ), ℓ23 (t ), and ℓ31 (t ) are given. The manifold M in this case is a 6-dimensional
manifold embedded in R9 whose shape changes with time.
Example 5.6. Recall the bead on the rotating hoop of Exercise 1.3 on page 6. With the
obvious choice of the xy z coordinates, the position vector of the bead is
r = 〈R sin ϕ cos Ωt , R sin ϕ sin Ωt , R cos ϕ〉.
The manifold M in this case is a the spinning hoop itself. Geometrically it is a one-
dimensional spinning object (a circle) embedded in R3 . It is given by the pair of equations
x sin Ωt = y cos Ωt ,
x 2 + y 2 + z 2 = a2 .
The first equation is that of plane that contains the z axis and spins about it with an
angular velocity of Ω. The second equation is that of a sphere of radius a centered at the
origin. The intersection of the two objects is the spinning hoop.
In general, a system of N particles subject to M constraint equations of the form
ϕi (x, t ) = 0, i = 1, 2, . . . , M , (5.6)
3N
where ϕi : R × R → R, i = 1, 2, . . . , M . These define a (3N − M )-dimensional manifold
M embedded in R3N . The system’s possible orbits are confined to lie in that manifold.
Generally M may move/deform with time, as it was the case in Examples 5.5 and 5.6.
However, if the equations (5.6) are independent of time, as it was the case in Examples 5.3
and 5.4, then M remains unchanged during the motion. That corresponds to a set of
constraints of the form
ϕi (x) = 0, i = 1, 2, . . . , M , (5.6’)
Constraints of type (5.6) are not the most general. Some very interesting mechanical
systems impose constraints on the velocity, ẋ, as in ϕi (x, ẋ, t ) = 0. The rolling of a coin
on the floor, for instance, has a constraint that depends on ẋ, therefore (5.6) is inadequate
for that purpose.
Definition 5.7. Constraints of the type (5.6) are called holonomic. All other types of con-
straints are called nonholonomic.
Definition 5.8. A mechanical system whose only constraints are of the holonomic type is
called a holonomic system.
We will begin our study of Lagrangian dynamics with holonomic systems. Nonholo-
nomic constraints will be brought up in the later chapters.
Remark 5.2. You may be interested to know that holonomic constraints of type (5.6)
are called rheonomic while those of type (5.6’) are called scleronomic. I prefer to call them
with the more user-friendly terms “time-dependent” and “time-independent” instead.
Admittedly that definition is rather vague, but its meaning is clarified further down:
§132. When from the differential equations of a material system an equal
number of finite equations between the coordinates of the system can be de-
duced, the system is holonomous.
By “finite equations” he means algebraic, as opposed to differential, equations.
Remark 5.4. A familiar example curvilinear coordinates, albeit not directly related to
mechanics, is the system of addressing locations on the surface of the Earth through their
longitude λ and latitude ϕ. In this context, M is the Earth’s surface, and λ and ϕ are the
coordinates q1 and q2 .
xi = xi (q, t ), i = 1, 2, . . . , 3N . (5.7)
The t in this equations accounts for the possible motion/deformation of the manifold re-
lated to time-dependent constraints (5.6). In the case of time-independent constraints (5.6’),
M is independent of time, and (5.7) reduces to
xi = xi (q), i = 1, 2, . . . , 3N . (5.7’)
[The last term will be absent in the case of (5.7’).] Let us observe that although the po-
sition xi is a function of q and t only, the velocity ẋi is a function of q, q̇, and t . Let’s
record this here for future reference:
Theorem 5.9. Let xi and ẋi be as in (5.7) and (5.9). Then for any i = 1, 2, . . . , 3N and
j = 1, 2, . . . , M , we have:
∂ ẋi (q, q̇, t ) ∂ xi (q, t )
= . (5.10)
∂ q˙j ∂ qj
∂ ẋi (q, q̇, t ) d ∂ xi (q, t )
= (5.11)
∂ qj dt ∂ qj
Proof. The assertion (5.10) is an immediate consequence of (5.8). As to (5.11), it’s a matter
of differentiating (5.8) with respect to q j and then exchanging the differentiation order in
the resulting second order partial derivatives:
Letting
3N
X ∂ xi
Qj = fi , j = 1, 2, . . . , n, (5.13)
i =1 ∂ qj
5.4. Virtual displacements, virtual work, and generalized force 31
x̂
virtual displacements at x̂
the manifold M
Figure 5.2: The system’s orbit lies on a manifold M determined by the holonomic con-
straints. Virtual displacements at x̂ are tangent to the manifold.
δW = f · δx = Q · δq.
The vector Q is called the generalized force at x̂. The component Q j is called the compo-
nent of the generalized force along the generalized coordinate q j .
Example 5.10. Consider the simple pendulum of Figure 1.1. The position vector r =
〈ℓ sin ϕ, ℓ cos ϕ〉, therefore the vector x (see (5.3a)) is
x = 〈x1 , x2 〉
= 〈ℓ sin ϕ, ℓ cos ϕ〉,
and the constraint is x12 + x22 −ℓ2 , therefore the configuration manifold M coincides with
the circle swept by the pendulum’s bob, embedded in the configuration space R2 . The
angle ϕ plays the role of the generalized coordinate in this case; any value of ϕ identifies
a point on M . Let us write ϕ and Qϕ instead of q1 and Q1 for clarity. The force vector is
〈0, m g 〉. We compute the generalized force by applying (5.13):
∂ x1 ∂x
Q ϕ = f1 + f2 2
∂ϕ ∂ϕ
= 0 × (ℓ cos ϕ) + m g × (−ℓ sin ϕ) = −m g ℓ sin ϕ.
Observe that Qϕ turns out to be equal to the moment of the weight vector f about the
pendulum’s pivot.
Example 5.11. Consider the double-pendulum of Figure 1.2. The position vectors of its
two masses are given by
then perform work during the motion. But such a behavior is uncharacteristic of a passive
constraint surface, so we disallow it.
Since a virtual displacement δx is tangent to the constraint manifold (see Figure 5.2),
the orthogonality of the reaction force f ′ to M is expressed naturally as f ′ · δx = 0 for
all virtual displacements δx, or in expanded form
3N
X
f i ′ δ xi = 0 for all virtual displacements δx.
i =1
therefore
n
3N
′∂ xi
X X
fi δq j = 0 for all virtual displacements δq,
j =1 i =1 ∂ qj
To simplify the left-hand side, we begin with a preliminary preparation. The kinetic
energy of the system is
3N
X 1
T̃ (ẋ) = mk ẋk2 .
k=1
2
I have written T̃ rather than the usual T for a reason which will become obvious shortly.
Now observe that for any i
3N
∂ T̃ (ẋ) ∂ X 1
= m ẋ 2 = mi ẋi ,
∂ ẋi ∂ ẋi k=1 2 k k
34 Chapter 5. Lagrangian mechanics
therefore
d ∂ T̃ (ẋ)
= mi ẍi .
d t ∂ ẋi
Then, the left-hand side of (5.16) may be calculated as
3N 3N
X ∂ xi X d ∂ T̃ (ẋ) ∂ xi
mi ẍi =
i =1 ∂ qj i =1 d t ∂ ẋi ∂ qj
3N
3N
X d ∂ T̃ (ẋ) ∂ xi X ∂ T̃ (ẋ) d ∂ xi
= − ,
i =1 d t ∂ ẋi ∂ q j i =1 ∂ ẋi d t ∂ qj
where in the last step we have used the differentiation formula u ′ v = (uv)′ − uv ′ . Now
apply (5.10) to the first summation on the right-hand side, and apply (5.11) to the second
summation, to get
3N 3N
3N
X ∂ xi X d ∂ T̃ (ẋ) ∂ ẋi X ∂ T̃ (ẋ) ∂ ẋi
mi ẍi = −
i =1 ∂ qj i =1 d t ∂ ẋi ∂ q̇ j i =1 ∂ ẋi ∂ q j
3N 3N
d X ∂ T̃ (ẋ) ∂ ẋi X ∂ T̃ (ẋ) ∂ ẋi
= − .
d t i =1 ∂ ẋi ∂ q̇ j i =1 ∂ ẋi ∂ q j
Let us recall that the Cartesian velocity components ẋ and the generalized velocity
components q̇ are related through (5.8). Therefore the kinetic energy, which we have
taken to be a function of ẋ, may equally well be expressed as a function of q, q̇, and t .
We write the latter as T to distinguish it from the former T̃ :
Consequently
3N
X ∂ xi d ∂ T (q, q̇, t ) ∂ T (q, q̇, t )
mi ẍi = − ,
i =1 ∂ qj dt ∂ q̇ j ∂ qj
and therefore (5.16) takes the form
d ∂ T (q, q̇, t ) ∂ T (q, q̇, t )
− = Qj , j = 1, 2, . . . , n. (5.17)
dt ∂ q̇ j ∂ qj
a
ϕ
a m
O
x
Figure 5.3: The massless hoop rolls against a horizontal line while remaining in a ver-
tical plane. A point of mass m is affixed to the hoop. The angle ϕ or the
distance x may be used as generalized coordinates in the configuration space
(Exercises 5.2 and 5.3).
and observe that since V does not depend on q, we have ∂ L/∂ q̇ j = ∂ T /∂ q̇, therefore
the equations of motion collapse to
d ∂ L(q, q̇, t ) ∂ L(q, q̇, t )
− = 0, j = 1, 2, . . . , n. (5.18)
dt ∂ q̇ j ∂ qj
Exercises
5.1. Find the gneralized forces Qϕ and Qθ in the spherical pendulum of Figure 1.4
(page 7).
5.2. A massless hoop of radius a rolls without slipping on a horizontal line, while remi-
aning in a vertical plane. A particle of mass m is firmly attached to the hoop, as
seen in Figure 5.3. Use the angle ϕ of the mass’s radius relative to the vertical as
generalized coordinate. Find the generalized force Qϕ .
5.3. In the previous problem use the distance x travelled by the contact point (see the
figure) as generalized coordinate. Find the generalized force Q x .
5.4. Suppose the external forces fi in (5.14) are derived from a potential, that is, there ex-
ists a scalar-valued function V˜ (x, t ) such that fi = −∂ Ṽ (x, t )/∂ xi for i = 1, 2, . . . , 3N .
Let V (q, t ) = Ṽ (x, t ) be the representation of the potential as a function of gen-
eralized coordinates. Show that the generalized forces Q j are derived from the
potential V , that is, Q j = ∂ V (q, t )/∂ q j , for j = 1, 2, . . . , n.
36 Chapter 5. Lagrangian mechanics
Chapter 6
Constraint reactions
where each ai j and ai t can be a given function of q and t . For convenience, we write these
set of constraints in the compact form
n
X
a l k (q, t ) d qk + a l t (q, t ) d t = 0, l = 1, 2, . . . , m, (6.2)
k=1
37
38 Chapter 6. Constraint reactions
of motion:
m
d ∂ L(q, q̇, t ) ∂ L(q, q̇, t ) X
− = ak j λ k , j = 1, 2, . . . , n. (6.4)
dt ∂ q̇ j ∂ qj k=1
where we have made use of (1.1). The kinetic and potential energies are
1
T = m(ρ̇2 + ρ2 ϕ̇ 2 ), V = −m g ρ cos ϕ,
2
which leads to the lagrangian
1
L = T − V = m(ρ̇2 + ρ2 ϕ̇ 2 ) + m g ρ cos ϕ.
2
We calculate
∂L ∂L ∂L ∂L
= m ρ̇, = mρϕ̇ 2 + m g cos ϕ = mρ2 ϕ̇, = −m g ρ sin ϕ.
∂ ρ̇ ∂ρ ∂ ϕ̇ ∂ϕ
The constraint of inextensibility ρ = ℓ implies that d ρ = 0, which we write as (1)d ρ +
(0)d ϕ + (0)d t = 0 to conform to the genral template (6.1). It follows that a11 = 1, a12 = 0,
a1t = 0, therefore the equations (6.4) take the form
d ∂ L ∂ L
− = a11 λ1
d t ∂ ρ̇ ∂ρ
d ∂ L ∂ L
− = a12 λ1 ,
d t ∂ ϕ̇ ∂ϕ
39
that is
or in expanded form:
m ρ̈ − (mρϕ̇ 2 + m g cos ϕ) = λ1 ,
mρ2 ϕ̈ + 2mρρ̇ϕ̇ + m g ρ sin ϕ = 0.
−mℓϕ̇ 2 − m g cos ϕ = λ1 ,
mℓ2 ϕ̈ + m g ℓ sin ϕ = 0.
The second equation is the usual equation of motion of a simple pendulum. In principle,
we may plug its solution, ϕ(t ), into the first equation to find λ1 , but we don’t do it that
way. Instead, we compute the generalized reaction forces Q r′ and Qϕ′ from (6.5):
Then, we apply (6.6) to translate these into the physical components of the forces. Toward
that end, let us observe that r = ρe r , therefore
∂r ∂r ∂e
= er , = ρ r = ρeϕ ,
∂ρ ∂ϕ ∂ϕ
∂r ∂r
Q r′ = f ′ · = f ′ · er , Qϕ′ = f ′ · = f ′ · ρeϕ ,
∂ρ ∂ϕ
whence
f ′ · e r = −(mℓϕ̇ 2 + m g cos ϕ) f ′ · eϕ = 0.
This tells us that the constraint reaction force f ′ lies in the direction of the pendulum’s
rod, and its magnitude equals the sum of the centrifugal force mℓϕ̇ 2 and the compoment
m g cos ϕ of the bob’s weight along the rod.
Example 6.2. Consider a the rolling hoop of Exercise (2) of Chapter 5. Find the force at
contact point between the hoop and the plane.
Referring to Figure 5.3, we use the hoop’s rotation angle ϕ and the horizontal trans-
lation x of its center as a generalized coordinates. The no-slip condition imposes the con-
straint x = aϕ. We will use ϕ and x as overdetermined generalized coordinates subject to
x = aϕ, that is d x − a d ϕ = 0, thefore accoring to the template (6.1), a11 = 1, a12 = −a.
40 Chapter 6. Constraint reactions
The position vector of the mass m is r = 〈x +a sin ϕ, a+a cos ϕ〉, therefore the velocity
is v = ṙ = 〈ẋ + a ϕ̇ cos ϕ, −a ϕ̇ sin ϕ〉. It follows that kvk2 = ẋ 2 + 2a ẋ ϕ̇ cos ϕ + a 2 ϕ̇ 2 ,
therefore the kinetic and potential enerigies are
1
T = m(ẋ 2 + 2a ẋ ϕ̇ cos ϕ + a 2 ϕ̇ 2 ), V = m ga(1 + cos ϕ).
2
Therefore
1
L = m(ẋ 2 + 2a ẋ ϕ̇ cos ϕ + a 2 ϕ̇ 2 ) − m ga(1 + cos ϕ).
2
and
∂L ∂L
= m(ẋ + a ϕ̇ cos ϕ), = 0,
∂ ẋ ∂x
∂L ∂L
= m(a ẋ cos ϕ + a 2 ϕ̇) = m(−a ẋ ϕ̇ sin ϕ + ga sin ϕ).
∂ ϕ̇ ∂ϕ
To eliminate λ1 , divide the second equation through by a and add the result to the
first equation. We get
g
2ma(1 + cos ϕ)ϕ̈ − ma + ϕ̇ 2 sin ϕ = 0.
a
This is the hoop’s equation of motion. To compute the constraint force, solve this for ϕ̈:
g
a a + ϕ̇ 2 sin ϕ
ϕ̈ =
2a(1 + cos ϕ)
1 g
λ1 = ma ϕ̇ 2 − sin ϕ. (6.8)
2 a
Now we compute the generalized forces Q x′ and Qϕ′ from (6.5):
Then, we apply (6.6) to translate these into the physical components of the forces. Toward
that end, let us observe that
∂r ∂r
= 〈1, 0〉, = 〈a cos ϕ, −a sin ϕ〉,
∂x ∂ϕ
Exercises 41
f′ f′ f′
a sin ϕ
ϕ m
a(1 + cos ϕ)
p
f′
∂r ∂r
Q x′ = f · = fx Qϕ′ = f · = a f x cos ϕ − a fy sin ϕ.
∂x ∂ϕ
It follows that
f x = λ1 , a f x cos ϕ − a fy sin ϕ = −aλ1 .
We solve this as a system of two equations in the two unknowns f x and fy :
1 + cos ϕ
f x = λ1 , fy = λ1 .
sin ϕ
Exercises
6.1. The hoop of Example 6.2 rolls, without slipping, down an incline which makes an
angle α with respect to the horizontal. See Figure 6.2. Use the angle ϕ as the gen-
eralized coordinate. Derive the equation of motion. Hint: (a) Express the vectors
i′ and j ′ in terms of i, j, and the angle α; (b) Express the position vector r of the
mass m in terms of the basis vectors i′ and j ′ ; (c) Apply the results of (a) and (b) to
express r in terms of i and j.
6.2. In the the previous exercise, find the contact force between the hoop and incline.
42 Chapter 6. Constraint reactions
α
j′ ϕ
r
m
j i′
α
i
Figure 6.2: Hoop rolling on an incline. The hoop’s initial position is shown in gray
(Exercise 6.1).
Chapter 7
Angular velocity
When you hurl a rock in the air, or toss a Frisbee, the object spins in general as it moves.
Associated with the motion is a vector ω(t ), called the object’s angular velocity vector,
(or just angular velocity for short,) whose direction and magnitude at any instant of time
t indicate orientation and the rate of rotation. It is the goal of this section to make the
definition of the angular velocity precise.
Toward that end, let {b1 , b2 , b3 } be an orthonormal set of vectors which is firmly af-
fixed to the spinning object, therefore moves with it. The choice of the letter b is this
notation is to remind us that we are dealing with a body coordinate system.9
In elementary linear algebra we learn that the component of an arbitrary vector r in
the direction of the unit vector b j is r · b j . Therefore the vector r may be expressed in the
P
basis {b1 , b2 , b3 } as r = 3j =1 (r · b j )b j .10 Therefore, the rate of change, ḃi , of the vector
bi in the basis {b1 , b2 , b3 } may be expressed as
3
X
ḃi = (ḃi · b j )b j .
j =1
Let ai j = ḃi · b j be the matrix of the coefficients. We claim that the matrix, let’s call it A,
¨
1 if i = j ,
is skew-symmetric. Indeed, we have bi · b j = Upon differentiating this we
0 if i 6= j .
get ḃi · b j + ḃ j · bi = 0, whence ai j + a j i = 0, which proves A is skew-symmetric.
The general form of a 3 × 3 skew-symmetric matrix is
0 ω3 −ω2
A = −ω3 0 ω1 . (7.1)
ω2 −ω1 0
9
As the body rotates, the vectors {b j }3j =1 rotate with it, therefore they vary with time from the point of view
3
of a stationary observer. On occasion we write b j (t ) j =1 to make explicit the dependence of these vectors on
the time t .
10
See equation (8.7) on page 49 for a simple proof.
43
44 Chapter 7. Angular velocity
Therefore
ḃ1 = ω3 b2 − ω2 b3 , (7.2a)
ḃ2 = ω1 b3 − ω3 b1 , (7.2b)
ḃ3 = ω2 b1 − ω1 b2 . (7.2c)
The coefficients ω1 , ω2 , ω3 measure the rate of change of the triad {b1 , b2 , b3 }, and
consequently, the rate of change of orientation of the body to which the triad is attached.
The angular velocity vector, defined as
ω = ω1 b1 + ω2 b2 + ω3 b3 (7.3)
encapsulates that rate of change.
Remark 7.2. We leave it as an exercise to show that differentiating (7.3) and applying the
equations (7.2), resutls in
ω = ω̇1 b1 + ω̇2 b2 + ω̇3 b3 . (7.4)
Remark 7.3. On the one hand, in (7.1) we have a23 = ω1 . On the other hand, we have
the ai j = ḃi ·b j by definition. It follows that ω1 = ḃ2 ·b3 . Similar expressions are obtained
for ω2 and ω3 . Let’s make a record of this:
ω1 = ḃ2 · b3 , ω2 = ḃ3 · b1 , ω3 = ḃ1 · b2 . (7.5)
−→
Remark 7.5. Consider points O and P in the body, and let r = OP . Then r = α1 b1 +
α2 b2 + α3 b3 , where α1 , α2 , α3 are constants. The velocity v of P relative to O is given by
v = ṙ. Let us observe that
v = ṙ = α1 ḃ1 + α2 ḃ2 + α3 ḃ3
= α1 (ω × b1 ) + α2 (ω × b2 ) + α3 (ω × b3 ) = ω × (α1 b1 + α2 b2 + α3 b3 ) = ω × r.
The formula
v = ω×r (7.7)
is used throughout the rest of this book.
Exercises 45
Exercises
7.1. Derive (7.4).
46 Chapter 7. Angular velocity
Chapter 8
47
48 Chapter 8. The moment of inertia tensor
y
x+y x
o
o −0.7x 1.2x
x
Figure 8.1: On the left: The sum x + y of two vectors x and y is formed through
the parallelogram rule. On the right: Multiplication by a number stretch-
es/shrinks/flips a vector. The vector 1.2x is drawn in a slightly displaced
position relative to the origin o for the sake of visualization.
x×y
y
n
θ y
o
x θ
o
x
Figure 8.2: On the left: The dot product of the vectors x and y is the number x · y =
kxk kyk cos θ. On the right: The cross
product of the vectors x and y is
the vector x × y = kxk kyk sin θ n, where n is a unit vector which is
perpendicular to the plane of the vectors x and y, and is oriented according
to the right-hand rule.
where 0 ≤ θ ≤ π is the angle between the vectors, and n is a unit vector which is perpen-
dicular to the plane of the vectors x and y, and is oriented according to the right-hand
rule. The latter means that if you align your right hand’s thumb with n, then a rotation
by an angle θ in the direction pointed at by your fingers will take the vector x to y. It
follows that the cross product is anticommutative, which means that x × y = −y × x.
x = c 1 e1 + c 2 e2 + c 3 e3 .
Dot-multiplying through by e1 and taking into account the orthonormality of basis vec-
tors, we get x · e1 = c1 . Repeating with e2 and e3 we conclude that ci = x · ei , i = 1, 2, 3.
Thus we have established the identity
For each i, the coefficient x · ei is called the component of the vector x along ei .
Remark 8.1. Let us state emphatically that the components x · ei are not properties of
the vector x at all! If you replace one basis with another which is rotated in an arbitrary
manner relative to the first, then the components of x will be different in general, while x
has not been touched. Even if you remove the basis altogether, the vector x will happily
continue to exist. Having said all that, it is sometimes useful to work with 〈c1 , c2 , c3 〉 ∈ R3
as sort of an “avatar” of x ∈ V , as long you remain cognizant of what it is.
Remark 8.2. Applying the defintion of the diatic product (8.4), the identity (8.7) may
be written in the equivalent form
x = (e1 ⊗ e1 )x + (e2 ⊗ e2 )x + (e3 ⊗ e3 )x,
which has at least two implications. First, upon factoring the x on the right-hand side,
we see that
I = e1 ⊗ e1 + e2 ⊗ e2 + e3 ⊗ e3 .
Second, ei ⊗ ei acting on any vector x produces the projection of x in the direction ei .
We noted earlier that the set of tensors on V , equipped with the operations defined
in (8.2) and (8.3), is a vector space of its own. The following lemma shows how to con-
struct a basis for that vector space.
Lemma 8.2. Let {e1 , e2 , e3 } be an orthonormal basis in V . Then {ei ⊗ e j }3i , j =1 is a basis for
the space of tensors on V .
Proof. We will show that any tensor L is a linear combination of the nine dyadic products
{ei ⊗ e j }3i , j =1 . Toward that end, pick an arbitrary x ∈ V ,
3
X
x= (x · ei )ei ,
i =1
it follows that
3
X 3
X
y · ej = (x · ei )(Lei · ej ) = (Lei · ej )(x · ei ),
i =1 i =1
and consequently
3
X
y= (y · e j )e j
j =1
3 X
X 3
= (Lei · ej )(x · ei )e j
j =1 i =1
3 X
X 3
= (Lei · ej )(ei ⊗ e j )x,
j =1 i =1
8.1. A brief introduction to tensor algebra 51
where we have made use of the definition (8.4). Since y = Lx, this tells us that
3 X
X 3
Lx = (Lei · ej )(ei ⊗ e j )x for all x ∈ V ,
i =1 j =1
therefore
3 X
X 3
L= (Lei · ej )(ei ⊗ e j ), (8.8)
i =1 j =1
which shows that L is a linear combination of the nine dyadic products {ei ⊗ e j }3i , j =1 . In
other words, the set of vectors {ei ⊗ e j }3i , j =1 spans the set of all tensors. To show that
the set is a basis, it remains to show that it is linearly independent. You will do that in
Exercise 8.2.
Remark 8.3. The coefficients in (8.8) are called the components of L in the basis {ei ⊗
e j }3i , j =1 . Letting ℓi j = Lei · ej , we may write (8.8) in the abbreviated form
3 X
X 3
L= ℓi j ei ⊗ e j .
i =1 j =1
At times it is useful to identify the tensor L with the 3 × 3 matrix with components ℓi j
but the caveats of Remark 8.1 apply equally well here. The components ℓi j are artifacts
of the choice of the basis vectors. The tensor is an intrinsic property of the system and
will continue to exist even when the basis vectors are obliterated.
Remark 8.4. In view of the one-to-one correspondence between 3 × 3 matrices and ten-
sors on V noted above, every property or theorem in matrix algebra finds a counterpart
in tensor analysis. For instance, a nonzero vector x is called and eigenvector of the tensor
L if Lx = λx for some λ ∈ R.11 The coefficient λ is the eigenvalue corresponding to that
eigenvector.
vi
mi
ω ri
Remark 8.5. According to (8.8), a tensor is a linear combination of the nine basis ele-
ments {ei ⊗ e j }3i , j =1 in general. In contrast, (8.9) presents L as a linear combination of
only three special elements {g j ⊗ g j }3j =1 constructed from L’s eigenvectors.
the the tensor I . The change, however, obeys a simple translation rule which we will
develop in the next section. For now let us observe another aspect of (8.10). The fact that
the system under consideration consists of N rigidly connected point masses is hardly
of particular importance. A general rigid solid may be approximated by a union a large
number of tiny parts, as one does in the theory of integration, and then pass to the limit
as the number of the parts goes to infinity, and the sizes of the individual parts go to zero,
while maintaining a fix mass for the aggregate.
To be specific, let B be the solid object, d m be the differential mass of the “part” of B
indicated by the position vector r relative to some origin o. Then the obvious extension
of (8.10) takes the following form for the solid’s moment of inertia:
Z
I= krk2 I − r ⊗ r d m. (8.12)
B
If ρ(r) is the density of the body at the position r, then d m = ρ dV , where V is the
volume element, and the formula above takes the form
Z
I= ρ(r) krk2 I − r ⊗ r dV . (8.13)
B
where r and r are the position vectors of a generic point p ∈ B relative to o and o′ , as
′
Theorem 8.4. Let c ∈ B the the body’s center of mass, and let us write rc′ for the position
vector of c relative to o′ . Furthermore, let τ = o′ − o. Then we have:
h i
Io = Io′ + m 2rc′ · τ + kτ k2 I − rc′ ⊗ τ − τ ⊗ rc′ − τ ⊗ τ , (8.14)
B c
p rc
rc′
r′
r
τ
o o′
Figure 8.4: The moment of inertia tensor of a rigid solid B depends on the choice of
the origin. The origins o′ and o′ in this illustration are related through
o′ − o = τ . The solid’s center of mass is c.
The first intergral on the right-hand side equals Io′ . The second integral may be simplified
by noting that Z Z
1
m= d m, rc′ = r ′ d m,
B m B
R
and consequently B r ′ d m = mrc′ ,
Corollary 8.5. Let c ∈ B be the body’s center of mass as before, and let Ic be the moment
of inertia tensor relative to c. Then the moment of inertia tensor Io of B relative to any
origin o is given by
Io = Ic + m krc k2 I − rc ⊗ rc . (8.15)
Proof. Apply (8.14) with o′ set to c. Then rc′ = 0, and the formula reduces to
Io = Ic + m kτ k2 I − τ ⊗ τ .
Then the observation that τ = o′ − o = c − o = rc completes the proof.
Remark 8.6. The moment of intertia tensor of a rigid system of N point masses is given
in (8.10). In particualr, the moment of intertia tensor of a single point of mass m at a
position r relative to an origin o is
m krk2 I − r ⊗ r .
Therefore the translation formula (8.15) may be interpreted as saying that the moment of
interia of a body relative of any point o equals the moment of inertia relative to its center
of mass, plus the moment of inertia relative to o of a fictitious point of mass m situated
at the center of mass.
Remark 8.7. The moment of inertia tensor Ic of a ridig body relative to its center of
mass is an intrinsic property of the body, just like its total mass or its center of mass are.
8.4. The principal moments of inertia 55
The total mass is a scalar, the center of mass is expressed through its position vector, and
the moment of inertia Ic is a tensor.
Exercises
8.1. Show that an orthonormal set of vectors is linearly independent.
8.2. Complete the proof of Lemma 8.2 by showing that the set {ei ⊗ e j }3i , j =1 is linearly
P P
independent. Hint: It suffices to show that 3i =1 3j =1 ci j (ei ⊗e j ) = 0 implies that
every ci j is zero. Begin by applying each side of that equality to ek .
8.3. Show that if the tensor L is symmetric, then the matrix ℓi j of its components (see
Remark 8.3) is symmetric, that is ℓi j = ℓ j i for i, j = 1, 2, 3.
8.4. Use (8.17) in conjunction with the translation formula (8.15) to calculate the mo-
ment of inertia tensor of a slender rod relative to one of its endpoints.
8.5. Look up the principal moments of inertia of a circular hoop in Wikipedia relative
to the hoop’s center. Then apply the translation formula (8.15) to calculate the
moment of inertia tensor of the hoop relative to a point on its rim.
56 Chapter 8. The moment of inertia tensor
Chapter 9
We assume that an external force f d m acts on the part P . The resultant of forces
applied to the body, and their moments about the center of mass, are
Z Z
F≡ f d m, M≡ r × f d m. (9.2)
B B
1
The contribution of the part P to the overall Gibbs function is 2 kv̇k2 d m − f · v̇ d m,
and therefore the overall Gibbs function is
Z Z
1
G= kv̇k2 d m − f · v̇ d m. (9.3)
B 2 B
Now let’s analyze each of the nine integrals on the right-hand side.
57
58 Chapter 9. Rigid body dynamics through the Gibbs-Appell formulation
1. The integrand kr̈c k2 in the first integral is independent of the position vector r, and
therefore the integral evaluates to
Z
1 1
kr̈c k2 d m = mkr̈c k2 .
2 B 2
2. The integrand kω̇ × rk2 of the second integral may be expanded through the iden-
tity (8.5) into the form kω̇ × rk2 = ω̇ · krk2 I − r ⊗ r ω̇. The integral of parenthe-
sized expression is the moment of inertia tensor, I , and therefore
Z
1 1
kω̇ × rk2 d m = ω̇ · I ω̇.
2 B 2
(ω̇ × r) · (ω × ṙ) = (ω̇ × r) · ω × (ω × r)
= (ω̇ × r) · (ω · r)ω − kωk2 r = (ω̇ × r) · (ω · r)ω
= (ω · r) (ω̇ × r) · ω = (ω · r) (ω × ω̇) · r
= (ω̇ × ω) · − (ω · r)r = (ω̇ × ω) · − (r ⊗ r)ω
= (ω̇ × ω) · krk2 ω − (r ⊗ r)ω
= (ω̇ × ω) · krk2 I − (r ⊗ r) ω.
9.2. Example: Euler’s equations of motion 59
Consequently,
Z
(ω̇ × r) · (ω × ṙ) d m = (ω̇ × ω) · I ω = (ω × I ω) · ω̇.
B
7. In the seventh integral, the factor r̈c is independent of the position vector r, and
therefore, according to (9.2) we have
Z Z
f · r̈c d m = f · d m r̈c = F · r̈c .
B B
8. We rotate the elements of the the triple vector product in the integrand of the eighth
integral, integrate, and apply (9.2) to obtain
Z Z Z
f · (ω̇ × r) d m = ω̇ · (r × f ) d m = ω̇ · r × f d m = M · ω̇.
B B B
1 1
G = mkr̈c k2 + ω̇ · I ω̇ + (ω × I ω) · ω̇ − F · r̈c − M · ω̇ + · · · , (9.4)
2 2
I ω̇ + ω × I ω = M .
These are known as Euler’s equations of the motion of a rigid body. Letting I1 , I2 , I3 be the
body’s principal moments of inertia, and ω = (ω1 , ω2 , ω3 ) and M = (M1 , M2 , M3 ) be the
components of the angular velocity and the applied momentum along the principal axes
of inertia, this takes the form
I1 0 0 ω̇1 ω1 I1 0 0 ω1 M1
0 I2 0 ω̇2 + ω2 × 0 I2 0 ω2 = M2 ,
0 0 I3 ω̇3 ω3 0 0 I3 ω3 M3
which simplifies to
Now let’s analyze each of the nine integrals on the right-hand side.
1. The integrand kω̇ × rk2 of the first integral may be expanded through the iden-
tity (8.5) into the form kω̇ × rk2 = ω̇ · krk2 I − r ⊗ r ω̇. The integral of parenthe-
sized expression is the moment of inertia tensor, I , with respect to the fixed point
O, and therefore Z
1 1
kω̇ × rk2 d m = ω̇ · I ω̇.
2 B 2
This expression contains no acceleration terms, and therefore it will drop out when
applying the Gibbs-Appell equations. Consequently we may safely ignore it. This
is indicated by the ellipses in the final result below.
3. To evaluate the third integral, we apply the identity
(ω̇ × r) · (ω × ṙ) = (ω̇ × ω) · krk2 I − (r ⊗ r) ω
where I is the moment of inertia tensor with respect to the fixed point O.
4. We rotate the elements of the the triple vector product in the integrand of the fourth
integral and integrate apply (9.2) to obtain
Z Z Z
f · (ω̇ × r) d m = ω̇ · (r × f ) d m = ω̇ · r × f d m = M · ω̇,
B B B
where M is the moment of the applied forces with respect to the fixed point O.
9.3. Rotation about a fixed point 61
1
G = ω̇ · I ω̇ + (ω × I ω) · ω̇ − M · ω̇ + · · · , (9.5)
2
where the ellipses indicate terms that do not contain accelerations.
Remark 9.1. If the applied force density f is constant, then there is a simple formula for
the moment M . Specifically, we have
Z Z
M= r×f dm = r d m × f = mrc × f = rc × F ,
B B
where rc is the position vector of the body’s center of mass relative to O, and F is the
resultant of the external forces applied to the body.
62 Chapter 9. Rigid body dynamics through the Gibbs-Appell formulation
Chapter 10
The Gibbs-Appell
formulation of dynamics
The goal of this chapter is to derive the Gibbs-Appell equations of motion and then show
a few applications. For now, I have two versions of the proof here, one based on the
presentation in Lurie [12] and the other based on the presentation in Gantmacher [8].
The part based on Lurie is not quite finished; the one based on Gantmacher is pretty
complete.
Due to the reliance on two different presentations, the notation in the illustrations/ex-
amples is very inconsistent. At one point I will rewrite this chapter by merging the two
presentations into one, and introduce a consistent notation.
Some of the material from the previous chapters is repeated for the sake of making
this chapter somewhat self-contained.
63
64 Chapter 10. The Gibbs-Appell formulation of dynamics
mi w i = F i + Ri , i = 1, . . . , N . (10.3)
We assume that the system’s constraints are ideal, that is, they do not perform work
in any virtual displacement. (See page 271 of Lurie’s book for a discussion.) Thus, for all
virtual displacements δri we have
N
X
Ri · δri = 0. (10.4)
i =1
As a consequence, multiplying (10.3) by δri and summing over i eliminates the the reac-
tion forces Ri :
XN XN
mi wi · δri = Fi · δri . (10.5)
i =1 i =1
The expression on the right-hand side of (10.5) is the virtual work performed by all
external forces Fi under the virtual displacements δri . It may be expressed as
N N
n
n
N n
X X X ∂ ri X X ∂ ri X
Fi · δri = Fi · δq s = Fi · δq s = Q s δq s ,
i =1 i =1 s =1 ∂ qs s =1 i =1
∂ qs s =1
∂ ri ∂ ṙi ∂ r̈i
= = .
∂ qs ∂ q̇ s ∂ q̈ s
N
X ∂ r̈i
Qs = Fi · . (10.7)
i =1 ∂ q̈ s
10.1. Gibbs-Appell according to Lurie [12] 65
10.1.4 Constraints
Suppose that the n generalized coordinates are related through l (generally nonholo-
nomic) constraints
Xn
ak s (q, t )q̇ s + ak (q, t ) = 0, k = 1, . . . , l , (10.8)
s =1
or equivalently,
n
X
ak s (q, t ) d q s + ak (q, t ) d t = 0. k = 1, . . . , l . (10.9)
s =1
Similarly, (10.10) may be solved for l of the virtual displacements in terms of the rest:
n−l
X
δq r = b r,l +s (q, t ) δq l +s , r = 1, . . . , l . (10.12)
s =1
For reasons which will become clear shortly, the terms which involve no generalized ac-
celerations in our calculations are irrelevant, therefore, to simplify the notation, we write
the above as
n−l
X
q̈ r = b r,l +s (q, t )q̈ l +s + · · · . (10.13)
s =1
From here on, the ellipsis “· · · ” in an equation indicates additive terms that involve no
generalized accelerations.
Now we apply (10.13) to eliminate the accelerations q̈ r , r = 1, . . . , l from (10.2). We
have:
n
X ∂ ri
wi = q̈ + · · ·
s =1 ∂ qs s
l n−l
X ∂ ri X ∂ ri
= q̈ r + q̈ l +s + · · ·
r =1 ∂ q r s =1 ∂ q l +s
l
n−l n−l
X ∂ ri X X ∂ ri
= b r,l +s (q, t )q̈ l +s + · · · + q̈ l +s + · · ·
r =1 ∂ q r s =1 s =1 ∂ q l +s
n−l
l
X ∂ ri X ∂ ri
= + b r,l +s (q, t ) q̈ l +s + · · · .
s =1 ∂ q l +s r =1 ∂ q r
66 Chapter 10. The Gibbs-Appell formulation of dynamics
Since the terms hidden under the ellipsis do not involve the generalized accelerations, we
conclude that
∂ wi
= ci ,l +s . (10.15)
∂ q̈ l +s
n l n−l
X ∂ ri X ∂ ri X ∂ ri
δri = δqk = δq r + δq l +s (10.16)
k=1
∂ qk r =1 ∂ q r s =1 ∂ q l +s
l
n−l n−l
X ∂ ri X X ∂ ri
= b r,l +s (q, t ) δq l +s + δq l +s
r =1 ∂ q r s =1 s =1 ∂ q l +s
n−l
n−l
X ∂ ri X ∂ ri
= + b r,l +s (q, t ) δq l +s
s =1 ∂ q l +s s =1 ∂ q r
n−l
X
= ci ,l +s δq l +s ,
s =1
n−l
X ∂ wi
δri = δq l +s , i = 1, . . . , N . (10.17)
s =1 ∂ q̈ l +s
This motivates the introduction of a quantity G known as the Gibbs function or the energy
of acceleration:
N
1X
G= m w · wi ,
2 i =1 i i
10.1. Gibbs-Appell according to Lurie [12] 67
We observe that
N
∂G X ∂ wi
= mi w i · ,
∂ q̈ l +s i =1 ∂ q̈ l +s
whence
N n−l
X X ∂G
mi wi · δri = δq l +s . (10.18)
i =1 s =1 ∂ q̈ l +s
N
l
X ∂ ri X ∂ ri
Q̃ l +s = Fi · + b r,l +s (q, t ),
i =1 ∂ q l +s r =1 ∂ q r
N l
N
X ∂ ri X X ∂ ri
= Fi · + F · b r,l +s (q, t ),
i =1 ∂ q l +s r =1 i =1 i ∂ q r
that is,
n−l
X ∂G
δq l +s − Q̃ l +s δq l +s .
s =1 ∂ q̈ l +s
∂G
= Q̃ l +s , s = 1, . . . , n − l . (10.20)
∂ q̈ l +s
68 Chapter 10. The Gibbs-Appell formulation of dynamics
10.1.9 Quasi-velocities
In a mechanical system with generalized coordinates q = 〈q1 , . . . , qn 〉, expressions of the
type
Xn
ωs = a s k (q, t )q̇k + a s (q, t ), s = 1, . . . , n (10.21)
k=1
d πs = ωs d t . (10.23)
Remark: There is no reason to expect the expression on the right to be an exact dif-
ferential, therefore although d π s makes sense as defined, it should not be assumed that it
is the differential of a function π s .
For any arbitrary function ϕ(q, t ) we have:
n
X ∂ϕ ∂ϕ
dϕ = d qr + dt
r =1 ∂ q r ∂t
n
X ∂ϕ ∂ϕ
= q̇ r d t + dt
r =1 ∂ q r ∂t
n
n
X ∂ϕ X ∂ϕ
= b r s (q, t )ω s + b r (q, t ) d t + dt
r =1 ∂ q r s =1 ∂t
n
n n
X X ∂ϕ X ∂ϕ ∂ϕ
= b r s (q, t ) d πs + b r (q, t ) dt + dt.
s =1 r =1 ∂ qr r =1 ∂ qr ∂t
and therefore we obtain an expression for the velocity vector in terms of the quasi-coordinates:
vi = ṙi = TODO . . .
ωk = 0, k = 1, . . . , l .
We solve (10.26) for the q̇ in terms of the quasi-velocities. Since the first l of the quasi-
velocities are zero, each q̇ r is a function of ω l +s , s = 1, . . . , n − l . Then we express the
Gibbs function G as a function of
q1 , . . . , q n , ω l +1 , . . . , ωn ω̇ l +1 , . . . , ω̇n .
∂G
= Q l +s , s = 1, . . . , l . (10.27)
∂ ω̇ l +s
Despite the complexity of the derivation, equations (10.27) are much easier to apply than
the original (10.20). In particular, these result in first order differential equations in quasi-
velocities, as apposed to the second order differential equations which we were getting
before.
10.2.1 Pseudocoordinates
Consider the motion of N particles of masses mν and position vectors rν subject to d
algebraic and g differential constraints. Utilizing the d algebraic constraints, we inroduce
m = 3N − d generalized idependent coordinates q = 〈q1 , . . . , q m 〉, and express the system’s
configures in them as
rν = rν (q, t ), ν = 1, . . . , N . (10.28)
From this it follows that
m
X ∂r ν ∂ rν
ṙν = q̇i + , ν = 1, . . . , N , (10.29)
i =1 ∂ qi ∂t
and m
X ∂ rν
δrν = δqi , ν = 1, . . . , N . (10.30)
i =1 ∂ qi
The vectors rν and ṙν should satisfy the g differential constraints:
m
X
lβν (rν , t ) · ṙν + Dβ (rν , t ) = 0, β = 1, . . . , g . (10.31)
ν=1
Substituting for rν and ṙν from (10.28) and (10.29), the constraint equations take the
form m
X
Aβi (q, t )q̇i + Aβ (q, t ) = 0, β = 1, . . . , g . (10.32)
i =1
Thus, the generalized coordinates q1 , . . . , q m can take on arbitrary values, but the gen-
eralized vecocities q̇1 , . . . , q̇ m are constrained by (10.32).
Assuming that the g constraints in (10.32) are indepdendent, that is, the matrix Aβi
has full rank, we may solve for g of the velocities q̇1 , . . . , q̇ m in terms of the remaining
n = m − g = 3N − d − g , the number n being the system’s degrees of freedom. Thus,
the velocities q̇1 , . . . , q̇n may take on arbitrary values, while the remaining g velocities are
determined from (10.32).
In practice, instead of the n generalized velocities noted above, it is often preferable
to use n linear combination of the generalized velocities, such as
m
X
π̇ s = f s i q̇i , i = 1, . . . , n. (10.33)
i =1
The quantities π̇ s defined here are called pseudovelocities. The notation π̇ s is entirely pro
forma—there is no requirement that the right-hand side of (10.33) be and exact derivative,
therefore although the notation π̇ s is well-defined, there is no function π s which it is the
derivative of. Nevertheless, we will have occasions to refer to the un-dotted symbol π s
which has no predefind meaning, but we will define it in an appropriate and constent way.
The symbol π s is called a pseudocoordinate.
We impose an invertibility requirement on the definitions (10.33) as follows. Con-
sider the system of g + n = m equation obtained as the union of the equations (10.32)
and (10.33), and view it as a system of m linear equaitons in the m unknowns q̇1 , . . . , q̇ m .
We require that system to be invertible. Thus, the m generalized velocities may be ex-
pressed in terms of the n pseudovelocities:
n
X
q̇i = hi s (q, t )π̇ s + hi (q, t ), i = 1, . . . , m. (10.34)
s =1
10.2. Gibbs-Appell according to Gantmacher [8] 71
Thus, any set of m generalized velocities that satify the motion constraints (10.32) define
a corresponding set of pseudovelocities π̇ s , and conversely, any set of n arbitrary pseu-
dovelocities define a set of generalized velocities that satify the motion constraints. The
important point here is that there is no constraint on the pseudovelocities.
Let us note that due to the constrain (10.32) on generalized velocities, the generalized
displacement are subject to the constraints
m
X
Aβi δqi = 0, β = 1, . . . , g . (10.35)
i =1
From what it has been said, the union of the equations (10.35) and (10.36) is invertivle
and the inverse has the form
n
X
δqi = hi s δπ s , i = 1, . . . , m. (10.37)
s =1
Thus, as argumed before, the expressions δπ s may take on arbitrary values. The corre-
sponding qi obtained from (10.37) will automatically satisfy the constraints (10.35).
N N m m XN m
X X X ∂ rν X ∂r X
δW = Fν ·δrν = Fν · δqi = Fν · ν δqi = Qi δqi , (10.38)
ν=1 ν=1 i =1 ∂ qi i =1 ν=1 ∂ qi i =1
where
N
X ∂ rν
Qi = Fν · , i = 1, . . . , m. (10.39)
ν=1 ∂ qi
Thus, we have obtained an expression for the virtual work in terms of the virtual
displacements δqi . Utilizing (10.37), we may express the virtual work in terms of the
δπ s :
m
X n
X X m
n X n
X
δW = Qi hi s δπ s , = hi s Qi δπ s , = Π s δπ s , (10.40)
i =1 s =1 s =1 i =1 s =1
The Π s defined above are called the generalized forces corresponding to the pseudocoordiante
π s , s = 1, . . . , n.
72 Chapter 10. The Gibbs-Appell formulation of dynamics
and n
X
r̈ν = eν s (q, t )π̈ s + · · · , ν = 1, . . . , N .
s =1
where the ellipsis indicate terms that are free of the pseudoaccelerations π̈ s , s = 1, . . . , n.
For future reference, let us note that
∂ r̈ν
= eν s . (10.44)
∂ π̈ s
that is
n
X n X
X N
Π s δπ s − mν r̈ν · eν s (q, t ) δπ s = 0,
s =1 s =1 ν=1
and finally
n
X X N
n X
Πs − mν r̈ν · eν s (q, t ) δπ s = 0,
s =1 s =1 ν=1
∂U
= Πs , s = 1, . . . , n, (10.47)
∂ π̈ s
∂U
= Πs , s = 1, . . . , n,
∂ π̈ s
Xm
Aβi (q, t )q̇i + Aβ (q, t ) = 0, β = 1, . . . , g ,
i =1
m
X
π̇ s = f s i q̇i , i = 1, . . . , n.
i =1
and express the result in terms of q, π̇, and π̈, then Π s may be computed more easily
through the formula
∂R
Πs = , s = 1, . . . , n. (10.49)
∂ π̈ s
Consequently, (10.47) takes on the form
∂U ∂R
= s = 1, . . . , n,
∂ π̈ s ∂ π̈ s
G = U − R, (10.50)
74 Chapter 10. The Gibbs-Appell formulation of dynamics
whence kwk2 = ℓ2 (ϕ̈ 2 + ϕ̇ 4 ). Considering that the force vector is f = 〈0, m g 〉, we are
lead to the Gibbs function
1
G = mkwk2 − f · w
2
1
= ℓ2 (ϕ̈ 2 + ϕ̇ 4 ) + m g ℓ(ϕ̈ sin ϕ + ϕ̇ 2 cos ϕ).
2
Since the Gibbs-Appell equations of motion (10.20) reduce to ∂ G/∂ ϕ̈ = 0 in our case,
we conclude that
ℓ2 ϕ̈ + m g ℓ sin ϕ = 0,
which is the familiar equation of motion of a simple pendulum.
m2
ϕ
m1
(x, y)
Figure 10.1: The Čaplygin sleigh’s configuration may be described in terms of the gen-
eralized coordinates x, y, and ϕ.
v1 = ṙ1 = 〈ẋ, ẏ〉, v2 = ṙ2 = 〈ẋ + ℓϕ̇ sin ϕ, ẏ + ℓϕ̇ cos ϕ〉.
and then
Thus, we arrive at
1 1
G = (m1 + m2 )(ẍ 2 + ÿ 2 ) + m2 ℓ2 ϕ̈ 2
2 2
1
− m2 ℓ(ϕ̈ sin ϕ + ϕ̇ 2 cos ϕ)ẍ + m2 ℓ(ϕ̈ cos ϕ − ϕ̇ 2 sin ϕ)ÿ + m2 ℓ2 ϕ̇ 4 .
2
Now, following the idea in (10.11) of eliminating redundant generalized velocities, we
solve (10.52) for ẏ, and eliminate it from the equations. Substituting ẏ = ẋ tan ϕ in the
above expression for G, we obtain
m1 + m2 2 1
G= ẍ + (m1 + m2 )ẋ sin ϕ − m2 ℓϕ̇ cos2 ϕ ϕ̇ ẍ
2
2 cos ϕ 3
cos ϕ
1 1
+ m2 ℓϕ̈ 2 + m ℓẋ ϕ̇ ϕ̈ + · · · , (10.53)
2 cos ϕ 2
where, as before, the ellipsis stands for terms that involve no accelerations.
There are no externally applied forces on the sleigh, therefore the equations of mo-
tion (10.20) reduce to
∂G ∂G
= 0, = 0,
∂ ẍ ∂ ϕ̈
that is,
m1 + m2 1
ẍ + (m1 + m2 )ẋ sin ϕ − m2 ℓϕ̇ cos2 ϕ ϕ̇ = 0, (10.54a)
2
cos ϕ 3
cos ϕ
1
m2 ℓϕ̈ + m ℓẋ ϕ̇ = 0. (10.54b)
cos ϕ 2
We may solve this system of two second order differential equations for the two un-
knowns x(t ) and ϕ(t ), subject to initial conditions x(0) = x0 , ẋ(0) = ẋ0 , ϕ(0) = ϕ0 ,
76 Chapter 10. The Gibbs-Appell formulation of dynamics
and ϕ̇(0) = ϕ̇0 . We may compute ẏ retroactively from the constraint equation ẏ(t ) =
ẋ(t ) tan ϕ(t ), and then integrate it to find y(t ) subject to the initial condition y(0) = y0 .
In practice, when solving the system on a computer, it is easier to adjoin the constraint
equation (10.52) to the two equations, as in
m1 + m2 1
ẍ + (m1 + m2 )ẋ sin ϕ − m2 ℓϕ̇ cos2 ϕ ϕ̇ = 0,
cos2 ϕ cos3 ϕ
1
m2 ℓϕ̈ + m ℓẋ ϕ̇ = 0,
cos ϕ 2
ẋ sin ϕ − ẏ cos ϕ = 0,
and solve the whole thing in one fell swoop for the three unknowns x(t ), y(t ), and ϕ(t )
by applying the initial conditions
Note that there is no initial condition on ẏ(0) since the ẋ0 and ϕ0 determine ẏ(0) through
the constraint equation (10.52).
1 1
G = (m1 + m2 )v̇ 2 + m2 ℓ2 ω̇ 2 + m2 ℓvω ω̇ − m2 ℓω 2 v̇
2 2
1
+ (m1 + m2 )v 2 + m2 ℓ2 ω 2 ω 2 .
2
Equations (10.27) in this case take the form
∂G ∂G
= 0, = 0,
∂ v̇ ∂ ω̇
and thus,
These are much more pleasant-looking equations compared to (10.54) which we had
obtained before. In fact, they are quite amenable to solving by hand. To wit, multiply the
fist equation by v, the second by ω, and add. We get
(m1 + m2 )v v̇ + m2 ℓ2 ω ω̇ = 0,
10.7. The problem from page 63 of Gantmacher 77
that is ·
1 1
(m1 + m2 )v 2 + m2 ℓ2 ω 2 = 0,
2 2
whence
1 1
E = (m1 + m2 )v 2 + m2 ℓ2 ω 2 (10.56)
2 2
is a constant of the motion. (In fact, it is the sleigh’s kinetic energy.) The value of E is
determined by the initial conditions v(0) and ω(0).
Now, multiplying (10.55a) by ℓ/2 and adding to (10.56), we get
1 1
(m1 + m2 )ℓv̇ + (m1 + m2 )v 2 = E,
2 2
that is,
2E
ℓv̇ + v 2 = α2 , where α2 = .
m1 + m2
We solve this separable equation for v subject to the initial condition v(0) = v0 , and obtain
hα v i
v(t ) = α tanh t + tanh−1 0 . (10.57)
ℓ α
We see that lim t →∞ v(t ) = α, that is, the sleigh’s speed approaches α in the long run.
Now that we have v(t ), the angular velocity ω(t ) may be computed easily. From the
1
differential equation ℓv̇ + v 2 = α2 we get v̇ = ℓ (α2 − v 2 ). We substitute this expression
for v̇ in (10.55a), then solve for ω 2 :
m1 + m2 2
ω(t )2 = α − v(t )2 , (10.58)
m2 ℓ 2
where v(t ) is given in (10.57). Note, in particular, that v(t ) → α implies ω(t ) → 0, that
is, the sleigh’s spin slows down to zero in the long run.
Remark 10.1. Let’s observe that (a) From the definition ẋ = v cos ϕ of the speed v we
see that a positive v implies motion in which m1 trails m2 ; and (b) Since tanh is positive
when its argument is positive, the solution (10.57) indicates that regardless of the initial
conditions, v(t ) will be positive for sufficiently large t . Putting these two observations
together, we conclude that regardless of the sleigh’s initial conditions, in the long run it
will orient itself so that m1 trails m2 in its motion.
π̇1 = ẋ ẋ = π̇1
Choice (1) π̇ = ϕ̇ ⇐⇒ ẏ = π̇1 tan ϕ
2
ẋ sin ϕ = ẏ cos ϕ ϕ̇ = π̇2
π̇1 = ẋ sin ϕ ẋ = π̇1 / sin ϕ
Choice (2) π̇ = ϕ̇ ⇐⇒ ẏ = π̇1 / cos ϕ
2
ẋ sin ϕ = ẏ cos ϕ ϕ̇ = π̇2
π̇1 = ẋ/ cos ϕ ẋ = π̇1 cos ϕ
Choice (3) π̇ = ϕ̇ ⇐⇒ ẏ = π̇1 sin ϕ
2
ẋ sin ϕ = ẏ cos ϕ ϕ̇ = π̇2
Table 10.1: There is no unique choice of pseudovelocities for a given problem. The
choices given here are three out of infinite such possibilities for the dumbbell
problem.
These form a set of five first order differential equations in the five unknowns x, y, ϕ, ω,
and v.
This system is solvable in terms of elementary functions, as demostrated in Gant-
macher. Plugging the raw system into Maple does not produce a good result immediately,
but we can maneuver it toward the right solution as follows.
The equations above impliy that ω is a constant, and therefore ϕ = ωt +ϕ0 . Therefore
the system shrinks to
ẋ = v cos ϕ,
ẏ = v sin ϕ.
v̇ + g sin ϕ = 0,
where ϕ = ωt +ϕ0 . In solving this reduced system, Maple distinguishes between the cases
where ω is zero and nonzero. If ω is nonzero, we get something like Gantmacher’s. In
particular, y consists of a linear term in t plus trigonometric function. If, however, ω is
zero and ϕ0 = π/2, we get a −1/2g t 2 term, as expected.
Note: In the above, I calclated the Gibbs function a la Desloge. We may calculate it a la
Gantmacher if we wish. Here is how.
What I have written above as the quasivelocity v, Gantmacher calls it π̇. The equation
ẏ = v sin ϕ then takes the form ẏ = π̇ sin ϕ, whereby we introduce the virtual displace-
ment of the quasicoordinate π through δy = (sin ϕ) δπ.
In accordance with (10.59), the virtual displacements of the masses are
ℓ ℓ
δr1 = δrc − 〈− sin ϕ, cos ϕ〉 δϕ, δr2 = δrc + 〈− sin ϕ, cos ϕ〉 δϕ,
2 2
A force of F = 〈0, −m g 〉 acts on each of the two masses therefore their virtual work
is
The previous chapter’s formulation of the Gibbs-Appell equations focused on the dy-
namics of point masses. Extending the results to rigid bodies is a matter of replacing
the summations with integrals, which is easy in principle, but provides some challenges
in practice. It is possible, with some work, to systematize the treatment and arrive at a
general formula for the Gibbs function which applies to all rigid bodies. Lurie [12] and
Desloge [5], for instance, have such formulas. Lurie’s formula is packaged quite nicely.
Desloge’s result is a little more general but it is not packaged as nicely. Furthermore,
Desloge’s formula is actually incorrect(!) due to a false assumption made in its derivation.
Desloge [6] points out his error and offers a corrected formula.
In the following sections I will offer a blending of Lurie’s and Desloge’s approaches
which combines the nice packaging of the former with the generality of the latter.
The stationary frame is defined by a triad {i, j, k} and the associated origin O. As the
name implies, the stationary frame is fixed (not moving). Typically the i and j
vectors lie in a horizontal plane and the vector k points upward, but that’s not a
requirement.
With the stationary frame we associate a Cartesian coordinate system whose x, y,
and z axes are aligned with the i, j and k vectors, respectively.
Ultimately, the purpose of rigid body dynamics is to express a body’s motion rela-
tive to the stationary frame.
The body frame of reference is defined by a triad {b1 , b2 , b3 } and an associated origin o.
The triad is firmly attached to the body, and therefore moves with it. There are
no restrictions on the choice of the origin, nor on the triad’s orientation. When-
ever possible, however, we set the origin at the body’s center of mass, and orient
the triad along the body’s principal axes of inertia, since that results in significant
simplifications.
81
82 Chapter 11. Rigid body dynamics
Since the triad {b1 , b2 , b3 } is affixed to the body, the triad’s angular velocity is exactly
the body’s angular velocity ω.
where the ellipsis indicates terms that do not involve accelerations, and are, therefore,
immaterial to the Gibbs-Appell equations of motion.
If the origin o of the body frame is chosen to coincide with the body’s center of mass,
then ρc = 0 and (11.1) reduces to
Z
1 1 1
kr̈k2 d m = mkr̈c k2 + ω̇ · Ic ω̇ + (ω̇ × ω) · Ic ω + · · · , (11.2)
B 2 2 2
where rc is the position vector, relative to the stationary frame, of the body’s center of
mass, and Ic is the moment of inertia tensor relative to the center of mass.
j
i
b2
e2
e3 , b3 y
ϕ
b1
x
ψ
e1
a
(x, y, a sin θ)
Figure 11.1: The rolling coin’s configuration is specified through five generalized coor-
dinates x, y, θ, ϕ, and ψ, as shown.. The no-slip conditions at the contact
point with the floor imposes two nonholonomic constraints.
We will need the time derivatives of these vectors shortly, so let’s compute them right
now. We have
ė1 = −ϕ̇ sin ϕ i + ϕ̇ cos ϕ j.
It follows that
ė1 · e1 = 0, ė1 · e2 = ϕ̇ cos θ, ė1 · e3 = −ϕ̇ sin θ,
therefore, according to (8.7), we have ė1 = ϕ̇ cos θ e2 − ϕ̇ sin θ e3 . In a similar manner we
compute ė2 and ė2 and express them in terms of the basis {e1 , e2 , e3 }. Here is what we
get:
Now let us examine the body frame. Referring to Figure 11.1 we have:
We differentiate these with respect to t , and substitute from (11.4) for the derivatives of
ei , and obtain
ḃ1 = −(ϕ̇ cos θ + ψ̇) sin ψ e1 + (ϕ̇ cos θ + ψ̇) cos ψ e2 + (−ϕ̇ sin θ cos ψ + θ̇ sin ψ) e3 ,
(11.6a)
ḃ2 = −(ϕ̇ cos θ + ψ̇) cos ψ e1 − (ϕ̇ cos θ + ψ̇) sin ψ e2 + (ϕ̇ sin θ sin ψ + θ̇ cos ψ) e3 ,
(11.6b)
ḃ3 = ϕ̇ sin θ e1 − θ̇ e2 . (11.6c)
Remark 11.2. The expressions for ḃ3 in (11.6c) and ė3 in (11.4c) agree since b3 = e3 .
Equations (11.6) may be used in conjunction with (7.5) to calculate the body’s angular
velocity vector. Toward that end we compute
ω1b = ḃ2 · b3 = −(ϕ̇ cos θ + ψ̇) cos ψ (e1 · b3 ) − (ϕ̇ cos θ + ψ̇) sin ψ (e2 · b3 )
+ (ϕ̇ sin θ sin ψ + θ̇ cos ψ) (e3 · b3 ),
where the superscript b is to remind us that ω1b is the component of the vector ω in
the the body frame. On the right-hand side we substitute for b3 from (11.5), simplify the
11.6. The no-slip constraint 85
result, and arrive at ω1b = ϕ̇ sin θ sin ψ+ θ̇ cos ψ. Computing the ω2b and ω3b components
in the same way, we arrive at:
Then from the definition (7.3) of the angular velocity vector we conclude that
ω = (ϕ̇ sin θ sin ψ + θ̇ cos ψ) b1 + (ϕ̇ sin θ cos ψ − θ̇ sin ψ) b2 + (ϕ̇ cos θ + ψ̇) b3 . (11.7)
This is the coin’s angular velocity vector expressed in the {b1 , b2 , b3 } basis. By substituting
for {b1 , b2 , b3 } from (11.5), we obtain and expression for ω in the {e1 , e2 , e3 } basis:
We see that the components of ω along the intermediate basis are particularly simple.
Therein lies the significance of the choice of our intermediate basis. Those components
play a central role in what comes next, therefore we name them simply ωi ,13
Note that each ωi is a linear combination of the generalized velocities θ̇, ϕ̇, and ψ̇, and
therefore each ωi is a quasi-velocity as defined in Section 10.1.9. Well, actually ω1 is not
a true quasi-velocity since it is the derivative of the generalized velocity θ̇, but the other
two are honest quasi-velocities since they are not derivatives of anything.
Solving (11.9) as a linear system of three equations for the three unknowns θ̇, ϕ̇, and
ψ̇, we get:
1
θ̇ = ω1 , ϕ̇ = ω , ψ̇ = ω3 − ω2 cot θ. (11.10)
sin θ 2
Equations (11.9) and (11.10) establish a one-to-one correspondence between the gen-
eralized velocities θ̇, ϕ̇, ψ̇, and the quasi-velocities ω1 , ω2 , ω3 . We may formulate the rest
of the analysis in terms of one or the other set of variables. We will do it in terms of the
ω’s since the derivations are easier that way.
For future reference, let us make a note of the following formula whose derivation is
left as an exercise:
rc = x i + y j + a sin θ k. (11.12)
The vector ρ = −ae2 extends from the coin’s center to its contact point with the floor.
Therefore, according to (7.7), the velocity, relative to the coin’s center, of a point on the
13
To be consistent with the b superscript introduced earlier for the components of ω in the {b1 , b2 , b3 } basis,
we should have named its {e1 , e2 , e3 } components with superscripts e. We refrain from doing that, however, to
reduce clutter.
86 Chapter 11. Rigid body dynamics
coin’s rim at the contact point is given by ω ×ρ. Consequently, the velocity of that point
relative to the stationary frame is r˙c + ω × ρ. Since the velocity of the ground at the
contact point is zero, then to prevent slippage, we need
r˙c + ω × ρ = 0. (11.13)
Upon substituting for rc from (11.12), for ω from (11.8), for the ei vectors from (11.3),
and simplifying the result, and we get
ẋ −a θ̇ sin ϕ sin θ +(ϕ̇ cos θ + ψ̇) cos ϕ i + ẏ +a θ̇ cos ϕ sin θ +(ϕ̇ cos θ + ψ̇) cos ϕ j = 0.
ẋ = −a(ω3 cos ϕ − ω1 sin ϕ sin θ), ẏ = −a(ω3 sin ϕ + ω1 cos ϕ sin θ) (11.14)
which express the rolling coin’s nonholonomic constraints. We use these to eliminate ẋ
and ẏ from the rest of the computations.
r˙c = −a(ω3 cos ϕ − ω1 sin ϕ sin θ) i − a(ω3 sin ϕ + ω1 cos ϕ sin θ) i + aω1 cos θ k.
Here we have applied (11.10) to eliminate the generalized velocities θ̇, ϕ̇, ψ̇ in favor of
the quasi-velocities ω1 , ω2 , ω3 . Finally, we compute kr¨c k2 which forms a part of the
problem’s Gibbs function:
where the ellipsis indicate terms that involve no acceleration terms ω̇1 , ω̇2 , ω̇3 .
where
1 1
α = ma 2 , β = ma 2 . (11.17)
4 2
11.8. The rotational acceleration 87
Therefore we have:
The final term is free of accelerations, therefore it may be dropped when forming the
Gibbs function.
In a similar manner we also calculate
Again, the final term is free of accelerations, therefore it may be dropped when forming
the Gibbs function.
According to (11.2) and (10.7), The Gibbs function is
Z Z
1
G= kr̈k2 d m − F · r̈ d m
B 2 B
1 1
= mkr̈c k2 + ω̇ · I ω̇ + (ω̇ × ω) · I ω − (−m g k) · r̈c ,
2 2
which may be evaluated by putting together (11.15), (11.16), (11.18), and (11.19):
1 1 1 1
G = ma 2 [ω̇12 + ω̇32 + 2ω̇1 ω2 ω3 − 2ω1 ω2 ω̇3 ] + αω̇12 + αω̇22 + βω̇32
2 2 2 2
− (ω̇1 ω2 − ω1 ω̇2 )(αω2 cot θ − βω3 ) + m ga ω̇1 cos θ + · · · ,
where, as always, we have dropped terms which do not depend on the accelerations. Fi-
nally, substituting for α and β from (11.17), we arrive at
h5 1 3 1
G = ma 2 ω̇12 + ω̇22 + ω̇32 − ω̇1 ω2 (ω2 cot θ − 6ω3 )
8 8 4 4
1 i
+ ω1 ω̇2 (ω2 cot θ − 2ω3 ) − ω1 ω2 ω̇3 + m ga ω̇1 cos θ.
4
Then from the equations ∂ G/∂ ω̇1 = 0, ∂ G/∂ ω̇2 = 0, ∂ G/∂ ω̇3 = 0 we get
5 1 g
ω̇1 − ω2 (ω2 cot θ − 6ω3 ) + cos θ = 0,
4 4 a
1 1
ω̇ + ω (ω cot θ − 2ω3 ) = 0,
4 2 4 1 2
3
ω̇ − ω1 ω2 = 0.
2 3
Clearly this set of differential equations is under-determined, since it depend on θ which
is also an unknown. However, from (11.10) we see that θ̇ = ω1 . Adjoining this to the
88 Chapter 11. Rigid body dynamics
above gives us a system of four first order differential equations in the four unknowns ω1 ,
ω2 , ω3 , and θ.
In practice, we extend the system by adjoining all three of the equations from (11.10),
and the two equations from (11.14). Thus, we obtain a system of eight first order differ-
ential equations in the eight unknowns ω1 , ω2 , ω3 , θ, ϕ, ψ, x, and y, whose solution
completely determines the coin’s motion. Here is the system in its full glory:
1 4g
ω̇1 − ω2 (ω2 cot θ − 6ω3 ) + cos θ = 0,
5 5a
ω̇2 + ω1 (ω2 cot θ − 2ω3 ) = 0,
2
ω̇3 − ω1 ω2 = 0,
3
θ̇ = ω1 , (11.20)
1
ϕ̇ = ω,
sin θ 2
ψ̇ = ω3 − ω2 cot θ.
ẋ = −a(ω3 cos ϕ − ω1 sin ϕ sin θ),
ẏ = −a(ω3 sin ϕ + ω1 cos ϕ sin θ).
Remark 11.3. We need eight initial conditions to go with these equations. The ini-
tial conditions on x, y, θ, ϕ, and ψ determine the coin’s initial position relative to the
stationary axes, so they may be specified in the obvious way. The initial conditions on
ω1 , ω2 , and ω3 require a more careful consideration. You may recall from (11.9) that the
ω’s were defined as the components of the coin’s angular velocity along the intermediate
frame {e1 , e2 , e3 }. Since the intermediate frame has no immediate physical manifesta-
tion, it is not easy to make up meaningful initial values for the ω’s. We do note, however,
that (11.9) defines the ω’s in terms of the generalized coordinates and their velocities,
which are easier to grasp. Therefore in practice you will make up initial values for θ̇, ϕ̇,
and ψ̇ as desired, then use (11.9) to determine the initial values for ω1 , ω2 , and ω3 .
Exercises
11.1. Derive (11.11).
11.2. Give the details of the computation which leads from (11.13) to (11.14).
11.3. Derive (11.19).
11.4. Equip a rigid body B with a body reference frame {b1 , b2 , b3 } whose origin co-
incides with the body’s center of mass. Additionaly, suppose that the {b1 , b2 , b3 }
is aligned with the body’s moment of inertia axes, so that the moment of inertia
tensor takes the form
I = I1 b1 ⊗ b1 + I2 b1 ⊗ b2 + I3 b1 ⊗ b3 .
Let ω1 , ω2 , ω3 be the components of the body’s angular velocity along the {b1 , b2 , b3 }
vectors, that is
ω = ω1 b1 + ω2 b2 + ω3 b3 .
Show that:
Exercises 89
11.5. Solve the system of differential equations (11.20) and produce an animation of the
coin’s motion.
90 Chapter 11. Rigid body dynamics
Chapter 12
Quaternions
cp = c[ a, u ] = [ ca, cu ], (c ∈ R) (12.1)
p + q = [ a, u ] + [ b , v ] = [ a + b , u + v ], (12.2)
p ◦ q = [ a, u ] ◦ [ b , v ] = [ ab − u · v, av + b u + u × v ]. (12.3)
Note that addition is commutative but multiplication, due to the presence of the u × v
term, is not.
As a special case of (12.3), we have
[ c, 0 ] ◦ [ a, u ] = c[ a, u ] and [ a, u ] ◦ [ c, 0 ] = c[ a, u ],
therefore the quaternion [ c, 0 ] acts exactly like the scalar c under quaternion multiplica-
tion. In particular, the quaternion [ 1, 0 ] is the multiplicative identity in the quaternion
algebra.
In view of the observation above, we adopt the convention of writing c instead of
of [ c, 0 ] when there is no chance of confusion. This is analogous to writing α, instead
of α + 0i when dealing with complex numbers. By the same token, we write u instead
of [ 0, u ]. The quaternion c = [ c, 0 ] is called a scalar quaternion or just a scalar if the
meaning is clear from the context. Similarly, the quaternion u = [ 0, u ] is called a vectorial
quaternion, or just a vector, for short.14
As a consequence of the convention adopted above, the notation u ◦ v, where u and
v are vectors, makes sense, and evaluates to
u ◦ v = [ 0, u ] ◦ [ 0, v ] = [ − u · v, u × v ]. (12.4)
You may want to think of u ◦ v as the “quaternion product” of the vectors u and v,
but note that the product is a quaternion, not a vector, unless u · v = 0, in which case
u ◦ v = u × v.
14
The terminology is not quite standardized. What we have called a scalar quaternion is also called a real
quaternion, and what we have called a vectorial quaternion is also called a pure quaternion or imaginary quater-
nion.
91
92 Chapter 12. Quaternions
which shows that the parenthesized expression is the left multiplicative inverse of p. Re-
peating the calculation, beginning with p ◦ p∗ , we see that the parenthesized expression
is also the right multiplicative inverse of p. We conclude that if |p| is nonzero, then p has
a multiplicative inverse, p−1 , given by
1 ∗
p−1 = p, (|p| 6= 0).
|p|2
Remark 12.1. This section has touched on just a few basic concepts of the quaternion
algebra which will be needed in what follows. For an in-depth study of quaternions see
Altmann [1]. For a leisurely introduction to quaternions, with applications to computer
graphics, see Hanson [9].
The right-hand side has a well-known geometric interpretation. Suppose the tails of the
vectors n and u are attached to a common point o, and let P be a plane through o and
perpendicular to n, as illustrated in Figure 12.1, and let Q be the plane of the vectors n
12.2. The geometry of the quaternions 93
u − 2(n · u)n
u
o
n
Q
and u. Then it is simple exercise to show that u − 2(n · u)n is the mirror reflection of the
vector u through the plane P . Thus, we have established the following:
Theorem 12.1. The reflection operator Rn into a plane with a unit normal n is given by
Rn u = n ◦ u ◦ n, u ∈ E3 . (12.8)
Lemma 12.2. Suppose the planes P1 and P2 , with unit normals n1 and n2 , make a dihedral
angle of ϕ/2 with each other. Let Rn1 and Rn2 be the reflection operators into the planes P1 and
P2 , and let Rn,ϕ be the rotation operator by angle ϕ about the vector n = (n1 ×n2 )/kn1 ×n2 k.
Then Rn,ϕ = Rn2 Rn1 .
P2 P2
n2 n2
R n2 R n1 u
n1 n1
R n1 u
P1 P1
n ϕ ϕ
2 2
u
(a) Planes P1 and P2 , shown edge- (b) Reflecting the arbitrary vector u into the
wise, have unit normals n1 and plane P1 produces Rn1 u. Reflecting the lat-
n2 , and make a dihedral angle ter into the plane P2 results in Rn2 Rn1 u.
of ϕ/2. The unit vector n =
(n1 × n2 )/kn1 × n2 k lies along
their intersection, and points
out of the picture, toward you,
therefore it is not visible.
P2 P2
n2 n2
P1′
P1′
R n2 R n1 u R n2 R n1 u
n1 R n2 u n1
R n1 u R n1 u
P1 P1
ϕ ϕ ϕ ϕ
2 2 2 2
u n u
(c) We introduce the auxiliary plane P1′ (shown as a (d) The vector Rn2 produced by reflecting u into P2 coin-
dashed line) which is the reflection of the plane P1 cides with the reflection of Rn2 Rn1 u into P1′ . From
into the plane P2 , therefore it makes a dihedral an- the symmetries of the diagram it is evident that a rota-
gle of ϕ/2 with P2 . tion by an angle ϕ about the vector n takes the plane
plane P1 to the plane P1′ . It follows that the same ro-
tation takes the vector u to the vector Rn2 Rn1 u, thus
proving that Rn,ϕ = Rn2 Rn1 .
Theorem 12.3. The operator Rn,ϕ of rotation by angle ϕ about a unit vector n has an
associated unit quaternion
ϕ ϕ
q = [ cos , n sin ] (12.9)
2 2
so that
Rn,ϕ u = q ◦ u ◦ q∗ , for all u ∈ E3 . (12.10)
Proof. Pick any two planes P1 and P2 which intersect along a line parallel to n, and whose
dihedral angle is ϕ/2. Then according to Theorem 12.1 we have
Rn1 u = n1 ◦ u ◦ n1 . Rn2 u = n2 ◦ u ◦ n2 ,
and according to Lemma 12.2 we have Rn,ϕ = Rn2 Rn1 . Thus, we compute
The negative sign in the final expression is immaterial since q appears twice in the rotation
formula (12.10). Finally, the assertion that q is a unit quaternion follows immediately from
the form of (12.9).
Remark 12.2. The planes P1 and P2 and their associated unit normals n1 and n2 which
enter the proof above are conveniences for the proof. They do not appear in the theorem’s
final result.
We may be more explicit by setting q0 = cos ϕ/2 and q = n sin ϕ/2 and simplifying (12.10)
to get
Rn,ϕ u = q ◦ u ◦ q∗ = (n · u)n + u − (n · u)n cos ϕ + (n × u) sin ϕ,
Suppose that the instantaneous angular velocity vector under this rotation at time t is
ω(t ). Then, according to (7.7), we have:
Comparing (12.12) and (12.13) we expect a relationship between q(t ) and ω(t ). This is
stated in the following:
Theorem 12.4. Let q(t ) be any time-varying unit quaternion, and let ω(t ) the angular
velocity of the rotation (12.12) engendered by q(t ). Then we have
ω = 2q̇ ◦ q∗ . (12.14)
Proof. Since q(t ) is a unit quaternion, we have |q(t )|2 = q(t ) ◦ q∗ (t ) = 1, and therefore
q̇ ◦ q∗ + q ◦ q̇∗ = 0. But since q ◦ q̇∗ = (q̇ ◦ q∗ )∗ , it follows that q̇ ◦ q∗ + (q̇ ◦ q∗ )∗ = 0, which
indicates that the scalar part of the quaternion q̇ ◦ q∗ is zero, that is, q̇ ◦ q∗ is a vectorial
quaternion. This justifies defining the vector ω through the quaternion product (12.14).
It remains to show that ω thus defined is indeed the motion’s angular velocity.
Toward that end, let us compute ṙ by differentiating (12.12):
ṙ = q̇ ◦ r0 ◦ q∗ + q ◦ r0 ◦ q̇∗ .
ṙ = q̇ ◦ q∗ ◦ r + r ◦ q ◦ q̇∗
= q̇ ◦ q∗ ◦ r + r ◦ (q̇ ◦ q∗ )∗
1 1 ∗ 1
= ω ◦ r+r ◦ ω = (ω ◦ r + r ◦ ω ∗ ).
2 2 2
We leave it as an easy exercise to show that evaluating the right-hand side with the help of
the identity (12.4) leads to
ṙ = ω × r (12.15)
which shows that ω defined in (12.14) is indeed the sought-after angular velocity.
12.4. A differential equation for the quaternion rotation 97
ω = ω1 q ◦ i ◦ q∗ + ω2 q ◦ j ◦ q∗ + ω3 q ◦ k ◦ q∗
= q ◦ (ω1 i + ω2 j + ω3 k) ◦ q∗ .
2q̇ ◦ q∗ = q ◦ (ω1 i + ω2 j + ω3 k) ◦ q∗ ,
whence
1
q̇ = q ◦ (ω1 i + ω2 j + ω3 k). (12.18)
2
This system of first order differential equations expresses the evolution of q in time. To-
gether with the equations in (12.16), these determine the body’s orientation in space. We
will adjoin them to the differential equations of dynamics to obtain a complete system of
differential equations that describes the body’s motion.
The differential equation’s initial condition, q(0), which specifies the body’s initial
orientation, is a unit quaternion, as any quaternion associated with a rotation ought to be.
A question arises whether the solution q(t ) of the differential equation is a unit quaternion
for all t . The answer is yes, as it follows from Theorem 12.5 below.
for the quaternion q(t ), where a(t ) is an arbitrary time-dependent vector, preserves the norm,
that is |q(t )| = |q0 | for all t . In particular, if q0 is a unit quaternion, then q(t ) is a unit
quaternion for all t .
Proof. We calculate the derivative of |q(t )|2 and substitute for q̇ from the differential
98 Chapter 12. Quaternions
equation:
d 2 d
|q| = (q ◦ q∗ ) = q̇ ◦ q∗ + q ◦ q̇∗
dt dt
= (q ◦ a) ◦ q∗ + q ◦ (q ◦ a)∗ = (q ◦ a) ◦ q∗ + q ◦ (a∗ ◦ q∗ ).
In the last step we have used the quaternion conjugation property (p ◦ q)∗ = q∗ ◦ p∗ ; see
Exercise 12.5. Now note that for any vector a we have a∗ = [ 0, a ]∗ = [ 0, −a ] = −a.
d
Then it follows that d t |q|2 = 0, therefore the norm of q remains constant.
q = [ q0 , q1 i + q2 j + q3 k ],
ẋ = R j · ω and ẏ = −R i · ω (12.20)
therefore 7
1
G1 = M R2 (ω̇12 + ω̇22 + ω̇32 ) − (k · ω̇)2 .
2 5
To compute the slug’s Gibbs function, let us write r for its position vector. We have:
r = rc + ab3 .
which we group as
r̈ = r̈c + a (ω̇2 + ω3 ω1 )b1 − (ω̇1 − ω2 ω3 )b2 − (ω12 + ω22 )b3 ,
Finally, we compute
1
G2 = mkr̈k2 − (−m g k) · r̈,
2
and then substitute for b1 , b2 , and b3 from (12.16).
The fully expanded expression of G2 is horrendously large. For all practical purposes
it is impossible to compute it with bare hands. Doing it with a symbolic computational
software, such as MAPLE or MATHEMATICA, however, is not a problem.
The composite ball’s Gibbs function os G = G1 + G2 . The equations of motion are
∂G ∂G ∂G
= 0, = 0, = 0,
∂ ω̇1 ∂ ω̇2 ∂ ω̇3
These along with the two equations (12.20) and the four equations (12.19) form a system
of nine first order differential equations in the nine unknowns x, y, ω1 , ω2 , ω3 , q0 , q1 ,
q2 , q3 .
The initial conditions for the first five have immediate physical meanings and their
specifications are intuitive and easy. The x and y place the ball’s center at a desired loca-
tion, and ω1 , ω2 , ω3 impart it an initial velocity. (Remember that are the components
of the angular velocity in the body triad. The ball’s initial orientation is specified through
q0 , q1 , q2 , q3 as follows.
Begin with the ball oriented so that the body triad {b1 , b2 , b3 } is lined up with the
stationary {i, j, k} triad. Then apply a rotation Rn,ϕ about a unit vector n and rotation
angle ϕ to orient the ball as desired. Any orientation in space may be achieved through
the appropriate choices of n and ϕ. The quaternion associated with that rotation is given
in (12.9). Therefore
ϕ ϕ ϕ ϕ
q0 (0) = cos , q1 (0) = n1 sin , q2 (0) = n2 sin , q3 (0) = n3 sin . (12.23)
2 2 2 2
Remark 12.5. To make an animation of the rolling ball, it is suffices to design the ball
graphics in a reference position, say where the body axes is aligned with the stationary
axes. Then, after having solved the differential equations and computed x(t ), y(t ), q0 (t ),
q1 (t ), q2 (t ), q3 (t ), rotate the reference ball about the vector q1 (t ) i + q2 (t ) j + q3 (t ) k by
the angle ϕ(t ) obtained from q0 (t ) = cos ϕ(t )/2, that is, ϕ(t ) = 2 cos−1 q0 (t ). Then trans-
late the rotated ball’s center to the location x(t ) i + y(t ) j + R k. MAPLE’s plottools
package provides the commands rotate() and translate() for rotating and trans-
lating graphics objects. Figure 12.3 shows the ball in its reference position (on the left),
and a frame from an animation sequence (on the right).
Exercises
12.1. Prove the identity (12.6).
12.2. In Figure 12.1, n is a unit vector to the plane P , and u is an arbitrary vector. Why
is u − 2(n · u)n the mirror reflection of u through the plane P ?
12.3. Let q = n2 ◦ n1 , where n1 and n2 are any two vectors. Show that q∗ = n1 ◦ n2 .
Exercises 101
Figure 12.3: On the left is a sample graphics of the unbalanced ball in the reference
position. On the right is a frame from an animation sequence which in-
cludes the trace of the ball’s contact point with the floor. This particular
animation was produced with the parameters R = 1, a = −3/5. M = 100,
m = 25, g = 1. The initial orientation was set through n = i, ϕ = π/2;
see (12.23). The remaining initial conditions were x(0), y(0), ω1 (0), ω3 (0)
all zeros, and ω2 (0) = 0.1.
p ◦ (q ◦ r) = (p ◦ q) ◦ r.
[1] Simon L. Altmann, Rotations, Quaternions, and Double Groups, Clarendon Press, Oxford, 1986.
[2] Stuart S. Antman, The simple pendulum is not so simple, SIAM Rev. 40 (1998), no. 4, 927–930.
[3] Roy Bowen M. and C.-C. Wang, Introduction to vectors and tensors, Vol. 1, Plenum Press, New York, 1976.
[4] Edward A. Desloge, Classical Mechanics, Vol. 1, John Wiley, 1982.
[5] Edward A. Desloge, Classical Mechanics, Vol. 2, John Wiley, 1982.
[6] Edward A. Desloge, The Gibbs-Appell equations of motion, American Journal of Physics 56 (1988), no. 9,
841–846.
[7] I. M. Gelfand and S. V. Fomin, Calculus of Variations, Prentice-Hall, 1963.
[8] F. Gantmacher, Lectures on Analytical Mechanics, Foundations of Engineering Mechanics, Mir Publishers,
Moscow, 1975. Translated from the Russian by George Yankovsky.
[9] Andrew J. Hanson, Visualizing Quaternions, Morgan Kaufmann Publishers, San Franciso, 2006.
[10] Heinrich Hertz, The Principles of Mechanics Presented in a New Form (English Translation), translated by D.
E. Jones and J. T. Walley, Macmillan and Co., New York, 1899.
[11] A. I. Lur’e, Analiticheskaya mekhanika, Gosudarstv. Izdat. Fiz.-Mat. Lit., Moscow, 1961 (Russian).
[12] A. I. Lurie, Analytical mechanics, Foundations of Engineering Mechanics, Springer-Verlag, Berlin, 2002.
Translated from the Russian by A. Belyaev.
[13] James Stewart, Multivariable Calculus, Brooks/Cole, 2008.
103