Abstract Algebra LN
Abstract Algebra LN
4CCM121A/5CCM121B
1 Introduction 5
1.1 What is abstract algebra? . . . . . . . . . . . . . . . . . . . . 5
1.2 Sets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.3 Logic and proofs . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.4 Mathematical induction . . . . . . . . . . . . . . . . . . . . . 9
1.5 Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2 The integers 13
2.1 The Division Algorithm . . . . . . . . . . . . . . . . . . . . . 13
2.2 The Euclidean Algorithm . . . . . . . . . . . . . . . . . . . . . 15
2.3 Relatively prime integers . . . . . . . . . . . . . . . . . . . . . 18
2.4 Linear Diophantine equations . . . . . . . . . . . . . . . . . . 19
2.5 Prime factorization . . . . . . . . . . . . . . . . . . . . . . . . 23
3 Binary operations 29
3.1 Binary operations . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.2 Composition of functions . . . . . . . . . . . . . . . . . . . . . 33
3.3 Arithmetic modulo n . . . . . . . . . . . . . . . . . . . . . . . 35
4 Groups 41
4.1 Definition of a group . . . . . . . . . . . . . . . . . . . . . . . 41
4.2 Examples of groups . . . . . . . . . . . . . . . . . . . . . . . . 42
4.3 Permutation groups . . . . . . . . . . . . . . . . . . . . . . . . 46
4.4 Basic properties of groups . . . . . . . . . . . . . . . . . . . . 53
4.5 Powers of group elements . . . . . . . . . . . . . . . . . . . . . 56
4.6 Orders of group elements . . . . . . . . . . . . . . . . . . . . . 59
4.7 Subgroups . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.8 Cyclic groups . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
4.9 Cosets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
4.10 Lagrange’s Theorem . . . . . . . . . . . . . . . . . . . . . . . 75
4.11 Product groups . . . . . . . . . . . . . . . . . . . . . . . . . . 79
3
4 CONTENTS
4.12 Homomorphisms . . . . . . . . . . . . . . . . . . . . . . . . . 81
5 Rings 93
5.1 Definition of a ring . . . . . . . . . . . . . . . . . . . . . . . . 93
5.2 Examples of rings . . . . . . . . . . . . . . . . . . . . . . . . . 94
5.3 Basic properties of rings . . . . . . . . . . . . . . . . . . . . . 98
5.4 Subrings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
5.5 Groups of units . . . . . . . . . . . . . . . . . . . . . . . . . . 104
5.6 Types of rings . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
5.7 Matrix rings . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
5.8 Ring homomorphisms . . . . . . . . . . . . . . . . . . . . . . . 111
5.9 The Chinese Remainder Theorem . . . . . . . . . . . . . . . . 115
5.10 Polynomial rings . . . . . . . . . . . . . . . . . . . . . . . . . 120
Chapter 1
Introduction
5
6 CHAPTER 1. INTRODUCTION
The first two above are examples of rings; the third is a group. These are two
particular types of algebraic structures. There are other types of algebraic
structures, but these two are the most fundamental. We’ll focus mainly on
groups this semester. We’ll start with the precise definition of a group in
the abstract; i.e., we’ll give the axioms which must be satisfied by the binary
operation on S in order for S to be called a group. Then we’ll deduce general
consequences from the axioms, i.e., we’ll prove theorems about groups. This
is the abstraction, an important tool in mathematics, but almost worthless
without examples. So along the way we’ll consider lots of examples and see
how the theorems apply to them.
Before studying groups though, we’ll study the most familiar and funda-
mental ring, namely Z, the set of integers. We’ll be considering its algebraic
properties, i.e., how the integers behave with respect to addition and mul-
tiplication. I’ll assume the most basic properties of the integers. The rest
of this chapter and the beginning of the next comprise a review of material
I’m assuming you’ve seen before, including the fact (see Theorem 2.2.2) that
the gcd of a and b is also the least positive integer of the form ax + by for
x, y 2 Z. Recall also that in practice you can compute gcd(a, b) and find x
and y using the Euclidean Algorithm; see Examples 2.2.4 and 2.2.5.
1.2 Sets
A set is a collection of things, called elements or members of the set. Here
are some familiar sets with special names:
• The set with no elements is called the empty set and denoted ;.
B = {. . . , 4, 2, 0, 2, 4, 6, . . .}.
N \ {n 2 Z| 3 n 3 } = { 4, 5, 6, . . . }.
• 9!” means “there exists a unique.” For example, 9!n 2 N such that
n2 < 3.
1.5 Functions
Finally recall that a function from a set A to a set B is a rule that assigns
exactly one element of B to each element of A. We write f : A ! B to
mean that f is a function from A to B, and for a 2 A, we write f (a) for the
element of B that the function assigns to a. A function is often defined by a
formula. Consider for example the function f : R ! R defined by f (x) = x2 .
Note that some elements of B may be assigned to more than one element of
A, or to none at all. In the example of f (x) = x2 , we have f ( 3) = f (3) = 9,
but there is no x 2 R such that f (x) = 1.
1.5. FUNCTIONS 11
p
• the modulus function | | from C to R defined by |x + iy| = x2 + y 2 .
You can think an element a in the domain A as “input” for the function a,
and the value f (a) 2 B as “output.” You might also sometimes think of f
as a “mapping” from A to B and represent it in a diagram by an arrow from
A to B.
Note that di↵erent elements of A can be assigned the same value in B;
for example sin 0 = sin ⇡ = 0. Also, not every element of b needs to be a
value of the function; for example, there is no x 2 R such that sin x = 2.
What is required is that for each a 2 A, there is exactly one element of B
which we call f (a). So some non-examples of functions are
Some functions do have the property that no two distinct elements of the
domain are assigned the same value; i.e., if a and a0 are assigned the same
value by f , then a = a0 . Such functions are called injective is one-to-one.
a, a0 2 A, f (a) = f (a0 ) ) a = a0 .
12 CHAPTER 1. INTRODUCTION
The integers
2. If a|n, then a n.
1. m = qn + r, and
2. 0 r < n.
Moreover, q and r are the unique pair of integers such that these both hold.
1
An “algorithm” for finding quotients and remainders for integer division is embedded
in its proof. As an algorithm though it’s rather inefficient. You learned a more practical
method of finding q and r in primary school.
13
14 CHAPTER 2. THE INTEGERS
Proof. Let P (m) be the “existence” assertion, i.e., that there exist q, r 2 Z
such that 1) m = qn+r, and 2) 0 r < n. We first prove P (m) for m 0 by
induction on m (where n is now an arbitrary fixed positive integer). We start
the induction argument by noting not only that P (0) is true, but indeed that
P (0), P (1), . . . , P (n 1) are all true, for if 0 m < n, then we can take q = 0
and r = m. Now suppose that m n and that P (0), P (1), . . . , P (m 1) are
all true. Let m0 = m n. Then 0 m0 < m, so P (m0 ) is true. This means
we can write:
m = n + q 0 n + r0 = (1 + q 0 )n + r0 = qn + r,
m = q 0 n + r0 and 0 r0 < n.
m= q0n r0 = n q0n + n r0 = qn + r
m = q1 n + r1 = q2 n + r2 and 0 r1 , r2 < n.
Theorem 2.2.2 Suppose that a, b 2 Z and that a and b are not both zero,
and let g = gcd(a, b). Then
r = a mq
= a (ax + by)q
= a(1 xq) + b( yq).
Since the remainders keep decreasing, we must eventually get rn = 0 for some
n > 0.
To illustrate the roles of a and b in starting the process, we could let
r 1 = a and r0 = b, so that our sequence of equations now reads:
r 1 = q1 r 0 + r1
r0 = q2 r 1 + r2
r1 = q3 r 2 + r3
..
.
rn 3 = qn 1 rn 2 + rn 1
rn 2 = qn r n 1
Proof. If d|a and d|b, then d|c since c = a bk. So if d is a common divisor
of a and b, then it is a common divisor of a and c as well. Similarly, if d|b
2.2. THE EUCLIDEAN ALGORITHM 17
and d|c, then d|a (and d|b), so in fact the common divisors of a and b are the
same as the common divisors of b and c. Therefore gcd(a, b) = gcd(b, c). ⇤
In view of the equations ri 1 = qi+1 ri + ri+1 for i = 0, . . . , n 2, the
proposition shows that
gcd(a, b) = gcd(r 1 , r0 ) = gcd(r0 , r1 ) = · · · = gcd(rn 2 , rn 1 ).
Since rn 1 |rn 2 , we have gcd(rn 2 , rn 1 ) = rn 1 .
Example 2.2.4 We work through the Euclidean algorithm for a = 114,
b = 42:
114 = 2 · 42 + 30
42 = 1 · 30 + 12
30 = 2 · 12 + 6
12 = 2 · 6 + 0.
So the last non-zero remainder is 6, and indeed gcd(114, 42) = 6.
The Euclidean algorithm also provides an algorithm for finding integers
x and y so that gcd(a, b) = ax + by. Note that we can rewrite the first n 1
equations of the Euclidean Algorithm, in reverse order, as:
rn 1 = rn 3 qn 1 r n 2
rn 2 = rn 4 qn 2 r n 3
..
.
r 2 = r 0 q2 r 1
r 1 = r 1 q1 r 0 .
Since gcd(a, b) = rn 1 , the first equation gives gcd(a, b) in terms of rn 3 and
rn 2 . Using the next equation to substitute for rn 2 , we get gcd(a, b) in terms
of rn 4 and rn 3 . Iterating this, we eventually get it in terms of r 1 = a and
r0 = b.
Example 2.2.5 Again consider a = 114, b = 42. “Unwinding” the equa-
tions from Example 2.2.4 gives:
6 = 30 2 · 12
12 = 42 30
30 = 114 2 · 42.
Substituting each equation into the previous one gives:
6 = 30 2 · 12
= 30 2 · (42 30) = 2 · 42 + 3 · 30
= 2 · 42 + 3(114 2 · 42) = 3 · 114 8 · 42.
So we get 6 = gcd(114, 42) in the form 114x 42y by taking x = 3, y = 8.
18 CHAPTER 2. THE INTEGERS
Corollary 2.3.3 Suppose that a, b 2 Z, not both 0. Let g = gcd(a, b). Then
a/g and b/g are relatively prime.
Proof. If b|c, then c = bm for some m 2 Z. If also a|c, then a|bm. Since
a and b are relatively prime, Cor. 2.3.4 implies that a|m. This means that
m = an for some n 2 Z. Therefore c = abn, so c is divisible by ab. ⇤
Again, note that it’s not always the case that if a|c and b|c, then ab|c.
(Take for example a = b = c = 2.) We need a and b to be relatively prime to
draw this conclusion.
ax + by = c,
Theorem 2.4.1 Suppose that a, b, c 2 Z and that a and b are not both zero.
Then the equation ax+by = c has solutions x, y 2 Z if and only if gcd(a, b)|c.
Since mk and nk are integers, we’ve shown that ax+by = c for some x, y 2 Z
(namely x = mk and y = nk). ⇤
3
Recall the “if and only if” means each assertion implies the other. We’ve just shown P
) Q. Now we have to prove the converse, that Q ) P, i.e., that if g|c, then the equation
has integer solutions.
20 CHAPTER 2. THE INTEGERS
Example 2.4.2 Consider the equation 114x + 42y = 660. We know that
gcd(114, 42) = 6 (see Example 2.2.4) and 6|660, the equation has solutions.
The Euclidean Algorithm gives 114 · 3 + 42 · ( 8) = 1 (see Example 2.2.5).
Multiplying through by 660/6 = 110 gives
Example 2.4.3 The equation 21x+35y = 900 has no integer solutions since
900 is not divisible by gcd(21, 35) = 7.
Now we explain how to find all the solutions assuming we’ve found one.
b a
x = x0 + k · , y = y0 k· for k 2 Z.
g g
Example 2.4.6 Let’s apply this to find all solutions of 114x+42y = 660. We
already found one solution in Example 2.4.2, namely x0 = 330, y0 = 880.
Since a/g = 19, b/g = 7, Thm. 2.4.5 gives all the solutions as:
Example 2.4.7 Let’s find all solutions of 114x + 42y = 660 with x, y 2 N
(the set of positive integers). From Example 2.4.6, this translates into the
inequalities
x = 330 + 7k > 0, y = 880 19k > 0.
Solving the first inequality for k gives 7k > 330, so k > 330/7 ⇡ 47.1.
Solving the second gives 19k < 880, so k < 880/19 ⇡ 46.3. The only
integer satisfying these inequalities is k = 47. Substituting this into the
formulas for x and y gives the solution:
x = 1, y = 13.
Example 2.4.8 You’re in a shop where apples cost 27p and oranges cost
69p. What are all the possible ways of spending exactly £8.40 on apples and
oranges? This translates into finding all solutions of
69 = 2 · 27 + 15
27 = 1 · 15 + 12
15 = 1 · 12 + 3
12 = 4·3 + 0.
15 = 69 2 · 27
12 = 27 15
3 = 15 12.
3 = 15 12
= 15 (27 15) = 2 · 15 27
= 2 · (69 2 · 27) 27 = 2 · 69 5 · 27.
x0 = 1400, y0 = 560.
3. Since a/g = 9 and b/g = 23, Thm. 2.4.5 gives all integer solutions as:
n = p1 p2 · · · pk and n = q1 q2 · · · q` ,
n = n0 n00 = q1 q2 · · · qi r1 r2 · · · rj ,
2.5. PRIME FACTORIZATION 25
n = p 1 p 2 · · · p k = q1 q 2 · · · q`
n = p 1 p 2 · · · p k = q1 q 2 · · · q`
Corollary 2.5.5 Suppose that m and n are integers, not both 0. Then m
and n are relatively prime if and only if they have no common prime divisors.
Proof. Recall that m and n are relatively prime if their greatest common
divisor is 1. So if m and n have a common prime divisor, say p, they cannot
be relatively prime. This shows that if m and n are relatively prime, then
they have no common prime divisor.
Now suppose that m and n have no common prime divisor, and let g =
gcd(m, n). We must show that g = 1. We will assume that g > 1 and arrive
at a contradiction. If g > 1 by Thm. 2.5.4, g has a prime divisor p (take
p = p1 for example). Since p|g, and g|m and g|n, we conclude that p|m
26 CHAPTER 2. THE INTEGERS
and p|n, contradicting our assumption that m and n have no common prime
divisor. Therefore g = 1. ⇤
For example, we can see easily that 867 and 3500 are relatively prime,
without applying the Euclidean algorithm, or even finding all the prime fac-
tors of 867. We can just notice that 3500 = 102 · 35 = 22 53 7, and check
that 867 is not divisible by these primes: 2, 5 and 7. To see that 867 is not
divisible by 2 or 5, we can just look at the last digit. If 867 were divisible
by 7, then so would be 860, and therefore so would be 86 (since 10 is not
divisible by 7), but 86 = 12 · 7 + 2.
We can also give a criterion for one positive integer to be divisible by
another in terms of their prime factorizations. Suppose that m and n are
integers greater than 1. First write the prime factorization of m in the form
where p1 , p2 , . . . , pk are distinct prime numbers (no two are the same), and
r1 , r2 , . . . , rk are positive integers (for example, 3500 = 22 53 7). Note that
the primes p1 , p2 , . . . , pk are precisely the prime divisors of m. It is clear that
each pi is a prime divisor of m; conversely if p|m and p is prime, then by
Cor. 2.5.3, p|pi for some i. Since pi is prime, and p > 1, it follows that p = pi .
Similarly we can write
n = q1s1 q2s2 · · · q`s` ,
where q1 , q2 , . . . , q` are the distinct prime divisors of n and s1 , s2 , . . . , s`
are positive integers. Now in the list q1 , q2 , . . . , q` of primes dividing n, some
might already be in the list p1 , p2 , . . . , pk of primes dividing m, while others
might not. Rather than distinguish between the possibilities and try to keep
extend the list of primes p1 , p2 , . . . , pk so that it includes all the prime divisors
of n, and we would still have an expression for m of the form:
r1 = s1 + u1 , r2 = s 2 + u 2 , ..., rk = sk + uk .
• p1 = 2, p2 = 3, p3 = 5, p4 = 7,
28 CHAPTER 2. THE INTEGERS
• r1 = 2, r2 = 0, r3 = 3, r4 = 1,
• s1 = 3, s2 = 2, s3 = 0, s4 = 1.
Since s1 > r1 (or s2 > r2 ), we see by part 1 of the corollary that 3500 is
not divisible by 504. To compute gcd(m, n) using part 2 of the theorem,
we find t1 = min(2, 3) = 2, t2 = min(0, 2) = 0, t3 = min(3, 0) = 0 and
t4 = min(1, 1) = 1, so gcd(m, n) = 22 30 50 71 = 28.
Chapter 3
Binary operations
29
30 CHAPTER 3. BINARY OPERATIONS
a b c
a a b c
b b a c
c c a b,
so for example b ⇧ c = c.
(f g)(x) = f (g(x)),
• You know that addition and multiplication on Z (or N, etc.) are as-
sociative, subtraction on Z and division on the set of non-zero real
numbers are not.
a+b (a + b)/2 + c a b c
(a ⇤ b) ⇤ c = ⇤c= = + +
2 2 4 4 2
is not always the same as
b+c a + (b + c)/2 a b c
a ⇤ (b ⇤ c) = a ⇤ = = + + .
2 2 2 4 4
On the other hand, the operation a ? b = a on Z is associative since
(a ? b) ? c = a ? c = a and a ? (b ? c) = a ? b = a.
a (b c) = 3 (7 9) = 3 4 = 7.
3.2. COMPOSITION OF FUNCTIONS 33
(a b) c = (3 7) 9 = 10 9 = 7.
The proof consists simply of unravelling the definitions of all the composites.
We already noted the two functions in question both have domain A and
codomain D, and for all a 2 A, we have
So we can omit the parentheses and just write h g f for this function,
which we can think of schematically as “combining” the three arrows in the
diagram
f g h
A ! B ! C ! D.
Def inition 3.3.1 Suppose that a and b are integers. We say that a is con-
gruent to b modulo n if a b is divisible by n. The notation for this is
a ⌘ b (mod n).
Proposition 3.3.2 Suppose that a, b and n are integers and n > 0. Then
the following are equivalent:
(a) a ⌘ b mod n;
36 CHAPTER 3. BINARY OPERATIONS
{ b 2 Z | b ⌘ a (mod n)}
So for example
where
[0]3 = { . . . , 6, 3, 0, 3, 6, 9, 12, . . . },
[1]3 = { . . . , 5, 2, 1, 4, 7, 10, 13, . . . },
and [2]3 = { . . . , 4, 1, 2, 5, 8, 11, 14, . . . }.
We are now going to define binary operations, namely an addition and a
multiplication, on Zn . We begin with addition. We’d like to define the sum
of two congruence classes, say [a]n and [b]n , by the formula
(the + inside the brackets being the usual addition of integers), but there
could be a problem with this. To define the binary operation + on the set Zn ,
we have to define the element X + Y 2 Zn for each X, Y 2 Zn . But for any
given X, there are infinitely many integers a such that X = [a]n , and similarly
for Y (using capital letters here to remind us that X and Y are in fact sets).
For example, our proposed definition says that [3]10 + [8]10 = [3 + 8] = [11]10 ,
but [3]10 is the same congruence class modulo 10 as [ 17]10 , so it had better be
true that [3]10 + [8]10 = [ 17]10 + [8]10 . And indeed it is, since [11]10 = [ 9]10 .
This was just an example. We need to check that whenever [a]n = [a0 ]n , our
formula gives the same value for the sum [a0 ]n + [b]n as for the sum [a]n + [b]n .
Similarly we could have chosen another integer in the congruence class of b
to compute the sum of [a]n and [b]n , and the result shouldn’t depend on this
choice. So to check that the sum is well-defined, we have to check that if
[a]n = [a0 ]n and [b]n = [b0 ]n , then [a + b]n = [a0 + b0 ]n . Similarly, we’d like to
define the product of [a]n and [b]n by the formula
but for this to be well-defined, we need to check that if [a]n = [a0 ]n and
[b]n = [b0 ]n , then [ab]n = [a0 b0 ]n . Fortunately, we essentially proved this
already in Prop. 3.3.4.
Proof. If [an ] = [a0 ]n and [b]n = [b0 ]n , then a ⌘ a0 mod n and b ⌘ b0 mod n
by Prop. 3.3.2. So by Prop. 3.3.4,
Groups
1. ⇤ is associative;
Before considering examples, here are some remarks about the properties:
1. You might also have seen the definition of a group include a closure
axiom, stating that if g, h 2 G, then g ⇤ h 2 G. This is already included
in the definition I gave (Defn. 3.1.1) for ⇤ to be a binary operation on
G.
41
42 CHAPTER 4. GROUPS
0 + n = n + 0 = n 8n 2 Z,
n + ( n) = ( n) + n = 0.
We’ll usually just write (G, ⇤) instead of “G with the operation ⇤.” So the
preceding example would be denoted (Z, +). We’ll even often omit ⇤ when
it’s clear from the context.
Example 4.2.2 (Z, ·) is not a group. 1) and 2) are satisfied (the identity
element being 1), but not every element of Z has an inverse with respect to
multiplication. For example, there is no integer n such that 2n = 1. (In fact
the only elements with inverses are ±1.)
Example 4.2.3 Let’s try to fix the last example by considering (R, ·) instead
of (Z, ·). Now 2 has an inverse, namely 1/2. But no, it’s still not a group; 0
has no inverse.
Example 4.2.4 Let’s try again. Let R⇥ = R \ {0} be the set of non-zero
real numbers1 . Note that the product of two elements of R⇥ is again in R⇥ ,
and now 1), 2) and 3) are satisfied, so (R⇥ , ·) is a group.
Example 4.2.5 (M2 (R), +) is a group. Recall M2 (R) denotes the set
✓ of 2⇥2◆
0 0
real matrices. 1) Matrix addition is associative, 2) the matrix
✓ 0 ◆
0
a b
is an identity element, and 3) the (additive) inverse of A = is
✓ ◆ c d
a b
A= .
c d
1
See Section 1.2 for the \ notation
4.2. EXAMPLES OF GROUPS 43
gives AB = BA = I.
The first property shows that GL2 (R) is closed under matrix multiplication,
i.e., if A, A0 2 GL2 (R), then AA0 2 GL2 (R). Note also that if A 2 GL2 (R)
and B is its inverse matrix, then
• Three reflections (one axis through each vertex); call these 1, 2 and
3 (ordering the vertices clockwise).
44 CHAPTER 4. GROUPS
Z⇥
n = { [a]n 2 Zn | gcd(a, n) = 1 },
so Z⇥
n is the set of congruence classes [a]n whose elements are relatively prime
to n. (Note that we can replace a by any b congruent to a modulo n and this
won’t change the congruence class [b]n = [a]n . Either of the two preceding
propositions shows that the condition gcd(a, n) = 1 in the definition of Z⇥ n
depends only on [a]n , not on the choice of integer a in the congruence class.
Some examples are
Z⇥
6 = {1, 5}, and Z⇥
7 = {1, 2, 3, 4, 5, 6}.
Def inition 4.3.1 For any set A, we define the identity function on A as
the function
Note that the condition g f = idA means that g(f (a)) = a for all a 2 A;
and the condition f g = idB means that f (g(b)) = b for all b 2 B. For
4.3. PERMUTATION GROUPS 47
Therefore g f is injective.
2) Suppose next that f and g are surjective. We need to show that if
c 2 C, then b = (g f )(a) for some a 2 A. Since g is surjective, we know
that c = g(b) for some b 2 B. Since f is surjective, we know that this
b = f (a) for some a 2 A. So (g f )(a) = g(f (a)) = g(b) = c. Therefore g f
is surjective.
3) This is immediate from parts 1) and 2). ⇤
We are now ready to describe which functions have inverse functions.
Therefore f is injective.
Now we prove f is surjective. Suppose that b 2 B. We must show that
b = f (a) for some a 2 A. Let a = g(b). Then
f (a) = f (g(b)) = b.
2) Now suppose that f is bijective. Recall that this means for each b 2 B,
there is a unique a 2 A such that f (a) = b. We want to show that f has an
inverse function g : B ! A. We define g : B ! A as follows: For b 2 B,
we let g(b) be the unique element a 2 A such that f (a) = b. This is a
function from B to A, since for each b 2 B, we have specified a single value
a = g(b) 2 A. We now show that g is an inverse function of f . We must
again show two things: that g(f (a)) = a for all a 2 A, and that f (g(b)) = b
for all b 2 B.
Suppose first that b 2 B. Then f (g(b)) = f (a) where a = g(b) is, by the
definition of the function g, the unique element of A such that f (a) = b. So
f (g(b)) = f (a) = b.
Now suppose that a 2 A. We must show that g(f (a)) = a. We just saw
that f (g(b)) = b for all b 2 B. Applying this to b = f (a) gives f (g(f (a))) =
f (a). Thus letting a0 = g(f (a)), we have f (a0 ) = f (a), and since f is assumed
to be injective, this implies that a = a0 , i.e., g(f (a)) = a. Therefore g is an
inverse function of f . ⇤
We are now ready to define symmetric groups, or permutation groups. Let
A be any set. We’ll mainly be concerned with the case where A is a finite
set, but we’ll start out working in more generality. We define SA to be the
set of bijective functions from A to A.
Proposition 4.3.6 If A is a set, then SA is a group under .
Proof. First we have to check that if f, g 2 SA , then f g 2 SA , so that is
indeed a binary operation on SA . By definition, the composite f g is again
a function from A to A, and it is bijective by Part 3 of Prop. 4.3.4.
Now we need to check SA with the binary operation satisfies the group
axioms. We already know that is associative. we also already saw that the
identity function idA on A satisfies
idA f = f idA = f
for all functions f : A ! A, so in particular for all f 2 SA . Note also that idA
is bijective, so idA 2 SA is an identity element for the operation . Finally
Prop. 4.3.5 shows that f has an inverse function g : A ! A. We have to
check that g 2 SA , i.e., that g is bijective. One way to see that it is bijective
is to note that it has an inverse function, namely f , and apply Prop. 4.3.5
again.
Thus (SA , ) satisfies the group axioms. ⇤
The group SA under is called the symmetric group, or permutation
group, on A, and its elements are called permutations of A. Now let’s sup-
pose that A is finite. Assume even more specifically, that A = {1, 2, . . . , n}
50 CHAPTER 4. GROUPS
n! = n(n 1)(n 2) · · · 2 · 1
and chase through what happens to each integer under first ⌧ , and then ,
giving ✓ ◆
1 2 3 4 5 6
⌧= .
3 5 6 2 1 4
The other standard notation is cycle notation. If a1 , a2 , . . . , ak are distinct
elements of {1, 2, . . . , n} (so k n), we write
(a1 a2 a · · · ak )
If n > 3, then Sn has elements which are not cycles. A general fact is
that every element can be written as a product of disjoint cycles, i.e., in the
form
(a1 a2 · · · ak1 )(ak1 +1 ak1 +2 · · · ak1 +k2 ) · · · (ak1 +k2 +···+kr 1 +1 · · · ak1 +k2 +···+kr ),
where a1 , a2 , . . . , ak1 +k2 +···+kr are distinct. We won’t prove this, but here’s
how it works in practice: Take an element a 2 {1, 2, . . . , n} and list the
values a1 = a, a2 = (a1 ), a3 = (a2 ), . . . until we get back to a. If k is the
least integer such that (ak ) = a, then one of the cycles in the expression for
is (a1 a2 · · · ak ), whose e↵ect is described by:
a1 ! a2 ! · · · ! ak ! a1 .
Let’s carry this out for the cycle of Example 4.3.7. Starting from a = 1
gives:
1 ! 2 ! 3 ! 5 ! 1,
yielding the 4-cycle (1235). We have now accounted for 1, 2, 3 and 5, and
starting from 4 gives (46). So a cycle expression for is
(1235)(46).
Note that we can “cycle” through the entries in a cycle without changing
it, so we could have written, say (3512) instead of (1235). It’s also the
case that we can write disjoint cycles in any order without altering their
composite. In other words, disjoint cycles “commute.” So we could just as
well write the cycle expression for as (46)(1235). For the sake of being
systematic though, I’ve been writing the cycles starting with the smallest
integer whenever there’s a choice.
As an example of computing a composite in cycle notation, let’s redo the
calculation of ⌧ in Example 4.3.7. We’ve already seen that = (1235)(46)
and ⌧ = (1234), so we want to compute
(1235)(46)(1234).
Note that we start with the rightmost cycle (1234) and “move” to the left
since that is the convention for composition of functions, but within each
cycle, the entries “move” from left to right (until we get to the last entry).
Since we want to write our answer in cycle notation, we next compute the
value at 3:
(1234) (46) (1235)
3 ! 4 ! 6 ! 6,
and then 6:
(1234) (46) (1235)
6 ! 6 !4 ! 4,
and then 4 and so on until we get back to 1. Eventually we find ⌧ has the
e↵ect:
1 ! 3 ! 6 ! 4 ! 2 ! 5 ! 1.
Since this accounts for all the elements {1, 2, 3, 4, 5, 6}, we conclude that ⌧
is the 6-cycle (136425), which is indeed consistent with the calculation in
Example 4.3.7.
4.4. BASIC PROPERTIES OF GROUPS 53
Def inition 4.4.2 Suppose that (G, ⇤) is a group and a 2 G. Then the
inverse of a is the unique element b 2 G such that a ⇤ b = b ⇤ a = e; we
denote this element by a 1 .
Here are some more notational conventions: just as for multiplication of
real numbers, we often omit the symbol for the binary operation in a group
(especially for an “abstract” group) and simply write ab instead of a ⇤ b.
A binary operation is often denoted + When some sort of “addition” un-
derlies the definition (for example, addition of matrices). In that case, we
would denote the inverse of a by a instead of a 1 . The symbol + is also
usually reserved for commutative binary operations. Groups for which the bi-
nary operation is commutative have a special name, after the mathematician
Abel.
Def inition 4.4.3 We say a group (G, ⇤) is an abelian group if the operation
⇤ is commutative; i.e., a ⇤ b = b ⇤ a for all a, b 2 G.
Among the above examples, (Z, +), (R⇥ , ·), (M2 (R), +), (Zn , +) and (Z⇥
n , ·)
are abelian groups, but GL2 (R), Dn and Sn (for n 3) are non-abelian.
Here’s an important property that applies to all groups; it’s called the
Cancellation Law:
Proposition 4.4.4 Suppose that G is a group and a, b, c 2 G. If ab = ac or
ba = ca, then b = c.
Proof. If ab = ac, then
You can think of this as a statement about the “multiplication table” for the
group. Suppose G has only finitely many elements, say g1 , g2 , . . . , gn . The
entries in the row of a are then
ag1 , ag2 , . . . , agn .
The corollary says that each element b 2 G appears exactly once in this list.
So b appears exactly once in each row of the table. Similarly, b appears
exactly once in each column of the table.
Here are some more general properties of inverses:
Proposition 4.4.6 Suppose G is a group and g, h 2 G. Then
1
1. If ab = e, then a = b and b = a 1 .
1
2. (ab) = b 1a 1.
3. (a 1 ) 1
= a.
Proof. 1) If ab = e, then since b 1 b = e, Prop. 4.4.4 implies that a = b 1 .
Similarly, since aa 1 = e, Prop. 4.4.4 implies that b = a 1 .
2) Note that
(ab)(b 1 a 1 ) = a(b(b 1 a 1 )) = a((bb 1 )(a 1 )) = a(ea 1 ) = aa 1
= e.
So applying part 1) (with ab in place of a and b 1 a 1 in place of b) shows
that b 1 a 1 = (ab) 1 .
3) Since aa 1 = e, part 1) with a 1 in place of b shows that a = (a 1 ) 1 . ⇤
Recall that b = a 1 means that ab = e and ba = e. The content of part
1) is that it’s enough to know that either of these equalities hold; the other
follows. Also, note in part 2) that the inverse of ab is given by b 1 a 1 (the
order reverses), and this is not necessarily the same as a 1 b 1 unless G is
abelian.
Finally one more remark on notation. So far I’ve been very careful to
place parentheses and show how associativity is being applied. For example,
the proof of part 2) of the above proposition begins with 3 applications of
associativity to shift around the parentheses, replacing an expression of the
form a(bc) with (ab)c or vice-versa. By now you should realize that we don’t
really need the parentheses. Since a(bc) = (ab)c, we just write abc instead
(or a ⇤ b ⇤ c if the operation is ⇤), and more generally we write a1 a2 · · · an .
Note that the order of placement of the elements matters (unless the group is
abelian), but the order in which the operation is applied to adjacent elements
doesn’t matter. With this mind, the formula in the proof of part 2) above
becomes:
(ab)(b 1 a 1 ) = abb 1 a 1 = aea 1 = aa 1 = e.
56 CHAPTER 4. GROUPS
• For the group of non-zero real numbers under multiplication, xn has its
usual meaning for x 2 R⇥ , n 2 Z.
0 1
• Suppose G = S6 and is the 6-cycle (123456). Then = e, = ,
and 2 = = (123456)(123456) has the e↵ect:
1 ! 2 ! 3, 2 ! 3 ! 4
3 ! 4 ! 5, 4 ! 5 ! 6
5 ! 6 ! 1, 6 ! 1 ! 2,
• Consider the group (Z10 , +). Since the group operation is based on ad-
dition, we’ll use the “additive” notation. Let’s compute some multiples
of [4] in Z10 : 1 · [4] = [ 4] = [6], 0 · [4] = [0], 1 · [4] = [4], 2 · [4] = [8],
3 · [4] = [2], etc. It’s easy to see that n · [a], the nth multiple of [a] in
Zm , is given by [na]. If n > 0, this is clear from definitions since
Z⇥
100 = { [1], [3], [7], [9], [11], [13], . . . , [97], [99] },
the included residue classes being those with last digits 1, 3, 7, 9. Com-
puting a few powers of [19], we have
Just as with addition of congruence classes, it’s easy to see that [a]nm =
[an ]m is n is positive (where an is the “usual” nth power of a). Note
though that this formula makes no sense if n is negative since an is not
an integer. To see how to compute [19] 1 in Z⇥ 100 for example, let’s
recall what this means (see the proof of Prop. 4.2.13). We need to find
a congruence class [x] 2 Z⇥ 100 so that [19x] = [19][x] = [1]. In other
words, we want 19x ⌘ 1 mod 100, or 19x 100y = 1 for some y 2 Z.
We can solve this using the Euclidean Algorithm:
100 = 5 · 19 + 5, 19 = 3 · 5 + 4, 5 = 4 + 1,
which gives
Now let’s return to the general situation and show that the powers of an
element satisfy the laws of exponents.
Proof. We’ll just prove the first formula and leave the second as an exercise.
If m and n are both positive, then it is clear that g m g n = g m+n since
gg · · · g gg · · · g = gg · · · g .
| {z } | {z } | {z }
m times n times (m + n) times
g m g n = (g m
) 1 (g n
) 1
= (g n
g m
) 1
= (g (m+n)
) 1
= g m+n .
Note that when using additive notation, the laws of “exponents” become:
(m · a) + (n · a) = (m + n) · a, m · (n · a) = (mn) · a
• g 0 = e;
• g n = (g n
) 1
for n < 0.
n 4 3 2 1 0 1 2 3 4 5 6 7
···
gn ⇢2 e ⇢1 ⇢2 e ⇢1 ⇢2 e ⇢1 ⇢2 e ⇢1
Similarly, for G = Z10 under + taking powers of [4] (or “multiples” since the
operation is additive) gives
n 4 3 2 1 0 1 2 3 4 5 6 7
···
n · [4] [4] [8] [2] [6] [0] [4] [8] [2] [6] [0] [4] [8]
g n = g dm = (g d )m = em = e
e = g n = g dq+r = g dq g r = (g d )q g r = eq g r = g r .
But since r < d and d is the least positive integer such that g d = e, it follows
that r cannot be a positive integer. The only possibility then is that r = 0,
so n = dq is divisible by d. ⇤
gm = gn ) gm n = e (as in part 1)
) d|(m n) (by Thm. 4.6.2)
) m ⌘ n (mod d) by definition of congruence.
Note also that we can “reverse” the argument; i.e., each ) can be replaced
by ,, so we see in fact that g m = g n , m ⌘ n (mod d). ⇤
Note that part 2) of the corollary describes what we saw in the examples
of ⇢1 2 D3 and [4] 2 Z10 ; part 1) describes what we saw for 2 2 R⇥ .
We’ve been considering the order of an element of a group (Defn. 4.6.1).
There is also the notion of the order of a group, which is just its size.
Def inition 4.6.4 Suppose that G is a group. If G has infinitely many ele-
ments, we say G has infinite order. Otherwise we say G has finite order,
and we define the order of G to be the number of elements in G.
So for example, the group D3 has order 6; its element ⇢1 has order 3. We’ll see
later how the two notions are related. For now let’s just note the following:
Corollary 4.6.5 If a group G has finite order, then so does every element
of G.
Proof. If g 2 G has infinite order, then Cor. 4.6.3 shows that its powers g n
would give infinitely many distinct elements of G. ⇤
Let’s consider the order of some permutations. The computation of the
powers of = (123456) in S6 shows that has order 6. We saw also that its
inverse is (165432) = (654321). In fact, in general:
2. has order k.
4.7 Subgroups
Def inition 4.7.1 Suppose that (G, ⇤) is a group. A subset H ✓ G is called
a subgroup of G if H, with the operation ⇤, is a group.
e0 ⇤ h = h ⇤ e0 = h for all h 2 H.
1. h, h0 2 H ) h ⇤ h0 2 H;
2. e 2 H;
1
3. h 2 H ) h 2 H.
64 CHAPTER 4. GROUPS
Note that the set of even integers is the special case where m = 2. Taking
m = 1 instead gives all of Z; taking m = 0 would give the subgroup {0}.
The set of integer multiples of m is usually denoted mZ. We shall see that
every subgroup of Z is of this form.
Remark 4.7.4 In the statement of the proposition, we really need all three
conditions to guarantee that H is a subgroup. You might wonder if two of
the conditions imply the third and are therefore sufficient to imply that H
is a subgroup, but this isn’t the case. For example, taking G = Z:
3. The empty set ; satisfies 1) and 3) but not 2), so it is not a subgroup.
(Note that 3) for example means that n must be in the subset when-
ever n is, but n is never in this subset, so the condition is automatically
satisfied.)
H = { gn | n 2 Z }
is a subgroup of G.
Proof. As usual, we verify 1), 2) and 3) of Prop. 4.7.2.
0
1. Suppose h, h0 2 H. Then h = g n , h0 = g n for some n, n0 2 Z, so
0 0
hh0 = g n g n = g n+n by Prop. 4.5.1, so hh0 2 H.
2. By definition e = g 0 , so e 2 H.
by Prop. 4.5.1, so h 1 2 H.
So H is a subgroup. ⇤
• The subgroup of Z5 generated by [4] is {[0], [4], [3], [2], [1]}, which is all
of Z5 .
• The subgroup of Z⇥ ⇥
5 generated by [4] is {[1], [4]} since [4] = [1] in Z5 .
2
Proof. 1) Recall from Corollary 4.6.3 that if g has infinite order, then the
group elements g n for n 2 Z are distinct. Since these are in hgi, we see that
hgi has infinitely many elements.
2) The second part of Corollary 4.6.3 says that if g has order d, then g m = g n
precisely when m ⌘ n (mod d), or equivalently, when m and n have the same
remainder on division by d. Therefore the elements of hgi are precisely the
elements g r as r runs through the possible remainders {0, 1, . . . , d 1}. So
hgi = {e, g, g 2 , . . . , g d 1 } has precisely d elements. ⇤
Going back to some of the examples:
• [4] in Z10 has order 5, and so does h[4]i = {[0], [4], [8], [2], [6]}.
• [4] in Z5 has order 5, and so does h[4]i = {[0], [4], [3], [2], [1]}.
In the last example, the subgroup generated by g was the whole group G.
• Z⇥n might or might not be cyclic, depending on n. It’s easy to see that
the group is cyclic for the first few values n = 1, 2, . . . , 7; for example
Z⇥ ⇥
7 = h[3]i since the powers of [3] in Z7 are:
n 0 1 2 3 4 5
[3]n [1] [3] [2] [6] [4] [5].
• Z10 has order 10, and the orders of its elements are:
element [0] [1] [2] [3] [4] [5] [6] [7] [8] [9]
order 1 10 5 10 5 2 5 10 5 10.
So the elements of Z10 which are generators are [1], [3], [7] and [9].
Recall that an abelian group is one in which the binary operation is com-
mutative.
Proposition 4.8.6 If G is a cyclic group, then G is abelian.
Proof. If G is cyclic, then G = hgi for some g 2 G. We must show that
hk = kh for all h, k 2 G. Since G = hgi, we know that h = g m and k = g n
for some m, n 2 Z. Therefore
hgi = { g n | n 2 Z }
Proof. Recall that the order of g a is the least positive integer n such that
g an = (g a )n = e. Since we are assuming g has order d, Thm. 4.6.2 shows
that g an = e if and only if d|an. Now let b = gcd(a, d). Since b divides both
a and d, we see that d/b, a/b and na/b are all integers. Moreover
d|na , (d/b)|(a/b)n.
(To see this, note that na = kd for some k 2 Z if and only if na/b = k(d/b)
for some k 2 Z.) By Cor. 2.3.3, a/b and d/b are relatively prime, so by
Cor. 2.3.4, we see that if (d/b)|n(a/b), then (d/b)|n. Also, if n is divisible by
d/b, then so of course is n(a/b), so
(d/b)|n(a/b) , (d/b)|n.
Therefore
(g a )n = e , (d/b)|n.
So the smallest positive integer n for which (g a )n = e is the smallest positive
integer divisible by d/b, which of course is d/b itself. Therefore the order of
g a is d/b. ⇤
This gives a quick way to compute the order of any element of a cyclic
group, once we have a generator. Consider for example the group G = Z⇥ 7.
We saw this was cyclic, generated by [3]7 , so [3]7 has order 6 and each element
70 CHAPTER 4. GROUPS
can be written as a power of 3. Here then is a table with the order of each
element of Z⇥
7:
a 0 1 2 3 4 5
[3]a [1] [3] [2] [6] [4] [5]
gcd(a, 6) 6 1 2 3 2 1
order of g a 1 6 3 2 3 6.
As another example, consider the element [4]10 in Z10 . Since [1]10 has
order 10, the order of [4]10 = 4 · [1]10 is 10/ gcd(4, 10) = 10/2 = 5. In fact,
we can now easily compute the order of any element of Zn :
Proof. The element [1]n has order n, so Theorem 4.8.7 shows that [a]n =
a·[1]n has order n/ gcd(a, n). Recall from Prop. 4.8.5 that [a]n generates Zn if
and only if [a]n has order n, which by Cor. 4.8.8 is equivalent to gcd(a, n) = 1.
So we have:
hg b i = { (g b )k | k 2 Z}
4.9 Cosets
Now we’ll introduce the notion of the cosets of a subgroup. We’ll use this to
prove an important theorem about finite groups, called Lagrange’s Theorem.
g ⇤ H = {g ⇤ h|h 2 H }
If omitting the symbol for the binary operation, we would write gH instead
of g ⇤ H.
⇢1 H = { ⇢1 h | h 2 H } = {⇢1 e, ⇢1 1} = {⇢1 , 3}
are the same subsets of G, so eH = 1 H is a single coset. (Note that for any
G and H, we have H = eH, so the subgroup H is itself a left coset of H in
G.) Let’s compute all the left cosets of H in D3 . We’ve already computed
gH for g = e, 1 and ⇢1 ; the remaining values of g give
⇢2 H = {⇢2 , 2 }, 2H = { 2 , ⇢2 } and 3H = { 3 , ⇢1 }.
eH = 1H = {e, 1 },
⇢1 H = 3 H = {⇢1 , 3 }
and ⇢2 H = 2 H = {⇢2 , 2 }.
Note that each left coset of H in D3 contains the same number of elements.
Furthermore each element of D3 appears in exactly one of the left cosets of
H. We’ll see this is what happens in general, but first let’s consider another
example.
4.9. COSETS 73
a + hni = { a + kn | k 2 Z }
Recall from Prop. 3.3.2 that b is of the form a+kn for some k 2 Z if and only
if b ⌘ a mod n; i.e., if and only b is in the congrunce class [a]n . So the left
coset a + hni is simply the congruence class [a]n . In particular each integer
is in exactly one of these left cosets; there is no overlap among them.
1. g 0 H = gH;
2. g 0 2 gH;
3. g 1 g 0 2 H.
Proof. We have g = ge 2 gH, so g 2 gH. To see that this is the only left
coset of H in G containing g, suppose that g is in the left coset g 0 H, where
g 0 2 G. Prop. 4.9.4 (with the roles of g and g 0 reversed) shows that in fact
gH = g 0 H. ⇤
Since we now know that gH is the only left coset of H in G containing g,
we can call it the left coset of H in G containing g,
We saw the assertion of the corollary explicitly in Examples 4.9.2 and
4.9.3, but let’s consider one more example:
Example 4.9.6 Consider
G = Z⇥
13 = {[1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12]}
and H = h[3]i = {[1], [3], [9]}. (Since 33 ⌘ 1 mod 13, we get that [3] has
order 3 in G and there are 3 elements in h[3]i.) We know that H = eH is a
4.10. LAGRANGE’S THEOREM 75
left coset of H in G. Another left coset is [2]H = {[2], [6], [5]}. We now know
[a]H for a = [1], [2], [3], [5], [6], [9]. We find also that [4]H = {[4], [12], [10]}
and [7]H = {[7], [8], [11]}. We’ve now accounted for all the elements of G in
some left coset of H in G, so we have a complete list of the left cosets:
{[1], [3], [9]}, {[2], [5], [6]}, {[4], [10], [12]} and {[7], [8], [11]}.
gH = { gh | h 2 H }
Proof. The proof is now a simple counting argument using Cor. 4.9.5 and
Lemma 4.10.1.
Let n be the order of G and let d be the order of H. According to
Cor. 4.9.5, each element of G is in exactly one left coset of H in G. So n, the
number of elements of G, is gotten by adding up the numbers of elements in
these left cosets. But Lemma 4.10.1 states that the number of elements in
each coset is d. Therefore
n = d
|+d+{z· · · + d},
k times
where k is the number of left cosets of H in G. Therefore n = kd, so n is
divisible by d. ⇤
The idea of the proof is already visible in our computations of cosets in
Examples 4.9.2 and 4.9.3. In Example 4.9.2 where G = D3 and H = {e, 1 },
we saw that the 6 elements of D3 were divided into the 3 left cosets of H in
G, each of which had exactly 2 elements (2 being the order of H), so 6 = 3·2.
Similarly for G = Z⇥
13 and H = {1, 4, 9}, we found that the 12 elements of G
were divided into the 4 cosets of H in G, each of which had 3 elements, so
12 = 4 · 3.
Def inition 4.10.3 If H is a subgroup of a group G, then the number of left
cosets of H in G is called the index of H in G, and denoted [G : H].
If G is finite, then we see that [G : H] is the number k = n/d in the above
proof of Lagrange’s Theorem. For example, the index of {e, 1 } in D3 is 3.
But if G is infinite we can still define the index of a subgroup H, and this
index may be finite or infinite. For example, if n > 0, then the index of hni
in Z is the number of left cosets of hni in Z, i.e., the number of congruence
classes modulo n, which is simply n. The subgroup {0} has infinite index
in Z, but for a more interesting example with infinite index, consider the
subgroup SL2 (R) of GL2 (R) If A 2 GL2 (R), then the left coset
ASL2 (R) = { AB | B 2 SL2 (R) }
is the set of 2 ⇥ 2-matrices with the same determinant as A since
C 2 ASL2 (R) , A 1 C 2 SL2 (R) , det(A 1 C) = 1 , det A = det C.
So there is one left coset for each possible determinant; i.e., for each non-zero
real number.
Lagrange’s Theorem is a statement about the order of any subgroup of G,
but it also tells us something about the order of any element of G. Recall if
g 2 G, then the order of g is the smallest positive integer d such that g d = e.
4.10. LAGRANGE’S THEOREM 77
Proof. Recall (Prop. 4.8.3) that the order of g is the same as the order of
hgi, the subgroup of G generated by g. So apply Lagrange’s Theorem to
H = hgi to conclude that the order of g divides the order of G. ⇤
Recall for example that the possible orders of elements of D3 are 1 (the
identity), 2 (the three rotations) and 3 (the two rotations). These are all
divisors of 6 (the order of D3 ), confirming what Cor. 4.10.4 says in this
example. Note that Cor. 4.10.4 does not say that every positive divisor of n
is the order of an element of G. The order of D3 is 6, which has divisors 1,
2, 3 and 6. While D3 has elements of orders 1, 2 and 3, it has no element
of order of 6. (Indeed if it did, the group would have to be cyclic, but it is
not even abelian.) Similarly, Lagrange Theorem (4.10.2) does not say that
every divisor of n occurs as the order of a subgroup of G. It happens to be
the case for G = D3 that it has subgroups of orders 1, 2, 3 and 6, but we’ll
see examples later where not every divisor occurs.
Here’s another consequence of Thm. 4.10.2:
Proof. Let d denote the order of g. Cor. 4.10.4 tells us that d|n, i.e., that
n = dk for some k 2 Z. Therefore g n = g dk = (g d )k = ek = e. ⇤
The following consequence of Lagrange’s Theorem is called Fermat’s
Little Theorem:
1. ap ⌘ a mod p;
[ap 1 ]p = [a]pp 1
= [1]p ,
For another example, let’s compute 50100 mod 103. Now 103 is prime, and
50 is not divisible by 103, so by Fermat’s Little Theorem, 50102 ⌘ 1 mod 103.
At first glance that doesn’t seem to help much, but we can think of this as
saying that
50100 · 502 ⌘ 1 (mod 103),
or that [50100 ] is the inverse of [502 ] in Z⇥ 2
103 . So we could compute 50 , find
the remainder on division by 13, and then use the Euclidean Algorithm to
find the multiplicative inverse. (An alternative would be to first find the
multiplicative inverse of 50 and then square; either works.) Since 2500 ⌘
28 mod 103, we compute:
103 = 3 · 28 + 19, 28 = 19 + 9, 19 = 2 · 9 + 1,
giving
1 = 19 2 · 9 = 19 2(28 19) = 3 · 19 2 · 28
= 3(103 3 · 28) 2 · 28 = 3 · 103 11 · 28.
4.11. PRODUCT GROUPS 79
where [a] 2 means ([a]2 ) 1 , and this is computed as above using the Euclidean
algorithm.
Def inition 4.11.1 Suppose that A and B are sets. The product of A and
B is defined to be the set
A ⇥ B = { (a, b) | a 2 A, b 2 B }.
A ⇥ B = {(0, 0), (0, 1), (0, 2), (1, 0), (1, 1), (1, 2)}.
(g, h)((g 0 , h0 )(g 00 , h00 )) = (g, h)(g 0 g 00 , h0 h00 ) = (g(g 0 g 00 ), h(h0 h00 )),
and similarly
((g, h)(g 0 , h0 ))(g 00 , h00 ) = (gg 0 , hh0 )(g 00 , h00 ) = ((gg 0 )g 00 ), (hh0 )h00 ).
Since G and H are groups, the binary operations on G and H are associative,
so g(g 0 g 00 ) = (gg 0 )g 00 and h(h0 h00 ) = (hh0 )h00 . Therefore
Example 4.11.3 Suppose that m and n are positive integers, and consider
the groups Zm (under addition modulo m) and Zn (under addition modulo
n). We can then form the product group Zm ⇥ Zn . Since Zm has m elements
and Zn has n elements, it follows that Zm ⇥ Zn has mn elements.
4.12. HOMOMORPHISMS 81
4.12 Homomorphisms
Roughly speaking, a homomorphism is a function from one group to another
that is compatible with their algebraic structure. Since more than one group
is involved in defining this notion, it’s helpful at first to have symbols in place
for their binary operations.
82 CHAPTER 4. GROUPS
Def inition 4.12.1 Let (G, ⇤G ) and (H, ⇤H ) be groups. A function :G!
H is a homomorphism (of groups) if
Note that g and g 0 are elements of the group G (the domain of ), so it makes
sense to apply ⇤G to them. This gives an element g ⇤G g 0 2 G, and as this is
the domain of , we can apply to g ⇤G g 0 to get an element (g ⇤G g 0 ) of
the codomain H. On the other hand, it also makes sense to apply to g and
g 0 , giving elements (g), (g 0 ) 2 H. We can then apply the binary operation
⇤H on H to get an element (g) ⇤H (g 0 ) 2 H. The definition says that is
a homomorphism if for every pair of elements g, g 0 2 G, these two di↵erent
procedures:
(gg 0 ) = (g) (g 0 ).
for all a, b 2 Z, where the middle equality is just the definition of the opera-
tion + on Zn .
(the middle equality being a consequence of Prop. 4.5.1, the law of exponents
in groups). Note that the preceding example is a special case of this one,
with G = Z and g = n.
2. (g 1 ) = ( (g)) 1 .
(the last equality by Part 1). Therefore Prop. 4.4.6 implies that (g 1 ) =
( (g)) 1 . (Note that the inverse on the left-hand-side is with respect to the
operation on G, and on the right it is with respect to the operation on H.)
⇤
The parts of the proposition can be viewed as special cases (n = 0 and
n = 1) of the following general property of homomorphisms, for which the
proof is left as an exercise.
Proposition 4.12.10 Suppose : G ! H is a homomorphism of groups.
If g 2 G and n 2 Z, then (g)n = (g n ).
We also note another general property of homomorphisms whose proof
is left as an exercise: the composite of two homomorphisms is a homomor-
phisms.
4.12. HOMOMORPHISMS 85
f( ) f( )
e e (23)
⇢ (123) ⇢ (13)
⇢2 (132) ⇢2 (12).
Therefore is an isomorphism.
On the other hand, consider the function : R⇥ ⇥ R⇥ ! C⇥ de-
fined by ((x, y)) = x + iy. This function is not a homomorphism since
((u, v)(x, y)) = ((ux, vy)) = ux + ivy is not in general the same as
((u, v)) ((x, y)) = (u + iv)(x + iy) = (ux vy) + i(uy + vx). (The function
also fails to be surjective since 1 = 1 + i · 0 is not the value of ((x, y)) for
any x, y 2 R⇥ .)
86 CHAPTER 4. GROUPS
Example 4.12.15 Consider the groups Z⇥ 8 = {[1]8 , [3]8 , [5]8 , [7]8 } and Z2 ⇥
Z2 = {([0]2 , [0]2 ), ([0]2 , [1]2 ), ([1]2 , [0]2 ), ([1]2 , [1]2 )}. Define : Z⇥ 8 ! Z2 ⇥
Z2 by ([1]8 ) = ([0]2 , [0]2 ), ([3]8 ) = ([0]2 , [1]2 ), ([5]8 ) = ([1]2 , [0]2 ) and
([7]8 ) = ([1]2 , [1]2 ). This is clearly a bijection, and comparing the tables:
and
([0]2 , [0]2 ) ([0]2 , [1]2 ) ([1]2 , [0]2 ) ([1]2 , [1]2 )
([0]2 , [0]2 ) ([0]2 , [0]2 ) ([0]2 , [1]2 ) ([1]2 , [0]2 ) ([1]2 , [1]2 )
([0]2 , [1]2 ) ([0]2 , [1]2 ) ([0]2 , [0]2 ) ([1]2 , [1]2 ) ([1]2 , [0]2 )
([1]2 , [0]2 ) ([1]2 , [0]2 ) ([1]2 , [1]2 ) ([0]2 , [0]2 ) ([0]2 , [1]2 )
([1]2 , [1]2 ) ([1]2 , [1]2 ) ([1]2 , [0]2 ) ([0]2 , [1]1 ) ([0]2 , [0]2 ),
shows that is in fact a homomorphism. (The table for H is gotten from
the table for G by replacing each g by (g).) Therefore is an isomorphism.
Note that a group is isomorphic to itself (by the identity function). According
to Prop. 4.12.17 says that G is isomorphic to H if and only if H is isomorphic
to G. Since the order doesn’t matter, we’ll often simply say instead that G
and H are isomorphic. Prop. 4.12.16 says that if G and H are isomorphic
and H and K are isomorphic, then G and K are isomorphic.
Example 4.12.25 Consider the two groups S4 and D12 . Both are non-
abelian groups of order 24, but D12 has an element of order 12 (a 30 rotation)
and S4 does not. Therefore the two groups are not isomorphic.
(G) = { (g) | g 2 G },
ker( ) = { g 2 G | (g) = eH }.
(gg 0 ) = (g) (g 0 ) = eH eH = eH .
4.12. HOMOMORPHISMS 91
Therefore gg 0 2 ker( ).
3) Suppose that g 2 ker( ). Then (g) = eH , so
(g 1 ) = (g) 1
= eH1 = eH .
Therefore g 1 2 ker( ).
It follows that ker( ) is a subgroup of G. ⇤
ker( ) = hni ✓ Z.
Example 4.12.32 Recall that the determinant function det : GL2 (R) ! R⇥
is a homomorphism (Example
✓ ◆ 4.12.4). The function
✓ is surjective
◆ since for
x 0 x 0
any x 2 R⇥ , we have 2 GL2 (R) and det = x. Therefore
0 1 0 1
the image of det is R⇥ . The kernel of det is
ker( ) = hdi
Rings
Def inition 5.1.1 A ring is a set R with binary operations + and ⇤ satis-
fying:
93
94 CHAPTER 5. RINGS
We’ve already seen that (Zn , +) is an abelian group, and that multiplication
is associative (Prop. 3.3.8) with identity element [1]n . In order to conclude
that Zn is a ring, we just have to check that X(Y + Z) = XY + XZ for all
X, Y, Z 2 Zn , i.e., that
also by definition. The distributive law for Z states that a(b + c) = ab + ac,
so it follows that
[a]n ([b]n + [c]n ) = [a(b + c)]n = [ab + ac]n = [a]n [b]n + [a]n [c]n .
So Zn is a ring.
f (x) = am xm + am 1 xm 1
+ · · · + a1 x + a0 ,
g(x) = bn xn + bn 1 xn 1
+ · · · + b1 x + b0 , with b0 , b1 , . . . , bn 2 R.
To write down the formula for f (x) + g(x), it’s convenient to assume the
leading terms have the same degree, which we can do by writing f (x) as
0xn + 0xn 1
+ · · · + 0xm+1 + am xn m + am 1 xm 1
+ · · · + a1 x + a0 ,
(where we set ai = 0 if i > m and bj = 0 if j > n). These are just formulas
for the familiar algebraic operations on polynomials, written in a way that
shows the output is again a polynomial. For example if f (x) = x3 + 2x + 1
and g(x) = x2 + 1, then
To see that R[x] with these operations is a ring, we first check that the
operations are associative. So suppose that f (x) and g(x) are as above, and
X̀
h(x) = ck xk . Then
k=0
n
X n
X n
X
i i
(f (x) + g(x)) + h(x) = (ai + bi )x + ci x = ((ai + bi ) + ci )xi
i=0 i=0 i=0
where dr is the sum of the ai bj such that i + j = r, and es is the sum of the
dr ck such that r + k = s. Therefore es is the sum of the (ai bj )ck such that
(i + j) + k = s. (Note that we have just used the distributive law for real
numbers to rewrite each term in the sum for es as
Note also that both operations are commutative and have identity ele-
ments. (The identity elements are simply the constant polynomials 0 and
1.) To conclude that (R[x], +) is an abelian group, we just have to note that
each polynomial
f (x) = am xm + am 1 xm 1
+ · · · + a1 x + a0
( am )xm + ( am 1 )xm 1
+ · · · + ( a1 )x + ( a0 ).
which proves one of the distributive laws. I’ll leave you to check the proof of
the other, which is very similar.
Now let’s consider some non-examples of rings. Let E denote the set of
even integers under the usual binary operations of addition and multiplica-
tion. Then E is a subgroup of Z under addition, hence is an abelian group.
98 CHAPTER 5. RINGS
The operation · is associative and satisfies the distributive law with respect
to addition, but there is no identity element in E for multiplication, so E is
not a ring.
Here is an example where only the distributive law fails. Consider the set
R with the operations + and •, where + is the usual addition operation and
• is defined by
x • y = x + y + xy = (x 1)(y 1) + 1, for x, y 2 R.
and
x • (y + z) = x + (y + z) + x(y + z) = x + y + z + xy + xz,
(x • y) + (x • z) = x + y + xy + x + z + xz
2. ( x) ⇤ y = xy = x ⇤ ( y) for all x, y 2 R.
Now since (R, +) is a group, we can apply the cancellation law (Prop. 4.4.4)
to conclude that 0R = 0R ⇤ x. Similarly,
0R + (x ⇤ 0R ) = x ⇤ 0R = x ⇤ (0R + 0R ) = (x ⇤ 0R ) + (x ⇤ 0R ),
(( x) ⇤ y) + (x ⇤ y) = (( x) + x) ⇤ y (by distributivity)
= 0R ⇤ y (by definition of )
= 0R (by Part 1).
(x ⇤ ( y)) + (x ⇤ y) = x ⇤ (( y) + y) = x ⇤ 0R = 0R
Example 5.3.2 Consider for example an element [a]n in the ring Zn . Then
one finds that m · [a]n = [ma]n for m 2 Z, and [a]m
n = [a ]n for m 2 N.
m
Example 5.3.3 The nth power of the element f (x) = x in the polynomial
ring R[x] is, of course, what we’ve been denoting as xn all along.
We’ll refer to the additive version of Prop. 4.5.1 as the law of multiples:
(m · x) + (n · x) = (m + n) · x and m · (n · x) = (mn) · x
(but note that we can’t conclude anything about xn y n without assuming that
· is commutative).
We also have the following behavior of the multiples of elements with
respect to multiplication in the ring:
The proof is left as an exercise, but let’s look carefully at the meaning of
the equation. Note that (n · x)y is the product of two elements of the ring
R, namely n · x and y; the first of these elements n · x is defined as the nth
multiple of the element x. On the other hand, n · (xy) is the nth multiple of
the element xy, where xy is the product in R of the elements x and y. We
can’t simply apply associativity since · is not the multiplication operation in
R, and indeed the integer n might not even be an element of R. Consider the
case n = 2, which already gives the key idea for how to prove the proposition
in general. The first equality is saying that (x + x)y = (xy) + (xy), which is
of course a consequence of the distributive law.
It is often useful to consider the multiples of the multiplicative identity
element 1R , and there is the following notation for them: If m 2 Z, then
we let mR = m · (1R ) denote the mth multiple of 1R . (Note that for m = 1,
this gives 1R = 1R , and for m = 0, we have that 0 · (1R ) is by definition
the additive identity element 0R , so there is no conflict in notation.) So for
example
5.4. SUBRINGS 101
• for R = Z, we have mR = m;
5.4 Subrings
Recall that a subgroup of a group G is a subset of G which itself is a group
with the same binary operation as on G. There’s a similar notion of a subring
of a ring.
Z ⇢ Q ⇢ R ⇢ C,
Example 5.4.3 Identifying the real number a 2 R with the constant poly-
nomial f (x) = a, we can view R as a subring of the ring R[x] of Example 5.2.2.
1. 0R 2 S and 1R 2 S;
2. if x, y 2, then x + y 2 S and x ⇤ y 2 S;
3. if x 2 S, then x 2 S.
• if x, y 2 S, then x + y 2 S;
• 0R 2 S;
• if x 2 S, then x 2 S.
5.4. SUBRINGS 103
The fact that 1R 2 S follows from the definition of a subring, which requires
that 1R = 1S . Finally if x, y 2 S, then since ⇤ is a binary operation on S, we
must have x ⇤ y 2 S. We have now shown that 1), 2) and 3) are all satisfied.
Now suppose that 1), 2) and 3) are satisfied by S, and we will show that
S is a subring. From 2), we know that + and ⇤ define binary operations on
S. Since 0R 2 S (by 1) and x 2 S whenever x 2 S (by 3), we know that S
is a subgroup of (R, +), and so (S, +) is an abelian group. Since R is a ring,
the operation · is associative on R, hence associative on S. Since 1R 2 S, we
have that 1R is the identity element for ⇤ on S. Finally since the distributive
laws hold for the operations + and ⇤ on R, they must hold for the operations
on S as well. Therefore S is a ring under the operations + and ⇤. Since
its multiplicative identity is 1R , we have that 1S = 1R , so S is a subring of
R. ⇤
in the ring R = M2 (R) of Example 5.2.3: We verify the conditions 1), 2) and
3) of Prop. 5.4.4. ✓ ◆ ✓ ◆
0 0 1 0
1) The matrices 0R = and 1R = are both in S.
✓ ◆ 0 0 ✓ ◆ 0 1
a 0 a0 0
2) If A = and A0 = are in S, then so is
0 d 0 d0
✓ ◆
0 a + a0 0
A+A = .
0 d + d0
✓ ◆ ✓ ◆
a 0 0 a0 0
3) If A = and A = are in S, then so is
0 d 0 d0
✓ ◆
0 aa0 0
A+A = .
0 dd0
More generally, for any ring R we let R⇥ denote the set of units in R. So
for example, if R = R, then the set of units is R⇥ = {non-zero real numbers}.
In fact, whenever we used the superscript ⇥ before, it was consistent with
the definition just given.
We have seen already that Z⇥ n and R were groups under multiplication
⇥
(Prop. 4.2.13 for Zn ). These are special cases of the more general fact:
⇥
and similarly (y 0 ⇤ y) ⇤ (x ⇤ x0 ) = 1R .
We must now show that R with the binary operation ⇤ satisfies the defi-
nition of a group. First of all, since ⇤ is associative on R, it is also associative
on the subset R⇥ ✓ R. To see that there is an identity element for ⇤ on R⇥ ,
note that 1R 2 R⇥ since 1R has a multiplicative inverse in R, namely 1R .
Finally we have to show that if x 2 R⇥ , then x has an inverse element in R⇥
for ⇤. By the definition of R⇥ , we know that x has an inverse element y 2 R.
We need only show that y 2 R⇥ , i.e., that y has a multiplicative in R, and
indeed it does, namely x. We have now shown that (R⇥ , ⇤) is a group. ⇤
We saw several examples, the most basic being Z. Some others were R, Zn ,
R[x] and M2 (R), each with their addition and multiplication operations. We
defined the notions of subring and unit group of a ring, and multiples and
(non-negative) powers of elements.
Here is another example of a ring:
Example 5.6.1 Let R = Z ⇥ Z = { (m, n) | m, n 2 Z } with addition and
multiplication defined componentwise, so for (m, n), (m0 , n0 ) 2 R,
x ⇤ y = y ⇤ x for all x, y 2 R.
Def inition 5.6.3 We say that a ring R is an integral domain (or simply
a domain) if R is commutative, 0R 6= 1R and
x, y 2 R, xy = 0R ) x = 0R or y = 0R .
For example, Z, Q, R, C and R[x] are integral domains. To prove that R[x]
is a domain, suppose that
Then it’s easy to see that M2 (Z) satisfies the conditions of Prop. 5.4.4, so
is therefore a subring of M2 (R). In fact if R is any ring and n 1, we let
Mn (R) denote the set of n⇥n-matrices with entries in R. We can then define
binary operations on Mn (R) using the usual formulas for matrix addition and
multiplication. Using the subscript ij to denote the entry in the ith row and
j th column, this means that if
0 1 0 1
a11 a12 · · · a1n b11 b12 · · · b1n
B a21 a22 · · · a2n C B b21 b22 · · · b2n C
B C B C
A = B .. .. .. C and B = B .. .. .. C ,
@ . . . A @ . . . A
an1 an2 · · · ann bn1 bn2 · · · bnn
then 0 1
a11 + b11 a12 + b12 · · · a1n + b1n
B a21 + b21 a22 + b22 · · · a2n + b2n C
B C
A+B =B .. .. .. C
@ . . . A
an1 + bn1 an2 + bn2 · · · ann + bnn
and AB is the matrix
0 1
a11 b11 + a12 b21 + · · · + a1n bn1 · · · a11 b1n + a12 b2n + · · · + a1n bnn
B a21 b11 + a22 b21 + · · · + a2n bn1 · · · a21 b1n + a22 b2n + · · · + a2n bnn C
B C
B .. .. C.
@ . . A
an1 b11 + an2 b21 + · · · + ann bn1 · · · an1 b1n + an2 b2n + · · · + ann bnn
Proof. We first check that Mn (R) is an abelian group under +. To see that
matrix addition is associative, suppose A, B, C 2 Mn (R) with ij-entries are
aij , bij and cij . Then the ij-entry of (A + B) + C is (aij + bij ) + cij , which
is the same as aij + (bij + cij ) since the operation + is associative on the
original ring R. Therefore the ij-entries of A + (B + C) and (A + B) + C
are the same for all i, j (with 1 i n, 1 j n). Similarly we see that
since + is commutative on R, so is matrix addition on Mn (R). There is an
identity element, namely the matrix 0 all of whose entries are 0R . Finally
the additive inverse of A is A, whose ij-entry is aij (the negative of aij
in the ring R). Therefore Mn (R) is an abelian group under matrix addition.
Next we check that matrix multiplication on Mn (R) is associative and
has an identity element. We compute the ij-entry of (AB)C and compare to
that of A(BC). Using sumation notation, we have that the ij-entry of AB
Xn
is rij = aik bkj . Therefore the ij-entry of (AB)C is
k=1
n n n
! n n
!
X X X X X
ri` c`j = aik bk` c`j = (aik bk` )c`j
`=1 `=1 k=1 `=1 k=1
by one of the distributive laws on the original ring R. Similarly, letting sij
denote the ij-entry of BC, we see that the ij-entry of A(BC) is
n n n
! n n
!
X X X X X
aik ekj = aik bk` c`j = aik (bk` c`j )
k=1 k=1 `=1 k=1 `=1
For k 6= i, we have ik akj = 0R akj = 0R , so the only term in the sum other
than 0R is ii aij = 1R aij = aij . Therefore 1A = A. Similarly we see that
A1 = A, so 1 is an identity element.
Finally we have to check the distributive laws. We compare the ij-entries
of A(B + C) and AB + AC. Let tij denote the ij-entry of B + C. Then the
ij entry of A(B + C) is
where the first equality is from the definition of matrix addition (tkj = bkj +
ckj ), the second by a distributive law on R, and the third by commutativity
of + on R. Since ai1 b1j + ai2 b2j + · · · + ain bnj is the ij-entry of AB and
ai1 c1j +ai2 c2j +· · ·+ain cnj is the ij-entry of AC, we conclude that A(B +C) =
AB + AC. Similarly we find that (A + B)C = AC + BC for all A, B, C 2 R.
⇤
4
Consider
✓ for◆example M✓2 (Z3 ). This
◆ is a ring with 81 = 3 elements. Let
[2] [1] [1] [0]
A= and B = . Let’s compute A + B, AB and BA:
[1] [0] [1] [1]
✓
◆ ✓ ◆
[2] [1] [1] [0]
A+B = +
✓ [1] [0] [1] ◆[1]
[2] + [1] [1] + [0]
=
✓ [1] + [1] ◆ [0] + [1]
[0] [1]
= ,
[2] [1]
✓ ◆✓ ◆
[2] [1] [1] [0]
AB =
✓ [1] [0] [1] [1] ◆
[2][1] + [1][1] [2][0] + [1][1]
= ,
✓ [1][1] + [0][1]
◆ [1][0] + [0][1]
[0] [1]
= ,
[1] [0]
5.8. RING HOMOMORPHISMS 111
✓ ◆✓ ◆
[1] [0] [2] [1]
BA =
✓ [1] [1] [1] [0] ◆
[1][2] + [0][1] [1][1] + [0][0]
=
✓ [1][2] + [1][1]
◆ [1][1] + [1][0]
[2] [1]
= .
[0] [1]
It is an exercise to check that if R is a commutative ring, then the unit group
of M2 (R) is
GL2 (R) = { A 2 M2 (R) | det(A) 2 R⇥ }.
This gives a way of constructing some interesting finite groups. For example,
the above matrices A and B are elements of the group GL2 (Z3 ), which has
order 48.
Def inition 5.8.1 Suppose that (R, +R , ⇤R ) and (S, +S , ⇤S ) are rings. A
function : R ! S is a homomorphism (of rings) if all of the following
hold:
3. (1R ) = 1S .
Example 5.8.6 Recall that R[x] denotes the ring of polynomials in the vari-
able x with coefficients in R (Example 5.2.2). If ↵ is any real number, then
we can define a function
v↵ : R[x] ! R
by v↵ (f (x)) = f (↵) which we could call evaluation-at-↵. Then v↵ is a homo-
morphism since
(nR ) = (n · 1R ) = n · (1R ) = n · 1S = nS
for all n 2 Z. ⇤
The proof of the following proposition is left as an exercise:
under the multiplication operation of the ring (See Def. 5.5.1 and Thm. 5.5.3).
Example 5.8.12 Let S denote the subring of M2 (R) consisting of the diag-
onal matrices (Example 5.4.5), and let R denote the product ring R ⇥ R (see
Example
✓ ◆5.6.1). Consider the function : R ! S defined by ((x, y)) =
x 0
. We check that is a homomorphism. Let r = (x, y) and
0 y
r0 = (x0 , y 0 ) be elements of R = R ⇥ R. Then 1)
✓ ◆
0 0 0 x + x0 0
(r + r ) = ((x + x , y + y )) = 0
✓ 0 ◆y+ ✓y 0 ◆
x 0 x 0
= + = (r) + (r0 ),
0 y 0 y0
✓◆
0 0 1 0
and similarly 2) (rr ) = (r) (r ). Since 1R = (1, 1) and 1S = , we
0 1
also have 3) (1R ) = 1S . So is a homomorphism. Since it is also obviously
bijective, is an isomorphism.
1
1. The inverse function : S ! R is also an isomorphism of rings.
The proofs are almost identical to Prop. 4.12.16 and Prop. 4.12.17 and left
as an exercise.
Theorem 5.9.1 If m and n are relatively prime positive integers, then the
function
: Zmn ! Zm ⇥ Zn
defined by ([a]mn ) = ([a]m , [a]n ) is an isomorphism of rings.
Example 5.9.4 Now let’s work another example, but where a little work
is required before applying the above algorithm. We’ll find all simultaneous
solutions of the congruences:
x⌘ 14 · 23 ⌘ 20 mod 57.
For the second congruence note that gcd(22, 84) = 2, which divides 26, so
there are solutions and the congruence is equivalent to 11x ⌘ 13 mod 42.
The Euclidean Algorithm applies to 11 and 42 yields 1 = 5 · 42 19 · 11, so
the inverse of [11]42 is [ 19]42 , and the solution of the second congruence is
x⌘ 19 · 13 ⌘ 5 mod 42.
But note that 42 and 57 are not relatively prime; their gcd is 3. Since
57 = 3·19, and 3 and 19 are relatively prime, we can view the first congruence
as equivalent to the pair of congruences
or more simply x ⌘ 2 mod 3 and x ⌘ 1 mod 19. Similarly the second congru-
ence is equivalent to the pair of congruences x ⌘ 2 mod 3 and x ⌘ 5 mod 14.
We are therefore looking for simultaneous solutions of the three congruences
We already know that the simultaneous solution of the first two of these
congruences is x ⌘ 20 mod 57. So we are finally reduced to solving the pair
of congruences
We can also apply Thm. 5.9.1 to derive a formula for the order of Z⇥ n.
The order of this group is denoted '(n). We can view ' as a function from
N to N, called Euler’s '-function. Thus '(n) is the number of integers in
{0, 1, 2, . . . , n 1} which are relatively prime to n. Computing a few values
we find:
n 1 2 3 4 5 6 7 8 9
'(n) 1 1 2 2 4 2 6 4 6.
Recall that if p is prime then Z⇥p = {[0], [1], . . . , [p 1]} has order p 1, so
'(p) = p 1. It is also easy to compute the value of ' for prime powers
pr , with r 1. Indeed the only integers among {0, 1, 2, . . . , pr 1} which
are not relatively prime to pr are precisely thos which are multiples of p, of
which there are exactly pr 1 (namely 0, p, 2p, . . . , pr p). Therefore
'(pr ) = pr pr 1
= (p 1)pr 1 .
For example Z⇥ 9 = {[1], [2], [4], [5], [7], [8]} contains all 9 elements of Z9 except
for the 3 multiples of [3], namely [0], [3] and [6].
To find a general formula, we use the following corollary of Thm. 5.9.1:
Corollary 5.9.5 If m and n are relatively prime, then Z⇥
mn is isomorphic to
Zm ⇥ Zn . In particular, if gcd(m, n) = 1, then '(mn) = '(m)'(n).
⇥ ⇥
a'(n) ⌘ 1 mod n.
Example 5.10.1 Taking R = Z, the ring Z[x] can be viewed as the subring
of R[x] consisting of polynomials with integer coefficients. In general, if R is
a subring of S, then R[x] can be identified with a subring of S[x].
then
f (x) + g(x) = [2]x2 + [1]x + [2]
and f (x)g(x) = [4]x3 + [8]x2 + [5]x + [1] = [1]x + [1].
5.10. POLYNOMIAL RINGS 121
Example 5.10.3 Consider the example where R itself the polynomial ring
R[x]. Since we’re already using x as a variable in the notation for R[x], we’ll
instead consider the ring R[y] of polynomials over R = R[x] in the variable
y. The elements of R[y] = (R[x])[y] are then polynomials in the variable
y, with coefficients that are themselves polynomials in the variable x. So
Xn
an element of R[y] is an expression of the form pi (x)y i , where for each
i=0
mi
X
i = 0, 1, . . . , n, the coefficient pi (x) 2 R[x] has the form pi (x) = ai,j xj for
j=0
some ai,0 , ai,1 , . . . , ai,mi 2 R. Here mi is the degree of pi (x), but including
higher order terms to pi (x) with coefficient 0, we can assume all the mi
are the same, say m, to simplify notation. So an element of (R[x])[y] is an
expression of the form
m n
! m X n
X X X
j i
f (x, y) = ai,j x y = ai,j xj y i ,
i=0 j=0 i=0 j=0
Def inition 5.10.4 Suppose that R is a ring and f (x) 2 R[x]. We say that
f (x) has degree n, or deg(f (x)) = n, if
f (x) = an xn + an 1 xn 1
+ · · · + a1 x + a0
with an 6= 0R .
f (x, y) = x2 y 3 + xy 3 x4 y + 3xy 2
= (x2 + x)y 3 + ( x4 + 3x)y 2
= ( y)x4 + y 3 x2 + (y 3 + 3y)x 2
122 CHAPTER 5. RINGS
2. (R[x])⇥ = R⇥ .
5.10. POLYNOMIAL RINGS 123
Proof. 1) We have to prove that if f (x) and g(x) are non-zero elements
of R[x], then f (x)g(x) is also not zero. This is immediate from the part
2) of Prop. 5.10.5, since deg(f (x)) 0 and deg(g(x)) 0 implies that
deg(f (x)g(x)) = deg(f (x)) + deg(g(x)) 0.
2) Suppose that f (x) 2 (R[x])⇥ . This means that there is a polyno-
mial g(x) 2 R[x] such that f (x)g(x) = 1R . Note that deg(f (x)) 0 and
deg(g(x)) 0 (since they are non-zero polynomials). Applying part 2) of
Prop. 5.10.5 again gives that
Example 5.10.7 Since R is an integral domain (in fact, a field), Cor. 5.10.6
shows that R[x] is an integral domain and (R[x])⇥ = R⇥ . Applying Cor. 5.10.6
again (now with R = R[x]), shows that R[x, y] = R[y] is also an integral do-
main and that (R[x, y])⇥ = (R[x])⇥ = R⇥ . In fact, we see that in general if
R is an integral domain, then so is R[x1 , x2 , . . . , xn ] and its unit group is R⇥
(i.e., the constant poynomials where the constant is in R⇥ ).
We will now focus on the case where R is not just an integral domain, but
a field. Recall this means that every non-zero element of R has a multipicative
inverse; for example R, C and Zp (for p prime) are fields. If R is a field, then
the polynomial ring R[x] turns out to have some nice properties similar to
the ring Z.
Recall that the division algorithm for integers (Thm. 2.1.2) states that if
a 2 Z and b 2 N, then there are unique integers q and r such that
a = bq + r and 0 r < b.
124 CHAPTER 5. RINGS
Let’s state it slightly di↵erently to emphasize the analogy with the version
for polynomial rings which we’re about to prove: If a, b 2 Z and b 6= 0, then
there are unique integers q and r such that
There is a similar Division Algorithm for polynomial rings, with the role of
the absolute value played by the degree.
Theorem 5.10.9 Suppose that R is a field and f (x), g(x) 2 R[x] with g(x) 6=
0R . Then there are unique polynomials q(x), r(x) 2 R[x] such that
• f (x) = g(x)q(x) + r(x), and
has degree at most m 1, so the induction hypothesis implies that there are
polynomials s(x), r(x) 2 R[x] such that
• h(x) = g(x)s(x) + r(x), and
f (x) = am bn 1 xm n
g(x) + h(x)
= am b n 1 x m n
g(x) + g(x)s(x) + r(x) = g(x)q(x) + r(x)
5.10. POLYNOMIAL RINGS 125
where q(x) = am bn 1 xm n + s(x). This proves the existence of q(x) and r(x)
as in the theorem.
Now we prove uniqueness. Suppose that
with q(x), r(x), s(x) and t(x) in R[x] and the degrees of r(x) and t(x) being
less than n = deg(g(x)) (including the possibility that r(x) or t(x) be 0R ).
We then have
g(x)(q(x) s(x)) = t(x) r(x),
so if q(x) s(x) is non-zero, then it has some degree k 0, and t(x) r(x)
has degree n+k n. On the other hand, since t(x) and r(x) have degree less
than n, so does t(x) r(x). This contradiction shows that q(x) s(x) = 0R ,
so q(x) = s(x). It follows that t(x) r(x) = 0 as well, so r(x) = t(x). ⇤
You probably already know the “algorithm” for finding q(x) and r(x) by
long division of polynomials, which resembles long division of integers. Let’s
do an example, which also illustrates the idea of the proof: the successive
terms ci xi in the quotient are gotten by dividing the leading of f (x) by that
of g(x), and then replacing f (x) by the polynomial f (x) ci xi which has
lower degree than f (x).
x2 + 0x + 1
3 5 4 3
x + x ) x + 0x + 2x + x2 + 0x + 1
x5 + x3
x3 + x2 + 0x + 1
x3 + 0x2 + x
x2 x + 1.
(If R is not commutative, then there are obvious notions of left divisibility
and right divisibility.)
Moreover the polynomials q(x) and r(x) are unique in the sense that they
are the only polynomials in R[x] satisfying these criteria. So if g(x)|f (x),
then we get f (x) = g(x)q(x) and r(x) = 0R . (Recall our convention that
deg(r(x)) = 1.)
Considering the special case where g(x) = x ↵ for some ↵ 2 R, we have
the following corollary of the Division Algorithm.
Proof. The Division Algorithm applied with g(x) = x ↵ states that there
are polynomials q(x), r(x) 2 R[x] such that f (x) = (x ↵)q(x) + r(x) and
deg(r(x)) < deg(x ↵) = 1, so either r(x) = 0R or deg(r(x)) = 0. In either
case we have r(x) = for some 2 R (i.e., r(x) is a constant polynomial).
Substituting ↵ for x gives
f (↵) = (↵ ↵)q(↵) + = .
xn + 1 = (x + 1)(xn 1
xn 2
+ ··· x + 1).
Example 5.10.19 Let f (x) = x4 [1] in Z5 [x]. Then f ([1]) = [1]4 [1] = [0],
so f (x)is divisible by x [1]. In fact f ([2]) = [2]4 [1] = [15] = [0], so f (x)
is also divisible by x [2], and by x [3] since f ([3]) = f ([ 2]) = [0], and
by x [4] since f ([4]) = f ([ 1]) = [0].
In fact for any prime p and any [a] 2 Z⇥p , Fermat’s Little Theorem implies
that [a]p 1 [1] = [0], so the polynomial xp 1 [1] is divisible by x [a].
0R = f ( ) = ( ↵)g( ),
Example 5.10.22 Suppose that f (x) 2 R[x] has degree 1; i.e., f (x) is lin-
ear. Then f (x) = a1 x + a0 for some a0 , a1 2 R with a0 6= 0. Since R is a
field, a0 2 R⇥ and a0 x + a1 = 0R has a unique solution, namely x = a0 1 a1 ;
thus f (x) has exactly one root.
Example 5.10.23 A polynomial f (x) 2 R[x] can have fewer than n roots
in R (where n = deg(f (x))). In fact it might not have any roots; take for
example, f (x) = x2 + 1 2 R[x]. The polynomial x4 1 2 R[x] has two roots,
namely 1 and 1. Note that either of these polynomials has n roots in C,
since i and i are roots. An example of a polynomial in C[x] with fewer than
n roots is f (x) = x2 ; its only root in C is 0. (There is of course a notion of
a repeated root, of which this is an example.)
• If R = Z5 , then f (x) has roots [2] and [3] and f (x) = (x [2])(x [3])
is reducible.
It turns out that the polynomial ring R[x] (where R is a field) has many
properties in common with the ring Z. There is for example a Euclidean
Algorithm which computes greatest (in degree) common divisors, but we
don’t have time for this.
130 CHAPTER 5. RINGS