Numbers and Polynomials
Numbers and Polynomials
Kermit Sigmon
Department of Mathematics
University of Florida, Gainesville
c
Copyright °1990, 1996 by Kermit Sigmon
c
°1999 by Department of Mathematics
Department of Mathematics • University of Florida • Gainesville, FL 32611
revised June 2001, corrections Nov 2005
ii
You are expected to work through the notes, proving the theorems and working the ex-
ercises. Collaboration between class members in this regard is fine, indeed encouraged, but
keep in mind that independent work builds self-confidence. You should work ahead to the
extent that you have worked through the material prior to its discussion in class. In this
way, you can compare your work with that discussed in class and develop a confidence that
you can independently attack a question. This is a do-it-yourself course in which you are
expected to take your turn presenting your work to the class and to actively participate in
class discussion.
To give you this opportunity for independent discovery, the notes contain neither proofs
of the theorems, nor solutions to the exercises. It is important, therefore, for you to keep a
carefully organized record of such obtained from your work as modified after class discussion;
a ring-binder is suggested for this. The process of carefully rewriting your notes after class
discussion is an important part of the learning process in the course. Answers to exercises
marked with an asterisk, ’∗’, are at the end of the notes.
Enjoy yourself!
Kermit Sigmon
Department of Mathematics
University of Florida
(5-96)
0
Thanks go to all the instructors of the course whose many suggestions which have improved these notes.
0
These notes were typeset using TEX. Thanks go to Jean Larson for creating the figures in Chapter 6
with TEX.
0
These notes were revised in 1999.
iii
Introduction
We adopt the viewpoint that the real number system is known. More specifically, we will
assume the existence of a set R, the real numbers, equipped with two binary operations “ + ”
and “ · ” satisfying certain axioms and an order relation “ < ” satisfying further axioms. You
are to deduce (prove) properties of the real numbers from these axioms — and these axioms
alone.
The natural numbers, integers, rational numbers, and irrational numbers will be defined
as certain distinguished subsets of the real numbers. Their properties will be deduced,
therefore, from properties of the real numbers. The complex numbers will be constructed
from the real numbers and their properties deduced from properties of the real numbers.
Finally, the notion of a polynomial over a number system will be built on properties of the
number system. Thus, all “truth” about these number systems and polynomials over them
ultimately has its source in the algebraic and order axioms for the real numbers which we
will assume.
An alternate approach to the development of these number systems, which we do not adopt,
is to first assume the existence of the natural numbers (or even more primitively, the Peano
axioms). One then constructs successively the integers, rational numbers, real numbers, and
complex numbers. This route presents considerable technical difficulty, especially in building
the reals from the rationals. We choose the first approach because it permits one to focus
on the central properties of the number systems with a minimum of distractions.
iv
Contents
7 Complex Numbers 35
8 Polynomials 39
v
vi CONTENTS
1. Algebraic Properties of the Real Numbers
1.1 Axioms (Algebraic Axioms for the Real Numbers (“Field Axioms”)). We assume that
the real numbers consists of a set R equipped with two binary operations “ + ” and “ · ”
satisfying the following axioms:
AC (Commutativity of Addition)
a + b = b + a for all a, b ∈ R.
AA (Associativity of Addition)
a + (b + c) = (a + b) + c for all a, b, c ∈ R.
a + 0 = a = 0 + a for all a ∈ R.
MC (Commutativity of Multiplication)
ab = ba for all a, b ∈ R.
MA (Associativity of Multiplication)
a(bc) = (ab)c for all a, b, c ∈ R.
D (Distributivity)
1.2 Remark. Some other basic facts will be used in proofs without being formally stated
here and without citation (except as needed to clarify the exposition). These can be divided
into two categories:
1
2 1. Algebraic Properties of the Real Numbers
1. Laws of logic.
2. Laws of equality. First, we have three basic axioms: For all a, b and c we have
(i) a = a, (ii) if a = b then b = a, and (iii) if a = b and b = c then a = c. In addition
there is a general principle which we may call substitution of equals, stating that if
a = b then we may freely substitute the symbol b for a in any expression. Thus, if
a = b and c = d then a + c = b + d and ac = bd. The principle here is that a = b
means that the symbols a and b are names for the same object. All of the properties
with which we are concerned are properties of the underlying object, not of the name,
and hence are unaffected by the name we happen to use for the object.
1.3 Definition/Remark. A binary operation on a set S is a function that assigns to every
ordered pair of elements of S a unique element of S. Familiar examples of binary operations
on R are ordinary addition, subtraction, and multiplication. In particular, if we write a+b =
c, we are assigning the real number c (the “answer”) to the ordered pair (a, b) of real numbers.
One immediate consequence of this definition is the familiar “equals added to equals are
equal”. In other words, if a = b and c = d, then a + c = b + d. The justification for this
is that our binary operation of addition assigns to the ordered pair (a, c) some real number
e, let’s say. But since a = b and c = d, the ordered pair (b, d) is the same ordered pair as
(a, c), and since the operation of addition assigns a unique number e to this ordered pair, we
must have b + d = e. But since a + c = e we have a + c = b + d. In summary, we can say
that the definition of binary operation justifies the implication that if a = b and c = d, then
a + c = b + d. Similar considerations apply to subtraction, multiplication, and division.
We will use the familiar rule that multiplication takes precidence over addition, so that
ab + cd means (ab) + (cd).
1.4 Theorem. Suppose a, b, c, d ∈ R. Then
a). If a + c = b + c, then a = b.
f ). a0 = 0 = 0a.
g). If ab = 0, then a = 0 or b = 0.
b). If b 6= 0, then the equation bx = a has one and only one solution.
1.6 Definition. We define subtraction and division as follows.
a). For a, b ∈ R, a − b denotes that number x such that b + x = a.
a
b). For a, b ∈ R with b 6= 0, denotes that number x such that bx = a.
b
1.7 Theorem. Suppose a, b, c, d ∈ R.
a
a). a − b = a + (−b); also, if b 6= 0, then = ab−1 .
b
b). a(b − c) = ab − ac and −(a − b) = b − a.
1
c). If a 6= 0, then is the multiplicative inverse of a .
a
a a
d). = a; also, if a 6= 0, then = 1.
1 a
−a a ³a´ −a a
e). If b 6= 0, then = =− and = .
b −b b −b b
ac ac
f ). If b 6= 0 and d 6= 0, then = .
bd bd
a
b ad
g). If b 6= 0, c 6= 0 and d 6= 0, then c = .
d
bc
a c ad + bc
h). If b 6= 0 and d 6= 0, then + = .
b d bd
1.8 Exercise. (Limitations on definitions, axioms)
a
a). Explain why, in Definition 1.6, was not given meaning when b = 0.
b
b). Explain why, in MID, one would wish to require that 1 6= 0 by showing that, if not,
then R = {0}.
c). Explain why, in MIV, one would not wish to require that 0 have a multiplicative
inverse.
1.9 Exercise. (Redundancy of axioms)
4 1. Algebraic Properties of the Real Numbers
a). Show that Axiom AC is redundant; i.e., it can be proved from the other axioms. [Hint:
Expand (1 + 1)(a + b) in two ways]
b). Show that the uniqueness of the additive inverse in Axiom AIV is redundant; i.e.,
show from the other axioms that a ∈ R has at most one additive inverse.
c). Show that the uniqueness of the multiplicative inverse in Axiom MIV is redundant;
i.e., show from the other axioms that a non-zero a ∈ R has at most one multiplicative
inverse.
b). (a + b)2 = a2 +(2ab+b2 ) and a2 −b2 = (a−b)(a+b) (a2 := aa, b2 := bb, 2ab := ab+ab).
1.11 Notation. A := B means that the new symbol “A” is defined by “B”.
1.12 Remark. This illustrates how one deduces a few of the familiar algebraic properties
of R from the axioms. You may henceforth use any other (valid!) ones as long as you can
verify them upon demand. In view of the commutative and associative properties you may
also omit parenthesis; e.g., a2 + (2ab + b2 ) = a2 + 2ab + b2 , etc.
5
1.13 Axioms (Order Axioms for the Real Numbers). We assume that there is a binary
relation “ < ” on R satisfying the following axioms:
OTC (Trichotomy)
For any a, b ∈ R, exactly one of a < b, a = b, and b < a holds.
OTR (Transitivity)
If a < b and b < c, then a < c.
OA (Compatibility with Addition)
If a < b, then a + c < b + c.
OM (Compatibility with Multiplication)
If a < b and 0 < c, then ac < bc.
The axioms above, together with those in 1.1, assert that the reals are an example of what
is known as an ordered field. One more axiom will be presented later as Axiom 4.7. With
this axiom, the least upper bound axiom (LUB), the reals are a complete ordered field.
1.14 Notation. “a > b” means “b < a”, “a ≤ b” means “a < b or a = b”, etc.
1.15 Theorem. Suppose a, b, c ∈ R. Then
a). If a > 0 and b > 0, then a + b > 0.
d). a > 0, b > 0 imply ab > 0; a > 0, b < 0 imply ab < 0; and a < 0, b < 0 imply ab > 0.
e). ab > 0 implies that either a > 0 and b > 0 or else a < 0 and b < 0.
f ). 0 < 1.
Geometrically, we can think of the absolute value |a| as giving the length of the line
segment of the number line whose ends are a and 0. The definition by cases given above
enables us to use algebra, e.g. | − 1| := −(−1) by definition, and hence | − 1| = 1 by 1.4i)
a). |a| ≥ 0.
c). |a|2 = a2 ≥ 0.
i). |a − c| ≤ |a − b| + |b − c|.
The importance of the notion of absolute value lies in the fact that |a−b| gives the distance
between points a and b on the real number line independent of their order.
1.21 Exercise. Solve the following inequalities, using ordinary arithmetic on real numbers.
a). |x + 1| > 6.
7
c). |2x − 9| ≤ 1.
d). |2x + 1| = x − 4.
e). x2 − x − 6 > 0.
x−1
f). < 1.
x+1
8 1. Algebraic Properties of the Real Numbers
2. The Natural Numbers Induction
2.1 Notation. We use the notation a ∈ B to mean that B is a set, and a is a member of
that set. We write A ⊂ B to mean that A is a subset of B, that is, ∀x (x ∈ A =⇒ x ∈ B).
Thus if A and B are two sets then A = B if and only if A ⊂ B and B ⊂ A.
2.2 Definition. We introduce a new symbol N. Intuitively, N will be the set of natural
numbers, N = {0, 1, 2, . . . }. As we all know, every natural number is also a real number,
that is, N is a subset of the real numbers. We will introduce three new axioms for the natural
numbers:
2. (a) 0 ∈ N
(b) n + 1 ∈ N for all numbers n ∈ N.
(c) n − 1 ∈ N for all n ∈ N such that n 6= 0.
3. (The well ordering Principle WO). Every nonempty subset of N has a least member.
That is, if A is a set with at least one member, such that every member of A is in N,
then there is some number n ∈ A such that m ∈ / A for all m < n.
2.4 Exercise. Suppose that N′ is a subset of the real numbers which also satisfies the axioms
for N. Show that N′ = N.
Hint: Show that every member of N is in N′ by assuming the contrary and applying ax-
iom WO to A = N \ N′ = {n : n ∈ N and n ∈ / N′ } to get a contradiction. Then use the
same argument, applying WO for N′ , to show that every member of N′ is in N.
9
10 2. The Natural Numbers Induction
b). Each of the “Field Axioms” (see 1.1) holds in N, except AIV and MIV; and
each of the “Order Axioms” OTC, OTR, OA and OM (see 1.13) holds in N.
[Hint: They hold in R, so they hold in N as long as the relevant quantities are in N].
c). Z is closed under addition and multiplication. In addition, Z is closed under additive
inverse and subtraction.
[Hint: We know the desired sums and products exist in R. Use Definition 2.5, a case
analysis, 2.5a, and algebra to show that they must also be in Z.]
2. If a, b ∈ Z then a − b ∈ Z.
Mathematical induction. You may have noticed that almost all proofs using the ax-
iom WO are proofs by contradiction. Mathematical induction can be viewed as a positive
way of giving essentially the same proof.
2.9 Lemma. Suppose that a ∈ N and that B is a subset of N which satisfies the following
two properties:
a) a ∈ B, and
b) k + 1 ∈ B whenever k ∈ B and k ≥ a.
2.10 Theorem (First Principle of Mathematical Induction (MI1)). Suppose that a ∈ N and
that, for each natural number n ≥ a, Φn is a statement associated with n. Suppose further
that the statements Φn satisfy the following two properties:
a) Φa is true.
b) If k is any natural number with k ≥ a, and Φk is true, then Φk+1 is also true.
2.11 Theorem (Second Principle of Mathematical Induction (MI2)). Suppose for each
natural number n, Φn is a statement associated with n and the statements Φn have the
following property:
a) If Φk is true whenever Φj holds for all natural numbers j < k, then Φn holds for all
natural numbers n.
Hint: One way to prove this is to let Ψk be the statement that “Φj is true for all j < k”,
and apply MI1 to the statements Ψk .
Notice that, although MI2 does not have an explicit base case, it does have an implicit
one which will often—but not always—need to be treated as a special case. If k = 0 then
the condition “Φj holds for all natural numbers j < k” is true for the trivial reason that
there are no natural numbers j < k. Hence proving clause 2.11(a) for given statements Φn
will always require proving Φ0 , without any (nontrivial) assumptions.
2.12 Theorem. If x is any real number greater than −1 and n is any natural number greater
than 0 then (1 + x)n ≥ 1 + nx.
Proof. Fix an arbitrary real number x > −1. Let Φk be the statement
We use MI1 (with a = 1) to prove by induction on k that Φk holds for all k ∈ N with k ≥ 1.
12 2. The Natural Numbers Induction
h2 Misty h1
H, with the possible exception of the pair (h1 , h2 ), have the same color. Thus we will have
finished the induction step if we can show that h1 and h2 have the same color. To this end,
pick any horse in the herd other than h1 and h2 . Call this horse Misty. Then Misty has the
same color as h1 because both h1 and Misty are members of H1 . Similarly, Misty has the
same color as h2 because they are both members of H2 . But then h1 and h2 both have the
same color as Misty. Since H was arbitrary, we have shown that Φk implies Φk+1 .
By (1), (2) and MI1 we conclude that Φk holds for all natural numbers k ≥ 1. Hence all
horses have the same color.1 ⊣
2.14 Exercise. Use mathematical induction to show that, for each positive integer n,
a). If 0 ≤ a < b, then an < bn .
n(n + 1)
b). 1 + 2 + 3 + · · · + n = .
2
c). 1 + 3 + 5 + · · · + (2n − 1) = n2 .
1
Here is the rest of the story:
THEOREM: All horses have an infinite number of legs. Proof (by intimidation) Everyone would agree
that all horses have an even number of legs. It is also well-known that horses have fore-legs in front and
two legs in back. But 4 + 2 = 6 legs is certainly an odd number of legs for a horse to have! Now the only
number that is both even and odd is infinity; therefore all horses have an infinite number of legs. However,
suppose that there is a horse somewhere that does not have an infinite number of legs. Well, that would be
a horse of a different color; and by the Lemma, it doesn’t exist. QED
13
n(n + 1)(2n + 1)
d). 1 + 22 + 32 + · · · + n2 = .
6
n2 (n + 1)2
e). 1 + 23 + 33 + · · · + n3 = .
4
f). (a) 1 · 2 + 2 · 3 + · · · + n(n + 1) = n(n + 1)(n + 2)/3
(b) 1 · 2 · 3 + 2 · 3 · 4 + · · · + n(n + 1)(n + 2) = n(n + 1)(n + 2)(n + 3)/4
(c) Find and prove the next statement in the series above.
(d) Find and prove the k th statement in the series above, for any natural number k.
b0 = 0;
b1 = 3;
bn = 7bn−1 − 10bn−2 for n > 1.
2.16 Exercise. Use mathematical induction to show that, for each natural number n,
d).
(1 − r)(1 + r1 + · · · + rn ) = 1 − rn+1 .
(Recall that r0 = 1.)
3.1 Definition. For a, b ∈ Z, we say a divides b (in symbols; a|b) if there exists a
c ∈ Z such that b = ac. In this case, we say that a is a divisor of b, that a is a factor of b
and that b is a multiple of a.
3.2 Theorem. Suppose a, b, c ∈ Z.
a). 1|a and a|0 for all a ∈ Z.
b). If a|b, then ac|bc.
c). If a|b and a|c, then a|(b + c).
d). If a|b, then a|bc.
e). If a|b and a|c, then a|(mb + nc) for all m, n ∈ Z.
f ). If a|b and b|c, then a|c.
g). If a 6= 0, then a|b if and only if a−1 b ∈ Z.
h). If a|b and b 6= 0, then |a| ≤ |b|.
3.3 Theorem (Division Algorithm). For a, b ∈ N with b > 0, there exist unique natural
numbers q and r satisfying a = bq + r and 0 ≤ r < b.
Hint: Set R := { s ∈ N : ∃ q ∈ N such that a = qb + s }, show R 6= ∅, and apply WO to get
a least element r.
15
16 3. Elementary Number Theory
Hint for d): For J 6= {0}, let A := { x ∈ J | x > 0 }. Show A ⊆ N and A 6= ∅; apply WO to
get b. Then for x ∈ J, use the Division Algorithm to show |x| ∈ Jb , and use this claim to
show J ⊆ Jb .
3.8 Theorem. Suppose a, b ∈ N with not both a and b zero. Then there is a unique integer
d such that d is the greatest common divisor of a and b.
[Hint: Set A := { a/c : c ∈ N, c|a and c|b }. Show that A ⊂ N and A 6= ∅, and then apply
WO to A.]
[Hint (for b)): We know Ja,b = Je := {ke | k ∈ Z} for some e ∈ N by Theorem 3.6(d). Show
e = d.]
3.11 Corollary. If a, b ∈ N, not both zero, and d = gcd(a, b), then there exist integers x
and y such that ax + by = d.
3.13 Definition. Natural numbers a and b are called relatively prime if gcd(a, b) = 1.
I. Ideals, gcd’s and the Euclidean Algorithm 17
a = bq1 + r1 ; 0 ≤ r1 < b
b = r1 q 2 + r2 ; 0 ≤ r2 < r1
r1 = r2 q 3 + r3 ; 0 ≤ r3 < r2
.. ..
. .
rk−2 = rk−1 qk + rk ; 0 ≤ rk < rk−1
rk−1 = rk qk+1 .
[[wjm 8/23/01 — Is this clearer?]]
Consider the following algorithm, which defines a function GCD:
(
a if b = 0.
GCD(a, b) =
GCD(b, r) if a = bq + r with 0 ≤ r < |b|.
Thus, for example
GCD(55, 15) = GCD(15, 10) since 55 = 3 · 45 + 10
= GCD(10, 5) since 15 = 1 · 10 + 5
= GCD(5, 0) since 10 = 2 · 5 + 0
=5 since b = 0.
Show that this algorithm computes gcd(a, b). That is, show that if a, b are any two
members of N with a 6= 0 then the computation of GCD(a, b) will eventually stop, and that
the value of GCD(a, b) so obtained is gcd(a, b).
18 3. Elementary Number Theory
3.17 Exercise. Use the Euclidean algorithm to compute the gcd of the following pairs of
numbers.
a). 1001, 1815
f). Extend the algorithm GCD above to obtain a function xGCD such that whenever a, b
are in N and a 6= 0 then xGCD(a, b) = (x, y, d) where d = gcd(a, b) and d = xa + by.
(ii). p1 ≤ p2 ≤ · · · ≤ pk .
3.21 Example. 21 = 3 · 7, 252 = 2 · 2 · 3 · 3 · 7, 104 = 2 · 2 · 2 · 13.]
One often groups equal primes: 252 = 22 · 32 · 7, 104 = 23 · 13.
So n = q1 e1 q2 e2 . . . ql el , where q1 < q2 < · · · < ql with ei ≥ 0.
3.22 Theorem (Fundamental Theorem of Arithmetic). Every natural number greater than
1 has exactly one prime factorization.
3.23 Corollary. Every natural number greater than 1 has a prime factor.
3.24 Theorem. (Primes and perfect squares)
a). If a, b, and n are positive integers with n = ab, then either
a2 ≤ n or b2 ≤ n.
II. Prime and Composite Numbers 19
b). A natural number n > 1 is prime iff it has no prime divisor p with p2 ≤ n.
a). Determine the prime factorization of the following natural numbers: 391, 403, 1815,
8316, 26208, 997.
b). Describe a method for finding all prime numbers up to a fixed natural number k.
Apply your method for k = 150, k = 500. [Sieve of Eratosthenes]
a). There are arbitrarily large gaps in the primes. That is, for any natural number k > 1,
there exist k consecutive composite integers.
[Hint: 6! + 2, 6! + 3, 6! + 4, 6! + 5, 6! + 6 are each composite.]
3.28 Exercise. Use the method of Theorem 3.27 to compute the gcd and lcm of the pairs
of integers in Exercise 3.17. The results from Exercise 3.25 may help.
3.29 Remark. Although there is no known local pattern to the occurrence of the primes, it
is known that they become rarer among large integers - as one would expect since more prime
n
divisors become available. In fact, the function is a global model for the distribution
ln(n)
of the primes as the following theorem shows (we will not prove it).
3.30 Theorem (The Prime Number Theorem). For a positive integer n, let P (n) denote
the number of primes not greater than n. Then
P (n)
lim n = 1.
n→∞
ln(n)
20 3. Elementary Number Theory
4. Rational and Irrational Numbers
I. Algebraic Irrationals
In 4.3 through 4.17, we assume temporarily that for each positive integer n and
√ each a ∈ R
n
with a ≥ 0, there is exactly one b ≥ 0 such that b = a; this b is denoted by a. This fact
n
rests on the axiom LUB (4.7) and is proved for the case n = 2 in 4.16.
4.3 Theorem. (Irrational square roots)
√
a). 2 is irrational.
[Hint: Suppose not, and apply Exercise 3.15.]
√
b). 12 is irrational.
√
c). 3 2 is irrational.
√
d). 4 8 is irrational.
√
e). Suppose that 0 < n, k ∈ N and that k n is rational. Then there is some m ∈ N so that
n = mk .
√ √ √ √
f ). 2 3 and 2 + 3 are irrational.
4.4 Exercise. Suppose a, b ∈ R.
a). Suppose that a 6= 0 is rational and b is irrational. Can you tell whether a + b is rational
or irrational? What about ab?
b). Suppose that a and b are both irrational. Can you tell whether a + b is rational or
irrational? What about ab?
√ √
4.5 Theorem. a). There are no a, b ∈ Q such that 3 2 = a 2 + b.
√ √
b). If a( 3 2)2 + b 3 2 + c = 0 with a, b, c ∈ Q, then a = b = c = 0
21
22 4. Rational and Irrational Numbers
4.8 Example. The least upper bound of a set may be an element of that set but it need
not be: for example, lub[0, 1) = lub[0, 1] = lub{1 − n1 | n = 1, 2, 3, . . . } = 1.
Here is the property for the natural numbers corresponding to LUB.
4.9 Theorem. (DWO, the Order Dual of WO) If a non-empty subset A ⊆ N has an upper
bound v ∈ N, then it has a greatest element t (t ∈ A and a ≤ t for all a ∈ A).
Hint: apply WO to U = { u ∈ N : u is an upper bound of A } to find t.
We note that the LUB axiom does not hold in Q. (See 4.17)
1
Hint for b): Write a
= 1 + x (x > 0) and use 2.12.
4.14 Theorem. Between any two distinct real numbers there is both a rational number and
an irrational number – and hence infinitely many of each type.
√
1 2
Hint: Consider multiples of and for suitable n. Assume (though it has not yet been
√ n n
proven) that 2 exists.
4.15 Exercise. Use the technique of the proof of 4.14 to exhibit a rational number and an
irrational number between:
√ √
a). 11 and 13.
4 5
b). and .
7 7
√
c). 3 and 11.
√
Finally we show that 2 exists:
4.16 Theorem. For √ each a ∈ R, a ≥ 0, there exists exactly one b ≥ 0 such that b2 = a.
(This b is denoted by a.)
4.18 Remark. The set A from the hint for Theorem 4.17, paired with B = { y ∈ Q : y ≥
0 and y 2 > 2 }, gives one example of a “Dedekind cut”. With Dedekind cuts one can more
clearly see the role of LUB and how the rational and irrational numbers intermix. Suppose
A and B are disjoint subsets of Q whose union is Q and for which x < y for each x ∈ A and
y ∈ B. Such a pair (A, B) of sets is called a Dedekind cut. Each such pair corresponds to a
real number: if either A has a greatest element r or B has a least element r, the pair (A, B)
corresponds to the rational number r; a pair in which A has no greatest element and B has
no least element corresponds to an irrational number. The irrational numbers can, therefore,
be viewed as those numbers created to “plug the holes” between pairs of sets in Dedekind
cuts of the second type. Note that LUB asserts the existence of numbers to “plug” all such
“holes.”
This is what we meant earlier when we said that LUB asserted that there are no gaps in
the ordering of R.
24 4. Rational and Irrational Numbers
4.20 Theorem. If |r| < 1, then the geometric series a + ar + ar2 + · · · + arn + . . . converges
a
to 1−r .
real number less than one. That is, each decimal expansion represents a non-negative
real number less than one.
9
[Hint: Compare with 1 = 10 + 1092 + 1093 + . . . ; see a calculus book for a suitable
Comparison Theorem.]
b). Conversely, each non-negative real number less than one has a decimal expansion.
c). The decimal expansion of a non-negative real number less than one is unique.
a). Periodic if, after some point, a block of digits repeats itself indefinitely:
0.a1 a2 . . . as 000 . . . .
a). The decimal expansion of each positive rational number q < 1 is periodic.
b). Conversely, each non-zero periodic decimal expansion of a non-negative real number
b < 1 represents a positive rational number.
Hint for a): What are the possible remainders in each step of performing the long division
n m?
Hint for b): Generalize this computation: b = 0.12345 implies 103 b = 123.45 and 105 b =
12345.45. Hence 105 b − 103 b = 12345 − 123 so that
12222 12222 1358
b= 5 3 = = .
10 − 10 99000 11000
Call a fraction m
n
a proper, reduced rational number if m < n are positive integers with
gcd(m, n) = 1. [Recall that in Exercise 3.15 we showed fractions could be reduced.]
4.27 Theorem. The decimal expansion of the proper, reduced rational number m n
is termi-
α β
nating exactly when n = 2 5 for some α, β ≥ 0. In this case the length of the terminating
decimal expansion is max{α, β}.
11 11 11·52 275
Hint: = 23 5
= 23 53
= 103
= 0.275.
40
4.28 Exercise. Express as a reduced quotient of integers.
26 4. Rational and Irrational Numbers
a) ∗
0.21522
b) ∗
0.4441332
c) ∗
0.34612
Hint: b = 0.393 implies 103 b = 393.393. Then (103 − 1)b = 393, so that
393 393 131
b= 3 = = .
10 − 1 999 333
4.30 Remark. It can be shown that for any such n, a t exists such that n|(10t − 1).
4.32 Exercise. Determine, without dividing out, the form of the decimal expansion of the
following.
27 7 154 111
a)∗ b)∗ c)∗ d)∗
56 13 1680 148
5. Countable and Uncountable Sets
We now explore briefly “how many” natural numbers, integers, rational numbers, irrational
numbers, and real numbers there are.
5.1 Definition. 1. If A and B are sets, then a function f from A to B is an object which
specifies, for each element a ∈ A, an element f (a) ∈ B. We use the notation f : A → B
to say that f is a function from A to B.
We will not attempt here to say what sort of an “object” a function is, or precisely
what we mean by the word “specifies”.
For an example of a function, consider a shepherd who has made a list of the names of
the sheep in his herd. Assuming that no two sheep have the same name, the shepherd has
defined a function S which assigns to each name n on the list the sheep S(n) having that
name. The function is one to one provided that no sheep has more than one name: if the
list includes nicknames along with the official names, then it would not be one to one. It is
onto if every sheep has a name: during lambing season the function would not be onto until
all of the newborn lambs had been given names.
We say that there is a one-to-one correspondence between two sets A and B if there if
is a one-to-one function from A onto B. In the case of the sheep there is (assuming no
nicknames) a one-to-one correspondence between the set of names on the list and the set of
named sheep. Clearly, this correspondence shows (assuming no nicknames) that there are
at least as many sheep as names. If every sheep has a name then it shows that the sets have
the same size, but if this is lambing season then the set of named sheep may be a proper
subset of the set of all sheep, so we could conclude that there are fewer names than there
are sheep.
5.2 Definition. a). Two sets A and B have the same cardinal number, written A ≡ B,
if there is a one-to-one correspondence between A and B. [Example: The one-to-one
correspondence n ↔ 2n shows that the set of all natural numbers has the same cardinal
number as the set of even natural numbers.]
27
28 5. Countable and Uncountable Sets
b). We say that A has at most as many members as B, written A ¹ B, if there is a one-
to-one correspondence between A and a subset of B. [Example: the identity function,
defined by id(n) = n for each n ∈ N, shows that N ¹ N, N ¹ Q and N ¹ R. The
function f : N → Q defined by f (n) = 1/(n + 1) is another map showing that N ¹ Q.]
c). A has fewer elements than B (written A ≺ B) if A ¹ B but A 6≡ B.
d). A set is countable if it is either finite or has the same cardinal number as N, the set of
natural numbers. Otherwise, it is said to be uncountable.
e). A set is countably infinite if it is countable but not infinite.
5.3 Remark. Notice that the function D(n) = 2n of clause (a) is a 1–1 function from N
to the set of even numbers, which is a proper subset of N. This is an important difference
between finite and infinite sets. The set of sheep is (presumably) finite, so if there are sheep
without names then the set of named sheep must be strictly smaller than the set of all sheep.
In the case of the infinite set N, however, the function D shows that the set of even numbers
has the same size as the full set N.
5.4 Remark. Mathematicians often write |A| for the cardinal number of A, the “number of
elements of A”. Thus A ¹ B could be written |A| ≤ |B|. However, you should not use the
concept of “the cardinal number of A” in the proofs below. In the first place, we really have
no idea what the “cardinal number” of an infinite set would be: clearly the cardinal number
of the set {Buttercup, Silvy, Misty} is the natural number 3, but what sort of an object is
|R|, the cardinal number of the set of reals? In the second place, Remark 5.3 shows that in
at least one important aspect the cardinal numbers of infinite sets behave quite differently
from finite numbers. It seems that we should be careful not to trust the analogy too far until
we have a better understanding of the properties of the cardinal numbers.
5.5 Theorem. The relation A ≡ B is an equivalence relation. That is, for all sets A, B
and C:
a). (Reflexivity) A ≡ A.
b). (Symmetry) If A ≡ B then B ≡ A.
c). (Transitivity) If A ≡ B and B ≡ C then A ≡ C.
5.6 Theorem. N ≡ {1, 3, 5, 7, . . . } ≡ Z ≡ Q.
That is, each of N, {1, 3, 5, . . . }, Z, and Q is countably infinite.
5.7 Theorem. N ≺ R. That is, R is uncountable.
Hint: Let A := {x ∈ R : 0 < x < 1} and show N ≺ A indirectly. Suppose to the contrary
N ≡ A with a one-to-one correspondence between N and A given by
0 ←→ 0.a0,0 a0,1 a0,2 a0,3 a0,4 . . .
1 ←→ 0.a1,0 a1,1 a1,2 a1,3 a1,4 . . .
2 ←→ 0.a2,0 a2,1 a2,2 a2,3 a2,4 . . .
.. ..
. .
29
where 0.ak,0 ak,1 ak,2 . . . is the decimal expansion of the real number rk corresponding to the
natural number k. (
1 if ai,i 6= 1
Now let b := 0.b0 b1 b2 b3 . . . , where bi :=
2 if ai,i = 1.
To what natural number can b correspond?
This argument is called the Cantor Diagonalization Process.
5.8 Theorem.
a). The union of two disjoint, countably infinite sets is countably infinite.
5.9 Theorem. For any set A, A ≺ P(A), where P(A) denotes the set of all subsets of A,
called the power set of A.
Hint: Indirect proof! Suppose to the contrary that f : A → P(A) is a one-to-one correspon-
dence between A and P(A), and consider the set B := {x ∈ A : x ∈/ f (x)}. Then B ∈ P (A),
so it must correspond to some element b ∈ A. Is b ∈ B?
5.10 Exercise. Give examples of infinitely many infinite sets no two of which have the same
cardinal number.
Theorem 5.5, stating that ≡ is (like =) an equivalence relation, suggests asking the question
whether the relation ¹ is an order relation. In fact it is, as the following theorem asserts.
We will not prove parts (b) and (c).
The proof of the first clause is essentially the same as clause (c) of theorem 5.5. The
proof of the second, which is known as the Cantor-Bernstein theorem, is somewhat more
challenging. The proof of the third requires a new axiom: the Axiom of Choice.
To get a start on the theory of arithmetic for infinite cardinal numbers, notice that if O is
the set of odd natural numbers and E is the set of even natural numbers, then N = O∪E and
hence we must have |N| = |O| + |E|. But |O| = |E| = |N|, so it follows that |N | = |N | + |N |.
In fact one can prove, using the axiom of choice, that for any infinite cardinal number κ we
have κ + κ = κ and κ · κ = κ.
30 5. Countable and Uncountable Sets
6. Fields and Subfields
6.1 Definition. A field is a set F equipped with binary operations + and · satisfying the
“field axioms” given in 1.1: AC, AA, AID, AIV, MC, MA, MID, MIV, and D. A subset
K of a field F is a subfield of F if K itself forms a field under the operations defined on F .
Recall that the closure of a set under an operation follows from the meaning of a binary
operation on the set (see 1.3)]
b). The rational numbers Q form a field (see 4.2), and Q is a subfield of R.
then K is a subfield of F .
31
32 6. Fields and Subfields
√ √ √ 2
e). Show that Q[ 3 2] := {a + b 3 2 + c( 3 2) | a, b, c ∈ Q} is a subfield of R.
√ √ 2 √ √ 2
[Hint: Compute [a + b 3 2 + c( 3 2) ][d + e 3 2 + f ( 3 2) ],
where d := a2 − 2bc, e := 2c2 − ab, and f := b2 − ac. To show that the product is
nonzero, use 1.4g).
6.5 Definition. A field F is an ordered field if it satisfies the order axioms OTC, OTR,
OA, and OM given in 1.13. An ordered field is called completely ordered if, in addition, it
satisfies the least upper bound axiom (4.7). (We have assumed that the real numbers R
form a completely ordered field.)
6.8 Remark. One can show (and we will assume) that, for each n, all of the field axioms
hold in Zn , except possibly MIV.
Note that for n = 12, the operations defined above form the arithmetic of a clock.
a). Compute in Z8 : 5 + 7∗ , 5 · 7, 72 ∗ .
∗
2
b). Compute in Z5 : −2∗ , 2 − 4, 2−1 ∗ , 3
.
c). Which elements of Z10 have a multiplicative inverse? Compute those that exist∗ .
a). The set of those elements of Zn which are relatively prime to n is closed under multi-
plication.
Hint: See 3.14(d), 3.14(f).
b). An element a in Zn has a multiplicative inverse iff a and n are relatively prime. Hint:
See 3.14(a) and 6.11 below.
6.11 Exercise. If a ∈ Zn with a relatively prime to n, then, using the Euclidean algorithm
one can find integers r and k such that ar + nk = 1. If r is then reduced via the division
algorithm, r = nq + b, 0 ≤ b < n, then b is the multiplicative inverse of a in Zn . Show that
this is indeed true and then use this method to compute:
C := {a + bi | a, b ∈ R}
For a complex number z = a + bi, a is called the real part of z and b is called the imaginary
part of z. The complex numbers are represented geometrically by the Cartesian coordinate
plane: z = a + bi is paired with the point having coordinates (a, b) as indicated below.
6 imaginary axis
i
real
−1 1 a axis-
.
.
.
−i .
.
.
.
.
.
.
.
b •
. . . . . . . . . . . . . . . . . . . . .
a + bi
a). Show that the sum and product of complex numbers defined above is the same as if
one treated the numbers as polynomials in i and let i 2 = −1. Hence by an abuse of
notation, we write, for example, 2 − 3i for 2 + (−3)i.
b). Find the sum, product and quotient (See 7.5d) of:
(i) 2 + 3i and −2 − 5i ∗ (ii) 5 and −2 + i (iii) 4i and 6i.
35
36 7. Complex Numbers
√ 3∗
d). Compute (− 12 + 1
2
3 i) .
a). What are the relative positions of z, −z, z̄, and −z̄ in the complex plane?
a). z1 + z2 = z 1 + z 2 .
b). z1 z2 = z 1 z 2 .
7.8 Remark. In doing arithmetic with complex numbers it is useful to have the exponential
function ez (for real and complex numbers) and the trigonometric functions sin(x) and cos(x)
for real numbers.
In the following, assume that these functions have been defined for real numbers and
assume (without proof) that they have their familiar properties for real numbers. We will
use this assumption in order to extend the definition of ez to complex numbers z and to
prove that it still satisfies the familiar properties for complex numbers z.
7.9 Definition. For z = a + bi ∈ C, one can write a = r cos θ and b = r sin θ , where r and
θ are as shown below. Hence z = r(cos θ + i sin θ), the polar form of z. Notice that r = |z|
and θ = tan−1 ( ab ) (or, if a < 0, then θ = π + tan−1 ( ab )). The angle θ is called the argument
of z.
The exponential function can be extended from R to C by defining eiθ := cos(θ) + i sin(θ).
Thus can write z = r(cos(θ) + i sin(θ)) = reiθ .
6
r (cos θ + i sin θ) = re i θ = a + bi
©© •.
. . . . . . . . . . . . . . . . . . . . .
©
r ©
.
©©
.
.
©©
.
.
©
.
© θ .
x
© -
.
.
Examples: √
2−
√ 2i = 2 2(cos(− π4 ) + i sin(− π4 ))
−1 + 3i = 2(cos 2π3
+ i sin 2π
3
)
π π
i = 1(cos 2 + i sin 2 ).
7.10 Theorem. If z1 = r(cos θ + i sin θ) and z2 = s(cos φ + i sin φ), then
a). z1 z2 = rs(cos(θ + φ) + i sin(θ + φ)) = rsei(θ+φ) and
z1 r
b). = (cos(θ − φ) + i sin(θ − φ)) = rs ei(θ−φ) .
z2 s
c). e(z1 +z2 ) = ez1 ez2 .
7.11 Exercise. (Polar form and geometric interpretation of multiplication)
√
3
a). Write in polar form: 2 + 2i ∗ , − 12 + 2
i, −2i ∗ , −3.
b). Interpret complex number multiplication geometrically in the plane.
7.12 Theorem (DeMoivre’s Theorem). If z = r(cos θ + i sin θ) and n is a positive integer,
then z n = rn (cos nθ + i sin nθ).
7.13 Theorem. Let z = r(cos θ + i sin θ) be a nonzero complex number.
Then z has exactly n n-th roots; namely:
µ µ ¶ µ ¶¶
√ θ 2π θ 2π
wk := r cos
n
+ k + i sin + k , for k = 0, 1, 2, ..., n − 1.
n n n n
38 7. Complex Numbers
w1 −4i
'$
• '$
2i• w0
A
• ¡
&% &%
¢ 1 = w0 ¡ 2
• •
w 1
w2
7.15 Exercise. Determine and graph in the complex plane all
a). x4 − 16 = 0 d) x3 − 64i = 0 ∗
b). x6 + 64 = 0 ∗ e) x4 + 81 = 0
c). x3 + 8i = 0 f) x5 − 243 = 0.
8. Polynomials
P (x) = a0 + a1 x + a2 x2 + · · · + an xn + . . .
in which ai ∈ F (or ai ∈ Z) for each i and ai = 0 for all i from some point on. The
set of all polynomials over F (or over Z) is denoted by F [x] (or Z[x]) and is called the
polynomial ring over F .
√
Examples: 2 + 2x + x2 is in R[x] and C[x] but not in Q[x] or Z[x];
1 + x + 2x2 + 3x3 + · · · + nxn + . . . is not a polynomial (why?).
8.2 Exercise. If P (x) = 1 + 4x + 2x2 and Q(x) = 4 − x + x2 + 3x3 , compute P (x) + Q(x)
and P (x)Q(x)
a). Using the formal definitions given in 8.1c).
39
40 8. Polynomials
a). deg(P (x) + Q(x)) ≤ max{deg P (x), deg Q(x)}, provided P (x) + Q(x) 6= 0.
[Give an example to show that strict inequality can hold.]
b). deg P (x)Q(x) = deg P (x) + deg Q(x).
8.4 Theorem. For any field F , F [x] (and Z[x])
a). Satisfies all the field axioms except MIV.
b). Has no divisors of zero. That is,
P (x)Q(x) = 0 implies P (x) = 0 or Q(x) = 0.
c). P (x)Q(x) = P (x)R(x) (P (x) 6= 0) implies Q(x) = R(x).
8.5 Theorem (Division Algorithm). Let P (x) and D(x) be polynomials over a field F with
D(x) 6= 0. Then there exist unique polynomials Q(x) and R(x) such that
P (x) = D(x)Q(x) + R(x) with either deg R(x) < deg D(x) or R(x) = 0.
8.6 Exercise. Compute via long division Q(x) and R(x) in the division algorithm if
a). P (x) = x4 − 3x2 + x + 3 and D(x) = 2x2 − 1. ∗
b). P (x) = 5x4 − 2x3 + 12x − 6 and D(x) = x2 + 4.
c). P (x) = 2x4 + 2x3 + 3x2 + x + 1 and D(x) = 2x2 + 1.
8.7 Definition. Suppose that P (x) = a0 + a1 x + a2 x2 + · · · + an xn is a polynomial over a
field F and c ∈ F .
a). The value of P (x) at c is defined to be
P (c) := a0 + a1 c + a2 c2 + · · · + an cn ∈ F .
b). c is called a zero of P (x) (or a root of P (x) = 0) if P (c) = 0.
8.8 Theorem (Remainder Theorem). If the polynomial P (x) over a field F is divided by
x − c (c ∈ F ) to give P (x) = (x − c)Q(x) + R, then R = P (c). [R = R(x) is constant. Why?]
8.9 Notation (Synthetic Division/Substitution). The computations involved in dividing a
polynomial P (x) by x − c can be placed in a convenient format (synthetic division). For
example, to divide 3x4 − x2 + 5x + 2 by x − 2 one can write:
coefficients of P (x)
z }| {
2 3 0 −1 5 2 ¾
c −→ ¾ ←− add
middle = 6 12 22 54
bottom × c
ր ր ր ր
3 6 11 27 56 = remainder = P (c)
| {z }
quotient =
3x3 + 6x2 + 11x + 27
Notice that this process computes the remainder R, which by theorem 8.8 is equal to P (c).
Hence the process of synthetic division is also a convenient method for computing P (c).
When used in this way, it can be called synthetic substitution.
41
(i) x2 + 4x + 13 ∗
(ii) x2 − 6x + 4
(iii) 2x3 + x2 + x − 1 ∗
b). Use the method suggested by the hint in 8.18 to find all zeros in C of the following
polynomials over C.
(i) x2 + 4x + (4 + 9i)
(ii) x2 − 4ix − 13 ∗ .
P (x) = an (x − r1 )(x − r2 ) · · · (x − rn ).
8.22 Definition. A polynomial over a field F of positive degree is called irreducible over F
if it is not the product of two polynomials in F [x] of lesser degree.
8.23 Example. x2 − 2 is irreducible over Q but not over R. x2 + 1 is irreducible over R but
not over C.
8.24 Theorem. (Irreducible polynomials)
a). Every linear polynomial is irreducible.
b). The only irreducible polynomials over C[x] are the linear ones.
8.26 Theorem. Every polynomial over R is the product of polynomials over R of degree at
most 2. [Hint: First factor in C[x]; (x − z)(x − z̄) =?]
b). Write the following polynomials as a product of irreducible polynomials over C. Over
R.
(i) x3 + x2 − x + 1 ∗
(ii) x4 − x2 − 2 ∗
(iii) x4 + 1 ∗
(iv) x6 − 2 ∗
a) 5 · 7 = 3, 72 = 1
2
b) −2 = 3, 3
= 4, 2−1 = 3
c) 1, 3, 7, 9; 3−1 = 7, 9−1 = 9
6.11 a) 9 b) 91
7.2 b) (2 + 3i)(−2 − 5i) = 11 − 16i d) 1
√
7.11 a) 2 + 2i = 2 2(cos π4 + i sin π4 ), −2i = 2(cos 3π
2
+ i sin 3π
2
)
7.15
3π 5π 7π
a) |w| = 1; θ = π4 , 4
, 4
, 4
5π 9π 13π
d) |w| = 2; θ = π8 , 8
, 8
, 8
7π
e) |w| = 2; θ = π6 , 6
7.16
5π 7π 3π 11π
b) |x| = 2; θ = π6 , π
2
, 6
, 6
, 2
, 6
5π 3π
a) |x| = 4; θ = π6 , 6
, 2
45
46 9. Answers to selected exercises
8.19
1
a)i −2 + 3i, −2 − 3i a)iii 2
is a zero
b)ii 3 + 2i, −3 + 2i
8.27