0% found this document useful (0 votes)
25 views31 pages

Naturalnumbers 07

This document outlines a construction of the real numbers from the natural numbers using set theory. It begins by defining the natural numbers using the Peano axioms and discusses proofs by induction. Subsequent sections will define the integers, rational numbers, and real numbers in a systematic progression.

Uploaded by

lawrence momanyi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views31 pages

Naturalnumbers 07

This document outlines a construction of the real numbers from the natural numbers using set theory. It begins by defining the natural numbers using the Peano axioms and discusses proofs by induction. Subsequent sections will define the integers, rational numbers, and real numbers in a systematic progression.

Uploaded by

lawrence momanyi
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

MATH 304: CONSTRUCTING THE REAL NUMBERS∗†

Peter Kahn

Spring 2007

Contents
1 The Natural Numbers 1
1.1 The Peano Axioms . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Proof by induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Comparing integral systems . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Definition by Induction . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.5 Addition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.6 Ordering the natural numbers . . . . . . . . . . . . . . . . . . . . . . 15
1.7 Variations on induction . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.8 Well-ordering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.9 Multiplication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
1.10 The natural numbers and cardinality . . . . . . . . . . . . . . . . . . 24

Except for illustrative examples, the following sections describe a systematic con-
struction of the real numbers using only the basic concepts of set theory. They follow
the standard progression of such constructions: natural numbers, integers, rational
numbers, and real numbers. Exercises with asterisks are considered challenging.

1 The Natural Numbers

The natural numbers N consist of the positive integers 1, 2, 3, . . ., together with the

number 0. Whereas small positive integers have been used, with varied symbolic

representation, since prehistoric times, the number 0 is a relatively late addition


The construction of the real numbers described in Chapter 3 of Robert Stoll’s 1963 book Set
Theory and Logic, W.H. Freeman and Company, San Francisco, London, inspired the development
in these notes. There will be occasional references to Stoll’s book.
† c
°May 21, 2007.

1
(estimated at around 300 B.C.), hence slightly less “natural” than the others. Thus,
some mathematicians exclude it from N. This doesn’t change the theory in any
significant way, just the form of certain definitions and proofs. It turns out to be

slightly more convenient to include 0 in N, so this is what we (and most contemporary

mathematicians) do.
There are a number of different approaches to constructing the natural numbers.

Our point of view will be that the entire set N = {0, 1, 2, 3, . . .} is given as a member of

our universe, together with one crucial property of N. We codify this property in the

form of an axiom, which we shall use to analyze N and to derive new properties. In this

approach, we are reminded of the dictum of the distinguished German mathematician


Leopold Kronecker (1823-1891):

God created the integers, all else is the work of man.

In our derivations of new properties of N, we shall insist on using only the basic
axiom, together with the rules of symbolic logic and introductory set theory as pre-
sented in the earlier chapters. Of course, once we have proved a result, we may use it

as well. By deriving all the usual features of N in this way, not only do we place them

on a solid logical foundation but we also validate our choice of initial assumptions.

1.1 The Peano Axioms

To focus on key properties of N, we make the following definitions:

Definition 1. An integral system < X, x0 , s > consists of a set X, a distinguished

element x0 ∈ X, called the initial element, and a function s : X → X, called the


successor function, satisfying:

2
a. s is an injection.

b. x0 6∈ R(s).

c. Suppose that Y is a subset of X containing x0 such that, for every x ∈ Y we

must have s(x) ∈ Y . Then Y = X.

Property (c) is known as the Principle of Induction (for < X, x0 , s >).

The stated property of Y (namely, that if x ∈ Y , then s(x) ∈ Y ) is sometimes

described by the statement “Y is closed under s.” It is also sometimes abbreviated


by writing s(Y ) ⊆ Y , where s(Y ) is shorthand for {z : z = s(x), for some x ∈ Y }.
The Principle of Induction states that there is only one subset of X that contains x0

and is closed under s: namely X itself. We’ll see shortly how this translates into the

usual method of proof by induction.

Let us now define the function σ : N → N by the rule σ(n) = n + 1. We shall

assume
The Axiom of Integrality: < N, 0, σ > is an integral system.
The ba-
The Axiom of Integrality is a fundamental assumption that we are making about
sic
the natural numbers; its three features, as given in Definition 1, are part of our axiom
for the
concept of N. A nearly identical presentation of these features (written as five items) natural
numbers
is known as the Peano Axioms, named after the Italian mathematician G. Peano,

who published them in 1889.

In the definition of the successor function σ, we used the notation n + 1 because it

is the common way that we write the successor of n. We do not mean by this that
we have yet defined or have available a notion of addition of natural numbers, and

3
that n + 1 is the result of adding 1 to n. Addition is a secondary concept that we
shall define later. To emphasize this, we shall for the time being use the notation
σ(n) = n0 , avoiding n + 1 until later.

1.2 Proof by induction

The student may recognize the Principle of Induction from other courses, perhaps
notably beginning calculus. Anticipating the way we are going to be using this prin-

ciple for proofs later in these notes, let us formulate the general idea here in the

terminology and symbolism of the Symbolic Logic notes and then reformulate it in
more familiar terms.

Proofs by induction involve statements of the form (∀n)P (n), where P is a predicate
involving the single variable n, which is assumed to range over N. Let T (P ) be the
truth set of P . By this we mean the set of all specializations a of the variable n for
which P (a) is true. To prove the given statement, we must show that T (P ) = N.

According to the Principle of Induction for < N, σ, 0 >, this can be accomplished
³¡
by showing two things: namely, that (i) 0 ∈ T (P ), and that (ii) (∀k ∈ N) k ∈
¢ ¡ ¢´
T (P ) ⇒ k 0 ∈ T (P ) .

Reformulating this, we get the familiar version of the induction principle: namely,
Familiar
to prove that (∀n)P (n) is true, first prove that P (0) is true, and then prove
version
of
¡ ¢ induction
(∀k ∈ N) P (k) is true ⇒ P (k 0 ) is true .

The first part is often called the initial step of the induction proof, and the second

is called the inductive step. The hypothesis in the inductive step is often called the

4
inductive hypothesis or induction hypothesis.
We present one example of this method that should be familiar to someone who has

studied calculus. The example is intended as an illustration, not as part of the logical
development we are presenting. So, we shall be assuming the standard properties of

the natural numbers as they are used in arithmetic, elementary algebra, and calculus
courses.
For any natural number n, let P (n) be the predicate 1+3+. . .+(2n+1) = (n+1)2 .

We use induction to prove (∀n)P (n). P (0) is the assertion 1 = (0 + 1)2 , which is
¡ ¢
obviously true. We must now show that (∀k) P (k) ⇒ P (k + 1) . So, choose any

natural number k and assume P (k), i.e., assume 1 + 3 + . . . + (2k + 1) = (k + 1)2 . We


must prove P (k+1). Begin by writing P (k+1): 1+3+. . .+(2(k+1)+1) = ((k+1)+1)2 .
¡ ¢
The left-hand side of this, can be written as 1 + 3 + . . . + (2k + 1) + (2(k + 1) + 1).
Familiar
Since P (k) is true, by the induction assumption, we may replace the part in large
example
¡ ¢ 2
parentheses by (k + 1)2 , obtaining (k + 1)2 + 2(k + 1) + 1, which equals (k + 1) + 1 . of an
induction
This calculation shows that proof

¡ ¢2
1 + 3 + . . . + (2(k + 1) + 1) = (k + 1) + 1 ,

which is just the assertion P (k + 1). Therefore induction step is complete. The
principle of induction now implies that P (n) is true for all n, so the desired result is

proved.

We now return to the logical development and illustrate method of proof by induc-

tion by proving the following simple but useful statement:

Proposition 1. Every natural number is either equal to 0 or is the successor of

5
another natural number.

This proposition may seem obvious to us, but it is not contained directly in the

definition we have given, so it needs to be proved.

Proof. The predicate in the statement of the proposition is n ∈ {0} ∪ R(σ), so its

truth set is obviously {0} ∪ R(σ). Let us call this X for short. We show that X
Another
satisfies the hypotheses of the induction principle. The first property, that 0 ∈ X,
example
is true by definition. Now choose any k ∈ X. Since k = σ(k), we obviously have of an
0
induction
0 0
k ∈ R(σ). But R(σ) ⊆ X, by definition, so k ∈ X. By the principle of induction, proof
X = N, as desired.

We remark that the Axiom of Integrality asserts in part that σ is injective. Together
with the foregoing proposition, this means that every non-zero natural number n can
be written in the form m0 , with m uniquely determined by n. We may call m the

predecessor of n.

Exercise 1. Using the standard facts about natural numbers that you have learned
in earlier courses, prove the following by induction on n:

a. 2 + 4 + . . . + (2n) = n(n + 1).

¡ ¢2
b. 13 + 23 + . . . + n3 = n(n + 1)/2 .

1.3 Comparing integral systems

We now wish to compare different integral systems, and for this, we use the following
definition:

6
Definition 2. Let < X, x0 , s > and < Y, y0 , t > be two integral systems. An integral
isomorphism of integral systems < X, x0 , s >' < Y, y0 , t > is a function f : X → Y
such that

a. f is a bijection.

b. f (x0 ) = y0 .

c. (∀x ∈ X), f (s(x)) = t(f (x)).

If an integral isomorphism exists, we say that < X, x0 , s > is integrally isomorphic to


< Y, y0 , t >.

An integral isomorphism in this sense, simply establishes a bijection between el-

ements of X and Y in such a way as to match initial elements and preserve the

successor relation. Thus, isomorphic integral systems are just copies of one another.

Exercise 2. Prove that this notion of integral isomorphism is an equivalence relation.

Exercise 3. Prove that for every integral system < X, x0 , s >, there is at most one
integral isomorphism < N, 0, σ >'< X, x0 , s >. (Hint: Suppose that f and g are

integral isomorphisms, and prove by induction that (∀n) f (n) = g(n).) The existence

of at least one isomorphism requires the notion of definition by induction, which is


discussed below.

Exercise 4. In this exercise you may use the standard properties of numbers (real,

rational, natural, etc.). For each of the following examples, state which is an integral
system and which isn’t. In either case, justify your answer.

7
a. < Z, 0, σ >, where Z denotes the set of all (i.e., positive, negative and zero)
integers, and 0 and σ are as before.

b. < Q+ , 0, σ >, where Q+ is the set of non-negative rational numbers, and 0 and

σ are as before.

c. < N(10), 10, σ >, where N(10) is the set of natural numbers ≥ 10, and σ is as

before.

d. < N− , 0, σ >, where N− denotes the non-positive integers, and 0 and σ are as

before.

1.4 Definition by Induction

The principle of induction is used not only to prove statements of the sort already

indicated, but it is also used to define functions that have N as their domain.

For example, assuming for the moment that we have full use of the system of real
numbers R, choose any real number a. We might try to define an “exponential”
Two
function f : N → R as follows: f (0) = 1; f (n0 ) = af (n). Indeed, for the first few familiar

values of n, we get f (0) = 1, f (1) = af (0) = a, f (2) = a · a = a2 , f (3) = a · f (2) = examples


of
2 3
a · a = a , . . ., etc. For those familiar with computer science, this is just the standard definition
by
recursive definition of an . The definition of f at the successor of n is defined in terms induction

of its value at n.
As another example, we might try to define the function g : N → R as follows:

g(0) = 1; g(n0 ) = n0 g(n). The first few values of this function are 1, 1, 2, 6, 24, 120, . . . .

Most students have seen this function before: it is just n!, the factorial function. Here,

8
the definition of g at the successor of n depends both on n itself and on the value of
g at n.
These definitions work fine if one is interested only in finitely many values of the
function—and this is all that comes into play in computer science or combinatorics.

By themselves, these rules for defining (or, more precisely, attempting to define) the
function, do not actually give a function with domain N. They give only a finite

sequence of function values.


This gap is filled by the following theorem, which shows that the examples just

given do indeed determine functions with domain N. It also gives a general criterion

for making valid definitions “by induction.” The theorem can be proved using the
principle of induction, but the proof is fairly lengthy, so we shall omit it. A proof can

be found in the first two chapters of Stoll’s book. We shall use the result stated by
the theorem, however.

Theorem 1. Suppose that A is a given set with a fixed element a0 selected in A.


Suppose further that we have functions hn : A → A, one for each n ∈ N. Then, there
exists a unique function k : N → A satisfying (i) k(0) = a0 , and (ii) k(n0 ) = hn (k(n)).

Notice that the theorem requires us to have available the element a0 and the stated

functions hn . It then gives us the desired function k. The functions hn are either

given as part of the background information or can be defined by some method other

than induction.

Let us illustrate this by showing how it applies to the two examples above.
For the first example, let A be the set of real numbers R, and let a be a real number

as before. For each n, let hn : R → R be given by the formula hn (x) = ax. (In this
case, all these functions hn are equal.) Notice that the definition of hn does not use

9
induction. Then, the theorem tells us that there exists a unique function k such that
k(0) = a and k(n0 ) = hn (k(n)) = ak(n). This is precisely the exponential function f
of the first example.
For the second example, let A = N and now choose a = 1. Define hn by the formula

hn (x) = n0 x. The reader should convince himeself/herself that this definition does
not use induction. In this case, all the functions hn are different from each other.

The theorem now tells us that there exists a unique function k : N → N satisfying
k(0) = 1 and k(n0 ) = n0 k(n). This is precisely the factorial function g of the second

example.

Exercise 5. Use the method of definition by induction to prove that for each integral

system, < X, x0 , s >, there exists at least one integral isomorphism f :< N, 0, σ >→<
X, x0 , s >.

Exercises 3 and 5 tell us that every integral system is a copy of < N, 0, σ > in a
unique way, or, to put it another way, there is only one integral isomorphism class
of integral systems, and every representative is isomorphic to < N, 0, σ > in a unique
Uniqueness
way. Because of this , we say that the properties in Definition 1 characterize (or of

determine, or define) < N, 0, σ > Henceforth, we write < N, 0, σ > more briefly as N natural
numbers
We now use this method of definition by induction to define addition of natural

numbers.

1.5 Addition

We have started with the most primitive features that lie at the heart of our concept
of the natural numbers, encoded as The Integrality Axiom (or Peano’s Axioms). No-

10
tice that, except for the operation of succession, (i.e., counting), we are not assuming
anything about operations with the natural numbers (such as addition and multipli-
cation). In particular, we are in a pre-arithmetic mode with respect to N. We now

remedy this by defining the operation of addition , +, for N. Induction can then be

used to verify that this operation has all the familiar properties. The definition and

derivation of the properties of multiplication are similar; we do this later.


Choose any m ∈ N. We shall use induction to define a function am : N → N, and

we shall denote the function value am (n) by m + n. To define am we make use of

Theorem 1, and this requires us to define functions hn : N → N. In this case, the

definition is easy: we define each hn to be the successor function σ. We must also


choose a fixed member of N, which in this case, we choose to equal m. Then Theorem

1 tells us that there exists a unique function N → N, which we shall denote am such
that

a. am (0) = m.

b. am (n0 ) = (am (n))0 .

Definition 3. For any m and n in N, we denote am (n) by m + n and call it the sum

of m and n ( or the result of adding m to n, or m plus n).

In terms of the + notation, the two defining features of am may be written as

follows:

a. m + 0 = m.

b. m + n0 = (m + n)0 .

11
We are left with the important task of justifying this definition, which amounts to
showing that it gives us the operation of addition with which we are all familiar, i.e.,
that it has all the familiar properties. Those that are not immediate consequences of

the definition are all proved by induction. We sketch a few proofs by induction for

illustration.

Lemma 1. Choose any natural numbers `, m, and n. Then: (i) m+0 = m = 0+m;
(ii) m + 1 = m0 = 1 + m; (iii) ` + (m + n) = (` + m) + n.

Proof. (i) m + 0 = am (0) = m, by definition, proving the first equality. The second
equality, namely, m = a0 (m) must be proved by induction on m. By definition,

a0 (0) = 0, so the assertion is true for m = 0. Now assume it is true for m = k:

that is, k = a0 (k). Evaluate a0 (k 0 ) = (a0 (k))0 = k 0 , the first equality following from
property (b) of a0 and the second from the induction hypothesis. This completes the
proof of (i).

(ii) Evaluate m + 1 = am (1) = am (00 ) = (am (0))0 = m0 . (The reader should fill in the
reasons for each equality.) The second equality, namely, m0 = a1 (m), is proved by
induction. When m = 0, both sides reduce to 1, so the assertion is true for m = 0.

Assume it for m = k, i.e., k 0 = a1 (k), and then evaluate a1 (k 0 ) = (a1 (k))0 = (k 0 )0 .

Switching the order of the equality, we obtain (k 0 )0 = a1 (k 0 ). But this is precisely the

second equality in assertion (ii) of the lemma, with m = k 0 , as desired. Therefore,

the induction, hence the proof of (ii), is complete.

(iii) Let P (n) be the predicate

(∀`)(∀m)(` + (m + n) = (` + m) + n).

12
We prove (∀n)P (n) by induction on n. P (0) is the statement (∀`)(∀m)(` + (m + 0) =
(` + m) + 0). Both sides reduce to ` + m, by what has already been proved, so P (0) is
true. Now assume P (k) for some k. We evaluate the left-hand side of P (k 0 ) for any

selected ` and m:

` + (m + k 0 ) = ` + (m + k)0
¡ ¢0
= ` + (m + k)
¡ ¢0
= (` + m) + k

= (` + m) + k 0 .

Again, we leave it to the reader to supply reasons for each step. The resulting equality
is precisely P (k 0 ), so the induction proof of (iii) is complete.

The reader may have noticed that statement (i) of the lemma is the familiar additive
identity rule for arithmetic and that the third statement of the lemma is the familiar
associative law for addition. Usually, these are stated as axioms or properties that we
assume. The lemma shows, however, that these can be proved from more rudimentary

concepts.

The associative law is needed because addition is defined to be a binary operation


on natural numbers. Once more than two numbers are to be added, we must decide

how to group them so that we can do the addition two numbers at a time. In the

case of three numbers, there are two ways of grouping the numbers (without changing
their order)—or, as we may say, there are two ways of associating the numbers (into

pairs). The associative law tells us that these two ways give equal sums.

Exercise 6. Show that there are five ways of associating four numbers into pairs,

13
and then prove, using the associative law, that all five ways produce the same sum.
How many ways of associating five numbers are there? List them.

This exercise shows that the associative law can be extended to the case of four

numbers, and similar arguments extend the law to five numbers, etc. Indeed, there is
a general inductive argument (which we do not go into in these notes) that extends

the result to any number of numbers. This means that, for addition, we do not need
to use parentheses to express arbitrarily long sums (e.g., a0 + a1 + . . . + an ), since any

association into pairs yields the same result. So, from now on we feel free to omit

parentheses in these situations.

Incidentally, earlier, we expressed some discomfort in using the notation “n + 1” to


represent the successor of n, since we had not yet defined addition. However, now, in
light of assertion (ii) of the lemma, we are able to use either n + 1 or 1 + n in place

of n0 .

Exercise 7. Use the definition of addition and the method of proof by induction to

prove the following statements:

a. (∀m)(∀n)(m + n = n + m). the commutative law for addition

¡ ¢
b. (∀k)(∀m)(∀n) (m + k = n + k) ⇒ (m = n) . the cancellation law for addition

The statements in the earlier lemma and in the exercise give the most important

features of the additive algebra of the natural numbers, so we summarize them in the

following theorem. Further algebra will come into play when we discuss multiplication
later.

14
Basic
Theorem 2. Let x, y, z be any natural numbers. Then the following hold: properties
of
a. x + (y + z) = (x + y) + z. associativity of addition addition

b. (x + z = y + z) ⇒ x = y. cancellation for addition

c. x + 0 = 0 + x = x identity law for addition

d. x + y = y + x. commutativity of addition

1.6 Ordering the natural numbers

Although we have spoken informally about a sequential aspect to the natural numbers,
the concept of an integral system does not, by itself, contain any explicit information
about an ordering of the elements, except for the reference to one element as the

initial element. But the natural ordering of the counting numbers is one of their
basic features, so we now introduce this into the picture with a precise definition.

Definition 4. Given m, n ∈ N, we write m ≤ n (or, equivalently, n ≥ m) if and only


if there exists a k ∈ N such that m + k = n.

For later use, it is convenient to rephrase this definition in the following way: Let

m and n be any natural numbers. The equation

m + x = n has a solution in N if and only if m ≤ n.

Exercise 8. Prove the following addendum to the foregoing: If the equation has a

solution in N, then the solution is unique.

15
Exercise 9. Show that ≤ is a partial order on N.

Exercise 10. Let < be the strict ordering corresponding to ≤. Show that m < n if
and only if there exists a k ∈ N such that k 6= 0 and m + k = n (or, equivalently,

k > 0 and m + k = n).

Exercise 11. Let m, n, and p be any natural numbers. Prove that m ≤ n ⇐⇒

m + p ≤ n + p.

Exercise 12. Suppose that m, n, p and q are natural numbers such that m+n = p+q.

Prove that if m < p, then n > q.

Exercise 13. * Let m be any natural number. Prove by induction on n, that no

natural number n satisfies m < n < m + 1.

Again, this is an example of one of those so-called “obvious” statements that is not
so obvious to prove. Another way of stating this fact is the following: For all m and
n, n < m + 1 if and only if n ≤ m. Convince yourself that this is, indeed, equivalent
to the assertion in the preceding exercise.

Proposition 2. ≤ is a linear order on N.

Proof. (a) A linear order is a partial order for which any two elements are compa-

rable (cf. Set Theory notes). So, we have to show that for any m and n in N, either

m ≤ n or n ≤ m. We prove this by induction on m. So, let P (m) be the assertion

³ ´
(∀n) (m ≤ n) ∨ (n ≤ m) .

Since, for any n, 0 + n = n, it follows that 0 ≤ n, and so P (0) is true.

16
Now assume P (m) is true for m = k. We prove P (k 0 ) by first choosing any n. We
must show k 0 ≤ n or n ≤ k 0 . By the induction hypothesis, there are two cases, k ≤ n
or n ≤ k: (i) Suppose n ≤ k . Since k 0 = k + 1, we have k ≤ k 0 , so, by the transitivity

of ≤, n ≤ k 0 . So P (k 0 ) holds in this case. (ii) Now suppose that k ≤ n. Case (i)

includes the case n = k, so we may as well assume k < n. We shall show that k 0 ≤ n.

Since k < n, it cannot be the case that n = 0. Therefore, n has a unique predecessor

p, i.e., n = p + 1. By Exercise 13, the strict inequality k < n = p + 1 is equivalent to


k ≤ p, i.e., p = k + `, for some ` ∈ N. Therefore, n = p + 1 = k + 1 + `, which implies

k 0 ≤ n, as desired. This completes the induction.

Often the fact that ≤ is a linear order is phrased in terms of <, which we then also Strict
linear
call a (strict) linear ordering. In terms of <, the condition is phrased as follows: For ordering
any natural numbers m and n, exactly one of the following holds: m < n, m = n, or
n < m.

Exercise 14. Prove the following, for all m, n, p ∈ N:

a. m < n ∧ n<p ⇒ m < p ; This is just the transitivity of <. Since


the transitivity of ≤ has already been proved, you may use that for this proof.

b. n = 0 ∨ n > 0.

c. ¬(n < n).

d. m < n ⇒ σ(m) = n ∨ σ(m) < n.

17
1.7 Variations on induction

Using the linear ordering of N, it is now easy to formulate two variations of the usual
method of induction.
Variation 1: This method does not have a commonly used name since it amounts

to little more than a re-labeling. We might call it shifted induction, however, when
we refer to it. This kind of induction arises when the predicate one would like to

verify is not usefully defined for some small values of n. Perhaps P (n) is defined only
Shifted
for n ≥ some fixed number n0 which depends on the particular problem. We can
induction
still do induction in this case, except that our initial value is no longer 0 but is n0 .
¡
The situation we are facing then is that P (n0 ) is known to be true and (∀k) P (k) ⇒
¢
P (k + 1) is known to be true, provided k ≥ n0 . We would then like to be able to
conclude that P (n) is true for all n ≥ n0 . Essentially, our entire frame of reference is
shifted over by n0 units.

This variant of induction is easy to derive from the standard induction principle
via the following re-labeling trick. Define a new predicate Q by the formula Q(n) =
P (n0 + n). Since, by definition, k ≥ n0 if and only if there is a natural number m such
that k = n0 + m, k ranges over all natural number ≥ n0 precisely when m ranges over

all natural numbers. Therefore, the situation we confront can now be reformulated as
¡ ¢
follows: Q(0) is true, and (∀m) Q(m) ⇒ Q(m + 1) is true. So, ordinary induction

implies that Q(n) is true for all natural numbers n, and, hence, P (n) is true for all

natural numbers n ≥ n0 .

Variation 2: This method is sometimes called strong induction or complete induction


Strong
even though it is equivalent to ordinary induction. In this form of induction, we still induction

18
have the same initial step. But the induction step is replaced by the following

³ ¡ ¢ ´
(∀k) (∀i) (i ≤ k) ⇒ P (i) ⇒ P (k + 1) .

That is, if, for all k, P (1), P (2), . . . , P (k) are all true, then P (k + 1) is true. The

initial step plus the induction step then imply the usual conclusion: (∀n)P (n) is true.

Exercise 15. Given a predicate P = P (n), formulate a predicate Q such that by


applying ordinary induction to Q, you get strong induction for P .

Of course, Variation 1 can be combined with strong induction: i.e., you don’t have
to start at 0 in strong induction.

Here is an example showing how strong induction, although logically equivalent to


ordinary induction, is a convenient tool in proofs. We shall be referring to standard

multiplicative properties of natural numbers, which have not yet been established in
our logical development, but we are doing so only for illustration.

Proposition 3. Every natural number n > 1 can be written as a product of primes.

Proof. This proof is just an illustrative sketch. Since there are no natural numbers

n satisfying 1 < n < 2, we may start the strong induction at n = 2. When n = 2,


the proposition is immediate, since 2 itself is a prime, so it is a one-fold product of

primes. Suppose the proposition is true for all natural numbers i ≤ k, and consider
Example
the natural number k + 1. If it is prime, we are done. If not, then by definition,
of
it can be written as a product of two natural numbers k + 1 = ab, with neither a proof
by
nor b equal to k + 1 or 1. Therefore, both a and b are < k + 1. Since there are strong
induction
no natural numbers strictly between k and k + 1, both a and b are ≤ k, hence the

19
strong induction hypothesis may be applied to them. They are thus both products
of primes, and so, clearly their product is a product of primes.

The point here is that since we are dealing with a multiplicative property, simply
knowing the proposition for k may not be adequate for proving it for k + 1, since

these two numbers are not related multiplicatively in a nice way. However, knowing
the proposition for all natural numbers ≤ k gives us the flexibility we need for the

proof. This concludes our discussion of strong induction.

1.8 Well-ordering

Finally, here is another basic and useful fact about N, which plays a role in proofs
later on.

Proposition 4. Every non-empty subset S of N has a least element (i.e., an element

that is minimal among all elements in S).

Proof. Let S be any subset of N. The proof will use the contrapositive method.
Assume that S has no minimal element, and let T be the set difference N \ S. Let
¡ ¢
P (n) be the predicate (∀k) (k ≤ n) ⇒ (k ∈ T ) . This asserts that the set of all

natural numbers less than or equal to n is a subset of T .

Clearly P (0) is true, since it asserts simply that 0 ∈ T . If 0 were not in T , it

would be in the complementary set S, and it would clearly be a minimal member of


S, contradicting our assumption about S.

Exercise 16. Prove the induction step (∀m)P (m) ⇒ P (m + 1).

The exercise concludes the induction proof that (∀n)P (n) is true. But this means
that T = N, hence S = ∅.

20
Thus, assuming that S has no minimal element, the above proof shows that it must
be empty, which is precisely the contrapositive of the statement of the proposition.

This property of N is known as the well-ordering principle. We sometimes say, The


natural
simply, the natural numbers are well-ordered. numbers
are
Let us be more precise about the definition of well-ordering. We say a linear ordering well-
ordered
on a set S (or a strict linear ordering on S) is a well-ordering provided that every

non-empty subset of S has a minimal element. So, to prove that a relation on S is

a well-ordering, first, one has to prove that it is a linear ordering (or a strict linear
ordering), and then one has to prove that it satisfies this extra minimality property.
The following exercise shows that there are well-ordered sets that do not satisfy the

Axiom of Integrality (i.e., induction does not hold).

Exercise 17. * Consider the cartesian product N × N = N2 . Using the ordering


Example
of N just defined, we now endow N2 with the associated (strict) lexicographic order :
of a
namely, we write (a, b) < (c, d) if and only if (a < c) or ((a = c) ∧ (b < d)). well-
ordered
set
a. Show that this defines a strict linear order on N2 , and then show that this linear for
which
order well-orders N2 . induction
fails
b. Define a function τ : N2 → N2 as follows: For any (a, b) ∈ N2 , let T (a, b) be

the set of all (c, d) > (a, b). Since N2 is well-ordered, T (a, b) has a minimal

element, say (m, n). Define τ (a, b) = (m, n). Prove that τ is injective and
that (0, 0) 6∈ R(τ ). From the definition, it should be clear that τ is a kind of

“successor function.” (In fact, it is not hard to prove that τ (a, b) = (a, b + 1),

but, unless you prove this, do not use this fact for this part of the exercise.)

21
c. Show that < N2 , τ, (0, 0) > fails to be an integral system. That is, show why
the induction principle fails. (Here, you may want to prove and use the formula
for τ given above.)

1.9 Multiplication

We proceed similarly to the case of addition, using the method of induction to define
multiplication.

Choose any natural number m. We shall define a function bm : N → N which is

supposed to represent multiplication by m. To do this by the method of induction, we


need to define functions hn : N → N, which we define as follows: hn (x) = x + m, for

each n. Note that we use addition in this definition, but that’s okay because addition
has already been defined. The method of induction also requires us to choose a
particular element in N: our choice is the natural number 0.

Then, Theorem 1 tells us that there exists a unique function bm : N → N satisfying

bm (0) = 0, and

bm (n0 ) = bm (n) + m.

The first few values of bm are:

22
bm (0) = 0

bm (1) = m

bm (2) = m + m

bm (3) = (m + m) + m

and so on. Clearly, bm represents repeated addition of m, which coincides with our
intuitive understanding of multiplication.

We write bm (n) as mn and call it the product of m by n or m times n. (Occasionally,


we interpose a dot, as in a · b, if this makes the product more readable.)

In terms of the product notation that we just introduced, the key defining properties
of bm can be written as

m · 0 = 0, and

m · n0 = mn + m.

We now list the basic properties of multiplication as a theorem.

Theorem 3. For any natural numbers `, m, and n, the following are true:

a. (`m)n = `(mn). the associative law for multiplication


Basic
b. `m = m`. the commutative law for multiplication properties
of
multi-
c. ` · 1 = ` = 1 · `. the multiplicative identity law plication

23
d. 0 · m = 0 = m · 0.

e. Suppose that ` 6= 0. Then, we have (` · m = ` · n) ⇒ m = n.

the cancellation law for multiplication

f. Suppose that ` 6= 0. Then we have (` · m ≤ ` · n) ⇐⇒ m ≤ n.

g. `(m + n) = `m + `n and (` + m)n = `n + mn. the distributive laws

Except for the second equality in item (d), which is true by definition, all of the
statements in this theorem can be proved by the method of induction. The proofs are

not very different from those already done for the facts about addition. However, the

first equality in (d) can also be proved directly (i.e., without induction) , provided
one has first proved the second distributive law. The proof is a little trick involving
the evaluation of 0 + 0 · m as follows: 0 + 0 · m = 0 · m = (0 + 0)m = 0 · m + 0 · m. Thus,
we get 0 + 0 · m = 0 · m + 0 · m, which, by additive cancellation, yields 0 = 0 · m. We

give this proof because a nearly identical proof can be used in more general contexts,

which we’ll encounter later.

Exercise 18. Prove the statements of Theorem 3.

1.10 The natural numbers and cardinality

In this section we discuss cardinality issues related to N, making use of the material

in Section 6 of the Set Theory notes. The student should refer to that section when

needed for reminders about definitions.

We begin by defining the initial segments of the set N.

Definition. Let n be any natural number, and define the nth initial segment Nn to
be {k : (k ∈ N) ∧ (k < n)}.

24
Obviously, N0 is the empty set. Moreover, since, for any natural number m, there
are no natural numbers strictly between m and m + 1 (Exercise 13), we have N1 =
{0}, N2 = {0, 1}, N3 = {0, 1, 2}. More generally, for n > 0, we could write Nn =

{0, 1, . . . , m}, where here m is the predecessor of n. To promote ease of reading, it

is useful to give this number its usual name, namely n − 1. However, we need to

remember that subtraction has not yet been defined, so at this point, the use of n − 1

does not indicate that we are subtracting 1 from n, only that we are considering the
immediate predecessor of n. With this notation understood, we may sometimes write

Nn as

{0, 1, 2, . . . , n − 1}.

25
Definition. A set S is called finite if and only if #(S) = #(Nn ), for some natural
Finite
number n. A set that is not finite is called infinite. and
infinite
The reader should have no trouble with the following exercise. sets

Exercise 19. Suppose that S is a set such that #(S) = #(Nn ), for some natural
number n, and suppose that x is an object not in S. Then #(S ∪ {x}) = #(Nn+1 )

We now recall the definition of the relation ≺ defined in the chapter Set Theory. If

S and T are two sets, we write #(S) ≺ #(T ) to indicate that there is an injection
S → T but no bijection S → T . In Set Theory, we cite some theorems that imply
that ≺ is a strict linear order.

We wish to compare the strict order relation < on N with the relation ≺ on the
collection of initial segments Nn . In order to keep assumptions to a minimum, we

shall use only the fact that ≺ is transitive and irreflexive in this context, not that it
is a linear order. In particular, we do not assume the Schröder-Bernstein Theorem or
the fact that any two cardinalities are comparable.

Lemma 2. Let m and n be any natural numbers such that m < n. Then there is no
injective map Nn → Nm .

Proof. The proof is by induction on m. When m = 0, the result is obvious, because

Nm is empty and Nn is not, and there is no map whatsoever from a nonempty set to

the empty set, much less an injective map.


So, assume inductively that the result holds for m = k. We show that when n >

k + 1, there is no injective map Nn → Nk+1 . This proof will proceed by contradiction.

Accordingly, suppose that n > k + 1 and there is an injective map f : Nn → Nk+1 .


Since n > k + 1 > k, we may use the inductive hypothesis to conclude that the range

26
of f cannot be contained in Nk = {0, 1, . . . , k − 1}. That is, there must be some a
such that f (a) = k.
If a = n − 1, i.e., if f (n − 1) = k, then f maps all the other elements, i.e., the
elements of Nn−1 , injectively into Nk . But, since n > k + 1, we have n − 1 > k, so we

may use the induction hypothesis to conclude that this injective mapping Nn−1 → Nk
cannot exist. Therefore, f cannot satisfy f (n − 1) = k. This means that f (a) = k,

for some a other than n − 1; it also means, since f is injective, that f (n − 1) = b,


where b is some element of Nk+1 other than k.

We now define a new map f 0 : Nn → Nk+1 as follows: f 0 (x) = f (x), as long as x

is neither equal to a or to n − 1; f 0 (a) = b, and f 0 (n − 1) = k. That is, f 0 is almost


identical with f except that it switches the values of f at a and n − 1. Therefore,

f 0 is an injective map from Nn to Nk+1 satisfying f 0 (n − 1) = k. But we have just


shown in the preceding paragraph that such a map cannot exist. This is the desired
contradiction.

Corollary 3. For every pair of natural numbers m and n,


Comparing
<
m<n ⇐⇒ #(Nm ) ≺ #(Nn ). and

Proof. We first prove the implication in the =⇒ direction. Let any pair of natural

numbers m and n be given such that m < n. Then Nm ⊆ Nn , and so #(Nm ) ¹

#(Nn ). Lemma 2 implies that we cannot have #(Nm ) = #(Nn ), which proves that
#(Nm ) ≺ #(Nn ).

Now we prove the converse. Suppose #(Nm ) ≺ #(Nn ). Since this implies that there

is an injective map Nm → Nn , Lemma 2 implies that we cannot have m > n. But the
assumption #(Nm ) ≺ #(Nn ) also implies that there is no bijection Nm → Nn , so we

27
cannot have m = n. The only remaining possibility, then, is that m < n.

Exercise 20. For all natural numbers m and n, m = n if and only if #(Nm ) = #(Nn ).

We may conclude, therefore, that the set of natural numbers corresponds bijectively
to the set of all finite cardinalities in such a way as to preserve the strict orderings

defined on these sets. In particular, this shows that ≺ is a well-ordering on the set of
all cardinalities Nn , n ranging over N, indeed a copy of the strict linear ordering <

on N.
New notation: It is clearly convenient to have some more compact notation for

the cardinality #(Nn ), and, of course, we all know that the conventional terminology

and notation fits the bill: namely, instead of “#(Nn ),” we write simply “n,” and we
refer to any set with that cardinality as a “set with cardinality n.” The results just
proved provide logical justification for adopting this convention. That is, even though
the concept of n as an element of the set N is very different from the concept of the

cardinality of the set Nn , since the two correspond bijectively in an order-preserving

way, there is little cost and considerable convenience in conflating the two. In a
similar spirit, we shall use “<” instead of “≺” when talking about finite cardinalities.

That is, we shall use “m < n” to mean either that the natural number m precedes
the natural number n or that the cardinality m is strictly less than the cardinality n.

Proposition 5. Let n be any natural number, and let S be any subset of Nn . Then,

there exists a natural number m ≤ n such that #(S) = #(Nm ).

Exercise 21. * Prove the above proposition by induction on n. The initial step of

the induction is almost obvious. We give a hint for the inductive step. Assume that

28
the result holds for n = k, and let S be a subset of Nk+1 = {0, 1, . . . , k}. If S is
already contained in Nk = {0, 1, . . . , k − 1}, then the induction hypothesis applies
immediately. If not, this means that k must be a member of S. Let T = S \ {k},

show that the induction hypothesis can be applied to T , and then use this, together

with Exercise 19 to get the desired result for S.

This proposition gives us good information about subsets of Nn , but it doesn’t tell Bounded
and
us about finite sets in general. To approach the general case, we make a definition: unbounded
subsets
we call a subset S of N bounded if there exists a natural number n such that S ⊆ Nn . of N
Otherwise, we say it is unbounded. Proposition 5 tells us that bounded sets are finite.

Exercise 22. Prove that if f : N → N is an injective map, then its range R(f ) is
unbounded. (Hint: Use Lemma 2.) Use this to conclude that N is infinite.

Proposition 6. If S is a finite subset of N, then S is bounded.

Proof. We prove the contrapositive: If S is unbounded, then S is infinite. So, suppose


that S is unbounded. We begin by defining a function f : S → S. Note that this is
not an inductive definition.

First observe that S 6= ∅, since ∅ is obviously bounded. Next, choose any n ∈ S.

We shall define f (n). Let Tn be the set

{m : (m ∈ S) ∧ (m > n)}.

If Tn = ∅, this would mean that every m ∈ S satisfied m ≤ n, i.e., S ⊆ Nn+1 ,

contradicting the assumed unboundedness of S. So Tn is non-empty. By the well-

ordering on N, Tn has a smallest element, say n1 .

Define f (n) = n1 . Notice that this definition insures that f (n) > n, for every n ∈ S.

29
This defines the function f : S → S. Next, we define a function g : N → S
inductively as follows: Let g(0) be the smallest element of S, say g(0) = n0 . Assuming
that g(k) has been defined, let g(k + 1) = f (g(k)). This gives an inductive definition

for g. Notice that, for every natural number n, g(n + 1) = f (g(n)) > g(n).

Exercise 23. Prove by induction that if m < n, then g(m) < g(n). (Hint: Since

m < n, we have n = m + k, for some k ≥ 1. Proceed by induction on k If k = 1, the

previous paragraph gives the desired inequality, hence proving the base case. Now do

the inductive step.)

The exercise proves that g : N → S is an injection. If S were finite, we would have

a bijection h : S → Np , for some natural number p. But then the composite h ◦ g


would be an injection N → Np . Exercise 22 shows that this is a contradiction.

The proof of Proposition 6 shows slightly more than what the proposition asserts.
Not only does it show that every unbounded subset S of N is infinite, it also shows that
#(N) ¹ #(S). Since S ⊆ N, we also have #(S) ¹ #(N). Therefore, now applying
the Schröder-Bernstein Theorem, we may conclude that any unbounded subset of

N has the same cardinality as #(N). Since every subset of N is either bounded or

unbounded, we see that the subsets of N have either finite cardinality or the same

cardinality as that of N (which is infinite).

We can state this in a slightly stronger way as follows: Let S be any set such that
there is an injective map f : S → N. Then, of course, f defines a bijection between
The
S and the subset R(f ) of N. So, the above statement can be rephrased as: Let S be
smallest
any set such that #(S) ¹ #(N). Then, either #(S) is finite or #(S) = #(N). In infinite
cardinality
other words #(N) is the smallest infinite cardinality. It is often denoted ℵ0 . ℵ0 .

30
This leaves open one obvious question: Are there infinite cardinalities  ℵ0 ? The
answer to this is “yes,” and a proof was given via the extra-credit exercise in the
homework assignment for Week 6: specifically, for any set A it was shown that #(A) ≺

#(P(A)) (a result of G. Cantor).


It follows that ℵ0 ≺ #(P(N)). If we apply the exercise (Cantor’s Theorem) with
X = P(N), we get a strictly larger cardinality, #(P(P(N))), and so on. This shows

that there are infinitely many infinite cardinalities.


Of course, most of the mathematics we do involves more familiar sets, such as the

set of real numbers R, the set of points in the plane, etc. The above exercise does

not clarify what the cardinalities of these sets are. For example, is it possible that
ℵ0 = #(R)? Another important result of Cantor shows that the answer is “no.” We

shall go through his proof later in the course.

31

You might also like