Naturalnumbers 07
Naturalnumbers 07
Peter Kahn
Spring 2007
Contents
1 The Natural Numbers 1
1.1 The Peano Axioms . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Proof by induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.3 Comparing integral systems . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Definition by Induction . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.5 Addition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
1.6 Ordering the natural numbers . . . . . . . . . . . . . . . . . . . . . . 15
1.7 Variations on induction . . . . . . . . . . . . . . . . . . . . . . . . . . 18
1.8 Well-ordering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.9 Multiplication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
1.10 The natural numbers and cardinality . . . . . . . . . . . . . . . . . . 24
Except for illustrative examples, the following sections describe a systematic con-
struction of the real numbers using only the basic concepts of set theory. They follow
the standard progression of such constructions: natural numbers, integers, rational
numbers, and real numbers. Exercises with asterisks are considered challenging.
The natural numbers N consist of the positive integers 1, 2, 3, . . ., together with the
number 0. Whereas small positive integers have been used, with varied symbolic
The construction of the real numbers described in Chapter 3 of Robert Stoll’s 1963 book Set
Theory and Logic, W.H. Freeman and Company, San Francisco, London, inspired the development
in these notes. There will be occasional references to Stoll’s book.
† c
°May 21, 2007.
1
(estimated at around 300 B.C.), hence slightly less “natural” than the others. Thus,
some mathematicians exclude it from N. This doesn’t change the theory in any
significant way, just the form of certain definitions and proofs. It turns out to be
mathematicians) do.
There are a number of different approaches to constructing the natural numbers.
Our point of view will be that the entire set N = {0, 1, 2, 3, . . .} is given as a member of
our universe, together with one crucial property of N. We codify this property in the
form of an axiom, which we shall use to analyze N and to derive new properties. In this
In our derivations of new properties of N, we shall insist on using only the basic
axiom, together with the rules of symbolic logic and introductory set theory as pre-
sented in the earlier chapters. Of course, once we have proved a result, we may use it
as well. By deriving all the usual features of N in this way, not only do we place them
on a solid logical foundation but we also validate our choice of initial assumptions.
2
a. s is an injection.
b. x0 6∈ R(s).
and is closed under s: namely X itself. We’ll see shortly how this translates into the
assume
The Axiom of Integrality: < N, 0, σ > is an integral system.
The ba-
The Axiom of Integrality is a fundamental assumption that we are making about
sic
the natural numbers; its three features, as given in Definition 1, are part of our axiom
for the
concept of N. A nearly identical presentation of these features (written as five items) natural
numbers
is known as the Peano Axioms, named after the Italian mathematician G. Peano,
is the common way that we write the successor of n. We do not mean by this that
we have yet defined or have available a notion of addition of natural numbers, and
3
that n + 1 is the result of adding 1 to n. Addition is a secondary concept that we
shall define later. To emphasize this, we shall for the time being use the notation
σ(n) = n0 , avoiding n + 1 until later.
The student may recognize the Principle of Induction from other courses, perhaps
notably beginning calculus. Anticipating the way we are going to be using this prin-
ciple for proofs later in these notes, let us formulate the general idea here in the
terminology and symbolism of the Symbolic Logic notes and then reformulate it in
more familiar terms.
Proofs by induction involve statements of the form (∀n)P (n), where P is a predicate
involving the single variable n, which is assumed to range over N. Let T (P ) be the
truth set of P . By this we mean the set of all specializations a of the variable n for
which P (a) is true. To prove the given statement, we must show that T (P ) = N.
According to the Principle of Induction for < N, σ, 0 >, this can be accomplished
³¡
by showing two things: namely, that (i) 0 ∈ T (P ), and that (ii) (∀k ∈ N) k ∈
¢ ¡ ¢´
T (P ) ⇒ k 0 ∈ T (P ) .
Reformulating this, we get the familiar version of the induction principle: namely,
Familiar
to prove that (∀n)P (n) is true, first prove that P (0) is true, and then prove
version
of
¡ ¢ induction
(∀k ∈ N) P (k) is true ⇒ P (k 0 ) is true .
The first part is often called the initial step of the induction proof, and the second
is called the inductive step. The hypothesis in the inductive step is often called the
4
inductive hypothesis or induction hypothesis.
We present one example of this method that should be familiar to someone who has
studied calculus. The example is intended as an illustration, not as part of the logical
development we are presenting. So, we shall be assuming the standard properties of
the natural numbers as they are used in arithmetic, elementary algebra, and calculus
courses.
For any natural number n, let P (n) be the predicate 1+3+. . .+(2n+1) = (n+1)2 .
We use induction to prove (∀n)P (n). P (0) is the assertion 1 = (0 + 1)2 , which is
¡ ¢
obviously true. We must now show that (∀k) P (k) ⇒ P (k + 1) . So, choose any
¡ ¢2
1 + 3 + . . . + (2(k + 1) + 1) = (k + 1) + 1 ,
which is just the assertion P (k + 1). Therefore induction step is complete. The
principle of induction now implies that P (n) is true for all n, so the desired result is
proved.
We now return to the logical development and illustrate method of proof by induc-
5
another natural number.
This proposition may seem obvious to us, but it is not contained directly in the
Proof. The predicate in the statement of the proposition is n ∈ {0} ∪ R(σ), so its
truth set is obviously {0} ∪ R(σ). Let us call this X for short. We show that X
Another
satisfies the hypotheses of the induction principle. The first property, that 0 ∈ X,
example
is true by definition. Now choose any k ∈ X. Since k = σ(k), we obviously have of an
0
induction
0 0
k ∈ R(σ). But R(σ) ⊆ X, by definition, so k ∈ X. By the principle of induction, proof
X = N, as desired.
We remark that the Axiom of Integrality asserts in part that σ is injective. Together
with the foregoing proposition, this means that every non-zero natural number n can
be written in the form m0 , with m uniquely determined by n. We may call m the
predecessor of n.
Exercise 1. Using the standard facts about natural numbers that you have learned
in earlier courses, prove the following by induction on n:
¡ ¢2
b. 13 + 23 + . . . + n3 = n(n + 1)/2 .
We now wish to compare different integral systems, and for this, we use the following
definition:
6
Definition 2. Let < X, x0 , s > and < Y, y0 , t > be two integral systems. An integral
isomorphism of integral systems < X, x0 , s >' < Y, y0 , t > is a function f : X → Y
such that
a. f is a bijection.
b. f (x0 ) = y0 .
ements of X and Y in such a way as to match initial elements and preserve the
successor relation. Thus, isomorphic integral systems are just copies of one another.
Exercise 3. Prove that for every integral system < X, x0 , s >, there is at most one
integral isomorphism < N, 0, σ >'< X, x0 , s >. (Hint: Suppose that f and g are
integral isomorphisms, and prove by induction that (∀n) f (n) = g(n).) The existence
Exercise 4. In this exercise you may use the standard properties of numbers (real,
rational, natural, etc.). For each of the following examples, state which is an integral
system and which isn’t. In either case, justify your answer.
7
a. < Z, 0, σ >, where Z denotes the set of all (i.e., positive, negative and zero)
integers, and 0 and σ are as before.
b. < Q+ , 0, σ >, where Q+ is the set of non-negative rational numbers, and 0 and
σ are as before.
c. < N(10), 10, σ >, where N(10) is the set of natural numbers ≥ 10, and σ is as
before.
d. < N− , 0, σ >, where N− denotes the non-positive integers, and 0 and σ are as
before.
The principle of induction is used not only to prove statements of the sort already
indicated, but it is also used to define functions that have N as their domain.
For example, assuming for the moment that we have full use of the system of real
numbers R, choose any real number a. We might try to define an “exponential”
Two
function f : N → R as follows: f (0) = 1; f (n0 ) = af (n). Indeed, for the first few familiar
of its value at n.
As another example, we might try to define the function g : N → R as follows:
g(0) = 1; g(n0 ) = n0 g(n). The first few values of this function are 1, 1, 2, 6, 24, 120, . . . .
Most students have seen this function before: it is just n!, the factorial function. Here,
8
the definition of g at the successor of n depends both on n itself and on the value of
g at n.
These definitions work fine if one is interested only in finitely many values of the
function—and this is all that comes into play in computer science or combinatorics.
By themselves, these rules for defining (or, more precisely, attempting to define) the
function, do not actually give a function with domain N. They give only a finite
given do indeed determine functions with domain N. It also gives a general criterion
for making valid definitions “by induction.” The theorem can be proved using the
principle of induction, but the proof is fairly lengthy, so we shall omit it. A proof can
be found in the first two chapters of Stoll’s book. We shall use the result stated by
the theorem, however.
Notice that the theorem requires us to have available the element a0 and the stated
functions hn . It then gives us the desired function k. The functions hn are either
given as part of the background information or can be defined by some method other
than induction.
Let us illustrate this by showing how it applies to the two examples above.
For the first example, let A be the set of real numbers R, and let a be a real number
as before. For each n, let hn : R → R be given by the formula hn (x) = ax. (In this
case, all these functions hn are equal.) Notice that the definition of hn does not use
9
induction. Then, the theorem tells us that there exists a unique function k such that
k(0) = a and k(n0 ) = hn (k(n)) = ak(n). This is precisely the exponential function f
of the first example.
For the second example, let A = N and now choose a = 1. Define hn by the formula
hn (x) = n0 x. The reader should convince himeself/herself that this definition does
not use induction. In this case, all the functions hn are different from each other.
The theorem now tells us that there exists a unique function k : N → N satisfying
k(0) = 1 and k(n0 ) = n0 k(n). This is precisely the factorial function g of the second
example.
Exercise 5. Use the method of definition by induction to prove that for each integral
system, < X, x0 , s >, there exists at least one integral isomorphism f :< N, 0, σ >→<
X, x0 , s >.
Exercises 3 and 5 tell us that every integral system is a copy of < N, 0, σ > in a
unique way, or, to put it another way, there is only one integral isomorphism class
of integral systems, and every representative is isomorphic to < N, 0, σ > in a unique
Uniqueness
way. Because of this , we say that the properties in Definition 1 characterize (or of
determine, or define) < N, 0, σ > Henceforth, we write < N, 0, σ > more briefly as N natural
numbers
We now use this method of definition by induction to define addition of natural
numbers.
1.5 Addition
We have started with the most primitive features that lie at the heart of our concept
of the natural numbers, encoded as The Integrality Axiom (or Peano’s Axioms). No-
10
tice that, except for the operation of succession, (i.e., counting), we are not assuming
anything about operations with the natural numbers (such as addition and multipli-
cation). In particular, we are in a pre-arithmetic mode with respect to N. We now
remedy this by defining the operation of addition , +, for N. Induction can then be
used to verify that this operation has all the familiar properties. The definition and
1 tells us that there exists a unique function N → N, which we shall denote am such
that
a. am (0) = m.
Definition 3. For any m and n in N, we denote am (n) by m + n and call it the sum
follows:
a. m + 0 = m.
b. m + n0 = (m + n)0 .
11
We are left with the important task of justifying this definition, which amounts to
showing that it gives us the operation of addition with which we are all familiar, i.e.,
that it has all the familiar properties. Those that are not immediate consequences of
the definition are all proved by induction. We sketch a few proofs by induction for
illustration.
Lemma 1. Choose any natural numbers `, m, and n. Then: (i) m+0 = m = 0+m;
(ii) m + 1 = m0 = 1 + m; (iii) ` + (m + n) = (` + m) + n.
Proof. (i) m + 0 = am (0) = m, by definition, proving the first equality. The second
equality, namely, m = a0 (m) must be proved by induction on m. By definition,
that is, k = a0 (k). Evaluate a0 (k 0 ) = (a0 (k))0 = k 0 , the first equality following from
property (b) of a0 and the second from the induction hypothesis. This completes the
proof of (i).
(ii) Evaluate m + 1 = am (1) = am (00 ) = (am (0))0 = m0 . (The reader should fill in the
reasons for each equality.) The second equality, namely, m0 = a1 (m), is proved by
induction. When m = 0, both sides reduce to 1, so the assertion is true for m = 0.
Switching the order of the equality, we obtain (k 0 )0 = a1 (k 0 ). But this is precisely the
(∀`)(∀m)(` + (m + n) = (` + m) + n).
12
We prove (∀n)P (n) by induction on n. P (0) is the statement (∀`)(∀m)(` + (m + 0) =
(` + m) + 0). Both sides reduce to ` + m, by what has already been proved, so P (0) is
true. Now assume P (k) for some k. We evaluate the left-hand side of P (k 0 ) for any
selected ` and m:
` + (m + k 0 ) = ` + (m + k)0
¡ ¢0
= ` + (m + k)
¡ ¢0
= (` + m) + k
= (` + m) + k 0 .
Again, we leave it to the reader to supply reasons for each step. The resulting equality
is precisely P (k 0 ), so the induction proof of (iii) is complete.
The reader may have noticed that statement (i) of the lemma is the familiar additive
identity rule for arithmetic and that the third statement of the lemma is the familiar
associative law for addition. Usually, these are stated as axioms or properties that we
assume. The lemma shows, however, that these can be proved from more rudimentary
concepts.
how to group them so that we can do the addition two numbers at a time. In the
case of three numbers, there are two ways of grouping the numbers (without changing
their order)—or, as we may say, there are two ways of associating the numbers (into
pairs). The associative law tells us that these two ways give equal sums.
Exercise 6. Show that there are five ways of associating four numbers into pairs,
13
and then prove, using the associative law, that all five ways produce the same sum.
How many ways of associating five numbers are there? List them.
This exercise shows that the associative law can be extended to the case of four
numbers, and similar arguments extend the law to five numbers, etc. Indeed, there is
a general inductive argument (which we do not go into in these notes) that extends
the result to any number of numbers. This means that, for addition, we do not need
to use parentheses to express arbitrarily long sums (e.g., a0 + a1 + . . . + an ), since any
association into pairs yields the same result. So, from now on we feel free to omit
of n0 .
Exercise 7. Use the definition of addition and the method of proof by induction to
¡ ¢
b. (∀k)(∀m)(∀n) (m + k = n + k) ⇒ (m = n) . the cancellation law for addition
The statements in the earlier lemma and in the exercise give the most important
features of the additive algebra of the natural numbers, so we summarize them in the
following theorem. Further algebra will come into play when we discuss multiplication
later.
14
Basic
Theorem 2. Let x, y, z be any natural numbers. Then the following hold: properties
of
a. x + (y + z) = (x + y) + z. associativity of addition addition
d. x + y = y + x. commutativity of addition
Although we have spoken informally about a sequential aspect to the natural numbers,
the concept of an integral system does not, by itself, contain any explicit information
about an ordering of the elements, except for the reference to one element as the
initial element. But the natural ordering of the counting numbers is one of their
basic features, so we now introduce this into the picture with a precise definition.
For later use, it is convenient to rephrase this definition in the following way: Let
Exercise 8. Prove the following addendum to the foregoing: If the equation has a
15
Exercise 9. Show that ≤ is a partial order on N.
Exercise 10. Let < be the strict ordering corresponding to ≤. Show that m < n if
and only if there exists a k ∈ N such that k 6= 0 and m + k = n (or, equivalently,
m + p ≤ n + p.
Exercise 12. Suppose that m, n, p and q are natural numbers such that m+n = p+q.
Again, this is an example of one of those so-called “obvious” statements that is not
so obvious to prove. Another way of stating this fact is the following: For all m and
n, n < m + 1 if and only if n ≤ m. Convince yourself that this is, indeed, equivalent
to the assertion in the preceding exercise.
Proof. (a) A linear order is a partial order for which any two elements are compa-
rable (cf. Set Theory notes). So, we have to show that for any m and n in N, either
³ ´
(∀n) (m ≤ n) ∨ (n ≤ m) .
16
Now assume P (m) is true for m = k. We prove P (k 0 ) by first choosing any n. We
must show k 0 ≤ n or n ≤ k 0 . By the induction hypothesis, there are two cases, k ≤ n
or n ≤ k: (i) Suppose n ≤ k . Since k 0 = k + 1, we have k ≤ k 0 , so, by the transitivity
includes the case n = k, so we may as well assume k < n. We shall show that k 0 ≤ n.
Since k < n, it cannot be the case that n = 0. Therefore, n has a unique predecessor
Often the fact that ≤ is a linear order is phrased in terms of <, which we then also Strict
linear
call a (strict) linear ordering. In terms of <, the condition is phrased as follows: For ordering
any natural numbers m and n, exactly one of the following holds: m < n, m = n, or
n < m.
b. n = 0 ∨ n > 0.
17
1.7 Variations on induction
Using the linear ordering of N, it is now easy to formulate two variations of the usual
method of induction.
Variation 1: This method does not have a commonly used name since it amounts
to little more than a re-labeling. We might call it shifted induction, however, when
we refer to it. This kind of induction arises when the predicate one would like to
verify is not usefully defined for some small values of n. Perhaps P (n) is defined only
Shifted
for n ≥ some fixed number n0 which depends on the particular problem. We can
induction
still do induction in this case, except that our initial value is no longer 0 but is n0 .
¡
The situation we are facing then is that P (n0 ) is known to be true and (∀k) P (k) ⇒
¢
P (k + 1) is known to be true, provided k ≥ n0 . We would then like to be able to
conclude that P (n) is true for all n ≥ n0 . Essentially, our entire frame of reference is
shifted over by n0 units.
This variant of induction is easy to derive from the standard induction principle
via the following re-labeling trick. Define a new predicate Q by the formula Q(n) =
P (n0 + n). Since, by definition, k ≥ n0 if and only if there is a natural number m such
that k = n0 + m, k ranges over all natural number ≥ n0 precisely when m ranges over
all natural numbers. Therefore, the situation we confront can now be reformulated as
¡ ¢
follows: Q(0) is true, and (∀m) Q(m) ⇒ Q(m + 1) is true. So, ordinary induction
implies that Q(n) is true for all natural numbers n, and, hence, P (n) is true for all
natural numbers n ≥ n0 .
18
have the same initial step. But the induction step is replaced by the following
³ ¡ ¢ ´
(∀k) (∀i) (i ≤ k) ⇒ P (i) ⇒ P (k + 1) .
That is, if, for all k, P (1), P (2), . . . , P (k) are all true, then P (k + 1) is true. The
initial step plus the induction step then imply the usual conclusion: (∀n)P (n) is true.
Of course, Variation 1 can be combined with strong induction: i.e., you don’t have
to start at 0 in strong induction.
multiplicative properties of natural numbers, which have not yet been established in
our logical development, but we are doing so only for illustration.
Proof. This proof is just an illustrative sketch. Since there are no natural numbers
primes. Suppose the proposition is true for all natural numbers i ≤ k, and consider
Example
the natural number k + 1. If it is prime, we are done. If not, then by definition,
of
it can be written as a product of two natural numbers k + 1 = ab, with neither a proof
by
nor b equal to k + 1 or 1. Therefore, both a and b are < k + 1. Since there are strong
induction
no natural numbers strictly between k and k + 1, both a and b are ≤ k, hence the
19
strong induction hypothesis may be applied to them. They are thus both products
of primes, and so, clearly their product is a product of primes.
The point here is that since we are dealing with a multiplicative property, simply
knowing the proposition for k may not be adequate for proving it for k + 1, since
these two numbers are not related multiplicatively in a nice way. However, knowing
the proposition for all natural numbers ≤ k gives us the flexibility we need for the
1.8 Well-ordering
Finally, here is another basic and useful fact about N, which plays a role in proofs
later on.
Proof. Let S be any subset of N. The proof will use the contrapositive method.
Assume that S has no minimal element, and let T be the set difference N \ S. Let
¡ ¢
P (n) be the predicate (∀k) (k ≤ n) ⇒ (k ∈ T ) . This asserts that the set of all
The exercise concludes the induction proof that (∀n)P (n) is true. But this means
that T = N, hence S = ∅.
20
Thus, assuming that S has no minimal element, the above proof shows that it must
be empty, which is precisely the contrapositive of the statement of the proposition.
a well-ordering, first, one has to prove that it is a linear ordering (or a strict linear
ordering), and then one has to prove that it satisfies this extra minimality property.
The following exercise shows that there are well-ordered sets that do not satisfy the
the set of all (c, d) > (a, b). Since N2 is well-ordered, T (a, b) has a minimal
element, say (m, n). Define τ (a, b) = (m, n). Prove that τ is injective and
that (0, 0) 6∈ R(τ ). From the definition, it should be clear that τ is a kind of
“successor function.” (In fact, it is not hard to prove that τ (a, b) = (a, b + 1),
but, unless you prove this, do not use this fact for this part of the exercise.)
21
c. Show that < N2 , τ, (0, 0) > fails to be an integral system. That is, show why
the induction principle fails. (Here, you may want to prove and use the formula
for τ given above.)
1.9 Multiplication
We proceed similarly to the case of addition, using the method of induction to define
multiplication.
each n. Note that we use addition in this definition, but that’s okay because addition
has already been defined. The method of induction also requires us to choose a
particular element in N: our choice is the natural number 0.
bm (0) = 0, and
bm (n0 ) = bm (n) + m.
22
bm (0) = 0
bm (1) = m
bm (2) = m + m
bm (3) = (m + m) + m
and so on. Clearly, bm represents repeated addition of m, which coincides with our
intuitive understanding of multiplication.
In terms of the product notation that we just introduced, the key defining properties
of bm can be written as
m · 0 = 0, and
m · n0 = mn + m.
Theorem 3. For any natural numbers `, m, and n, the following are true:
23
d. 0 · m = 0 = m · 0.
Except for the second equality in item (d), which is true by definition, all of the
statements in this theorem can be proved by the method of induction. The proofs are
not very different from those already done for the facts about addition. However, the
first equality in (d) can also be proved directly (i.e., without induction) , provided
one has first proved the second distributive law. The proof is a little trick involving
the evaluation of 0 + 0 · m as follows: 0 + 0 · m = 0 · m = (0 + 0)m = 0 · m + 0 · m. Thus,
we get 0 + 0 · m = 0 · m + 0 · m, which, by additive cancellation, yields 0 = 0 · m. We
give this proof because a nearly identical proof can be used in more general contexts,
In this section we discuss cardinality issues related to N, making use of the material
in Section 6 of the Set Theory notes. The student should refer to that section when
Definition. Let n be any natural number, and define the nth initial segment Nn to
be {k : (k ∈ N) ∧ (k < n)}.
24
Obviously, N0 is the empty set. Moreover, since, for any natural number m, there
are no natural numbers strictly between m and m + 1 (Exercise 13), we have N1 =
{0}, N2 = {0, 1}, N3 = {0, 1, 2}. More generally, for n > 0, we could write Nn =
is useful to give this number its usual name, namely n − 1. However, we need to
remember that subtraction has not yet been defined, so at this point, the use of n − 1
does not indicate that we are subtracting 1 from n, only that we are considering the
immediate predecessor of n. With this notation understood, we may sometimes write
Nn as
{0, 1, 2, . . . , n − 1}.
25
Definition. A set S is called finite if and only if #(S) = #(Nn ), for some natural
Finite
number n. A set that is not finite is called infinite. and
infinite
The reader should have no trouble with the following exercise. sets
Exercise 19. Suppose that S is a set such that #(S) = #(Nn ), for some natural
number n, and suppose that x is an object not in S. Then #(S ∪ {x}) = #(Nn+1 )
We now recall the definition of the relation ≺ defined in the chapter Set Theory. If
S and T are two sets, we write #(S) ≺ #(T ) to indicate that there is an injection
S → T but no bijection S → T . In Set Theory, we cite some theorems that imply
that ≺ is a strict linear order.
We wish to compare the strict order relation < on N with the relation ≺ on the
collection of initial segments Nn . In order to keep assumptions to a minimum, we
shall use only the fact that ≺ is transitive and irreflexive in this context, not that it
is a linear order. In particular, we do not assume the Schröder-Bernstein Theorem or
the fact that any two cardinalities are comparable.
Lemma 2. Let m and n be any natural numbers such that m < n. Then there is no
injective map Nn → Nm .
Nm is empty and Nn is not, and there is no map whatsoever from a nonempty set to
26
of f cannot be contained in Nk = {0, 1, . . . , k − 1}. That is, there must be some a
such that f (a) = k.
If a = n − 1, i.e., if f (n − 1) = k, then f maps all the other elements, i.e., the
elements of Nn−1 , injectively into Nk . But, since n > k + 1, we have n − 1 > k, so we
may use the induction hypothesis to conclude that this injective mapping Nn−1 → Nk
cannot exist. Therefore, f cannot satisfy f (n − 1) = k. This means that f (a) = k,
Proof. We first prove the implication in the =⇒ direction. Let any pair of natural
#(Nn ). Lemma 2 implies that we cannot have #(Nm ) = #(Nn ), which proves that
#(Nm ) ≺ #(Nn ).
Now we prove the converse. Suppose #(Nm ) ≺ #(Nn ). Since this implies that there
is an injective map Nm → Nn , Lemma 2 implies that we cannot have m > n. But the
assumption #(Nm ) ≺ #(Nn ) also implies that there is no bijection Nm → Nn , so we
27
cannot have m = n. The only remaining possibility, then, is that m < n.
Exercise 20. For all natural numbers m and n, m = n if and only if #(Nm ) = #(Nn ).
We may conclude, therefore, that the set of natural numbers corresponds bijectively
to the set of all finite cardinalities in such a way as to preserve the strict orderings
defined on these sets. In particular, this shows that ≺ is a well-ordering on the set of
all cardinalities Nn , n ranging over N, indeed a copy of the strict linear ordering <
on N.
New notation: It is clearly convenient to have some more compact notation for
the cardinality #(Nn ), and, of course, we all know that the conventional terminology
and notation fits the bill: namely, instead of “#(Nn ),” we write simply “n,” and we
refer to any set with that cardinality as a “set with cardinality n.” The results just
proved provide logical justification for adopting this convention. That is, even though
the concept of n as an element of the set N is very different from the concept of the
way, there is little cost and considerable convenience in conflating the two. In a
similar spirit, we shall use “<” instead of “≺” when talking about finite cardinalities.
That is, we shall use “m < n” to mean either that the natural number m precedes
the natural number n or that the cardinality m is strictly less than the cardinality n.
Proposition 5. Let n be any natural number, and let S be any subset of Nn . Then,
Exercise 21. * Prove the above proposition by induction on n. The initial step of
the induction is almost obvious. We give a hint for the inductive step. Assume that
28
the result holds for n = k, and let S be a subset of Nk+1 = {0, 1, . . . , k}. If S is
already contained in Nk = {0, 1, . . . , k − 1}, then the induction hypothesis applies
immediately. If not, this means that k must be a member of S. Let T = S \ {k},
show that the induction hypothesis can be applied to T , and then use this, together
This proposition gives us good information about subsets of Nn , but it doesn’t tell Bounded
and
us about finite sets in general. To approach the general case, we make a definition: unbounded
subsets
we call a subset S of N bounded if there exists a natural number n such that S ⊆ Nn . of N
Otherwise, we say it is unbounded. Proposition 5 tells us that bounded sets are finite.
Exercise 22. Prove that if f : N → N is an injective map, then its range R(f ) is
unbounded. (Hint: Use Lemma 2.) Use this to conclude that N is infinite.
{m : (m ∈ S) ∧ (m > n)}.
Define f (n) = n1 . Notice that this definition insures that f (n) > n, for every n ∈ S.
29
This defines the function f : S → S. Next, we define a function g : N → S
inductively as follows: Let g(0) be the smallest element of S, say g(0) = n0 . Assuming
that g(k) has been defined, let g(k + 1) = f (g(k)). This gives an inductive definition
for g. Notice that, for every natural number n, g(n + 1) = f (g(n)) > g(n).
Exercise 23. Prove by induction that if m < n, then g(m) < g(n). (Hint: Since
previous paragraph gives the desired inequality, hence proving the base case. Now do
The proof of Proposition 6 shows slightly more than what the proposition asserts.
Not only does it show that every unbounded subset S of N is infinite, it also shows that
#(N) ¹ #(S). Since S ⊆ N, we also have #(S) ¹ #(N). Therefore, now applying
the Schröder-Bernstein Theorem, we may conclude that any unbounded subset of
N has the same cardinality as #(N). Since every subset of N is either bounded or
unbounded, we see that the subsets of N have either finite cardinality or the same
We can state this in a slightly stronger way as follows: Let S be any set such that
there is an injective map f : S → N. Then, of course, f defines a bijection between
The
S and the subset R(f ) of N. So, the above statement can be rephrased as: Let S be
smallest
any set such that #(S) ¹ #(N). Then, either #(S) is finite or #(S) = #(N). In infinite
cardinality
other words #(N) is the smallest infinite cardinality. It is often denoted ℵ0 . ℵ0 .
30
This leaves open one obvious question: Are there infinite cardinalities  ℵ0 ? The
answer to this is “yes,” and a proof was given via the extra-credit exercise in the
homework assignment for Week 6: specifically, for any set A it was shown that #(A) ≺
set of real numbers R, the set of points in the plane, etc. The above exercise does
not clarify what the cardinalities of these sets are. For example, is it possible that
ℵ0 = #(R)? Another important result of Cantor shows that the answer is “no.” We
31