Mathematical Analysis Volume I Overview
Mathematical Analysis Volume I Overview
Volume I
January 1, 2024
Contents i
Contents
Contents i
Preface iv
References 566
Preface iv
Preface
Chapter 1
In this section, we give a brief review of propositional logic, sets and functions.
It is assumed that students have taken an introductory course which covers these
topics, such as a course in discrete mathematics [Ros18].
From two propositions p and q, we can apply logical operators and obtain a
compound proposition.
1. p → q ≡ ¬p ∨ q
2. De Morgan’s Law
(i) ¬(p ∨ q) ≡ ¬p ∧ ¬q
(ii) ¬(p ∧ q) ≡ ¬p ∨ ¬q
∀x ∃y x + y = 0
∃y ∀x x + y = 0.
When the domains for x and y are both the set of real numbers, the first statement
is true, while the second statement is false.
For a set A, we use the notation x ∈ A to denote x is an element of the set A;
and the notation x ∈ / A to denote x is not an element of A.
x ∈ A ⇐⇒ x ∈ B.
x ∈ A ∪ B ⇐⇒ (x ∈ A) ∨ (x ∈ B).
x ∈ A ∩ B ⇐⇒ (x ∈ A) ∧ (x ∈ B).
x ∈ A \ B ⇐⇒ (x ∈ A) ∧ (x ∈
/ B).
x ∈ AC ⇐⇒ (x ∈ U ) ∧ (x ∈
/ A).
Chapter 1. The Real Numbers 5
Since a universal set can vary from context to context, we will usually avoid
using the notation AC and use U \ A instead for the complement of A in U . The
advantage of using the notation AC is that De Morgan’s law takes a more succint
form.
1. (A ∪ B)C = AC ∩ B C
2. (A ∩ B)C = AC ∪ B C
Notice that f −1 (D) is a notation, it does not mean that the function f has an
inverse.
Next, we turn to discuss injectivity and surjectivity of functions.
Chapter 1. The Real Numbers 6
a1 ̸= a2 =⇒ f (a1 ) ̸= f (a2 ).
f (a1 ) = f (a2 ) =⇒ a1 = a2 .
∀b ∈ B, ∃a ∈ A, f (a) = b.
In this section, we introduce the set of real numbers using an intuitive approach.
We will also use the notation Z+ to denote the set of positive integers.
The set of real numbers contains the set of rational numbers Q as a subset. If
a real number is not a rational number, we call it an irrational number. The set
of irrational numbers is R \ Q.
It has been long known that there are real numbers that are not rational numbers.
√
The best example is the number 2, which appears as the length of the diagonal
of a unit square (see Figure 1.1).
√
Figure 1.1: The number 2.
The addition and multiplication operations defined on the set of natural numbers
can be extended to the set of real numbers consistently.
If a and b are real numbers, a + b is the addition of a and b, and ab is the
multiplication of a and b.
If a and b are positive real numbers, a+b and ab are also positive real numbers.
The set of real numbers with the addition and multuplication operations is
a field, which you will learn in abstract algebra. These operations satisfy the
following properties.
Chapter 1. The Real Numbers 9
1. Commutativity of Addition
a+b=b+a
2. Associativity of Addition
(a + b) + c = a + (b + c)
3. Additive Identity
a+0=0+a=a
0 is called the additive identity.
4. Additive Inverse
For every real number a, the negative of a, denoted by −a, satisfies
a + (−a) = (−a) + a = 0
5. Commutativity of Multiplication
ab = ba
6. Associativity of Multiplication
(ab)c = a(bc)
7. Multiplicative Identity
a·1=1·a=a
8. Multiplicative Inverse
For every nonzero real number a, the reciprocal of a, denoted by 1/a,
satisfies
1 1
a· = ·a=1
a a
9. Distributivity
a(b + c) = ab + ac
Chapter 1. The Real Numbers 10
The set of complex numbers C is the set that contains all numbers of the
form a + ib, where a and b are real numbers, and i is the purely imaginary
number such that i2 = −1. It contains the set of real numbers R as a subset.
Addition and multiplication can be extended to the set of complex numbers. These
two operations on complex numbers also satisfy all the properties listed above.
Nevertheless, we shall focus on the set of real numbers in this course.
There are special subsets of real numbers which are called intervals. There
are nine types of intervals, four types are finite, five types are semi-infinite or
infinite. Their definitions are as follows.
Finite Intervals
1. (a, b) = {x ∈ R | a < x < b}
2. [a, b) = {x ∈ R | a ≤ x < b}
3. (a, b] = {x ∈ R | a < x ≤ b}
4. [a, b] = {x ∈ R | a ≤ x ≤ b}
For the intervals (a, b), [a, b), (a, b], [a, b], the points a and b are the end points
of the interval, while any point x with a < x < b is an interior point.
6. [a, ∞) = {x ∈ R | x ≥ a}
7. (−∞, a) = {x ∈ R | x < a}
8. (−∞, a] = {x ∈ R | x ≤ a}
9. (−∞, ∞) = R.
For the intervals (a, ∞), [a, ∞), (−∞, a) and (−∞, a], a is the end point of
the interval, while any other points in the interval besides a is an interior point.
The set of natural numbers is a well-ordered set. Every nonempty subset
of positive integers has a smallest element. This statement is equivalent to the
Chapter 1. The Real Numbers 11
Let P (n) be a sequence of statements that are indexed by the set of positive
integers Z+ . Assume that the following two assertions are true.
2. For every positive integer n, if the statement P (n) is true, the statement
P (n + 1) is also true.
Then we can conclude that for all positive integers n, the statement P (n) is
true.
Before ending this section, let us discuss the absolute value and some useful
inequalities.
In particular, | − x| = |x|.
1. If r is a positive number,
2. If r is a nonnegative number,
|x − a| ≤ r ⇐⇒ −r ≤ x − a ≤ r ⇐⇒ x ∈ [a − r, a + r].
Proposition 1.6
Given real numbers x and y,
|xy| = |x||y|.
|x + y| ≤ |x| + |y|.
Proposition 1.8
Given real numbers x and y,
|x − y| ≥ ||x| − |y|| .
Proof
Since |x − y| ≥ 0, the statement is equivalent to
By triangle inequality,
|x − y| + |y| ≥ |x − y + y| = |x|.
Hence,
|x| − |y| ≤ |x − y|.
By triangle inequality again,
Hence,
−|x − y| ≤ |x| − |y|.
This completes the proof.
Example 1.1
If |x − 5| ≤ 2, show that
9 ≤ x2 ≤ 49.
Solution
|x − 5| ≤ 2 implies 3 ≤ x ≤ 7. This means that x is positive. The
inequality x ≥ 3 then implies that x2 ≥ 9, and the inequality x ≤ 7 implies
that x2 ≤ 49. Therefore,
9 ≤ x2 ≤ 49.
a2 + b 2
ab ≤ .
2
Proof
This is just a consequence of (a − b)2 ≥ 0.
Exercises 1.2
Question 1
Use induction to show that for any positive integer n,
n! ≥ 2n−1 .
(1 + a)n ≥ 1 + na
Question 3
Let n be a positive integer. If c1 , c2 , . . . , cn are numbers that lie in the
interval (0, 1), show that
(1 − c1 )(1 − c2 ) . . . (1 − cn ) ≥ 1 − c1 − c2 − · · · − cn .
Chapter 1. The Real Numbers 16
x ≤ c for all x ∈ S.
x ≥ b for all x ∈ S.
Example 1.2
Determine whether each of the following sets of real numbers is bounded
above, whether it is bounded below, and whether it is bounded.
(a) A = {x | x < 2}
Solution
(a) The set A is bounded above since every element of A is less than or
equal to 2. It is not bounded below, and so it is not bounded.
(b) The set B is bounded below since every element of B is larger than or
equal to −2. It is not bounded above, and so it is not bounded.
(c) The set C is equal to A∩B. So it is bounded above and bounded below.
Therefore, it is bounded.
−S = {−x | x ∈ S} .
For example, the set B = {x | x > −2} is the negative of the set A = {x | x < 2},
the set C = {x | − 2 < x < 2} is the negative of itself (see Figure 1.2). It is
obvious that S is bounded above if and only if −S is bounded below.
Next, we recall the definition of maximum and minimum of a set.
Chapter 1. The Real Numbers 18
Example 1.3
For the set S1 = [−2, 2], −2 is the minimum, and 2 is the maximum.
For the set S2 = [−2, 2), −2 is the minimum, and there is no maximum.
This example shows that a bounded set does not necessarily have maximum
or minimum. However, a finite set always have a maximum and a minimum.
Proposition 1.11
If S is a finite set, then S has a maximum and a minimum.
u = sup S.
Example 1.4
sup S1 = sup S2 = 2.
One natural question to ask is, if S is a nonempty subset of real numbers that
is bounded above, does S necessarily have a least upper bound. The completeness
axiom asserts that this is true.
Completeness Axiom
If S is a nonempty subset of real numbers that is bounded above, then S
has a least upper bound.
The reason this is formulated as an axiom is we cannot prove this from our
intuitive definition of real numbers. Therefore, we will assume this as a fact for
the set of real numbers. A lots of theorems that we are going to derive later is a
consequence of this axiom.
Actually, the set of real numbers can be constructed axiomatically, taken it
Chapter 1. The Real Numbers 20
to be a set that contains the set of rational numbers, satisfying all properties
of addition and multiplication operations, as well as the completeness axiom.
However, this is a tedious construction and will drift us too far.
To show that the completeness axiom is not completely trivial, we show in
Example 1.6 that if we only consider the set of rational numbers, we can find a
subset of rational numbers A that is bounded above but does not have a least upper
bound in the set of rational numbers. We look at the following example first.
Example 1.5
Define the set of real numbers S by
S = x ∈ R | x2 < 2 .
Show that S is nonempty and is bounded above. Conclude that the set
A = x ∈ Q | x2 < 2
Solution
The number 1 is in S, and so S is nonempty. For any x ∈ S, x2 < 2 < 4,
and hence x < 2. This shows that S is bounded above by 2. Since 1 and 2
are rational numbers, the same reasoning shows that the set A is nonempty
and is bounded above by a rational number.
Example 1.6
Consider the set
A = x ∈ Q | x2 < 2 .
UA = {c ∈ Q | x ≤ c for all x ∈ A} .
Solution
We use proof by contradiction. Assume that UA has a smallest element c1 ,
which is an upper bound of A that is smaller than or equal to any upper
bound of A. Then for any x ∈ A,
x 2 ≤ c1 .
p2 − 2q 2
c21 − 2 = ,
q2
and
4p2 + 8pq + 4q 2 − 2(p2 + 4pq + 4q 2 ) 2(p2 − 2q 2 )
c22 − 2 = = .
(p + 2q)2 (p + 2q)2
In the solution above, the construction of the positive rational number c2 seems
a bit adhoc. In fact, we can define c2 by
mp + 2nq
c2 =
np + mq
for any positive integers m and n with m2 > 2n2 . Then the proof still works.
Now let us see how completeness axiom is used to guarantee that there is a
real number whose square is 2.
Example 1.7
Use completeness axiom to show that there is a positive real number c such
that
c2 = 2.
Solution
Define the set of real numbers S by
S = x ∈ R | x2 < 2 .
d d2 d d
c21 = c2 + + 2
≤ c2 + + < c2 + d = 2.
2 16c 2 16
This implies that c1 is an element of S that is larger than c, which contradicts
to c is an upper bound of S.
Case 2: c2 > 2.
Let d = c2 − 2. Then d > 0. Define the number c1 by
d
c1 = c − .
2c
Then c1 < c, and
d2
c21 = c2 − d + 2
> c2 − d = 2.
4c
This implies that c1 is an upper bound of S that is smaller than c, which
contradicts to c is the least upper bound of S.
Since we obtain a contradiction if c2 ̸= 2, we must have c2 = 2.
In fact, the completeness axiom can be used to show that for any positive real
number a, there is a positive real number c such that
c2 = a.
√
We denote this number c as a, called the positive square root of a. The number
√
b = − a is another real number such that b2 = a.
More generally, if n is a positive integer, a is a positive real number, then there
is a positive real number c such that cn = a. We denote this number c by
√
n
c= a,
Chapter 1. The Real Numbers 24
ℓ = inf S.
Theorem 1.13
If S is a nonempty subset of real numbers that is bounded below, then S
has a greatest lower bound.
For a nonempty set S that is bounded, it has a least upper bound sup S and a
greatest lower bound inf S. The following is quite obvious.
Proposition 1.14
If S is a bounded nonempty subset of real numbers, it has a least upper
bound sup S and a greatest lower bound inf S. Moreover,
inf S ≤ sup S,
Let us emphasize again the characterization of the least upper bound and
greatest lower bound of a set.
Chapter 1. The Real Numbers 25
Example 1.8
For each of the following set of real numbers, determine whether it has a
least upper bound, and whether it has a greatest lower bound.
(a) A = {x ∈ R | x3 < 2}
Solution
(a) The set A is bounded above, since if x ∈ A, then x3 < 2 < 23 , and so
x < 2. The set A is not bounded below since it contains all negative
numbers. Hence, A has a least upper bound, but it does not have a
greatest lower bound.
(b) If x2 < 10, then x2 < 16, and so −4 < x < 4. This shows that B
is bounded. Hence, B has a least upper bound, and a greatest lower
bound.
Finally, we want to highlight again Proposition 1.12 together with its lower
bound versus infimum counterpart.
Chapter 1. The Real Numbers 26
Exercises 1.3
Question 1
For each of the following sets of real numbers, find its least upper bound,
greatest lower bound, maximum, and minimum if any of these exists. If
any of these does not exist, explain why.
(b) B = [−3, ∞)
Question 2
Use completeness axiom to show that there is a positive real number c such
that
c2 = 5.
Question 3
For each of the following set of real numbers, determine whether it has a
least upper bound, and whether it has a greatest lower bound.
Proposition 1.15
2. For any real number c, there is exactly one integer in the interval [c, c +
1), and there is exactly one integer in the interval (c, c + 1].
These statements are quite obvious. For any real number c, the integer in the
interval [c, c + 1) is ⌈c⌉, called the ceiling of c. It is the smallest integer larger than
or equal to c. For example ⌈−2.5⌉ = −2, ⌈−3⌉ = −3. The integer in the interval
(c, c + 1] is ⌊c⌋ + 1, where ⌊c⌋ is the floor of c. It is the largest integer that is less
than or equal to c. For example, ⌊−2.5⌋ = −3, ⌊−3⌋ = −3.
In Section 1.3, we have seen that a nonempty subset of real numbers that
is bounded above does not necessary have a maximum. Example 1.6 shows
that a nonempty subset of rational numbers that is bounded above also does not
necessary have a maximum. However, for nonempty subsets of integers, the same
is not true.
Proposition 1.16
Let S be a nonempty subset of integers.
The two statements are equivalent, and the second statement is a generalization
of the well-ordered principle for the set of positive integers. It can be proved using
mathematical induction.
Next we discuss another important property called the Archimedean property.
First let us show that the set of positive integers Z+ is not bounded above.
Chapter 1. The Real Numbers 29
Theorem 1.17
The set of positive integers Z+ is not bounded above.
Proof
Assume to the contrary that the set of positive integers Z+ is bounded
above. By completeness axiom, it has a least upper bound u. Since
u − 1 < u, u − 1 is not an upper bound of Z+ . Hence, there is a positive
integer n such that
n > u − 1.
It follows that
n + 1 > u.
Since n + 1 is also a positive integer, this says that there is an element of
Z+ that is larger than the least upper bound of Z+ . This contradicts to the
definition of least upper bound. Hence, Z+ cannot be bounded above.
The proof uses the key fact that any number that is smaller than the least upper
bound of a set is not an upper bound of the set. This is a standard technique in
proofs.
1. For any positive number M , there is a positive integer n such that n >
M.
2. For any positive number ε, there is a positive integer n such that 1/n <
ε.
These two statements are equivalent, and the first statement is equivalent to
the fact that the set of positive integers is not bounded above.
In the following, we consider another property called denseness.
A key fact we want to prove is that the set of rational numbers Q is dense in
the set of real numbers.
Proof
Let (a, b) be an open interval. Then ε = b − a > 0. By the Archimedean
property, there is a positive integer n such that 1/n < ε. Hence,
nb − na = nε > 1,
and so
na + 1 < nb.
Consider the interval (na, na + 1]. There is an integer m that lies in this
interval. In other words,
Dividing by n, we have
m
a< < b.
n
This proves that the open interval (a, b) contains the rational number m/n,
and thus completes the proof that the set of rational numbers is dense in the
set of real numbers.
Proposition 1.20
The set of integers Z and the set of rational numbers Q are countable, while
the set of real numbers R is not countable.
Since the union of countable sets is countable, this proposition implies that the
set of irrational numbers is uncountable. Therefore, there are far more irrational
Chapter 1. The Real Numbers 31
numbers than rational numbers. Hence, it should not be surprising that the set of
irrational numbers is also dense in the set of real numbers. To prove this, let us
recall the following facts.
Proof
Let (a, b) be an open interval. Define
a b
c= √ , d= √ .
2 2
Then c < d. By the denseness of rational numbers, there is a rational
number u that lies in the interval (c, d). Hence,
a b
√ =c<u<d= √ .
2 2
√
Let v = 2u. Then v is an irrational number satisfying
a < v < b.
This proves that the open interval (a, b) contains the irrational number v,
and thus completes the proof that the set of irrational numbers is dense in
the set of real numbers.
Example 1.9
Is the set of integers Z dense in R? Justify your answer.
Chapter 1. The Real Numbers 32
Solution
(0, 1) is an open interval that does not contain any integers. Hence, the set
of integers is not dense in R.
Exercises 1.4
Question 1
Let S = Q \ Z. Is the set S dense in R? Justify your answer.
Chapter 1. The Real Numbers 33
a1 , a2 , . . . , an , . . .
Example 1.10
an = an−1 + 3.
Solution
We compute recursively.
a1 = 2
a2 = a1 + 3 = 5
a3 = a2 + 3 = 8
a4 = a3 + 3 = 11
a5 = a4 + 3 = 14
Chapter 1. The Real Numbers 34
an = 3n − 1.
Example 1.11
1
sn = sn−1 + .
2n
Find the first 5 terms of the sequence.
Solution
We compute recursively.
1
s1 =
2
1 3
s2 = s1 + =
22 4
1 7
s3 = s2 + 3 =
2 8
1 15
s4 = s3 + 4 =
2 16
1 31
s5 = s4 + 5 =
2 32
sequence {sn } in Example 1.11 is the partial sum of the geometric sequence
The
1
. One can prove by induction that
2n
1
sn = 1 − .
2n
Chapter 1. The Real Numbers 35
Example 1.12
For the sequence {sn } defined in Example 1.12, the general term sn cannot be
expressed as an explicit elementary function of n.
Example 1.13
Solution
We compute recursively.
a1 = 2 < 3
a2 = a1 + 1 = 3 ≥ 3
1 5
a3 = a2 − = < 3
2 2
1 17
a4 = a3 + = <3
3 6
1 37
a5 = a4 + = ≥3
4 12
1 173
a6 = a5 − =
5 60
From the examples above, we observe that some sequences are monotone.
Chapter 1. The Real Numbers 36
Example 1.14
In analysis, we are often led to consider the behavior of a sequence {an } when
n gets larger than larger. We are interested to know whether the sequence would
approach a fixed value. This leads to the idea of convergence.
|an − a| < ε.
Here the positive number ε is used to measure the distance from the term an
to the number a. Since ε can be any positive number, the distance can get as small
as possible.
One question that is natural to ask is whether a sequence {an } can converge to
Chapter 1. The Real Numbers 37
Theorem 1.22
A sequence cannot converge to two different numbers.
Proof
This is proved by contradiction. Assume that there is a sequence {an }
which converges to two different numbers b and c. Let
|b − c|
ε= .
2
Since b and c are distinct, |b − c| > 0 and so ε > 0. By definition of
convergence, there is a positive integer N1 such that for all n ≥ N1 ,
|an − b| < ε.
|an − c| < ε.
Figure 1.4: A sequence {an } cannot converge to two different numbers b and c.
Limit of a Sequence
If a sequence {an } converges to a number a, we say that the sequence is
convergent. Otherwise, we say that it is divergent. Theorem 1.22 says
that for a convergent sequence {an }, the number a that it converges to is
unique. We call this unique number a the limit of the convergent sequence
{an }, and express the convergence of {an } to a as
lim an = a.
n→∞
Example 1.15
Let c be a real number and let {an } be the sequence with an = c for all
n ∈ Z+ . Then for any ε > 0, we take N = 1. For all n ≥ N = 1, we have
|an − c| = |c − c| = 0 < ε,
which shows that the limit of the constant sequence {an } is c. Namely,
lim c = c.
n→∞
1
Another simple example is the sequence {an } with an = .
n
Chapter 1. The Real Numbers 39
Example 1.16
Use the definition of convergence to show that
1
lim = 0.
n→∞ n
Solution
Given ε > 0, the Archimedean property asserts that there is a positive
integer N such that 1/N < ε. If n ≥ N , we have
1 1
0< ≤ < ε.
n N
This gives
1
−0 <ε for all n ≥ N.
n
By definition, we conclude that
1
lim = 0.
n→∞ n
Example 1.17
Proof
Let {ank } be a subsequence of {an }. Notice that for all k ∈ Z+ ,
nk ≥ k.
|an − a| < ε.
|ank − a| < ε.
Example 1.18
Find the limit
1
lim
n→∞ 2n
if it exists.
Solution
1 1
Notice that is a subsequence of with nk = 2k . By Example
2n n
1.16,
1
= 0. lim
n→∞ n
Example 1.19
Solution
n
Let an = (−1) . Then for any positive integer n, a2n−1 = −1, and a2n = 1.
The subsequence {a2n−1 } of {an } converges to −1, while the subsequence
{a2n } of {an } converges to 1. Since there are two subsequences of {an }
that converge to two different limits, by Theorem 1.23, the sequence {an }
is not convergent.
For the sequence {an } defined in Example 1.10, we can see that the set {an | n ∈
+
Z } is not bounded above. Therefore, we would expect that the sequence does
not converge to any number.
For simplicity, we say that a sequence {an } is bounded above/bounded below/
bounded if the set {an | n ∈ Z+ } is bounded above/bounded below/bounded . If
the sequence {an } is bounded above, we denote the supremum of the set {an | n ∈
Z+ } as sup{an }. If the sequence {an } is bounded below, we denote the infimum
of the set {an | n ∈ Z+ } as inf{an }.
We have the following theorem which guarantees that a convergent sequence
Chapter 1. The Real Numbers 42
must be bounded.
Proof
Let {an } be a convergent sequence that converges to the limit a. By
definition of convergence with ε = 1, there is a positive integer N such
that for all n ≥ N ,
|an − a| < 1.
This implies that
Define
M = max {|a1 |, |a2 |, . . . , |aN −1 |, |a| + 1} .
Then
|an | ≤ M for all n ∈ Z+ .
This shows that the sequence {an } is bounded.
Example 1.20
Proposition 1.25
Proof
Given ε > 0, the number
ε
ε1 =
|c| + 1
is also positive. Since {an } converges to a, there is a positive integer N
such that for all n ≥ N ,
ε
|an − a| < ε1 = .
|c| + 1
Example 1.21
By Proposition 1.25, we find that for any constant c,
c
lim = 0.
n→∞ n
Let {an }, {bn } and {cn } be three sequences. Assume that there is a positive
integer N0 such that for all n ≥ N0 ,
b n ≤ an ≤ c n .
If both the sequences {bn } and {cn } converge to ℓ, then the sequence {an }
also converges to ℓ.
Chapter 1. The Real Numbers 44
Proof
For a positive number ε, since the sequence {bn } converges to ℓ, there is a
positive integer N1 such that for all n ≥ N1 ,
|bn − ℓ| < ε.
bn − ℓ > −ε.
|cn − ℓ| < ε.
cn − ℓ < ε.
an − ℓ ≥ bn − ℓ > −ε,
and
an − ℓ ≤ cn − ℓ < ε.
This proves that for all n ≥ N ,
|an − ℓ| < ε.
When applying the squeeze theorem, we are interested in the limit of the
sequence {an }. It is not enough to find two seqeunces {bn } and {cn } satisfying
b n ≤ an ≤ c n
for all n greater than or equal to a fixed N0 . The two sequences {bn } and {cn }
must have the same limit.
Chapter 1. The Real Numbers 45
Example 1.22
(−1)n
an = ,
n
we have
1 1
− ≤ an ≤ .
n n
Since
1
lim = 0,
n→∞ n
we have
1
lim − = 0.
n→∞ n
By squeeze theorem,
(−1)n
lim = 0.
n→∞ n
Theorem 1.27
The sequence {an } converges to 0 if and only if the sequence {|an |}
converges to 0.
|an − 0| < ε.
Chapter 1. The Real Numbers 46
Notice that
|an | − 0 = |an | = | an − 0 |.
Hence, for all n ≥ N ,
|an | − 0 < ε.
This proves that the sequence {|an |} converges to 0.
Next, we assume that the sequence {|an |} converges to 0. Then the
sequence {−|an |} also converges to 0. Since
−|an | ≤ an ≤ |an |,
In the following, we discuss two useful results that can be deduced from
specific information about a convergent sequence. They will be useful in the
proofs of other theorems that we are going to discuss.
One can easily formulate a counterpart of this lemma for a sequence with
negative limit.
Proof
Take ε = a/2. Then ε > 0. Hence, there is a positive integer N so that for
all n ≥ N ,
a
|an − a| < .
2
This implies that for all n ≥ N ,
a
an − a > − .
2
Thus, for all n ≥ N ,
a
an > > 0.
2
Chapter 1. The Real Numbers 47
Lemma 1.29
b ≤ an ≤ c for all n ∈ Z+ .
Proof
Given that
|an − a| < ε.
an − a > −ε = c − a when n ≥ N.
Hence,
an > c when n ≥ N.
This contradicts to an ≤ c for all n ∈ Z+ . Therefore, we must have a ≤ c.
lim an = a, lim bn = b,
n→∞ n→∞
Notice that Proposition 1.25 is actually a special case of this theorem when
{bn } is a constant sequence.
|an | ≤ M, |bn | ≤ M.
By Lemma 1.29,
|a| ≤ M, |b| ≤ M.
Now we want to show that the difference of an bn and ab aproaches zero
when n gets large. This should be achieved by the fact that |an − a| and
|bn − b| both approach 0 when n gets large. To compare an bn − ab to an − a
and bn − b, we do some manipulations as follows.
Now we can show that an bn converges to ab. Given ε > 0, since ε/(2M ) is
also positive, there exists a positive integer N1 such that
ε
|an − a| < when n ≥ N1 .
2M
Similarly, there exists a positive integer N2 such that
Chapter 1. The Real Numbers 50
ε
|bn − b| < when n ≥ N2 .
2M
Take N = max{N1 , N2 }. When n ≥ N , n ≥ N1 and n ≥ N2 . It follows
from (1.1) that ε ε
|an − bn | < M + = ε.
2M 2M
This completes the proof that the sequence {an bn } converges to ab.
Proof
Without loss of generality, assume that a > 0. Lemma 1.28 implies that
there is a positive integer N1 such that
a
an > >0 when n ≥ N1 .
2
Given ε > 0, a2 ε/2 is also positive. By definition of convergence, there is
a positive integer N2 such that when n ≥ N2 ,
a2 ε
|an − a| < .
2
Take N = max{N1 , N2 }. If n ≥ N ,
1 1 |an − a| 2 a2 ε
− = < 2× = ε.
an a |an ||a| a 2
The results about sums, products and quotients of convergent sequences can
be summarized in the following.
2. lim an bn = ab.
n→∞
an a
3. If bn ̸= 0 for all n ∈ Z+ and b ̸= 0, lim = .
n→∞ bn b
These will be used repeatedly in the future. Let us now look at some examples
how these properties are applied.
Example 1.23
Let m be a positive integer. Product rule of limits implies that
1 1 1
lim m
= lim × · · · × lim = 0.
n→∞ n n→∞ n
| {z n→∞ n}
m terms
Chapter 1. The Real Numbers 52
Example 1.24
Determine whether the limit exists. If it exists, find the limit.
(−1)n
(a) lim 3 +
n→∞ 3n − 2
2n2 + 3n + 4
(b) lim
n→∞ 5 − 7n2
n+1
(c) lim
n→∞ n2 + 1
n2 + 1
(d) lim
n→∞ n + 1
Solution
(a) Since {1/(3n − 2)} is a subsequence of the sequence {1/n}, it
converges to 0. By Theorem 1.27,
(−1)n
lim = 0.
n→∞ 3n − 2
Hence,
(−1)n (−1)n
lim 3 + = lim 3 + lim = 3 + 0 = 3.
n→∞ 3n − 2 n→∞ n→∞ 3n − 2
(c) Divide the numerator and the denominator by n2 and then apply the
rules for limits, we have
1 1
n+1 + 2 0+0
lim 2 = lim n n = = 0.
n→∞ n + 1 n→∞ 1 1+0
1+ 2
n
(d) Since the reciprocal of the sequence has limit 0 by part (c), we find
that
n2 + 1
lim
n→∞ n + 1
We have seen in Section 1.3 that the supremum or infimum of a set is not
necessarily an element of the set. The supremum of a set is an element of the set
if and only if the set has a maximum. Analogously, the infimum of a set is an
element of the set if and only if the set has a minimum.
Even though the supremum and infimum of a set might fail to be an element
of the set, they are always limits of sequences in that set.
Example 1.25
Consider the set S = (−∞, π). It is bounded above with sup S = π. The
sequence {un } with
1
un = π −
n
is a sequence in S that converges to π = sup S.
Chapter 1. The Real Numbers 54
lim un = u.
n→∞
Exercises 1.5
Question 1
1
Let a be a positive integer that is larger than 1. Show that lim = 0.
n→∞ an
Question 2
If {an } is a sequence that converge to a negative number a, show that there
is a positive integer N such that an < a/2 < 0 for all n ≥ N .
Question 3
Determine whether the limit exists. If it exists, find the limit.
3n + (−1)n
(a) lim
n→∞ n+2
4n + 2
(b) lim
n→∞ 7n2 + 3n
n
(c) lim 2
n→∞ 2n + n + 5
n2 + 4n
(d) lim
n→∞ n + 3
Question 4
If {an } is a sequence that converges to a, use the definition of convergence
to show that the sequence {|an |} converges to |a|.
Question 6
Let {an } and {bn } be sequences. Assume that there is a real number a such
that
|an − a| ≤ bn for all n ∈ Z+ .
If lim bn = 0, show that
n→∞
lim an = a.
n→∞
1
(a) Show that |an+1 − 3| ≤ for all n ∈ Z+ .
n
[Hint: Use induction.]
(b) Show that the sequence {an } is convergent and find its limit.
Chapter 1. The Real Numbers 57
In this section, we use sequences to define and study some properties of subsets
of real numbers. Given a subset S of real numbers, we say that a sequence {an }
is in S if each of the terms an is a point in S. In other words, the sequence {an }
is in S means that the set {an | n ∈ Z+ } is a subset of S. We will abuse notation
and write this as {an } ⊂ S when there is no confusion. We start with a simple but
useful lemma.
Lemma 1.35
Let S be a subset of real numbers. If {an } is a sequence in S that converges
to a, then every neighbourhood of a contains a point of S.
Chapter 1. The Real Numbers 58
Proof
Let (b, c) be a neighbourhood of a. Since a is in (b, c), b < a < c, and
hence the number
ε = min{a − b, c − a}
is positive. By definition, a − b ≥ ε, c − a ≥ ε. Since {an } converges to a,
there is a positive integer N such that for all n ≥ N ,
|an − a| < ε.
In particular,
b ≤ a − ε < aN < a + ε ≤ c.
This shows that aN is a point in S that is in the neighbourhood (b, c) of a.
Theorem 1.36
Let S be a subset of real numbers. Then S is dense in R if and only if every
real number x is the limit of a sequence in S.
Since we have proved that each of the set of rational numbers and the set of
irrational numbers is dense in the set of real numbers, we immediately obtain the
following.
Corollary 1.37
Let x be a real number.
Example 1.26
√
Let x = 2, and define the sequences {pn } and {qn } by
√
⌊10n 2⌋ √
pn = , q n = 2.
10n
Here ⌊a⌋ is the floor of a. By definition,
√ √ √
10n 2 − 1 < ⌊10n 2⌋ ≤ 10n 2.
Therefore,
√ 1 √
2 − n < pn ≤ 2.
10
√ √
By squeeze theorem, {pn } converges to 2. Since ⌊10n 2⌋ is an integer,
pn is a rational number. Hence, {pn } is a sequence of rational numbers that
√
converges to x = 2. Obviously, {qn } is a sequence of irrational numbers
√
that converges to x = 2.
√
expansion of 2 to give a number with n decimal places. The first 7 terms of
the sequence {pn } are
Example 1.27
The three statements in Lemma 1.29 imply that intervals of the form
(−∞, a], [a, ∞) and [a, b] are closed subsets of R. In particular, we call
them closed intervals, and [a, b] is a closed and bounded interval.
Remark 1.2
1. By definition, R is closed in R.
Example 1.28
Solution
The sequence {1/n} is a sequence in the interval (0, 2) that converges to
the point 0 that is not in (0, 2). Hence, the interval (0, 2) is not closed in R.
Chapter 1. The Real Numbers 61
Remark 1.3
One can prove that if S is an interval of the form (a, b), or (a, b], or [a, b),
or (−∞, a), or (a, ∞), then S is not closed in R.
Example 1.29
Is the set of rational numbers Q closed in R?
Solution
We have seen in Example 1.26 that there is a sequence in the set Q that
√
converges to 2, which is not in Q. Hence, Q is not closed in R.
Example 1.30
In the solution of Example 1.28, we have seen that the sequence {1/n} in
(0, 2) converges to the point 0. Since none of the an is 0, 0 is a limit point
of the set (0, 2).
Example 1.31
Solution
We claim that every point in [0, 2] is a limit point of the set (0, 2).
Example 1.30 shows that 0 is a limit point of (0, 2). The sequence {2−1/n}
is a sequence in (0, 2) that converges to 2. Hence, 2 is also a limit point of
(0, 2).
For any c ∈ (0, 2), c > 0. Let m be a positive intger such that 1/m < c.
Then {c − 1/(n + m)} is a sequence in (0, 2) that converges to c. Hence, c
is a limit point of (0, 2).
This completes the proof that the set of limit points of (0, 2) is [0, 2].
Remark 1.4
1. For intervals of the form (a, b), (a, b], [a, b) or [a, b], the set of limit
points is [a, b].
2. For intervals of the form (−∞, a) or (−∞, a], the set of limit points is
(−∞, a].
3. For intervals of the form (a, ∞) or [a, ∞), the set of limit points is
[a, ∞).
Example 1.32
Show that the set Z does not have limit points.
Solution
If n is an integer, x is contained in the open interval (n − 1, n + 1) that does
not contain any integer other than n itself. Hence, there is no sequence in
Z \ {n} that converges to n. Therefore, an integer n is not a limit point of
Z.
Chapter 1. The Real Numbers 63
If x is not an integer, it is contained in the interval (⌊x⌋, ⌈x⌉) that does not
contain any integers. By Lemma 1.35, x is not a limit of a sequence in Z.
Therefore, x is not a limit point of Z.
(a) x is in S;
Example 1.33
By Example 1.32, every point in the set of integers Z is an isolated point of
the set.
The following is quite obvious from the definition of isolated points and Lemma
1.35.
Theorem 1.38
Let S be a subset of real numbers. A point x in S is an isolated point if and
only if there is a neighbourhood (a, b) of x that intersects the set S only at
the point x.
We have seen that a limit point of a set is not necessarily a point of that set.
The following gives a characterization of closed sets in terms of limit points.
Theorem 1.39
Let S be a subset of real numbers. The set S is closed in R if and only if it
contains all its limit points.
Chapter 1. The Real Numbers 64
Proof
Assume first S is closed in R. Let x be a limit point of S. Then there is
a sequence {an } in S \ {x} that converges to x. In particular, {an } is a
sequence in S that converges to x. Since S is closed in R, x is in S. This
proves that S contains all its limit points.
Now assume that S is not closed in R. Then there is a sequence {an } in S
that converges to a point x, but x is not in S. Since x is not in S, none of
the terms in the sequence {an } is in S. Therefore, x is a limit point of S.
This shows that S does not contain all its limit points.
Chapter 1. The Real Numbers 65
Exercises 1.6
Question 1
Show that every real number is a limit point of the set of rational numbers.
Question 2
Let S be the set
1
S= n ∈ Z+ ∪ {0}.
n
(a) Find the set of limit points and the set of isolated points of S.
Question 3
Determine whether each of the following is a closed set.
(c) C = R \ (−1, 1)
Proof
First suppose that {an } is an increasing sequence that is convergent. Then
{an } is bounded. So it is bounded above.
Conversely, suppose that {an } is an increasing sequence that is bounded
above. Then a = sup{an } exists. Now we use the same argument as in
the proof of Lemma 1.34. Given ε > 0, since a − ε is less than a, it is not
an upper bound of the set S = {an | n ∈ Z+ }. Hence, there is a positive
integer N such that
aN > a − ε.
Chapter 1. The Real Numbers 67
It follows that
Example 1.34
lim an = 0.
n→∞
Remark 1.5
It follows from Theorem 1.27 that for any a in the interval (−1, 1),
lim an = 0.
n→∞
an+1 = an × a < an .
Hence, the sequence {an } is decreasing. On the other hand, an > 0 for all
n ∈ Z+ . Hence, {an } is a decreasing sequence that is bounded below. By
the monotone convergence theorem, {an } converges to a number ℓ.
Chapter 1. The Real Numbers 68
Example 1.35
Solution
First notice that an > 0 for all n ∈ Z+ . When n ≥ 2,
2an + 2 2an−1 + 2 2(an − an−1 )
an+1 − an = − = .
an + 2 an−1 + 2 (an + 2)(an−1 + 2)
Now, a2 = 4/3 > a1 . Hence, we deduce that an+1 − an > 0 for all n ∈ Z+ .
In other words, {an } is an increasing sequence. For all n ≥ 1,
2
an+1 = 2 − < 2.
an + 2
Hence, {an } is bounded above by 2. Since {an } is an increasing sequence
that is bounded above, by monotone convergence theorem, it converges
to a limit u = sup{an }. Since {an+1 } is a subsequence of {an }, it also
converges to u. Apply the limit laws to
2an + 2
an+1 = ,
an + 2
we find that
2u + 2
u= .
u+2
Chapter 1. The Real Numbers 69
Notice that Example 1.35 is closely related to Example 1.6. The sequence
{an } defined in Example 1.35 is another sequence of rational numbers which
√
converges to 2.
The next example is a classical one.
Example 1.36
Show that the limit n
1
lim 1 +
n→∞ n
exists.
Solution
Let n
1
an = 1 + .
n
Given a positive integer n, notice that
n
an+1 n+2 (n + 2)n
= × .
an n+1 (n + 1)2
It follows that
an+1 (n + 2)(n2 + n + 1) n3 + 3n2 + 3n + 2
≥ = 3 > 1.
an (n + 1)3 n + 3n2 + 3n + 1
Therefore,
1 1 1
an ≤ 1 + 1 + + · · · + n−1 = 3 − n−1 ≤ 3.
2 2 2
This proves that {an } is bounded above by 3. Since {an } is an increasing
sequence that is bounded above, the monotone convergence theorem asserts
that the limit n
1
lim an = lim 1 +
n→∞ n→∞ n
exists.
The Number e
The number e is defined as
n
1
e = lim 1 + .
n→∞ n
e = 2.718281828459046
1
bn = bn−1 + for all n ≥ 1,
n!
also converges to e. In series notation,
1 1 1 1
e=1+ + + + ··· + + ··· .
1! 2! 3! n!
Chapter 1. The Real Numbers 71
Exercises 1.7
Question 1
Given that the sequence {an } is defined by a1 = 2, and for all n ≥ 1,
3an + 1
an+1 = .
an + 2
Show that {an } is convergent and find its limit.
Question 2
For n ≥ 1, let n
1
an = 1 + .
n
Define the sequence {bn }∞
n=0 by b0 = 1, and for all n ≥ 1,
1
bn = bn−1 + .
n!
(a) Show that the sequence {bn } is convergent.
(b) For a positive integer n, use the binomial expansion of an to show that
an ≤ bn and
3
b n − an ≤ .
2n
(c) Conclude that the sequence {bn } converges to e.
Chapter 1. The Real Numbers 72
Example 1.37
In this example, we find that although the sequence {an } is not convergent, it
has convergent subsequences. In this section, we are going to prove that every
bounded sequence has a convergent subsequence. By monotone convergence
theorem, it is sufficient to prove that every sequence has a monotone subsequence.
It can be achieved via a concept called peak index.
am ≥ an for all n ≥ m.
In other words, there is no term after the mth term that is larger than am .
Theorem 1.41
Every sequence has a monotone subsequence.
Proof
Given a sequence {an }, let S be the set of its peak indices. It is a subset of
positive integers. We discuss the cases where S is infinite and S is finite.
Case 1: S is infinite.
Let n1 , n2 , n3 , . . . be the elements of S arranged in increasing order, namely,
This is a subsequence of {n}. For any positive integer k, since nk+1 > nk
and nk is a peak index, we have
ank+1 ≤ ank .
Now nk is not a peak index implies that there is a positive integer nk+1
larger than nk such that
ank+1 > ank .
This procedure constructs the increasing subsequence {ank } inductively.
In both cases, we have shown that {an } has a monotone subsequence.
A sequence {an } is called a Cauchy sequence provided that for any ε > 0,
the is a positive integer N such that for all m ≥ n ≥ N ,
|am − an | < ε.
Example 1.38
n+1
For the sequence {an } with an = , it is easy to check that it is a
n
Cauchy sequence. Notice that if m ≥ n,
1 1 1 1 1
|am − an | = − = − < .
n m n m n
Given ε > 0, the Archimedean property says that there is a positive integer
N such that 1/N < ε. Hence, if m ≥ n ≥ N ,
1 1
|am − an | < ≤ < ε.
n N
Theorem 1.43
If a sequence {an } is convergent, then it is a Cauchy sequence.
Proof
Let a be the limit of the convergent sequence {an }. Given ε > 0, there is a
positive integer N such that for all n ≥ N ,
ε
|an − a| < .
2
It follows from triangle inequality that if m ≥ n ≥ N ,
ε ε
|am − an | ≤ |am − a| + |an − a| < + = ε.
2 2
Hence, {an } is a Cauchy sequence.
The converse is also true in the set of real numbers. It is proved using the fact
that every bounded sequence has a convergent subsequence.
Proof
First we prove that {an } is a Cauchy sequence implies that it is bounded.
The proof is almost identical to the proof that a convergent sequence is
bounded. Take ε = 1. There is a positive integer N0 such that for all
m ≥ n ≥ N0 ,
|am − an | < 1.
This implies that
Let
M = max{|a1 |, . . . , |aN0 −1 |, |aN0 | + 1.}
Then |an | ≤ M for all n ∈ Z+ , proving that it is bounded. Since {an } is a
bounded sequence, it has a convergent subsequence {ank } which converges
to a limit a. We want to prove that the sequence {an } also converges to a.
Given ε > 0, there is a positive integer N such that for all m ≥ n ≥ N ,
ε
|am − an | < .
2
There is a positive integer K such that for all k ≥ K,
ε
|ank − a| < .
2
Now let n be an integer such that n ≥ N . Since {nk } is a subsequence of
{n}, there is an integer k such that k ≥ K and nk ≥ n. Then
ε ε
|an − a| ≤ |ank − an | + |ank − a| < + = ε.
2 2
This proves that for all n ≥ N ,
|an − a| < ε.
Theorem 1.44 is proved using the fact that every bounded sequence has a
convergent subsequence. The latter is a consequence of the monotone convergence
theorem, whose validity relies on the completeness axiom for real numbers. Hence,
the fact that every Cauchy sequence of real numbers is convergent is a consequence
of the completeness axiom.
If we consider the set of rational numbers, the assertion is not true. For
example, we have shown that there is a sequence of rational numbers {an } that
√
converges to 2. Therefore, the sequence {an } is a Cauchy sequence that does
not converge in the set of rational numbers.
The following combines the results of Theorem 1.43 and Theorem 1.44.
Chapter 1. The Real Numbers 77
Example 1.39
For a positive integer n, let
1 1
sn = 1 + + ··· + .
2 n
Show that the sequence {sn } is divergent.
Solution
We prove that {sn } is not a Cauchy sequence, by showing that for ε = 1/2,
for any positive integer N , there are integers m and n with m ≥ n ≥ N
such that
1
|sm − sn | ≥ .
2
For a given positive integer N , let n = N and m = 2N . Then m ≥ n ≥ N
and m − n = N . Notice that
1 1 1
sm − sn = + + ··· +
N +1 N +2 2N
1 1 1
≥ + + ··· +
|2N 2N {z 2N}
m−n=N terms
1
= .
2
This shows that {sn } is not a Cauchy sequence. Hence, it is not convergent.
Theorem 1.45
If S is a closed and bounded subset of real numbers, then it is sequentially
compact.
Proof
Let S be a subset of R that is closed and bounded. Given a sequence {an }
in S, since S is bounded, the sequence {an } is bounded. Therefore, there
is a subsequence {ank } that converges to a number a. Since {ank } is a
sequence in the set S that converges to a, and S is closed, the limit a must
be in S. In other words, we have shown that the sequence {an } in S has a
subsequence {ank } that converges to a point a that is in S. This proves that
S is sequentially compact.
Example 1.40
Theorem 1.46
Let S be a subset of R. If S is sequentially compact, then it is closed and
bounded.
which in turn is equivalent to (¬q → ¬p) ∧ (¬r → ¬p). Hence, we will prove the
following two statements: if S is not closed, it is not sequentially compact; and if
S is not bounded, it is not sequentially compact.
Proof
First, we prove that if S is not closed, it is not sequentially compact. If S is
not closed, there is a sequence {an } in S which converges to a point a but
a is not in S. For this sequence, every subsequence is convergent with limit
a. Hence, this sequence does not have a convergent subsequence with limit
in S. This proves that S is not sequentially compact.
Next, we prove that if S is not bounded, it is not sequentially compact. If S
is not bounded, for each integer n, there is a point an in S such that
|an | ≥ n.
|ank | ≥ nk .
Hence, the sequence {ank } is not bounded, and thus it is not convergent.
This shows that the sequence {an } does not have any convergent
subsequence. Therefore, S is not sequentially compact.
Notice that the only type of intervals that is both closed and bounded is the
type [a, b]. Hence, this is the only type of intervals that are sequentially compact.
Chapter 1. The Real Numbers 80
Example 1.41
Determine whether each of the following sets is sequentially compact.
(a) Z
Solution
(a) The set Z is not bounded. Hence, it is not sequentially compact.
(b) 3 is a limit point of the set A but it is not in A. Hence, A is not closed,
and so it is not sequentially compact.
Theorem 1.47
Let S be a subset of real numbers. If S is closed and bounded, then it has a
maximum and a minimum. Equivalently, if S is sequentially compact, then
it has a maximum and a minimum.
Proof
Since S is bounded, S has a least upper bound u and a greatest lower bound
ℓ. By Lemma 1.34, there are sequences {un } and {ℓn } in S that converge
to u and ℓ respectively. Since S is closed, u and ℓ are in S. Since u = sup S
is in S, S has a maximum. Since ℓ = inf S is in S, S has a minimum.
Chapter 1. The Real Numbers 81
Exercises 1.8
Question 1
Given that the sequence {an } is defined by
1 1
an = 1 + + ... + .
3 2n − 1
Show that {an } is not a Cauchy sequence. Then conclude that the sequence
{an } is divergent.
Question 2
Determine whether each of the following sequence is a Cauchy sequence.
n + (−1)n
(a) The sequence {an } with an =
n − (−1)n
1+n
(b) The sequence {bn } with bn =
1 − (−1)n n
Question 3
Determine whether each of the following sets is sequentially compact.
Question 4
Show that the union of two sequentially compact sets is sequentially
compact.
Chapter 2. Limits of Functions and Continuity 82
Chapter 2
p(x)
f (x) = ,
q(x)
where p(x) and q(x) are polynomials, and q(x) is not the zero polynomial.
The domain of this function is the set D = R \ S, where S is the finite point
set containing all x for which q(x) = 0.
In Section 1.6, we have defined the concept of limit points of a set D. The point
x0 is a limit point of the set D if there is a sequence of points in D \ {x0 } that
converges to x0 . A limit point of a set is not necessarily in that set. A set that
contains all its limit points is a closed set. If a point x0 is in a set D but is not a
limit point of D, it is called an isolated point of D. If x0 is an isolated point of D,
there is a neighbourhood (a, b) of x0 which intersects the set D only at the point
x0 .
Limits of functions can be defined using the ε − δ language or using limits of
sequences. We will define the concept using limits of sequences first, and then
show that it is equivalent to the ε − δ definition.
lim f (x) = ℓ.
x→x0
Notice that we do not define lim f (x) if x0 is not a limit point of the domain
x→x0
where the function is defined.
Chapter 2. Limits of Functions and Continuity 84
lim f (x) = ℓ ⇐⇒
x→x0
Example 2.1
Find the limit if it exists.
2x + 3
(a) lim
x→1 x2 + 1
x2 − 1
(b) lim
x→1 x − 1
x2 + 1
(c) lim
x→1 x − 1
Solution
(a) The function
2x + 3
f (x) =
x2 + 1
is defined on D = R. If {xn } is a sequence in R \ {1} that converges
to 1, limit laws imply that
2xn + 3 2×1+3 5
lim f (xn ) = lim 2
= 2
= .
n→∞ n→∞ xn + 1 1 +1 2
Hence,
2x + 3 5
lim = .
x→1 x2 + 1 2
Chapter 2. Limits of Functions and Continuity 85
x2n − 1
lim f (xn ) = lim = lim (xn + 1) = 2.
n→∞ n→∞ xn − 1 n→∞
Hence,
x2 − 1
lim = 2.
x→1 x − 1
2. lim (f g)(x) = ℓ1 ℓ2 .
x→x0
f (x) ℓ1
lim = .
x→x0 g(x) ℓ2
From this proposition, it follows that we can take limits of a rational function
easily at a point which is not a zero of the polynomial in the denominator.
Proposition 2.2
Let p(x) and q(x) be polynomials. If x0 is a real number such that q(x0 ) ̸=
0, then
p(x) p(x0 )
lim = .
x→x0 q(x) q(x0 )
Example 2.2
Show that for any real number x0 ,
Solution
Let {xn } be a sequence in R\{x0 } that converges to x0 . By Question 1.5.4,
the sequence {|xn |} converges to |x0 |. This proves that
lim f (x) = ℓ.
x→x0
(ii) For any ε > 0, there is a δ > 0 such that if the point x is in D and
0 < |x − x0 | < δ, then |f (x) − ℓ| < ε.
lim f (x) = ℓ ⇐⇒
x→x0
Namely, there is an ε > 0 such that for any δ > 0, there is a point x in
D \ {x0 } with |x − x0 | < δ but |f (x) − ℓ| ≥ ε. For this ε > 0, we
construct a sequence {xn } in D \ {x0 } in the following way. For each
positive integer n, there is a point xn in D \ {x0 } such that |xn − x0 | < 1/n
but |f (xn ) − ℓ| ≥ ε. Then {xn } is a sequence in D \ {x0 } that satisfies
1
|xn − x0 | < .
n
Since lim 1/n = 0, we find that the sequence {xn } converges to x0 . Since
n→∞
|f (xn ) − ℓ| ≥ ε for all n ∈ Z+ , the sequence {f (xn )} cannot converge to
ℓ. Hence, we have shown that there is a sequence {xn } in D \ {x0 } that
converges to x0 but {f (xn )} does not converge to ℓ. This proves that (i)
does not hold.
Chapter 2. Limits of Functions and Continuity 89
For any real number x0 , determine whether the limit lim H(x) exists.
x→x0
Solution
We consider the cases where x0 > 0, x0 < 0 and x0 = 0.
Case 1: x0 > 0. In this case, we claim that lim f (x) = 1.
x→x0
Given ε > 0, take δ = x0 . Then δ > 0. If x is in R and 0 < |x − x0 | < δ =
x0 , we have x − x0 > −x0 and hence x > 0. Thus f (x) = 1 and
|f (x) − 1| = 0 < ε.
|f (x) − 0| = 0 < ε.
Case 3: x0 = 0. In this case, we claim that lim f (x) does not exist. Let
x→x0
{un } and {vn } be the sequences {1/n} and {−1/n} respectively. They are
both sequences in R \ {0} that converge to 0.
Therefore,
In this example, we can also use the ε−δ definition to show that lim f (x) does
x→0
not exist. Assume that lim f (x) exists and is equal to ℓ. Take ε = 1/2. There
x→0
exists δ > 0 such that for any x ∈ R, if 0 < |x − 0| < δ, then
|f (x) − ℓ| < ε.
Now the points x = x1 = −δ/2 and x = x2 = δ/2 both satisfy 0 < |x − 0| < δ.
We have f (x1 ) = 0 and f (x2 ) = 1. By triangle inequality,
This gives
1 = |f (x1 ) − f (x2 )| < 1,
which is a contradiction. Hence, lim f (x) does not exist.
x→0
In calculus, we have defined the concepts of left limits and right limits to deal
with functions like the Heaviside function, which is defined by cases. Given a
subset of real numbers D and a point x0 , define
For example, consider D = [0, 2). If x0 = 1, then D1,− = [0, 1) and D1,+ =
(1, 2). If x0 = 0, then D0,− = ∅ and D0,+ = (0, 2). If x0 = 2, then D2,− = [0, 2)
and D2,+ = ∅.
Chapter 2. Limits of Functions and Continuity 91
Notice that even though x0 is a limit point of D, it might not be a limit point of
Dx0 ,− or Dx0 ,+ . We define the left limit and right limit of a function f : D → R
when x approaches x0 in the following way.
1. If x0 is a limit point of Dx0 ,− , Dx0 ,− is not an empty set. We say that the
limit of the function f : D → R as x approaches x0 from the left exists
provided that the limit of the function f : Dx0 ,− → R as x approaches
x0 exists. If the left limit exists, it is denoted by
2. If x0 is a limit point of Dx0 ,+ , Dx0 ,+ is not an empty set. We say that the
limit of the function f : D → R as x approaches x0 from the right exists
provided that the limit of the function f : Dx0 ,+ → R as x approaches
x0 exists. If the right limit exists, it is denoted by
1. If x0 is a limit point of both Dx0 ,+ and Dx0 ,− , then lim f (x) exists if
x→x0
and only if both lim− f (x) and lim+ f (x) exist and they are equal.
x→x0 x→x0
Example 2.4
For the Heaviside function, we have
Since the left and right limits are not equal, lim H(x) dos not exist.
x→0
For any real number x0 , determine whether the limit lim f (x) exists.
x→x0
Solution
Fixed a real number x0 . For any positive integer n, there is a rational
number pn and an irrational number qn in the open interval (x0 − 1/n, x0 ).
The sequences {pn } and {qn } are in R \ {x0 } and converge to x0 . Since
we find that
Since the sequence {f (xn )} has different limits when we consider two
different sequences {xn } in R \ {x0 } that converge to x0 , we conclude
that lim f (x) does not exist.
x→x0
For this example, if one wants to use the ε − δ definition of limits, one can
proceed in the following way. For fixed x0 in R, assume that lim f (x) = ℓ.
x→x0
When ε = 1/2, there is a δ > 0 such that for any x with 0 < |x − x0 | < δ,
Chapter 2. Limits of Functions and Continuity 93
|f (x) − ℓ| < ε. The open interval (x0 − δ, x0 ) contains a rational number x1 and
an irrational number x2 . Notice that f (x1 ) = 1 and f (x2 ) = 0. Both x = x1 and
x = x2 satisfy 0 < |x − x0 | < δ. By triangle inequality,
This gives
1 = |f (x1 ) − f (x2 )| < 1,
which is a contradiction. Hence, lim f (x) does not exist.
x→x0
Next, we consider composite functions.
Proposition 2.4
then
lim h(x) = lim (g ◦ f )(x) = ℓ.
x→x0 x→x0
Proof
Let {xn } be a sequence in D\{x0 } that converges to x0 , and let yn = f (xn )
for all n ∈ Z+ . By assumption, {yn } is a sequence in U \ {y0 }. Since
lim f (x) = y0 , the sequence {f (xn )} converges to y0 . Since lim g(y) =
x→x0 y→y0
ℓ, the sequence {g(yn )} converges to ℓ. In other words, the sequence {(g ◦
f )(xn )} converges to ℓ.
Since we have proved that whenever {xn } is a sequence in D \ {x0 } that
converges to x0 , the sequence {(g ◦ f )(xn )} converges to ℓ, we conclude
that
lim (g ◦ f )(x) = ℓ.
x→x0
Corollary 2.5
Let D be a subset of real numbers. Given a function f : D → R, if x0 is a
limit point of D and lim f (x) = ℓ, then
x→x0
Example 2.6
For any x0 ≥ 0, show that
√ √
lim x = x0 .
x→x0
Solution
Let us use the ε−δ definition of limits. Consider the case x0 = 0 first. Given
ε > 0, take δ = ε2 . Then δ > 0. If x ≥ 0 is such that 0 < |x − 0| < δ = ε2 ,
√
we have 0 < x < ε2 , which implies that 0 < x < ε. Hence, if x ≥ 0 and
0 < |x − 0| < δ,
√ √
| x − 0| < ε.
This proves that
√ √
lim x=0= x0 .
x→0
1 2
√√ < √ .
x + x0 3 x0
3 3 √
Given ε > 0, let δ = min x0 , ε x0 . Then δ > 0. If x ≥ 0 and
4 2
3 1
0 < |x − x0 | < δ, then |x − x0 | < x0 and so x > x0 . Therefore,
4 4
Chapter 2. Limits of Functions and Continuity 95
√ √ |x − x0 | 2
x−x0 = √ √ < δ × √ ≤ ε.
x + x0 3 x0
√ √
This proves that lim x = x0 .
x→x0
√ √
Figure 2.2: (a) The function f (x) = x. (b) The function f (x) = 3 x.
Using similar methods, one can prove that if n is an integer, then for any x0 in
√
the domain of the function f (x) = n x,
√
n
√
lim x = n x0 .
x→x0
Now we want to give a brief discussion about limits that involve infinities.
Example 2.7
lim f (x) = ℓ
x→∞
are equivalent.
(ii) For any ε > 0, there is a positive number M such that if the point x is
in D and x > M , then
|f (x) − ℓ| < ε.
lim f (x) = ℓ.
x→−∞
(ii) For any ε > 0, there is a positive number M such that if the point x is
in D and x < −M , then
|f (x) − ℓ| < ε.
Example 2.8
1
Show that lim = 0.
x→∞ x
Solution
We use both definitions to prove the statement.
Using the sequence definition, let {xn } be a sequence of nonzero real
numbers that diverges to ∞. We want to show that the sequence {1/xn }
converges to 0. Given ε > 0, the number M = 1/ε is also positive. Since
the sequence {xn } diverges to ∞, there is a positive integer N such that for
all n ≥ N ,
1
xn > M = .
ε
1
In particular, for all n ≥ N , xn > 0 and 0 < < ε. This proves that the
xn
1
sequence {1/xn } converges to 0. Therefore, lim = 0.
x→∞ x
Now consider the definition in terms of ε. Given ε > 0, let M = 1/ε. Then
M is a positive number. If x in R \ {0} is such that x > M , then
1 1
0< < = ε.
x M
1
This proves that lim = 0.
x→∞ x
Example 2.9
1
For any positive integer n, lim = 0.
x→∞ xn
Figure 2.3: (a) The function f (x) = 1/x. (b) The function f (x) = 1/x2 .
Example 2.10
Determine whether the limit
2x2 + 3x + 4
lim
x→∞ x2 + 7
exists. If it exists, find the limit.
Solution
Divide the numerator and denominator by x2 , we have
3 4
2
2x + 3x + 4 2+ + 2
= x x .
x2 + 7 7
1+ 2
x
Using limit laws and the fact that lim 1/x = 0, we find that
x→∞
2x2 + 3x + 4 2+0+0
lim = = 2.
x→∞ x2 + 7 1+0
Chapter 2. Limits of Functions and Continuity 100
Example 2.11
Determine whether the limit
x
lim √
x→−∞ x2 + 1
exists. If it exists, find the limit.
Solution
√
Notice that x2 = |x|. Hence,
x x
√ = r .
x2+1 1
|x| 1 + 2
x
When x < 0,
x
= −1.
|x|
Therefore,
x
lim = −1.
x→−∞ |x|
we find that
1
lim r = 1.
x→∞ 1
1+ 2
x
Hence,
x
lim √ = −1.
x→−∞ x2+1
Chapter 2. Limits of Functions and Continuity 101
x
Figure 2.4: The function f (x) = √ .
x2+1
Remark 2.1
Using similar ideas, one can formulate analogous definitions for the
following limits.
then
lim f (x) = ℓ.
x→x0
Exercises 2.1
Question 1
Find the limit if it exists.
x2 + 3x + 2
(a) lim
x→−1 x2 + 1
x2 − 3x − 4
(b) lim
x→−1 x+1
x2 + 1
(c) lim
x→−1 x + 1
x2 − 3x − 4
(d) lim
x→−1 x+1
Chapter 2. Limits of Functions and Continuity 103
Question 2
Define the function f : R → R by
x + 3, if x ≥ 1,
f (x) =
5 − x, if x < 1.
For any real number x0 , determine whether the limit lim f (x) exists.
x→x0
Question 3
Define the function f : R → R by
x, if x is rational;
f (x) =
−x, if x is irrational.
(a) Use squeeze theorem to show that lim f (x) exists and find the limit.
x→0
Question 4
Determine whether the limit exists. If it exists, find the limit.
2x2 + x + 4
(a) lim
x→−∞ 5x2 + 2
2x + 3
(b) lim √
x→∞ 4x2 + 1
Question 5
For any x0 ≥ 0, show that
√
4
√
lim x = 4 x0 .
x→x0
Chapter 2. Limits of Functions and Continuity 104
The definitions of limit and continuity are very similar. However, there is a
slight difference. To define continuity at a point x0 , x0 must be a point in the
domain of the function D. To define limit, x0 does not need to be a point in the
domain D but has to be a limit point of D. When the point x0 is in D and is also
a limit point of D, the relation between limit and continuity is as follows.
if and only if
lim |f (x) − f (x0 )| = 0.
x→x0
Proposition 2.8
Let D be a subset of real numbers and let f : D → R be a function defined
on D. If f : D → R is continuous, then for any subset A of D, the function
f : A → R, which is the restriction of f to A, is also continuous.
Example 2.12
Proposition 2.2 says that a rational function is continuous.
Example 2.13
Example 2.14
Example 2.5 says that the Dirichlet’s function is nowhere continuous.
Example 2.15
√
Example 2.6 says that the function f (x) = x is continuous. In general,
√
for any positive integer n, the function f (x) = n x is continuous.
Lemma 2.9
Let D be a subset of real numbers and let x0 be an isolated point of D.
If {xn } is a sequence of points in D that converges to x0 , then there is a
positive integer n such that xn = x0 for all n ≥ N .
Chapter 2. Limits of Functions and Continuity 106
Proof
By Theorem 1.38, there is a neighbourhood (a, b) of x0 which intersects D
at x0 only. Let ε = min{x0 − a, b − x0 }. Then ε > 0 and (x0 − ε, x0 +
ε) ⊂ (a, b). Since the sequence {xn } converges to x0 , there is a positive
integer N such that for all n ≥ N , |xn − x0 | < ε. Hence, for all n ≥ N ,
xn ∈ (x0 − ε, x0 + ε) ⊂ (a, b). Since (a, b) ∩ D = {x0 }, we find that
xn = x0 for all n ≥ N .
Proof
If {xn } is a sequence in D that converge to x0 , Lemma 2.9 says that there is
a positive integer N such that xn = x0 for all n ≥ N . Therefore, f (xn ) =
f (x0 ) for all n ≥ N . This implies that the sequence {f (xn )} converges to
f (x0 ). By the definition of continuity, f is continuous at x0 .
Example 2.16
Since every point of the set of positive integers Z+ is an isolated point, any
function f : Z+ → R defined on the set of positive integers is continuous.
This conclusion might be a bit counter intuitive for students that see it for the
first time. One can think about it naively in the following way. For an isolated
point, it has no close neighbours to be compared to. Hence, the limit operation
does not work, and thus the function is continuous by default.
Let us summarize again the continuity of a function at a point.
Chapter 2. Limits of Functions and Continuity 107
(ii) For any ε > 0, there is a δ > 0 such that if the point x is in D and
|x − x0 | < δ, then |f (x) − f (x0 )| < ε.
The proof of Theorem 2.11 is almost identical to the proof of Theorem 2.3.
Example 2.17
Solution
The domain of the function is D = R \ {0}. Let x0 be a point in D. Then
x0 ̸= 0. Notice that
Chapter 2. Limits of Functions and Continuity 108
1 1 |x − x0 |
|f (x) − f (x0 )| = − = . (2.1)
x x0 |x||x0 |
If
|x0 |
|x − x0 | < ,
2
then
|x0 |
|x| > > 0.
2
Given ε > 0, let
|x0 | |x0 |2
δ = min , ε .
2 2
|x0 | |x0 |
If x in D is such that |x − x0 | < δ, then |x − x0 | < and so |x| > .
2 2
It follows from (2.1) that
2
|f (x) − f (x0 )| < δ × ≤ ε.
|x0 |2
Proposition 2.12
Let D be a subset of real numbers that contains the point x0 . Given that the
functions f : D → R and g : D → R are continuous at x0 .
Proposition 2.13
Corollary 2.14
Let D be a subset of real numbers that contains the point x0 . If the function
f : D → R is continuous at x0 , then the function |f | : D → R is also
continuous at x0 .
Example 2.18
Solution
The domain of the function f is D = [−2, 3]. First we show that if x0 ∈
D \ {1}, then f is continuous at x0 .
If x0 ∈ [−2, 1), then x0 < 1. If {xn } is a sequence in D \ {x0 } that
converges to x0 , then there is a positive integer N such that xn < 1 for all
n ≥ N . This implies that for all n ≥ N , f (xn ) = 2x2n − 3. Hence, the
sequence {f (xn )} converges to f (x0 ) = 2x20 − 3. This proves that f is
continuous at x0 .
Using similar arguments, we can show that if x0 ∈ (1, 3], f is continuous
at x0 .
Chapter 2. Limits of Functions and Continuity 110
Remark 2.2
We can formulate a general theorem from Example 2.18 as follows.
Let D be a subset of real numbers that contains the point x0 , and let Dx0 ,−
and Dx0 ,+ be the intersection of D with the sets {x | x < x0 } and {x | x >
x0 } respectively. Suppose that x0 is a limit point of both Dx0 ,− and Dx0 ,+ ,
and f : D → R is a function such that its restrictions to Dx0 ,− and Dx0 ,+
are continuous. If
Example 2.19
Example 2.20
Solution
For any x1 and x2 in [−10, 8],
Hence,
|f (x1 ) − f (x2 )| ≤ 20|x1 − x2 |.
This shows that f is a Lipschitz function with Lipschitz constant 20.
Example 2.21
Solution
If f is a Lipschitz function, there is a positive constant c such that
Here we see that whether a function is Lipschitz or not depends on the domain.
Finally we prove that a Lipschitz function is continuous.
Chapter 2. Limits of Functions and Continuity 113
Theorem 2.15
Let D be a subset of real numbers. If f : D → R is a Lipschitz function,
then it is continuous.
Proof
Since f : D → R is Lipschitz, there is a positive constant c such that for
any x1 and x2 in D,
Let x0 be a point in D. Given ε > 0, take δ = ε/c. Then δ > 0 and for any
x ∈ D, if |x − x0 | < δ,
Exercises 2.2
Question 1
Consider the function f : R → R defined by
2, if x > 2,
f (x) = .
x, if x ≤ 2
Question 2
Consider the function f : R → R defined by
x 2 , if x is rational,
f (x) =
−x2 , if x is irrational.
Question 3
Consider the function f : R → R defined by
2x + 5, if x < −1,
f (x) = .
ax2 + x, if x ≥ −1
Question 4
Question 5
√
Let f : [1, ∞) → R be the function defined by f (x) = x. Show that f is
a Lipschitz function.
Chapter 2. Limits of Functions and Continuity 115
For a real-valued function f : D → R, the maximum value is the largest value the
function can assume; while the minimum value is the smallest value the function
can assume.
Extreme Values
The maximum value of a function f : D → R is the maximum of the set
f (D); while the minimum value is the minimum of the set f (D).
A maximum value or a minimum value of a function is called an extreme
value of the function.
Example 2.22
(a) For the function f : [−1, 2] → R, f (x) = 2x, D = [−1, 2] and f (D) =
[−2, 4]. Thus, f has minimum value −2 and maximum value 4.
(b) For the function g : [−1, 2) → R, g(x) = 2x, D = [−1, 2) and g(D) =
[−2, 4). Thus, g has minimum value −2, but it does not have maximum
value.
Chapter 2. Limits of Functions and Continuity 116
Figure 2.6: The functions f (x), g(x) and h(x) defined in Example 2.22.
Example 2.22 shows that the existence of extreme values depends on the
domain of the function.
For a set to have maximum and minimum values, it is necessary (but not
sufficient) that the set is bounded. Let us first define what it means for a function
to be bounded.
Example 2.23
All the three functions defined in Example 2.22 are bounded.
Example 2.24
1
Consider the function f : (0, 1) → R defined by f (x) = . Although the
x
domain of the function D = (0, 1) is bounded, the range of the function
f (D) = (1, ∞) is not bounded.
This example shows that a continuous function does not necessarily map a
bounded set to a bounded set.
Example 2.25
1
Consider the function f : [1, ∞) → R defined by f (x) = . Although
x
the domain of the function D = [1, ∞) is closed, the range of the function
f (D) = (0, 1] is not closed.
This example shows that a continuous function does not necessarily map a
closed set to a closed set.
The situation changes when we combine closed and bounded. Recall that
we have defined the concept of sequential compactness in Chapter 1, Section
1.8. A set D is sequentially compact if every sequence in D has a subsequence
that converges to a point in D. We have proved that a subset of real numbers is
sequentially compact if and only if it is closed and bounded.
The following theorem says that a continuous function maps a closed and
bounded set to a closed and bounded set.
Theorem 2.16
Let D be a closed and bounded subset of R. If f : D → R is a continuous
function, then the set f (D) is closed and bounded.
Using the fact that a subset of real numbers is sequentially compact if and only
if it is closed and bounded, Theorem 2.16 is equivalent to the following.
Theorem 2.17
Let D be a sequentially compact subset of R. If f : D → R is a continuous
function, then the set f (D) is sequentially compact.
Chapter 2. Limits of Functions and Continuity 118
Proof
We use the definition of sequential compactness to prove this theorem. Let
{yn } be a sequence in f (D). We need to prove that there is a subsequence
of {yn } that converges to a point in f (D).
For each positive integer n, since yn is in f (D), there is an xn in D such
that f (xn ) = yn . This gives a sequence {xn } in D. Since D is sequentially
compact, there is a subsequence {xnk } of {xn } that converges to a point
x0 in D. Since f is continuous at x0 , the sequence {f (xnk )} converges to
f (x0 ). In other words, we have shown that the subsequence {ynk } of {yn }
converges to the point f (x0 ) in f (D).
Corollary 2.19
Example 2.26
Show that the function f : R → R defined by
f (x) = |x − 1| + |x − 2| + |x − 3| + |x − 4| + |x − 5|
Solution
In this example, the domain of the function is not closed and bounded. We
cannot apply the extreme value theorem directly. However, we can proceed
in the following way. First, we justify that the function f : R → R is
continuous. A function of the form g(x) = x − a is continuous since it
is a polynomial. Absolute value of a continuous function is continuous.
Hence, a function of the form h(x) = |x − a| is continuous. Being a sum
of continuous functions, f (x) is a continuous function.
To prove the existence of a minimum value, we notice that for x ≥ 5,
f (x) = x − 1 + x − 2 + x − 3 + x − 4 + x − 5 = 5x − 15 ≥ 10.
For x ≤ 1,
f (x) = 1 − x + 2 − x + 3 − x + 4 − x + 5 − x = 15 − 5x ≥ 10.
Now restrict the domain to [1, 5], the function f : [1, 5] → R is continuous.
Hence, it has a minimum value at some x0 ∈ [1, 5]. It follows that
In particular,
f (x0 ) ≤ f (1) = 10.
This proves that for all x ∈ R, f (x) ≥ f (x0 ). Hence, the function f : R →
R has a minimum value.
Chapter 2. Limits of Functions and Continuity 120
Exercises 2.3
Question 1
Determine whether the function is bounded.
x
(a) f : R → R, f (x) = √ .
x2+4
1
(b) f : (0, 1) → R, f (x) = x + .
x
Question 2
If a function f : D → R is continuous and bounded, does it necessarily
have a maximum value and a minimum value? Justfiy your answer.
Question 3
Let f : [−4, 4] → R be the function defined by
x2 + x + 1
f (x) = √ .
4x2 + 9
Show that it has a maximum value and a minimum value.
Chapter 2. Limits of Functions and Continuity 121
In this section, we are going to discuss the intermediate value theorem, which is
an important theorem for continuous functions. It is essentially a theorem about
existence of solutions for equations defined by continuous functions.
f (c) = w.
Proof
The proof is using bisection method, which provides a constructive way to
find the point c.
Without loss of generality, assume that f (a) < w < f (b).
We construct two sequences {an } and {bn } recursively. Define a1 = a,
b1 = b, and let
a1 + b 1
m1 =
2
be the midpoint of a1 and b1 . The interval [a, b] = [a1 , b1 ] is bisected into
two subintervals [a1 , m1 ] and [m1 , b1 ] by the point m1 .
We want to define the interval [a2 , b2 ] to be one of these, based on the value
of f (m1 ).
By definition,
a1 ≤ a2 < b2 ≤ b1 ,
f (a2 ) < w ≤ f (b2 ),
and the length of the interval [a2 , b2 ] is half the length of the interval [a1 , b1 ].
Chapter 2. Limits of Functions and Continuity 122
By definition,
an ≤ an+1 < bn+1 ≤ bn ,
f (an+1 ) < w ≤ f (bn+1 ),
and
b n − an
bn+1 − an+1 = .
2
This constructs the sequences {an } and {bn }. Notice that {an } is an
increasing sequence that is bounded above by b, while {bn } is a decreasing
sequence that is bounded below by a.
By monotone convergence theorem, the sequence {an } converges to a
number c1 = sup{an } and the sequence {bn } converges to a number
c2 = inf{bn }. By induction, we find that
b−a
b n − an = .
2n−1
Taking n → ∞ limits, we conclude that
c2 − c1 = 0.
Chapter 2. Limits of Functions and Continuity 123
and
lim an = c = lim bn . (2.3)
n→∞ n→∞
Eq. (2.2) shows that c is in [a, b]. The continuity of the function f and (2.3)
implies that
f (c) = lim f (an ) = lim f (bn ).
n→∞ n→∞
Since
f (an ) < w and f (bn ) ≥ w for all n ∈ Z+ ,
we find that
f (c) ≤ w and f (c) ≥ w.
This proves that f (c) = w, and hence completes the proof of the theorem.
Example 2.27
Show that the equation
x6 + 6x + 1 = 0
has a real root.
Solution
6
Let f (x) = x + 6x + 1. Since f (x) is a polynomial, it is a continuous
function. Notice that
Hence, f (−1) < 0 < f (0). Namely, 0 is a value between f (−1) and f (0).
By intermediate value theorem, there is a point c in the interval (−1, 0) such
that f (c) = 0. Then x = c is a root of the equation
x6 + 6x + 1 = 0.
Example 2.28
Let n be a positive integer, and let c be a positive number. Use the
intermediate value theorem to show that there is a positive real number
x such that
xn = c.
Chapter 2. Limits of Functions and Continuity 125
Solution
Take a = 0, b = c + 1, and consider the function f : [a, b] → R defined by
f (x) = xn . Then,
f (a) = f (0) = 0,
f (b) = f (c + 1) = (1 + c)n ≥ 1 + nc > c.
Hence,
f (a) < c < f (b).
Since f is a continuous function, intermediate value theorem asserts that
there is a number x in the interval [0, c + 1] such that f (x) = xn = c.
The equivalence of the two definitions is seen by observing that when t changes
from 0 to 1, (1 − t)u + tv goes through all the points in the interval [u, v].
Obviously, an interval is a convex set. The converse is also true.
Theorem 2.21
Let S be a subset of real numbers. If S is a convex set, then S is an interval.
Chapter 2. Limits of Functions and Continuity 126
Sketch of Proof
If S is bounded below, let a = inf S. Otherwise, set a = −∞. If S is
bounded above, let b = sup S. Otherwise, set b = ∞.
If c is a point in (a, b), then a < c < b. In particular, since c > a, it is not
a lower bound of S. Hence, there is a point u in S such that a ≤ u < c.
Since c < b, c is not an upper bound of S. Hence, there is a point v in S
such that c < v ≤ b. Since u and v are in S and S is convex, all points in
the interval [u, v] are in S. By construction, u < c < v. Hence, c is in S.
This proves that all the points in (a, b) are in S.
Finally, we just need to consider whether S contains a, and whether it
contains b.
1. If S is bounded, S does not contain inf S and sup S, then S = (a, b).
4. If S is bounded, and S contains both inf S and sup S, then S = [a, b].
5. If S is bounded below but not bounded above, and S does not contain
inf S, then S = (a, ∞).
6. If S is bounded below but not bounded above, and S contains inf S, then
S = [a, ∞).
7. If S is bounded above but not bounded below, and S does not contain
sup S, then S = (−∞, b).
Chapter 2. Limits of Functions and Continuity 127
Proof
To show that f (I) is an interval, take two distinct points u and v in f (I).
We need to show that any w in between u and v is in f (I). Since u and v are
in f (I), there exist a and b in I such that u = f (a) and v = f (b). Without
loss of generality, assume that a < b. Since I is an interval, it contains the
interval [a, b]. Since f is continuous on [a, b], and w is in between f (a) and
f (b), the version of the intermediate value theorem that we have proved
implies that there is a point c in the interval (a, b) such that f (c) = w. This
shows that w is also in f (I).
Chapter 2. Limits of Functions and Continuity 128
Exercises 2.4
Question 1
√
Show that the equation 2x + x2 + 1 = 0 has a real solution.
Question 2
Given that f : [−2, 10] → [−2, 10] is a continuous function. Show that
there is a point x in the interval [−2, 10] such that f (x) = x.
Question 3
Suppose that f : R → R is a bounded continuous function. Show that there
is a real number x such that f (x) = x.
Question 4
Let n be an odd positive integer, and let
Theorem 2.24
Let D be a subset of real numbers. If f : D → R is a uniformly continuous
function, it is continuous.
Chapter 2. Limits of Functions and Continuity 130
Let us compare the definitions of continuity and uniform continuity using the
definitions in terms of ε − δ.
• A function f : D → R is continuous if
Example 2.29
1
Show that the function f : (0, 1) → R, f (x) = is not uniformly
x
continuous.
Solution
For a positive integer n, let
1 1
un = , vn = .
n+1 n+2
Then {un } and {vn } are sequences in the domain D = (0, 1), and
1 1
lim (un − vn ) = lim − lim = 0.
n→∞ n→∞ n + 1 n→∞ n + 2
Chapter 2. Limits of Functions and Continuity 131
Example 2.30
Solution
For a positive integer n, let
1
un = n + , vn = n.
n
Then {un } and {vn } are sequences in the domain D = (0, ∞), and
1
lim (un − vn ) = lim = 0.
n→∞ n→∞ n
Since 2
1 1
f (un ) − f (vn ) = n + − n2 = 2 + 2 ,
n n
we find that
lim f (un ) − f (vn ) = 2 ̸= 0.
n→∞
Example 2.31
Solution
In the solution of Example 2.20, we have shown that for any x1 and x2 in
the domain D = [−10, 8],
Example 2.31 is a function that is Lipschitz. In fact, the proof of Theorem 2.15
can be easily modified to prove that a Lipschitz function is uniformly continuous.
Theorem 2.25
Let D be a subset of real numbers. If f : D → R is a Lipschitz function,
then it is uniformly continuous.
√
The converse is not true. For example, the function f : [0, 1] → R, f (x) = x
is not Lipschitz, but it is uniformly continuous. We leave this to the exercise.
In the following, we give a sufficient condition for a function to be uniformly
continuous.
Theorem 2.26
Let D be a closed and bounded subset of real numbers. If f : D → R is a
continuous function, then it is uniformly continuous.
Lemma 2.27
Let S be a sequentially compact set in R, and let {an } and {bn } be two
sequences in S. There is strictly increasing sequence of positive integers
{n1 , n2 , n3 , . . .} such that each of the subsequences {an1 , an2 , an3 , . . .} and
{bn1 , bn2 , bn3 , . . .} converges to a point in S.
Proof
First, the sequentially compactness of S guarantees that there is a
strictly increasing sequence of positive integers {k1 , k2 , k3 , . . .} so that the
subsequence {ak1 , ak2 , ak3 , . . .} converges to a point a in S. For a positive
integer j, let
c j = bk j .
Consider the sequence {cj } indexed by j ∈ Z+ . It is a subsequence of
{bn }, and it is also a sequence in S. Since S is sequentially compact, there
is a strictly increasing sequence of positive integers {j1 , j2 , j3 , . . .} such
that the subsequence {cj1 , cj2 , cj3 , . . .} = {bkj1 , bkj2 , bkj3 , . . .} converges
to a point b in S. For a positive integer m, let nm = kjm . Then
{n1 , n2 , n3 , . . .} is a strictly increasing sequence of positive integers.
The sequence {an1 , an2 , an3 , . . .} is a subsequence of {ak1 , ak2 , ak3 , . . .}.
Hence, it converges to a. The sequence {bn1 , bn2 , bn3 , . . .} is the sequence
{cj1 , cj2 , cj3 , . . .} which converges to b.
Example 2.32
√
Show that the function f : (0, 100) → R, f (x) = x is uniformly
continuous.
Solution
The domain of the function Df = (0, 100) is not closed and bounded. We
cannot apply Theorem 2.26 directly. Consider the function g : [0, 100] → R
√
defined by g(x) = x. It is a continuous function. Since the domain
Dg = [0, 100] is closed and bounded, g is uniformly continuous.
Since f : (0, 100) → R is the restriction of the function g to Df , it is also
uniformly continuous.
Exercises 2.5
Question 1
Question 2
x
Show that the function f : (0, 20) → R, f (x) = √ is uniformly
x+1
continuous.
Question 3
√
Let f : [0, 1] → R be the function defined by f (x) = x. Show that f is
not Lipschitz, but it is uniformly continuous.
Question 4
1
Determine whether the function f : (0, 1) → R, f (x) = √ is uniformly
x
continuous.
Chapter 2. Limits of Functions and Continuity 136
Proposition 2.28
Let D be a subset of real functions and let f : D → R be a function defined
on D. If f is strictly monotonic, then it is one-to-one.
Chapter 2. Limits of Functions and Continuity 137
Example 2.33
It is an increasing function.
It is a decreasing function.
Figure 2.8: The functions f (x) and g(x) defined in Example 2.33.
Chapter 2. Limits of Functions and Continuity 138
Example 2.34
Figure 2.9: The functions f (x) and g(x) defined in Example 2.34.
Theorem 2.29
Let f : [a, b] → R be a monotonic function. For any x0 in (a, b], the left
limit
f− (x0 ) = lim− f (x)
x→x0
exists. Define
f− (a) = f (a) and f+ (b) = f (b).
Then the function f : [a, b] → R is continuous at the point x0 in [a, b] if and
only if
f− (x0 ) = f (x0 ) = f+ (x0 ).
Otherwise, f has a jump discontinuity at x0 with jump
Proof
If f is decreasing, then −f is increasing. Hence, we only need to consider
the case where f : [a, b] → R is increasing. Fixed x0 in (a, b]. Define the
nonempty set S− by
S− = {f (x) | a ≤ x < x0 } .
Since f is increasing, f (x) ≤ f (x0 ) for any x in [a, x0 ). Therefore the set
S− is bounded above by f (x0 ). Let u = sup S− . Then u ≤ f (x0 ). We
claim that
u = lim− f (x) = f− (x0 ).
x→x0
f (x1 ) > u − ε.
Chapter 2. Limits of Functions and Continuity 140
Let δ = x0 −x1 . Then δ > 0. If x is a point in [a, x0 ) such that |x−x0 | < δ,
then x1 < x < x0 , and thus
Using similar argument, we find that for any x0 in [a, b), the right limit
lim+ f (x) exists, and
x→x0
Corollary 2.30
Let I be an interval. If f : I → R is monotonic, then f is continuous if and
only if f (I) is an interval.
Proof
If f : I → R is continuous, intermediate value theorem implies that f (I) is
an interval.
If f : I → R is not continuous, Theorem 2.29 implies that there is a point
x0 in the interval I for which either f− (x0 ) ̸= f (x0 ) or f+ (x0 ) ̸= f (x0 ). In
any case, f (I) cannot be an interval.
Theorem 2.31
Let f : I → R be a function defined on an interval I. If f is continuous
and one-to-one, then f is strictly monotonic.
Proof
If f fails to be strictly monotonic, there exist three points a, x0 , b in I such
that a < x0 < b and one of the following holds.
Consider case (i) where f (a) < f (b) < f (x0 ). Since w = f (b) is a value
between f (a) and f (x0 ), intermediate value theorem implies that there is
a point c in the interval (a, x0 ) for which f (c) = w. But then c ̸= b, but
f (c) = f (b). This contradicts to f is one-to-one.
Using the same argument, we will reach a contradiction for the other three
cases. This proves that f must be strictly monotonic.
Example 2.35
Consider the functions f and g that are defined in Example 2.34.
Theorem 2.32
Let I be an interval and let f : I → R be a continuous function defined
on I. If f : I → R is one-to-one, then f −1 : f (I) → R exists, and it is
continuous.
Chapter 2. Limits of Functions and Continuity 143
Proof
By Theorem 2.31, f is strictly monotonic. Without loss of generality, we
assume that f is strictly increasing.
Given a point y0 in the interval f (I), let x0 be the unique point in I such
that f (x0 ) = y0 . Given ε > 0, we need to prove that there is a δ > 0 such
that if y is a point in f (I) with |y − y0 | < δ, then |f −1 (y) − f −1 (y0 )| < ε.
For simplicity, assume that x0 is an interior point of I. Then there is a r > 0
such that [x0 − r, x0 + r] is in I. Take
ε1 = min{ε, r}.
Remark 2.3
If I = (a, b) is an open interval and the function f : (a, b) → R is
continuous and one-to-one, we have seen in Theorem 2.31 that f is strictly
monotonic. In fact, one can prove that f (I) is also an open interval.
Without loss of generality, assume that f is strictly increasing. Since f
is continuous, f (I) is an interval. If f (I) is not an open interval, either
inf f (I) or sup f (I) is in f (I). If c = inf f (I) is in f (I), there is a point
u+a
u in (a, b) such that f (u) = c. But then u > a and so u1 = is also
2
a point in (a, b). Since u1 < u, f (u1 ) < f (u) = c. This contradicts to
c = inf f (I). In the same way, one can show that sup f (I) is not in f (I).
Hence, f (I) must be an open interval.
Although we can use limits to show that when n is a positive integer, the
√
function f (x) = n x is continuous, it is tedious. Using Theorem 2.32 is much
more succint.
Example 2.36
Let n be a positive integer.
It is easy to check that the two expressions for xr are equal. Using the fact that
composition of continuous functions is continuous, we obtain the following.
Chapter 2. Limits of Functions and Continuity 145
Theorem 2.33
Let r be a rational number.
Exercises 2.6
Question 1
2x + 1
Show that the function f : (−1, ∞) → R, f (x) = is strictly
x+1
monotonic, and find the inverse function f −1 .
Chapter 3. Differentiating Functions of a Single Variable 146
Chapter 3
The simplest function is the constant function f (x) = c, whose function value
does not vary with the input. The next class of functions that are relatively easy
to study is a polynomial of degree one f (x) = ax + b, where a ̸= 0. Sometimes
we also call any function of the form f (x) = ax + b as a linear function, as its
graph y = ax + b is a straight line in the xy-plane. However, this should not be
confused with a linear function that are considered in linear algebra, which in the
single variable case, refers to a function of the form f (x) = ax.
Gf = {(x, y) | x ∈ D, y = f (x)} .
Figure 3.1: The graph of the function f (x) = ax + b when (i) a < 0, (ii) a = 0
and (iii) a > 0.
For the function y = f (x) = ax + b, we find that for any two distinct points
Chapter 3. Differentiating Functions of a Single Variable 147
x1 and x2 ,
f (x2 ) − f (x1 ) a(x2 − x1 )
= = a.
x2 − x1 x2 − x1
In other words, the change in the y values,
∆y = y2 − y1 = f (x2 ) − f (x1 )
∆x = x2 − x1 ,
with propotionality constant a. This constant a is called the rate of change of the
function, and it is the slope of the line y = ax + b. Its magnitude |a| characterizes
how fast y is changing with respect to x, and its sign determine the way y changes.
When a > 0, y increases as x increases. When a < 0, y decreases as x increases.
For a function that is more complicated, such as a quadratic function y =
f (x) = x2 , we find that
3.1 Derivatives
f (x) − f (x0 )
lim
x→x0 x − x0
if it exists. If the limit exists, we say that f is differentiable at x0 , and its
derivative a x0 is denoted by f ′ (x0 ). Namely,
f (x) − f (x0 )
f ′ (x0 ) = lim .
x→x0 x − x0
f (x) − f (x0 )
g(x) = ,
x − x0
which is defined on the set D = (a, b) \ {x0 }. It is easy to check that x0 is
indeed a limit point of the set D. The function g(x) is the quotient of the function
p(x) = f (x) − f (x0 ) and the function q(x) = x − x0 . It is not defined at x = x0
since q(x0 ) = 0. Moreover, since lim q(x) = q(x0 ) = 0, a necessary condition
x→x0
for f to be differentiable at the point x0 is lim p(x) = 0, which says that the
x→x0
function f (x) is continuous at x0 .
Let x0 be a point in the open interval (a, b). If the function f : (a, b) → R
is differentiable at x0 , it is continuous at x0 .
Chapter 3. Differentiating Functions of a Single Variable 149
Proof
If f is differentiable at x0 , the limit
f (x) − f (x0 )
f ′ (x0 ) = lim
x→x0 x − x0
exists. By limit laws,
f (x) − f (x0 )
lim (f (x) − f (x0 )) = lim lim (x − x0 ) = f ′ (x0 ) × 0 = 0.
x→x0 x→x0 x − x0 x→x0
Hence,
lim f (x) = f (x0 ),
x→x0
f (x0 + h) − f (x0 )
f ′ (x0 ) = lim .
h→0 h
The continuity of the function f at the point x0 is then equivalent to
f (x + h) − f (x)
f ′ (x) = lim .
h→0 h
Example 3.1
f (x) − f (x0 )
f ′ (x0 ) = lim = a.
x→x0 x − x0
Hence, f is a differentiable function and its derivative is
Example 3.2
Solution
For any real number x,
f (x + h) − f (x)
f ′ (x) = lim
h→0 h
(x + h)2 − x2
= lim
h→0 h
2xh + h2
= lim
h→0 h
= lim (2x + h)
h→0
= 2x.
f (x) − f (x0 )
mx;x0 =
x − x0
Chapter 3. Differentiating Functions of a Single Variable 151
as x approaches x0 . Since mx;x0 is the slope of the secant line joining the two
points (x0 , f (x0 )) and (x, f (x)) on the graph of the function, in the limit x → x0 ,
we obtain a straight line that only touches the graph in a neighbourhood of the
point (x0 , f (x0 )) at this point. This line is called the tangent line of the curve
y = f (x) at the point (x0 , f (x0 )).
Example 3.3
We have found in Example 3.2 that the derivative of the function f (x) = x2
is f ′ (x) = 2x. At the point x = 3, f (3) = 9 and f ′ (3) = 6. Hence, the
equation of the tangent line to the curve y = x2 at the point (3, 9) is
y = 9 + 6(x − 3) = 6x − 9.
Example 3.4
f (x) − f (x0 ) x − x0
lim = lim = 1.
x→x0 x − x0 x→x 0 x − x0
f (x) − f (0) −x
lim− = lim− = −1.
x→0 x−0 x→0 x
This implies that the limit
f (x) − f (0)
lim
x→0 x−0
does not exist. Hence, f is not differentiable at x = 0.
Graphically, we find that the curve y = |x| has a "sharp turn" at the point
(0, 0), and there is no well-defined tangent there (see Figure 3.3).
Chapter 3. Differentiating Functions of a Single Variable 153
Figure 3.3: The graph of the function f (x) = |x| has a "sharp turn" at x = 0.
f (x) − f (x0 )
f−′ (x0 ) = lim− ,
x→x0 x − x0
exists.
f (x) − f (x0 )
f+′ (x0 ) = lim+ ,
x→x0 x − x0
exists.
Proposition 3.2
Proof
We use the formula
f (x) − f (x0 )
f ′ (x0 ) = lim
x→x0 x − x0
xn − xn0
= lim
x→x0 x − x0
= nxn−1
0 .
Example 3.6
√
Determine the points where the function f : [0, ∞) → R, f (x) = x is
differentiable, and find the derivatives at those points.
Solution
First we consider the case where x0 > 0. When x > 0 and x ̸= x0 ,
√ √
f (x) − f (x0 ) x − x0 1
= =√ √ . (3.1)
x − x0 x − x0 x + x0
Hence,
f (x) − f (x0 ) 1 1
f ′ (x0 ) = lim = lim √ √ = √ .
x→x0 x − x0 x→x0 x + x0 2 x0
1
This shows that f is differentiable at x0 with derivative f ′ (x0 ) = √ .
2 x0
For x0 = 0, we can only consider the right derivative. When x > 0, the
formula (3.1) still holds. However, the limit
f (x) − f (0) 1
lim+ = lim+ √
x→0 x−0 x→0 x
√
does not exist. Hence, the function f (x) = x is not differentiable at
x = 0.
Chapter 3. Differentiating Functions of a Single Variable 156
√
Figure 3.4: The function f (x) = x is not differentiable at x = 0.
Using limit laws, one can find derivatives of linear combinations, products and
quotients of functions.
Let x0 be a point in (a, b). Given that the functions f : (a, b) → R and
g : (a, b) → R are differentiable at x0 . For any constants α and β, the
function αf + βg : (a, b) → R is also differentiable at x0 and
Proof
This is straightforward derivation from the limit laws. By assumption, we
have
f (x) − f (x0 ) g(x) − g(x0 )
f ′ (x0 ) = lim and g ′ (x0 ) = lim .
x→x0 x − x0 x→x0 x − x0
It follows that
(αf + βg)(x) − (αf + βg)(x0 )
(αf + βg)′ (x0 ) = lim
x→x0 x − x0
f (x) − f (x0 ) g(x) − g(x0 )
= lim α +β
x→x0 x − x0 x − x0
f (x) − f (x0 ) g(x) − g(x0 )
= α lim + β lim
x→x0 x − x0 x→x0 x − x0
= αf ′ (x0 ) + βg ′ (x0 ).
Chapter 3. Differentiating Functions of a Single Variable 157
Let x0 be a point in (a, b). Given that the functions f : (a, b) → R and
g : (a, b) → R are differentiable at x0 , the function (f g) : (a, b) → R is
also differentiable at x0 and
Proof
Again, we are given that
Just like the proof of the product rule for limits, we need to do some
manipulations.
It follows that
f (x)g(x) − f (x0 )g(x0 )
(f g)′ (x0 ) = lim
x→x0 x − x0
f (x) − f (x0 ) g(x) − g(x0 )
= lim g(x0 ) + f (x)
x→x0 x − x0 x − x0
f (x) − f (x0 ) g(x) − g(x0 )
= lim lim g(x0 ) + lim f (x) lim
x→x0 x − x0 x→x0 x→x0 x→x0 x − x0
′ ′
= f (x0 )g(x0 ) + f (x0 )g (x0 ).
Chapter 3. Differentiating Functions of a Single Variable 158
Let x0 be a point in (a, b). Given that the functions f : (a, b) → R and
g : (a, b) → R are differentiable at x0 , and g(x) ̸= 0 for all x in (a, b).
Then the function (f /g) : (a, b) → R is differentiable at x0 and
′
f f ′ (x0 )g(x0 ) − f (x0 )g ′ (x0 )
(x0 ) = .
g g(x0 )2
Chapter 3. Differentiating Functions of a Single Variable 159
The assumption g(x) ̸= 0 for all x in (a, b) is to make sure that the function
f /g is well-defined on (a, b). In practice, we only need g(x0 ) ̸= 0 and g is
differentiable at x0 . For then we find that g is continuous at x0 . The assumption
g(x0 ) ̸= 0 will imply that g(x) ̸= 0 in a neighbourhood of x0 .
Proof
First, notice that
f (x) f (x0 ) f (x)g(x0 ) − f (x0 )g(x)
− =
g(x) g(x0 ) g(x)g(x0 )
(f (x) − f (x0 ))g(x0 ) − f (x0 )(g(x) − g(x0 ))
= .
g(x)g(x0 )
Using the same reasoning as in the proof of the product rule, we obtain
′
f 1
(x0 ) = lim
g x→x0 g(x)g(x0 )
f (x) − f (x0 ) g(x) − g(x0 )
× g(x0 ) lim − f (x0 ) lim
x→x0 x − x0 x→x0 x − x0
′ ′
f (x0 )g(x0 ) − f (x0 )g (x0 )
= .
g(x0 )2
Proposition 3.6
For any integer n,
d n
x = nxn−1 . (3.2)
dx
Chapter 3. Differentiating Functions of a Single Variable 160
Proof
We have proved the formula (3.2) when n ≥ 0. When n < 0, let m = −n.
Then m is a positive integer. By quotient rule, we have
d d
d n d 1 xm 1 − xm mxm−1 m
x = = dx dx = − = − m+1 = nxn−1 .
dx dx x m x 2m x 2m x
Hence, the formula (3.2) also holds when n is a negative integer.
dn y
f (n) (x) or
dxn
to denote the nth -derivative of the function y = f (x). It is defined
recursively by
Example 3.7
Polynomial functions are infinitely differentiable. Moreover, if the degree
of a polynomial p(x) is n, then p(k) (x) = 0 for all k ≥ n + 1.
Chapter 3. Differentiating Functions of a Single Variable 161
Example 3.8
Define the function f : R → R by
ax2 , if x < 1,
f (x) =
x + b , if x ≥ 1.
x
Find the values of a and b so that f is differentiable.
Solution
The function f is differentiable at any point x0 in the interval (−∞, 1) or
the interval (1, ∞).
For f to be differentiable, f has to be continuous and differentiable at x =
1. For f to be continuous at x = 1, we must have
This gives
a = 1 + b.
For f to be differentiable at x = 1, we must have
Exercises 3.1
Question 1
Define the function f : R → R by
ax2 + x, if x < 1,
f (x) =
bx + 3 , if x ≥ 1.
x2
Find the values of a and b so that f is differentiable.
Question 2
Let x0 be a point in (a, b). Given that f : [a, b] → R is a continuous function
defined on [a, b] and differentiable at x0 . Let g : [a, b] → R be the function
defined by
f (x) − f (x0 ) ,
if x ∈ [a, b] \ {x0 }
g(x) = x − x0
f ′ (x0 ),
if x = x0 .
Question 3
Let x0 be a point in (a, b) and let f : (a, b) → R be a function defined on
(a, b).
f (x0 + h) − f (x0 − h)
lim = f ′ (x0 ).
h→0 2h
In this section, we are going to derive derivative formulas for composite functions
and inverse functions. First we discuss a different perspective for differentiability
of a function at a point.
Differentiability
Let x0 be a point in the interval (a, b) and let f : (a, b) → R be a function
defined on (a, b). If f is differentiable at x0 , then
f (x0 + h) − f (x0 )
f ′ (x0 ) = lim .
h→0 h
This implies that
f (x0 + h) − f (x0 ) − ch
lim = 0,
h→0 h
limit laws imply that
f (x0 + h) − f (x0 )
c = lim .
h→0 h
This implies that f is differentiable at x0 and f ′ (x0 ) = c. In other words,
the function f is differentiable at x0 if and only if there is a number c such
that
f (x0 + h) − f (x0 ) − ch
lim = 0.
h→0 h
Since x0 ∈ (a, b), there is an r > 0 such that (x0 − r, x0 + r) ⊂ (a, b). For
a given real number c, let ε : (−r, r) → R be the function defined by
f (x0 + h) − f (x0 ) − ch
ε(h) = .
h
Chapter 3. Differentiating Functions of a Single Variable 165
and
ε(h) → 0 when h → 0.
Proof
Let y0 = f (x0 ), and define the functions ε1 (h) and ε2 (k) by
where
k(h)
ε3 (h) = g ′ (y0 )ε1 (h) + ε2 (k(h)) .
h
Since f is differentiable at x0 ,
Heuristically, if we let u = f (x) and y = g(u) = g(f (x)), chain rule says that
dy dy du
= × ,
dx du dx
which is the limit of
∆y ∆y ∆u
= ×
∆x ∆u ∆x
when ∆x → 0. The rigorous proof we give above do not use this because we
might face the problem that ∆u = f (x) − f (x0 ) can be zero even when x ̸= x0 .
Example 3.9
Let f : R → R be a differentiable function and let a be a constant. Show
that the function g : R → R defined by g(x) = f (ax) is differentiable, and
g ′ (x) = af ′ (ax).
Chapter 3. Differentiating Functions of a Single Variable 167
Solution
The function u : R → R, u(x) = ax is differentiable with u′ (x) = a. By
chain rule, the function g(x) = (f ◦ u)(x) is also differentiable and
Example 3.10
f (x3 ) − f (1)
lim
x→1 x−1
in terms of a.
Solution
3
Let g(x) = x . Then g(1) = 1 and g is differentiable at x = 1 with
g ′ (1) = 3.
f (x3 ) − f (1)
lim = (f ◦ g)′ (1) = f ′ (g(1))g ′ (1) = 3f ′ (1) = 3a.
x→1 x−1
be zero. In the following theorem, we show that this condition is also sufficient.
The formula for (f −1 )′ (y0 ) would follow from the chain rule if we know
apriori that f −1 is differentiable at y0 . The gist of this theorem is to state that
f −1 is indeed differentiable at y0 .
Proof
Without loss of generality, assume that f is strictly increasing. By Theorem
2.32, f −1 : f (I) → R is also continuous. There is a δ > 0 so that [x0 −
δ, x0 + δ] ⊂ I. Then (f (x0 − δ), f (x0 + δ)) is an open interval in f (I)
containing the point y0 . This implies that there is an r > 0 so that (y0 −
r, y0 + r) ⊂ f (I). For any k ∈ (−r, r), let
Hence, by limit laws for quotients and composite functions, we find that
f −1 (y0 + k) − f −1 (y0 ) 1
lim =
k→0 k f (x0 + h(k)) − f (x0 )
lim
k→0 h(k)
1
=
f (x0 + h) − f (x0 )
lim
h→0 h
1
= ′ .
f (x0 )
Corollary 3.9
Let I be an open interval, and let f : I → R be a strictly monotonic
differentiable function. If f ′ (x) ̸= 0 for all x ∈ I, then the inverse function
f −1 : f (I) → R is also a strictly monotonic differentiable function with
1
(f −1 )′ (x) = .
f ′ (f −1 (x))
Example 3.11
f ′ (x) = rxr−1 .
Chapter 3. Differentiating Functions of a Single Variable 170
Solution
First we consider the case r = 1/n, where n is a positive integer. The
function f (x) = x1/n is the inverse of the function g(x) = xn , which is
differentiable and strictly increasing. Hence, f (x) = x1/n is differentiable
and strictly increasing. Moreover, since g ′ (x) = nxn−1 , we have
1 1 1 1 1 −1
f ′ (x) = = = = xn .
g ′ (f (x)) g ′ (x1/n ) n(x1/n )n−1 n
where
h(x) = xp , g(x) = x1/q .
By Proposition 3.6, h′ (x) = pxp−1 . We have just shown that g ′ (x) =
1 1q −1
x . By chain rule,
q
1 p 1q −1 p p
f ′ (x) = g ′ (h(x))h′ (x) = (x ) × pxp−1 = x q −1 = rxr−1 .
q q
Chapter 3. Differentiating Functions of a Single Variable 171
Exercises 3.2
Question 1
Given that the function f : (0, ∞) → R is defined by
1
f (x) = √ .
4 + x2
(a) Show that f is one-to-one.
Question 2
Let a be a positive number. Recall that a function f : (−a, a) → R is even
if and only if
The mean value theorem is one of the most important theorems in analysis. We
will first prove a special case of the mean value theorem called Rolle’s theorem.
To prove this, we need the extreme value theorem, which asserts the existence
of global maximum and global minimum for a continuous function defined on a
closed and bounded interval. As a matter of fact, what we actually need is a local
extremum, which we define as follows.
f (x) ≤ f (x0 ).
f (x) ≥ f (x0 ).
Figure 3.6: The function y = f (x) has local maxima at the points B and D, and
local minima at the points A and C. The point A is also where global minimum
appears; while the point B is where the global maximum appears.
at that point, the derivative of the function can only be zero at that point.
Theorem 3.10
Let (a, b) be a neighbourhood of the point x0 , and let f : (a, b) → R
be a function defined on (a, b). If x0 is a local extremizer of f , and f is
differentiable at x0 , then f ′ (x0 ) = 0.
Proof
Without loss of generality, assume that x0 is a local maximizer. Then there
is a δ > 0 such that (x0 − δ, x0 + δ) ⊂ (a, b), and for all x in (x0 − δ, x0 + δ),
f (x) ≤ f (x0 ). Since f is differentiable at x0 , the limit
f (x) − f (x0 )
lim
x→x0 x − x0
exists and is equal to f ′ (x0 ). This implies that the left limit and the right
limit both exist and both equal to f ′ (x0 ). Namely,
For the left limit, when x is in (x0 −δ, x0 ), x−x0 < 0 and f (x)−f (x0 ) ≤ 0.
Therefore,
f (x) − f (x0 )
≥0 when x ∈ (x0 − δ, x0 ).
x − x0
Taking the x → x− ′
0 limit, we find that f (x0 ) ≥ 0. For the right limit, when
x is in (x0 , x0 + δ), x − x0 > 0 but f (x) − f (x0 ) ≤ 0. Therefore,
f (x) − f (x0 )
≤0 when x ∈ (x0 , x0 + δ).
x − x0
Taking the x → x+ ′
0 limit, we find that f (x0 ) ≤ 0. Since the left limit shows
that f ′ (x0 ) ≥ 0 while the right limit shows that f ′ (x0 ) ≤ 0, we conclude
that f ′ (x0 ) = 0.
Hence, Theorem 3.10 says that if x0 is an interior point of D, and the function
f : D → R is differentiable at x0 , a necessary condition for x0 to be a local
extremum of the function f is x0 must be a stationary point. Nevertheless, this
condition is not sufficient. For example, the function f (x) = x3 has a stationary
point at x = 0, but x = 0 is not a local extremizer of the funnction.
Now let us return to the mean value theorem. As a motivation, let us consider
the distance s travelled by an object as a function of time t. We have discussed in
Section 3.1 that to find the instantaneous speed of the object at a particular time
t0 , we first find the average speed over the time interval from t0 to t0 + ∆t, and
take the limit ∆t → 0. Namely, the instantaneous speed at time t0 is
s(t0 + ∆t) − s(t0 )
lim ,
∆t→0 ∆t
Chapter 3. Differentiating Functions of a Single Variable 175
which is precisely s′ (t0 ), the derivative of s(t) at t = t0 . The mean value theorem
asserts that the average speed of the object in a time interval [t1 , t2 ] must equal
to the instantaneous speed s′ (t0 ) for some t0 in that interval. Intuitively, this is
something one would expect to be true.
Now let us prove a special case of the mean value theorem.
Proof
Since f : [a, b] → R is a continuous function defined on a closed
and bounded interval, the extreme value theorem says that it must have
minimum value and maximum value. In other words, there are two points
x1 and x2 in [a, b] such that
Notice that x1 and x2 are also local extremizers of the function f : [a, b] →
R. If f (x1 ) = f (x2 ), then f is a constant function. In this case, f ′ (x0 ) = 0
for all x0 in (a, b). If f (x1 ) ̸= f (x2 ), then f (x1 ) < f (x2 ). Since f (a) =
f (b), either x1 or x2 must be in the open interval (a, b). In other words,
there is a local extremizer x0 in the interval (a, b). Since f is differentiable
at x0 , Theorem 3.10 says that we must have f ′ (x0 ) = 0. In either case,
there is an x0 in (a, b) satisfying f ′ (x0 ) = 0.
f (b) − f (a)
f ′ (x0 ) = .
b−a
The mean value theorem stated in Theorem 3.12 is also referred to as Lagrange’s
mean value theorem. Notice that Rolle’s theorem is a special case of the mean
value theorem where f (a) = f (b). The quantity
f (b) − f (a)
b−a
gives the average rate of change of the function f (x) over the interval [a, b],
and the mean value theorem says that this average rate of change is equal to
the rate of change at a particular point. To prove the mean value theorem, we
apply a transformation to the function f (x) to get a function g(x) that satisfies the
conditions in the Rolle’s theorem.
Chapter 3. Differentiating Functions of a Single Variable 177
Proof
Let g : [a, b] → R be the function defined by
and so
f (b) − f (a)
m= .
b−a
Notice that the function g : [a, b] → R is continuous, and g : (a, b) → R is
differentiable with
f (b) − f (a)
g ′ (x) = f ′ (x) − m = f ′ (x) − .
b−a
By construction, g(a) = g(b). Hence, we can apply Rolle’s theorem to the
function g and conclude that there is a point x0 in (a, b) such that g ′ (x0 ) =
0. For this point x0 ,
f (b) − f (a)
f ′ (x0 ) = .
b−a
This proves the mean value theorem.
Notice that for the mean value theorem to hold, the function f : [a, b] → R do
Chapter 3. Differentiating Functions of a Single Variable 178
not need to be differentiable at the end points of the interval [a, b], and the point
x0 is guaranteed to be a point in the interior of the interval.
The mean value theorem has very wide applications. We will discuss a few in
this section.
Recall that the derivative of a constant function is 0. The converse is not
obvious, but it is an easy consequence of the mean value theorem.
Lemma 3.13
If the function f : [a, b] → R is continuous on [a, b], differentiable on (a, b),
and f ′ (x) = 0 for all x ∈ (a, b), then f is a constant function.
Take any x ∈ (a, b]. Then f is continuous on [a, x], differentiable an (a, x).
Therefore, we can apply mean value theorem to conclude that there is a
point c in (a, x) such that
f (x) − f (a)
= f ′ (c) = 0.
x−a
This proves that f (x) = f (a). Therefore, the function f is a constant.
Theorem 3.14
Assume that the functions f : [a, b] → R and g : [a, b] → R are continuous
on [a, b], differentiable on (a, b), and
Proof
Define the function h : [a, b] → R by
Then the function h is continuous on [a, b], differentiable on (a, b), and
h′ (x) = 0 for all x ∈ (a, b). By Lemma 3.13, h is a cosntant function.
Namely, there is a constant C so that h(x) = C for all x ∈ [a, b]. Therefore,
Theorem 3.14 implies the identity criterion, which says that if two functions
are differentiable in an open interval, their derivatives are the same, and
their values at a single point in the interval coincide, then these two
functions must be identical.
Example 3.12
Let n be a nonnegative integer. Assume that the function p : R → R is
(n + 1) times differentiable and p(n+1) (x) = 0 for all real numbers x. Then
p(x) is a polynomial of degree at most n.
Proof
We prove this by induction on n. When n = 0, the statement says that
if p : R → R is a differentiable function and p′ (x) = 0 for all x ∈ R,
then p(x) is a polynomial of degree 0. Since a polynomial of degree 0 is a
constant, this statement is true by Lemma 3.13.
Now let n ≥ 1, and assume that we have proved that for any k < n, if
q : R → R is a function that is (k+1) times differentiable and q (k+1) (x) = 0
for all real numbers x, then q(x) is a polynomial of degree at most k.
Chapter 3. Differentiating Functions of a Single Variable 180
Example 3.13
Solution
Let x be point in (0, 10]. By mean value theorem, there is a c ∈ (0, x) such
that
f (x) − f (0)
= f ′ (c).
x−0
Since −3 < f ′ (c) < 8, we find that
The next example shows that the mean value theorem can be used to determine
the number of solutions of an equation.
Example 3.14
Recall that in Example 2.27, we have shown that the equation
x6 + 6x + 1 = 0
has a real root. Determine the exact number of real roots of this equation.
Solution
Let f : R → R be the function f (x) = x6 + 6x + 1. This is a differentiable
function with
f ′ (x) = 6x5 + 6.
From this, we find that f ′ (x) = 0 if only if x5 = −1, if and only if x = −1.
If x1 and x2 are two points such that x1 < x2 and f (x1 ) = f (x2 ) = 0,
Rolle’s theorem says that there is a point u in (x1 , x2 ) such that f ′ (u) = 0.
Chapter 3. Differentiating Functions of a Single Variable 182
If f (x) = 0 has three distinct real roots, we can assume that these real roots
are x1 , x2 and x3 with x1 < x2 < x3 . Then there is a u1 in (x1 , x2 ), and
a u2 in (x2 , x3 ) such that f ′ (u1 ) = f ′ (u2 ) = 0. In other words, f ′ (x) = 0
has two distinct real roots u1 and u2 . But we have shown that there is only
one x such that f ′ (x) = 0. Therefore, f (x) = 0 can have at most two real
solutions.
Since f (0) = 1, we have f (−1) < 0 < f (1). By intermediate value
theorem, there is a c1 ∈ (−1, 0) such that f (c1 ) = 0.
Since f (−2) = 53, we have f (−1) < 0 < f (−2). By intermediate value
theorem, there is a c2 ∈ (−2, −1) such that f (c2 ) = 0.
We conclude that f (x) = 0 has exactly two real roots.
Theorem 3.15
Given that f : [a, b] → R is a function continuous on [a, b], and
differentiable on (a, b).
1. If f ′ (x) > 0 for all x ∈ (a, b), then f : [a, b] → R is a strictly increasing
function.
2. If f ′ (x) < 0 for all x ∈ (a, b), then f : [a, b] → R is a strictly decreasing
function.
Notice that we only assume that f ′ is positive or negative on the open interval
(a, b). If f ′ exists at the end points, it can be 0 there, and the conclusion about the
strict monotonicity still holds for the entire closed interval [a, b].
Proof
It suffices for us to prove the first statement. Given any two points x1 and
x2 in the closed interval [a, b] with x1 < x2 , the function f is continuous on
[x1 , x2 ], differentiable on (x1 , x2 ), and f ′ (x) > 0 for any x ∈ (x1 , x2 ). By
mean value theorem, there is a point c in (x1 , x2 ) such that
Chapter 3. Differentiating Functions of a Single Variable 183
f (x2 ) − f (x1 )
= f ′ (c).
x2 − x 1
Since f ′ (c) > 0 and x2 − x1 > 0, we conclude that
Example 3.15
Remark 3.2
Let f : [a, b] → R be a function defined on [a, b], and let x1 , . . . , xn be
points in (a, b) such that the following conditions are satisfied.
Using the same reasoning as in Example 3.15, one can prove that f is
strictly increasing on [a, b].
relax the strict monotonicity to monotonicity, we will find that f ′ (x) ≥ 0 for all
x ∈ (a, b) is sufficient and necessary for f to be increasing.
Theorem 3.16
Given that the function f : [a, b] → R is continuous on [a, b], and
differentiable on (a, b).
Proof
Again, let us consider the first statement. If f ′ (x) ≥ 0 for all x ∈ (a, b),
the proof that f is increasing is almost verbatim the proof in Theorem 3.15,
with > replaced by ≥. For the converse, if f is increasing on [a, b], we want
to show that f ′ (x0 ) ≥ 0 for any x0 in (a, b). This follows from the fact that
f (x) − f (x0 )
≥0
x − x0
for any x in (a, b)\{0} since f is increasing. Taking limit gives f ′ (x0 ) ≥ 0.
Example 3.16
Consider the function f : R → R defined by
x
f (x) = .
x2 +1
Find the local maximum value and the local minimum value of f , and find
the range of the function f .
Solution
Since f is a rational function, it is differentiable, and
x
Figure 3.9: The function f (x) = .
x2 +1
Take ε to be the positive number f ′′ (x0 )/2. The definition of limit implies
that there is a δ > 0 such that (x0 − δ, x0 + δ) ⊂ (a, b), and for all the points
x in (x0 − δ, x0 ) ∪ (x0 , x0 + δ),
f ′ (x) f ′′ (x0 )
− f ′′ (x0 ) < .
x − x0 2
Combining together, we find that f (x) ≥ f (x0 ) for all x in (x0 − δ, x0 + δ).
This proves that x0 is a local minimizer of f .
Chapter 3. Differentiating Functions of a Single Variable 188
Example 3.17
For the function f (x) considered in Example 3.16, we have shown that the
stationary points are x = −1 and x = 1. A tedious computation gives
′′ 2x(x2 − 3)
f (x) = .
(x2 + 1)3
Hence,
1 1
f ′′ (−1) = , f ′′ (1) = − .
2 2
The second derivative test can then be used to conclude that x = −1 is a
local minimizer, and x = 1 is a local maximizer.
Proof
The proof is an again an application of the extreme value theorem.
Without loss of generality, assume that f+′ (a) < w < f−′ (b). Since f :
[a, b] → R is a differentiable function, it is continuous. Define the function
g : [a, b] → R by
g(x) = f (x) − wx.
Chapter 3. Differentiating Functions of a Single Variable 189
g ′ (x) = f ′ (x) − w.
′
g+ (a) = f+′ (a) − w < 0, ′
g− (b) = f−′ (b) − w > 0.
By definition,
′ g(x) − g(a)
g+ (a) = lim+ .
x→a x−a
′
Taking ε to be the positive number −g+ (a)/2, we find that there is a δ > 0
such that δ ≤ b − a, and for all x ∈ (a, a + δ),
Remark 3.3
As a consequence of the Darboux’s theorem, we find that if a function f :
(a, b) → R is differentiable and f ′ (x) ̸= 0 for any x ∈ (a, b), then either
f ′ (x) > 0 for all x ∈ (a, b), or f ′ (x) < 0 for all x ∈ (a, b). In any case, this
means that such a function must be strictly monotonic.
Exercises 3.3
Question 1
Given that the function f : [−5, 8] → R is continuous on [−5, 8],
differentiable on (−5, 8), and −4 < f ′ (x) < 4 for all x in (−5, 8). If
f (0) = 2, find a range for the values of f (x).
Question 2
Show that the function f : R → R,
x3
f (x) =
x2 + 1
is strictly increasing, and find the range of the function.
Question 3
Show that the equation
x5 + x + 32 = 0
has exactly one real solution.
Question 4
Find the number of real solutions of the equation
32x
= 1.
x4+ 16
Question 5
Let n be a nonnegative integer, and let f : (a, b) → R be a differentiable
function. If the equation f ′ (x) = 0 has n distinct real roots in the interval
(a, b), show that the equation f (x) = 0 has at most (n + 1) distinct real
roots in the interval (a, b).
Chapter 3. Differentiating Functions of a Single Variable 192
Question 6
Consider the function f : R → R defined by
x+1
f (x) = .
x2 + 15
(a) Find the local maximum value and the local minimum value of f .
Question 7
Let f : [a, b] → R be a function such that the limit
f (x) − f (b)
L = lim−
x→b x−b
exists. If L > 0, show that there is a δ > 0 such that δ ≤ b − a and for all
x ∈ (b − δ, b),
f (x) < f (b).
Question 8
Let f : (a, b) → R be a differentiable function. Suppose that f ′ : (a, b) →
R is monotonic, show that f ′ : (a, b) → R is continuous.
Chapter 3. Differentiating Functions of a Single Variable 193
In previous section, we have seen that the mean value theorem is very useful in
analysing the behavior of a differentiable function. For future applications, we
will often quote it in the following form.
To see this, let x1 = x0 + h. If h = 0, (3.4) is obviously true for any c in (0, 1).
If h ̸= 0, then when c runs through all values from 0 to 1, x0 + ch runs through
all points in the open interval I with x0 and x1 as endpoints. Thus, (3.4) says that
for some u in the open intefval I, which is precisely the statement of the mean
value theorem.
When finding limits of functions, we often encounter situations like
f (x)
lim
x→x0 g(x)
where both lim f (x) and lim g(x) are zero. For example, let
x→x0 x→x0
Then
lim f (x) = f (1) = 0 and lim g(x) = g(1) = 0.
x→1 x→1
Observe that
f (x) f (1 + h)
lim = lim .
x→1 g(x) h→0 g(1 + h)
Chapter 3. Differentiating Functions of a Single Variable 194
Since we are only interested in the limit when x approaches 1, we are prone to use
the mean value theorem in the form (3.4) and conclude that there are c1 and c2 in
(0, 1) such that
f (x) f (1 + h) − f (1) f ′ (1 + c1 h)
lim = lim = lim ′ . (3.5)
x→1 g(x) h→0 g(1 + h) − g(1) h→0 g (1 + c2 h)
For the functions f and g that we consider above, f ′ and g ′ are both continuous at
x = 1 and g ′ (1) ̸= 0. Hence, we find that
f ′ (1 + c1 h) f ′ (1)
lim = ′ .
h→0 g ′ (1 + c2 h) g (1)
For general differentiable functions f (x) and g(x) with f (1) = g(1) = 0, if we
do not assume that f ′ and g ′ are continuous, we cannot conclude the limit from
(3.5) since c1 and c2 are in general different functions of h.
In this section, we are going to prove a generalization of the mean value
theorem, called the Cauchy mean value theorem, which ensures that we can have
the same value for c1 and c2 .
Notice that when g(x) = x, we have the Lagrange’s mean value theorem.
Chapter 3. Differentiating Functions of a Single Variable 195
Proof
The proof uses the same idea as the proof of Lagrange’s mean value
theorem, with the function g(x) = x replaced by a general g(x). By
Remark 3.3, the condition g ′ (x) ̸= 0 for all x ∈ (a, b) implies that g is
strictly monotonic. Hence, g(a) ̸= g(b).
Define the function h : [a, b] → R by
which gives
f (b) − f (a)
m= .
b−a
Again, the function h : [a, b] → R is continuous on [a, b], differentiable on
(a, b), and satisfies h(a) = h(b). By Rolle’s theorem, there is a point x0 in
(a, b) such that h′ (x0 ) = 0. For this x0 ,
f ′ (x0 ) − mg ′ (x0 ) = 0.
Example 3.18
f (7) − f (1)
2c1 = f ′ (c1 ) = = 8,
7−1
and
g(7) − g(1)
3c22 − 18c2 = g ′ (c2 ) = = −15.
7−1
Chapter 3. Differentiating Functions of a Single Variable 196
Theorem 3.20
Let n be a positive integer, and let (a, b) be an open interval that contains
the point x0 . If the function f : (a, b) → R is n times differentiable, and
hn (n)
f (x) = f (x0 + ch), where h = x − x0 . (3.6)
n!
Proof
We apply the Cauchy mean value theorem n times to the given function
f : (a, b) → R and the function g : (a, b) → R defined by g(x) = (x−x0 )n .
Notice that g is also n times differentiable,
and
g (n) (x) = n! for all x ∈ (a, b).
Since f (x0 ) = 0, eq. (3.6) obviously holds for x = x0 with any c ∈ (0, 1).
So we only need to consider a point x = x1 in (a, b) \ {x0 }. First assume
that x1 > x0 . For any 1 ≤ k ≤ n, g (k) (x) ̸= 0 for any x ∈ (x0 , x1 ). Thus
we can apply the Cauchy mean value theorem for the pairs (f, g), (f ′ , g ′ ),
. . ., (f (n−1) , g (n−1) ) over the interval (x0 , x1 ).
Chapter 3. Differentiating Functions of a Single Variable 197
Since f (x0 ) = g(x0 ) = 0, Cauchy mean value theorem implies that there
exists a point u1 in (x0 , x1 ) such that
Continue with this n times, we find that there are points u1 , . . . , un such
that x0 < un < un−1 < · · · < u1 < x1 , and
Example 3.19
Let x0 be a point in the interval (a, b), and assume that the function f :
(a, b) → R is twice continuously differentiable. Prove that
Solution
Let r = min{x0 − a, b − x0 }. Then r > 0 and (x0 − r, x0 + r) ⊂ (a, b).
Define the function g : (−r, r) → R by
h2 ′′
g(h) = g (c(h)h).
2
Hence,
g(h) f ′′ (x0 + c(h)h) + f ′′ (x0 − c(h)h)
= .
h2 2
Now since c(h) ∈ (0, 1),
|c(h)h| ≤ |h|.
Therefore,
lim c(h)h = 0.
h→0
Since f ′′ is continuous,
Exercises 3.4
Question 1
Given that p(x) is a polynomial of degree at most 5, and
p(1) = p(1) (1) = p(2) (1) = p(3) (1) = p(4) (1) = 0, p(5) (1) = 1200.
Question 2
Let (a, b) be an interval that contains the point x0 . Given that the function
f : (a, b) → R is three times continuously differentiable, find the limit
Up to now we have only dealt with algebraic functions, which are functions that
can be obtained by performing algebraic operations of addition, multiplication,
division and taking roots on polynomials. In this section, we introduce other
useful elementary functions – the class of transcendental functions which includes
exponential, logarithmic and trigonometric functions. These functions have been
introduced in a pre-calculus course, but not rigorously.
In this section, we are going to define these functions and derive their properties
using calculus. Everything would be done rigorously using the analytic tools that
we have developed so far, except for an existence theorem that we are going to
prove in Chapter 4.
Let us first state this existence theorem.
Let (a, b) be an open interval that contains the point x0 , and let y0 be any
real number. Given that f : (a, b) → R is a continuous function, there
exists a unique differentiable function F : (a, b) → R such that
Hence, only the existence part of Theorem 3.21 is pending a proof. The uniqueness
follows from what we have discussed.
It is easy to check that for any integer n that is not equal to −1, an antiderivative
of the function f (x) = xn is the function
xn+1
F (x) = .
n+1
So far we haven’t seen any algebraic function whose antiderivative is equal to
1
f (x) = . We define one such function and call it the natural logarithm function.
x
Definition 3.10 The Natural Logarithm Function
1
Since g : (0, ∞) → R, g(x) = is a continuous function, the existence and
x
uniqueness of the function f (x) = ln x is guaranteed by Theorem 3.21.
The following gives some useful properties of the natural logarithmic function.
Chapter 3. Differentiating Functions of a Single Variable 202
(a) ln(xy) = ln x + ln y
x
(b) ln = ln x − ln y
y
(c) ln xr = r ln x
Proof
To prove (a), we fixed y > 0 and define the function f : (0, ∞) → R by
f (x) = ln(xy) − ln y.
1 rxr−1 1
f ′ (x) = × r
= .
r x x
This allows us to conclude that f (x) = ln x, and (c) is thus proved.
From part (b) of Proposition 3.22, we find that for any x > 0,
1
ln = ln x−1 = − ln x;
x
and if n is a positive integer,
ln xn = n ln x.
Proposition 3.23
d
(c) exp(x) = exp(x).
dx
Chapter 3. Differentiating Functions of a Single Variable 205
Proof
(a) and (b) are obvious from the corresponding properties of ln x. For part
(c), we employ the derivative formula for inverse function. To make it less
confusing, let y = exp(x). Then
ln y = x.
Proof
Let u = exp(x) and v = exp(y). Then u and v are positive numbers and
x = ln u, y = ln v.
By Proposition 3.22,
ln(uv) = ln u + ln v = x + y.
Chapter 3. Differentiating Functions of a Single Variable 206
Therefore,
exp(x) exp(y) = uv = exp(x + y).
Part (b) is proved in the same way. For part (c), Proposition 3.22 implies
that
ln(ur ) = r ln u = rx.
Therefore,
exp(x)r = ur = exp(rx).
Notice that part (c) says that for any postive number u, and any rational number
r,
ur = exp(r ln u).
We can use this to define power functions with irrational powers.
xr = exp(r ln x).
We have seen that this definition coincides with the old definition when r is
a rational number. For r > 0, since ln x → −∞ as x → 0+ , r ln x → −∞
as x → 0+ . Since exp(x) → 0 as x → −∞, we conclude that xr → 0 as
x → 0+ . Therefore, the definition f (0) = 0 makes the function f (x) = xr
continuous. Using the fact that exp(x) and ln x are inverses of each other, we
have the following.
ln(xr ) = r ln x.
Since both ln x and exp(x) are strictly increasing functions, it is easy to deduce
Chapter 3. Differentiating Functions of a Single Variable 207
√ √
Figure 3.12: (a) The function y = x 2 . (b) The function y = x− 2 .
the following.
Proposition 3.26
For any positive numbers x and y, and any real numbers r and s,
(a) (xy)r = xr y r
r
x xr
(b) = r
y y
(c) xr+s = xr xs
xr
(d) xr−s =
xs
(e) (xr )s = xrs
Chapter 3. Differentiating Functions of a Single Variable 208
Proof
For part (a), we have
Proposition 3.27
For any real number r and any positive number x,
d r
x = rxr−1 .
dx
Proof
This follows from straightforward computation.
d r d d r
x = exp (r ln x) = exp(r ln x) (r ln x) = xr × = rxr−1 .
dx dx dx x
in Chapter 1.
Chapter 3. Differentiating Functions of a Single Variable 209
Theorem 3.28
We have n
1
exp(1) = lim 1+ = e.
n→∞ n
This imples that
ln e = 1.
Proof
We consider the differentiable function g(x) = ln(1 + x), x > −1, whose
derivative is
1
g ′ (x) = .
1+x
By definition of derivative,
By definition,
n
1 1
exp n ln 1 + = 1+ .
n n
ax = exp (x ln a) , x ∈ R.
ex = exp(x).
Proposition 3.29
Let a be a positive number.
Proposition 3.30
Proposition 3.31
Let a be a positive number such that a ̸= 1. For any real numbers x and y,
1. ax+y = ax ay
x−y ax
2. a = y
a
3. (ax )y = axy
Chapter 3. Differentiating Functions of a Single Variable 211
7π 3π
Figure 3.15: The definitions of sin θ and cos θ for (a) θ = and (b) θ = .
3 4
the definitions, it is obvious that sin θ and cos θ are periodic functions of periodic
2π.
From the definitions, it is quite obvious that sin θ and cos θ are continuous
functions. A rigorous proof is tedious. To show that these two functions are
Chapter 3. Differentiating Functions of a Single Variable 213
Again, the proof of the existence requires knowledge from later chapters. We
will prove uniqueness here. We begin by a lemma that will be useful later.
Lemma 3.33
Let f : R → R be a twice differentiable function that satisfies
f ′′ (x) + f (x) = 0.
1. f is infinitely differentiable.
2. For any positive integer n, the nth derivative of f , g(x) = f (n) (x),
satisfies
g ′′ (x) + g(x) = 0.
Proof
Since f is twice differentiable, f is continuous and differentiable. Since
f ′′ (x) = −f (x), f ′′ is continuous and differentiable. This implies that f is
three times differentiable and f ′′′ = −f ′ . Continue arguing in this way, we
find that f is infinitely differentiable, and for any nonengative integer n,
g ′′ (x) + g(x) = 0.
These prove the first and second statements. For the third statement, we
notice that
d
f ′ (x)2 + f (x)2 = f ′ (x)f ′′ (x) + f (x)f ′ (x)
dx
= 2f ′ (x) (f ′′ (x) + f (x)) = 0.
f ′ (x)2 + f (x)2 = C.
f ′ (x)2 + f (x)2 = 0.
Chapter 3. Differentiating Functions of a Single Variable 215
Notice that once we prove the existence of the function S(x) = sin x, then the
function C(x) = cos x exists. One can then check that the function
In other words, to prove the existence part in Theorem 3.32, we only need to
establish the existence of the function S(x) = sin x.
In the following, we establish the properties of the functions S(x) and C(x).
Theorem 3.34
The functions S(x) and C(x) are infinitely differentiable functions that
satisfy the following.
Proof
S ′ (x) = C(x) is by the definition of C(x). Differentiating gives C ′ (x) =
S ′′ (x) = −S(x). To prove (b), one check that the function f (x) = −S(−x)
satisfies f ′′ (x) + f (x) = 0, f (0) = 0 and f ′ (0) = 1. By uniquess of the
function S(x), we have f (x) = S(x), which proves that S(x) is an odd
function. Since C(x) = S ′ (x), C(x) is an even function. Lemma 3.33 says
that S(x)2 + S ′ (x)2 is a constant. Hence, there is a constant A such that
Setting x = 0 gives A = 1. This proves part (c). For part (d), fixed a real
number y and consider the function
We find that
f ′ (x) = S ′ (x + y) = C(x + y),
f ′′ (x) + f (x) = S ′′ (x + y) + S(x + y) = 0,
and
f (0) = S(y), f ′ (0) = C(y).
Since the function g(x) = S(y)C(x) + C(y)S(x) satisfies
Here we have used advanced analytic tools to prove the identities (3.9) in a
simple way. Part (c) in Theorem 3.34 says that
Theorem 3.35
There is a smallest positive number u such that C(u) = 0.
Chapter 3. Differentiating Functions of a Single Variable 218
Proof
Since S(x) is differentiable, we can apply mean value theorem to conclude
that there is a point v in (0, 2) such that
S(2) − S(0)
= S ′ (v) = C(v).
2−0
This gives
1 1
|C(v)| = |S(2)| ≤ .
2 2
By part (e) and part (c) in Theorem 3.34,
1
C(2v) = C(v)2 − S(v)2 = 2C(v)2 − 1 ≤ − 1 < 0.
2
Since C(2v) < 0 < C(0), and C(x) is a continuous function, intermediate
value theorem implies that there is a point w in (0, 2v) such that C(w) = 0.
Let
A = {w > 0 | C(w) = 0.} .
We have just shown that A is a nonempty set. By definition, A is bounded
below by 0. Hence, u = inf A exists. By Lemma 1.34, there is a sequence
{wn } in A that converges to u. Since C(x) is continuous, the sequence
{C(wn )} converges to C(u). But C(wn ) = 0 for all n. Hence, C(u) = 0.
Since C(0) = 1, u ̸= 0. Hence, u > 0. This proves that u is the smallest
positive number such that C(u) = 0.
Let u be the smallest positive number such that C(u) = 0. Then we must have
C(x) > 0 for all x ∈ [0, u). Since S ′ (x) = C(x), S(x) is strictly increasing on
[0, u]. Thus, S(x) > 0 for all x ∈ (0, u]. This, and S(u)2 + C(u)2 = 1, implies
that S(u) = 1. From part (d) and part (e) in Theorem 3.34, we find that
It follows that
The last pair of equations show that S(x) and C(x) are periodic functions of
period 4u. Since S(x) > 0 and C(x) > 0 for x ∈ (0, u), we have the following.
x = C(θ), y = S(θ).
What is not obvious is that this θ is exactly the radian of the angle that the line
segment OP makes with the positive x-axis. To show this, we can argue in the
following way. Assume that an object is travelling on the circle x2 + y 2 = 1, and
its position at time t is (x(t), y(t)), where
x = C(t), y = S(t).
It follows that the velocity of the object at time t is (x′ (t), y ′ (t)), where
2u = π.
There are four other trigonometric functions. They are defined in terms of
sin x and cos x in the usual way.
Chapter 3. Differentiating Functions of a Single Variable 221
Proposition 3.36
tan x, cot x, sec x and csc x are infinitely differentiable functions with
d d
tan x = sec2 x, cot x = − csc2 x,
dx dx
d d
sec x = sec x tan x, csc x = − csc x cot x.
dx dx
Chapter 3. Differentiating Functions of a Single Variable 222
Before closing this subsection, we want to prove some important limits and
inequalities for the function sin x.
Theorem 3.37
Proof
When x = 0, sin x = 0 and | sin x| ≤ |x| is obviously true. If x ̸= 0, mean
value theorem implies that there is a number c in (0, 1) such that
sin x sin x − sin 0
= = cos(cx). (3.12)
x x−0
Hence,
sin x
= | cos(cx)| ≤ 1,
x
which implies that | sin x| ≤ |x|. This proves the first statement.
Chapter 3. Differentiating Functions of a Single Variable 223
sin x , π
if 0 < x ≤ ,
g(x) = x 2
1, if x = 0.
As before, for each x ∈ (0, π2 ], mean value theorem implies that there is a
u ∈ (0, x) such that
sin x
= cos u.
x
Since 0 < u < x and the cosine function is strictly decreasing on (0, π2 ),
we find that
′ 1 sin x 1
g (x) = cos x − = (cos x − cos u) < 0.
x x x
2 sin x
≤ ≤ 1.
π x
Thus, for all x ∈ [0, π/2],
2
x ≤ sin x ≤ x.
π
In this section, we are going to define inverse functions for sin x, cos x and tan x.
Since trigonometric functions are periodic functions, they are not one-to-one.
Chapter 3. Differentiating Functions of a Single Variable 224
Hence, we cannot find their inverses over the whole domain of their definitions.
However, we can restrict each of their domains to an interval on which each of
them is one-to-one to define the inverse. Such interval should contain the interval
(0, π/2) which is where these functions are classically defined.
• The largest interval that contains the interval (0, π/2) and on which sin x is
one-to-one is [− π2 , π2 ].
• The largest interval that contains the interval (0, π/2) and on which cos x is
one-to-one is [0, π].
• The largest interval that contains the interval (0, π/2) and on which tan x is
one-to-one is (− π2 , π2 ).
Theorem 3.38
sin−1 : (−1, 1) → R is a differentiable function with
d 1
sin−1 x = √ .
dx 1 − x2
Theorem 3.39
cos−1 : (−1, 1) → R is a differentiable function with
d 1
cos−1 x = − √ .
dx 1 − x2
Theorem 3.40
tan−1 : R → R is a differentiable function with
d 1
tan−1 x = .
dx 1 + x2
Chapter 3. Differentiating Functions of a Single Variable 226
Exercises 3.5
Question 1
Determine the following limits.
n
1
(a) lim 1 −
n→∞ n
n
2
(b) lim 1 +
n→∞ n
n
2
(c) lim 1 −
n→∞ n
Question 2
For any x ∈ [−1, 1], show that sin−1 x + cos−1 x is a constant and find this
constant.
Question 3
Determine the following limits.
(a) lim
π−
tan x
x→ 2
Question 4
Consider the function f : R → R defined by
sin 1 ,
if x ̸= 0,
f (x) = x
0,
if x = 0.
Question 5
Consider the function f : R → R defined by
x sin 1 ,
if x ̸= 0,
f (x) = x
0,
if x = 0.
Question 6
Consider the function f : R → R defined by
x2 sin 1 ,
if x ̸= 0,
f (x) = x
0,
if x = 0.
g(x) = x + f (x).
(c) Show that g ′ (0) = 1, but for any neighbourhood (a, b) of 0, g : (a, b) →
R is not increasing.
Chapter 3. Differentiating Functions of a Single Variable 228
In this section, we will apply the Cauchy mean value theorem to prove the l’
Hôpital’s rules. The latter are useful rules for finding limits of the form
f (x)
lim ,
x→x0 g(x)
Theorem 3.41
Let f : (a, b) → R and g : (a, b) → R be differentiable functions that
satisfy the following conditions.
f ′ (x)
(ii) lim+ = L.
x→a g ′ (x)
f (x)
Then lim+ = L.
x→a g(x)
Proof
The condition (i) implies that we can extend f and g to be continuous
functions on [a, b) by defining f (a) = g(a) = 0. Then by Cauchy mean
value theorem, for any x ∈ (a, b), there is a point u(x) ∈ (a, x) such that
Since
a < u(x) < x,
squeeze theorem implies that
lim u(x) = a.
x→a+
f ′ (u(x)) f ′ (u)
lim+ = lim = L.
x→a g ′ (u(x)) u→a+ g ′ (u)
f (x)
lim+ = L.
x→a g(x)
It is easy to see that an analogue of Theorem 3.41 holds for left limits. Combine
the left limit and the right limit, we have the following.
Let x0 be a point in the open interval (a, b), and let D = (a, b)\{x0 }. Given
that f : D → R and g : D → R are diferentiable functions that satisfy the
following conditions.
f ′ (x)
(ii) lim = L.
x→x0 g ′ (x)
f (x)
Then we have lim = L.
x→x0 g(x)
Example 3.20
Determine the limit
x20 + 2x9 − 3
lim
x→1 x7 − 1
if it exists.
Solution
Let f (x) = x20 + 2x9 − 3 and g(x) = x7 − 1. Then
Since
f ′ (x) 20x19 + 18x8 38
lim ′
= lim 6
= ,
x→1 g (x) x→1 7x 7
l’ Hôpital’s rule implies that
x20 + 2x9 − 3 38
lim 7
= .
x→1 x −1 7
Example 3.21
Determine whether the limit exists. If it exists, find the limit.
ex − 1 − x
(a) lim
x→0 x2
sin 2x
(b) lim
x→0 3x
cos 2x − 1
(c) lim
x→0 x2
Chapter 3. Differentiating Functions of a Single Variable 231
Solution
(a) This is a limit of the form 0/0. Applying l’ Hôpital’s rule, we have
ex − 1 − x ex − 1
lim = lim .
x→0 x2 x→0 2x
Again, we have a limit of the form 0/0. Applying l’ Hôpital’s rule
again, we have
ex − 1 − x ex − 1 ex 1
lim = lim = lim = .
x→0 x2 x→0 2x x→0 2 2
(b) This is a limit of the form 0/0. Apply l’ Hôpital’s rule, we have
sin 2x 2 cos 2x 2
lim = lim = .
x→0 3x x→0 3 3
(c) This is a limit of the form 0/0. Applying l’ Hôpital’s rule twice, we
have
cos 2x − 1 −2 sin 2x −4 cos 2x
lim 2
= lim = lim = −2.
x→0 x x→0 2x x→0 2
Example 3.22
Since f is continuous, we have
Since we also have lim (2h) = 0, applying l’ Hôpital’s rule again give
h→0
f ′′ (x0 + h) + f ′′ (x0 − h)
lim = f ′′ (x0 ).
h→0 2
These prove that
f ′ (x)
(ii) lim = L.
x→∞ g ′ (x)
f (x)
Then we have lim = L.
x→∞ g(x)
Chapter 3. Differentiating Functions of a Single Variable 233
Proof
Let b = 1/a, and define the functions f1 : (0, b) → R and g1 : (0, b) → R
by
1 1
f1 (x) = f , g1 (x) = g .
x x
Then f1 and g1 are differentiable functions and
′ 1 ′ 1 ′ 1 ′ 1
f1 (x) = − 2 f , g1 (x) = − 2 g .
x x x x
Moreover,
and
f′ 1
f ′ (x) f ′ (x)
lim+ 1′ = lim+ ′ x
1
= lim = L.
x→0 g1 (x) x→0 g x→∞ g ′ (x)
x
By Theorem 3.41,
f1 (x)
lim+ = L.
x→0 g1 (x)
This implies that
f (x)
lim = L.
x→∞ g(x)
Example 3.23
x+1
x
Determine whether the limit lim exists. If it exists, find the
x→∞ x+2
limit.
This is not of the type 0/0. But the logarithm of it can be turned into that form.
Chapter 3. Differentiating Functions of a Single Variable 234
Solution
Consider the function
x+1
x x
g(x) = ln = (x + 1) ln .
x+2 x+2
This is a limit of the form ∞/∞. One may say that we can turn it to a limit of the
form 0/0 by writing
x e−x
= .
ex x−1
Then l’ Hôpital’s rule says that if the limit
e−x
lim (3.15)
x→∞ −x−2
Chapter 3. Differentiating Functions of a Single Variable 235
exists and is equal to L, the limit (3.14) also exists and is equal to L. However, the
limit (3.15) is more complicated than the limit (3.14). So this strategy is useless.
Hence, there is a need for us to consider the ∞/∞ indeterminate case. We only
prove the theorem in the case x0 is finite. The case where x0 is infinite can be
dealt with in the same way as in the proof of Theorem 3.43.
Let x0 be a point in the open interval (a, b), and let D be the set D =
(a, b) \ {x0 }. Given that f : D → R and g : D → R are diferentiable
functions that satisfy the following conditions.
f ′ (x)
(ii) lim = L.
x→x0 g ′ (x)
f (x)
Then we have lim = L.
x→x0 g(x)
The proof of this theorem is technical because of the infinite limits. The
strategegy to rewrite this as
1/g(x)
lim
x→x0 1/f (x)
is not useful, as have been demonstrated in our discussion before this theorem.
Proof
f (x)
We will prove that the right limit lim+ is equal to L. The proof that
x→x0 g(x)
the left limit is equal to L is similar. Observe that if we fix a point u in
(x0 , b), then for any x in (x0 , u), Cauchy mean value theorem asserts that
there is a cx in (x, u) such that
Thus,
f ′ (cx )
f (x) g(x) − g(u) f (u) − Lg(u)
−L= − L + . (3.16)
g(x) g(x) g ′ (cx ) g(x)
f ′ (x)
lim+ = L,
x→x0 g ′ (x)
there exsits a δ1 > 0 such that (x0 , x0 + δ1 ) ⊂ (a, b), and for any x ∈
(x0 , x0 + δ1 ),
f ′ (x) ε
′
−L < .
g (x) 3
Take u = x0 + δ1 /2. Since lim g(x) = ∞, we find that
x→x0
Therefore, there exists a number δ such that 0 < δ ≤ δ1 /2, and for all
x ∈ (x0 , x0 + δ),
If x is in (x0 , x0 + δ), x0 < x < u and hence x0 < cx < u < x0 + δ1 . This
implies that
f ′ (cx ) ε
′
−L < .
g (cx ) 3
Eq. (3.16) then implies that for all x ∈ (x0 , x0 + δ),
Notice that in the proof, we do not use the assumption that lim f (x) = ∞.
x→x0
Hence, this can be ommited from the conditions in the theorem. If f (x) is bounded
in a neighbourhood of x0 , there is no need to apply l’ Hôpital’s rule.
Let us now look at some examples.
Example 3.24
Let r be a positive number. Prove that
ln x
lim = 0.
x→∞ xr
lim xs e−x = 0.
x→∞
Solution
The limit
ln x
lim
x→∞ xr
is of the form ∞/∞. Apply l’ Hôpital’s rule, we have
1
ln x 1 1
lim r = lim xr−1 = lim r = 0.
x→∞ x x→∞ rx r x→∞ x
Since lim ex = ∞, and the function f (x) = xs is a continuous function,
x→∞
we find that
(ln u)s
lim xs e−x = lim
x→∞ u→∞ u
s
ln u
= lim 1/s
u→∞ u
= 0s = 0.
Chapter 3. Differentiating Functions of a Single Variable 238
Example 3.25
Show that there exists a number c so that the function
x x , if x > 0,
f (x) =
c, if x = 0,
is continuous.
Solution
Since
f (x) = xx = ex ln x when x > 0,
f (x) is continuous on (0, ∞). To make f continuous, f must be continuous
at x = 0. This means
Exercises 3.6
Question 1
Determine whether the limit exists. If it exists, find the limit.
x100 + x50 − 2
(a) lim
x→1 3x101 + 4x55 − 7x
2e−x − 2 + 2x
(b) lim
x→0 x2 + 3x3
tan−1 x
(c) lim
x→0 x
tan x − x
(d) lim
x→0 x3
Question 2
Find the limit 2x+1
3x − 1
lim .
x→∞ 3x + 1
Question 3
Let r be a positive number. Prove that
lim+ xr ln x = 0.
x→0
Question 4
Determine whether the limit exists. If it exists, find the limit.
xx − 1
(a) lim
x→1 x − 1
xx − 1
(b) lim+
x→0 x ln x
xx ln(1 + x)
(c) lim+
x→0 x
Chapter 3. Differentiating Functions of a Single Variable 240
runs through all points in the interval [x1 , x2 ]. We say that a subset S of R is
convex if and only if for any two points x1 and x2 in S, and for any t in [0, 1], the
point (1 − t)x1 + tx2 is also in S. We have proved that a subset of R is convex if
and only if it is an interval.
Hence, (x0 , g(x0 )) is point on the line L. Therefore, a function y = f (x) is strictly
concave up if its graph is always below a line segment joining two points on the
graph; and it is strictly concave down if its graph is always above a line segment
joining two points on the graph.
Figure 3.19: (a) A strictly concave up function. (b) A strictly concave down
function.
Example 3.26
Example 3.27
Solution
Let x1 and x2 be any two distinct real numbers, and let t be a number in the
interval (0, 1). Then
We say that the graph of f is strictly above the tangent line at x = x0 except at the
tangential point provided that
Similarly, one can define what it means for the graph of f to be below a tangent
line, or strictly below.
Chapter 3. Differentiating Functions of a Single Variable 243
Theorem 3.45
Let f : [a, b] → R be a function that is continuous on [a, b], and
differentiable on (a, b). The following three conditions are equivalent.
(b) The graph of f is strictly above every tangent line except at the
tangential point.
Proof
First we prove (a) =⇒ (b). Take any x0 ∈ (a, b). The equation of the
tangent line at x = x0 is
If x ∈ [a, b],
Therefore
f (x) − g(x) = (x − x0 )(f ′ (u) − f ′ (x0 )).
If a ≤ x < x0 , u < x0 and so f ′ (u) < f ′ (x0 ). This implies that f (x) −
g(x) > 0. If x > x0 , u > x0 and so f ′ (u) > f ′ (x0 ). Then we also have
f (x) − g(x) > 0. In other words, we have proved that for any x0 in (a, b),
for any x ∈ [a, b] \ {x0 },
This proves that the graph of f is strictly above every tangent line except at
the tangential point.
Next, we prove (b) =⇒ (c). Given x1 and x2 in [a, b] with x1 < x2 , and
t ∈ (0, 1), let x0 = (1 − t)x1 + tx2 . Then x1 < x0 < x2 , and
x1 − x0 = −t(x2 − x1 ), x2 − x0 = (1 − t)(x2 − x1 ).
By assumption,
Letting x → x+
1 in (3.17), we find that
f (x2 ) − f (x1 )
f ′ (x1 ) ≤ .
x2 − x1
Chapter 3. Differentiating Functions of a Single Variable 245
Letting x → x−
2 in (3.17), we find that
f (x2 ) − f (x1 )
f ′ (x2 ) ≥ .
x2 − x1
These prove that
f ′ (x2 ) ≥ f ′ (x1 ).
In other words, we have proved that f ′ is increasing on (a, b). If f ′ is not
strictly increasing on (a, b), there exist x1 and x2 in (a, b) with x1 < x2 but
f ′ (x1 ) = f ′ (x2 ). Since f ′ is increasing, we will have f ′ (x) = f ′ (x1 ) =
f ′ (x2 ) = m for all x ∈ (x1 , x2 ). This implies that
Theorem 3.46
Let f : [a, b] → R be a function that is continuous on [a, b], and
differentiable on (a, b). The following three conditions are equivalent.
(b) The graph of f is strictly below every tangent line except at the
tangential point.
In Theorem 3.45, if we relax the strictness, the proofs are actually easier.
Chapter 3. Differentiating Functions of a Single Variable 246
Theorem 3.47
Let f : [a, b] → R be a function that is continuous on [a, b], and
differentiable on (a, b). The following three conditions are equivalent.
Theorem 3.48
Let f : [a, b] → R be a function that is continuous on [a, b], and
differentiable on (a, b). The following three conditions are equivalent.
Theorem 3.49
Let f : [a, b] → R be a function that is continuous on [a, b], and twice
differentiable on (a, b).
2. f (x) is concave down if and only if f ′′ (x) ≤ 0 for all x ∈ (a, b).
3. If f ′′ (x) > 0 for all x ∈ (a, b), then f is strictly concave up.
4. If f ′′ (x) < 0 for all x ∈ (a, b), then f is strictly concave down.
Chapter 3. Differentiating Functions of a Single Variable 247
Proof
For (a) and (b), this is just the fact that f ′′ (x) ≥ 0 for all x ∈ (a, b) if and
only if f ′ is increasing; f ′′ (x) ≤ 0 for all x ∈ (a, b) if and only if f ′ is
decreasing.
For (c) and (d), we note that f ′′ (x) > 0 for all x ∈ (a, b) implies that f ′ is
strictly increasing on (a, b); while f ′′ (x) < 0 for all x ∈ (a, b) implies that
f ′ is strictly decreasing on (a, b)
Example 3.28
Solution
′′
Since f (x) = − sin x < 0 for all x ∈ (0, π), we find that f : [0, π] → R,
f (x) = sin x is strictly concave down.
Example 3.29
Solution
Notice that since p and q are positive, we have 1/p < 1 and 1/q < 1. This
implies that p > 1 and q > 1. Consider the function f : (0, ∞) → R,
f (x) = ln x. We find that
1 1
f ′ (x) = , f ′′ (x) = − <0 for all x > 0.
x x2
Hence, the function f : (0, ∞) → R, f (x) = ln x is strictly concave down.
This implies that for any two positive numbers x1 and x2 , and any t ∈ (0, 1),
The equality can hold if and only if x1 = x2 . Now, let t = 1/q. Then
t ∈ (0, 1) and 1 − t = 1/p. Given the positive numbers a and b, let
x 1 = ap , x2 = bq .
Exercises 3.7
Question 1
(a) Show that the function f : R → R, f (x) = e−x is strictly concave up.
1
(b) Show that the function f : (0, π) → R, f (x) = is strictly concave
sin x
up.
Question 2
Let f : (a, b) → R be a twice differentiable function. If f : (a, b) → R
is concave down, and f (x) > 0 for all x ∈ (a, b), prove that the function
g : (a, b) → R,
1
g(x) =
f (x)
is concave up.
Question 3
Given that the function f : [a, b] → R is concave down. Show that for any
x1 , x2 , . . . , xn in [a, b], if t1 , t2 , . . . , tn are nonnegative numbers satifying
t1 + t2 + . . . + tn = 1,
then
Chapter 4
The concept of integrals arises naturally when one wants to compute the area
bounded by a curve, such as the area of a circle. Since the ancient time, our
ancestors have found a good strategy to deal with such problems. For example,
they used the area of polygons to approximate the area of a circle. The circle is
partitioned into sectors, and the area of each sector is approximated by the area
of the inscribed triangle (see Figure 4.1). When the circle is partitioned into more
sectors, better approximation is obtained.
The same idea can be used to find the area enclosed by any curves. This
motivated the definition of integrals. For curves defined by continuous functions,
it is not difficult to formulate a well-defined definition for integrals. However,
mathematicians soon discovered that we need to work with functions that are not
continuous as well. The process to make integrals rigorously defined is long and
tedious. We will follow the historical path and study the Riemann integrals in
this course. This lays down the foundation for advanced theory of integration à
la Lebesgue. For practical applications and computations, Riemann integrals are
sufficient and easier to calculate.
Chapter 4. Integrating Functions of a Single Variable 252
In this section, we define the Riemann integral for a function f : [a, b] → R that
is defined on a closed and bounded interval [a, b]. For this purpose, the function
is necessarily bounded. In Section 4.6, we will discuss how to deal with functions
that are not necessarily bounded, via some limiting processes.
For a closed and bounded interval [a, b], we will always assume that a < b.
We start by a few definitions.
We have slightly abused notation and used set notation for a partition.
Example 4.1
[0, 2], [2, 3], [3, 5], [5, 9] and [9, 10].
We use the lengths of the subintervals to measure how fine a partition is.
Let P = {x0 , x1 , . . . , xk } be a partition of the interval [a, b]. The gap of the
partition P , denoted by |P | or gap P , is the length of the largest subinterval
in the partition. Namely,
Example 4.2
|P | = max{2, 1, 2, 4, 1} = 4.
Let [a, b] be a closed and bounded interval. A regular partition of [a, b] into
k intervals is the partition P = {x0 , x1 , . . . , xk }, where
b−a
|P | = x1 − x0 = x2 − x1 = · · · = xk − xk−1 = .
k
b−a
This implies that xi = x0 + i , 1 ≤ i ≤ k.
k
Example 4.3
The regular partition of the interval [0, 10] into 5 intervals is the partition
P = {0, 2, 4, 6, 8, 10}.
10 − 0
The gap of this partition is |P | = = 2.
5
Example 4.4
A = {1, 3, 4, 5} .
Then
R(f, P, A) = 5 × 2 + 9 × 1 + 8 × 2 + 5 × 1 = 40.
As shown in Figure 4.2, the Riemann sum R(f, P, A) is the sum of the areas
of rectangles that are used to approximate the region bounded by the curve y =
6x − x2 and the x-axis.
The Darboux lower sum L(f, P ) and the Darboux upper sum U (f, P ) are
defined by
k
X
L(f, P ) = mi (xi − xi−1 ),
i=1
Xk
U (f, P ) = Mi (xi − xi−1 ).
i=1
Remark 4.1
For convenience, we denote
Example 4.5
i interval xi − xi−1 mi Mi
1 [0, 2] 2 0 8
2 [2, 3] 1 8 9
3 [3, 5] 2 5 9
4 [5, 6] 1 0 5
Hence, the Darboux lower sum L(f, P ) and the Darboux upper sum
U (f, P ) are
L(f, P ) = 0 × 2 + 8 × 1 + 5 × 2 + 0 × 1 = 18,
U (f, P ) = 8 × 2 + 9 × 1 + 9 × 2 + 5 × 1 = 48.
Example 4.6
Proposition 4.1
For any partition P = {xi }ki=0 of the interval [a, b], and any choice of
intermediate points A = {ξi }ki=1 for the partition P , we have
Proof
For any 1 ≤ i ≤ k, let
Then
m ≤ mi ≤ f (ξi ) ≤ Mi ≤ M.
Therefore,
Example 4.7
P ∗ = {0, 1, 2, 3, 5, 6, 8, 9, 10}
is a refinement.
Chapter 4. Integrating Functions of a Single Variable 258
Example 4.8
For the partition P and P ∗ in Example 4.7, P ∗ induces the partition P1 =
{0, 1, 2}, P2 = {2, 3}, P3 = {3, 5}, P4 = {5, 6, 8, 9} and P5 = {9, 10} of
the intervals [0, 2], [2, 3], [3, 5], [5, 9] and [9, 10] respectively.
Proposition 4.2
Theorem 4.3
Let f : [a, b] → R be a bounded function, and let P and P ∗ be partitions of
[a, b]. If P ∗ is a refinement of P , then
Proof
Let P = {xi }ki=0 . For each 1 ≤ i ≤ k, let
Since
mi ≤ f (x) ≤ Mi for all x ∈ [xi−1 , xi ],
we find that
Figure 4.5: When a partition is refined, Darboux lower sum gets larger.
Figure 4.6: When a partition is refined, Darboux upper sum gets smaller.
Corollary 4.4
L(f, P1 ) ≤ U (f, P2 ).
Proof
∗
Take a common refinement P of the partitions P1 and P2 . By Theorem
4.3,
L(f, P1 ) ≤ L(f, P ∗ ) ≤ U (f, P ∗ ) ≤ U (f, P2 ).
This implies that the sets SL (f ) and SU (f ) are bounded. When we use the
Darboux lower sums and upper sums to approximate areas, we are interested in
the least upper bound of the lower sums and the greatest lower bound of the upper
sums.
Chapter 4. Integrating Functions of a Single Variable 261
Z b
2. The upper integral of f , denoted by f , is defined as the greatest lower
a
bound of the Darboux upper sums.
Z b
f = inf SU (f ) = inf {U (f, P ) | P is a partition of [a, b]} .
a
Example 4.9
Proposition 4.5
Proof
By definitions of infimum and supremum, for any positive integer n, there
are partition P1 and P2 such that
Z b Z b
1 1
L(f, P1 ) > f− , U (f, P2 ) < f+ .
a n a n
Z b Z b
2. 0 ≤ f − f ≤ U (f, P ) − L(f, P ).
a a
L(f, P ) ≤ w ≤ U (f, P ).
less than or equal to all the Darboux upper sums. This is the case if and only if
the lower and the upper integrals are the same.
It is the unique number that is larger than or equal to all the Darboux lower
sums, and less than or equal to all the Darboux upper sums.
Remark 4.2
If f : [a, b] → R is a continuous nonnegative function, we are going to
prove that f is Riemann integrable. It follows from our discussions above
Z b
that the integrable f is the area bounded by the curve y = f (x), the
a
x-axis, and the lines x = a and x = b.
Leibniz Notation
In Leibniz notation, the integral of f : [a, b] → R over [a, b] is denoted by
Z b
f (x)dx.
a
Example 4.10
Solution
Let P = {xi }ki=0 be any partition of the interval [0, 1]. For any 1 ≤ i ≤
k, by denseness of the set of rational numbers and the set of irrational
numbers, there exist a rational number and an irrational number in the
interval [xi−1 , xi ]. This shows that
mi = 0, Mi = 1, for all 1 ≤ i ≤ k.
Hence,
k
X
L(f, P ) = mi (xi − xi−1 ) = 0,
i=1
Xk
U (f, P ) = Mi (xi − xi−1 ) = 1.
i=1
SL (f ) = {0}, SU (f ) = {1}.
Lemma 4.6
Let f : [a, b] → R be a bounded function. Then the following are
equivalent.
(b) For any ε > 0, there exists a partition P of [a, b] such that
Proof
First, let us prove (a) implies (b). If f is Riemann integrable,
Z b Z b Z b
f= f= f.
a a a
Conversely, assume that (b) holds. Then for every positive integer n, there
is a partition Pn of [a, b] such that
Chapter 4. Integrating Functions of a Single Variable 266
1
0 ≤ U (f, Pn ) − L(f, Pn ) < .
n
Therefore,
Z b Z b
1
0≤ f − f< .
a a n
Taking the n → ∞ limit, squeeze theorem implies that
Z b Z b
f = f.
a a
This theorems says that the Riemann integrability of a function can be checked
by the existence of a sequence of partitions satisfying (4.1). Such sequence of
partitions can also be used to compute the Riemann integral. Thus, we give such
sequence a special name.
Hence, Theorem 4.7 says that f : [a, b] → R is Riemann integrable if and only
if it has an Archimedes sequence of partitions.
Chapter 4. Integrating Functions of a Single Variable 267
the definition of limit of sequences implies that for every ε > 0, there is a
positive integer N such that for all n ≥ N ,
Z b
0 ≤ U (f, Pn ) − f ≤ U (f, Pn ) − L(f, Pn ).
a
Z b
lim L(f, Pn ) = lim U (f, Pn ) = f.
n→∞ n→∞ a
Chapter 4. Integrating Functions of a Single Variable 268
Example 4.12
2
Let f : [1, 4] → R be the function
Z 4 f (x) = x . Show that f is Riemann
integrable and find the integral f (x)dx.
1
Solution
Let n be a positive integer, and let Pn = {x0 , x1 , . . . , xn } be the regular
partition of [1, 4] into n intervals. Then
3i
xi = 1 + .
n
Notice that the function f : [1, 4] → R, f (x) = x2 is an increasing function.
Therefore, on the interval [xi−1 , xi ],
2 2
3i − 3 3i
mi = f (xi−1 ) = 1 + , Mi = f (xi ) = 1+ .
n n
n n
6i 9i2
X 3X
U (f, Pn ) = Mi (xi − xi−1 ) = 1+ + 2
i=1
n i=1
n n
3 3(n + 1)(2n + 1)
= n + 3(n + 1) +
n 2n
3
= 2 (14n2 + 15n + 3).
2n
Moreover,
45
U (f, Pn ) − L(f, Pn ) = .
n
It follows that
lim (U (f, Pn ) − L(f, Pn )) = 0.
n→∞
(ii) For any ε > 0, there exists a δ > 0 so that if P = {xi }ki=0 is a partition
of [a, b] with |P | < δ, then
Proof
(ii) implies (i) follows trivially from Lemma 4.6.
Now assume that (i) holds. Since f : [a, b] → R is bounded, there exists a
positive number M such that
P0 = {e
x0 , x es }
e1 , . . . , x
Here
mi = inf f (x), Mi = sup f (x).
xi−1 ≤x≤xi xi−1 ≤x≤xi
Let
E1 = {1 ≤ i ≤ k | ∃j, x
ej ∈ [xi−1 , xi ]}
be the set that contains those indices i where the interval [xi−1 , xi ] contains
a partition point of P0 , and let E2 = {1, 2, . . . , k} \ E1 be the set of those
indices that are not in E1 .
Chapter 4. Integrating Functions of a Single Variable 271
By definition, the point xe0 can only be in [x0 , x1 ], and the point x
es can only
be in [xk−1 , xk ]. For any 1 ≤ j ≤ s − 1, xej can be in at most two different
subintervals of P . Hence, E1 contains at most 2s elements.
Splitting the sum over i to a sum over E1 and a sum over E2 , we have
X X
U (f, P )−L(f, P ) = (Mi −mi )(xi −xi−1 )+ (Mi −mi )(xi −xi−1 ).
i∈E1 i∈E2
0 ≤ Mi − mi ≤ 2M.
Lemma 4.9
Let f : [a, b] → R be a bounded function, and let P = {xi }ki=0 be a partition
of [a, b]. For every ε > 0, there exist choices of intermediate points A and
B for the partition P such that
Proof
For 1 ≤ i ≤ k, let mi = inf f (x) and Mi = sup f (x). By
xi−1 ≤x≤xi xi−1 ≤x≤xi
definitions of infimum and supremum, for each 1 ≤ i ≤ k, there are points
ξi and ηi in [xi−1 , xi ] such that
ε
mi ≤ f (ξi ) < mi + ,
(b − a)
ε
Mi − < f (ηi ) ≤ Mi .
(b − a)
Let A = {ξi }ki=1 and B = {ηi }ki=1 . They are choices of intermediate points
for the partition P . The two inequalities above give
(ii) There is a number I such that for any ε > 0, there exists a δ > 0 so
that if P = {xi }ki=0 is a partition of [a, b] with |P | < δ, A = {ξi }ki=1
is a choice of intermediate points for P , then
|R(f, P, A) − I| < ε.
Note that statement (ii) can be expressed as saying the limit of Riemann sums
I = lim R(f, P, A)
|P |→0
exists.
Proof
Assume that (i) holds. Let
Z b Z b
I= f= f.
a a
Given ε > 0, by Theorem 4.8, there exists a δ > 0 such that if P = {xi }ki=0
is a partition of [a, b] with |P | < δ, then
Given ε > 0, (ii) implies there is a δ > 0 such that if P = {xi }ki=0 is a
partition of [a, b] with |P | < δ, A = {ξi }ki=1 is a choice of intermediate
points for the partition P , then
ε
|R(f, P, A) − I| < . (4.3)
4
Here I is the limit of Riemann sums implied by (ii). Let P = {xi }ni=0 be a
regular partition into n intervals, where n is large enough so that
b−a
|P | = < δ.
n
By Lemma 4.9, there exist choices of intermediate points A and B for the
partition P which satisfy
ε ε
U (f, P ) < R(f, P, A) + , L(f, P ) > R(f, P, B) − .
4 4
These imply that
ε
U (f, P ) − L(f, P ) < R(f, P, A) − R(f, P, B) + .
2
Chapter 4. Integrating Functions of a Single Variable 275
By (4.3),
ε
|R(f, P, A) − R(f, P, B)| ≤ |R(f, P, A) − I| + |R(f, P, B) − I| < .
2
This proves that
U (f, P ) − L(f, P ) < ε,
which completes the proof that (ii) implies (i).
Corollary 4.11
lim |Pn | = 0.
n→∞
Then
Z b
(a) f = lim U (f, Pn ) = lim L(f, Pn )
a n→∞ n→∞
Z b
(b) f = lim R(f, Pn , An ), where for each n ∈ Z+ , An is a choice of
a n→∞
intermediate points for the partition Pn .
Proof
Z b
Let I = f . Given ε > 0, Theorem 4.8 and Theorem 4.10 imply that
a
there is a δ > such that for any partition P with |P | < δ, and any choice of
intermdiates points A for the partition P ,
For every positive integer n, take Pn to be the regular partition of [a, b] into
n intervals. This gives a sequence of partitions {Pn } with
b−a
lim |Pn | = lim = 0.
n→∞ n→∞ n
For the choices of intermediate points An , one can take the left end point of
each interval, or the right end point, or the midpoint.
Example 4.13
We are going to prove in Section 4.3 that a continuous function is
integrable. The function f : [0, 6] → R,Zf (x) = 6x − x2 is continuous. Use
6
Riemann sums to evaluate the integral f (x)dx.
0
Solution
For a positive integer n, let Pn = {xi }ni=0 be the regular partition of [0, 6]
into n intervals. Then
6i
xi = , 0 ≤ i ≤ n.
n
Chapter 4. Integrating Functions of a Single Variable 277
Exercises 4.1
Question 1
(c) Use partZ (b) to conclude that f is Riemann integrable and find the
2
integral f (x)dx.
0
Question 2
In this section, we derive some properties of the Riemann integrals. First we show
that integral of a nonnegative function is nonnegative.
Theorem 4.12
If f : [a, b] → R is a bounded function that is Riemann integrable, and
then Z b
f ≥ 0.
a
Proof
Since f is Riemann integrable,
Z b
f = lim L(f, Pn ),
a n→∞
Therefore, Z b
f ≥ 0.
a
Proof
Here we use the fact that a function h : [a, b] → R is Riemann integrable
if and only if the limit lim R(h, P, A) exists. Since f : [a, b] → R and
|P |→0
g : [a, b] → R are bounded, αf + βg is also bounded. Since f : [a, b] → R
and g : [a, b] → R are Riemann integrable,
Z b Z b
f = lim R(f, P, A), g = lim R(g, P, A).
a |P |→0 a |P |→0
Notice that for any partition P = {xi }ki=0 of [a, b], and any choice of
intermediate points A = {ξi }ki=1 for the partition P ,
k
X
R(αf + βg, P, A) = (αf (ξi ) + βg(ξi )) (xi − xi−1 )
i=1
From the previous two theorems, we obtain a comparison theorem for integrals.
Chapter 4. Integrating Functions of a Single Variable 281
then Z b Z b
f≥ g.
a a
Proof
Define h : [a, b] → R to be the function h(x) = f (x) − g(x). Then
h(x) ≥ 0 for all x ∈ [a, b]. By Theorem 4.13, h is Riemann integrable and
Z b Z b Z b
h= f− g.
a a a
Z b
By Theorem 4.12, h ≥ 0. Hence,
a
Z b Z b
f≥ g.
a a
We can apply the monotonicity theorem to obtain bounds for an integral from
the lower bound and the upper bound of the function.
Example 4.14
Then Z b
m(b − a) ≤ f ≤ M (b − a).
a
In either case, Z b Z c Z b
f= f+ f.
a a c
Proof
We use Lemma 4.6. First we prove (a). Given ε > 0, since f : [a, b] → R
is Riemann integrable, there is a partition P of [a, b] such that
Therefore,
If a > b, define Z b Z a
f =− f.
a b
Z b Z c
Then one can check that as long as two of the three integrals f, f,
Z b a a
Exercises 4.2
Question 1
Given that f : [2, 7] → R is a function satisfying
Question 2
Given that f : [a, b] → R and g : [a, b] → R are bounded functions, P is a
partition of [a, b], and c and d are two points in [a, b] with c < d. Prove the
following.
Then use (c) and (d) to give a proof of the following statement: If f :
[a, b] → R and g : [a, b] → R are Riemann integrable, then f + g : [a, b] →
R is also Riemann integrable, and
Z b Z b Z b
(f + g) = f+ g.
a a a
Chapter 4. Integrating Functions of a Single Variable 286
Theorem 4.17
Let f : [a, b] → R be a continuous function. Then f : [a, b] → R is
Riemann integrable.
Proof
Since f : [a, b] → R is a continuous function defined on a closed and
bounded interval, it is uniformly continuous. Given ε > 0, there exists
δ > 0 such that for any u and v in [a, b] with |u − v| < δ,
ε
|f (u) − f (v)| < .
b−a
Let P = {xi }ki=0 be a partition of [a, b] with |P | < δ. For any 1 ≤ i ≤ k,
f : [xi−1 , xi ] → R is continuous. By extreme value theorem, there exists ui
and vi in [xi−1 , xi ] such that
Then
|ui − vi | ≤ xi − xi−1 ≤ |P | < δ.
Therefore,
ε
Mi − mi = f (vi ) − f (ui ) < .
b−a
Chapter 4. Integrating Functions of a Single Variable 287
Hence,
k
X
U (f, P ) − L(f, P ) = (Mi − mi )(xi − xi−1 )
i=1
k
ε X
< (xi − xi−1 ) = ε.
b − a i=1
It follows from this theorem that all the following classes of functions
are integrable on a closed and bounded interval that is contained in their
domains.
• Polynomials
• Rational Functions
• Exponential Functions
• Logarithmic Functions
• Trigonometric Functions
Let us revisit the concept of area, which is our original motivation to define
integrals.
Figure 4.7: Darboux lower sum underestimates area while Darboux upper sum
overestimates area.
Given P a partition of [a, b], the Darboux lower sum L(f, P ) is a sum of
areas of rectangles that are inside R. The Darboux upper sum U (f, P ) is a
sum of areas of rectangles whose union contains R. Therefore, if R has a
area A, L(f, P ) is less than or equal to A, while U (f, P ) is larger than or
Z b
equal to A. Since f is continuous, the Riemann integral I = f (x)dx
a
exists. By definition, I is the unique number such that
L(f, P ) ≤ I ≤ U (f, P )
for all partitions P of [a, b]. Therefore, we define the area of R to be this
number I. Namely, Z b
Area of R = f (x)dx.
a
There are also other classes of functions that are Riemann integrable, which
are useful. First, we relax the continuity condition slightly in the previous theorem.
Chapter 4. Integrating Functions of a Single Variable 289
Theorem 4.18
Let f : [a, b] → R be a bounded function that is continuous on (a, b). Then
f : [a, b] → R is Riemann integrable.
Here we only assume f is continuous on (a, b). The function can take on any
values on the boundary points a and b.
Proof
Since f : [a, b] → R is bounded, there is a positive constant M such that
Z b
As we can see in the proof above, the integral f does not depend on the
a
Chapter 4. Integrating Functions of a Single Variable 290
function value at the end points. In fact, this is true for any finite number of points.
Theorem 4.19
Let f : [a, b] → R be a bounded function that is Riemann integrable.
Assume that g : [a, b] → R is a function and S = {a1 , a2 , . . . , ak } is a
finite subset of [a, b] such that
Proof
Let h : [a, b] → R be the function h(x) = g(x) − f (x). Then h(x) = 0 for
x ∈ [a, b] \ S. Since S is a finite set, h is bounded, and so there is a positive
constant M such that |h(x)| ≤ M for all x ∈ [a, b]. Given a positive integer
n, let Pn = {x0 , x1 , . . . , xn } be the regular partition of [a, b] into n intervals.
There are at most 2k of the intervals [xi−1 , xi ] that contains a point of S. In
these intervals,
n
X 2M k(b − a)
L(h, Pn ) = inf h(x)(xi − xi−1 ) ≥ − .
i=1
xi−1 ≤x≤xi n
Chapter 4. Integrating Functions of a Single Variable 291
Therefore,
2M k(b − a) 2M k(b − a)
− ≤ L(h, Pn ) ≤ U (h, Pn ) ≤ .
n n
Taking n → ∞ limits, we find that
Remark 4.4
If f : (a, b) → R is a bounded function, we can extend the function to
[a, b] and discuss its integrability. By Theorem 4.19, this is not affected by
how we define the function at x = a and x = b. In Z case the extension is b
Riemann integrable, we still denote the integral by f.
a
Using the general additivity theorem (Corollary 4.16) and Theorem 4.18, we
obtain the following immediately.
Theorem 4.20
Let f : [a, b] → R be a function that is bounded and piecewise continuous.
Then f : [a, b] → R is Riemann integrable.
Chapter 4. Integrating Functions of a Single Variable 292
Example 4.15
Proposition 4.21
Example 4.16
Solution
The function f is given explicitly by
1, if 0 ≤ x ≤ 1,
1 < x ≤ 5/4,
4, if
f (x) = 3, if 5/4 < x ≤ 5/3,
2, if 5/3 < x ≤ 5/2,
1, if 5/2 < x ≤ 5.
The following theorem shows that monotonic functions are also Riemann
Chapter 4. Integrating Functions of a Single Variable 294
integrable.
Theorem 4.22
If f : [a, b] → R is a monotonic function, then it is Riemann integrable.
Proof
Without loss of generality, assume that f : [a, b] → R is an increasing
function. If P = {xi }ki=0 is a partition of [a, b], then for any 1 ≤ i ≤ k,
Therefore,
n
X
U (f, P ) − L(f, P ) = (f (xi ) − f (xi−1 ) (xi − xi−1 ).
i=1
For each positive integer n, let Pn be the regular partition of [a, b] into n
intervals. Then
n
b−aX
U (f, Pn ) − L(f, Pn ) = (f (xi ) − f (xi−1 ))
n i=1
(b − a) (f (b) − f (a))
= .
n
Chapter 4. Integrating Functions of a Single Variable 295
(b − a) (f (b) − f (a))
lim (U (f, Pn ) − L(f, Pn )) = lim = 0.
n→∞ n→∞ n
In other words, {Pn } is an Archimedes sequence of partitions for f . By the
Archimedes-Riemann theorem, this proves that f is Riemann integrable.
Example 4.17
Theorem 4.23
Let f : [a, b] → R be a bounded function. If f : [a, b] → R is Riemann
integrable, then the function |f | : [a, b] → R is Riemann integrable.
Proof
We will first prove the following: For any c and d in [a, b] with c < d,
There are two sequences of points {un } and {vn } in [c, d] such that
But then
U (|f |, P ) − L(|f |, P )
Xn
= sup |f (x)| − inf |f (x)| (xi − xi−1 )
xi−1 ≤x≤xi xi−1 ≤x≤xi
i=1
Xn
≤ sup f (x) − inf f (x) (xi − xi−1 )
xi−1 ≤x≤xi xi−1 ≤x≤xi
i=1
Remark 4.5
The converse of Theorem 4.23 is not true. Namely, if a function f : [a, b] →
R is bounded, |f | : [a, b] → R is Riemann integrable does not imply that
f : [a, b] → R is Riemann integrable. For a counter example, consider the
function f : [0, 1] → R defined as
1, if x is rational,
f (x) =
−1, if x is irrational.
One can prove that f : [0, 1] → R is not integrable, exactly the same way
as in Example 4.11. On the other hand, since |f | : [0, 1] → R is a constant
function, it is Riemann integrable.
The following theorem says that the product of Riemann integrable functions
is Riemann integrable.
Theorem 4.24
Let f : [a, b] → R and g : [a, b] → R be bounded functions. If f :
[a, b] → R and g : [a, b] → R are Riemann integrable, then the function
(f g) : [a, b] → R is also Riemann integrable.
Proof
We will apply Lemma 4.6 to prove the Riemann integrability of the function
h = (f g) : [a, b] → R. Since f : [a, b] → R and g : [a, b] → R are bounded
functions, there is a positive number M so that
There are two sequences of points {un } and {vn } in [c, d] such that
Notice that
|h(vn ) − h(un )| = |g(vn )(f (vn ) − f (un )) + f (un )(g(vn ) − g(un ))|
≤ |g(vn )||f (vn ) − f (un )| + |f (un )||g(vn ) − g(un )|
Therefore,
|h(vn ) − h(un )|
≤ M sup f (x) − inf f (x) + sup g(x) − inf g(x) .
c≤x≤d c≤x≤d c≤x≤d c≤x≤d
It follows that
U (h, P ∗ ) − L(h, P ∗ )
Xk
= sup h(x) − inf h(x) (xi − xi−1 )
xi−1 ≤x≤xi xi−1 ≤x≤xi
i=1
k
X
≤M sup f (x) − inf f (x) (xi − xi−1 )
xi−1 ≤x≤xi xi−1 ≤x≤xi
i=1
k
X
+M sup g(x) − inf g(x) (xi − xi−1 )
xi−1 ≤x≤xi xi−1 ≤x≤xi
i=1
Exercises 4.3
Question 1
Given that f : [−1, 1] → R is the function defined by
sin 4 ,
if x ̸= 0,
f (x) = x
0,
if x = 0.
Question 2
Let f : [a, b] → R be a bounded function. If f : [a, b] → R is Riemann
integrable, show that
Z b Z b
f (x)dx ≤ |f (x)|dx.
a a
Question 3
Let f : [0, 6] → R be the function defined as
−2, if 0 ≤ x < 1,
f (x) =
⌊4/x⌋ , if 1 ≤ x ≤ 6.
Z 6
Show that f is Riemann integrable and find f.
0
Question 4
Let f : R → R be the function defined as
f (x) = (x − ⌊x⌋)2 .
Question 5
Let f : [a, b] → R and g : [a, b] → R be bounded functions. Define the
function h : [a, b] → R by
By definition, F (a) = 0. For any c and d in [a, b], one can check that
Z d Z c Z d
F (d) − F (c) = f (u)du − f (u)du = f (u)du.
a a c
Theorem 4.25
Let f : [a, b] → R be a bounded function that is Riemann integrable, and
let F : [a, b] → R be the function defined by
Z x
F (x) = f (u)du.
a
Proof
It is sufficient to prove that F : [a, b] → R is Lipschitz. The continuity
follows. Since f : [a, b] → R is bounded, there is a positive constant M
such that
|f (x)| ≤ M for all x ∈ [a, b].
Chapter 4. Integrating Functions of a Single Variable 303
Therefore,
Z x2 Z x2
|F (x2 ) − F (x1 )| = f (u)du ≤ |f (u)|du
x1 x1
Z x2
≤ M du = M (x2 − x1 ) = M |x2 − x1 |.
x1
Proof
Since f : [a, b] → R is a continuous function, the extreme value theorem
says that there are points u and v in [a, b] such that
satisfies
f (u) ≤ w ≤ f (v).
By intermediate value theorem, there is a point c in [a, b] such that f (c) =
w. This gives Z b
1
f (x)dx = f (c).
b−a a
In fact, one can argue that the number c can be chosen to be in (a, b).
Example 4.18
Solution
Let g : [a, b] → R be the function g(x) = f (x) − m. Then g : [a, b] → R is
a continuous function and g(x) ≥ 0 for all x ∈ [a, b]. Moreover,
Z b Z b Z b
g(x)dx = f (x)dx − mdx = 0.
a a a
We want to show that g(x) = 0 for all x ∈ [a, b]. Suppose to the contrary
that there is a point x0 in [a, b] such that g(x0 ) ̸= 0. Then g(x0 ) > 0. Since
g is continuous, there exists a δ > 0 such that for all x ∈ (x0 − δ, x0 + δ) ∩
(a, b),
g(x0 )
g(x) > .
2
Chapter 4. Integrating Functions of a Single Variable 305
b−a
Without loss of generality, assume that δ < . Then either x0 − δ > a
2
or x0 + δ < b. In any case, (x0 − δ, x0 + δ) ∩ (a, b) = (c, d) is an interval
of length at least δ. But then
Z b Z c Z d Z b
g(x)dx = g(x)dx + g(x)dx + g(x)dx
a a c d
g(x0 )
≥ 0 × (c − a) + (d − c) + 0 × (b − d)
2
g(x0 )
= (d − c) > 0,
2
which is a contradiction. Therefore, we must have g(x) = 0 for all x ∈
[a, b].
Remark 4.6
In the mean value theorem for integrals, we can strengthen the theorem to
have the point c being a point in the open interval (a, b). In the proof, we
Z b
have shown that for w = f (x)dx,
a
f (u) ≤ w ≤ f (v).
Now we turn to the fundamental theorem of calculus. Consider the case that an
object is moving with speed v(t) at time t. To find s(t), the distance travelled up to
time t, we can partition the time interval [0, t] into a finite number of subintervals
[0, t1 ], [t1 , t2 ], . . . , [tk−1 , tk ], where tk = t. For each time interval [ti−1 , ti ], where
1 ≤ i ≤ k, take a point t∗i ∈ [ti−1 , ti ], and approximate the average speed over the
time interval [ti−1 , ti ] by the speed at time t∗i , v(t∗i ). Then the distance travelled up
Chapter 4. Integrating Functions of a Single Variable 306
to time t is approximately
k
X
v(t∗i )(ti − ti−1 ).
i=1
We recognize that this is a Riemann sum of the speed function v(t). The distance
travelled s(t) should be calculated as the limit where the gap of the partition goes
to zero. In other words, Z t
s(t) = v(τ )dτ.
0
which means that differentiation and integration are inverse processes of each
other. The fundamental theorem of calculus gives a rigorous setting for this.
F (x0 + h) − F (x0 ) ε
− f (x0 ) ≤ < ε.
h 2
F (x0 + h) − F (x0 )
lim = f (x0 ).
h→0 h
Example 4.19
We find that
Z x −x, if 0 ≤ x < 1,
F (x) = f (u)du =
0 2x − 3, if 1 ≤ x ≤ 2.
Chapter 4. Integrating Functions of a Single Variable 308
Example 4.20
Evaluate the following derivatives.
Z x
d
(a) sin(u2 )du
dx 0
Z 1
d
(b) sin(u2 )du
dx x
Z x2
d
(c) sin(u2 )du
dx x
Solution
The function f : R → R, f (x) = sin(x2 ) is continuous. Hence, it is
Riemann integrable over any closed and bounded intervals. Let
Z x Z x
F (x) = f (u)du = sin(u2 )du.
0 0
Chapter 4. Integrating Functions of a Single Variable 309
Then Z b
f (x)dx = F (b) − F (a).
a
We will present two proofs of the fundamental theorem of calculus II. The first
one uses fundamental theorem of calculus I.
Chapter 4. Integrating Functions of a Single Variable 310
G(x) = F (x) + C.
and so Z b
f (x)dx = G(b) = F (b) − F (a).
a
we find that
L(f, P ) ≤ F (b) − F (a) ≤ U (f, P ).
Notice that this is true for any partition P of [a, b]. By definitions of the
lower integral and the upper integral, we find that
Z b Z b
f ≤ F (b) − F (a) ≤ f.
a a
Since f : [a, b] → R is Riemann integrable, the lower integral and the upper
integral are the same. Thus,
Z b Z b Z b
f= f= f = F (b) − F (a).
a a a
then Z b
f (x)dx = F (b) − F (a).
a
Proof
We can assume that
Applying
Z ai the fundamental theorem of calculus II to each of the integrals
f (x)dx, we find that
ai−1
Z b k
X
f (x)dx = (F (ai ) − F (ai−1 )) = F (b) − F (a).
a i=1
This completes the proof. Note that it is crucial here that F is continuous
on [a, b].
Example 4.21
Solution
Using additivity,
Z 2 Z 0 Z 2
f (x)dx = f (x)dx + f (x)dx.
−1 −1 0
is continuous on [a, b], differentiable on (a, b), and F ′ (x) = f (x) for all
x ∈ (a, b). By Lagrange mean value theorem, there is a c ∈ (a, b) such that
b
F (b) − F (a)
Z
1
f (x)dx = = F ′ (c) = f (c).
b−a a b−a
Let (a, b) be an open interval that contains the point x0 , and let y0 be any
real number. Given that f : (a, b) → R is a continuous function, there
exists a unique differentiable function F : (a, b) → R such that
Proof
As we mentioned before, the uniqueness follows from the identity criterion.
For the existence, notice that f is continuous on any closed and bounded
interval that is contained in (a, b). Hence, we can define the function F :
(a, b) → R by Z x
F (x) = f (u)du + y0 .
x0
Example 4.22
Find the limit n
1X πk
lim sin .
n→∞ n n
k=1
Solution
We try to identify
n
1X πk
sin
n k=1 n
πk
as a Riemann sum. For 1 ≤ k ≤ n, let ξk = . These are equally
n
spaced points in the interval [0, π]. This motivates us to define the function
f : [0, π] → R, f (x) = sin x. Since f is a continuous function, it is
Riemann integrable. Let Pn be the regular partition of [0, π] into n intervals.
Then with An = {ξk }nk=1 , we have
n
X πk π
R(f, Pn , An ) = sin .
k=1
n n
Therefore,
n
1X πk 1 2
lim sin = lim R(f, Pn , An ) = .
n→∞ n n π n→∞ π
k=1
Chapter 4. Integrating Functions of a Single Variable 316
Exercises 4.4
Question 1
Evaluate the following derivatives.
Z x
d 2
(a) eu du
dx 0
Z 1
d
(b) cos(u2 )du
dx x
Z x3
d √
(c) 2 + sin u du
dx x
Question 2
Let f : [−2, 6] → R be the function defined by
x2 − x, if − 2 ≤ x < 1,
f (x) =
x − 1 , if 1 ≤ x ≤ 6.
x
Find a continuous function F : [−2, 6] → R such that F is differentiable
on (−2, 6), F (0) = 0, and
Question 3
Find the limit
17 + 27 + · · · + n7
lim .
n→∞ n8
Question 4
Find the limit n
1X 2 2πk
lim cos .
n→∞ n n
k=1
Chapter 4. Integrating Functions of a Single Variable 317
Then g maps the interval [a, b] onto a closed and bounded interval [c, d] with
end points g(a) and g(b). If f : [c, d] → R is a function that is bounded and
continuous on (c, d), then the function h : [a, b] → R,
This is equivalent to
Z d Z b
f (u)du = f (g(x))|g ′ (x)|dx. (4.8)
c a
The function g : [a, b] → R that satisfies all the three given conditions defines
a smooth change of variables u = g(x) from x to u, in the sense that g is
continuously differentiable on (a, b).
Chapter 4. Integrating Functions of a Single Variable 318
Proof
Since g is one-to-one, we have g((a, b)) ⊂ (c, d). Therefore, the function
h : [a, b] → R, h(x) = f (g(x))g ′ (x) is continuous and bounded on (a, b),
and hence, it is Riemann integrable. For any x ∈ [a, b], let
Z x Z x
H1 (x) = h(u)du = f (g(u))g ′ (u)du,
a
Z x a
F (x) = f (u)du,
c
Z g(x)
H2 (x) = f (u)du = F (g(x)) − F (g(a)).
g(a)
Since H1′ (x) = H2′ (x) for all x ∈ (a, b), and H1 (a) = H2 (a), we conclude
that H1 (x) = H2 (x) for all x ∈ [a, b]. Namely,
Z b Z g(b)
′
f (g(x))g (x)dx = f (u)du.
a g(a)
From this, we see that integration by substitution is just the inverse of the
chain rule for differentiation. To prove the equivalence of (4.7) and (4.8),
we consider two cases.
Case I: g is strictly increasing on [a, b].
In this case, g ′ (x) ≥ 0, and c = g(a), d = g(b). So (4.7) is equivalent to
(4.8).
Case II: g is strictly decreasing.
In this case, g ′ (x) ≤ 0, g(a) = d and g(b) = c. Therefore,
Z b Z b
′
f (g(x))|g (x)|dx = − f (g(x))g ′ (x)dx
a a
and Z g(b) Z c Z d
f (u)du = f (u)du = − f (u)du.
g(a) d c
Example 4.23
Z 3 √
Evaluate the integral x 16 + x2 dx.
−2
Solution
√
Let f (x) = x and g(x) = 16 + x2 . The function g is continuously
differentiable, with g ′ (x) = 2x, and it maps the interval [−2, 3] onto
the interval [16, 25]. However, it is not one-to-one. The function f is
continuous on [16, 25], so we can apply the integration by substitution. In
practice, we will do substitution by letting u = 16 + x2 , and find that
du
= 2x.
dx
Chapter 4. Integrating Functions of a Single Variable 320
This implies that we can replace xdx by du/2. When x = −2, u = 20;
when x = 3, u = 25. Thus,
Z 3 √ 25 √
1 25 √
125 − 20 20
Z
2
1 3
x 16 + x dx = udu = u 2 = .
−2 2 20 3 20 3
Students are invited to split the integral into a sum of two integrals, one
over the interval [−2, 0], and one over the interval [0, 3]. The function g(x)
is one-to-one on each of these two intervals. Check that the same answer is
obtained.
Example 4.24
Solution
We consider the change of variables u = g(x) = a − x. This is a strictly
monotonic function with g ′ (x) = −1. Therefore, du = −dx. When x = 0,
u = a; when x = a, u = 0. Hence,
Z a Z 0 Z a
f (x)dx = f (a − u)(−du) = f (a − x)dx.
0 a 0
Chapter 4. Integrating Functions of a Single Variable 321
Example 4.25
Solution
Notice that Z a Z 0 Z a
f (x)dx = f (x)dx + f (x)dx.
−a −a 0
Z 0
For the integral f (x)dx, we consider the change of variables u =
−a
g(x) = −x. This is a strictly monotonic function with g ′ (x) = −1.
Therefore, du = −dx. When x = −a, u = a; when x = 0, u = 0.
Hence, Z 0 Z 0 Z a
f (x)dx = f (−u)(−du) = f (−x)dx.
−a a 0
(a) When f is an even function, f (−x) = f (x) for all x ∈ [0, a].
Therefore,
Z a Z a Z a Z a
f (x)dx = f (x)dx + f (x)dx = 2 f (x)dx.
−a 0 0 0
(b) When f is an odd function, f (−x) = −f (x) for all x ∈ [0, a].
Therefore,
Z a Z a Z a
f (x)dx = − f (x)dx + f (x)dx = 0.
−a 0 0
Chapter 4. Integrating Functions of a Single Variable 322
Solution
A circle of radius r with center at the origin has equation x2 + y 2 = r2 .
By symmetry, it is enough for us to find the area in the first quadrant, and
then multiply by 4. The sector in the first quadrant is bounded by the curve
√
y = r2 − x2 , the lines x = 0, x = r, and the x-axis. Hence, the area of a
circle of radius r is Z r√
A=4 r2 − x2 dx.
0
Making a change of variables x = r sin θ, we find that
dx
= r cos θ.
dθ
When x = 0, θ = 0; when x = r, θ = π/2. Therefore,
Z π
2 p
A=4 r2 − r2 sin2 θ r cos θ dθ
0
Z π
2
2
= 4r cos2 θ dθ.
0
Proof
Since f and g are continuous on [a, b], they are bounded. Since f ′ (x) and
g ′ (x) are conitnuous and bounded on (a, b), f g ′ and f ′ g are continuous and
bounded on (a, b). Therefore, f g ′ and gf ′ are Riemann integrable on [a, b].
By product rule, for any x ∈ (a, b),
So (f g)′ is also bounded and continuous on (a, b), and hence Riemann
integrable on [a, b]. Since f g is also continuous on [a, b], we can apply
fundamental theorem of calculus, which gives
Z b
(f g)′ (x)dx = (f g)(b) − (f g)(a).
a
Therefore,
Z b Z b
′
f (x)g (x)dx + g(x)f ′ (x)dx = f (b)g(b) − f (a)g(a).
a a
In a nutshell, the integration by parts formula is just the inverse of the product
rule of differentiation. But it is a very useful integration technique.
Integration by Parts
The integration by parts formula is often expressed as
Z Z
udv = uv − vdu.
Example 4.27
Let n be a positive integer. Evaluate the integral
Z e
ln x
n
dx.
1 x
Solution
If n = 1, we use integration by substitution with u = ln x. Then
du 1
= .
dx x
When x = 1, u = 0; when x = e, u = 1. Therefore,
Z e Z 1 2 1
ln x u 1
dx = udu = = .
1 x 0 2 0 2
Therefore,
Z e e Z e
ln x 1 ln x 1 1
dx = − × + dx
1 xn n − 1 xn−1 1 n − 1 1 xn
e
1 1 1 1
=− × −
n − 1 en−1 (n − 1)2 xn−1 1
1 n 1
= 2
− 2 n−1
.
(n − 1) (n − 1) e
Example 4.28
Let I be an open interval that contains the point x0 , and let f : I → R
be a continuous function. Given a positive integer n, define the function
F : I → R by
1 x
Z
F (x) = (x − t)n f (t)dt.
n! x0
Prove that F is (n + 1) times continuously differentiable,
and
F (n+1) (x) = f (x) for all x ∈ I.
Solution
Define the function g : R → R by
Z x
g(x) = f (t)dt.
x0
du
= −1, v(t) = g(t).
dt
It follows that
h it=x Z x Z x
F (x) = (x − t)g(t) + g(t)dt = g(t)dt.
t=x0 x0 x0
du
= −k(x − t)k−1 , v(t) = g(t).
dt
It follows that
Z x
1 t=x 1
F (x) = (x − t)k g(t) t=x0 + (x − t)k−1 g(t)dt
k! (k − 1)! x0
Z x
1
= (x − t)k−1 g(t)dt.
(k − 1)! x0
and
F (k) (x) = g(x) for all x ∈ I.
The latter implies that F (k) (x0 ) = g(x0 ) = 0, and F (x) is (k + 1) times
differentiable, with
F (k+1) (x) = g ′ (x) = f (x)
a continuous function. Therefore, when n = k+1, the statement also holds.
By principle of mathematical induction, the statement is true for all positive
integers n.
Chapter 4. Integrating Functions of a Single Variable 329
Exercises 4.5
Question 2
Let f : [a, b] → R be a bounded function that is Riemann integrable. Show
that for any real number c,
Z b Z b+c
f (x)dx = f (x − c)dx.
a a+c
Question 2
Explain why Z 1 Z 1
−x2 2
(x + 1)e dx = 2 e−x dx.
−1 0
Question 3
Let a be a positive number. Assume that the functions f : [0, a] → R and
g : [0, a] → R are bounded and piecewise continuous, prove that
Z a Z a
f (x)g(a − x)dx = f (a − x)g(x)dx.
0 0
Question 4
Let m and n be nonnegative integers. Show that
Z 1
m! n!
xm (1 − x)n dx = .
0 (m + n + 1)!
Chapter 4. Integrating Functions of a Single Variable 330
Question 5
Let f : [a, b] → R be a continuous and strictly increasing function which
maps the interval [a, b] bijectively onto the interval [c, d], where c = f (a),
and d = f (b). Denote by g : [c, d] → R the inverse function of f . Notice
that f : [a, b] → R and g : [c, d] → R are Riemann integrable. This
question is regarding the proof of the formula
Z b Z d
f (x)dx = bf (b) − af (a) − g(x)dx. (4.9)
a c
(c) Let Pn = {xi }ni=0 be the regular partition of the interval [a, b] into n
intervals. For 0 ≤ i ≤ n, let yi = f (xi ). Then Pen = {yi }ni=0 is a
partition of [c, d].
(ii) Show that lim |Pn | = 0 and lim |Pen | = 0. You might want to
n→∞ n→∞
use uniform continuity.
(iii) Use part (i) and part (ii) to prove the formula (4.9).
Chapter 4. Integrating Functions of a Single Variable 331
have values
Z x Z x
1 π 1 π
lim− √ du = and lim + √ du = −
x→1 0 1 − u2 2 x→−1 0 1 − u2 2
lim F (x)
x→a+
Example 4.29
Z 1
1
√ dx is an improper integral as the function f : [0, 1) → R,
0 1 − x2
1
f (x) = √ is not bounded. We have seen that this improper integral
1 − x2
π
is convergent and has value .
2
Example 4.30
Let p be a positive Z number. Determine those values of p for which the
1
1
improper integral p
dx is convergent. Find the value of the improper
0 x
integral when it is convergent.
Chapter 4. Integrating Functions of a Single Variable 334
Solution
For p > 0, define the function F : (0, 1] → R by
Z 1
1
F (x) = p
du.
x u
Then
− ln x,
if p = 1,
F (x) = 1 − x1−p
, if p ̸= 1.
1−p
From this, we see that lim+ F (x) exists if and only if 0 < p < 1. Hence,
Zx→0
1
1
the improper integral p
dx is convergent if and only if 0 < p < 1. In
0 x
this case, Z 1
1 1
p
dx = , 0 < p < 1.
0 x 1−p
Z 1
When r ≥ 0, the integral xr dx is just an ordinary integral. However, we
0 Z 1
will sometimes abuse terminology and say that the integral xr dx is convergent
0
if and only if r > −1.
Z b
lim f (x)dx
a→−∞ a
Remark 4.8
Z ∞
To make the integral f (x)dx well defined when it is convergent, we
−∞
need to check that the right hand side of (4.10) does not depend on the
point c. In fact, we can show that if there is a real number c0 so that both
the improper integrals
Z c0 Z ∞
f (x)dx and f (x)dx
−∞ c0
lim F (x)
x→∞
Example 4.31
Let p be any realZnumber. Determine those values of p for which the
∞
1
improper integral dx is convergent. Find the value of the improper
1 xp
integral when it is convergent.
Solution
For a fixed real number p, define the function F : [1, ∞) → R by
Z x
1
F (x) = p
dx.
1 u
Chapter 4. Integrating Functions of a Single Variable 338
Then
ln x,
if p = 1,
F (x) = x1−p − 1
, if p ̸= 1.
1−p
From this, we see that the limit lim F (x) exists if and only if p > 1. Hence,
Z ∞ x→∞
1
the improper integral dx is convergent if and only if p > 1, and
1 xp
Z ∞
1 1
p
dx = , p > 1.
1 x p−1
Example 4.32
Determine whether the improper integral is convergent. If yes, find the
value of the integral.
Z ∞
1
(a) dx
0 1 + x2
Z 0
(b) ex dx
−∞
Z ∞
x
(c) dx
−∞ x2 +1
Solution
d 1
(a) Since tan−1 x = , we find that
dx 1 + x2
Z b
1
2
dx = tan−1 b − tan−1 0 = tan−1 b.
0 1+x
Since
π
lim tan−1 b = ,
b→∞ 2
Z ∞
1
the improper integral dx is convergent and its value is
0 1 + x2
Chapter 4. Integrating Functions of a Single Variable 339
Z ∞ Z b
1 1 −1 π
dx = lim dx = lim tan b = .
0 1 + x2 b→∞ 0 1 + x2 b→∞ 2
(b) Since ea → 0 as a → −∞, we have
Z 0 Z 0
x
e dx = lim ex dx = lim (1 − ea ) = 1.
−∞ a→−∞ a a→−∞
Z 0
The improper integral ex dx is convergent and is equal to 1.
−∞
Since
d 2x
ln(1 + x2 ) = ,
dx 1 + x2
we find that Z b
x 1
2
dx = ln(1 + b2 ).
0 1+x 2
But
lim ln(1 + b2 ) = ∞.
b→∞
Z ∞
x
Hence, the improper integral 2
dx is divergent. So, the
Z ∞ 0 x +1
x
improper integral 2
dx is also divergent.
−∞ x + 1
Z ∞
One is tempted to define the improper integral f (x)dx as
−∞
Z a
lim f (x)dx
a→∞ −a
x
if it exists. For part (c) in the example above, f (x) = is an odd function.
Z a 1 + x2
x
Thus, 2
dx = 0 for any a, and so
−a 1 + x
Z a
x
lim dx = 0.
a→∞ −a 1 + x2
Chapter 4. Integrating Functions of a Single Variable 340
results, such as that the integral xdx is convergent. Nevertheless, the limit
−∞
Z a
lim f (x)dx,
a→∞ −a
if
Z ∞it exists, has some applications. It is called the Cauchy principal value of
f (x)dx.
−∞
Proposition 4.34
Z ∞
If the improper integral f (x)dx is convergent, then its Cauchy
−∞
principal value exists, and is equal to the improper integral. Namely,
Z ∞ Z ∞
P.V. f (x)dx = f (x)dx.
−∞ −∞
Chapter 4. Integrating Functions of a Single Variable 341
Proof
Z ∞
If the improper integral f (x)dx is convergent, then the limits
−∞
Z 0 Z b
lim f (x)dx and lim f (x)dx
c→−∞ c b→∞ 0
exists and
Z ∞ Z 0 Z b
f (x)dx = lim f (x)dx + lim f (x)dx.
−∞ c→−∞ c b→∞ 0
are convergent.
Another natural question to ask is whether one can determine whether an
improper integral is convergent without explicitly computing the integral. There
are some partial solutions to this.
Z ∞ Z ∞
For instance, the integral f (x)dx is divergent if the integral f (x)dx
0 1
is divergent.
The next proposition says that linear combination of convergent integrals must
be convergent.
convergent, then for any constants α and β, the improper integral (αf +
I
βg) is also convergent, and
Z Z Z
(αf + βg) = α f + β g.
I I I
This follows easily from limit laws. Now we want to prove some comparison
theorems for improper integrals. We start with integrals of nonnegative functions.
If a function f is nonpositive, one just consider the function −f , which is then
nonnegative.
Lemma 4.36
Let I be an interval. Given that f : I → R is a nonnegative function that is
bounded and Riemann integrable on any closed and bounded intervals that
are contained in I. Fixed x0 in I and define the function F : I → R by
Z x
F (x) = f (u)du.
x0
Z
1. If I = (a, b] or I = (−∞, b], then the integral f (x)dx is convergent
I
if and only if the function F (x) is bounded below.
Z
2. If I = [a, b) or I = [a, ∞), then the integral f (x)dx is convergent if
I
and only if the function F (x) is bounded above.
Chapter 4. Integrating Functions of a Single Variable 343
Proof
Notice that since f (u) ≥ 0 for all u ∈ I, for any x1 and x2 in I, if x1 < x2 ,
then Z x2
F (x2 ) − F (x1 ) = f (u)du ≥ 0.
x1
1. If I = (a, b] or I = (−∞, b], the limit lim+ F (x) or the limit lim F (x)
x→a x→−∞
exists if and only if F (x) is bounded below.
2. If I = [a, b) or I = [a, ∞), the limit lim− F (x) or the limit lim F (x)
x→b x→∞
exists if and only if F (x) is bounded above.
Z ∞
In Proposition 4.34, we have stated that if the improper integral f (x)dx is
Z ∞ −∞
convergent, then the Cauchy principal value P.V. f (x)dx exists. The converse
−∞
is true if the function f : R → R is nonnegative.
Theorem 4.37
Let f : R → R be a nonnegative function that is bounded and Riemann
integrable on any closed and bounded intervals. The improper integral
Z ∞
f (x)dx is convergent if and only if the Cauchy principal value
−∞Z
∞
P.V. f (x)dx exists. Moreover,
−∞
Z ∞ Z ∞
f (x)dx = P.V. f (x)dx.
−∞ −∞
Proof
Z ∞
We just need to show that if the Cauchy principal value P.V. f (x)dx
Z ∞ −∞
Z ∞
Assume that the Cauchy principal value P.V. f (x)dx exists and is equal
−∞
to I. As in the proof of Lemma 4.38, the function
Z x
F (x) = f (u)du
0
−a ≤ b ≤ c ≤ a.
Hence, Z c Z a
F (c) − F (b) = f (x)dx ≤ f (x)dx ≤ I.
b −a
Proof
Notice that the second statement is the contrapositive of the first statement.
Hence, we only need to prove the first statement. Fixed x0 in the interval I,
and define
Z x Z x
F (x) = f (u)du, G(x) = g(u)du.
x0 x0
If x > x0 ,
0 ≤ F (x) ≤ G(x).
Therefore, G is bounded above implies F is bounded above. If x < x0 ,
Z x0 Z x0
F (x) = − f (u)du, G(x) = − g(u)du.
x x
Since Z x0 Z x0
0≤ f (u)du ≤ g(u)du,
x x
we find that
0 ≥ F (x) ≥ G(x).
Therefore, G is bounded below implies that F is bounded below. The
assertions about the convergence of the integrals then follow from Lemma
4.36.
Example 4.33
Z ∞
x
We can show that the integral dx is divergent without explicitly
0 x2 + 1
computing the integral. Notice that for x ≥ 1,
1 x
0≤ ≤ 2 .
2x x +1
Z ∞ Z ∞
1 x
Since the integral dx is divergent, the integral 2
dx is also
1 x Z ∞ 1 x +1
x
divergent. Hence, the integral 2
dx is divergent.
0 x +1
Chapter 4. Integrating Functions of a Single Variable 346
Example 4.34
Z ∞
1
Determine whether the improper integral √ dx is convergent.
0 x(x + 1)
Solution
We determine the convergence of the two improper integrals
Z 1 Z ∞
1 1
√ dx and √ dx
0 x(x + 1) 1 x(x + 1)
Z 1
Since the integral x−p dx is convergent when p < 1, while the integral
Z ∞ 0 Z ∞
−p
x dx is convergent if p > 1, x−p dx is not convergent for any values of
1 0
p. Hence, to determine the convergence of the integral in the example above, we
need to split the integral into two parts and compare to different g(x) = x−p . For
x → 0+ , we ignore the part 1/(x + 1) which has a finite limit. For x → ∞, the
leading term of 1/(x + 1) is 1/x. This is how we identify the correct values of p
to compare to.
Theorem 4.38 provides a useful strategy to determine the convergence of an
Chapter 4. Integrating Functions of a Single Variable 347
integral in the case that the function is nonnegative. For a function that can take
both positive and negative values, we need other strategies.
Theorem 4.39
Let I be an interval. Assume that f : I → R is a function that is
bounded and Riemann integrable on any closed
Z and bounded intervals that
are contained in I. If the improper integral |f (x)|dx is convergent, then
Z I
Proof
Define the functions f+ : I → R and f− : I → R by
In other words,
f (x), if f (x) ≥ 0,
f+ (x) =
0, if f (x) < 0,
−f (x), if f (x) ≤ 0,
f− (x) =
0, if f (x) > 0.
Notice that f+ and f− are nonnegative functions, and
Example 4.35
Z ∞
sin x
Show that the improper integral dx is convergent.
1 x2
Solution
For any x ≥ 1,
sin x 1
≤ .
x2 x2
Z ∞ Z ∞
1 sin x
Since the integral dx is convergent, the integral dx is
1 x2 1 x2
convergent.
There are some important special functions in mathematics and physics which
are defined in terms of improper integrals. One such function is the gamma
function, which students have probably seen in probability theory. In fact, gamma
function is ubiquitous in mathematics.
Chapter 4. Integrating Functions of a Single Variable 349
Example 4.36
Z ∞
Let s be a real number. Show that the improper integral ts−1 e−t dt is
0
convergent if and only if s > 0.
Solution
Z 1 Z ∞
s−1 −t
We split the integral into the two integrals t e dt and ts−1 e−t dt.
0 1
Notice that
Therefore, there is a number t0 > 1 such that for all t ≥ t0 , ts−1 e−t/2 ≤ 1.
Now the function
g(t) = ts−1 e−t/2
is continuous on the interval [0, t0 ]. Hence, it is bounded on [0, t0 ]. These
imply that there is a number M ≥ 1 such that
Hence,
0 ≤ ts−1 e−t ≤ M e−t/2 for all t ≥ 1.
Z ∞ Z ∞
−t/2
Since the integral e dt is convergent, the integral ts−1 e−t dt is
1 1
convergent. Z ∞
Hence, the integral ts−1 e−t dt is convergent if and only if s > 0.
0
Chapter 4. Integrating Functions of a Single Variable 350
When s > 0, using integration by parts with u(t) = ts and v(t) = −e−t ,
we have
Z b
Γ(s + 1) = lim+ ts e−t dt
a→0 a
b→∞
s −t b
Z b
s−1 −t
= lim+ −t e a + s t e dt
a→0 a
b→∞
= sΓ(s).
Γ(n + 1) = n!.
Hence, Z ∞
1 √
Γ = t−1/2 e−t dt = π.
2 0
In the future, we are going to explore more about the gamma function. For
example, we will prove the useful formula for the beta integral, which says
that if α > 0, β > 0,
Z 1
Γ(α)Γ(β)
tα−1 (1 − t)β−1 dt = .
0 Γ(α + β)
Exercises 4.6
Question 1
Z ∞
Let a be a positive real number. Show that the integral e−ax dx is
0
convergent and find its value.
Question 2
Z ∞
2
Let n be a positive integer. Find the value of the integral xn e−x dx.
0
Question 3
Explain why the given integral is an improper integral, and determine
whether it is convergent. If yes, find the value of the integral.
Z 0
x
(a) √ dx
−3 9 − x2
Z 1
√
(b) x ln xdx
0
Z 2
dx
(c)
0 (x − 1)2
Question 4
Determine whether the improper integral is convergent. If yes, find its
value.
Z ∞ √
x
(a) dx
0 x+1
Z ∞
ln x
(b) dx
1 x2
Chapter 4. Integrating Functions of a Single Variable 353
Question 5
Determine whether the improper integral is convergent.
Z ∞
1
(a) √ dx
0 ( x + 1)2
Z 1 x
e
(b) √ dx
0 x
Z 2π
sin x
(c) dx
0 x3/2
Z ∞
x3
(d) 2 2
dx
−∞ (x + x + 1)
Chapter 5. Infinite Series of Numbers and Infinite Products 354
Chapter 5
Theorem 5.1
Let {an } be a sequence of real numbers.
1. The sequence {an } is not bounded above if and only if there is a strictly
increasing subsequence {ank } such that lim ank = ∞.
k→∞
2. The sequence {an } is not bounded below if and only if there is a strictly
decreasing subsequence {ank } such that lim ank = −∞.
k→∞
Chapter 5. Infinite Series of Numbers and Infinite Products 355
Proof
It is sufficient to prove the first statement. If there is a subsequence {ank }
of {an } such that lim ank = ∞, it is obvious that {an } is not bounded
k→∞
above.
Conversely, given that {an } is not bounded above, we want to construct a
strictly increasing subsequence {ank } such that lim ank = ∞. Let n1 = 1.
k→∞
Since {an } is not bounded above, there is a n2 > 1 such that an2 ≥ an1 + 1.
Assume we have found n1 , n2 , . . . , nk−1 , such that
and
anj+1 ≥ anj + 1, for all 1 ≤ j ≤ k − 2.
Since {an } is not bounded above, there is an nk > nk−1 such that ank ≥
ank−1 + 1. This constructs a strictly increasing sequence {ank } inductively
which satisfies
ank ≥ an1 + k − 1.
Associated with a given sequence {an }, we can define two sequences {bn }
and {cn }.
Definition 5.1
Given a sequence {an }, we can define two sequences {bn } and {cn } as
follows. For each positive integer n,
Example 5.1
1
For the sequence {an } with an = ,
n
1
bn = 0, cn = for all n ≥ 1.
n
Example 5.2
bn = n, cn = ∞ for all n ≥ 1.
Example 5.3
The following are obvious from the definitions and Theorem 5.1.
Proposition 5.2
Notice that using extended definitions of infimum and supremum, the limit
infimum and limit supremum of a sequence always exist, either as a finite number,
or ±∞.
Example 5.4
1
1. For the sequence {an } with an = defined in Example 5.1,
n
lim inf an = 0, lim sup an = 0.
n→∞ n→∞
Since
bn ≤ c n for all n ∈ Z+ ,
we obtain the following immediately.
Proposition 5.3
Proposition 5.4
Let {un } and {vn } be sequences of real numbers. If un ≤ vn for all positive
integers n, then
Example 5.5
Find lim inf an and lim sup an for the sequence {an } defined by
n→∞ n→∞
n 1
an = (−1) 1 + .
n
Solution
Notice that for any n ≥ 1,
1 1
a2n−1 = −1 − , a2n = 1 + .
2n − 1 2n
Chapter 5. Infinite Series of Numbers and Infinite Products 359
We observe that
The sequence {a2n−1 } increases to −1, while the sequence {a2n } decreases
to 1. Therefore,
1 1
b2n = b2n+1 = −1 − , c2n−1 = c2n = 1 + .
2n + 1 2n
It follows that
Example 5.6
Let {an } be the sequence defined by an = (−1)n n. Find lim inf an and
n→∞
lim sup an .
n→∞
Solution
For any n ≥ 1,
The sequence {an } is not bounded below nor bounded above. Therefore,
bn = −∞, cn = ∞.
It follows that
For a monotoic sequence, it is easy to find its limit inferior and limit superior.
Chapter 5. Infinite Series of Numbers and Infinite Products 360
Theorem 5.5
Let {an } be a monotonic sequence.
In other words, for monotonic sequence, the limit inferior, limit superior, and
the limit are all the same.
In fact, if a sequence {an } has a finite limit, then its limit inferior, limit
superior and limit are all the same.
Theorem 5.6
Let {an } be a sequence, and let a be a finite number. Then the following
two statements are equivalent.
(a) lim an = a.
n→∞
Proof
For a positive integer n, define bn = inf ak , cn = supk≥n an . Then
k≥n
By definition,
b n ≤ an ≤ c n .
Hence, (b) implies (a) follows from squeeze theorem.
Chapter 5. Infinite Series of Numbers and Infinite Products 361
Now we prove that (a) implies (b). Given ε > 0, since lim an = a, there
n→∞
is a positive integer N such that for all n ≥ N ,
ε ε
a− < an < a + .
2 2
Hence, for all n ≥ N ,
ε ε ε ε
a− ≤ bn ≤ a + and a− ≤ cn ≤ a + .
2 2 2 2
These prove that for all n ≥ N ,
Thus,
lim inf an = lim sup an = a.
n→∞ n→∞
Hence, we are left to consider sequences which does not have a finite limit.
Let us first characterize when the limit inferior and the limit superior of a sequence
can be −∞ or ∞.
Theorem 5.7
Let {an } be a sequence of real numbers. Then the following three
statements are equivalent.
(c) There is a strictly increasing subsequence {ank } such that lim ank =
k→∞
∞.
Proof
Notice that lim sup an = lim cn , where cn = sup ak . Since {cn } is a
n→∞ n→∞ k≥n
decreasing sequence, lim cn = ∞ if and only if cn = ∞ for all n ≥ 1.
n→∞
Hence, this theorem follows from Theorem 5.1 and Proposition 5.2.
Chapter 5. Infinite Series of Numbers and Infinite Products 362
Theorem 5.8
Let {an } be a sequence of real numbers. Then the following three
statements are equivalent.
(c) There is a strictly decreasing subsequence {ank } such that lim ank =
k→∞
−∞.
Theorem 5.9
Let {an } be a sequence of real numbers.
Proof
Notice that if lim sup an = −∞, we must have lim inf an = −∞.
n→∞ n→∞
Similarly, if lim inf an = ∞, it is necessary that lim sup an = ∞.
n→∞ n→∞
It is enough for us to prove the second statement. Let bn = inf ak . Notice
k≥n
that bn ≤ an . If lim inf an = lim bn = ∞, we must have lim an = ∞.
n→∞ n→∞ n→∞
Conversely, assume that lim an = ∞. Given M > 0, there is a positive
n→∞
integer N such that
an ≥ M for all n ≥ M.
Example 5.7
Consider the sequence {an } with an = n + (−1)n . The first few terms
are given by 0, 3, 2, 5, 4, 7, . . .. This sequence is neither increasing nor
decreasing. For any n ≥ 1,
Combining Theorem 5.7, Theorem 5.8 and Theorem 5.9, we can summarize
the cases where the limit inferior or the limit superior is −∞ or ∞.
3. lim inf an = −∞ and lim sup an = ∞ if and only if {an } is not bounded
n→∞ n→∞
above nor bounded below.
4. −∞ < lim inf an < ∞ and lim sup an = ∞ if and only if {an } is
n→∞ n→∞
bounded below but not bounded above.
5. lim inf an = −∞ and −∞ < lim sup an < ∞ if and only if {an } is
n→∞ n→∞
bounded above but not bounded below.
The following gives a relation of limit inferior and limit superior with limits
of subsequences.
Chapter 5. Infinite Series of Numbers and Infinite Products 364
Theorem 5.10
Let {an } be sequence with
b ≤ ℓ ≤ c.
Proof
For every positive integer n, let
bn = inf ak , cn = sup an .
k≥n k≥n
Then
b = lim bn , c = lim cn .
n→∞ n→∞
we find that
b ≤ ℓ ≤ c.
Now we turn to the case of finite limit superior and finite limit inferior. We
have the following equivalence.
Chapter 5. Infinite Series of Numbers and Infinite Products 365
Theorem 5.11
Let {an } be a sequence of real numbers. Then the following two statements
are equivalent.
(i) there exists a positive integer N such that for all n ≥ N , an <
c + ε; and
(ii) for every positive integer N , there exists an integer n ≥ N , such
that an > c − ε.
Proof
For a positive integer n, define cn = sup an , so that lim sup an = lim cn .
k≥n n→∞ n→∞
Let us first prove that (a) implies (b). Given ε > 0, since lim cn = c, there
n→∞
is a positive integer N such that for all n ≥ N , |cn − c| < ε. If n ≥ N , we
find that
an ≤ sup ak = cn < c + ε.
k≥n
This proves (b)(i). Now given ε > 0, there is a positive integer N0 such
that c − ε < cn < c + ε for all n ≥ N0 . For any positive integer N , let
N ′ = max{N, N0 }. Then N ′ ≥ N0 . Hence,
sup ak = cN ′ > c − ε.
k≥N ′
c − ε < cn < c + ε.
Theorem 5.12
Let {an } be a sequence of real numbers. Then the following two statements
are equivalent.
(i) there exists a positive integer N such that for all n ≥ N , an >
b − ε; and
(ii) for every positive integer N , there exists an integer n ≥ N , such
that an < b + ε.
The following theorem says that the limit inferior and the limit superior are
limits of subsequences.
Theorem 5.13
Let {an } be a sequence.
Proof
It is sufficient for us to prove the first statement. We use (b)(ii) of Theorem
5.11. Take ε = 1 and N = 1. There is an integer n1 ≥ 1 such that
an1 > c − 1. Suppose we have chosen n1 , n2 , . . . , nk−1 such that n1 <
n2 < . . . < nk−1 and
1
anj > c − for all 1 ≤ j ≤ k − 1.
j
lim ank = c.
k→∞
Theorem 5.7, Theorem 5.8, Theorem 5.10 and Theorem 5.13 give
characterization of limit superior and limit inferior as follows.
1. The sequence {an } is not bounded above if and only if lim sup an = ∞,
n→∞
if and only if there is a strictly increasing subsequence {ank } such that
lim ank = ∞.
k→∞
2. The sequence{an } is not bounded below if and only if lim inf an = −∞,
n→∞
if and only if there is a strictly decreasing subsequence {ank } such that
lim ank = −∞.
k→∞
Chapter 5. Infinite Series of Numbers and Infinite Products 368
5. The limit of any subsequence {ank } must be between lim inf an and
n→∞
lim sup an .
n→∞
Example 5.8
Find lim inf an and lim sup an for the sequence {an } defined by
n→∞ n→∞
2πn
an = sin .
5
Solution
For any n ≥ 1,
2π 4π
a5n = 0, a5n+1 = sin , a5n+2 = sin ,
5 5
6π 8π
a5n+3 = sin, a5n+4 = sin .
5 5
Hence, the limit of a convergent subsequence of {an } can and can only be
2π 4π 6π 8π
0, sin , sin , sin , sin .
5 5 5 5
Since
8π 6π 4π 2π
sin < sin < 0 < sin < sin ,
5 5 5 5
we find that
8π 2π
lim inf an = sin , lim sup an = sin .
n→∞ 5 n→∞ 5
Chapter 5. Infinite Series of Numbers and Infinite Products 369
Exercises 5.1
Question 1
Find lim inf an and lim sup an for the sequence {an }, where
n→∞ n→∞
2n
an = .
n+1
Question 2
Find lim inf an and lim sup an for the sequence {an }.
n→∞ n→∞
2n
(a) an = (−1)n
2n + 1
2n + 1
(b) an = (−1)n
2n
Question 3
Find lim inf an and lim sup an for the sequence {an }.
n→∞ n→∞
(b) an = n − 2(−1)n n
Question 4
Find lim inf an and lim sup an for the sequence {an } defined by
n→∞ n→∞
2πn
an = cos .
9
Question 5
Prove or disprove: Given two sequences {an } and {bn },
In this section, we consider infinite series and its convergence. A series is a sum
of the form ∞
X
an = a1 + a2 + · · · + an + · · · ,
n=1
where {an }∞
n=1 is an infinite sequence. Sometimes a series might start with the
n = 0 term. Since a series is an infinite sum, we need to study whether the sum
makes sense. The natural thing to do is to define it using limits.
For convenience, we will deal with series that starts with the n = 1 term in
this chapter. When necessary, we will explain what changes need to be made if
the series starts with the n = 0 term.
n
X
sn = ak .
k=1
We say that the series is convergent or has a finite sum, if the sequence
{sn } has a finite limit. Otherwise, we say that the series is divergent. If the
series is convergent, we define its sum by
∞
X n
X
an = s = lim sn = lim ak .
n→∞ n→∞
n=1 k=1
∞
X
If the infinite series an starts with the n = 0 term, we still define its nth
n=0
partial sum by
n
X
sn = ak when n ≥ 0.
k=0
Chapter 5. Infinite Series of Numbers and Infinite Products 371
∞
X
The convergence of a series an is not affected by a finite number
n=1
∞
X
of terms in the series. Given a positive integer n0 , the series an is
n=n0
∞
X
convergent if and only if the series an is convergent. In case they are
n=1
convergent,
∞
X ∞
X 0 −1
nX
an − an = an .
n=1 n=n0 n=1
Example 5.9
∞
X 1 1
For the series n
, an = n , and the nth partial sum is
n=1
2 2
1 1 1 1
sn = + 2 + ··· + n = 1 − n.
2 2 2 2
∞
1 X 1
Since lim n = 0, we find that lim sn = 1. Hence, the series is
n→∞ 2 n→∞
n=1
2n
convergent and
∞
X 1
n
= 1.
n=1
2
Solution
The nth partial sum of the series is
1 1
sn = 1 + + ··· + .
2 n
Chapter 5. Infinite Series of Numbers and Infinite Products 372
∞
X ∞
X ∞
X
(αan + βbn ) = α an + β bn .
n=1 n=1 n=1
Let us look at a simple criteria that can be used to conclude that a series is
divergent.
Theorem 5.15
∞
X
If a series an is convergent, then lim an = 0. Equivalently, if
n→∞
n=1
∞
X
lim an ̸= 0, then the series an is divergent.
n→∞
n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 373
Proof
∞
X
We just need to prove the first statement. If an is convergent, the
n=1
sequence of partial sums {sn } converges to a number s. Notice that
an = sn − sn−1 , where s0 = 0 by default. Therefore,
∞
X
When one is determining the convergence of a series an , it is always good
n=1
to start with checking whether the limit lim an is zero.
n→∞
Example 5.11
∞
X
The series (−1)n is divergent since the limit lim (−1)n does not exist.
n→∞
n=1
Example 5.12
Determine the convergence of the series
∞ n
X 1
1+ .
n=1
n
Solution
Since n
1
lim 1 + = e ̸= 0,
n→∞ n
the series ∞ n
X 1
1+
n=1
n
is divergent.
The geometric series is a series which we can find the partial sums explicitly.
It is useful for comparisons.
Chapter 5. Infinite Series of Numbers and Infinite Products 374
∞
X 1
rn = when |r| < 1.
n=0
1−r
Proof
∞
X
n
When |r| ≥ 1, the limit lim r is not 0. Thus, the series rn is divergent.
n→∞
n=0
When |r| < 1, the nth partial sum is
1 − rn+1
sn = 1 + r + r 2 + · · · + r n = .
1−r
1 − rn+1 1
lim sn = lim = .
n→∞ n→∞ 1 − r 1−r
∞
X
Hence, the series rn is convergent when |r| < 1, and
n=0
∞
X 1
rn = when |r| < 1.
n=0
1−r
∞
X
arn−1 , where a is
A general geometric series is a series of the form
n=1
the first term of the series. It is convergent if and only if |r| < 1.
∞
X
If all the terms an in the series an are nonnegative, we notice that the
n=1
partial sums {sn } form an increasing sequence. For an increasing sequence, we
have the monotone convergence theorem. Applying to the sequence of partial
sums, we have the following.
Chapter 5. Infinite Series of Numbers and Infinite Products 375
Theorem 5.17
∞
X
If an ≥ 0 for all n ≥ 1, then the series an is convergent if and only if
n=1
the sequence of partial sums {sn } is bounded above.
(i) f is continuous.
Proof
Since f (x) decreases to 0 monotonically, f (x) ≥ 0 for all x ≥ 1, and so
{an } is a nonnegative decreasing sequence with lim an = 0. Let sn = a1 +
n→∞ Z x
th
a2 +· · ·+an be the n partial sum of the series, and let F (x) = f (u)du
1
when x ≥ 1.
Chapter 5. Infinite Series of Numbers and Infinite Products 376
This gives
F (n + 1) ≤ sn ≤ a1 + F (n). (5.1)
Z ∞
If the improper integral f (x)dx is convergent, {F (n)} is bounded
1
above by a number M . Therefore,
s n ≤ a1 + M for all n ≥ 1,
∞
X
and so the sequence {sn } is bounded above. Therefore, the series an is
n=1
convergent. Z ∞
If the improper integral f (x)dx is divergent, lim F (n + 1) = ∞. By
1 n→∞
(5.1), we find that the sequence {sn } is not bounded above. Thus, the series
X∞
an is divergent.
n=1
Let us now use the integral test to determine the convergence of the p-series.
Proof
Define the function f : [1, ∞) → R by
1
f (x) = .
xp
Then f is a continuous function that Z decreases monotonically to 0. By
∞
1
Example 4.30, the improper integral dx is convergent if and only if
1 xp
∞
X 1
p > 1. By integral test, the series p
is convergent if and only if p > 1.
n=1
n
Example 5.13
∞
X 1 1
The series √ is divergent since it is a p-series with p = 2
≤ 1.
n=1
n
Chapter 5. Infinite Series of Numbers and Infinite Products 378
tn + f (n + 1) ≤ sn ≤ a1 + tn .
∞
X
If we use sn to approximate the sum s = an , the error is
n=1
∞
X
s − sn = ak .
k=n+1
0.577215664901532
Now we return to the comparison test. Using Theorem 5.17, we obtain the
following test for nonnegative series.
Chapter 5. Infinite Series of Numbers and Infinite Products 380
0 ≤ an ≤ b n for all n ≥ 1.
∞
X ∞
X
1. If bn is convergent, an is convergent.
n=1 n=1
∞
X ∞
X
2. If an is divergent, bn is divergent.
n=1 n=1
Proof
Let sn = a1 + . . . + an and tn = b1 + . . . + bn be respectively the nth partial
X∞ X∞
sums of the series an and bn . Then {sn } and {tn } are increasing
n=1 n=1
sequences and
sn ≤ tn .
∞
X
1. If bn is convergent, the sequence {tn } is bounded above. Then the
n=1
∞
X
sequence {sn } is also bounded above. Hence, an is convergent.
n=1
∞
X
2. If an is divergent, the sequence {sn } is not bounded above. Then the
n=1
∞
X
sequence {tn } is also not bounded above. Hence, bn is divergent.
n=1
Example 5.15
Determine the convergence of the series
∞
X 2n
.
n=1
3n − 1
Chapter 5. Infinite Series of Numbers and Infinite Products 381
Solution
For n ≥ 1,
1
3n − 1 ≥ × 3n .
2
Therefore,
2n 2n+1
≤ .
3n − 1 3n
Since the series ∞ ∞
X 2n+1 X 2n
=2
n=1
3n n=1
3n
is a geometric series with r = 2/3, it is convergent. By comparison test,
∞
X 2n
the series is convergent.
n=1
3n − 1
Example 5.16
Determine the convergence of the series
∞
X n
√ .
n=1
n n+1
Solution
For n ≥ 1,
n n 1
√ ≥ √ √ = √ .
n n+1 n n+n n 2 n
∞
X 1
Since the series √ is a p-series with p = 1/2 ≤ 1, it is divergent.
n=1
n
∞
X 1
So the series √ is also divergent. By comparison test, the series
n=1
2 n
∞
X n
√ is divergent.
n=1
n n+1
∞
X 2n
and the denominator of an when n is large. Since we know that the series
3n n=1
is convergent, we need to prove that an is up to a constant, less than or equal to
∞
X
bn , in order to use the comparison test to conclude that an is convergent.
n=1
n n
Simiarly, for Example 5.16, we compare an = √ to bn = √ since
√ n n+1 n n
n and n n are respectively the leading terms of the numerator and denominator
X∞ X∞
of an . Since bn is divergent, so we want to conclude that an is divergent.
n=1 n=1
For this, we need to show that an is larger than a constant times bn .
Proving strict inequalities is tedious, and we see that it might not be necessary.
In fact, we obtain the series to compare to by investigating the leading terms.
This is somehow a limit. Hence, we can replace the comparison test by limit
comparison test.
an
Since ≥ 0, we must have L ≥ 0.
bn
∞
X ∞
X
1. If L = 0, and the series bn is convergent, then the series an is
n=1 n=1
convergent.
∞
X ∞
X
2. If L > 0, the series an is convergent if and only if the series bn
n=1 n=1
is convergent.
The condition (ii) says that when n is large, an is smaller than or equal to a
multiple of bn .
Chapter 5. Infinite Series of Numbers and Infinite Products 383
Proof
First consider the case where L = 0. By definition of limit with ε = 1,
there is a positive integer N such that for all n ≥ N ,
an
< 1.
bn
Therefore,
0 ≤ an ≤ b n
for all n ≥ N.
∞
X X∞
Since the series bn is convergent, the series bn is convergent. Then
n=1 n=N
∞
X
comparison test implies that the series an is convergent. Therefore, the
n=N
∞
X
series an is convergent.
n=1
Now for the case L > 0, take ε = L/2. There is a positive integer N such
that for all n ≥ N ,
an L
−L < .
bn 2
This implies that
L 3L
0≤ b n ≤ an ≤ bn for all n ≥ N.
2 2
∞
X
Comparison test then shows that the series an is convergent if and only
n=1
∞
X
if the series bn is convergent.
n=1
Then
an 1
lim = lim = 1.
n→∞ bn n→∞ 1
1−
3n
∞ ∞
X 2n X 2n
Since the series is convergent, the series is convergent.
n=1
3n n=1
3n − 1
Let us now turn to series that can have negative terms. First we formulate
a Cauchy criterion for convergence of series. Recall that a sequence {sn } is a
Cauchy sequence if for every ε > 0, there is a positive integer N so that for all
m ≥ n ≥ N,
|sm − sn | < ε.
Applying the Cauchy criterion for convergence of sequences (see Theorem 1.43),
and the fact that if m ≥ n > 1,
sm − sn−1 = an + an+1 + · · · + am ,
Theorem 5.23
∞
X ∞
X
If the series |an | is convergent, then the series an is convergent.
n=1 n=1
Proof
∞
X
Given ε > 0, since the series |an | is convergent, there is a positive
n=1
integer N such that for all m ≥ n ≥ N ,
∞
X
The converse of Theorem 5.23 is not true. Namely, there exists series an
n=1
∞
X
which is convergent but the corresponding absolute series |an | is not convergent.
n=1
Therefore, let us make the following definitions.
Chapter 5. Infinite Series of Numbers and Infinite Products 386
∞
X ∞
X
1. We say that the series an converges absolutely if the series |an |
n=1 n=1
is convergent.
∞
X
2. We say that the series an converges conditionally if the series
n=1
∞
X
|an | is divergent.
n=1
Example 5.19
∞
X (−1)n
If p > 1, the series converges absolutely.
n=1
np
Example 5.20
Show that the series ∞
X 2n + (−1)n 3n
n=1
5n + 1
is convergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 387
Solution
Let
2n + (−1)n 3n 3n
an = , bn = n .
5n + 1 5
X∞
Then bn > 0 for all n ∈ Z+ and bn is convergent. Now,
n=1
n
2
n
1 + (−1)
|an | 3
lim = lim = 1.
n→∞ bn n→∞ 1
1+ n
5
∞
X 2n + (−1)n 3n
Therefore, the series converges absolutely, and thus is
n=1
5n + 1
convergent.
Example 5.21
The series
1 1 1
1− + − + ···
2 3 4
is an alternating series.
∞
X
A necessary condition for an alternating series (−1)n−1 bn to be convergent
n=1
is lim bn = 0. The following theorem says that if {bn } is also decreasing, then
n→∞
the alternating series is convergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 388
Proof
Since {bn } decreases monotonically to 0, bn ≥ 0 for all n ∈ Z+ . Let
∞
X
an = (−1)n−1 bn be the nth term of the series (−1)n−1 bn , and let sn =
n=1
a1 + a2 + · · · + an be the nth partial sum. We are given that
b1 ≥ b2 ≥ · · · ≥ bn ≥ bn+1 · · · .
Therefore,
exist. Since
−b2n = a2n = s2n − s2n−1 ,
taking the n → ∞ limits give
so = se .
Chapter 5. Infinite Series of Numbers and Infinite Products 389
This proves that the sequence {sn } has a limit s = s0 = se , and thus the
∞
X
alternating series (−1)n−1 bn is convergent.
n=1
∞
X
Notice that the sum of the alternating series s = (−1)n−1 bn is the least
n=1
upper bound of {s2n }, and the greatest lower bound of {s2n−1 }.
n
X
sn = (−1)k−1 bk
k=1
|s − sn | ≤ |bn+1 |.
Example 5.22
For the alternating series
∞
X (−1)n−1 1 1 1
=1− + − + ···
n=1
n 2 3 4
1
in Example 5.21, bn = . Since {bn } decreases monotonically to 0, by the
n
alternating series test, the series
Chapter 5. Infinite Series of Numbers and Infinite Products 390
∞
X (−1)n−1 1 1 1
=1− + − + ···
n=1
n 2 3 4
∞
X 1
is convergent. Since the harmonic series is divergent, the series
n=1
n
∞
X (−1)n−1
converges conditionally.
n=1
n
Example 5.23
Now we turn to two useful tests that are used for testing convergence of power
series. They both based on comparisons with geometric series. We first prove the
following.
Theorem 5.26
Let {an } be a sequence of positive numbers. Then
an+1 √ √ an+1
lim inf ≤ lim inf n an ≤ lim sup n an ≤ lim sup .
n→∞ an n→∞ n→∞ n→∞ an
an+1 √
Hence, if the limit lim exists, the limit lim n an also exists, and the
n→∞ an n→∞
two limits are equal.
Since an > 0 for all n ∈ Z+ , all the four limits in the theorem are nonnegative.
Proof
If {cn } is a sequence of postive numbers, it is easy to verify that
1 1 1 1
sup = , inf = .
cn inf{cn } cn sup{cn }
Chapter 5. Infinite Series of Numbers and Infinite Products 391
∞
X
1. If ρ < 1, the series an converges absolutely.
n=1
∞
X
2. If ρ > 1, the series an is divergent.
n=1
Proof
1−ρ
If ρ < 1, take ε = in (b)(i) of Theorem 5.11. There is a positive
2
integer N such that
p
n
|an | < ρ + ε = ρ1 for all n ≥ N.
Thus, we have
|an | < ρn1 for all n ≥ N.
Notice that
1+ρ
ρ1 =
< 1.
2
X
Therefore, the geometric series ρn1 is convergent. By comparison
n=N
∞
X ∞
X
test, the series |an | is convergent. Thus, the series an converges
n=1 n=1
absolutely.
ρ−1
If ρ > 1, take ε = in (b)(ii) of Theorem 5.11. There are positive
2
integers n1 , n2 , . . . such that 1 ≤ n1 < n2 < . . . and
q
nk
|ank | > ρ − ε = ρ2 for all k ∈ Z+ .
Chapter 5. Infinite Series of Numbers and Infinite Products 393
Thus, we have
|ank | > ρn2 k for all k ∈ Z+ . (5.3)
Since
1+ρ
ρ2 =
> 1,
2
and nk → ∞ as k → ∞, we find that lim ρn2 k = ∞. In other words, the
k→∞
sequence {ρn2 k } is not bounded above. Eq. (5.3) then implies that {|ank |}
is also not bounded above. Therefore, the limit lim an is not zero. Hence,
n→∞
X∞
the series an is divergent.
n=1
Now, let us look at some examples where ρ = 1. First, notice that
√
ln n ln x
n
lim n = lim exp = exp lim = e0 = 1. (5.4)
n→∞ n→∞ n x→∞ x
∞
X 1
For the p-series p
, an = n−p . Thus,
n=1
n
√
n
√ −p
ρ = lim n−p = lim n
n = 1.
n→∞ n→∞
Example 5.24
∞ n
X 1−n
Determine the convergence of the series .
n=1
2n + 1
Solution
Applying root test,
s n
n
1−n n−1 1
ρ = lim sup = lim = .
n→∞ 2n + 1 n→∞ 2n + 1 2
an+1 an+1
r = lim inf , R = lim sup .
n→∞ an n→∞ an
∞
X
1. If R < 1, the series an converges absolutely.
n=1
∞
X
2. If r > 1, the series an is divergent.
n=1
Proof
p
n
If R < 1, Theorem 5.26 implies that ρ = lim sup |an | < 1. Theorem
n→∞
∞
X
5.27 implies that an converges absolutely.
n=1 p
n
If r > 1, Theorem 5.26 implies that ρ = lim sup |an | > 1. Theorem 5.27
n→∞
∞
X
implies that an is divergent.
n=1
∞
X1
The p-series provides examples of r = R = 1, but the series is
n=1
np
convergent if p > 1, divergent when p ≤ 1. Hence, ratio test is also
inconclusive when r ≤ 1 ≤ R.
Ratio test is useful to determine the convergence of power series. We are going
to study this in Chapter 6.
Chapter 5. Infinite Series of Numbers and Infinite Products 395
Example 5.25
Determine whether the series is convergent.
∞
X 2n
(a) (−1)n−1
n=1
n+1
∞
X n+1
(b) (−1)n−1
n=1
2n
Solution
2n
(a) Using ratio test with an = (−1)n−1 , we find that
n+1
an+1 2n+1 n+1 n+1
r = R = lim = lim × n = 2 lim = 2.
n→∞ an n→∞ n + 2 2 n→∞ n + 2
∞
X 2n
Therefore, the series (−1)n−1 is divergent.
n=1
n + 1
n+1
(b) Using ratio test with an = (−1)n−1 , we find that
2n
an+1 n+2 2n 1 n+2 1
r = R = lim = lim n+1 × = lim = .
n→∞ an n→∞ 2 n+1 2 n→∞ n + 1 2
∞
X n+1
Therefore, the series (−1)n−1 n is convergent.
n=1
2
Convergence Tests
In this section, we have explored various strategies to determine the
∞
X
convergence of a series an . We make a summary as follows. This is
n=1
a useful manual for beginners, but it is not binding.
∞
X
1. Check whether lim an is 0. If not, the series an is divergent.
n→∞
n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 396
∞ ∞
X X 1
2. Check whether it is a geometric series arn−1 or a p-series p
.
n=1 n=1
n
∞
X
A geometric series arn−1 is convergent if and only if |r| < 1. A
n=1
∞
X 1
p-series p
is convergent if and only if p > 1.
n=1
n
6. For alternating series which does not converge absolutely, check whether
alternating series test can be applied.
7. If an = bnn for each n ∈ Z+ , and lim sup bn exists, use root test.
n→∞
Theorem 5.29
Let r be a real number with |r| < 1. For any real number α,
lim nα rn = 0.
n→∞
Proof
If r = 0, the limit is trivial. Hence, we consider the case |r| < 1 and r ̸= 0.
If α ≤ 0, the statement is also easy to prove since lim rn = 0 and
n→∞
Chapter 5. Infinite Series of Numbers and Infinite Products 397
0, if α < 0,
lim nα =
n→∞ 1, if α = 0.
The highly nontrivial case is when α > 0. In this case, lim nα = ∞. Since
n→∞
nα |r|n = nα en ln |r| ,
xα
and ln |r| < 0, we can deduce that lim nα |r|n = 0 from lim x =
n→∞ x→∞ e
0. Nevertheless, let us present an alternative argument here which is
interesting by its own.
∞
X
Consider the series nα rn with an = nα rn . When |r| < 1 and r ̸= 0,
n=1
α α
an+1 n+1 n+1
lim = lim |r| = |r| lim = |r| < 1.
n→∞ an n→∞ n n→∞ n
∞
X
By ratio test, the series nα rn is convergent. Therefore,
n=1
lim nα rn = lim an = 0.
n→∞ n→∞
Chapter 5. Infinite Series of Numbers and Infinite Products 398
Exercises 5.2
Question 1
∞
X n2 + 1
Determine whether the series is convergent.
n=1
3n2 + n + 1
Question 2
∞
X ln n
Let p be a positive number. Show that the series is convergent if
n=1
np
and only if p > 1.
Question 3
Let p be a positive number. Show that the series
∞
X ln n
(−1)n−1 p
n=1
n
is convergent.
Question 4
Determine whether the series is convergent.
∞
X 3n + (−1)n 4n
(a)
n=1
5n + 2n
∞
X 2n − 5n
(b)
n=1
4n + 3n + 1
Question 5
∞ √
X
n−1 n
Determine whether the series (−1) is convergent.
n=1
n+1
Chapter 5. Infinite Series of Numbers and Infinite Products 399
Question 6
Determine whether the series is convergent.
∞ √
X 2n n + 3
(a)
n=1
5n2 − 2
∞
X 4n2 − 7
(b) √
3 n+1
n=1
6n
Question 7
Use Theorem 5.26 to determine
√
n
lim n!.
n→∞
Question 8
Determine whether the series is convergent.
∞ √ n
X 2 n−1
(a) √
n=1
n+1
∞ √ n
X 2 n−1
(b) √
n=1
3 n+1
Question 9
Determine whether the series is convergent.
∞ √ n
n−1 n2
X
(a) (−1)
n=1
3n
∞
X 4n
(b) (−1)n−1
n=1
3n n2
Chapter 5. Infinite Series of Numbers and Infinite Products 400
In this section, we want to explore more about the difference between a series that
converges absolutely and one that converges conditionally.
∞
X
Given a series an with terms {an }, define
n=1
|an | + an an , if an ≥ 0,
pn = =
2 0, if an < 0;
|an | − an −an , if an ≤ 0,
qn = = .
2 0, if an > 0.
Then 0 ≤ pn ≤ |an |, 0 ≤ qn ≤ |an |, and
|an | = pn + qn , an = pn − qn .
Example 5.26
∞
X (−1)n−1 1 1 1
For the series =1− + − + ···,
n=1
n 2 3 4
1 1
p2n−1 = , p2n = 0; q2n−1 = 0, q2n = .
2n − 1 2n
Theorem 5.30
∞
X
Let an be a convergent series.
n=1
∞
X ∞
X
1. If the series an converges absolutely, then the series pn and the
n=1 n=1
∞
X
series qn are convergent.
n=1
∞
X ∞
X
2. If the series an converges conditionally, then the series pn and
n=1 n=1
∞
X
the series qn are divergent.
n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 401
Proof
X∞ ∞
X
First we show that the two series pn and qn can only be both
n=1 n=1
convergent or both divergent. We have
p n = an + q n , qn = pn − an ,
∞
X
and we are given that the series an is convergent. Therefore, the series
n=1
∞
X ∞
X
qn is convergent implies that the series pn is convergent. Similarly,
n=1 n=1
∞
X ∞
X
the series pn is convergent implies that the series qn is convergent.
n=1 n=1
∞
X ∞
X
If the series an converges absolutely, the series |an | is convergent.
n=1 n=1
Since
0 ≤ pn ≤ |an |, 0 ≤ qn ≤ |an |,
∞
X ∞
X
comparison test implies that the series pn and qn are convergent.
n=1 n=1
∞
X ∞
X
Conversely, if the series pn and qn are convergent, since
n=1 n=1
|an | = pn + qn ,
∞
X ∞
X
the series |an | must be convergent. Therefore, if the series an
n=1 n=1
∞
X
converges conditionally, which means the series |an | is divergent, then
n=1
∞
X ∞
X
the series pn and the series qn must be both divergent.
n=1 n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 402
Example 5.27
∞
X (−1)n−1 1 1 1
For the series = 1−
+ − + · · · in Example 5.26, the
n=1
n 2 3 4
∞ ∞ ∞ ∞
X X 1 X X 1
series pn = and the series qn = are divergent.
n=1 n=1
2n − 1 n=1 n=1
2n
Example 5.28
Namely,
4k − 3, if n = 3k − 2,
π(n) = 4k − 1, if n = 3k − 1,
2k,
if n = 3k.
∞
X (−1)n−1 1 1 1
The rearrangement of the series = 1− + − + ···
n=1
n 2 3 4
induced by π is
1 1 1 1 1
1+ − + + − + ··· .
3 2 5 7 4
is convergent. We can find its sum in the following way. Let sn = a1 +a2 +· · ·+an
Chapter 5. Infinite Series of Numbers and Infinite Products 403
1 1 1
b3k−2 = , b3k−1 = , b3k = − for all k ∈ Z+ .
4k − 3 4k − 1 2k
∞
X
th
Let tn = b1 + b2 + · · · + bn be the n partial sum of the series bn . Now
n=1
n n
X 1 1 X 1
t3n = + − .
k=1
4k − 3 4k − 1 k=1
2k
As k runs from 1 to n, 4k − 3 and 4k − 1 run through all positive odd integers
between 1 and 4n. Therefore,
2n n 4n 2n n
X 1 X 1 X1 X 1 X 1
t3n = − = − − .
k=1
2k − 1 k=1 2k k=1
k k=1 2k k=1 2k
Chapter 5. Infinite Series of Numbers and Infinite Products 404
∞
X
This proves that the series bn is convergent, and it converges to lim t3n =
n→∞
n=1
3
ln 2.
2 ∞ ∞
X X
Hence, although the series bn is a rearrangement of the series an , it
n=1 n=1
has a different sum.
In the following, we prove that rearrangement of a nonnegative series would
not lead to different sums.
Lemma 5.31
∞
X
+
If an ≥ 0 for all n ∈ Z and the series an is convergent, then any
n=1
rearrangement of the series has the same sum. Namely, for any bijecion
π : Z+ → Z+ ,
X∞ ∞
X
aπ(n) = an .
n=1 n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 405
Proof
In a nutshell, this is just the fact that a nonnegative series is convergent if
and only if the sequence of partial sums is bounded above, and the sum of
the series is the least upper bound of the sequence of partial sums.
For a rigorous argument, define sn = a1 +· · ·+an to be the nth partial sum of
X∞
the series an , and tn = aπ(1) + · · · + aπ(n) to be the nth partial sum of the
n=1
∞
X
series aπ(n) . Notice that both {sn } and {tn } are increasing sequences.
n=1
We are given that s = sup{sn } exists. For any positive integer n, the
set {π(1), π(2), . . . , π(n)} has a maximum Nn . This means that the set
{π(1), π(2), . . . , π(n)} is contained in the set {1, 2, . . . , Nn }. Therefore,
tn ≤ sNn ≤ s.
Proof
∞
X ∞
X
Define the nonnegative series pn and qn by
n=1 n=1
|an | + an |an | − an
pn = , qn = .
2 2
Then
an = p n − q n .
∞
X
Since the series an converges absolutely, Theorem 5.30 says that the
n=1
∞
X ∞
X
series pn and qn are convergent.
n=1 n=1
∞
X
+ +
Lemma 5.31 says that for any bijecion π : Z → Z , the series pπ(n)
n=1
∞
X
and qπ(n) are convergent, and
n=1
∞
X ∞
X ∞
X ∞
X
pπ(n) = pn , qπ(n) = qn .
n=1 n=1 n=1 n=1
∞
X
Therefore, the series aπ(n) is convergent and
n=1
∞
X ∞
X ∞
X ∞
X ∞
X ∞
X
aπ(n) = pπ(n) − qπ(n) = pn − qn = an .
n=1 n=1 n=1 n=1 n=1 n=1
Finally, we come to the celebrated Riemann’s theorem for series that converges
conditionally.
Chapter 5. Infinite Series of Numbers and Infinite Products 407
Theorem 5.33
Riemann’s Theorem for Conditionally Convergent Series
∞
X
Let an be a series that converges conditionally, and let b and c be two
n=1
extended real numbers with b ≤ c. There exists a bijection π : Z+ → Z+
∞
X
such that for the series aπ(n) with partial sums tn = aπ(1) + · · · + aπ(n) ,
n=1
Here an extended real number is either an ordinary real number or ±∞. This
theorem implies that one can have a rearrangement of a conditionally convergent
series that diverge to ±∞ or converge to any real number.
Proof
+
For n ∈ Z , let
|an | + an |an | − an
pn = , qn = .
2 2
∞
X
Since the series an converges conditionally, Theorem 5.30 says that the
n=1
∞
X X∞
series pn and qn are divergent. Let
n=1 n=1
S+ = n ∈ Z+ | an ≥ 0 , S− = n ∈ Z+ | an < 0 .
Then
S+ ∪ S− = Z+ , S+ ∩ S− = ∅.
There are strictly increasing maps π1 : Z+ → Z+ and π2 : Z+ → Z+ , such
that π1 (Z+ ) = S+ and π2 (Z+ ) = S− .
X∞ ∞
X
Define the nonnegative series un and vn by
n=1 n=1
Then the sequences {un } and {vn } are obtained from the sequences {pn }
and {qn } by removing some zero terms. Hence, both nonnegative series
X∞ X∞
un and vn are divergent.
n=1 n=1
Now we start to define the bijection π : Z+ → Z+ . Construct two
sequences of real numbers {bn } and {cn } such that c1 > 0, bn ≤ cn for
all n ∈ Z+ , and
lim bn = b, lim cn = c.
n→∞ n→∞
C1 = u1 + u2 + · · · + uk1 > c1 .
Then define
π(1) = π1 (1), . . . , π(k1 ) = π1 (k1 ).
Take l1 to be the smallest positive integer such that
Then define
Then define
Then define
∞
X ∞
X
Continue this construction inductively. Since un and vn are
n=1 n=1
nonnegative sequences that diverges to ∞, and bm ≤ cm for all positive
integers m, the existence of the positive integers km and lm at each step is
guaranteed. It is easy to see that the map π : Z+ → Z+ is a bijection. For
∞
X
the series aπ(n) , let tn = aπ(1) + · · · + aπ(n) be its nth partial sum. Set
n=1
α0 = β0 = 0, and for m ≥ 1, let
αm = k1 + k2 + · · · + km , βm = l1 + l2 + · · · + lm ,
δm = αm−1 + βm−1 + km , λm = δm + lm = αm + βm .
Then
By construction,
δm ≤ n < δm+1 .
Then
tn ≤ max{cm + uαm , cm+1 + uαm+1 }.
Since m ≥ M2 , cm and cm+1 are less than c + ε/2. Since m ≥ M2 ≥ M1 ,
uαm and uαm+1 are less than ε/2. These imply that for all n ≥ N ,
tn < c + ε.
Hence,
lim sup tn ≤ c.
n→∞
lim inf tn ≥ b.
n→∞
For all m ∈ Z+ ,
Exercises 5.3
Question 1
Show that the series ∞ ∞
X X (−1)n−1
an = √
n=1 n=1
n+1
+
is convergent. If π : Z → Z+ is a bijective correspondence, consider
∞
X X∞
the rearrangement of the series an given by aπ(n) . Does the series
n=1 n=1
∞
X ∞
X
aπ(n) necessarily converge to the same number as the series an ?
n=1 n=1
Question 2
Show that the series
∞ ∞ √
X X (−1)n−1 n
an =
n=1 n=1
n2 + 1
Question 3
Show that the series
∞ ∞ √
X X (−1)n−1 n
an =
n=1 n=1
n+1
In this section, we consider infinite products and study its convergence. An infinite
product is a product of the form
∞
Y
un = u1 u2 · · · un · · · ,
n=1
(a) If infinitely many of the terms un ’s are zero, then we say that the infinite
Y∞
product un is divergent.
n=1
(b) If only finitely many of the un ’s are zero, there is a positive integer ℓ
such that un is nonzero for all n ≥ ℓ. Form the partial product
n
Y
P [ℓ]n = uk , for n ≥ ℓ.
k=ℓ
(i) If the limit lim P [ℓ]n does not exist or the limit is 0, we say that
n→∞
Y∞
the infinite product un is divergent.
n=1
(ii) If the limit lim P [ℓ]n exists and is equal to a nonzero number
n→∞
∞
Y
P [ℓ], we say that the infinite product un converges to
n=1
ℓ−1
Y
P = P [ℓ] uk .
k=1
n
Y
P [1]n = uk simply as Pn .
k=1
∞
Y
By definition, if the infinite product un converges to 0, then at least one
n=1
of the un is equal to 0, and there are only finitely many of the un ’s that are
equal to 0.
Example 5.29
∞
Y 1
Determine the convergence of the infinite product 1+ .
n=1
n
Solution
1
For n ≥ 1, un = 1 + ̸= 0. Notice that
n
n
Y 1 2 3 n+1
Pn = 1+ = × × ··· × = n + 1.
k=1
k 1 2 n
∞
Y 1
Since lim Pn = ∞, the infinite product 1+ is divergent.
n→∞
n=1
n
Example 5.30
∞
Y 1
Determine the convergence of the infinite product 1− .
n=1
n
Chapter 5. Infinite Series of Numbers and Infinite Products 415
Solution
1
For n ≥ 1, un = 1 − . We find that u1 = 0 and un > 0 for all n ≥ 2.
n
n
Y 1 1 2 n−1 1
P [2]n = 1− = × × ··· = .
k=2
k 2 3 n n
∞
Y 1
Since lim P [2]n = 0, the infinite product 1− is divergent.
n→∞
n=1
n
Example 5.31
∞
Y 1
Determine the convergence of the infinite product 1− 2 .
n=1
n
Solution
1
For n ≥ 1, un = 1 − . We find that u1 = 0 and un > 0 for all n ≥ 2.
n2
n Y n n
Y 1 1 Y 1
P [2]n = 1− 2 = 1− 1+
k=2
k k=2
k k=2 k
1 2 n−1 3 n+1 n+1
= × × ··· × × ··· × = .
2 3 n 2 n 2n
∞
1 Y 1
Since P [2] = lim P [2]n = , the infinite product 1 − 2 is
n→∞ 2 n=1
n
convergent, and it converges to u1 P [2] = 0.
Proposition 5.34
∞
Y
If the infinite product un is convergent, then lim un = 1.
n→∞
n=1
The proof of this is more complicated than its infinite series counterpart.
Proof
n
Y
Let Pn = uk be the nth partial product. Then Pn > 0 for all n ∈ Z+ . If
k=1
∞
Y
the infinite product un is convergent, then the sequence {Pn } converges
n=1
to a positive number P . This implies that there is a positive integer N1 such
that
P
Pn > for all n ≥ N1 .
2
Given ε > 0, apply Cauchy criterion to the convergent sequence {Pn }, we
find that there is a positive integer N2 such that for all m ≥ n ≥ N2 ,
Pε
|Pn − Pm | < .
2
Chapter 5. Infinite Series of Numbers and Infinite Products 417
Hence, {Pn } is a Cauchy sequence, and thus it is convergent. Eq. (5.5) then
implies that
1
lim Pn ≥ PN1 > 0.
n→∞ 2
This proves that {Pn } does not converge to 0. Therefore, the infinite
Y∞
product un is convergent.
n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 418
The following gives a relation between the convergence of the infinite product
with the convergence of infinite series.
Theorem 5.36
Let {un } be a sequence of positive numbers. Then the infinite product
Y∞ ∞
X
un is convergent if and only if the infinite series ln un is convergent.
n=1 n=1
Proof
∞
Y
First assume that the infinite product uk is convergent. Given ε > 0,
k=1
since lim ln x = 0, there exists a δ such that 0 < δ < 1 and if |x − 1| < δ,
x→1
then | ln x| < ε. By the Cauchy criterion for infinite products, there is a
positive integer N such that for all m ≥ n ≥ N ,
"m #
Y
uk − 1 < δ.
k=n
∞
X
This proves that the infinite series ln un satisfies the Cauchy criterion.
n=1
Hence, it is convergent.
∞
X
Conversely, assume that the infinite series ln un is convergent. Given
n=1
ε > 0, since lim ex = 1, there exists δ > 0 such that if |x| < δ, then
x→0
|ex − 1| < ε.
Chapter 5. Infinite Series of Numbers and Infinite Products 419
Using Cauchy criterion for infinite series, we find that there is a positive
integer N such that for all m ≥ n ≥ N ,
m
X
ln uk < δ.
k=n
∞
Y
This shows that the infinite product un satisfies the Cauchy criterion.
n=1
Hence, it is convergent.
Example 5.32
∞
Y a
For any nonzero real number a, the infinite product exp is
n=1
n
∞
X a
divergent since the infinite series is divergent; while the infinite
n=1
n
∞ a ∞
Y X a
product exp 2 is convergent since the infinite series is
n=1
n n=1
n2
convergent.
ln(1 + a)
Since lim = 1, it is natural to compare the convergence of the
∞
a→0 a ∞
Y X
product (1 + an ) to the convergence of the series an .
n=1 n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 420
Theorem 5.37
Let {an } be a sequence of real numbers such that 0 < an < 1 for all
n ∈ Z+ . Then the following three statements are equivalent.
∞
X
(a) The series an is convergent.
n=1
∞
Y
(b) The infinite product (1 + an ) is convergent.
n=1
∞
Y
(c) The infinite product (1 − an ) is convergent.
n=1
Proof
Since 0 < an < 1 for all n ∈ Z+ , we find that 1 + an > 0 and 1 − an > 0
∞
X
for all n ∈ Z+ . A necessary condition for the convergence of either an ,
n=1
∞
Y ∞
Y
or (1 + an ), or (1 − an ), is
n=1 n=1
lim an = 0.
n→∞
v
By Theorem 5.36, it is then sufficient to prove that if {an } is a sequence of
real numbers satisfying 0 < an < 1 for all n ≥ 1, and lim an = 0, then
n→∞
the following three statements are equivalent.
∞
X
(a) The series an is convergent.
n=1
∞
X
(b′ ) The series ln(1 + an ) is convergent.
n=1
∞
X
′
(c ) The series ln(1 − an ) is convergent.
n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 421
bn ln(1 + an ) ln(1 + x)
lim = lim = lim = 1,
n→∞ an n→∞ an x→0 x
cn − ln(1 − an ) − ln(1 − x)
lim = lim = lim = 1.
n→∞ an n→∞ an x→0 x
∞
X
By limit comparison test for positive series, we find that an is
n=1
∞
X ∞
X
convergent if and only if bn is convergent, and an is convergent
n=1 n=1
∞
X
if and only if cn is convergent. These establish the equivalence of (a)
n=1
and (b′ ), and the equivalence of (a) and (c′ ).
Example 5.33
Theorem 5.37 can be used to deduce the following.
∞
Y 1
1. The infinite product 1+ considered in Example 5.29 is
n=1
n
∞
X 1
divergent since the infinite series is divergent.
n=1
n
∞
Y 1
2. The infinite product 1− considered in Example 5.30 is
n=1
n
∞
X 1
divergent since the infinite series is divergent.
n=1
n
Chapter 5. Infinite Series of Numbers and Infinite Products 422
∞
Y 1
3. The infinite product 1 − 2 considered in Example 5.31 is
n=1
n
∞
X 1
convergent since the infinite series 2
is convergent.
n=1
n
Theorem 5.38
∞
Y
If the infinite product (1 + |an |) is convergent, then the infinite product
n=1
∞
Y
(1 + an ) is convergent.
n=1
Proof
Without loss of generality, we can assume that |an | < 1 for all n ≥ 1.
∞
Y
Given ε > 0, since the infinite product (1 + |an |) is convergent, Cauchy
n=1
criterion says that there is a positive integer N such that for all m ≥ n ≥ N ,
"m #
Y
(1 + |ak |) − 1 < ε.
k=n
∞
Y
This proves that the infinite product (1 + an ) satisfies the Cauchy
n=1
criterion. Hence, it is convergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 423
Corollary 5.39
∞
X
Let an be a series that converges absolutely. Then the infinite product
n=1
∞
Y
(1 + an ) converges absolutely.
n=1
Proof
∞
X
Since an converges absolutely, lim an = 0. Without loss of generality,
n→∞
n=1
∞
X
we can assume that |an | < 1 for all n ≥ 1. Since |an | is convergent,
n=1
∞
Y
Theorem 5.37 implies that the infinite product (1 + |an |) is convergent.
n=1
∞
Y
Theorem 5.38 then implies that the infinite product (1 + an ) converges
n=1
absolutely.
Example 5.34
∞
(−1)n−1
Y
The infinite product 1+ is convergent since the series
n=1
n2
∞
X (−1)n−1
2
converges absolutely.
n=1
n
Now it is natural to ask the following question. Is it true that the infinite
Chapter 5. Infinite Series of Numbers and Infinite Products 424
∞
Y ∞
X
product (1 + an ) is convergent if and only if the series an is convergent?
n=1 n=1
The following two examples show that neither one implies the other.
Example 5.35
n n+1
Y
Y 1 1 1
P2n−1 = 1− 1+ √ , P2n = 1− .
k=2
k n + 1 k=2
k
∞
Y 1
Since the infinite product 1− is divergent, the infinite product
n=2
n
Y∞
(1 + an ) is divergent.
n=1
∞
X ∞
Y
This gives an example where an is convergent but (1 + an ) is
n=1 n=1
divergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 425
Example 5.36
Then √ √
n+1 n
1 + a2n−1 = √ , 1 + a2n =√ .
n n+1
n √
Y n+1
If Pn = (1 + ak ), we find that P2n−1 = √ and P2n = 1 for all
k=1
n
∞
Y
n ≥ 1. Hence, the infinite product (1 + an ) converges to 1.
n=1
n
X
If sn = ak , then
k=1
n X n
X 1 1 1
s2n = √ −√ = √ √ .
k=1
k k+1 k=1
k( k + 1)
∞ ∞
X 1 X 1
Compare to the series , we find that the series √ √ is
kk=1 k=1
k( k + 1)
divergent. Therefore, lim s2n = ∞, which implies that lim sn does not
n→∞ n→∞
∞
X
exist. Hence, the series an is divergent.
n=1
∞
Y ∞
X
This gives an example where (1 + an ) is convergent but an is
n=1 n=1
divergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 426
Exercises 5.4
Question 1
Question 2
∞
Y 1
Let s be a positive number. Show that the infinite product 1 − s is
n=2
n
convergent if and only if s > 1.
Question 3
For n ≥ 1, let
1 1
un = 1+ exp − .
n n
∞
Y
Show that the infinite product un is convergent and find its value.
n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 427
Example 5.37
∞
n(m + 1)
(a)
m(n + 1) m,n=1
∞
mn
(b)
m2 + n2 m,n=1
provided that for every ε > 0, there is a positive integer N so that for all
positive integers m and n with m ≥ N , n ≥ N ,
|am,n − a| < ε.
Example 5.38
∞
n(m + 1)
For the double sequence considered in Example 5.37,
m(n + 1) m,n=1
notice that
n(m + 1) 1 1 1 1 1
am,n = = 1− 1+ = 1− + − .
m(n + 1) n+1 m n + 1 m m(n + 1)
∞
mn
Before we study the convergence of the double sequence ,
m2 + n2 m,n=1
let us prove the following lemma, which says that for a double sequence {am,n }∞
m,n=1
to be convergent, it should approach the same limit regardless of how m and n
goes to infinity.
Lemma 5.40
Let {am,n }∞
m,n=1 be a double sequence that converges to a number a, and let
g : Z+ → Z+ be a function such that lim g(n) = ∞. Define the sequence
n→∞
{bn }∞
n=1 by
bn = ag(n),n .
Then the sequence {bn }∞
n=1 also converges to a.
Proof
Given ε > 0, there is a positive integer N1 such that for all (m, n) ∈ Z+ ×
Z+ with m ≥ N1 and n ≥ N1 ,
|am,n − a| < ε.
Example 5.39
∞
mn
For the double sequence considered in Example 5.37,
m2 + n2 m,n=1
assume that it converges to a. Take g1 : Z+ → Z+ to be the function
g1 (n) = n. Then we find that
n2 1
a = lim 2 2
= .
n→∞ n + n 2
Take g2 : Z+ → Z+ to be the function g2 (n) = 2n. Then we find that
2n2 2
a = lim 2 2
= .
n→∞ 4n + n 5
It is easy to prove that linearity also holds for limits of double sequences.
Chapter 5. Infinite Series of Numbers and Infinite Products 430
{αam,n + βbm,n }∞
m,n=1
Proof
Let a = lim am,n and b = lim bm,n . Given ε > 0, there are positive
m,n→∞ m,n→∞
integers N1 and N2 such that
ε
|am,n − a| < , for all m ≥ N1 , n ≥ N1 ;
2(|α| + 1)
ε
|bm,n − b| < , for all m ≥ N2 , n ≥ N2 .
2(|β| + 1)
Let N = max{N1 , N2 }. For all positive integers m and n with m ≥ N and
n ≥ N , we have
The question is whether we can find the limit of a double seqeunce {am,n } by
taking the limit n → ∞ first, and then take the limit m → ∞, or in the opposite
order.
Chapter 5. Infinite Series of Numbers and Infinite Products 431
∞
mn
For the double sequence , for fixed m ≥ 1, taking the n →
m + n2
2
m,n=1
∞ limit, we have
mn
lim = 0.
n→∞ m2 + n2
Hence,
mn
lim lim = 0.
m→∞ n→∞ m2 + n2
∞
mn
But we have shown that the double sequence is divergent.
m2 + n2 m,n=1
Therefore, we find that to study the limit of a double sequence, in general we
cannot take one limit first before the other. The following theorem says that if one
knows apriori that the double sequence is convergent, one can take iterated limits
under some conditions.
Theorem 5.42
Assume that the double sequence {am,n }∞
m,n converges to a, and for each
m ∈ Z+ , the limit
bm = lim am,n
n→∞
Proof
Given ε > 0, there exists a positive integer N such that for all (m, n) ∈
Z+ × Z+ with m ≥ N and n ≥ N ,
ε
|am,n − a| < .
2
Hence, for fixed m ≥ N , taking the n → ∞ limit gives
ε
|bm − a| ≤ < ε.
2
This proves that lim bm = a.
n→∞
Chapter 5. Infinite Series of Numbers and Infinite Products 432
The assumption that the limit lim am,n exists for each m ∈ Z+ in Theorem
n→∞
5.42 is needed, as the convergence of the double sequence {am,n }∞ m,n does not
guarantee that the limit lim am,n exists. An example is shown below.
n→∞
Example 5.40
m + (−1)n−1
am,n = .
m2
n−1 ∞
m + (−1)
For fixed m ∈ Z+ , the sequence does not have a limit
m2 n=1
m+1 m−1
since it is oscillating between 2
and . But the double sequence
∞
m m2
{am,n }m,n=1 converges to zero. This can be proved in the following way.
m+1
Given ε > 0, since lim = 0, there exists a positive integer N so
m→∞ m2
that for all m ≥ N .
m+1
0< < ε.
m2
This implies that if m ≥ N , n ≥ N , then
m+1
0 ≤ am,n ≤ < ε.
m2
Hence, the double sequence {am,n }∞
m,n=1 converges to zero.
am,n | (m, n) ∈ Z+ × Z+
is bounded.
Chapter 5. Infinite Series of Numbers and Infinite Products 433
Remark 5.3
If a double sequence {am,n }∞m,n=1 is convergent, it is not necessarily
bounded. For example, consider the double sequence {am,n }∞
m,n=1 with
n, if m = 1,
am,n =
1, if m ≥ 2.
If a double sequence {am,n }∞ m,n=1 is increasing in both indices, for any positive
integers m1 , m2 , n1 , n2 , if m2 ≥ m1 and n2 ≥ n1 , then
Figure 5.2: An illiustration of the relative positions of (m1 , n1 ) and (m2 , n2 ) when
m2 > m1 and n2 > n1 .
Chapter 5. Infinite Series of Numbers and Infinite Products 434
Example 5.41
mn
am,n =
(m + 1)(n + 1)
Let {am,n }∞
m,n=1 be a double sequence that is increasing in both indices.
Then the double sequence {am,n }∞ m,n=1 is convergent if and only if it is
bounded above. In case it is convergent, it converges to sup {am,n }.
(m,n)∈Z+ ×Z+
Proof
If the sequence {am,n }∞
m,n=1 converges to a, then there is a positive integer
N such that for all (m, n) ∈ Z+ × Z+ with m ≥ N and n ≥ N ,
|am,n − a| < 1.
This prove that the double sequence {am,n }∞ m,n=1 is bounded above by a+1.
In fact, the same reasoning shows that it is bounded above by a + ε for any
ε > 0, but we do not need this.
Conversely, if {am,n }∞
m,n=1 is bounded above, then
a= sup {am,n }
(m,n)∈Z+ ×Z+
where {am,n }∞ + +
m,n=1 is a double sequence. For each (m, n) ∈ Z × Z , we define
the (m, n) partial sum sm,n by
m X
X n
sm,n = ak,l .
k=1 l=1
Chapter 5. Infinite Series of Numbers and Infinite Products 436
Therefore,
m
X m−1
X
sm,n − sm,n−1 − sm−1,n + sm−1,n−1 = ak,n − ak,n = am,n .
k=1 k=1
Proposition 5.44
X
If the double series am,n is convergent, then the double
(m,n)∈Z+ ×Z+
sequence {am,n }∞
m,n=1 converges to 0.
X
If am,n is a double series with am,n ≥ 0 for all (m, n) ∈ Z+ ×
(m,n)∈Z+ ×Z+
+
Z , then the double sequence of partial sums {sm,n } is a double sequence that is
increasing in both indices. From Theorem 5.43, we obtain the following.
Theorem 5.45
X
If am,n is a double series with am,n ≥ 0 for all (m, n) ∈ Z+ ×
(m,n)∈Z+ ×Z+
+
Z , then it is convergent if and only if the double sequence of partial sums
{sm,n } is bounded above.
Chapter 5. Infinite Series of Numbers and Infinite Products 437
Figure 5.3: An illiustration of those terms ak,l that involved in sm2 ,n2 − sm1 ,n1
when m2 > m1 and n2 > n1 .
Corollary 5.46
X
If am,n is a double series with am,n ≥ 0 for all (m, n) ∈ Z+ ×
(m,n)∈Z+ ×Z+
Z , then it is convergent if and only if the sequence {sn,n }∞
+
n=1 is convergent.
In this case,
X X n
n X
am,n = lim sn,n = lim ak,l .
n→∞ n→∞
(m,n)∈Z+ ×Z+ k=1 l=1
Proof
Since am,n ≥ 0 for all (m, n) ∈ Z+ × Z+ , the double sequence {sm,n } is
Xwhile the sequence {sn,n } is increasing.
increasing in both indices,
If the double series am,n is convergent, Theorem 5.45 implies
(m,n)∈Z+ ×Z+
that the double sequence of partial sums {sm,n } is bounded above. Being
a subset, the sequence {sn,n }∞
n=1 is also bounded above. By monotone
convergence theorem, the sequence {sn,n }∞n=1 is convergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 438
This implies that the double sequence {sm,n } is bounded above by t. Hence,
X
the double series am,n is convergent. From the argument above,
(m,n)∈Z+ ×Z+
we also find that
Example 5.42
Show that the double series
X 1
(m2 + n2 )2
(m,n)∈Z+ ×Z+
is convergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 439
Solution
Notice that
n X
n n k n l
X 1 XX 1 XX 1
sn,n = 2 2 2
≤ 2 2 2
+
k=1 l=1
(k + l ) k=1 l=1
(k + l ) l=1 k=1
(k + l2 )2
2
n X k n ∞
X 1 X k X 1
≤2 4
= 2 4
= 2 3
.
k=1 l=1
k k=1
k k=1
k
∞
X 1
Since the series 3
is convergent, the sequence {sn,n } is bounded
k=1
k
X 1
above. Hence, the double series is convergent.
+ +
(m + n2 )2
2
(m,n)∈Z ×Z
Next, we consider double series that have negative terms. Given a double
sequence {am,n }∞ ∞ ∞
m,n=1 , let {pm,n }m,n=1 and {qm,n }m,n=1 be double sequences
defined by
|am,n | + am,n |am,n | − am,n
pm,n = , qm,n = .
2 2
Then
|am,n | = pm,n + qm,n , am,n = pm,n − qm,n .
{pm,n }∞ ∞
m,n=1 and {qm,n }m,n=1 are nonnegative double sequences with
Theorem 5.47
X
If the double series am,n converges absolutely, then it is
(m,n)∈Z+ ×Z+
convergent.
Chapter 5. Infinite Series of Numbers and Infinite Products 440
Proof
+ +
For (m, n) ∈ Z × Z , let
− −
tm,n = s+
m,n + sm,n , sm,n = s+
m,n − sm,n .
Moreover,
0 ≤ s+
m,n ≤ tm,n , 0 ≤ s−
m,n ≤ tm,n . (5.6)
Since {pm,n }, {qm,n } and {|am,n |} are nonnegative double sequences,
−
{s+
m,n }, {sm,n } and {tm,n } are nonnegative double sequences that
are increasing in both indices. By assumption, the double series
X
|am,n | is convergent. Therefore, the double sequence {tm,n }
(m,n)∈Z+ ×Z+
is bounded above. Eq. (5.6) implies that the double sequences {s+m,n } and
{s− +
m,n } are also bounded above. Hence, the double sequences {sm,n } and
{s−m,n } are convergent. By linearity, the double sequence {sm,n } is also
convergent and
−
lim sm,n = lim s+
m,n − lim sm,n .
m,n→∞ m,n→∞ m,n→∞
X
This proves that the double series am,n is convergent, and
(m,n)∈Z+ ×Z+
X X X
am,n = pm,n − qm,n .
(m,n)∈Z+ ×Z+ (m,n)∈Z+ ×Z+ (m,n)∈Z+ ×Z+
There is a simpler proof of this theorem using the same idea as we prove the
Chapter 5. Infinite Series of Numbers and Infinite Products 441
case for single series. The ideas in the proof that we present above have been
used when we prove that any rearrangement of an absolutely convergent single
series is convergent and has the same sum. It is a useful technique for dealing
with absolutely convergent series. One should compare this proof to the proof
of Theorem 4.39 for convergence of improper integrals. In fact,Zinfinite series
∞
and improper integrals are closely related. An improper integral f (x)dx is
−∞
convergent if and only if the double limit
Z b
lim f (x)dx
a→−∞,b→∞ a
exists. This can be rephrased as for any two sequences {am } and {bn } satisfying
lim am = −∞ and lim bn = ∞, the double sequence {Fm,n }, with
m→∞ n→∞
Z bn
Fm,n = f (x)dx
am
Example 5.43
∞ −1, ∞
X if m = 1, X 1, if n = 1,
am,n = am,n =
n=1
0, if m ≥ 2; m=1
0, if n ≥ 2.
Chapter 5. Infinite Series of Numbers and Infinite Products 442
Therefore,
∞ ∞
! ∞ ∞
!
X X X X
am,n = −1, am,n = 1.
m=1 n=1 n=1 m=1
Figure 5.4: An illiustration of the terms in the double series defined in Example
5.43.
Theorem 5.48
X
Assume that the double series am,n converges to s, and for
(m,n)∈Z+ ×Z+
∞
X
+
every fixed m ∈ Z , the series am,n is convergent with sum um . Then
n=1
the series
∞ ∞ ∞
!
X X X
um = am,n
m=1 m=1 n=1
Proof
Let m X
n
X
sm,n = ak,l .
k=1 l=1
This shows that for fixed m, the limit bm = lim sm,n exists and it equal to
n→∞
Xm
uk . By Theorem 5.42, the sequence {bm } converges to s. Therefore,
k=1
∞
X
the series um is convergent and has sum s.
m=1
Theorem 5.49
X
Given that am,n is a double series with am,n ≥ 0 for all
(m,n)∈Z+ ×Z+
(m, n) ∈ Z+ × Z+ , and it is convergent with sum s. We have the following.
∞
X
(a) For all m ∈ Z+ , um = am,n is finite.
n=1
∞
X
+
(b) For all n ∈ Z , vn = am,n is finite.
m=1
∞
X ∞
X
(c) The series um and the series vn both converge to s. Namely,
m=1 n=1
∞ X
X ∞ ∞ X
X ∞ X
am,n = am,n = am,n .
m=1 n=1 n=1 m=1 (m,n)∈Z+ ×Z+
Chapter 5. Infinite Series of Numbers and Infinite Products 444
Proof
Given positive integers m and n, let
X n
m X n
X m
X
sm,n = am,n , um,n = am,l , vm,n = ak,n .
k=1 l=1 l=1 k=1
Then m n
X X
sm,n = uk,n = vm,l . (5.7)
k=1 l=1
+
Since am,n ≥ 0 for all m, n ∈ Z , we have
Now let m n
X X
bm = uk and cn = vl
k=1 l=1
∞
X ∞
X
be the partial sums of the series um and vn . From (5.7), we find
m=1 n=1
that
m
X n
X
lim sm,n = u k = bm , lim sm,n = vl = cn .
n→∞ m→∞
k=1 l=1
From these, we find that the sequences {bm } and {cn } are also increasing
sequences that are bounded above by s.
Chapter 5. Infinite Series of Numbers and Infinite Products 445
Therefore,
b = lim bm and c = lim cn
m→∞ n→∞
from Corollary 5.46, we find that there is a positive integer N such that
sN,N > s − ε.
But then
N X
X N ∞
N X
X
sN,N = am,n ≤ am,n = bN .
m=1 n=1 m=1 n=1
Theorem 5.50
X
Given that am,n is a double series with am,n ≥ 0 for all
(m,n)∈Z+ ×Z+
∞
X
(m, n) ∈ Z+ × Z+ . Assume that for each m ∈ Z+ , the series am,n
n=1
∞
X
converges to um . If the series um is convergent, then the double series
X m=1
am,n is convergent, and
(m,n)∈Z+ ×Z+
∞ ∞ ∞
!
X X X X
am,n = um = am,n .
(m,n)∈Z+ ×Z+ m=1 m=1 n=1
Chapter 5. Infinite Series of Numbers and Infinite Products 446
Proof
∞
X
It is sufficient to prove that the convergence of the series um implies
X m=1
the convergence of the double series am,n . The last statement
(m,n)∈Z+ ×Z+
∞
X
then follows from Theorem 5.48. Assume that the series um converges
m=1
to u. Using the same notations as in the proof of Theorem 5.49, we find
that for each positive integer m, the sequence {um,n }∞n=1 increases to um .
From (5.7), we find that for any positive integers m and n,
m
X
sm,n ≤ uk ≤ u.
k=1
Remark 5.4
Putting together Theorem 5.49 and Theorem 5.50, we conclude the
X
following. Given a double series am,n with nonnegative terms
(m,n)∈Z+ ×Z+
am,n , we can determine its convergence and find its sum by first checking
X∞
whether for each fixed m, the series am,n is convergent. If yes, find
n=1
∞
X
the sum, call it as um , and check whether the series um is convergent.
X m=1
If yes, then the double series am,n is convergent and its sum is
(m,n)∈Z+ ×Z+
∞
X X
given by um . Namely, the sum of the double series am,n
m=1 (m,n)∈Z+ ×Z+
can be obtained by iterated summation.
Chapter 5. Infinite Series of Numbers and Infinite Products 447
∞
X
We can also start with the series am,n for each fixed n. This shows that
m=1
for double series with nonnegative terms, we can interchange the orders of
summation. In fact, with slightly more effort, one can prove that we can
sum in any orders.
∞
X
If for some integer m, the series am,n is divergent, then the double series
n=1
X ∞
X
am,n is divergent. Even if the series am,n is convergent for
(m,n)∈Z+ ×Z+ n=1
∞
X
all positive integers m, the series um can still be divergent. In this latter
X m=1
case, the double series am,n is divergent. An example is given
(m,n)∈Z+ ×Z+
by the double series
X 1
.
m2 + n2
(m,n)∈Z+ ×Z+
∞
X 1
For fixed positive integer m, comparison with the series shows that
n=1
n2
∞
X 1
the series is convergent. By integral test, we find that
n=1
m + n2
2
∞ Z ∞
X 1 1 1 π 1
um = ≥ dx − 2 = − 2 > 0.
n=1
m + n2
2
0 m2 +x 2 m 2m m
∞ ∞
X 1 X 1
Since the series is divergent but the series is convergent,
m=1
m n=1
m2
∞ ∞
X π 1 X
the series − is divergent. Hence, the series um is
m=1
2m m2 m=1
divergent.
Finally, we can come back to series with negative terms. From Theorem 5.49,
we have the following.
Chapter 5. Infinite Series of Numbers and Infinite Products 448
Theorem 5.51
X
Given that am,n is a double series that converges absolutely,
(m,n)∈Z+ ×Z+
and it is convergent with sum s. We have the following.
∞
X
(a) For all m ∈ Z+ , um = am,n is finite.
n=1
∞
X
+
(b) For all n ∈ Z , vn = am,n is finite.
m=1
∞
X ∞
X
(c) The series um and the series vn both converge to s. Namely,
m=1 n=1
∞ X
X ∞ ∞ X
X ∞ X
am,n = am,n = am,n .
m=1 n=1 n=1 m=1 (m,n)∈Z+ ×Z+
Proof
Using the same
X notations as in the proof of Theorem 5.47, since
X the double
series |am,n | is convergent, the double series pm,n
(m,n)∈Z+ ×Z+ (m,n)∈Z+ ×Z+
X
and qm,n are convergent. Applying Theorem 5.49 to the
(m,n)∈Z+ ×Z+
X X
nonnegative series pm,n and qm,n , we conclude that
(m,n)∈Z+ ×Z+ (m,n)∈Z+ ×Z+
X∞ ∞
X ∞
X
+ +
for all m ∈ Z and all n ∈ Z , the series pm,n , qm,n , pm,n and
n=1 n=1 m=1
∞
X
qm,n are convergent. Since
m=1
This theorem says that absolutely convergent double series enjoys almost the
Chapter 5. Infinite Series of Numbers and Infinite Products 449
same privileges as the nonnegative double series. The following theorem gives a
summary.
Theorem 5.52
X
Given that am,n is a double series that satisfies the following
(m,n)∈Z+ ×Z+
conditions.
∞
X
(i) For each fixed m ∈ Z+ , the series |am,n | is convergent.
n=1
∞ X
X ∞
(ii) |am,n | is convergent.
m=1 n=1
∞
X
(b) For each fixed n ∈ Z+ , the series am,n converges absolutely.
m=1
∞
X
+
(c) For each fixed m ∈ Z , the series am,n converges absolutely.
n=1
∞ X
X ∞ ∞ X
X ∞
(d) Both the series am,n and am,n are convergent.
m=1 n=1 n=1 m=1
(e) The sum of the double series can be computed by iterated summation.
Namely,
X ∞ X
X ∞ ∞ X
X ∞
am,n = am,n = am,n .
(m,n)∈Z+ ×Z+ m=1 n=1 n=1 m=1
Proof
By X
Theorem 5.50, (i) and (ii) implies that the double series
|am,n | is convergent, which gives (a).
(m,n)∈Z+ ×Z+
Chapter 5. Infinite Series of Numbers and Infinite Products 450
By Theorem 5.49, (a) implies (b) and (c). Theorem 5.49 also implies that
the two series
∞ ∞
! ∞ ∞
!
X X X X
|am,n | and |am,n |
m=1 n=1 n=1 m=1
∞ X
X ∞ ∞ X
X ∞
comparison test shows that the series am,n and am,n are
m=1 n=1 n=1 m=1
convergent. This gives (d). The statement (e) follows from (a) and Theorem
5.51.
Chapter 5. Infinite Series of Numbers and Infinite Products 451
Exercises 5.5
Question 1
If a and b are positive constants, show that the double series
X 1
am2 + bn2
(m,n)∈Z+ ×Z+
is divergent.
Question 2
Given that a and b are positive constants, u and v are real numbers, and α
is a number larger than 1. Show that the double series
X sin(mu + nv)
(am2 + bn2 )α
(m,n)∈Z+ ×Z+
is convergent.
Chapter 6. Sequences and Series of Functions 452
Chapter 6
In this chapter, we study sequences and series whose terms depend on a variable.
In this case, we also say that the function f : D → R is the pointwise limit
of the sequence of functions {fn : D → R}.
Example 6.1
Solution
Notice that
0, if 0 ≤ x < 1,
lim xn =
n→∞ 1, if x = 1.
Therefore, the sequence of functions {fn } converges pointwise to the
function f : [0, 1] → R, where
0, if 0 ≤ x < 1,
f (x) =
1, if x = 1.
Example 6.2
Solution
For each x ∈ [0, 1], the sequence {fn (x)} is convergent. For any x ∈ (1, 2],
the sequence {fn (x)} is divergent. Hence, the sequence of functions {fn }
does not converge pointwise.
Chapter 6. Sequences and Series of Functions 454
We have seen that the answer to the first question is no, as given by Example
6.1. The answers to the second and third questions are also no. We will look
at some examples.
Example 6.3
For each positive integer n, let fn : R → R be the function
1
fn (x) = .
1 + nx2
Study the pointwise convergence of the sequence of functions {fn }.
Solution
Since fn (0) = 1 for all n ∈ Z+ ,
lim fn (0) = 1.
n→∞
If x ̸= 0,
1
0 ≤ fn (x) ≤ .
nx2
Chapter 6. Sequences and Series of Functions 455
By squeeze theorem,
Example 6.4
For each positive integer n, let fn : R → R be the differentiable function
2
fn (x) = xe−nx .
Solution
(a) Since fn (0) = 0 for all n ∈ Z+ ,
lim fn (0) = 0.
n→∞
Chapter 6. Sequences and Series of Functions 456
2
lim fn (x) = x lim e−nx = x lim e−u = 0.
n→∞ n→∞ u→∞
(b) For n ∈ Z+ ,
2
fn′ (x) = (1 − 2nx2 )e−nx .
Since fn′ (0) = 1 for all n ∈ Z+ ,
2
lim fn′ (x) = lim (1 − 2nx2 )e−nx = lim (1 − 2u)e−u = 0.
n→∞ n→∞ u→∞
Example 6.5
For each positive integer n, let
p
Sn = p, q ∈ Z, 0 ≤ p ≤ q ≤ n, q ≥ 1 .
q
Solution
If x is a rational number in [0, 1], there exists a nonnegative integer p and a
positive integer q such that 0 ≤ p ≤ q and x = p/q. Therefore, x ∈ Sn for
all n ≥ q. This implies that fn (x) = 1 for all n ≥ q. Hence,
These show that the sequence of functions {fn } converges pointwise to the
Dirichlet function f : R → R,
1, if x is rational,
f (x) = .
0, if x is irrational.
For each n ∈ Z+ , the set Sn which fn (x) ̸= 0 is finite. Thus the function
fn : [0, 1] → R is Riemann integrable. But the Dirichlet function f is not
Riemann integrable.
Example 6.6
For each positive integer n, let
n2 x(1 − nx), if 0 ≤ x ≤ n1 ,
fn (x) =
0, otherwise.
Z 1
Notice that fn is integrable on [0, 1]. Let cn = fn (x)dx.
0
Solution
(a) Since fn (0) = 0 for all n ∈ Z+ ,
lim fn (0) = 0.
n→∞
Chapter 6. Sequences and Series of Functions 459
If x > 0, there is a positive integer N so that x > 1/N . This implies that
fn (x) = 0 for all n ≥ N . Hence, we also have
lim fn (x) = 0.
n→∞
∞
X
Hence, the series of functons xn converges pointwise on the interval
n=0
1
(−1, 1) to the function s(x) = .
1−x
Example 6.8
∞
2
X
Determine the domain of convergence of the series e−n x .
n=1
Solution
2x
If x ≤ 0, −n2 x ≥ 0 for all n ∈ Z+ . Therefore, lim e−n ̸= 0, and so the
n→∞
∞
2
X
series e−n x is divergent.
n=1
2
If x > 0, lim e−n x = 0. In this case, notice that n2 ≥ n for all n ∈ Z+
n→∞
implies that
2
0 ≤ e−n x ≤ e−nx for all n ∈ Z+ .
∞
X
Since the series e−nx is a geometric series with positive constant ratio
n=1
∞
2x
X
r=e −x
< 1, it is convergent. By the comparison test, the series e−n
n=1
is also convergent.
∞
2x
X
Therefore, the domain of convergence of the series e−n is D =
n=1
(0, ∞).
Chapter 6. Sequences and Series of Functions 462
Exercises 6.1
Question 1
For each positive integer n, let fn : R → R be the function defined by
2
fn (x) = e−nx .
Question 2
For each positive integer n, let fn : (0, ∞) → R be the function defined by
1
fn (x) = .
1 + xn
(a) Determine the pointwise convergence of the sequence of functions
{fn }.
Question 3
For each positive integer n, let fn : [0, ∞) → R be the function defined by
x
fn (x) = ,
1 + nx
Z 1
and let cn = fn (x)dx.
0
Question 4
For each positive integer n, let fn : R → R be the function defined by
x
fn (x) = n sin ,
n
Z 1
and let cn = fn (x)dx.
0
Question 5
∞
2
X
Find the domain of convergence of the series of functions ne−nx .
n=1
Question 6
∞
2
X
Find the domain of convergence of the series of functions ne−n x .
n=1
Chapter 6. Sequences and Series of Functions 464
lim lim fn (x) does not necessary equal to lim lim fn (x).
x→x0 n→∞ n→∞ x→x0
d d
lim fn (x) does not necessary equal to lim fn (x).
dx n→∞ n→∞ dx
3. The integral of f over an interval [a, b] does not necessary equal to the
limit the integrals of fn over [a, b].
Z b Z b
lim fn (x)dx does not necessary equal to lim fn (x)dx.
a n→∞ n→∞ a
Since derivatives and integrals are also limits, all these pathological
behaviors have the same root. Namely, one cannot simply interchange the
orders of two limits, as have been shown in Section 5.5.
For the point x = 1, {fn (1)} converges to f (1) = 1. For any ε > 0, we can take
N = 1. Then for all n ≥ N ,
The same goes for the point x = 0. For any other x in the interval (0, 1), {fn (x)}
converges to f (x) = 0. Given ε > 0, if ε < 1, the smallest N such that
Proposition 6.1
One sees that it is a matter of the ordering of the quantifiers, but it makes
a significant difference when we interchange the orders of a universal quantifier
with a existential quantifier.
One should also compare the definition of uniform convergence to uniform
continuity that we discussed in Section 2.5. In both cases, the uniformity is with
Chapter 6. Sequences and Series of Functions 467
Non-Uniform Convergence
In logical expressions, the sequence of functions {fn : D → R} does not
converge uniformly to the function f : D → R is expressed by
Proposition 6.2
Example 6.9
Solution
In Example 6.1, we have seen that the sequence {fn } converges pointwise
to the function f : [0, 1] → R, where f (x) = 0 for x ∈ [0, 1) and f (1) = 1.
If {fn : [0, 1] → R} converges uniformly, it must converge to the same
function f :: [0, 1] → R. Take ε = 12 . There must be a positive integer N
such that for all n ≥ N , for all x ∈ [0, 1],
1
|fn (x) − f (x)| < .
2
Chapter 6. Sequences and Series of Functions 468
Example 6.10
2
For n ≥ 1, let fn : R → R be the function fn (x) = xe−nx . Show that the
sequence {fn } converges uniformly.
Solution
In Example 6.4, we have seen that the sequence {fn } converges pointwise
to the function f : R → R that is identically 0. Notice that
2
fn′ (x) = (1 − 2nx2 )e−nx .
√
This shows that fn′ (x) > 0 for |x| < 1/ 2n, and fn′ (x) < 0 for |x| >
√
1/ 2n. Since
1 1
|fn (x) − f (x)| = |fn (x)| ≤ √ ≤ √ < ε.
2n 2N
This proves that the sequence of functions {fn } converges uniformly to the
function f that is identically 0.
Theorem 6.3
Let D be a subset of real numbers, and let {fn : D → R} be a sequence of
functions defined on D.
Proof
We have addressed I. and II. (a). Let us now consider II. (b). If the sequence
of functions {fn : D → R} converges uniformly to the function f : D →
R, given ε > 0, there is a positive integer N ≥ N0 such that for all n ≥ N
and for all x ∈ D,
ε
|gn (x)| = |fn (x) − f (x)| < .
2
This gives
ε
0 ≤ Mn = sup |gn (x)| ≤ <ε for all n ≥ N.
x∈D 2
Therefore, lim Mn = 0.
n→∞
Conversely, if lim Mn = 0, given ε > 0, there is a positive integer N ≥ N0
n→∞
such that
Mn < ε for all n ≥ N.
It follows that for all n ≥ N , for all x ∈ D,
Example 6.11
For the sequence of functions discussed in Example 6.9,
x n , if 0 ≤ x < 1,
fn (x) − f (x) =
0, if x = 1.
Therefore,
Mn = sup |fn (x) − f (x)| = 1.
0≤x≤1
Example 6.12
To apply Theorem 6.3, we need to know apriori the pointwise limit of the
sequence of functions {fn } to be able to conclude the uniform convergence of the
sequence. Sometimes it could be difficult to find the limit function. To circumvent
this problem, we introduce the concept of uniformly Cauchy.
Theorem 6.4
Cauchy Criterion for Uniform Convergence of Sequences of Functions
Proof
If the sequence of functions {fn : D → R} converges uniformly to f :
D → R, given ε > 0, there is a positive integer N such that for all n ≥ N ,
ε
|fn (x) − f (x)| < for all x ∈ D.
2
Chapter 6. Sequences and Series of Functions 472
Using Theorem 6.4, Theorem 6.3 can be finetuned as follows. The proof is
straightforward and we leave it to the exercises.
Theorem 6.5
Given that {fn : D → R} is a sequence of functions defined on the subset
D of real numbers, for each pair of (m, n) ∈ Z+ × Z+ , define the extended
real number Mm,n as
In most cases, such as Example 6.8, we can only justify a series of functions
converges pointwise, but we cannot find an explicit close form for the sum s(x).
In this case, a Cauchy criterion becomes useful.
Chapter 6. Sequences and Series of Functions 474
Theorem 6.6
Cauchy Criterion for Uniform Convergence of Series of Functions
∞
X
A series of functions fn (x) converges uniformly on a set D if and only
n=1
if for every ε > 0, there is a positive integer N such that for all m ≥ n ≥ N ,
m
X
fk (x) < ε for all x ∈ D.
k=n
Example 6.13
∞
2x
X
For the series e−n considered in Example 6.8, we have shown that
n=1
it converges pointwise on the interval (0, ∞). Let us prove that the
convergence is uniform on any set D of the form D = [a, ∞), where a
is a positive constant.
We notice that if m ≥ n and x ≥ a,
m m ∞ ∞
X
−k2 x
X
−kx
X
−kx
X
−ka e−na
0≤ e ≤ e ≤ e ≤ e = .
k=n k=n k=n k=n
1 − e−a
e−na
0≤ < ε.
1 − e−a
Chapter 6. Sequences and Series of Functions 475
∞
2x
X
By Theorem 6.6, the series e−n converges uniformly on [a, ∞).
n=1
∞
2x
X
The readers are invited to show that the series e−n does not converge
n=1
uniformly on the set (0, ∞). It is a typical situation that allthough the series
converges pointwise on a set A, it fails to converge uniformly on A, but it converges
uniformly on subsets of A. Most of the time, we do not need uniform convergence
on A, but uniform convergence on a collection of subsets of A whose union is A
is enough. In the example above, C = {[a, ∞) | a ≥ 0} is a collection of subsets
of A = (0, ∞) whose union is A.
Theorem 6.7
Let {fn : D → R} be a sequence of functions defined on D. If the series
X∞ ∞
X
|fn (x)| converges uniformly on D, then the series fn (x) converges
n=1 n=1
absolutely and uniformly on D.
Chapter 6. Sequences and Series of Functions 476
Proof
∞
X
Since the series |fn (x)| converges uniformly on D, it also converges
n=1
∞
X
pointwise. Hence, the series fn (x) converges absolutely on D.
n=1
∞
X
Since |fn (x)| converges uniformly on D, it is uniformly Cauchy. Given
n=1
ε > 0, there is a positive integer N such that for all m ≥ n ≥ N ,
m
X
|fk (x)| < ε for all x ∈ D.
k=n
∞
X
Hence, the series fn (x) is also uniformly Cauchy on D. Therefore, it
n=1
also converges uniformly.
(i) For each n ∈ Z+ , there is a positive constant Mn such that |fn (x)| ≤
Mn for all x ∈ D.
∞
X
(ii) The series Mn is convergent.
n=1
∞
X
Then the series fn (x) converges absolutely and uniformly on D.
n=1
Chapter 6. Sequences and Series of Functions 477
Proof
∞
X
By Theorem 6.7, we only need to show that the series |fn (x)| converges
n=1
∞
X
uniformly on D. Given ε > 0, since the series Mn is convergent, there
n=1
is a positive integer N such that for all m ≥ n ≥ N ,
m
X
Mk < ε.
k=n
∞
X
By Theorem 6.6, the series |fn (x)| converges uniformly on D.
n=1
Example 6.14
∞
2
X
Let a be a positive number. Show that the series (−1)n−1 e−n x
n=1
converges absolutely and uniformly on [a, ∞).
Solution
2
For n ∈ Z+ , let fn (x) = (−1)n−1 e−n x . For x ∈ [a, ∞), x ≥ a. Hence, for
n ∈ Z+ ,
2
|fn (x)| = e−n x ≤ e−nx ≤ e−na .
∞
X
Since r = e −na
< 1, the geometric series e−na is convergent. By
n=1
∞
2x
X
Weierstrass M -test, the series (−1)n−1 e−n converges absolutely and
n=1
uniformly on [a, ∞).
Chapter 6. Sequences and Series of Functions 478
Exercises 6.2
Question 1
2
For n ≥ 1, let fn : [0, 1] → R be the function fn (x) = e−nx . Show that
the sequence of functions {fn } does not converge uniformly.
Question 2
x
For n ≥ 1, let fn : R → R be the function fn (x) = n sin . Show that
n
the sequence of functions {fn } does not converge uniformly.
Question 3
x
For n ≥ 1, let fn : [0, 2π] → R be the function fn (x) = n sin . Show
n
that the sequence of functions {fn } converges uniformly.
Question 4
x
For n ≥ 1, let fn : [0, ∞) → R be the function fn (x) = .
1 + nx
Determine whether the sequence of functions {fn } converges uniformly.
Question 5
∞
2x
X
Let a be a positive constant. Show that the series (−1)n−1 ne−n
n=1
converges absolutely and uniformly on the set [a, ∞).
Chapter 6. Sequences and Series of Functions 479
In this section, we are going to see how uniform convergence can avoid the
pathological behaviours we mentioned in the beginning of Section 6.2. First we
show that uniform limit of continuous functions is continuous. This is a very
important result in mathematical analysis.
Proof
The proof is a standard 1/3 argument. Given x0 ∈ D, we want to show
that f is continuous at x0 using the ε − δ argument. Given ε > 0, there is a
positive integer N such that for all n ≥ N ,
ε
|fn (x) − f (x)| < for all x ∈ D.
3
We are only going to use this statement when n = N . Since fN is
continuous at x0 , there is a δ > 0 such that for all x ∈ D, if |x − x0 | < δ,
then
ε
|fN (x) − fN (x0 )| < .
3
From these, we find that if x is in D and |x − x0 | < δ, then
|f (x) − f (x0 )| ≤ |f (x) − fN (x)| + |fN (x) − fN (x0 )| + |fN (x0 ) − f (x0 )|
ε ε ε
< + + = ε.
3 3 3
This proves that f is continuous at x0 .
Chapter 6. Sequences and Series of Functions 480
Example 6.15
Corollary 6.10
Proof
We apply Theorem 6.9 to the sequence of partial sums {sn (x)}. Since
n
X
sn (x) = fk (x) is a finite sum of continuous functions, it is continuous.
k=1
By Theorem 6.9, s(x) = lim sn (x) is continuous.
n→∞
Theorem 6.11
Assume that for each n ∈ Z+ , the funtion fn : [a, b] → R is Riemann
integrable. If the sequence of functions {fn : [a, b] → R} converges
uniformly to the function f : [a, b] → R, then f : [a, b] → R is also
Riemann integrable, and the orders of the limit operation and the integration
operation can be interchanged. Namely,
Z b Z b Z b
lim fn (x)dx = f (x)dx = lim fn (x)dx. (6.2)
n→∞ a a a n→∞
Proof
Given ε > 0, there is a positive integer N such that for all n ≥ N ,
ε
|fn (x) − f (x)| < for all x ∈ [a, b]. (6.3)
3(b − a)
Therefore,
ε ε
U (f, P ) ≤ U (fN , P ) + , L(f, P ) ≥ L(fN , P ) − .
3 3
Thus,
2ε ε 2ε
U (f, P ) − L(f, P ) ≤ U (fN , P ) − L(fN , P ) + < + = ε.
3 3 3
From this, we conclude that f is Riemann integrable. This in turn implies
that for any n ∈ Z+ , the function fn − f is Riemann integrable on [a, b],
and so is the function |fn − f |. With the same ε > 0, we find from (6.3)
that for any n ≥ N ,
Z b Z b Z b
fn (x)dx − f (x)dx = (fn (x) − f (x))dx
a a a
Z b
ε
≤ |fn (x) − f (x)| dx ≤ < ε.
a 3
This proves that (6.2) holds.
Corollary 6.12
Proof
We apply Theorem 6.11 to the sequence of partial sums {sn (x)}. Since
n
X
sn (x) = fk (x) is a finite sum of Riemann integrable functions, it is
k=1
Riemann integrable. The rest follows from Theorem 6.11.
Example 6.16
Theorem 6.13
Given that {fn : (a, b) → R} is a sequence of functions which satisfies the
following conditions.
(i) There is a point x0 in the interval (a, b) such that the sequence
{fn (x0 )} converges to a number y0 .
Proof
+
For each n ∈ Z , since fn : I → R is differentiable, it is continuous.
Given a point c in the interval (a, b), let {hn,c : (a, b) → R} be a sequence
of functions defined by
fn (x) − fn (c) ,
if x ̸= c,
hn,c (x) = x−c (6.4)
′
fn (c), if x = c.
Chapter 6. Sequences and Series of Functions 485
′
|hm,c (x) − hn,c (x)| ≤ sup |fm (x) − fn′ (x)|.
a<x<b
′
sup |hm,c (x) − hn,c (x)| ≤ sup |fm (x) − fn′ (x)|. (6.5)
a<x<b a<x<b
fm (x)−fn (x) = fm (x0 )−fn (x0 )+(x−x0 ) (hm,x0 (x) − hn,x0 (x)) . (6.6)
Given ε > 0, since the sequence {fn (x0 )} is convergent, there is a positive
integer N1 such that for all m ≥ n ≥ N1 ,
ε
|fm (x0 ) − fn (x0 )| < .
2
Chapter 6. Sequences and Series of Functions 486
Eq. (6.6) implies that for all m ≥ n ≥ N , and for all x ∈ (a, b),
|fm (x) − fn (x)| ≤ |fm (x0 ) − fn (x0 )| + |x − x0 ||hm,x0 (x) − hn,x0 (x)|
ε ε
< + (b − a) × = ε.
2 2(b − a)
By Theorem 6.4, this proves that the sequence of functions {fn : (a, b) →
R} converges uniformly. Let
f (x) − f (c) ,
if x ̸= c,
hc (x) = lim hn,c (x) = x−c .
n→∞
g(c), if x = c.
Remark 6.1
Corollary 6.14
Proof
Applying Theorem 6.13 to the sequence of partial sums {sn (x)}. Since
Xn
sn (x) = fk (x) is a finite sum of differentiable functions, it is
k=1
differentiable. The rest follows from Theorem 6.13.
Chapter 6. Sequences and Series of Functions 488
Example 6.17
∞
2x
X
Consider the series e−n discussed in Example 6.13. We have shown
n=1
that it converges uniformly on [a, ∞) when a is a positive number. For each
1 2
n ∈ Z+ , fn (x) = − 2 e−n x is a differentiable function with derivative
n
d 1 −n2 x 2
− 2e = e−n x .
dx n
Notice that
1
|fn (x)| ≤ for all x ∈ [0, ∞).
n2
∞
X 1
Since the series is convergent, Weierstrass M -test shows that the
n=1
n2
∞ ∞
X X 1 2
series fn (x) = − 2 e−n x converges absolutely and uniformly on
n=1 n=1
n
[0, ∞). Corollary 6.14 shows that for any x ∈ (a, ∞), we can do term by
term differentiation and obtain
∞ ∞
d X 1 −n2 x X −n2 x
− e = e . (6.7)
dx n=1 n2 n=1
Since a > 0 is arbitrary, eq. (6.7) holds for any x > 0. However, this is not
true for x = 0 even if we only consider right derivatives, as the right hand
side of the equation is divergent when x = 0.
Chapter 6. Sequences and Series of Functions 489
Exercises 6.3
Question 1
∞
2x
X
(a) Show that the series n2 e−n defines a continuous function on
n=1
(0, ∞).
∞
2x
X
(b) Show that the series e−n defines a differentiable function on
n=1
(0, ∞), and for each x > 0,
∞ ∞
d X −n2 x X 2
e =− n2 e−n x .
dx n=1 n=1
Question 2
Let {fn : (a, b) → R} be a sequence of continuously differentiable
functions. Assume that there is a point x0 ∈ [a, b] such that the sequence
{fn (x0 )} converges to a point y0 , and the sequence of functions {fn′ :
(a, b) → R} converges uniformly to a function g : (a, b) → R. Use
the fundamental theorems of calculus and Theorem 6.11 to prove that
the sequence of functions {fn : (a, b) → R} converges uniformly to a
differentiable function f : (a, b) → R, and f ′ (x) = g(x) for all x ∈ (a, b).
Chapter 6. Sequences and Series of Functions 490
∞
X
Each term in a power series cn (x − x0 )n is a simple polynomial cn (x −
n=0
x0 )n which is infinitely differentiable. However, as an infinite series, we need
X∞
to address the convergence issue. Obviously, the power series cn (x − x0 )n
n=0
converges when x = x0 .
Recall that in Chapter 5, we have discussed the ratio test in Theorem 5.28.
X∞
Given an is a series with an ̸= 0 for all n ∈ Z+ , let
n=1
an+1 an+1
r = lim inf and R = lim sup .
n→∞ an n→∞ an
∞
X
Then the series an is divergent if r > 1, convergent if R < 1, but inconclusive
n=1
an+1
if r ≤ 1 ≤ R. This test is useful if the limit lim exists. For then r = R and
n→∞ an
we only left with finitely many points which we cannot conclude the convergence
of the power series. Let us look at some examples.
Chapter 6. Sequences and Series of Functions 491
Example 6.18
∞
X xn
Find the domain of convergence of the power series .
n=0
n!
Solution
The power series is convergent when x = 0. When x ̸= 0, using ratio test
xn
with an = , we find that
n!
an+1 |x|
lim = lim = 0.
n→∞ an n→∞ n + 1
Therefore, the series is convergent for all real numbers x. The domain of
convergence is R, the set of real numbers.
Example 6.19
∞
X
Find the domain of convergence of the power series n!xn .
n=0
Solution
The power series is convergent when x = 0. When |x| ̸= 0, using ratio test
with an = n!xn , we have
an+1
lim = lim (n + 1)|x| = ∞.
n→∞ an n→∞
∞
X
Hence, the series is divergent if x ̸= 0. We conclude that the series n!xn
n=0
is only convergent when x = 0. The domain of convergence is the set {0}.
Example 6.20
∞
X xn
Find the domain of convergence of the power series .
n=1
n2
Chapter 6. Sequences and Series of Functions 492
Solution
The power series is convergent when x = 0. When x ̸= 0, using ratio test
xn
with an = 2 , we have
n
2
n2
an+1 n
lim = |x| lim = |x| lim = |x|.
n→∞ an n→∞ (n + 1)2 n→∞ n+1
Therefore, the series is convergent if |x| < 1, and divergent if |x| > 1.
When |x| = 1, the test is inconclusive.
∞ ∞
X 1 X (−1)n
But we know that the series and the series are
n=1
n2 n=1
n2
∞
X xn
convergent. Therefore, the series is convergent if and only if
n=1
n2
|x| ≤ 1. The domain of convergence is the set [−1, 1].
In the examples above, we apply the ratio test to determine the domain of
convergence. This works fine when all the coefficients cn in the power series
X∞
cn (x − x0 )n are nonzero, or only finitely many of them are zero. We need to
n=0
an+1
find the limit inferior and limit superior of the sequence , where an is
an
the nth term cn (x − x0 )n in the power series. Since
essentially
we need to find the limit inferior and limit superior of thesequence
cn+1 cn+1
, then multiply by |x − x0 |. If the limit of the sequence
cn cn
exists, the limit inferior and limit superior of this sequence are the same, and the
domain of convergence can be determined up to the end points of an interval. We
apply other convergence test to check the convergence at these end points.
There are two problems with using the ratio test for determining the domain
of convergence.
cn+1
1. If the limit inferior and limit superior of the sequence are not the
cn
same, the ratio test is inconclusive for x in an interval.
Chapter 6. Sequences and Series of Functions 493
∞
X
2. When infinitely many of the coefficients cn in the power series cn (x − x0 )n
n=0
are zero, the ratio test cannot be applied. This problem can be circumvented
if there is some patterns on the indices n for which cn is 0. For example, if
c2n = 0 for all n ∈ Z+ , the series only contains the odd terms, and it can be
written as ∞
X
c2n−1 (x − x0 )2n−1 .
n=1
In this case, we can apply the ratio test with an = c2n−1 (x − x0 )2n−1 . However,
the first problem might still be present.
To resolve these problems, we find that the root test (Theorem 5.27) is better
∞
X
from the theoretical point of view. Given a series an , let
n=1
p
n
ρe = lim sup |an |.
n→∞
∞
X
The root test says that the series an is convergent if ρe < 1, divergent if ρe > 1,
n=1
and inconclusive if ρe = 1.
Applying the root test to a power series, we have the following.
Corollary 6.16
∞
X
Given a power series cn (x − x0 )n such that cn ̸= 0 for all n, assume
n=0
that the limit
cn+1
ρ = lim
n→∞ cn
exists in the extended sense.
Proof
cn+1
By Theorem 5.26, we find that lim exists implies that
n→∞ cn
p
n
p cn+1
lim sup |cn | = lim n |cn | = lim .
n→∞ n→∞ n→∞ cn
Domain of Convergence
Theorem 6.15 shows that the domain of convergence of a power series
centered at x0 can only be one of the following cases:
1. R 2. {x0 }
3. (x0 − R, x0 + R) 4. [x0 − R, x0 + R)
5. (x0 − R, x0 + R] 6. [x0 − R, x0 + R]
Example 6.21
∞
X
Find the domain of convergence of the power series cn x n .
n=1
Solution
Notice that
cn+1 n + 1, if n is odd,
= 1
cn , if n is even.
n
Applying ratio test with an = cn xn , we find that if x ̸= 0,
an+1 cn+1
lim inf = |x| lim inf =0
n→∞ an n→∞ cn
an+1 cn+1
lim sup = |x| lim sup = ∞.
n→∞ an n→∞ cn
This shows that the ratio test is inconclusive for any x except x = 0.
Let us turn to root test. By (5.4), we have
√
n
lim n = 1.
n→∞
This example shows that applying ratio test naively will leads to inconclusive
scenario, but the root test has rescued the problem. In practice, we always want
to avoid applying the root test because it is difficult to find the limit superior of
p
the sequence { n |cn |} when the coefficients cn . In the example above, we can
avoid using root test by writing the power series as a sum of two power series, and
apply the ratio test to the two power series separately. In any case, the root test
has given a theoretical decisive conclusion about the possible types of domain of
convergence for a power series.
For a power series whose radius of convergence R is 0, it only converges at
a single point x = x0 . So there is no point to consider such power series. If the
X∞
radius of convergence R of a power series cn (x − x0 )n is positive, the power
n=0
series defines a function on the open interval (x0 − R, x0 + R). We want to study
the continuity, differentiability and integrability of such a power series. Therefore,
we need to determine whether the power series converges uniformly.
X∞
Unfortunately, in general, a power series cn (x − x0 )n does not converge
n=0
uniformly on the interval (x0 − R, x0 + R). For example, consider the series
X∞ ∞
X
s(x) = xn . In Example 6.7, we have seen that xn is convergent when
n=0 n=0
|x| < 1, and divergent when |x| > 1. Hence, its radius of convergence is R = 1.
X∞
When |x| < 1, the power series xn defines the function
n=0
∞
X 1
s(x) = xn = .
n=0
1−x
we find that
sup |s(x) − sn (x)| = ∞.
|x|<1
∞
X
Hence, the series xn does not converge uniformly on (−1, 1). However, if a
n=0
is a number such that 0 < a < 1, then for |x| ≤ a,
xn+1 an+1
≤ .
1−x 1−a
Therefore,
an+1
sup |s(x) − sn (x)| ≤ .
|x|≤a 1−a
This implies that
lim sup |s(x) − sn (x)| = 0.
n→∞ |x|≤a
∞
X
Hence, the series xn converges uniformly on [−a, a].
n=0
A general power series also have similar behavior.
Proof
∞
R + R1 X
Let R2 = . Then R1 < R2 < R. Hence, the series cn (x − x0 )n
2 n=0
X∞
is convergent when |x − x0 | = R2 . Let x2 = x0 + R2 . Then cn (x2 −
n=0
∞
X
x0 )n = cn R2n is convergent. This implies that lim cn R2n = 0.
n→∞
n=0
Chapter 6. Sequences and Series of Functions 499
Proof
Given any x1 ∈ D = {x | |x − x0 | < R}, R1 = |x1 − x0 | < R.
Theorem 6.17 says that the power series converges uniformly on the set
D1 = {x | |x − x0 | ≤ R1 }, which contains the point x1 .
For n ≥ 0, the function fn (x) = cn (x − x0 )n is continuous. By Corollary
∞
X
6.10, the power series cn (x − x0 )n is continuous at x = x1 .
n=0
is Riemann integrable on [a, b], and we can integrate term by term. Namely,
Z b ∞
Z bX ∞
X Z b
n
f (x)dx = cn (x − x0 ) dx = cn (x − x0 )n dx. (6.8)
a a n=0 n=0 a
Chapter 6. Sequences and Series of Functions 501
Proof
Let R1 = max{|a − x0 |, |b − x0 |}. Then 0 < R1 < R and [a, b] is contained
∞
X
in [x0 − R1 , x0 + R1 ]. By Theorem 6.17, the power series cn (x − x0 )n
n=0
converges uniformly on [x0 −R1 , x0 +R1 ], and hence on [a, b]. For any n ≥
0, the function fn (x) = cn (x − x0 )n is Riemann integrable. By Corollary
X∞
6.12, the function f (x) = cn (x − x0 )n is Riemann integrable on [a, b]
n=0
and (6.8) holds.
Lemma 6.20
Given that {an } is a sequence of nonnegative numbers,
√
n
lim sup n an = lim sup an .
n→∞ n→∞
Proof
√ √
For all n ∈ Z+ , n ≥ 1. By (5.4), we have lim n n = 1. Hence, given
n
n→∞
ε > 0, there is a positive integer N such that for all n ≥ N ,
√
1≤ n
n < 1 + ε.
Theorem 6.21
∞
X
Let cn (x − x0 )n be a power series with a positive radius of convergence
n=0
∞
X
R. Then the radius of convergence of the derived series ncn (x − x0 )n−1
n=1
is also R.
Proof
∞
X
′
Let R be the radius of convergence of the derived series ncn (x−x0 )n−1 .
n=1
It is not difficult to see that it is the same as the radius of convergence of
X∞
the series ncn (x − x0 )n . By Lemma 6.20,
n=1
1 p
n
p
n 1
′
= lim sup |nc n | = lim sup |cn | = .
R n→∞ n→∞ R
Notice that if k ∈ Z+ ,
dk
(x − x0 )n = n(n − 1) · · · (n − k + 1)(x − x0 )n−k .
dxk
By induction, we can deduce the following.
Corollary 6.22
∞
X
Given that cn (x − x0 )n is a power series whose radius of convergence
n=0
R is positive. For any k ∈ Z+ , the series
∞
X
n(n − 1) · · · (n − k + 1)cn (x − x0 )n−k
n=k
Proof
∞
X
The k = 1 case, which says that the series ncn (x − x0 )n−1 has radius of
n=1
convergence R, is given by Theorem 6.21. Applying Theorem 6.21 to the
∞
X ∞
X
n−1
series ncn (x−x0 ) , we find that the series n(n−1)cn (x−x0 )n−2
n=1 n=2
also has radius of convergence R. This is the statement we need to prove
for the k = 2 case. For general k ∈ Z+ , we proceed by induction.
The next theorem says that we can differentiate a power series term by term.
∞ ∞
′ d X X
f (x) = cn (x − x0 )n = ncn (x − x0 )n−1 . (6.9)
dx n=0 n=1
Proof
By Theorem 6.21, the radius of convergence of the derived series
X∞ ∞
X
ncn (x − x0 )n−1 is also R. By Theorem 6.17, the series ncn (x −
n=1 n=1
x0 )n−1 converges absolutely and uniformly on [x0 −R1 , x0 +R1 ] if R1 < R.
Given x1 ∈ (x0 − R, x0 + R), |x1 − x0 | < R. Choose R1 such that
|x1 − x0 | < R1 < R. Then x1 ∈ (x0 − R1 , x0 + R1 ). Corollary 6.14 implies
that the function
Chapter 6. Sequences and Series of Functions 504
∞
X
f (x) = cn (x − x0 )n
n=0
Corollary 6.24
∞
X
Let cn (x − x0 )n be a power series with a positive radius of convergence
n=0
R. Then the function
∞
X
f (x) = cn (x − x0 )n
n=0
In particular,
f (k) (x0 ) = k!ck .
∞
X
We say that cn (x − x0 )n is a power series expansion or power series
n=0
representation of the function f (x) on the interval (x0 − R, x0 + R).
Example 6.22
1
When |x| < 1, the function f (x) = has a power series expansion
1−x
given by
∞
1 X
= xn = 1 + x + x2 + · · · + xn + · · · . (6.11)
1 − x n=0
Theorem 6.25
Let k be a nonnegative integer. Then for |x| < 1,
∞
1 X n n−k
k+1
= x . (6.12)
(1 − x) n=k
k
n n(n − 1) · · · (n − k + 1)
Here = are the binomial coefficients.
k k!
Proof
The k = 0 case is just the formula (6.11). By Corollary 6.24, we can
differentiate term by term k times and (6.10) gives
∞
k! X
= n(n − 1) · · · (n − k + 1)xn−k when |x| < 1.
(1 − x)k+1 n=k
Example 6.23
In probability theory, a geometric random variable is a random variable
X that depends on a parameter p where 0 < p < 1. If one performs a
series of identical and independent Bernoulli trials, each has a probability
p to be a success, then X is the number of these Bernoulli trials need to be
performed until the first success occurs. For any n ∈ Z+ , the probability
that X is equal to n is
P (X = n) = (1 − p)n−1 p.
Chapter 6. Sequences and Series of Functions 507
The expected number of Bernoulli trials need to be performed until the first
success is
∞
X ∞
X
E(X) = nP (X = n) = p n(1 − p)n−1 .
n=1 n=1
Thus,
∞
X ∞
X ∞
X
2 n−1 n−1
n (1 − p) = n(n − 1)(1 − p) + n(1 − p)n−1
n=1 n=1 n=1
2(1 − p) 1 2−p
= 3
+ 2 = .
p p p3
Therefore, the variance of X is
2−p 1 1−p
Var (X) = E(X 2 ) − E(X)2 = 2
− 2 = .
p p p2
1
Recall that the logarithm function f (x) = ln x is defined so that f ′ (x) = .
x
This gives
d 1
ln(1 + x) = .
dx 1+x
Using term by term integration, we can obtain power series representation for the
logarithm function.
Chapter 6. Sequences and Series of Functions 508
∞
X xn x2 x3 xn
ln(1 + x) = (−1)n−1 =x− + + · · · + (−1)n−1 + · · · .
n=1
n 2 3 n
Proof
Given any x1 with |x1 | < 1, let R1 = |x1 |. Since the geometric series
X∞
xn has radius of convergence 1, Theorem 6.19 says that we can integrate
n=0
(6.11) term by term over the interval with 0 and x1 as end points.
Z x1 Z ∞
x1 X ∞ Z x1
1 n
X
dx = x dx = xn dx.
0 1−x 0 n=0 n=0 0
This gives
∞ ∞
X xn+1
1
X xn1
− ln(1 − x1 ) = = .
n=0
n + 1 n=1 n
Replacing x1 by −x, we find that if |x| < 1, then
∞ n ∞
X
nx
X xn
ln(1 + x) = − (−1) = (−1)n−1 . (6.13)
n=1
n n=1
n
The theories that we have deveoped so far do not allow us to take the limit
x → 1− term by term on the right hand side of (6.13). However, we can go
around this problem in another way.
Example 6.24
Show that
X (−1)n−1 ∞
1 1 1
1− + − + ··· = = ln 2. (6.14)
2 3 4 n=1
n
Chapter 6. Sequences and Series of Functions 509
Solution
Notice that if x ̸= −1, then for any n ∈ Z+ ,
1 − (−x)n 1 xn
1−x+x2 −x3 +· · ·+(−1)n−1 xn−1 = = +(−1)n−1 .
1+x 1+x 1+x
Each of the functions is continuous on [0, 1]. Therefore,
Z 1
1 − x + x2 − x3 + · · · + (−1)n−1 xn−1 dx
0
Z 1 Z 1
1 xn
= dx + (−1)n−1 dx.
0 1+x 0 1+x
This gives
n
X (−1)k−1 1 1 1 1
=1− + − + · · · + (−1)n−1 = ln 2 + Rn ,
k=1
k 2 3 4 n
where
1
xn
Z
n−1
Rn = (−1) dx.
0 1+x
When 0 ≤ x ≤ 1, 1 + x ≥ 1, and so
xn
≤ xn when 0 ≤ x ≤ 1.
1+x
Therefore,
1 1
xn
Z Z
1
|Rn | ≤ dx ≤ xn dx = .
0 1+x 0 n+1
n
1 1 1 X (−1)k−1
1− + − + · · · = lim = ln 2.
2 3 4 n→∞
k=1
k
Next, we give the power series that represents the exponential function.
Chapter 6. Sequences and Series of Functions 510
Proof
∞
X xn
We have shown in Example 6.18 that the power series is convergent
n=0
n!
for all real numbers x. Corollary 6.24 says that it defines an infinitely
differentiable function f : R → R by
∞
X xn x x2 x3 x4
f (x) = =1+ + + + + ··· .
n=0
n! 1! 2! 3! 4!
f (0) = 0, f ′ (0) = 1.
We define this function as sin x. We can now prove the existence. This is actually
the power series method for solving differential equations. Assume that f (x) can
be written as a power series
∞
X
f (x) = cn x n .
n=0
(−1)n−1
c2n−1 = , c2n = 0 for all n ∈ Z+ .
(2n − 1)!
Now we are left to justify this is indeed the solution to our problem.
Chapter 6. Sequences and Series of Functions 512
Theorem 6.28
The power series
∞
X (−1)n−1 2n−1 x3 x5 x7
x =x− + − + ···
n=1
(2n − 1)! 3! 5! 7!
Proof
First, we need to show that the power series is convergent everywhere. We
(−1)n−1 2n−1
can use the ratio test with an = x . Then if x ̸= 0,
(2n − 1)!
an+1 1
lim = x2 lim = 0.
n→∞ an n→∞ 2n(2n + 1)
This shows that the power series is convergent for all x ∈ R. By Corollary
6.24, it defines an infinitely differentiable function f : R → R,
∞
X (−1)n−1 2n−1 x3 x5 x7
f (x) = x =x− + − + ··· .
n=1
(2n − 1)! 3! 5! 7!
x2 x4 x6
f ′ (x) = 1 − + − + ··· .
2! 4 6!
This gives f ′ (0) = 1. Differentiate term by term again, we have
∞
′′
X (−1)n−1 2n−3
f (x) = (2n − 1)(2n − 2) x
n=2
(2n − 1)!
∞ ∞
X (−1)n−1 2n−3 X (−1)n−1 2n−1
= x =− x = −f (x).
n=2
(2n − 3)! n=1
(2n − 1)!
This proves that f ′′ (x) + f (x) = 0, and thus the proof is completed.
Chapter 6. Sequences and Series of Functions 513
As a byproduct, we obtain the power series expansion for the functions sin x
and cos x.
The power series for cos x is obtained by term by term differentiating the
power series for sin x.
Finally, we want to consider the multiplication of two power series. Given that
p(x) and q(x) are polynomials of degree k and l respectively, with
p(x) = a0 + a1 x + · · · + ak xk and q(x) = b0 + b1 x + · · · + bl xl .
The product p(x)q(x) is a polynomial of degree k + l, with
p(x)q(x) = c0 + c1 x + · · · + ck+1 xk+1
= (a0 b0 ) + (a0 b1 + a1 b0 )x + · · · + ak bl xk+l .
For 0 ≤ n ≤ max{k, l}, we find that
n
X
cn = a0 bn + a1 bn−1 + · · · + an−1 b1 + an b0 = am bn−m .
m=0
This motivates the following.
n
X
cn = am bn−m .
m=0
Chapter 6. Sequences and Series of Functions 514
Proof
Without loss of generality, assume that x0 = 0.
It is suficient to prove that for any x1 satisfying 0 < R1 = |x1 | < Rc , the
X∞
series cn xn1 is convergent, and it converges to AB, where
n=0
∞
X ∞
X
A= an xn1 , B= bn xn1 .
n=0 n=0
∞
X
This would imply that the series cn xn is convergent on (−Rc , Rc ),
n=0
which proves that its radius of convergence R is at least Rc .
Chapter 6. Sequences and Series of Functions 515
Take an R2 such that R1 < R2 < Rc . Then R2 < Ra and R2 < Rb . Using
the same reasoning as in the proof of Theorem 6.17, we find that there is a
positive constant M such that
X l
n X n
X n
X
Cn = ak bl−k xl1 = ak xk1 bl−k xl−k
1
l=0 k=0 k=0 l=k
Xn n−k
X Xn
= ak xk1 bl xl1 = ak xk1 Bn−k
k=0 l=0 k=0
n
X
= BAn − ak xk1 (B − Bn−k ).
k=0
Example 6.25
Solution
We know that
∞
x
X xn x2 x3 x4 x5
e = =1+x+ + + + + ··· for all x ∈ R,
n=0
n! 2 6 24 120
∞
X x2n−1 x3 x5
sin x = (−1)n−1 =x− + + ··· for all x ∈ R.
n=1
(2n − 1)! 6 120
By Theorem 6.30,
x2 x3 x 4 x5 x3 x5
x
e sin x = 1 + x + + + + + ··· x− + + ···
2 6 24 120 6 120
x3 x4 x 5 x3 x4 x5 x5
= x + x2 + + + − − − + + ···
2 6 24 6 6 12 120
x3 x5
= x + x2 + − + ··· .
3 30
This gives the power series expansion of f (x) up to the x5 term. From this,
we find that
(5) 1
f (0) = 5! × − = −4.
30
Chapter 6. Sequences and Series of Functions 517
Exercises 6.4
Question 1
Let p be a positive number. Determine the domain of convergence of the
∞
X xn
power series .
n=0
np
Question 2
Show that when |x| < 1,
∞
−1
X x2n−1 x3 x5
tan x= (−1)n−1 =x− + + ··· .
n=1
2n − 1 3 5
Question 4
∞
X
Find a closed form formula for the sum of the series n3 xn when |x| < 1.
n=1
Question 5
Consider the function f : R → R defined by f (x) = ex cos x. Find the
power series expansion of f (x) up to the x5 term, and find f (5) (0).
Chapter 6. Sequences and Series of Functions 518
In Section 6.4, we have seen that the exponential function, logarithm function, sine
and cosine functions have power series representations that are valid on its domain
or a subset of its domain. Power series are limits of sequences of polynomials.
They are infinitely differentiable, and they can be differentiated term by term
and integrated term by term. Thus, they are very useful. Hence, we can ask
the following two questions.
∞
X
f (x) = cn (x − x0 )n when |x − x0 | < R.
n=0
For the first question, Corollary 6.24 says that if such a representation exists,
then we must have
Here the Taylor series is defined as a power series as long as the function is
infinitely differentiable in an open interval I that contains the point x0 . We do not
assume any convergence. Even though the Taylor series is convergent, we cannot
assume that it converges to the function f (x) itself. In Section 6.6.3, we are going
to see a classical example of an infinitely differentiable function whose Taylor
series converges but to a different function.
Nevertheless, for functions that are defined by a power series centered at x0 ,
Corollary 6.24 gives the following.
Theorem 6.31
∞
X
Assume that the power series cn (x − x0 )n has positive radius of
n=0
convergence R. If f (x) is the function defined by the power series
X∞
cn (x − x0 )n on the interval (x0 − R, x0 + R), then the Taylor series
n=0
∞
X
of f (x) at x0 is cn (x − x0 )n . Namely,
n=0
∞
X f (n) (x0 )
f (x) = (x − x0 )n when x ∈ (x0 − R, x0 + R).
n=0
n!
This shows that the Taylor series of f (x) converges to the function f (x).
It also says that the power series expansion of a function at a point x0 , if
exists, is unique, which is the Taylor series of the function at x0 .
Remark 6.3 Maclaurin Series for Odd Functions and Even Functions
Let a be a positive number and let f : (−a, a) → R be an infinitely
differentiable function. Since the derivative of an odd function is even,
and the derivative of an even function is odd, the following holds.
differentiation and integration to obtain the power series for new functions from
old ones. This is a useful tactic to find Taylor series of a large class of functions
from the few elementary ones listed above.
Example 6.26
x+1
Find the power series expansion of the function f (x) = at x = 0,
4 + x2
and find the largest open interval where this series is convergent.
Solution
Applying the formula
∞
1 X
= xn , |x| < 1
1 − x n=0
gives
∞ 2n
1 1 1X nx
= = (−1) , |x| < 2.
x2
4 + x2 4 n=0 22n
4 1+
4
The largest open interval where this series is convergent is (−2, 2).
Example 6.27
Let f : R → R be the function defined by
sin x ,
if x ̸= 0,
f (x) = x
1, if x = 0.
Show that f is infinitely differentiable, and find f (n) (0) for all n ≥ 0.
Chapter 6. Sequences and Series of Functions 522
Solution
For any real number x, the series
∞
X x2n−1 x3 x5 x7
(−1)n−1 =x− + − + ···
n=1
(2n − 1)! 3! 5! 7!
∞
sin x X x2n
converges to . Therefore, the power series (−1)n
x n=0
(2n + 1)!
converges for all x, and when x ̸= 0, it is equal to f (x). When x = 0,
it has value 1, which is equal to f (0). This proves that
∞
X x2n
f (x) = (−1)n for all x ∈ R.
n=0
(2n + 1)!
Since the function f (x) has a power series expansion that converges
everywhere, it is an infinitely differentiable function. From the power series
expansion, we find that
(−1)n
f (2n+1) (0) = 0, f (2n) (0) = for all n ≥ 0.
2n + 1
An important power series expansion that cannot be derived from the list of
Taylor series for elementary functions is the binomial series. Recall that if n is a
positive integer, the binomial expansion of (1 + x)n is given by
n
n
X n
(1 + x) = xk .
k=0
k
and for k ≥ 1,
α α(α − 1) · · · (α − k + 1)
= .
k k!
Example 6.28
Let α be a real number. Show that the Maclaurin series of the function
f : (−1, 1) → R, f (x) = (1 + x)α is
∞
X α
xk .
k=0
k
Solution
The function f is infinitely differentiable. By straightforward computation,
we have
f (k) (x) = α(α − 1) · · · (α − k + 1)(1 + x)α−k .
This gives,
f (k) (0) = α(α − 1) · · · (α − k + 1).
Therefore, the Maclaurin series of f is
∞ ∞ ∞
X f (k) (0) k
X α(α − 1) · · · (α − k + 1) k
X α k
x = x = x .
k=0
k! k=0
k! k=0
k
ck+1 α−k
lim = lim = 1,
k→∞ ck k→∞ k + 1
In the example above, we have shown that the Maclaurin series of f (x) =
∞
α
X α k
(1 + x) is x , which is a power series that converges on (−1, 1). But
k=0
k
we have not shown that the Maclaurin series converges to f (x) on (−1, 1), except
when α is an integer. To prove the convergence of the Maclaurin series to the
function, we will study the convergence of the sequence of partial sums.
The partial sums of Taylor series are called Taylor polynomials. They are
important in their own right.
Chapter 6. Sequences and Series of Functions 525
In particular,
Example 6.29
T1 (x) = T2 (x) = x,
x3
T3 (x) = T4 (x) = x − ,
2
x 3 x5
T5 (x) = T6 (x) = x − + ,
2 24
and so on.
Chapter 6. Sequences and Series of Functions 526
In this example, we notice that T2n−1 (x) = T2n (x) for all n ∈ Z+ . This is
because f (x) is an odd function.
We have noticed that the first Taylor polynomial
f (x) − g(x)
lim = 0.
x→x0 (x − x0 )n
We will show that the nth Taylor polynomial of a function f (x) at x0 is an nth -
order approximation of the function at x0 . First, we prove the following lemma
which says that for any real number x0 , any polynomial of degree n can be written
Xn
in the form ck (x − x0 )k .
k=0
Lemma 6.32
Given a real number x0 , and a polynomial p(x) of degree n, we have
n
X p(k) (x0 )
p(x) = (x − x0 )k .
k=0
k!
In other words, the nth Taylor polynomial of p(x) is p(x) itself, and the
Taylor series of p(x) is also p(x).
Chapter 6. Sequences and Series of Functions 527
Proof
Let
p(x) = a0 + a1 x + · · · + an xn ,
and let h = x − x0 . Then x = h + x0 . Substitute x by x0 + h, we have
p(x) = a0 + a1 (h + x0 ) + · · · + an (h + x0 )n .
a0 + a1 (h + x0 ) + · · · + an (h + x0 )n
Corollary 6.33
Corollary 6.34
Let I be an interval that contains the point x0 , and let n be a positive integer.
Given that f : I → R is a function that is n times differentiable, let
n
X f (k) (x0 )
Tn (x) = (x − x0 )k
k=0
k!
Proof
By Lemma 6.32, we have
n (k)
X Tn (x0 )
Tn (x) = (x − x0 )k .
k=0
k!
Theorem 6.35
Let I be an open interval that contains the point x0 , and let n be a positive
integer. Assume that the function f : I → R is n times differentiable.
Proof
Let us consider (a) first. If n = 1, we need to show that
By l’ Hôpital’s rule,
f (x) − p(x)
lim = 0.
x→x0 (x − x0 )n
f (x) − Tn (x)
lim = 0.
x→x0 (x − x0 )n
Chapter 6. Sequences and Series of Functions 530
These give
Notice that
n (k)
X f (x0 )
Tn (x) − p(x) = − ck (x − x0 )k .
k=0
k!
Theorem 6.35 says that the nth Taylor polynomial of a function f (x) at
a point x0 is the unique polynomial of degree at most n which is an nth -
order approximation of f (x) at x0 . Hence, we also called Tn (x) the Taylor
polynomial of f (x) of order n at x0 . One should avoid calling it the nth
degree Taylor polynomial as we have seen that Tn (x) does not necessary
have degree n.
The function
Rn (x) = f (x) − Tn (x)
Chapter 6. Sequences and Series of Functions 531
is called the remainder when we approximate the function f (x) by its nth Taylor
polynomial Tn (x) at x0 . Eq (6.16) says that when x approaches x0 , the order of
Rn (x) is smaller than the order of |x − x0 |n . If we assume that f has one more
derivative, we can say more.
We will first prove the Lagrange remainder theorem which assumes that f :
I → R is (n + 1) times differentiable.
f (n+1) (ξ)
f (x) − Tn (x) = (x − x0 )n+1 , where ξ = x0 + c(x − x0 ).
(n + 1)!
Proof
The proof is a straightforward application of Theorem 3.20, which is a
consequence of Cauchy mean value theorem. Let g : I → R be the function
defined by
n
X f (k) (x0 )
g(x) = f (x) − Tn (x) = f (x) − (x − x0 )k .
k=0
k!
Applying Theorem 3.20 to the function g, we find that for any x ∈ I \ {x0 },
there is a number c ∈ (0, 1) such that
g (n+1) (ξ)
f (x)−Tn (x) = g(x) = (x−x0 )n+1 , where ξ = x0 +c(x−x0 ).
(n + 1)!
(n+1)
Since Tn (x) is a polynomial of degree n, Tn (x) = 0 for all x ∈ I.
Therefore, g (n+1) (x) = f (n+1) (x) for all x ∈ I. This concludes the proof.
The Lagrange remainder theorem also holds in the n = 0 case. This is just the
Lagrange mean value theorem. Thus Lagrange remainder theorem is an extension
of the Lagrange mean value theorem. It gives useful estimates on the error term
in approximating a function by its Taylor polynomial, especially if f (n+1) (x) is
always positive or always negative in a neighbourhood of x0 .
Example 6.30
In this example, we demonstrate how we can use the Lagrange remainder
theorem to show that the Taylor series of the function f : R → R, f (x) =
ex , converges to f (x) for all real numbers x. The nth Taylor polynomial of
f (x) = ex at x = 0 is
x2 xn
Tn (x) = 1 + x + + ··· + .
2 n!
Since f (n) (x) = ex for any n ∈ Z+ , Lagrange remainder theorem says that
for any n ≥ 0, for any real number x ̸= 0, there is a number ξ strictly
between 0 and x such that
f (n+1) (ξ) n+1 eξ
ex − Tn (x) = x = xn+1 . (6.17)
(n + 1)! (n + 1)!
For fixed x, ξ depends on n but we can use ξ < |x| to get the estimate
eξ ≤ e|x| that is independent of n. This implies that
|x|n+1
|ex − Tn (x)| ≤ e|x| .
(n + 1)!
Chapter 6. Sequences and Series of Functions 533
∞
X |x|n
We have proved in Example 6.18 that the power series is
n=0
n!
|x|n+1
convergent. Therefore, lim = 0. This allows us to conclude
n→∞ (n + 1)!
that
x2 xn
ex = lim Tn (x) = 1 + x + + ··· + + ··· . (6.18)
n→∞ 2 n!
This is an alternative way to prove that the Taylor series of ex converges
to ex , instead of the aproach used in the proof of Theorem 6.27. From the
series expansion (6.18), it is easy to deduce that for all x > 0, and all n ≥ 1,
x2 xn
ex > 1 + x + + ··· + .
2 n!
In particular, we have
ex > 1 + x,
x x2
e >1+x+ ,
2
2
x x3
ex > 1 + x + + ,
2 6
x2 x3 x4
ex > 1 + x + + + .
2 6 24
For x < 0, the series (6.18) is alternating. The sequence {bn } with bn =
|x|n
is not decreasing. However, since x2n−1 < 0 and x2n > 0 for all
n!
n ≥ 1, and eξ > 0 for all ξ, we can use (6.17) to conclude that for x < 0,
ex > 1 + x,
x2
ex < 1 + x + ,
2
x2 x3
ex > 1 + x + + ,
2 6
x2 x3 x4
ex < 1 + x + + + .
2 6 24
In Theorem 3.37, we apply mean value theorem to prove that | sin x| ≤ |x| for
all real numbers x. In the following example, we extend this result partially.
Chapter 6. Sequences and Series of Functions 534
Example 6.31
x3
Show that for x ∈ (0, π), sin x > x − .
6
Solution
Let f (x) = sin x. Then f (x) is infinitely differentiable, with the third
Taylor polynomial at x = 0 given by
x3
T3 (x) = x − .
6
Apply the Lagrange remainder theorem, we find that for any x ∈ (0, π),
there is a ξ ∈ (0, x) ⊂ (0, π) so that
x3
sin x > x − for x ∈ (0, π).
6
Figure 6.8: The function f (x) = sin x and its Taylor polynomials at x = 0.
Theorem 6.37
Let I be an open interval, and let n be a positive integer. Assume that the
function f : I → R is (n + 1) times differentiable, and f (n+1) (x) = 0 for
all x ∈ I. Then f (x) is a polynomial of degree at most n.
Proof
If f (n+1) (x) = 0 for all x ∈ I, take any point x0 in I, and let Tn (x) be the
nth Taylor polynomial of f at x0 . By definition, f (x0 ) = Tn (x0 ). Given
x ∈ I \ {x0 }, the Lagrange remainder theorem implies that there is a point
ξ ∈ I such that
f (n+1) (ξ)
f (x) − Tn (x) = (x − x0 )n+1 .
(n + 1)!
Since f (n+1) (x) is identically 0, we find that for all x ∈ I, f (x) = Tn (x).
This proves that f is a polynomial of degree at most n.
Corollary 6.38
Let I be an open interval, and let n be a positive integer. Assume that
f : I → R and g : I → R are (n + 1) times differentiable functions such
that
f (n+1) (x) = g (n+1) (x) for all x ∈ I,
then there is a polynomial p(x) of degree at most n such that
1 x
Z
G(x) = (x − t)n g(t)dt
n! x0
and
G(n+1) (x) = g(x) for all x ∈ I.
Theorem 6.39 The Cauchy Remainder Formula
Let I be an open interval that contains the point x0 , and let n be a positive
integer. Given that f : I → R is a function that is (n+1) times continuously
differentiable, let
n
X f (k) (x0 )
Tn (x) = (x − x0 )k
k=0
k!
1 x
Z
f (x) − Tn (x) = (x − t)n f (n+1) (t)dt.
n! x0
Chapter 6. Sequences and Series of Functions 537
Proof
Let H : I → R be the function defined by
1 x
Z
H(x) = f (x) − Tn (x) − (x − t)n f (n+1) (t)dt.
n! x0
Then by the result proved in Example 4.28, we find that H is a function that
is (n + 1) times continuously differentiable,
and
Theorem 6.40
Let α be a real number. For |x| < 1,
∞
α
X α k
(1 + x) = x .
k=0
k
Proof
∞
X α
Let f (x) = (1 + x)α , −1 < x < 1. We have seen that xk is the
k
k=0
th
Maclaurin series of f (x). The n Taylor polynomial of f (x) at x = 0 is
n
X α
Tn (x) = xk .
k=0
k
Chapter 6. Sequences and Series of Functions 538
We need to show that lim Tn (x) = f (x) for all |x| < 1. It is easy to verify
n→∞
that
(n+1) α
f (x) = (n + 1)! (1 + x)α−n−1 .
n+1
By Cauchy remainder formula, for x ∈ (−1, 1),
1 x
Z
f (x) − Tn (x) = (x − t)n f (n+1) (t)dt
n! 0
Z x
α
= (n + 1) (x − t)n (1 + t)α−n−1 dt.
n+1 0
(1 − τ )n (1 + xτ )α−n−1 ≥ 0.
1 + xτ ≥ 1 − τ ≥ 0.
Exercises 6.5
Question 1
Let f : R → R be the function defined by
2 − 2 cos x ,
if x ̸= 0,
f (x) = x2
1, if x = 0.
Show that f is infinitely differentiable, and find f (n) (0) for all n ≥ 0.
Question 2
Show that for all x > 0,
x2
x− < ln(1 + x) < x.
2
Question 3
Show that for all x > 0,
x x2 √ x x2 x3
1+ − < 1+x<1+ − + .
2 8 2 8 16
Question 4
Show that for all x ∈ (−π, π),
x2
cos x ≥ 1 − .
2
Chapter 6. Sequences and Series of Functions 541
Question 5
Let α be a real number. Assume that α is not an integer. In Example 6.28,
∞
X α k
we have shown that the power series x , which is the Maclaurin
k=0
k
series of the function f (x) = (1 + x)α , has radius of convergence 1. Define
the function g : (−1, 1) → R by
∞
X α k
g(x) = x .
k=0
k
In this question, you are asked to show that g(x) = (1+x)α for x ∈ (−1, 1),
without using the Cauchy remainder formula.
(a) Show that (1 + x)g ′ (x) = αg(x) for all x ∈ (−1, 1).
In Example 1.36, we have defined the number e as the limit of the increasing
n
1
sequence {an }, where an = 1 + . We have proved that an ≤ 3 for all
n
n ∈ Z+ . This implies that e ≤ 3. In Theorem 6.27, we proved that
∞
X 1 1 1 1
e= = 1 + + + ··· + + ··· .
n=0
n! 1! 2! n!
Proof
Assume to the contrary that e is rational. Then since e is positive, there are
positive integers a and b such that
a
e= .
b
For any positive integer n, we apply Lagrange remainder theorem to the nth
Taylor polynomial of ex at the point x0 = 0. With x = 1, we find that there
is a number cn in the interval (0, 1) such that
1 1 1 ecn
e=1+ + + ··· + + . (6.20)
1! 2! n! (n + 1)!
ecn
n! n! e 3
0 < n!e − n! + n! + + ··· + = < ≤ .
2! n! n+1 n+1 n+1
is a positive integer that is less than 3/(n + 1). For n > 3, 3/n + 1 is less
than 1. This gives a contradiction. Hence, e must be irrational.
As in the case of the number e, we will show that π is an irrational number using
proof by contradiction. We begin by two lemmas.
Chapter 6. Sequences and Series of Functions 544
Lemma 6.42
Given that f : R → R and g : R → R are two infinitely differentiable
functions. For any n ∈ Z+ , and any numbers α and β,
Z β Z β
(2n+1)
f (x)g(x)dx + f (x)g (2n+1) (x)dx
α α
2n
X 2n
X (6.21)
k (k) (2n−k)
= (−1) f (β)g (β) − (−1)k f (k) (α)g (2n−k) (α).
k=0 k=0
Proof
+
Given n ∈ Z , define the function F : R → R by
2n
X
F (x) = (−1)k f (k) (x)g (2n−k) (x).
k=0
Then
2n
X 2n
X
F ′ (x) = (−1)k f (k+1) (x)g (2n−k) (x) + (−1)k f (k) (x)g (2n−k+1) (x)
k=0 k=0
Lemma 6.43
Let a, b and n be positive integers. Define the polynomial p : R → R by
xn (a − bx)n
p(x) = .
n!
For any integer k satisfying 0 ≤ k ≤ 2n, p(k) (0) and p(k) (a/b) are integers.
Proof
Using binomial expansion, we have
n
xn X n n−m
p(x) = a (−1)m bm xm .
n! m=0 m
By Lemma 6.32,
2n
X p(k) (0)
p(x) = xk .
k=0
k!
Comparing the coeficients, we find that
0,
if 0 ≤ k ≤ n − 1,
(k)
p (0) = k!
n
a2n−k (−1)k−n bk−n , if n ≤ k ≤ 2n.
n! k − n
By Lemma 6.32,
2n
X p(k) (a/b) a k
p(x) = x− .
k=0
k! b
Chapter 6. Sequences and Series of Functions 546
Proof
Assume that π is a rational number. Then there are positive integers a and
b such that
a
π= .
b
+
For n ∈ Z , define the polynomial pn (x) by
xn (a − bx)n bn xn (π − x)n
pn (x) = = ,
n! n!
and let Z π
In = pn (x) sin xdx.
0
(k) (k)
By Lemma 6.43, pn (0) and pn (π) are integers for all 0 ≤ k ≤ 2n. Since
are integers, we find that g (k) (0) and g (k) (π) are integers for all k ≥ 0. The
right hand side of (6.22) shows that In is an integer for all n ∈ Z+ . On the
other hand, for all 0 ≤ x ≤ π,
π 2
0 ≤ x(π − x) ≤ .
2
Therefore, for 0 ≤ x ≤ π,
n
π2b
1
0 ≤ pn (x) ≤ .
n! 4
which gives
0 ≤ In < 1 for all n ≥ N.
Chapter 6. Sequences and Series of Functions 548
We will show that this function is infinitely differentiable and f (n) (0) = 0 for all
n ≥ 0.
1
Figure 6.9: The function f (x) = exp − 2 .
x
Lemma 6.45
If p(x) is a polynomial, then
1 1
lim p exp − =0 (6.23a)
x→0+ x x
1 1
lim p exp − 2 = 0. (6.23b)
x→0 x x
Proof
In Example 3.24, we have shown that for any real number s, lim y s e−y =
y→∞
0. From this, we find that if k is an integer,
1 1
lim exp − = lim y k e−y = 0,
x→0+ xk x y→∞
and
1 1
lim exp − 2 = lim y k e−y = 0.
x→0 x2k x y→∞
Theorem 6.46
Let f : R → R be the function defined by
exp − 1
, if x ̸= 0,
f (x) = x2 (6.24)
0,
if x = 0.
Proof
We claim that for each positive integer n, there is a polynomial pn (x) of
degree 3n such that
pn 1 exp − 1 ,
if x ̸= 0,
f (n) (x) = x x2 (6.25)
0,
if x = 0.
When x ̸= 0,
1 1 2 1 1
f (n)
(x) = − 2 p′n−1 + 3 pn−1 exp − 2 .
x x x x x
Chapter 6. Sequences and Series of Functions 551
where
pn (x) = −x2 p′n−1 (x) + 2x3 pn−1 (x).
By inductive hypothesis, p′n−1 (x) is a polynomial of degree 3n − 4. Thus,
x2 p′n−1 (x) is a polynomial of degree 3n − 2. Since 2x3 pn−1 (x) is a
polynomial of degree 3n, pn (x) is a polynomial of degree 3n. For the
derivative at 0, Lemma 6.45 implies that
This proves the statement for the n case, and thus completes the induction.
Theorem 6.47
Let I be an open interval that contains the point x0 . There is an infinitely
differentiable function f : I → R whose Taylor series at the point x = x0
is convergent pointwise on I, but it does not converge to f (x) pointwise on
I.
Proof
Define the function f : I → R by
exp − 1
, if x ̸= x0 ,
(x − x0 )2
0,
if x = x0 .
∞
X f (n) (x0 )
(x − x0 )n ,
n=0
n!
is the series that is identically 0. Therefore, it converges everywhere, but it
does not converge to f (x) except at the point x = x0 .
Using almost the same proof as for Theorem 6.46, we obtain the following.
Theorem 6.48
Given a real number x0 , the function g : R → R defined by
exp − 1
, if x > x0 ,
g(x) = x − x0 (6.26)
0, if x ≤ x0 .
is infinitely differentiable.
The function g(x) defined by (6.26) is also not analytic. Nevertheless, it has
some important applications. It is usually used to "smooth" up a function or
truncate a function smoothly.
Theorem 6.49
Given two real numbers a and b with a < b, define the function h : R → R
by
g(x − a)
h(x) =
g(x − a) + g(b − x)
0, if x ≤ a,
1
exp − (6.27)
x−a
= , if a < x < b,
1 1
exp − + exp −
x−a b−x
1, if x ≥ b.
Proof
We just need to show that g(x − a) + g(b − x) is nonzero for all x ∈ R.
The rest follows from Theorem 6.48 and the definition of h(x). Since the
function g is nonnegative, in order for g(x − a) + g(b − x) = 0, we must
have g(x − a) = g(b − x) = 0. But we know that g(x − a) = 0 only when
x ≤ a, and g(b − x) = 0 only when x ≥ b. Since the set {x | x ≤ a} and
the set {x | x ≥ b} are disjoint, we conclude that g(x − a) + g(b − x) is
never 0.
Remark 6.4
The function h(x) defined by (6.27) is an example of an infinitely
diferentiable function that is increasing but assume constant values outside
a bounded interval.
Chapter 6. Sequences and Series of Functions 554
and
hm (x + 2m) = h(x) for all x ∈ R.
1
Figure 6.12: The functions hm (x) when m = 1, 2
and 14 .
Lemma 6.50
Given a positive number m, define xm,k = mk for all k ∈ Z+ . The function
hm defined in Definition 6.15 has the following properties.
Proof
Part (a) follows from hm (−m) = hm (m). Part (b) can be proved by
induction on k ≥ 0, using the periodicity of hm and the fact that hm is
an even function. Part (c) follows from the definition of hm and periodicity.
Lemma 6.51
Given a positive number ℓ and a point x ∈ R, let
Proof
The points nℓ, n ∈ Z, partition the real line into subintervals of the form
[nℓ, (n + 1)ℓ], each of length ℓ. Since U and V are adjacent intervals of
length ℓ/2, one of them must lie entirely inside one of the intervals of the
form [nℓ, (n + 1)ℓ].
By part (b) in Lemma 6.50, the graph of hℓ : [nℓ, (n + 1)ℓ] → R is a line
segment of slope 1 or −1. This proves the assertion when k = 0. To prove
the assertion for k ≥ 1, we notice that to obtain the graph of the function hℓ
from the graph of the function h2ℓ , we divide each line segment in the graph
of h2ℓ into two equal parts, one of the parts change slope from 1 to −1 or
from −1 to 1. Hence, if W is an interval and the graph of hℓ : W → R is
a line segment, the graph of h2k ℓ : W → R must also be a line segment for
any k ∈ Z+ . This completes the proof the the lemma.
Theorem 6.52
For a positive number m, let hm : R → R be the function
hm (x) = |x| for |x| ≤ m, and h(x + 2m) = h(x) for all x ∈ R.
Proof
+
By Lemma 6.50, for each n ∈ Z , the function gn : R → R is continuous
and
1
|gn (x)| ≤ n for all x ∈ R.
4
∞
X 1
Since the series is convergent, Weierstrass M -test implies that
n=0
4n
∞
X
the series gn (x) converges uniformly on R. Since each gn (x) is a
n=0
continuous function, Corollary 6.10 implies that the function f (x) =
X∞
gn (x) is continuous.
n=0
Now we are left to prove that f (x) is not diferentiable at any x ∈ R. Given
x0 ∈ R, assume that
f (x) − f (x0 )
f ′ (x0 ) = lim
x→x0 x − x0
exists. Then for any sequence {xk }∞
k=0 in R \ {x0 }, if lim xk = x0 , then
k→∞
the limit
f (xk ) − f (x0 )
lim
k→∞ x − x0
exists and is equal to f ′ (x0 ).
Chapter 6. Sequences and Series of Functions 557
gn (xk ) − gn (x0 )
xk − x0
is equal to 1 or −1 for each 0 ≤ k ≤ n. The sum of an odd number of 1
or −1 must be odd. The sum of an even number of 1 or −1 must be even.
Therefore,
f (xk ) − f (x0 )
ck =
xk − x0
is odd when k is even, and is even when k is odd. This implies that the
sequence {ck }∞k=0 is an integer sequence that is alternatingly odd and even.
Hence, it does not have a limit. This is a contradiction, which allows us to
conclude that f cannot be differentiable at x0 .
Chapter 6. Sequences and Series of Functions 558
Figure 6.13: The functions gn (x) for n = 0, 1, 2, 3, and the function f (x).
Lemma 6.53
The following identities hold.
n
X n k
(a) For n ≥ 0, x (1 − x)n−k = 1.
k=0
k
n
X k n k
(b) For n ≥ 1, x (1 − x)n−k = x.
k=1
n k
n
k2 n k
X x(1 − x)
(c) For n ≥ 2, 2
x (1 − x)n−k = x2 + .
k=1
n k n
n 2
X k n k x(1 − x)
(d) For n ≥ 2, x− x (1 − x)n−k = .
k=0
n k n
Proof
The first identity (a) is just a consequence of the binomial expansion
theorem.
Chapter 6. Sequences and Series of Functions 559
Therefore,
n n
X k n k n−k
X n − 1 k−1
x (1 − x) =x x (1 − x)n−k
k=1
n k k=1
k − 1
n−1
X n − 1
=x xk (1 − x)n−1−k = x.
k=0
k
It follows that
n n−2
X k(k − 1) n k n−k 2
X n−2 k
x (1 − x) =x x (1 − x)n−2−k = x2 .
k=2
n(n − 1) k k=0
k
n k
Figure 6.14: The polynomials pn,k (x) = x (1 − x)n−k when n = 2 and
k
n = 3, for all 0 ≤ k ≤ n.
n k
Figure 6.15: The polynomials pn,k (x) = x (1 − x)n−k when n = 4 and
k
n = 5, for all 0 ≤ k ≤ n.
n k
Figure 6.16: The polynomials pn,k (x) = x (1 − x)n−k when n = 6 and
k
n = 7, for all 0 ≤ k ≤ n.
Proof
We first consider the case where [a, b] = [0, 1]. Since f : [0, 1] → R is
continuous on a closed and bounded interval, it is uniformly continuous
and bounded. The boundeness of f implies that there is a positive number
M such that
|f (x)| ≤ M for all x ∈ [0, 1].
Given ε > 0, since f is uniformly continuous, there is a δ > 0 such that for
all x1 and x2 in [0, 1], if |x1 − x2 | < δ, then
ε
|f (x1 ) − f (x2 )| < .
2
For any positive integer n, we construct a polynomial pn (x) to be a
polynomial of degree at most n given by the following linear combination
of Bernstein basis polynomials.
Chapter 6. Sequences and Series of Functions 562
n n
X k X k n k
pn (x) = f pn,k (x) = f x (1 − x)n−k .
k=0
n k=0
n k
Let us estimate the supremum of |f (x) − pn (x)| on [0, 1]. For fixed x ∈
[0, 1], part (a) in Lemma 6.53 implies that
X
k n k
f (x) − pn (x) = f (x) − f x (1 − x)n−k .
k=0
n k
k
For 0 ≤ k ≤ n, if x − < δ, then
n
k ε
f (x) − f < .
n 2
k
If x − ≥ δ, then
n
2
k k 2M k
f (x) − f ≤ |f (x)| + f ≤ 2M ≤ 2 x − .
n n δ n
Therefore,
n
" 2 #
X ε 2M k n k
|f (x) − pn (x)| < + 2 x− x (1 − x)n−k .
k=0
2 δ n k
we find that
ε 2M x(1 − x) ε M
|f (x) − pn (x)| < + 2 ≤ + 2 .
2 δ n 2 2δ n
M M ε
If n ≥ 2
, then 2 ≤ . For any such n, we find that
εδ 2δ n 2
|f (x) − pn (x)| < ε for all 0 ≤ x ≤ 1.
Let
−1 x−a
p(x) = q(u (x)) = q .
b−a
Then p(x) is also a polynomial, and p(u(t)) = q(t). Therefore,
This completes the proof of the Weierstrass approximation theorem for the
general case.
One cannot extend the Weierstrass approximation theorem to the case where
f : I → R is a continuous function defined on an unbounded interval I. This is
because a non-constant polynomial would approach ∞ or −∞ when x approaches
Chapter 6. Sequences and Series of Functions 564
Remark 6.5
In probability theory, a binomial random variable X with parameters n and
p counts the number of successes in n independent and identical Bernoulli
trials, each has a probability p ∈ (0, 1) of being a success. X can take
integer values between 0 and n. The probability that X = k is
n k
P (X = k) = p (1 − p)n−k , 0 ≤ k ≤ n.
k
which reflects that the total probability is 1. The identity in part (b) gives
n
X n k
E(X) = k p (1 − p)k = np,
k=0
k
The identity in part (d) of Lemma 6.53 is just another way of computing
the variance. Using part (d), we have
n
X n k
2
Var (X) = (k − np) p (1 − p)k
k=0
k
n 2
2
X k n k
=n −p p (1 − p)k = np(1 − p).
k=0
n k
References 566
References
[Ros18] Kenneth Rosen, Discrete mathematics and its applications, eighth ed.,
Mc Graw Hill, 2018.
[SCW20] James Stewart, Daniel K. Clegg, and Saleem Watson, Calculus, ninth
ed., Cengage Learning, 2020.
[Tao14] Terence Tao, Analysis. II, third ed., Texts and Readings in
Mathematics, vol. 38, Hindustan Book Agency, New Delhi, 2014. MR
3310023