Complex Analysis
Complex Analysis
Dan Romik
About this document. These notes were created for use as primary reading
material for the graduate course Math 205A: Complex Analysis at UC Davis.
The current 2020 revision (dated June 6, 2020) updates my earlier version
of the notes from 2018. With some exceptions, the exposition follows the
textbook Complex Analysis by E. M. Stein and R. Shakarchi (Princeton Uni-
versity Press, 2003).
Cover figure: a heat map plot of the entire function z 7→ z(z − 1)π −z/2 Γ(z/2)ζ(z).
Created with Mathematica using code by Simon Woods, available at
http://mathematica.stackexchange.com/questions/7275/how-can-i-generate-this-domain-coloring-plot
Contents
1 Introduction: why study complex analysis? 1
3 Analyticity 7
4 Power series 13
5 Contour integrals 16
6 Cauchy’s theorem 21
Problems 92
References 121
Index 122
1 1 INTRODUCTION: WHY STUDY COMPLEX ANALYSIS?
z1 = 2 cos(2π/9) − 2,
z2 = 2 cos(8π/9) − 2,
z3 = 2 sin(π/18) − 2.
√
• Proving Stirling’s formula: n! ∼ 2πn(n/e)n . Here, an ∼ bn is the stan-
dard “asymptotic to” relation, defined to mean limn→∞ an /bn = 1.
• Proving the prime number theorem: π(n) ∼ logn n , where π(n) denotes
the number of primes less than or equal to n (the prime-counting func-
tion).
2 1 INTRODUCTION: WHY STUDY COMPLEX ANALYSIS?
• It was proved in 2016 that the optimal densities for sphere packing in
8 and 24 dimensions are π 4 /384 and π 12 /12!, respectively. The proofs
make spectacular use of complex analysis (and more specifically, a
part of complex analysis that studies certain special functions known
as modular forms).
where z0 = arg min |p(z)|, and the minimum exists because p(z) is a continu-
|z|≤R
ous function on the disc DR (0).
Denote w0 = p(z0 ), so that m0 = |w0 |. We now claim that m0 = 0. As-
sume by contradiction that it doesn’t, and examine the local behavior of p(z)
around z0 ; more precisely, expanding p(z) in powers of z − z0 we can write
n
X
p(z) = w0 + cj (z − z0 )j = w0 + ck (z − z0 )k + . . . + cn (z − z0 )n ,
j=1
5 2 THE FUNDAMENTAL THEOREM OF ALGEBRA
When r is very small, the power r k dominates the other terms r j with k < j ≤
n, i.e.,
Exercise 1. Complete the last details of the proof (for which r are the in-
equalities valid, and why?) Note that “complex analysis” is part of “analysis”
— you need to develop facility with such estimates until they become second
nature.
f (z + h) − f (z)
f 0 (z) := lim
h→0 h
exists. In this case we call f 0 (z) the derivative of f at z .
In the case when f 0 (z) 6= 0, the existence of the derivative has a geometric
meaning: if we write the polar decomposition f 0 (z) = reiθ of the derivative,
then for points w that are close to z , we will have the approximate equality
f (w) − f (z)
≈ f 0 (z) = reiθ ,
w−z
or equivalently
where “lower order terms” refers to a quantity that is much smaller in mag-
nitude that |w − z|. Geometrically, this means that to compute f (w), we start
from f (z), and move by a vector that results by taking the displacement vec-
tor w − z , rotating it by an angle of θ , and then scaling it by a factor of r
(which corresponds to a magnification if r > 1, a shrinking if 0 < r < 1, or
doing nothing if r = 1). This idea can be summarized by the slogan:
f (z + h) − f (z)
f 0 (z) = lim
h→0 h
u(x + h + iy) − u(x + iy) v(x + h + iy) − v(x + iy)
= lim +i
h→0, h∈R h h
∂u ∂v
= +i .
∂x ∂x
On the other hand also
f (z + h) − f (z)
f 0 (z) = lim
h→0 h
u(x + h + iy) − u(x + iy) v(x + h + iy) − v(x + iy)
= lim +i
h→0, h∈iR h h
u(x + iy + ih) − u(x + iy) v(x + iy + ih) − v(x + iy)
= lim +i
h→0, h∈R ih ih
∂u ∂v ∂v ∂u
= −i −i·i = −i .
∂y ∂y ∂y ∂y
Since these limits are equal, by equating their real and imaginary parts we
get a famous system of coupled partial differential equations, the Cauchy-
Riemann equations:
∂u ∂v ∂v ∂u
= , =− .
∂x ∂y ∂x ∂y
We have proved that if f is analytic at z = x + iy then the components
u, v of f satisfy the Cauchy-Riemann equations. Conversely, we now claim
if f = u + iv is continuously differentiable at z = x + iy (in the sense that
each of u and v is a continuously differentiable function of x, y as defined in
ordinary real analysis) and satisfies the Cauchy-Riemann equations there, f
is analytic at z .
10 3 ANALYTICITY
Proof. The assumption implies that f has a differential at z , i.e., in the no-
tation of vector calculus, denoting f = (u, v), z = (x, y)> , ∆z = (h1 , h2 )> , we
have !
∂u ∂u
u(z) ∂x ∂y h1
f (z + ∆z) = + ∂v ∂v + E(h1 , h2 ),
v(z) ∂x ∂y
h2
where E(h1 , h2 ) = o(|∆z|) as |∆z| → 0. Now, by the assumption that the
Cauchy-Riemann equations hold, we also have
∂u ∂u
! !
∂u
∂x ∂y h1 h + ∂u
∂x 1
h
∂y 2
∂v ∂v = ∂u ∂u ,
∂x ∂y
h2 − ∂y h1 + ∂x h2
which is the vector calculus notation for the complex number
∂u ∂u ∂u ∂u
−i (h1 + ih2 ) = −i ∆z.
∂x ∂y ∂x ∂y
So, we have shown that (again, in complex analysis notation)
f (z + ∆z) − f (z) ∂u ∂u E(∆z) ∂u ∂u
lim = lim −i + = −i .
∆z→0 ∆z ∆z→0 ∂x ∂y ∆z ∂x ∂y
This proves that f is holomorphic at z with derivative given by f 0 (z) = ∂u
∂x
−
i ∂u
∂y
.
With the help of the Cauchy-Riemann equations, we can now prove our
earlier claim that conformality implies analyticity.
Theorem 3. If f = u + iv is conformal at z , continuously differentiable in the
real analysis sense, and satisfies det Jf > 0 (i.e., f preserves orientation as a
planar map), then f is holomorphic at z .
a b
Lemma 2 (Conformality lemma.). Assume that A = is a 2 × 2 real
c d
matrix. The following are equivalent:
(a) A preserves orientation (that is, det A > 0) and is conformal, that is
for all w1 , w2 ∈ R2 .
a b
(b) A takes the form A = for some a, b ∈ R with a2 + b2 > 0.
−b a
cos θ − sin θ
(c) A takes the form A = r for some r > 0 and θ ∈ R. (That
sin θ cos θ
is, geometrically A acts by a rotation followed by a scaling.)
Proof that (a) =⇒ (b). Note that both columns of A are nonzero vectors by
the assumption that det A > 0. Now applying the conformality assumption
with w1 = (1, 0)> , w2 = (0, 1)> yields that (a, c) ⊥ (b, d), so that (b, d) = κ(−c, a)
for some κ ∈ R \ {0}. On the other hand, applying the conformality assump-
tion with w1 = (1, 1)> and w2 = (1, −1)> yields that (a + b, c + d) ⊥ (a − b, c − d),
which is easily seen to be equivalent to a2 + c2 = b2 + d2 . Together with the
previous
relation
implies that κ = ±1. So A is of one of the two forms
that
a −c a c
or . Finally, the assumption that det A > 0 means it is the
c a c −a
first of those two possibilities that must occur.
Exercise 3. Complete the proof of the lemma above by showing the implica-
tions (b) ⇐⇒ (c) and that (b) =⇒ (a).
4 4 4
2 2 2
-4 -2 2 4 -4 -2 2 4 -4 -2 2 4
-2 -2 -2
-4 -4 -4
Figure 3: The level curves for the (a) real and (b) imaginary parts of z 2 =
(x2 − y 2 ) + i(2xy). (c) shows the superposition of both families of level curves.
2
1.0 1.0
1
0.5 0.5
0.0 0 0.0
-0.5 -1 -0.5
-1.0 -2 -1.0
-1.0 -0.5 0.0 0.5 1.0 -2 -1 0 1 2 -1.0 -0.5 0.0 0.5 1.0
Figure 4: The level curves for the real and imaginary parts of z −1 = x
x2 +y 2
−
y
i x2 +y 2.
4u = 0,
∂ ∂ 2 2
where 4 = ∂x 2 + ∂y 2 is the two-dimensional Laplacian operator. Similarly
That is, we have shown that u and v are harmonic functions. This is an
extremely important connection between complex analysis and the theory of
partial differential equations, which also relates to many other areas of real
analysis.
We will later see that the assumption of twice continuous differentiability
is unnecessary, but proving this requires some subtle complex-analytic ideas.
A final remark related to analyticity and the Cauchy-Riemann equation is
the observation that if f = u + iv is analytic then its Jacobian (in the sense of
multivariate calculus when we consider it as a map from R2 to R2 ) is given
by
ux uy
Jf = det = ux vy − uy vx = u2x + vx2 = |ux + ivx | = |f 0 (z)|2 .
vx vy
This can also be understood geometrically. (Exercise: how?)
4 Power series
Until now we have not discussed any specific examples of functions of a com-
plex variable. Of course, there are the standard functions that you probably
encountered already in your undergraduate studies: polynomials, rational
functions, ez , the trigonometric functions, etc. But aside from these exam-
ples, it would be useful to have a general way to construct a large family
of functions. Of course, there is such a way: power series, which—non-
obviously—turn out to be essentially as general a family of functions as one
could hope for.
To make things precise, a power series is a function of a complex variable
z that is defined by
∞
X
f (z) = an z n
n=0
where (an )∞
n=0 is a sequence of complex numbers, or more generally by
∞
X
g(z) = f (z − z0 ) = an (z − z0 )n
n=0
where (an )∞
n=0 is again a a sequence and z0 is some fixed complex number.
These functions are defined wherever the respective series converge.
For which values of z does this formula make sense? It is not hard to see
that it converges absolutely precisely for 0 ≤ |z| < R, where the value of R is
14 4 POWER SERIES
given by
−1
1/n
R = lim sup |an | .
n→∞
R is called the radius of convergence of the power series.
Proof. Assume 0 < R < ∞ (the edge cases R = 0 and R = ∞ are left as
an exercise). The defining property of R is that for all > 0, we have that
n
|an | < R1 + if n is large enough, and R is the minimal number with that
1
property. Let z ∈ DR (0). Since |z| < R, we have |z| R + < 1 for some
fixed > 0 chosen small enough. That implies that for n > N (for some large
enough N as a function of ),
∞ ∞ n
X
n
X 1
|an z | < + |z| ,
n=N n=N
R
so the series is dominated by a convergent geometric series, and hence con-
verges.
Conversely, if |z| > R, then, |z| R1 − > 1 for some small enough fixed
nk
> 0. Taking a subsequence (ank )∞ 1
k=1 for which |ank | > R − (guaranteed
to exist by the definition of R), we see that
∞ ∞ nk
X
n
X 1
|an z | ≥ |z| − = ∞,
n=0 k=1
R
so the power series diverges.
Exercise 4. Complete the argument in the extreme cases R = 0, ∞.
Another important theorem is:
Theorem 4. Power series are holomorphic functions in the interior of the
disc of convergence and can be differentiated termwise.
Proof. Denote
∞
X
f (z) = an z n = SN (z) + EN (z),
n=0
XN
SN (z) = an z n ,
n=0
X∞
EN (z) = an z n ,
n=N +1
∞
X
g(z) = nan z n−1 .
n=1
15 4 POWER SERIES
The claim is that f is differentiable on the disc of convergence and its deriva-
tive is the power series g . Since n1/n → 1 as n → ∞, it is easy to see that f (z)
and g(z) have the same radius of convergence. Fix z0 with |z| < r < R. We
f (z0 +h)−f (z0 )
wish to show that h
converges to g(z0 ) as h → 0. Observe that
f (z0 + h) − f (z0 ) SN (z0 + h) − SN (z0 ) 0
− g(z0 ) = − SN (z0 )
h h
EN (z0 + h) − EN (z0 ) 0
+ + (SN (z0 ) − g(z0 ))
h
The first term converges to 0 as h → 0 for any fixed N . To bound the second
term, fix some > 0, and note that, if we assume that not only |z0 | < r but
also |z0 + h| < r (an assumption that’s clearly satisfied for h close enough to
0) then
∞
(z0 + h)n − z0n
EN (z0 + h) − EN (z0 ) X
≤ |an |
h
n=N +1
h
∞
P
h n−1 hk (z + h)n−1−k
0
X
k=0
= |an |
h
n=N +1
∞
X
≤ |an |nrn−1 ,
n=N +1
The proof above can be thought of as a special case of the following more
conceptual result: if gn is a sequence of holomorphic functions on a region Ω,
16 5 CONTOUR INTEGRALS
P∞
Corollary 2. For a power series g(z) = n=0 an (z −z0 )n with a positive radius
of convergence, we have
g (n) (z0 )
an = .
n!
In other words g(z) satisfies Taylor’s formula
∞
X g (n) (z0 )
g(z) = (z − z0 )n .
n=0
n!
5 Contour integrals
We now introduce contour integrals, which are another fundamental building
block of the theory.
Contour integrals, like many other types of integrals, take as input a func-
tion to be integrated and a “thing” (or “place”) over which the function is
integrated. In the case of contour integrals, the “thing” is a contour, which
is (for our current purposes at least) a kind of planar curve. We start by de-
veloping some terminology to discuss such objects. First, there is the notion
of a parametrized curve, which is simply a continuous function γ : [a, b] → C.
The value γ(a) is called the starting point and γ(b) is called the ending point.
Two curves γ1 : [a, b] → C, γ2 : [c, d] → C are called equivalent, which is
denoted γ1 ∼ γ2 , if γ2 (t) = γ1 (I(t)) where I : [c, d] → [a, b] is a continuous,
one-to-one, onto, increasing function. A “curve” is an equivalence class of
parametrized curves with respect to this equivalence relation.
In practice, we will usually refer to parametrized curves as “curves”,
which is the usual abuse of terminology that one sees in various places in
mathematics, in which one blurs the distinction between equivalence classes
and their members, remembering that various definitions, notation, and proof
arguments need to “respect the equivalence” in the sense that they do not
depend of the choice of member. (Meta-exercise: think of 2–3 other examples
of this phenomenon.)
17 5 CONTOUR INTEGRALS
where the limit is a limit of Riemann sums with respect to a family of parti-
tions of the interval [a, b] over which the curve γ is defined, as the norm of the
partitions shrinks to 0. Here the partition points are a = t0 < t1 < . . . < tn = b,
the points zj = f (tj ) are their images on the curve γ , and the symbols ∆sj
refer to finite line elements, namely ∆sj = |zj − zj−1 |.
The line integral of the second kind is defined for a vector field F = (P, Q)
(the more traditional notation from calculus for what we would denote in the
current context as the complex-valued function F = P + iQ) by
Z Z n
X
F · ds = P dx + Q dy = lim P (zj )∆xj + Q(zj )∆yj ,
γ γ max ∆sj →0
j j=1
A special case of an arc length integral is the length of the curve, defined
as the integral of the constant function 1:
Z Z b
len(γ) = |dz| = |γ 0 (t)| dt.
γ a
= f (z) dz.
γ1
19 5 CONTOUR INTEGRALS
Exercise 5. Show that the integral with respect to arc length similarly does
not depend on the parametrization.
Contour integrals have many surprising properties, but the ones on the
following list of basic properties are not of the surprising kind:
Exercise 6. Prove this result (part of the exercise is to define precisely the
notions of “composition of curves” and “reverse curve”).
Proof. For smooth curves, an easy application of the chain rule gives
Z Z b Z b
0 0 0
F (z) dz = F (γ(t))γ (t) dt = (F ◦ γ)0 (t) dt = (F ◦ γ)(t)|t=b
t=a
γ a a
= F (γ(b)) − F (γ(a)) = F (w2 ) − F (w1 ).
For piecewise smooth curves, this is a trivial extension that is left as an
exercise.
Proof. Fix some z0 ∈ Ω. For any z ∈ Ω, there is some path γ(z0 , z) connecting
z0 and z (since Ω is connected and open, hence pathwise-connected — a
standard exercise in topology, see the exercises in Chapter 1 of [11]). Define
Z
F (z) = f (w) dw.
γ(z0 ,z)
By the assumption, this integral does not depend on which contour γ(z0 , z)
connecting z0 and z was chosen, so F (z) is well-defined. We now claim that
F is holomorphic and its derivative is equal to f . To see this, note that
F (z + h) − F (z)
− f (z)
h Z Z
1
= f (w) dw − f (w) dw − f (z)
h γ(z0 ,z+h) γ(z0 ,z)
Z Z
1 1
= f (w) dw − f (z) = (f (w) − f (z)) dw
h γ(z,z+h) h γ(z,z+h)
21 6 CAUCHY’S THEOREM
by continuity of f .
Remark 3. Note that with the last result, if we knew that holomorphic func-
tions are differentiable infinitely many times (the so-called regularity theo-
rem), we could conclude that a function that satisfies the assumption that
all its contour integrals on closed contours were 0 is holomorphic. This is in
fact true, and is called Morera’s theorem (and is an important fact in com-
plex analysis), but we won’t be able to prove it until we’ve proved Cauchy’s
theorem.
H
Example 2. Compute |z|=1 z n dz for n ∈ Z. What do we learn from the fact
that the integral is not zero for n = −1? (Hint: something; but what?) And
what do we learn from the fact that it’s 0 when n 6= −1? (Hint: nothing; but
why?)
Lemma 3. If f is holomorphic on Ω and f 0 ≡ 0 then f is a constant.
6 Cauchy’s theorem
One of the central results in complex analysis is Cauchy’s theorem.
Theorem 6 (Cauchy’s theorem.). If f is holomorphic on a simply-connected
region Ω, then for any closed curve in Ω we have
I
f (z) dz = 0.
γ
22 6 CAUCHY’S THEOREM
The challenges facing us are: first, to prove Cauchy’s theorem for curves
and regions that are relatively simple (where we do not have to deal with
subtle topological considerations); second, to define what simply-connected
means; third, which will take a bit longer and we won’t do immediately, to
extend the theorem to the most general setting.
Two other theorems that are closely related to Cauchy’s theorem are
Goursat’s theorem, a relatively easy special case of Cauchy’s theorem, and
Morera’s theorem which is a kind of converse to Cauchy’s theorem.
Theorem 7 (Goursat’s theorem).H If f is holomorphic on a region Ω, and T
is a triangle contained in Ω, then ∂T f (z) dz = 0 (where T refers to the “full”
triangle, and ∂T refers to its boundary considered as a curve oriented in the
usual positive direction).
Theorem 8 (Morera’s theorem). If f : Ω → C is a continuous function on a
region Ω such that I
f (z) dz = 0
γ
holds for any closed contour in Ω, then f is holomorphic on Ω.
(n)
Here, the triangles Tj1 ,...,jn for jn = 1, 2, 3, 4 are obtained by subdividing the
(n−1)
order n − 1 triangle Tj1 ,...,jn−1 into 4 subtriangles whose vertices are the ver-
(n−1)
tices and/or edge bisectors of Tj1 ,...,jn−1 (see Figure 1 on page 35 of [11]).
Now, given the way this subdivision was done, it is clear that we have the
equality
I 4 I
X
f (z) dz = f (z) dz
(n−1) (n)
∂Tj jn =1 ∂Tj
1 ,...,jn−1 1 ,...,jn
That is, the integral along the boundary of the original triangle is equal to
the sum of the integrals over all 4n triangles of order n. Now, the crucial
observation is that one of these integrals has to have a modulus that is at
least as big as the average. That is, we have
I I
I 4
X
f (z) dz ≤ f (z) dz ≤ 4n f (z) dz
(n)
∂T (0) ∂Tj ,...,j ∂T (n)
j1 ,...,jn =1 1 n j(n)
(n) (n)
where j(n) = (j1 , . . . , jn ) is some n-tuple chosen such that the second in-
equality holds. Moreover, we can choose j(n) inductively in such a way that
(n) (n) (n−1)
the triangles Tj(n) are nested — that is, Tj(n) ⊂ Tj(n−1) for n ≥ 1, or equiva-
(n−1) (n−1)
lently j(n) = (j1 , k) for some 1 ≤ k ≤ 4 — to make this happen,
, . . . , jn−1
H
choose k to be such that ∂T (n) f (z) dz is greater than (or equal to) the
(j(n−1),k)
average I
4
1 X
f (z) dz ,
4
(n)
∂T
d=1 (j(n−1),d)
Now observe that the sequence of nested triangles shrinks to a single point.
That is, we have
∞
\ (n)
Tj(n) = {z0 }
n=0
24 6 CAUCHY’S THEOREM
for some point z0 ∈ T . This is true because the diameter of the triangles goes
to 0 as n → ∞, so certainly there can’t be two distinct points in the inter-
section; whereas, on the other hand, the intersection cannot be empty, since
the sequence (zn )∞
n=0 of centers (in some obvious sense, e.g., intersection of
the angle bisectors) of each of the triangles is easily seen to be a Cauchy se-
quence (and hence a convergent sequence, by the completeness property of
the complex numbers), whose limit must be an element of the intersection.
Having defined z0 , write f (z) for z near z0 as
where
f (z) − f (z0 )
ψ(z) = − f 0 (z0 ).
z − z0
The holomorphicity of f at z0 implies that ψ(z) → 0 as z → z0 . Denote by d(n)
(n)
the diameter of Tj(n) and by p(n) its perimeter. Each subdivision shrinks both
the diameter and perimeter by a factor of 2, so we have
It follows that
Z Z
0
(n) f (z) dz = (n) f (z0 ) + f (z0 )(z − z0 ) + ψ(z)(z − z0 ) dz
∂T ∂T
j(n)
Z j(n)
= ψ(z)(z − z0 ) dz ≤ p(n) d(n) sup |ψ(z)|
∂T (n) (n)
j(n) z∈T j(n)
H R
Finally, combining this with the relationship between ∂T (0)
f (z) dz and | ∂T (n) f (z) dz|,
j(n)
we get that Z
f (z) dz ≤ p(0) d(0) sup |ψ(z)| −−−→ 0,
∂T (0) (n) n→∞
z∈Tj(n)
Corollary
H 6 (Cauchy’s theorem for a disc.). If f is holomorphic on a disc,
then γ f dz = 0 for any closed contour γ in the disc.
Proof. f has a primitive, and we saw that that implies the claimed conse-
quence.
Theorem 9 (Cauchy’s
R theorem for a region enclosed by a “toy contour”).
The statement γ f (z)dz = 0 is also true for a function that’s analytic in a
region enclosed by a contour that is simple enough that the method of proof
used for the disc above can be extended to it.2
Proof. Repeat the same ideas, going from Goursat’s theorem, to the fact
that the function has a primitive, to the fact that its contour integrals along
closed curves vanish. The difficulty as the toy contour gets more complicated
is to make sure that the geometry works out when proving the existence of
the primitive — see for example the (incomplete) discussion of the case of
“keyhole contours” on pages 40–41 of [11].
2
The book [11] calls such a contour a “toy contour”, leaving the term as a somewhat
vaguely defined meta-concept.
26 7 CONSEQUENCES OF CAUCHY’S THEOREM
As δ → 0, the two parts of the integral along the “neck” of the contour
Γ,δ cancel out in the limit because Fz is continuous, and hence uniformly
continuous, on the compact set D \ D(z, ). So we can conclude that
I I
Fz (w) dw = Fz (w) dw.
C |w−z|=
The next, and final, step, is to take the limit as → 0 of the right-hand side
of this equation, after first decomposing Fz (w) as
f (w) − f (z) 1
Fz (w) = + f (z) · ,
w−z w−z
27 7 CONSEQUENCES OF CAUCHY’S THEOREM
z0
z
Figure 5: The keyhole contour used in the proof of Cauchy’s integral formula.
(by aHstandard calculation, see Example 2 above). Putting all together gives
1
that C 2πi Fz (w) dw = f (z), which was the formula to be proved.
Example 3. in the case when z is the center of the circle C = {w : |w − z| =
r}, Cauchy’s formula gives that
I Z 2π
1 dw 1
f (z) = f (w) = f (z + reit )dt.
2π |w−z|=r i(w − z) 2π 0
Theorem 11 (the mean value property for holomorphic functions). The value
of a holomorphic function f at z is equal to the average of its values around
a circle |w − z| = r (assuming it is holomorphic on an open set containing the
disc |w − z| ≤ r ).
28 7 CONSEQUENCES OF CAUCHY’S THEOREM
Considering what the mean value property means for the real and imag-
inary parts of f = u + iv , which are harmonic functions, we see that they in
turn also satisfy a similar mean value property:
Z 2π
1
u(x, y) = u(x + r cos t, y + r sin t) dt.
2π 0
This is in fact true for all harmonic functions — a fact, known as the mean
value property for harmonic functions, that can be proved separately us-
ing PDE/real analysis methods, or derived from the above considerations by
proving that every harmonic function in a disc is the real part of a holomor-
phic function.
Proof. The key observation is that the expression on the right-hand side of
Cauchy’s integral formula for f (z) (which is the case n = 0 of the “extended”
version) can be differentiated under the integral sign. To make this precise,
let n ≥ 1, and assume inductively that we proved
(n − 1)!
I
(n−1) f (w)
f (z) = dw.
2πi C (w − z)n
Then
(n − 1)!
I
f (n)
(z) = f (w)n(w − z)−n−1 dz,
2πi C
that is convergent for all z ∈ DR (z0 ), where (of course) an = f (n) (z0 )/n!.
but for large enough k this is impossible, since wk − z0 6= 0 for all k and
g(wk ) → g(z0 ) = 0 as k → ∞.
The conclusion is that f is identically zero at least in a neighborhood of
z0 . But now we claim that that also implies that f is identically zero on all of
Ω, because Ω is a region (open and connected). More precisely, denote by U
the set of points z ∈ Ω such that f is equal to 0 in a neighborhood of z . It is
obvious that U is open, hence also relatively open in Ω since Ω itself is open;
U is also closed, by the argument above; and U is nonempty (it contains z0 ,
again by what we showed above). It follows that U = Ω by the well-known
characterization of a connected topological space as a topological space that
has no “’clopen” (closed and open) sets other than the empty set and the
entire space.
An alternative way to finish the proof is the following. For every point
z ∈ Ω, let r(z) be the radius of convergence of the power series expansion
of f around z . Thus the discs {Dr(z) (z) : z ∈ Ω} form an open covering of
Ω. Take w ∈ Ω (with z0 being as above), and take a path γ : [a, b] → Ω con-
necting z0 and w (it exists because Ω is open and connected, hence pathwise-
connected). The open covering of Ω by discs is also an open covering of the
compact set γ[a, b] (the range of γ ). By the Heine-Borel property of compact
sets, it has a finite subcovering {Dr(zj ) (zj ) : j = 0, . . . , m} (where we take
w = zm+1 . The proof above shows that f is identically zero on Dr(z0 ) (z0 ), and
32 7 CONSEQUENCES OF CAUCHY’S THEOREM
also shows that if we know f is zero on Dr(zj ) (zj ) then we can conclude that
it is zero on the next disc Dr(zj+1 ) (zj+1 ). It follows that we can get all the way
to the last disc Dr(w) (w). In particular, f (w) = 0, as claimed.
Remark 5. The above result is also sometimes described under the heading
zeros of holomorphic functions are isolated, since it can be formulated
as the following statement: if f is holomorphic on Ω, is not identically zero on
Ω, and f (z0 ) = 0 for z0 ∈ Ω, then for some > 0, the punctured neighborhood
D (z0 ) \ {z0 } of z0 contains no zeros of f . In other words, the set of zeros of f
contains only isolated points.
Remark 6. The condition that the limit point z0 be in Ω is needed. Note that
it is possible to have a sequence zn → z0 of points in Ω such that f (zn ) = 0
for all n. For example, consider the function e1/z − 1 — it has zeros in every
neighborhood of z0 = 0.
Corollary 7. If f, g are holomorphic on a region Ω, and f (z) = g(z) for z in a
set with limit point in Ω (e.g., an open disc, or even a sequence of points zn
converging to some z ∈ Ω), then f ≡ g everywhere in Ω.
is satisfied for all z ∈ D \ {z0 }. Once we show this, we will set f˜(z0 ) to be
defined by the same integral representation, and it will be easy to see that
that gives the desired extension.
To prove that the representation above holds, consider a “double keyhole”
contour Γ,δ that surrounds most of circle C = ∂D but makes diversions to
avoid the points z0 and z , circling them in the negative direction around most
of a circle of radius . After applying a limiting argument similar to the one
used in the proof of Cauchy’s integral formula, we get that
I I I
1 f (w) 1 f (w) 1 f (w)
= + .
2πi C w−z 2πi C (z) w − z 2πi C (z0 ) w−z
On the right-hand side, the first term is f (z) by Cauchy’s integral formula
(since f is known to be holomorphic on an open set containing D (z)). The
second term can be bounded in magnitude using the assumption that f is
bounded in a neighborhood of z0 ; more precisely, we have
I
f (w) 1
≤ 2π sup |f (w)| · −−→ 0.
C (z0 ) w−z
w∈C (z0 ) |z − z0 | − →0
34 7 CONSEQUENCES OF CAUCHY’S THEOREM
where m is the smallest index ≥ 0 such that am 6= 0. This gives the desired
representation. On the other hand, given a representation of this form, ex-
panding g(z) as a power series shows that m has to be the smallest index of
a nonzero coefficient in the power series expansion of f (z), which proves the
uniqueness claim.
Remark 8. As with the case of zeros, one can extend this definition in an
obvious way to define a notion of a “pole of order 0”. If f (z) is actually
holomorphic and nonzero at z0 (or has a removable singularity at z0 and can
be made holomorphic and nonzero by defining its value at z0 appropriately),
we define the order of the pole as 0 and consider f to have a pole of order 0
at z0 .
Exercise 7. The definitions of the order of a zero and a pole can be con-
sistently unified into a single definition of the (generalized) order of a zero,
where if f has a pole of order m at z0 then we say instead that f has a zero
of order −m. Denote the order of a zero of f at z0 by ordz0 (f ). With these
definitions, prove that
(can you give a useful condition when equality holds?), and that
for f into its principal part and a remaining holomorphic part. Integrating
termwise gives 0 for the integral of G(z), by Cauchy’s theorem; 0 for the
integral powers (z − z0 )k with −m ≤ k ≤ −2, by a standard computation;
and 2πia−1 = 2πi Resz0 (f ) for the integral of r(z − z0 )−1 , by the same standard
computation. This gives the result.
Proof. The idea is the same, except one now uses a contour with multiple
keyholes to deduce after a limiting argument that
I N I
X
f (z) dz = f (z) dz
∂D k=1 C (zk )
Theorem 23 (The residue theorem (version for general toy contours).). As-
sume that f is holomorphic in a region containing a toy contour γ (oriented
in the positive direction) and the region Rγ enclosed by it, except for poles
at the points z1 , . . . , zN ∈ Rγ . Then
I N
X
f (z) dz = 2πi Reszk (f ).
γ k=1
• Topologically, we think of C
b as the one-point compactification of C; that
is, we add to C an additional element ∞ (called “the point at infinity”)
and say that the neighborhoods of ∞ are the complements of compact
sets in C. This turns C
b into a topological space in a simple way.
See page 88 in [11] for a more detailed explanation. One can check that
this geometric identification is a homeomorphism between S 2 (equipped
with the obvious topology inherited from R3 ) and C
b (with the one-point
compactification topology defined above).
• poles;
Proof. Assume that the closure f (Dr (z0 ) \ {z0 }) does not contain a point w ∈
1
C. Then g(z) = f (z)−w is a function that’s holomorphic and bounded in Dr (z0 )\
{z0 }. By Riemann’s removable singularity theorem, its singularity at z0 is
removable, so we can assume it is holomorphic at z0 after defining its value
there. It then follows that
1
f (z) = w +
g(z)
has either a pole or a removable singularity at z0 , depending on whether
g(z0 ) = 0 or not.
41 10 THE ARGUMENT PRINCIPLE
Remark 9. By similar reasoning, the theorem also holds when the circle is
replaced by a toy contour γ .
The proof above hides, as some slick mathematical proofs have a way of
doing, the fact that the formula (1) has a fairly simple intuitive explanation.
Start by noting that the integral in the argument principle can be repre-
sented as
Z b 0 Z b
f 0 (z) f (γ(t))γ 0 (t) (f ◦ γ)0 (t)
I
1 1 1
dz = dt = dt
2πi γ f (z) 2πi a f (γ(t) 2πi a (f ◦ γ)(t)
Z
1 1
= dw,
2πi f ◦γ w
that is, an integral of dw/w over the contour f ◦ γ — the image of γ under f .
Now note that the differential form dw/w has a special geometric meaning
in complex analysis, namely we have
dw
= “d (log w) ” = “d (log |w| + i arg w) ”.
w
We put these expressions in quotes since the logarithm and argument are
not single-valued functions so it needs to be explained what such formulas
mean. However, at least log |w| is well-defined for a curve that does not cross
0, so when integrating over the closed curve f ◦ γ , the real part is zero by
the fundamental theorem of calculus. The imaginary part (which becomes
real after dividing by 2πi) can be interpreted intuitively as the change in the
argument over the curve — that is, initially at time t = a one fixes a specific
value of arg w = arg γ(a); then as t increases from t = a to t = b, one tracks the
increase or decrease in the argument as one travels along the curve γ(t); if
this is done correctly (i.e., in a continuous fashion), at the end the argument
must have a well-defined value. Since the curve is closed, the total change in
the argument must be an integer multiple of 2π , so the division by 2πi turns
it into an integer.
Of course, this explanation also explains the name “the argument princi-
ple,” which may sound arbitrary and uninformative when you first hear it.
Connection to winding numbers. What the above reasoning shows is
that in general, an integral of the form
I
1 f (w)
dw
2πi γ w
over a closed curve γ that does not cross 0 carries the meaning of “the total
number of times the curve γ goes around the origin,” with the number being
43 10 THE ARGUMENT PRINCIPLE
positive if the curve goes in the positive direction around the origin; negative
if the curve goes in the negative direction around the origin; or zero if there
is no net change in the argument. This number is more properly called the
winding number of f around w = 0 (also sometimes referred to as the
index of the curve around 0), and denoted
I
1 f (z)
Ind0 (f ) = dz.
2πi γ z
More generally, one can define the winding number at z = z0 as the number
of times a curve γ winds around an arbitrary point z0 , which (it is easy to
see) will be given by I
1 f (z)
Indz0 (f ) = dz,
2πi γ z − z0
assuming that γ does not cross z0 .
Note that winding number is a topological concept of planar geometry
that can be considered and studied without any reference to complex anal-
ysis; indeed, in my opinion that is the correct approach. It is possible, and
not especially difficult, to define it in purely topological terms without men-
tioning contour integrals, and then show that the complex analytic and topo-
logical definitions coincide. Try to think what such a definition might look
like.
Proof. Define ft (z) = f (z) + tg(z) for t ∈ [0, 1], and note that f0 = f and
f1 = f + g , and that the condition |f (z)| > |g(z)| on γ implies that ft has no
zeros on γ for any t ∈ [0, 1]. Denote
ft0 (z)
I
1
nt = dz,
2πi γ ft (z)
To prove continuity of nt , note that the function g(t, z) = ft0 (z)/ft (z) is
continuous, hence also uniformly continuous, on the compact set [0, 1] × γ .
For s, t ∈ [0, 1] satisfying |t − s| < δ , we can write
I
1
|nt − ns | ≤ |g(t, z) − g(s, z)| · |dz|
2πi γ
1
≤ len(γ) sup{|g(u, z) − g(v, z)| : z ∈ γ, u, v ∈ [0, 1], |u − v| < δ}.
2πi
Given > 0, we can choose δ that ensures that this expression is < if
|t − s| < δ , by the uniform continuity. This is precisely what is needed to
show that t 7→ nt is continuous.
As with the argument principle, Rouché’s theorem also has a rather amus-
ing intuitive explanation, which I learned from the book Visual Complex Anal-
ysis by Tristan Needham. The slogan to remember is “walking the dog”.
Imagine that you are walking in a large empty park containing at some “ori-
gin” point 0 a large pole (in the English sense of a metal post sticking out
of the ground, not the complex analysis sense). You start at some point X
and go for a walk along some curve, ending back at the same starting point
X . Let N denote your winding number around the pole at the origin — that
is, the total number of times you went around the pole, with the appropriate
sign.
Now imagine that you also have a dog that is walking alongside you in
some erratic path that is sometimes close to you, sometimes less close. As
you traverse your curve C1 , the dog walks along on its own curve C2 , which
also begins and ends in the same place. Let M denote the dog’s winding
number around the pole at the origin. Can we say that N = M ? The answer
is: yes, we can, provided that we know the dog’s distance to you was always
less than your distance to the pole. To see this, imagine that you had the dog
on a leash of variable length; if the distance condition was not satisfied, it
would be possible for the dog to reach the pole and go in a short tour around
it while you were still far away and not turning around the pole, causing an
entanglement of the leash with the pole.
Amazingly, the above scenario maps in a precise way to Rouché’s theo-
rem, using the following dictionary: the curve f ◦ γ represents your path; the
curve (f +g)◦γ represents the dog’s path; g ◦γ represents the vector pointing
from you to the dog; the condition |f | > |g| along γ is precisely the correct
condition that the dog stays closer to you than your distance to the pole; and
the conclusion that the two winding numbers are the same is precisely the
theorem’s assertion that f and f + g have the same number of generalized
45 11 APPLICATIONS OF ROUCHÉ’S THEOREM
The idea is now to apply Rouché’s theorem to F (z) and G(z). Fix > 0
small enough so that the disc D (z0 ) is contained in Ω and does not contain
solutions of the equation f (z) = w0 other than z0 (this is possible, by the
property that zeros of holomorphic functions are isolated). Defining
(or equivalently f (z) = w) has the same number in solutions (in particular,
at least one solution) as the equation f (z) = w0 in the disc D (z0 ). This was
precisely the claim to be proved.
i) F is continuous.
Proof. This proof is based on the idea of translating the global statement
about the equality of the two contour integrals into a local statement, simi-
larly to the proof of Goursat’s theorem but in a more general setting. (See
also pages 93–95 in [11] for a variant of the proof presented below.)
Denote by F : [0, 1] × [0, 1] → Ω the homotopy between γ0 and γ1 , and for
any s ∈ [0, 1] denote by γs : [0, 1] → C the curve γs (t) = F (s, t). The strategy
of the proof is to show that there are values 0 = s0 < s1 < s2 < . . . < sn = 1
such that
Z Z Z Z
f (z) dz = f (z) dz = . . . = f (z) dz = f (z) dz.
γs0 γs1 γsn−1 γsn
That is, we will show that a slight perturbation of the s parameter does not
change the value of the integral. To this end, for two fixed values 0 ≤ s <
s0 ≤ 1 that are close to each other (in a sense we will make precise shortly),
we break up the t-interval [0, 1] over which the curves s, s0 are defined into
very small segments by fixing points 0 = t0 < t1 < . . . < tm = 1 that are very
close together (in a sense that, again, we will need to make precise below
once we understand what is needed to make the argument work), and then
write
Z m Z
X Z m Z
X
f (z) dz = f (z) dz, f (z) dz = f (z) dz.
γs j=1 γs ([tj−1 ,tj ]) γs0 j=1 γs0 ([tj−1 ,tj ])
We will now show that these two integrals are equal by exploiting our knowl-
edge of local properties of f that follow from its analyticity. Specifically,
assume that for each 1 ≤ j ≤ m we know that there exists an open disk Dj
48 12 SIMPLY-CONNECTED REGIONS AND CAUCHY’S THEOREM
containing the convex hull of the union of the two curve segments γs ([tj−1 , tj ])
and γs0 ([tj−1 , tj ]). For each 0 ≤ j ≤ m, let ηj denote a straight line segment
(considered as a parametrized curve) from γs (tj ) to γs0 (tj ), and for each 1 ≤
j ≤ m let Γj denote the closed curve γs ([tj−1 , tj ]) + ηj − γs0 ([tj−1 , tj ]) − ηj−1 (the
concatenation of the four curves γs ([tj−1 , tj ]), “the reverse of ηj ”, γs0 ([tj−1 , tj ]),
and “the reverse of ηj−1 ”). By Cauchy’s theorem on a disc, we have
I
f (z)dz = 0,
Γj
since in the next-to-last step the sum is telescoping, and in the last step
we note that η0 and ηm are both degenerate curves each of which simply
stays at a single point (γs (0) = F (s, 0) = γs0 (0), and γs (1) = F (s, 1) = γs0 (1),
respectively). This is precisely the equality we wanted.
We still need to justify the assumption about the discs Dj . This can be
made to work if s and s0 are sufficiently close to each other and the points
0 < t0 < t1 < . . . < tm = 1 are sufficiently densely spaced, using an argument
involving continuity and compactness. Here is one way to make the argu-
ment: fix 0 ≤ s ≤ 1. At each 0 ≤ t ≤ 1, by continuity of the homotopy function
F there exists a number δ > 0 and a disc Ds,t centered at γs (t) = F (s, t) such
that for any s0 , t0 ∈ [0, 1] with |s0 − s| < , |t0 − t| < δ , we have γs0 (t0 ) ∈ Ds,t . The
family of discs Ds,t for 0 ≤ t ≤ 1 are an open cover of the curve γs (or more
precisely of its image γs ([0, 1]), which is a compact set), so by the Heine-Borel
property we will have a finite subcovering Ds,t0 , . . . , Ds,tm . This is enough to
prove the assumption for s0 sufficiently close to s.
Finally, for each s denote by δ(s) the value of δ chosen above as a function
of s. The collection of open intervals {(s − δ(s), s + δ(s)) : s ∈ [0, 1]} form an
49 12 SIMPLY-CONNECTED REGIONS AND CAUCHY’S THEOREM
open covering of the interval [0, 1], so again using the Heine-Borel property,
we can extract a finite subcovering. This enables us to find a sequence 0 =
s0 < s1 < . . . < sn = 1 that we claimed exist at the beginning of the proof,
namely where the relation
Z Z
f (z) dz = f (z) dz
γsj−1 γsj
holds for each j = 1, . . . , n (with sj−1 playing the role of s and sj playing the
role of s0 in the discussion above).
Proof. Assume for simplicity that γ is parametrized as a curve on [0, 1]. Then
it can be thought of as the concatenation of two curves γ1 and −γ2 , where
γ1 = γ|[0,1/2] and γ2 is the “reverse” of the curve γ|[1/2,1] . Note that γ1 and γ2
have the same endpoints. By the invariance property of contour integrals
under homotopy proved above, we have
Z Z Z Z
f (z) dz = f (z) dz = f (z) dz − f (z) dz = 0.
γ γ1 −γ2 γ1 γ2
Exercise 10. The proof of Theorem 28 above still involves a minor amount of
what I call “dishonesty”; that is, the proof is not actually formally correct as
written but contains certain inconsistencies between what the assumptions
of the theorem are and what we end up actually using in the body of the
proof. Can you identify those inconsistencies? What additional work might
be needed to fix these problems? And why do you think the author of these
notes, and the authors of the textbook [11], chose to present things in this
way rather than treat the subject in a completely rigorous manner devoid of
any inaccuracies? (The last question is a very general one about mathemati-
cal pedagogy; coming up with a good answer might help to demystify for you
a lot of similar decisions that textbook authors and course instructors make
in the teaching of advanced material, and make the study of such topics a bit
less confusing in the future.)
50 13 THE LOGARITHM FUNCTION
on any region Ω that does not contain 0 and where one can make a consistent,
smoothly varying choice of arg z as z ranges over Ω. It is easy to see that this
formula gives an inverse to the exponential function.
For example, if
Ω = C \ (−∞, 0]
(the “slit complex plane” with the negative real axis removed), we can set
where Arg z is set to take values in (−π, π). This is called the principal branch
of the logarithm — basically a kind of standard version of the log function
that complex analysts have agreed to use whenever this is convenient (or
not too inconvenient). However, sometimes we may want to consider the
logarithm function on more strange or complicated regions. When can this
be made to work? The answer is: precisely when Ω is simply-connected.
i) F is holomorphic in Ω.
iii) F (r) = log r (the usual logarithm for real numbers) for all real numbers
r ∈ Ω sufficiently close to 1.
d
ze−F (z) = e−F (z) − zF 0 (z)e−F (z) = e−F (z) (1 − z/z) = 0,
dz
51 13 THE LOGARITHM FUNCTION
Exercise 11. Prove that the principal branch of the logarithm has the Taylor
series expansion
∞
X (−1)n−1
Log z = (z − 1)n (|z| < 1).
n=1
n
Exercise 12. Modify the proof above to prove the existence of a branch of
the logarithm function in any simply-connected region Ω not containing 0,
without the assumption that 1 ∈ Ω. In what way is the conclusion weakened
in that case?
Exercise 13. Explain in what sense the logarithm functions F (z) = logΩ (z)
satisfying the properties proved in the theorem above (and its generalization
described in the previous exercise) are unique.
eg(z) = f (z).
Note that if f (z) = z 1/n is one choice of an nth root function, then for any 0 ≤
k ≤ n − 1, the function g(z) = e2πik/n f (z) will be another function satisfying
g(z)n = z . Conversely, it is easy to see that those are precisely the possible
choices for an nth root function.
52 14 THE EULER GAMMA FUNCTION
4. Integral representation:
Z ∞
Γ(s) = e−x xs−1 dx (Re s > 0).
0
1
.
where γ = limn→∞ 1 + 2
+ 13 + . . . + 1
n
− log n = 0.577215 is the Euler-
Mascheroni constant.
n! ns
Γ(s) = lim (s ∈ C).
n→∞ s(s + 1) · · · (s + n)
8. Zeros: the gamma function has no zeros (so Γ(s)−1 is an entire func-
tion).
9. Poles: the gamma function has poles precisely at the non-positive inte-
gers s = 0, −1, −2, . . ., and is holomorphic everywhere else. The pole at
s = −n is a simple pole with residue
(−1)n
Ress=−n (Γ) = (n = 0, 1, 2, . . .).
n!
Functional equation:
I leave it as an exercise to check (or read the easy explanation in [11]) that
the function it defines is holomorphic in that region.
Next, perform an integration by parts, to get that, again for Re(s) > 0, we
have
Z ∞ Z ∞
−x s x=∞
−e−x xs x=0 e−x sxs−1 dx = s Γ(s),
Γ(s + 1) = e x dx = +
0 0
Γ(s + 1)
Γ1 (s) =
,
s
which is a function that is holomorphic on Re(s) > −1, s 6= 0 and coincides
with γ(s) for Re(s) > 0. By the principle of analytic continuation this provides
a unique extension of Γ(s) to the region Re(s) > −1. Because of the factor
1/s and the fact that Γ(1) = 1 we also see that Γ1 (s) has a simple pole at s = 0
with residue 1.
Next, for Re(s) > −2 we define
Γ1 (s + 1) Γ(s + 2)
Γ2 (s) = = ,
s s(s + 1)
a function that is holomorphic on Re(s) > −2, s 6= 0, −1, and coincides with
Γ1 (s) for Re(s) > −1, s 6= 0. Again, this provides an analytic continuation of
Γ(s) to that region. The factors 1/s(s + 1) show that Γ2 (s) has a simple pole
at s = −1 with residue −1.
Continuing by induction, having defined an analytic continuation Γn−1 (s)
of Γ(s) to the region Re(s) > −n + 1, s 6= 0, −1, −2, . . . , −n + 2, we now define
Γn−1 (s + 1) Γ(s + n)
Γn (s) = = ... = .
s s(s + 1) · · · (s + n − 1)
55 14 THE EULER GAMMA FUNCTION
and to note that the integral over [1, ∞) converges (and defines a holomor-
phic function of s) for all s ∈ C, and the integral over [0, 1] can be computed
by expanding e−x as a power series in x and integrating term by term. That
is, for Re(s) > 0 we have
1 ∞
1X ∞ 1
(−1)n n+s−1 X (−1)n
Z Z Z
−x s−1
e x dx = x dx = xn+s−1 dx
0 0 n=0
n! n=0
n! 0
∞ n
X (−1)
=
n=0
n!(n + s)
which is easy to verify directly. For the general claim, using a linear change
of variables and an integration by parts we see that
Z n Z 1
x n s−1
1− x dx = ns (1 − t)n ts−1 dt
0 n
0 s t=1 Z 1 s
s nt n−1 t
= n (1 − t) + n(1 − t) dt
s t=0 0 s
Z 1
s n
=n · (1 − t)n−1 t(s+1)−1 dt,
s 0
so the claim follows by induction on n.
n! ns
Γ(s) = lim .
n→∞ s(s + 1) · · · (s + n)
Proof of the infinite product representation for Γ(s). For Re(s) > 0 we have
s(s + 1) · · · (s + n)
Γ(s)−1 = lim
n→∞ n!ns
s s s
= s lim e−s log n 1 + 1+ ··· 1 +
n→∞ 1 2 n
n
Pn 1
Y s −s/k
= s lim es( k=1 k −log n) 1+ e
n→∞
k=1
k
∞
Y s −s/n
= seγs 1+ e .
n=1
n
We now check that the infinite product actually converges absolutely and
uniformly on compact subsets in all of C, so defines an entire function. Let’s
start with some preliminary elementary observations on infinite products.
Proof. Under the assumption, there exists some large enough N0 ≥ 1 such
that |an | < 1/2 for all n ≥ N0 , which in particular implies that 1 + an =
exp(Log(1 + an )), where Log(z) is the principal branch of the logarithm func-
tion. Now, by the assumption, clearly it is enough to prove the convergence
Q∞
of the infinite product n=N0 (1 + an ), and this can be written as
∞
Y n
Y n
Y
(1 + an ) = lim (1 + ak ) = lim exp (Log(1 + ak ))
n→∞ n→∞
n=N0 k=N0 k=N0
n
!
X
= lim exp Log(1 + ak ) .
n→∞
k=N0
P∞ Pn
If we knew that the infinite series n=N0 Log(1 + an ) = limn→∞ k=N0 Log(1 +
ak ) converged, we could continue the above chain of equalities as
n
! ∞
!
X X
= exp lim Log(1 + ak ) = exp Log(1 + an ) ,
n→∞
k=N0 n=N0
Q∞
and since ez is never 0, that would mean that the product n=N0 (1+an ) exists
and is non-zero (that is, it converges), hence as was mentioned above the
Q∞
infinite product n=1 (1 + an ) also converges and the claim would be proved.
P∞
To see the convergence of the series n=N0 Log(1 + an ), recall that the
function z 7→ Log(z) has the convergent Taylor expansion
∞
X (−1)m−1
Log(z) = (z − 1)m (|z − 1| < 1).
m=1
m
∞ ∞ ∞ ∞ ∞
!2
X X X X X
| Log(1 + an )| ≤ |an | + C |an |2 ≤ |an | + C |an | < ∞,
n=N0 n=N0 n=N0 n=N0 n=N0
Q∞
Proof. Lemma 8 above implies that the infinite product n=1 (1 + fn (z)) con-
verges to a nonzero limit for any z ∈ Ω. By repeating the same estimates
in the proof of that lemma in the context of z being allowed to range on
a compact subset K ⊂ Ω, one sees that the sequence of partial products
Qn
k=1 (1 + fn ) actually converges uniformly on compacts, so the limiting func-
tion is holomorphic.
Q∞ z
e−z/n is an entire function.
Proof that n=1 1+ n
∞ ∞ 2
X z −z/n X z z z
1+ e − 1 = 1+ 1− +O − 1
n n n n 2
n=1 n=1
∞ 2
X z
= O <∞
n=1
n2
As with the gamma function, the Riemann zeta function is usually defined
on only part of the complex plane and its definition is then extended by an-
alytic continuation. Again, I will formulate this as a theorem asserting the
existence of the zeta function and its various properties.
Theorem 32 (Riemann zeta function). There exists a unique function, de-
noted ζ(s), of a complex variable s, having the following properties:
3. Euler product formula: for Re(s) > 1, ζ(s) also has an infinite product
representation
Y 1
ζ(s) = ,
p
1 − p−s
where the product ranges over the prime numbers p = 2, 3, 5, 7, 11, . . ..
5. ζ(s) has no zeros on the line Re(s) = 1 (this requires a separate proof
from the previous claim).
6. The “trivial” zeros: the zeros of ζ(s) in the region Re(s) ≤ 0 are pre-
cisely at s = −2, −4, −6, . . ..
Bn+1
ζ(−n) = − (n = 1, 2, 3, . . .).
n+1
(Note that for negative even integers, this coincides with the property
stated above about the trivial zeros at s = −2, −4, −6, . . ., since it is an
easy fact that the Bernoulli numbers satisfy B2k+1 = 0 for integer k ≥ 1.
But this formula adds information about the values of ζ(s) at negative
odd integers.)
ζ ∗ (1 − s) = ζ ∗ (s),
11. Mellin transform representation: an expression for ζ(s) valid for all
s ∈ C is
s
π −s/2 Γ ζ(s)
2 Z ∞
1 1 1 s+1 s−2
=− − + t− 2 +t 2 (ϑ(t) − 1) dt,
1−s s 2 1
where the sum is over all prime powers less than or equal to x. Then
for non-integer x > 1,
X xρ
ψ(x) = x − − log(2π),
ρ
ρ
where the sum ranges over all zeros ρ of the Riemann zeta function
counted with their respective multiplicities. (In most textbooks the sum
is separated into two sums, one ranging over the trivial zeros which can
be evaluated explicitly, and the other ranging over the much less trivial
zeros in the strip 0 < Re(s) < 1. Also the sum is only conditionally
convergent; refer to a book on analytic number theory for the proper
way to interpret it to get a convergent sum.)
The explicit formula of number theory illustrates that knowing where the
zeros of ζ(s) are has important consequences for prime number enumeration.
In particular, proving that Re(s) has no zeros in Re(s) ≥ 1 will enable us to
prove one of the most famous theorems in mathematics.
Theorem 33 (Prime number theorem). Let π(x) denote the number of prime
numbers less than or equal to x. Then we have
π(x)
lim = 1.
x→∞ x/ log x
Conjecture 1 (The Riemann hypothesis). All the nontrivial zeros of ζ(s) are
on the “critical line” Re(s) = 1/2.
To begin the proof of Theorem 32, again, let’s take as the definition of
ζ(s) the standard representation
∞
X 1
ζ(s) = .
n=1
ns
−s − Re(s)
P P
Since n |n | = nn , we see that the series converges absolutely
precisely when Re(s) > 1, and that the convergence is uniform on any half-
plane of the form Re(s) > α where α > 1. In particular, it is uniform on
compact subsets, so ζ(s) is holomorphic in this region.
63 15 THE RIEMANN ZETA FUNCTION
Proof of the Euler product formula. We now prove that Z(s) = ζ(s). This can
be done by manipulating the partial products associated with the infinite
product defining Z(s), as follows:
Y 1 Y
ζN (s) := = (1 + p−s + p−2s + p−3s + . . .)
p≤N
1 − p−s p≤N
X 1
= ,
j j
ns
n=p11 ···pkk
p1 ,...,pk primes ≤N
where the last equality follows from the fundamental theorem of arithmetic,
together with the fact that when multiplying two (or a finite number of) in-
finitely convergent series, the summands can be rearranged and summed in
any order we desire. So, we have represented ζN (s) as a series of a simi-
lar form as the series defining ζ(s), but involving terms of the form n−s only
for those positive integers n whose prime factorization contains only primes
≤ N . It follows that
X 1
|ζ(s) − ζN (s)| ≤ s
.
n>N
n
Taking the limit as N → ∞ shows that Z(s) = limN →∞ ζN (s) = ζ(s). This
proves the validity of the Euler product formula.
Proof. The Euler product formula gives a convergent product for ζ(s) in this
region where each factor (1 − p−s )−1 has no zeros.
where Z ∞
fˆ(k) = f (x)e−2πikx dx.
−∞
is the Fourier transform of f .
In particular, setting x = 0 in the formula for g(x) gives the basic fact that
∞
X
g(0) = ĝ(k).
k=−∞
P∞
However, note that g(0) = n=−∞ f (n), the quantity on the left-hand side of
the Poisson summation formula. On the other hand, the Fourier coefficient
ĝ(k) can be expressed in terms of the Fourier coefficients of the original
function f (x):
Z 1 Z 1 ∞
X
−2πikx
ĝ(k) = g(x)e dx = f (x + n)e−2πikx dx
0 0 n=−∞
∞
X Z 1 ∞ Z
X n+1
−2πikx
= f (x + n)e dx = f (u)e−2πiku du
n=−∞ 0 n=−∞ n
Z ∞
= f (u)e−2πiku du = fˆ(k).
−∞
Combining these observations gives the result, modulo a few details we’ve
glossed over concerning the precise assumptions that need to be made about
f (x) (we will only apply the Poisson summation formula for one extremely
well-behaved function, so I will not bother discussing those details).
65 15 THE RIEMANN ZETA FUNCTION
Theorem 35. The Jacobi theta function ϑ(t) satisfies the functional equation
1
ϑ(t) = √ ϑ(1/t) (t > 0).
t
Remark 11. Equations of this form are studied in the theory of modular
forms, an area of mathematics combining number theory, complex analysis
and algebra in a very surprising and beautiful way.
Proof. The idea is to apply the Poisson summation formula to the function
2
f (x) = e−πtx ,
2
(that is, the fact that the function e−πx is its own Fourier transform); this
evaluation appears in Example 1, Chapter 2, pages 42–44 in [11]. With
the above substitution for f (x) and fˆ(k), the Poisson summation formula be-
comes precisely the functional equation for ϑ(t).
Exercise 15. (a) Use the residue theorem to evaluate the contour integral
2
e−πz t
I
dz,
γN e2πiz − 1
valid for Re(s) > 0. A linear change of variable x = πn2 t brings this to the
form s Z ∞
2
π −s/2 Γ n−s = e−πn t ts/2−1 dt.
2 0
Summing the left-hand side over n = 1, 2, . . . gives π −s/2 Γ 2s ζ(s) — the func-
tion we denoted ζ ∗ (s) — adding the stronger assumption that Re(s) > 1. For
the right-hand side we have that
∞ Z ∞
!
∞ Z ∞
2 2t
X X
e−πn t ts/2−1 dt. = e−πn ts/2−1 dt
n=1 0 0 n=1
∞
ϑ(t) − 1 s/2−1
Z
= t dt,
0 2
where the estimates in the lemma are needed to justify interchanging the or-
der of the summation and integration, and show that the integral converges
for Re(s) > 1. Thus we have obtained the representation
Z ∞ Z ∞
∗ 1 s/2−1
ζ (s) = (ϑ(t) − 1)t dt = ϕ(t)ts/2−1 dt,
2 0 0
1
where we denote ϕ(t) = − 1). Next, the idea is to use the functional
2
(ϑ(t)
equation for ϑ(t) to bring this to a new form that can be seen to be well-
defined for all s ∈ C except s = 1. Specifically, we note that the functional
equation for can be expressed in the form
ϕ(t) = t−1/2 ϕ(1/t) + 12 t−1/2 − 12 .
67 15 THE RIEMANN ZETA FUNCTION
We have derived a formula for ζ ∗ (s) (one of the formulas claimed in the main
theorem above) that is now seen to define a meromorphic function on all of C
— the integrand decays rapidly as t → ∞ so actually defines an entire func-
tion, so the only poles are due to the two terms −1/s and 1/(s − 1). We have
therefore proved that ζ(s) can be analytically continued to a meromorphic
function on C.
ζ ∗ (1 − s) = ζ ∗ (s).
Corollary 14. The only pole of ζ(s) is a simple pole at s = 1 with residue 1.
has a pole at s = 1 with residue π 1/2 Γ(1/2)−1 = 1, and a pole (that turns out
to be a removable singularity) at s = 0 with residue π 0 Γ(0)−1 = 0. (That is,
the pole of ζ ∗ (s) at s = 0 is cancelled out by the zero of Γ(s/2).)
Corollary 16. The zeros of ζ(s) in the region Re(s) < 0 are precisely the
trivial zeros s = −2, −4, −6, . . ..
Proof. We already established the existence of the trivial zeros. The fact that
there are no other zeros also follows easily from the functional equation and
is left as an exercise.
This representation is certainly valid for Re(s) > 1. However, note that we
have the bound
−s
x − bxc−s ≤ |s| · bxc− Re(s)−1
(x ≥ 1)
by the mean value theorem. Thus, the integral is actually an absolutely con-
vergent integral in the larger region Re(s) > 0, and the representation we
derived gives an analytic continuation of ζ(s) to a meromorphic function on
Re(s) > 0, which has a single pole at s = 1 (a simple pole with residue 1) and
is holomorphic everywhere else.
An elaboration of this idea using what is known as the Euler-Maclaurin
summation formula can be used to perform the analytic continuation of ζ(s)
to a meromorphic function on C by extending it inductively from each region
Re(s) > −n to Re(s) > −n − 1, as we saw could be done for the gamma
function. Another approach is to use the analytic continuation for Re(s) > 0
shown above, then prove that the functional equation ζ ( 1 − s) = ζ ∗ (s) holds in
the region 0 < Re(s) < 1, and then use the functional equation to analytically
continue ζ(s) to Re(s) ≤ 0 (which is the reflection of the region Re(s) ≥ 1
under the transformation s 7→ 1 − s).
Proof. For this proof, denote s = σ + it, where we assume σ > 1 and t is
real and nonzero. The proof is based on investigating simultaneously the
behavior of ζ(σ + it), ζ(σ + 2it), and ζ(σ), for fixed t as σ & 1. Consider the
following somewhat mysterious quantity
We can evaluate “X ” as
where Log(·) denotes the principal branch of the logarithm function. Now
note that for z = a + ib with a > 1 and p prime we have |p−z | = p−a < 1, so
∞
X
−z p−mz
− Log(1 − p ) = ,
m=1
m
and
∞
h X
−zp−ma
i h i
− Re Log(1 − p ) = Re cos(mb log p) + i sin(mb log p)
m=1
m
∞
X p−ma
= cos(mb log p).
m=1
m
So we can rewrite X as
∞
X
X= cn n−σ (3 + 4 cos θn + cos(2θn ))
n=1
where θn = t log n and cn = 1/m if n = pm for some prime p. We can now use
a simple trigonometric identity
eX = |ζ(σ)3 ζ(σ + it)4 ζ(σ + 2it)| = O((σ − 1)−3 (σ − 1)4 ) = O(σ − 1).
In particular, eX → 0 as σ & 1, in contradiction to the result we proved above
that eX ≥ 1. This proves the claim that ζ(s) cannot have a zero on the line
Re(s) = 1.
Exercise 16. The above proof that eX ≥ 1 (which immediately implied the
claim of the theorem) relied on showing that for any prime number p, the
corresponding factors in the Euler product formula satisfy the inequality
(1 − x)3 |1 − zx|4 |1 − z 2 x| ≤ 1
for all x ∈ [0, 1] and z satisfying |z| = 1. Since this is an elementary inequality,
it seems like it ought to have an elementary proof (i.e., a proof that does not
involve logarithms and power series expansions). Can you find such a proof?
of the Riemann zeta function as a tool for counting prime numbers. (This
was the only number theory paper Riemann wrote in his career!) The his-
tory (including all the technical details) of these developments is described
extremely well in the classic textbook [4], which I highly recommend.
The original proofs of the prime number theorem were very complicated
and relied on the “explicit formula of number theory” (that I mentioned in
the previous section) and some of its variants. Throughout the 20th cen-
tury, mathematicians worked hard to find simpler ways to derive the prime
number theorem. This resulted in several important developments (such as
the Wiener tauberian theorem and the Hardy-Littlewood tauberian theorem)
that advanced not just the state of analytic number theory but also complex
analysis, harmonic analysis and functional analysis. Despite all the efforts
and the discovery of several paths to a proof that were simpler than the orig-
inal approach, all proofs remained quite difficult. . . until the year 1980, when
the mathematician Donald Newman discovered a wonderfully simple way to
derive the theorem using a completely elementary use of complex analysis.
It is Newman’s proof (as presented in the short paper [13] by D. Zagier) that
I present here.
Define the weighted prime counting functions
X
π(x) = #{p prime : p ≤ x} = 1,
p≤x
X X log x
ψ(x) = log p = log p ,
p≤x
log p
pk ≤x
with the convention that the symbol p in a summation denotes a prime num-
ber, and pk denotes a prime power, so that summation over p ≤ x denotes
summation over all primes ≤ x, and the summation over pk denotes summa-
tion over all prime powers ≤ x. Another customary way to write the function
ψ(x) is as X
ψ(x) = Λ(n),
n≤x
where the function Λ(n), called the von Mangoldt function, is defined by
Lambda (
log p if n = pk , p prime,
Λ(n) =
0 otherwise.
x
Lemma 11. The prime number theorem π(x) ∼ log x
is equivalent to the
statement that ψ(x) ∼ x.
73 16 THE PRIME NUMBER THEOREM
In the opposite direction, we have a similar (but slightly less elegant) in-
equality, namely that for any 0 < < 1 and x ≥ 2,
X X X
log x1−
ψ(x) ≥ log p ≥ log p ≥
p≤x x1− <p≤x x1− <p≤x
= (1 − ) log x π(x) − π(x1− ) ≥ (1 − ) log x π(x) − x1− .
Now assume that ψ(x) ∼ x as x → ∞. Then the first of the two bounds above
implies that
ψ(x)
π(x) ≥ ,
log x
so
x
lim inf π(x)/ ≥ 1.
x→∞ log x
On the other hand, the second of the two bounds implies that
1 ψ(x)
π(x) ≤ · + x1− ,
1 − log x
x 1 log x 1
which implies that lim supx→∞ π(x)/ log x ≤ 1− +lim supx→∞ x
= 1−
. Since
was an arbitrary number in (0, 1), it follows that
x
lim sup π(x)/ ≤ 1.
x→∞ log x
Combining the two results about the lim inf and the lim sup gives that π(x) ∼
x/ log x.
Now assume that π(x) ∼ logx x , and apply the inequalities we derived above
in the opposite direction from before. That is, we have
so
lim sup ψ(x)/x ≤ 1.
x→∞
On the other hand,
implies that
log x
lim inf ψ(x)/x ≥ lim (1 − ) 1 − = 1 − .
x→∞ x→∞ x
ψ(x)
Again, since ∈ (0, 1) was arbitrary, it follows that lim inf x→∞ = 1. Com- x
ψ(x)
bining the two results about the lim inf and lim sup proves that limx→∞ x =
1, as claimed.
Lemma 12. For Re(s) > 1 we have
∞
ζ 0 (s) X
− = Λ(n)n−s .
ζ(s) n=1
Proof. Using the Euler product formula and taking the logarithmic deriva-
tive (which is an operation that works as it should when applied to infinite
products of holomorphic functions that are uniformly convergent on compact
subsets), we have
ζ 0 (s) X d
ds
(1− p−s ) X log p · p−s
− = =
ζ(s) p
1 − p−s p
1 − p−s
X ∞
X X
−s −2s −3s
= log p (p +p +p + . . .) = log p · p−ks
p p prime k=1
∞
X
= Λ(n)n−s .
n=1
Lemma 13. There is a constant C > 0 such that ψ(x) < Cx for all x ≥ 1.
√
(The estimate O( n log2 n) for the sum of log p for prime powers higher than
1 is easy and is left as an exercise.) Taking the logarithm of both sides, this
gives the bound
√
ψ(2n) − ψ(n) ≤ 2n log 2 + C1 n log n ≤ C2 n,
for T > 0, which for any T is an entire function of z . Our goal is to show
that limT →∞ gT (0) = g(0). This can be achieved using a clever application of
Cauchy’s integral formula. Fix some large R > 0 and a small δ > 0 (which
depends on R in a way that will be explained shortly), and consider the con-
tour C consisting of the part of the circle |z| = R that lies in the half-plane
Re(z) ≥ −δ , together with the straight line segment along the line Re(z) = −δ
connecting the top and bottom intersection points of this circle with the line
(see Fig. 6(a)). Assume that δ is small enough so that g(z) (which extends an-
alytically at least slightly to the right of Re(z) = 0) is holomorphic in an open
76 16 THE PRIME NUMBER THEOREM
C C+ C- ' C+
C-
R R R
δ δ
set containing C and the region enclosed by it. Then by Cauchy’s integral
formula we have
z 2 dz
Z
1 Tz
g(0) − gT (0) = (g(z) − gT (z))e 1+ 2
2πi C R z
z 2 dz
Z Z
1 Tz
= + (g(z) − gT (z))e 1+ 2 ,
2πi C+ C− R z
where we separate the contour into two parts, a semicircular arc C+ that
lies in the half-plane Re(z) > 0, and the remaining part C− in the half-plane
Re(z) < 0 (Fig. 6(b)). We now bound the integral separately on C+ and on
C− . First, for z lying on C+ we have
∞ ∞
Be− Re(z)T
Z Z
−zt
|e−zt | dt =
|g(z) − gT (z)| = f (t)e dt ≤ B
,
T T Re(z)
where B = supt≥0 |f (t)|, and
z 2
Tz 2 Re(z)
e 1 + 2 = eRe(z)T
R R
(by the trivial identity |1 + eit |2 = |eit (eit + eit )|2 = 2 cos(t), valid for t ∈ R). So
in combination we have
z 2 dz
Z
1 Tz 2B B
2πi (g(z) − gT (z))e 1+ 2 ≤ (πR) 2
= .
C+ R z 2πR R
0
can deform the contour, replacing it with the semicircular arc C− = {|z| =
R, Re(z) < 0} (Fig. 6(c)). On this contour we have the estimate
T T
Be− Re(z)T
Z Z
−zt
|e−zt | dt =
|gT (z)| = f (t)e dt ≤ B ,
0 −∞ | Re(z)|
2B
lim sup |g(0) − gT (0)| ≤ .
T →∞ R
Since R was an arbitrary positive number, the lim sup must be 0, and the
theorem is proved.
which is bounded by the lemma we proved above, as our function f (t). The
associated function g(z) is then
Z ∞ Z ∞
−t −ztψ(x)
g(z) = t
(ψ(e )e − 1)e dt = − 1 x−z−1 dx
0 1 x
Z ∞ Z ∞ X !
1 1
= ψ(x)x−z−2 dx − = Λ(n) x−z−2 dx −
1 z 1 n≤x
z
∞ Z ∞ ∞ ∞
x−z−1
X
−z−2 1 X 1
= Λ(n) x dx − = Λ(n) −
n=1 n z n=1
−z − 1 n
z
∞
1 X 1 1 ζ 0 (z + 1) 1
= Λ(n)n−z−1 − = − · − (Re(z) > 0).
z + 1 n=1 z z + 1 ζ(z + 1) z
78 16 THE PRIME NUMBER THEOREM
Proof of the prime number theorem. We will prove that ψ(x) ∼ x, which we
already showed is equivalent to the prime number theorem. Assume by con-
ψ(x) ψ(x)
tradiction that lim supx→∞ x > 1 or lim inf x→∞ x < 1. In the first case,
that means there exists a number λ > 1 such that ψ(x) ≥ λx for arbitrarily
large x. For such values of x it then follows that
λx λx λ
ψ(t) − t λ−t λx − t
Z Z Z
dt ≥ dt =: A > 0, dt =
x xt2 1 t2 t2
R∞
but this is inconsistent with the fact that the integral 1 (ψ(x) − x)x−2 dx
converges.
ψ(x)
Similarly, in the event that lim inf x→∞ x < 1, that means that there exists
a µ < 1 such that ψ(x) ≤ µx for arbitrarily large x, in which case we have
that
x x 1
ψ(t) − t λx − t λ−t
Z Z Z
dt ≤ dt = dt =: B < 0,
λx t2 λx t2 λ t2
again giving a contradiction to the convergence of the integral.
79 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
and more. At the heart of many such results is an important technique known
as the saddle point method. Some related techniques (that are all minor
variations on the same theme) are Laplace’s method, the steepest descent
method and the stationary phase method.
d −x n
e x = e−x −xn + nxn−1 = e−x xn−1 (−x + n),
0=
dx
3
As a general rule of problem-solving, it’s often helpful to start attacking a problem by
thinking about really easy things you can say about it before moving on to advanced tech-
niques. It’s a great way to develop your intuition, and sometimes you discover that the easy
techniques solve the problem outright.
80 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
i.e., when x = n. Plugging this value into the inequality gives the bound
n! ≥ (n/e)n (n ≥ 1).
This is of course a standard and very easy result, but it’s intriguing to note
that it’s brought us quite close to the “true” asymptotics given by (2). The
point of this trivial calculation is that, as we shall see below, there is some-
thing special about the value x = n that resulted from this maximization op-
eration; when interpreted in the context of complex analysis, it corresponds
to a so-called “saddle point,” since it is a local minimum of ex /xn as one
moves along the real axis, but it will be a local maximum when one moves
in the orthogonal direction parallel to the imaginary axis. Thus, the trivial
approach has revealed the kernel of a deeper, much more powerful one.
Now let’s move on to the more powerful approach. Our “toy” approach
was based purely on real analysis, making use of the Taylor expansion of
the function x 7→ ex ; it turns out we could do better by thinking complex-
analytically. Start with the power series expansion
∞
z
X zn
e = .
n=0
n!
As we know very well from our study of Cauchy’s integral formula and the
residue theorem, the nth Taylor coefficient can be extracted from the func-
tion by contour integration, that is, by writing
ez
I
1 1
= dz,
n! 2πi |z|=r z n+1
term that was already canceled out. For convenience, rewrite this as
π
nn
Z
1
exp n(eit − 1 − it) dt.
n
=
e n! 2π −π
For I2 , we have
√
π n
Z
√ iu
iu/ n
|I2 | ≤ 2 exp n e − 1 − √ du
M
n
Z π √n √
=2 exp n Re eiu/ n − 1 du
M
Z π √n
u
=2 exp n cos √ −1 du
M n
Now use the elementary fact that cos(t) ≤ 1 − t2 /8 for x ∈ [−π, π] (see Fig. 7)
to infer further that
√
Z π n √
exp −u2 /8 du ≤ 2π n exp −n1/5 = O(n−1/5 ).
|I2 | ≤ 2
M
Combining the above results, we have proved the following version of Stir-
ling’s formula with a quantitative (though suboptimal) bound:
1.0
0.5
-3 -2 -1 1 2 3
-0.5
-1.0
4n
2n
= (1 + o(1)) √ .
n πn
(Note that this is not too far from the trivial upper bound 2n
n
≤ (1 + 1)2n =
2n
2 .) It is instructive to rederive this result using the saddle-point method,
starting from the expansion
2n
2n
X 2n n
(1 + z) = z ,
k=0
k
(1 + z)2n
I
2n 1
= dz.
n 2πi |z|=r z n+1
By the same trivial method for deriving upper bounds that we used in the
case of the Taylor coefficients 1/n! of the function ez , we have that for each
x > 0,
2n
≤ (1 + x)2n /xn = exp (log(1 + x) − n log x) .
n
We optimize over x by differentiating the expression log(1 + x) − n log x inside
the exponent and setting the derivative equal to 0. This gives x = 1, the
location of the
saddle point. For this value of x, we again recover the trivial
2n 2n
inequality n ≤ 2 .
84 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
Now note that the expression in the exponent has the Taylor expansion
1
n(2 log(1 + eit ) − t) = 2 log 2 − nt2 + O(nt4 ) as t → 0.
4
√
Again, we see that a change of variables u = t/ n will bring the integrand
to an asymptotically scale-free form. More precisely, we have
Z π !
2n 1 1 2
= exp n 2 log 2 − nt + O(nt4 ) dt
n 2π −π 4
Z π
4n
4
1 2 u
= √ exp − u + O du.
2π n −π 4 n
Exercise 17. Complete this analysis to give a rigorous proof using this
√
method of the asymptotic formula 2n
n
= (1 + o(1))4n
/ πn.
Exercise 18. Repeat this analysis for the sequence (bn )∞ n=1 of central trino-
mial coefficients, where bn is defined as the coefficient of xn in the expansion
of (1+x+x2 )n , a definition that immediately gives rise to the contour integral
representation
(1 + z + z 2 )n
I
1
bn = dz.
2πi |z|=r z n+1
85 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
2 5
1 4
3
10 20 30 40
-1 2
-2 1
-3
10 20 30 40
-4 -1
-5 -2
(a) (b)
Like their more famous cousins the central binomial coefficients, these coef-
ficients are important in combinatorics and probability theory. Specifically,
an and bn correspond to the numbers of random walks on Z that start and end
at 0 and have n steps, where in the case of the central binomial coefficients
the allowed steps of the walk are −1 or +1, and in the case of the central
trinomial coefficients the allowed steps are −1, 0 or 1; see Fig. 8.
Using a saddle point analysis, show that the asymptotic behavior of bn as
n → ∞ is given by √
3 · 3n
bn ∼ √ .
πn
in the form
e−ng(z) dz
I I
1 1 dz
a(n) = = exp − n(g(z) + log z)
2πi |z|=r z n z 2πi |z|=r z
Z π
1 it
= e−ng(re ) r−n e−int dt
2π −π
Z π
1
it
= exp − n(g(re ) + it − log r) dt.
2π −π
d 1
(g(z) + log z) = g 0 (z) + = 0.
dz z
This causes the first-order term in the Taylor expansion of g(z) + log z around
z = r to disappear. One is then left with a constant term, that can be pulled
outside of the integral; a second order term, which (in favorable circum-
stances where this technique actually works) causes the integrand to be
2
well-approximated by a Gaussian density function e−u /2 near z = r ; and
lower-order terms which can be shown to be asymptotically negligible.
Geometrically, if one plots the graph of |g(z) + 1/z| then one finds the
emergence of a saddle point at z = r , and this is the origin of the term
“saddle point method.” This phenomenon is illustrated with many beautiful
examples and graphical figures in the lecture slides [6] prepared by Flajolet
and Sedgewick as an online resource to accompany their excellent textbook
Analytic Combinatorics [5].
Exercise 19. It is instructive to see an example where the saddle point anal-
ysis fails if applied mindlessly without checking that the part of the integral
that is usually assumed to make a negligible contribution actually behaves
that way. A simple example illustrating what can go wrong is the function
∞ ∞
z2
X z 2n X
f (z) = e = = bn z n ,
n=0
n! n=0
Clearly any analysis, asymptotic or not, needs to address and take into ac-
count the fact that bn behaves differently according to whether n is even or
odd. Try to apply the method we developed to derive an asymptotic formula
for bn . The method fails, but the failure can easily be turned into a success
by noting that there are actually two saddle points, each of which makes a
contribution to the integral, in such a way that for odd n the contributions
cancel and for even n they reinforce each other. This shows that periodicities
are one common pitfall to look out for when doing asymptotic analysis.
Exercise 20. As another amusing example, apply the saddle point method
P∞
to the function f (z) = 1/(1 − z) = n=0 dn z n , for which the Taylor coefficients
dn = 1 are all equal to 1. Can you succeed in deriving an asymptotic formula
for the constant function 1?
where we define
Φ(u) = u − 1 − log u,
Z ∞
du
I(t) = e−tΦ(u) .
0 u
(Again, note that we massaged the integrand to cancel the Taylor expansion
of − log u around u = 1 up to the first order.) Our goal is to prove that
r
2π
I(t) = + O(t−7/10 ) as t → ∞.
t
As before, this will be done by splitting the integral into a main term and
error terms. The idea is that for large t, the bulk of the contribution to
the integral comes from a region very near the point where Φ(u) takes its
minimum. It is easy to check by differentiation that this minimum is obtained
at u = 1, and that we have
so that I(t) = I1 + I2 + I3 . The main contribution will come from I2 , the part
of the integral that contains the critical point u = 1, so let us examine that
term first. Expanding Φ(u) in a Taylor series around u = 1, we have
(u − 1)2
Φ(u) = + O((u − 1)3 )
2
(u−1)2
for u ∈ [1/2, 2] (in fact the explicit bound Φ(u) − 2 ≤ (u − 1)3 on this
interval can be easily checked). As before, noting that
√
(u − 1)2 1 √ ( t(u − 1))3
3 2
t + O((u − 1) ) = ( t(u − 1)) + O √ ,
2 2 t
89 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
3.0
2.5
2.0
1.5
1.0
0.5
1 2 3 4
√
we see that it is natural to apply a linear change of variables v = t(u − 1) to
bring the integrand to a scale-free, centered form. This results in
√
Z t
1 v 1
I2 = √ √
exp −tΦ 1 + √ √ dv
t 1 t 1 + v/ t
− t
2
Z √t 2 3
1 v v t
=√ √
exp − + O √ 1+O √ dv.
t − 12 t 2 t t
As before, we actually need to split
√ up this integral into two parts to take into
3
account the fact that the O(v / t) term can blow up when v is large enough.
Let M = t1/10 , and denote
Z M
1 v 1
J1 = √ exp −tΦ 1 + √ √ dv,
t −M t 1 + v/ t
Z
1 v 1
J2 = √ exp −tΦ 1 + √ √ dv,
t [− 21 √t,√t]\[−M,M ] t 1 + v/ t
so that I2 = J1 + J2 . For J1 we have
Z M
1 v 1
J1 = √ exp −tΦ 1 + √ √ dv
t −M t 1 + v/ t
Z M 3
1 −v 2 /2 v v
=√ e 1+O √ 1+O √ dv
t −M t t
Z M r
1 2 2π
= √ 1 + O(t−1/5 ) e−v /2 dv = 1 + O(t−1/5 ) ,
t −M t
in the last step using a similar estimate as the one we used in our proof of
Stirling’s approximation for n!. Next, for J2 we use the elementary inequality
90 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
(prove it as an exercise)
(u − 1)2
Φ(u) ≥ (0 ≤ u ≤ 1),
2
√ √ √
and the more obvious fact that 1/(1 + v/ t) ≤ 2 for v ∈ [− 12 t, t] to get that
Z Z ∞
2 −v 2 /2 4 2 /2
J2 ≤ √ e dv ≤ √ e−v dv
t 1√ √ t
[− t, t]\[−M,M ] M
2
1
= O(e−M ) = √ O(t−1/5 ).
t
as in our earlier proof. Combining the above results, we have shown that
r
−1/5
2π
I2 = 1 + O(t ) .
t
Next, we bound I1 . Here we use a different method since there is a different
source of potential trouble near the left end u = 0 of the integration inter-
val. Considering first a truncated integral over [ε, 1/2] and performing an
integration by parts, we have
Z 1/2 Z 1/2
−tΦ(u) du 1 d −tΦ(u) 1
e =− e du
ε u t ε du Φ0 (u)u
u=1/2
1 e−tΦ(u) 1 1/2 −tΦ(u) du
Z
=− − e .
t u − 1 u=ε t ε (u − 1)2
Taking the limit as ε → 0 (and noting that Φ(ε) → +∞ in this limit) yields the
formula
Z 1/2
2 1 du 1
I1 = e−tΦ(1/2) − e −tΦ(u)
=O as t → ∞.
t t 0 (u − 1)2 t
Sδ = {z ∈ C : | arg z| ≥ π − δ}
91 17 INTRODUCTION TO ASYMPTOTIC ANALYSIS
Here, ss−1/2 is defined as exp((s − 1/2) Log s), where Log denotes as usual
the principal branch of the logarithm function. This sort of approximation is
important in certain applications of complex analysis, for example to analytic
number theory.
92 PROBLEMS
Problems
1. Below is a list of basic formulas in complex analysis (think of them as
“formulas you need to know like the back of your hand”). Review each
of them, making sure that you understand what it says and why it is
true—that is, if it is a theorem, prove it, or if it is a definition, make
sure you understand that that is the case.
4. For each of the following functions u(x, y), determine if there exists
a function v(x, y) such that f (x + iy) = u(x, y) + iv(x, y) is an entire
function, and if so, find it, and try to find a formula for f (z) directly in
terms of z rather than in terms of its real and imaginary parts. (Hint4 )
-3 -2 -1 1 2 3
-1
-2
-3
a. w = 12 z d. w = (2 + i)z − 3
b. w = iz e. w = 1/z
c. w = z f. w = z 2 − 1
for some z1 , . . . , zn ∈ C (these are the roots of p(z) counted with multi-
plicities). Use this to prove that any such polynomial where the coeffi-
cients a0 , . . . , an are real has a factorization
where w1 , . . . , wn−1 denote the roots of p0 (z). Prove that w1 , . . . , wn−1 all
lie in the convex hull of z1 , . . . , zn (see Figure 1 for an illustration). That
is, each wk can be expressed as a convex combination
wk = α1 z1 + α2 z2 + . . . + αn zn ,
P
where α1 , . . . , αn are nonnegative real numbers and j αj = 1. (To be
clear, there are different coefficients for each k .)
z3
z4
w3
w1
z1
w2
z2
Figure 10: An example of the roots of a complex polynomial and of its deriva-
.
tive. Here z1 = 0, z2 = 3 − i, z3 = 2 + 2i, z4 = 1+3i
2
and w1 = 0.375 + 0.586i,
. .
w2 = 2.336 − 0.335i, w3 = 1.414 + 1.624i.
p = −3uv, (6)
q = −(u3 + v 3 ) (7)
are satisfied.
(c) Explain why, in order to solve the pair of equations (6)–(7), one can
alternatively solve
p3
= −RS, (8)
27
q = −(R + S), (9)
(e) Using the above reductions, show that the three solutions of the
simplified cubic (5) can be expressed as
w1 = u + v,
w2 = ζ u + ζ v,
w3 = ζ u + ζ v,
√
where ζ = e2πi/3 = 21 (−1 + i 3) (a cube root of unity) and u, v are
properly chosen cube roots of R, S obtained as solutions to (10).
(f) Illustrate the above procedure by applying it to get formulas for
the roots of the cubic equation
z 3 + 6z 2 + 9z + 3 = 0.
Bring the formulas to a form that makes it clear that the roots are
real numbers.
a b
11. Let A = be a 2 × 2 real matrix. Prove the “conformality lemma”
c d
from page 11, which asserts the equivalence of the following three con-
ditions:
(a) A as a linear map preserves orientation (that is, det A > 0) and is
conformal, that is
hAw1 , Aw2 i hw1 , w2 i
=
|Aw1 | |Aw2 | |w1 | |w2 |
for all w1 , w2 ∈ R2 . (Here hw1 , w2 i denotes the standard inner prod-
uct in R2 , and |w| = hw, wi1/2 is the usual two-dimensional norm of
a vector in R2 .)
a b
(b) A takes the form A = for some a, b ∈ R.
−b a
cos θ − sin θ
(c) A takes the form A = r for some r > 0 and θ ∈ R.
sin θ cos θ
(That is, geometrically A acts by a rotation followed by a scaling.)
z = x + iy, z = x − iy
98 PROBLEMS
Show that, from this somewhat strange point of view, the Cauchy-
Riemann equations
∂u ∂v ∂u ∂v
= , =−
∂x ∂y ∂y ∂x
can be rewritten in the more concise equivalent form
∂f
= 0,
∂z
assuming that it is okay to apply the chain rule from multivariable cal-
culus; and moreover, that in this notation we also have the identity
∂f
f 0 (z) = .
∂z
15. Cauchy’s theorem and irrotational vector fields. Recall from vec-
tor calculus that a planar vector field F = (P, Q) defined on some region
∂g ∂g
Ω ⊂ C = R2 is called conservative if it is of the form F = ∇g = ∂x , ∂y
(the gradient of g ) for some scalar function g : Ω → R. By the funda-
mental theorem of calculus for line integrals, for such a vector field we
have I
F · ds = 0
γ
for any closed curve γ . Recall also that (as is easy to check) any con-
servative vector field is irrotational, that is, it satisfies
curl F = 0
17. Bessel functions. The Bessel functions are a family of functions (Jn )∞
n=−∞
of a complex variable,defined by
∞
X (−1)k z 2k+n
Jn (z) = .
k=0
k!(k + n)! 2
√ x
P∞ k
(For example, note that J0 (−2 x) = k=0 (k!)2 , which is reminiscent of
the exponential function and already seems like a fairly natural function
to study.) Find the radius of convergence of the series defining Jn (z),
and prove that the Bessel functions satisfy the following properties:
interesting):
∞
z 1 X
exp t− = Jn (z)tn ,
2 t n=−∞
∞
X
cos(z sin t) = J0 (z) + 2 J2n (z) cos(2nt),
n=1
∞
X
sin(z sin t) = 2 J2n+1 (z) sin((2n + 1)t),
n=0
∞
X
cos(z cos t) = J0 (z) + 2 (−1)n J2n (z) cos(2nt),
n=1
∞
X
sin(z cos t) = 2 (−1)n J2n+1 (z) sin((2n + 1)t),
n=0
Z π
1
Jn (z) = cos (z sin t − nt) dt.
π 0
Hint for the last equation: cos(a − b) = cos(a) cos(b) + sin(a) sin(b).
Remark. The Bessel functions are very important functions in
mathematical physics, and appear naturally in connection with
various problems in diffusion, heat conduction, electrodynamics,
quantum mechanics, Brownian motion, probability, and more. More
recently they played an important role in some problems in com-
binatorics related to longest increasing subsequences (a subject I
wrote a book about, available to download from my home page).
Their properties as analytic functions of a complex variable are
also a classical, though no longer very fashionable, topic of study.
18. Show that Liouville’s theorem (“a bounded entire function is constant”)
can be proved directly using the “simple” (n = 0) case of Cauchy’s in-
tegral formula, instead of using the case n = 1 of the extended formula
as we did in the lecture.
19. Show that Liouville’s theorem can in fact be deduced even just from
the mean value property of holomorphic functions, which is the special
case of Cauchy’s integral formula in which z is taken as the center of
the circle around which the integration is performed.
103 PROBLEMS
|f (z)| ≤ A + B|z|n
prove that p(z) has exactly n zeros (counting multiplicities) in the unit
disc |z| < 1.
Note. This is a special case of a less elementary fact that can be proved
using Rouché’s theorem; see problem 28
23. Spend at least 5–10 minutes thinking about the concept of a toy con-
tour. Specifically, for the case of a keyhole contour we discussed in
the context of the proof of Cauchy’s integral formula, think carefully
about the steps that are needed to get a proof of Cauchy’s theorem for
104 PROBLEMS
Guidance. This is not a trivial exercise, but is not very difficult when
broken down into the following elementary substeps:
sinh2 y
| cot(x + iy)| = ≤ 1,
1 + sinh2 y
and that when y = N and x is arbitrary,
1 + sinh2 N
| cot(x + iy)| ≤ ≤ 2 (if N > 10);
sinh2 N
then use these estimates to bound the integral.
iv. By comparing the two results about IN , deduce (11).
(b) Integrate the identity (11) to deduce (using some additional fairly
easy reasoning) the formulas
N ∞
X 1 1 X 2z
π cot(πz) = lim = + (z ∈ C \ Z). (12)
N →∞
n=−N
z+n z n=1 z 2 − n2
107 PROBLEMS
Note that the function on the right-hand side is (or can be easily
checked to be) an entire function of z with a simple zero at any
integer z = n ∈ Z, and whose Taylor expansion around z = 0 starts
with πz + O(z 3 ); thus it is a natural guess for an infinite product
expansion of sin(πz), although the fact that this guess is correct is
far from obvious; for example one can multiply the right-hand side
by an arbitrary function of the form eg(z) and still have an entire
function with the same set of zeros.
Hint. Compute the logarithmic derivatives of both sides of (13).
You may want to review some basic properties of infinite products,
as discussed for example on pages 140–142 of [Stein-Shakarchi].
(Spoiler alert: pages 142–144 contain a solution to this subexer-
cise, starting with an independent proof of (12) and proceeding
with a derivation of (13) along the same lines as I described above.)
(b) By specializing the value of z in (13) to an appropriate specific
value, obtain the following infinite product formula for π , known
as Wallis’ product (first proved by John Wallis in 1655):
π 2 2 4 4 6 6 8 8
= · · · · · · · ··· .
2 1 3 3 5 5 7 7 9
(c) By comparing the first terms in the Taylor expansion around z = 0
of both sides of (13), derive the well-known identities
∞ ∞
X 1 π2 X 1 π4
2
= , 4
= .
n=1
n 6 n=1
n 90
(d) More generally, one can use (13), or more conveniently (12), to
obtain closed formulas for all the series
∞
X 1 1 1 1
ζ(2k) = (k = 1, 2, . . .) = 1 + + + + ...,
n=1
n2k 22k 32k 42k
108 PROBLEMS
that is, the special values of the Riemann zeta function ζ(s) =
P∞ 1
n=1 ns at the positive even integers. To see this, first, rewrite
(12) as
∞
1 X 1 1
π cot(πz) = + − (z ∈ C \ Z). (14)
z n=−∞ z + n n
n6=0
27. Let f (z) = p(z)/q(z) be a rational function such that deg q ≥ deg p + 2
(where deg p denotes the degree of a polynomial p). Prove that the sum
of the residues of f (z) over all its poles is equal to 0.
28. (A generalization of the result from problem 21) If p(z) = an z n +an−1 z n−1 +
. . . + . . . + a0 is a polynomial of degree n such that for some 0 ≤ k ≤ n
we have X
|ak | > |aj |,
0≤j≤n
j6=k
prove that p(z) has exactly k zeros (counting multiplicities) in the unit
disk |z| < 1.
30. Show how Rouché’s theorem can be used to give yet another proof
of the fundamental theorem of algebra. This proof is one way to make
precise the intuitively compelling “topological” proof idea we discussed
at the beginning of the course.
i. Values at half-integers:
1
(2n)! √
Γ n+ 2
= π (n = 0, 1, 2, . . .).
4n n!
110 PROBLEMS
Γ s Γ s + k1 Γ s + k2 · · · Γ s + k−1
= (2π)(k−1)/2 k 1/2−ks Γ(ks).
k
33. For n ≥ 1, let Vn denote the volume of the unit ball in Rn . By evaluating
the n-dimensional integral
ZZ Z n
!
X
An = ... exp − 12 x2j dx1 dx2 . . . dxn
Rn j=1
π n/2
Vn = .
Γ n2 + 1
34. The beta function is a function B(s, t) of two complex variables, defined
for Re(s), Re(t) > 0 by
Z 1
B(s, t) = xs−1 (1 − x)t−1 dx.
0
Γ0 (s)
ψ(s) =
Γ(s)
of the gamma function, also considered as a somewhat important spe-
cial function in its own right.
Pn (x) = x(x − 1) . . . (x − n) (n = 0, 1, 2, . . .)
Prove this.
1.0 1.0
0.8 0.8
0.6 0.6
0.4 0.4
0.2 0.2
(a) (b)
1.0
0.8
1 2 3 4 5 6 7
0.6
-200
0.4
-400
0.2
-600
0.0 0.2 0.4 0.6 0.8 1.0
(c) (d)
Figure 11: (a) A plot of the fractional parts of the roots of Qn (x) for n = 50.
(b) The limiting function R(t). (c) The two previous plots combined. (d) The
polynomial P7 (x). Note that the roots of Q7 (x) correspond to the local minima
and maxima of P7 (x), which are highlighted.
1.0
0.8 α=1
0.6
0.4 α=2
α=3
α=4
0.2 α=5
2 4 6 8 10 12
37. (a) Show that the Laurent expansion of Γ(s) around s = 0 is of the form
1
Γ(s) = − γ + O(s)
s
(where γ is the Euler-Mascheroni constant). If you’re feeling especially
energetic, derive the more detailed expansion
γ 2 π2
1
Γ(s) = − γ + + s + O(s2 )
s 2 12
and proceed to derive (by hand, or if you prefer using a symbolic math
software application such as SageMath or Mathematica) as many addi-
tional terms in the expansion as you have the patience to do.
(b) Show that the Laurent expansion of ζ(s) around s = 1 is of the form
1
ζ(s) = + γ + O(s − 1).
s−1
115 PROBLEMS
38. Show that the symmetric version of the functional equation for the zeta
function
ζ ∗ (1 − s) = ζ ∗ (s),
where ζ ∗ (s) = π −s/2 Γ(s/2)ζ(s), can be rewritten in the equivalent form
πs
ζ(s) = 2s π s−1 sin Γ(1 − s)ζ(1 − s).
2
Γ0 (s)
39. Show that the Taylor expansion of the digamma function ψ(s) = Γ(s)
around s = 1 is given by
∞
X
ψ(s) = −γ − (−1)n−1 ζ(n + 1)(s − 1)n (|s − 1| < 1).
n=1
(a) Prove that the series defining D(s) converges uniformly on any
half-plane of the form Re(s) ≥ α where α > 0, and conclude that
D(s) is defined and holomorphic in the half-plane Re(s) > 0.
(b) Show that D(s) is related to the Riemann zeta function by the for-
mula
D(s) = (1 − 21−s )ζ(s) (Re(s) > 1).
(c) Using this relation, deduce a new proof that the zeta function can
be analytically continued to a meromorphic function on Re(s) > 0
that has a simple pole at s = 1 with residue 1 and is holomorphic
everywhere else in the region.
P
41. Let ψ(x) = pk ≤x log p denote von Mangoldt’s weighted prime count-
ing function. Show that ψ(n) = log lcm(1, 2, . . . , n), where for integers
a1 , . . . , ak , the notation lcm(a1 , . . . , ak ) denotes the least common multi-
ple of a1 , . . . , ak .
Note that this implies that an equivalent formulation of the prime num-
ber theorem is the interesting statement that
lcm(1, . . . , n) = e(1+o(1))n as n → ∞.
116 PROBLEMS
(where the product is over all prime numbers p that are ≤ x).
(b) Pass to the logarithm and deduce that for some constant K > 0 we
have the bound
X1
≥ log log x − K (x ≥ 1).
p≤x
p
1
P
That is, the harmonic series of primes p p diverges as log log x, in
contrast to the usual harmonic series which diverges as log x.
44. (a) Reprove Theorem 36 (the “toy Riemann hypothesis” — the result
that the Riemann zeta function has no zeros on the line Re(s) = 1) by
considering the behavior of
(b) Try to reprove the same theorem in yet a third way by considering
We saw that the zeta function and its logarithmic derivative have the
Dirichlet series representations
∞
X
ζ(s) = n−s ,
n=1
0 ∞
ζ (s) X
− = Λ(n)n−s .
ζ(s) n=1
Use the Euler product formula for the zeta function or other elementary
manipulations to prove the following identities (valid for Re(s) > 1):
∞
X
0
ζ (s) = − log n · n−s ,
n=1
∞
1 X
= µ(n)n−s ,
ζ(s) n=1
∞
ζ(s) X
= |µ(n)|n−s .
ζ(2s) n=1
Other famous Dirichlet series representations you may want to think
about or look up are
∞
X
2
ζ(s) = d(n)n−s ,
n=1
∞
ζ(s − 1) X
= φ(n)n−s ,
ζ(s) n=1
∞
X
ζ(s)ζ(s − 1) = σ(n)n−s .
n=1
6. The Jordan curve theorem. The Jordan curve theorem states that a
continuous, simple planar curve separates the plane into two connected
components, precisely one of which is unbounded. The theorem plays
120 PROBLEMS
References
[1] T. Apostol. Modular Functions and Dirichlet Series in Number Theory.
Springer, 1990.
[3] H. Duminil-Copin,
p S. Smirnov. The connective constant of the honeycomb
√
lattice equals 2+ 2. Ann. Math. 175 (2012), 1653–1665.
[7] A. Ivić. The Riemann Zeta-Function: Theory and Applications. Dover Pub-
lications, 2003.
[9] R. Maehara. The Jordan curve theorem via the Brouwer fixed point theo-
rem.. Amer. Math. Monthly 91 (1984), 641–643.
[13] D. Zagier. Newman’s short proof of the prime number theorem. Amer.
Math. Monthly 104 (1997), 705–708.
Index
absolutely convergent, 13 closed
H curve, 18
Airy function, 79 , 18
an -formula, 16 closed disc, 39
analytic continuation, 32 compact set, 31
analytic function, 7 compactification
geometric view, 7 one point, 39
angle preserving map, 8 complex differentiable, 7
arc length integral complex dynamics, 3
definition, 18 complex-valued function, 17
iϕ
argument: arg w ; w = re , arg w = ϕ complicated integrals, 2
, 42 conformal map, 8
argument principle, 41 conformal maps, 2
Arg z , 50 conformality lemma, 11
asymptotic analysis, 79 constant function
criterion, 21
Basel problem, 60 contour integral, 16
Bernoulli numbers, 60 definition, 18
contour integrals
calculus
Fundamental Theorem of Calcu-
fundamental theorem, 18
lus, 19
Cardano
properties, 19
cubic equation, 1 0
contours C , C+ , C− , C− , 76
Cardy-Smirnov formula, 2
convergence
Casorati-Weierstrass
uniform, 28
theorem, 40
convergence disc, 15
Cauchy inequalities, 29
convergent
Cauchy’s integral formula, 26
absolutely, 13
extended, 28
covering, 31
Cauchy’s theorem, 21
cubic equation, 1
consequences, 26
curve
for a disc, 25
closed, 18
general version, 49
equivalence class, 16
toy contour (?), 25
equivalent, 16
Cauchy-Riemann equations, 9
length len γ , 18
central binomial coefficient, 83
parametrization, 18
change of variables, 18
piecewise continuously differentiable,
clopen
17
connected and open set, 31
122
123 INDEX