Introduction to Fourier Series
Introduction to Fourier Series
net/publication/344486946
CITATIONS READS
2 1,879
1 author:
Zhuo Chen
Imperial College London
1 PUBLICATION 2 CITATIONS
SEE PROFILE
All content following this page was uploaded by Zhuo Chen on 06 October 2020.
Abstract
Fourier series has always been a heated topic in mathematics and physics.
This article summarizes contents in Fourier Analysis: an Introduction,
from the discovery that some functions can be written as the sum of sin
and cos, into the smoothness condition of its convergence and provides an
insight into its application in different fields such as number theory and
physics, which works as the guide for the development of Fourier analysis.
Contents
1 Introduction 3
1
3.2 Fourier Exponential Series . . . . . . . . . . . . . . . . . . . . . . 16
3.3 Restrictions on f . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
3.7 Convolutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
5.1.2 Curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
2
5.1.4 Green’s theorem . . . . . . . . . . . . . . . . . . . . . . . 54
1 Introduction
Although the idea was motivated only to solve the heat equation, Fourier series
was then proven to be useful in many fields such as differential equations, algebra
and electrical engineering. Based on Fourier Analysis: an Introduction, we will
introduce the motivation, properties, convergence and applications of Fourier
series.
Not only to the heat equation, Fourier series applies to wave equation, Schrödinger
equation and so on. In fact, most linear differential equations with solutions in
trigonometric functions or exponentials have solutions in the form of Fourier
series, which we will explore in section 2.
Also, Fourier series of a given function doesn’t always converge to it. In sections
3 and 4, we will explore the smoothness conditions and we will discuss the
pointwise and uniform convergence.
3
Acknowledgement
We would like to express our thanks to Doctor Disheng Xu for his help in reading
and guidance for writing the report. We also want to thank Imperial College
for this wonderful experience in UROP.
This section introduces how mathematicians approached the idea of Fourier se-
ries. Along with the development of physics, mathematicians found many linear
differential equations having solutions in the form of trigonometric functions,
which can be sum up to a series.
Travelling waves consist of an initial profile F (x), the profile at t = 0, and are
travelling at a rate of c as t changes. It is characterized as the following.
u(x, t) = F (x − ct)
Standing waves consist of an initial profile φ(x) and an amplifying factor ψ(t).
The displacement u(x, t) is therefore their product.
u(x, t) = φ(x)ψ(t)
4
2.1.1 Derivation of wave equation
Wave equation concerns with the displacement x, the height u(x, t) and time t
of a string hanging with both ends fixed. We can assume without the loss of
generality that it is fixed at x = 0 and x = π, which gives us the first initial
conditions. The heights at the two points are zero.
Moreover, the string is steady at the beginning, that is, the initial velocity is
zero.
u(x, 0) = 0 (2.2)
We now start the derivation. We will first look at the discrete case, that is, we
consider several N particles with little masses connected by sticks. This is a
microcosm of the string. These particles have horizontal coordinates xi equally
distributed in x = 0 and x = π. y = u(x, t) denotes the displacement in given x
π
and t. The distance between each neighbor particle is h = N : x0 , x1 , ...xN and
xn = nh, yn = u(xn , t).
Our final result is based on Newton’s Second Law F = ma. The total force F is
the sum of the forces from the left and right, which can be obtained by Hooke’s
Law. For the nth particle,
τ
FL = (yn+1 − yn )
h
τ
FR = (yn−1 − yn )
h
Assume the density of the sticks with respect to the horizontal axis is ρ. The
right hand side is then
ρhyn00 (t)
τ
(yn+1 − 2yn + yn−1 ) = ρhyn00 (t)
h
5
yn+1 − 2yn + yn−1
τ = ρyn00 (t)
h2
q
τ
Let h → 0 and we have the following equation with c = ρ.
∂2u ∂2u
τ = ρ
∂x2 ∂t2
1 ∂2u ∂2u
2 2
=
c ∂t ∂x2
∂2u ∂2u
= (2.3)
∂T 2 ∂X 2
Now we are able to apply the definition of two kinds of waves into the problems as
each of them represents a solution. The one using the travelling waves is more
comprehensive, while the one with standing waves introduce us into Fourier
series.
∂u 1 ∂u 1 ∂u
= +
∂m 2 ∂x 2 ∂t
Therefore, all solutions can be written as travelling waves. Let’s then consider
the initial conditions. We have u(x, 0) = f (x) as f (x) is the initial profile.
u(x, 0) = u(π, t) = 0 as the string is fixed at x = 0 and x = π. Next, ∂u
∂t (x, 0) =
g(x), in which g(x) represents the initial velocity and g(0) = g(π) = 0, which
leads to the following equations (note that these should only apply to interval
[0, π], but we can extend the function to be odd and periodic of 2π).
6
F (x) + G(x) = f (x)
F 0 (x) − G0 (x) = g(x)
Z x
1
F (x) = f (x) + g(u) du + C1
2 0
Z x
1
G(x) = f (x) − g(u) du + C2
2 0
Z x+t
1 1
u(x, t) = f (x + t) + f (x − t) + g(u) du (2.4)
2 2 x−t
We now turn to the second method. When one guesses for solutions of a dif-
ferential equation, it is common to use separation of variables. In this case, we
can assume the solution u(x, t) can be written as φ(x)ψ(t), which coincides our
definition of standing waves. Substitute and we get
Note that the left-hand side only depends on x and the other side only depends
on t. The equality holds only when both are a constant λ.
ψ 00 (t) − λψ(t) = 0
(2.5)
φ00 (x) − λφ(x) = 0
7
As the string is fixed at x = 0 and x = π, φ(0) = φ(π) = 0 ⇒ Ā = 0. Then
B = 0 is trivial so assume B 6= 0 ⇒ m ∈ Z. Next, m = 0 is trivial and m ≤ −1
can be ignored by symmetry, so assume m ≥ 1 and arrange the above results as
Recall the definition of an equation being linear: if u and v are two solutions,
so is αu + βv with α and β two constants. Due to the linearity of derivatives,
the wave equation is linear as well, so we can superimpose the above results as
a sum.
∞
X
u(x, t) = (Am cos mt + Bm sin mt) sin mx (2.6)
n=1
∞
X
f (x) = u(x, 0) = Am sin mx
m=1
Z π ∞ Z π
X π
f (x) sin nx dx = Am sin mx sin nx dx = An ·
0 m=1 0 2
Z π
2
An = f (x) sin nx dx (2.7)
π 0
Example 2.1 (the plucked string). Let’s consider an example of plucked string
as in the diagram and with c = 1. The string has initial velocity 0 and is plucked
at x = π/4, y = 1, given by the following formula.
4
π x,
0 ≤ x ≤ π/4
f (x) = − π4 x + 2h, π/4 ≤ x ≤ π/2 (2.8)
0, π/2 ≤ x ≤ π
8
h
O
π/4 π/2 π
We are given the initial position f (x) and initial velocity g(x) = 0 to find Am
and Bm in (2.6). First, we have
∞
X
f (x) = u(x, 0) = An sin nx
n=1
π
8 2 sin nπ/4 − sin nπ/2
Z
2
An = f (x) sin nx dx =
π 0 π2 n2
∞
∂u X
(x, 0) = mBm sin mx = 0
∂t n=1
⇒ Bm = 0
∞
X 8 2 sin nπ/4 − sin nπ/2
u(x, t) = Am cos mt sin mx, An =
n=1
π2 n2
The heat equation deals with the temperature u(x, y, t) on a plane, which also
leads to Fourier series. First, we will derive the time-dependent heat equation
and then solve it when t = 0, which is known as the steady-state heat equation.
9
2.2.1 Derivation of heat equation
∂u ∂u
(x0 , y0 + h/2, t) − (x0 , y0 − h/2, t)
∂y ∂y
where κ > 0 is the conductivity of the material. Let h → 0 and we have
!
∂H 2 ∂2u ∂2u
= κh + 2
∂t ∂x2 ∂y
Therefore, we have the following equation and arrange into the time-dependent
heat equation. !
∂H 2 ∂u 2 ∂2u ∂2u
= σh = κh + 2
∂t ∂t ∂x2 ∂y
σ ∂u ∂2u ∂2u
= + 2 (2.9)
κ ∂t ∂x2 ∂y
Steady state means the heat is in equilibrium and no heat flow, which gives the
steady-state heat equation.
∂2u ∂2u
+ 2 =0 (2.10)
∂x2 ∂y
10
Let’s consider the equation in a unit disc. This is known as the Dirichlet prob-
lem. Although we can solve the equation in rectangular coordinates, it is too
complicated to consider the boundary conditions. Therefore, we first change to
polar coordinates. Using chain rules, we have the following results.
∂u ∂u ∂x ∂u ∂y ∂u ∂u
= + = −r sin θ + r cos θ
∂θ ∂x ∂θ ∂y ∂θ ∂x ∂y
∂u ∂u ∂x ∂u ∂y ∂u ∂u
= + = cos θ + sin θ (2.11)
∂r ∂x ∂r ∂y ∂r ∂x ∂y
∂2u ∂2u ∂2u
2
= cos2 θ 2 + sin2 θ 2 (2.12)
∂r ∂x ∂y
∂2u ∂u ∂u 2
2
2 ∂ u 2
2
2 ∂ u
= −r cos θ − r sin θ + r sin θ + r cos θ (2.13)
∂θ2 ∂x ∂y ∂x2 ∂y 2
∂2u ∂u ∂2u
r2 2
+r =− 2
∂r ∂r ∂θ
The left and right hand sides are each function of r and θ, so the equality holds
only when they are both equal to λ, a constant.
dF dF
= e−x
dr dr
11
d2 F dF d2 F
2
= −e−2x + e−2x 2
dr dx dx
Thus,
d2 F
− λF = 0
dx2
dx
We further substitute u = dF and the equation is now easy to solve.
d2 F 1 du 1 dF du 1 du
2
=− 2 =− 2 =− 3
dx u dx u dx dF u dF
d2 F
= λF
dx2
Two simple solutions for F are rm and r−m when m 6= 0, 1 and log r when
m = 0. Note that F (r) is unbounded at the origin if the exponential is negative
which disobeys our intuition. We assume positive exponential. Following the
same reason, we reject m = log r.
F (r) = r|m|
We yield solutions in the forms of um (r, θ) = r|m| eimθ As the heat equation is
linear, we superimpose these solutions as an infinite sum. Again, we leave the
convergence to be proven in the next section.
∞
X imθ
u(r, θ) = am r|m|e (2.15)
m=−∞
∞
X
u(1, θ) = am eimθ
m=−∞
Exactly the same mechanism works for any harmonic functions as all of them
satisfy Laplace’s equation (2.8). Therefore, we will be seeing this solution in
Fourier series also in many other fields such as electromagnetism, fluid dynamics
and complex analysis.
12
2.3 Schrödinger equation
~2 2 ∂Ψ
∇ Ψ + V Ψ = i~ (2.16)
2m ∂t
We solve it using separation of variables. Write the wave function as the follow-
ing and substitute into (2.16)
~2 1 2
2m ψ ∇ ψ +V =E
(2.17)
i~ T1 dT
dt =E
Let’s consider the problem in V = 0 and in one dimension, that is, ψ = ψ(x).
The solutions are then given by
T = e−iEt/~
2mE
ψ = A sin kx + B cos kx, k 2 =
~2
Ψn (x, t) = (An sin nx + Bn cos nx)e−iEn t/~
∞
X
Ψ(x, t) = (An sin nx + Bn cos nx)e−iEn t/~
n=1
13
Example 2.2 (Particle in a box). We need to solve Schrödinger equation with
V = 0 on (0, π) and Ψ vanishes at x = 0 and x = π. Substitutions give the
following results.
∞
X
Ψ(x, t) = An (sin nx)e−iEn t/~ (2.18)
n=1
Moreover, given Ψ(x, t) = f (x), we can find An exactly the same as in (2.7).
Recall that in geometry, we discompose a vector into basis vectors and solve
for the coefficients by taking their inner products. Similarly, the essence of
Fourier series is to discompose a function into exponential or trigonometric
functions(which forms basis in Hilbert Space) and the coefficients can be cal-
culated by taking the integral of their products(which is the definition of inner
product for functions). We will explore the convergence of the series and make
these intuitions into rigorous definitions in the rest of the article.
Fourier analysis is the study of how general functions can be decomposed into
trigonometric or exponential functions with definite frequencies.There are two
types of Fourier expansions:
In the following sections, we will introduce basic Fourier series and Fourier
transform and investigate some detailed questions about Fourier analysis.
The first question is that, what does ”well-behaved” in the above text mean for
a function? In other words, what are the restrictions on the functions that can
be decomposed into Fourier series?
Next, we will investigate the uniqueness of Fourier series. Are two functions
with the same Fourier coefficient necessarily equal?
14
After that, we will take a closer look at the partial sum of Fourier series and
try to solve the question arisen in the last section: in what sense does SN (f )
converge to f as N → ∞?
Finally, to better apply Fourier series to solve mathematics and physics prob-
lems, we will introduce convolution, good kernels and Cesàro and Abel summa-
bility.
By Euler’s Theorem, where eiθ = cos θ+i sin θ, we can also write a trigonometric
series in exponential form
∞
2πinx
X
cn e L
n=−∞
If a trigonometric series involves only finitely many non-zero terms, that is,
cn = 0 for all large |n|, it is called a trigonometric polynomial; its degree
is the largest value of |n| for which cn 6= 0.
Definition 3.2. Consider a function f (x) that is periodic on the interval [0, L],
then the Fourier series of F can be expressed as
∞
X 2πnx 2πnx
a0 + (an cos ( ) + bn sin ( )) (3.1)
n=1
L L
where Z L
1
a0 = f (x)dx,
L 0
Z L
2 2πnx
an = f (x) cos ( )dx,
L 0 L
Z L
2 2πnx
bn = f (x) sin ( )dx.
L 0 L
15
3.2 Fourier Exponential Series
By Euler’s theorem, given a function f (x) which is periodic on [0, L], its Fourier
series can also be written as
∞
2πinx
X
cn e L
n=−∞
Remark. Even though we are given the Fourier series of f , we cannot say for
sure that they are necessarily equal, so we will temporarily use ∼ to represent
the relations between them, written as
∞
2πinx
X
f (x) ∼ cn e L .
n=−∞
N
2πinx
X
SN (f )(x) = fˆ(n)e L .
−N
3.3 Restrictions on f
16
XN
U =[ sup f (x)](xj − xj−1 )
j=1 xj−1 ≤x≤xj
and
XN
L=[ inf f (x)](xj − xj−1 ),
xj−1 ≤x≤xj
j=1
Remark. From the definition of Riemann integrability, we can see that the func-
tion is not necessarily continuous. It can be everywhere continuous, piecewise
continuous (meaning that it has finitely many discontinuities), or even with
infinitely many discontinuities as long as it is Riemann integrable.
Finally, we say that a complex-valued function is integrable if its real and imagi-
nary parts are integrable. Also, the sum and product of two integrable functions
are integrable.
17
restrict it to any interval of length 2π
2π periodic functions functions on an 2π-interval
extend it to a periodic function on R
Figure 2: relations
Example 3.1 (Sawtooth function). Find the Fourier series for the periodic
function. It is defined by f (x) = Ax for −L L
2 < x < 2 and it has period L.
The answer to this question could be quite long but is a great example to show
the detailed process of finding both Fourier trigonometric and exponential series.
Z L
2 2 2πnx
bn = Ax sin dx
L −L
2
L
Z Z
1 1
x sin(rx)dx = x − cos(rx) − − cos(rx)dx
r r
x 1
= cos(rx) + 2 sin(rx)
r r
AL
=− cos(πn)
πn
AL
= (−1)n+1 .
πn
18
Therefore, the Fourier trigonometric series of f is
∞
AL X 1 2πnx
f (x) ∼ (−1)n+1 sin
π n=1 n L
We can also obtain the Fourier series using exponentials. By definition of Fourier
coefficients, we get
Z L
1 2
cn = Axe−i2πnx/L dx (3.3)
L −L
2
Z
x 1
xe−rx = − e−rx − 2 e−rx .
r r
This is valid unless r = 0, in which case the integral equals x = x2 /2. For the
R
A xL −i2πnx/L L L 2 −i2πnx/L L
cn = − e 2
−L + e 2
−L .
L i2πn 2 i2πnx 2
The second of these terms yields zero because te limits produce equal terms.
The first term yields
A (L/2)L −iπn
+ eiπn .
cn = − · e
L i2πn
The sum of the exponentials is simply 2(−1)n . So we have (getting the i out of
the denominator)
iAL
cn = (−1)n forn 6= 0.
2πn
L
If n = 0, then the integral yields c0 = (A/L)(x2 /2)|−2 L = 0. Basically, the area
2
under the curve is zero since Ax is an odd function. Putting everything together
gives the Fourier exponential series,
19
X iAL i2πnx/L
f (x) ∼ (−1)n e .
2πn
n6=0
The sum runs over all the integers (positive or negative) with the exception of
0.
Example 3.2. Define f (θ) = (π − θ)2 /4 for 0 ≤ θ ≤ 2π. Then successive
integration by parts similar to that performed in the previous example yield
∞
π 2 X cos nθ
f (θ) ∼ + . (3.4)
12 n=1 n2
Actually this solves the Basel problem, which asks for the precise summation
of the reciprocals P
of the squares of the natural numbers, i.e. the precise sum of
∞
the infinite series n=1 .
Let θ = 0. we get
π2
f (0) =
4
∞
π2 X 1
= + ,
12 n=1 n2
∞
X 1 π2
2
= .
n=1
n 6
Dirichlet kernel is of the fundamental importance of the whole theory. Its Fourier
coefficients an have the property that an = 1 if |n| ≤ N and an = 0 otherwise.
Theorem 3.1. Dirichlet kernel can be expressed as
20
sin (N + 21 )x
DN (x) = .
sin(x/2)
N
X −1
X
ωn and ωn
n=0 n=−N
1 − ω N +1 ω −N − 1
and .
1−ω 1−ω
∞
X
Pr (θ) = r|n| einθ .
n=−∞
Proof. Similar to the previous proof, we first split Pr (θ) into two parts,
∞
X ∞
X
Pr (θ) = ωn + ω̄ n with ω = reiθ ,
n=0 n=1
where both series converge absolutely. The first sum (an infinite geometric
progression) equals 1/(1 − ω), and likewise, the second is ω̄/(1 − ω̄).
21
Together, they combine to give
1 − ω̄ + (1 − ω)ω̄ 1 − |ω|2 1 − r2
= = .
(1 − ω)(1 − ω̄) |1 − ω|2 1 − 2r cos θ + r2
If f and g have the same Fourier coefficients, then f and g are necessarily equal.
By taking the difference f − g, this proposition can be reformulated as : if
fˆ(n) = 0 for all n ∈ Z, then f = 0.
Theorem 3.3. Suppose that f is an integrable function on the circle with
fˆ(n) = 0 for all n ∈ Z. Then f (θ0 ) = 0 whenever f is continuous at the
point θ0 .
Our contradiction aims to prove that when assuming fˆ(n) = 0, f (θ0 ) 6= 0 gives
a contradiction. Therefore when fˆ(n) = 0, there should be f (θ0 ) = 0 whenever
f is continuous at the point θ0 .
• First we put restrictions on f . Choose 0 < δ ≤ π/2, so that f (θ) > f (0)/2
whenever |θ| < δ.
• Then we define p(θ). Let
p(θ) = + cos θ
where > 0 is chosen so small that |p(θ)| < 1 − /2 whenever δ ≤ |θ| ≤ π.
• Choose η > 0 with η < δ so that
p(θ) ≥ 1 + for |θ| < η.
2
22
• Finally, Let pk (θ) = [p(θ)]k .
O η δ π π x
2
Since Z π
1
fˆ(n) = f (θ)e−inθ dθ = 0,
2π −π
Therefore, we have
Z π Z π N
X
pk (θ)f (θ) dθ = f (θ) c¯n e−inθ dθ
−π −π −N
N
X Z π (3.5)
= c¯n f (θ)e−inθ dθ
−N −π
= 0.
Also, our choice of δ guarantees that p(θ) and f (θ) are non-negative whenever
|θ| < δ, thus:
23
Z
f (θ)pk (θ) dθ ≥ 0, (3.7)
η≤|θ|≤δ
and Z k
f (0)
f (θ)pk (θ) dθ ≥ 2η 1+ . (3.8)
|θ|<η 2 2
This corollary can be derived from Theorem 3.3 and it indicates the uniqueness
of Fourier series.
Theorem 3.5. Suppose that f is a continuous function on the circle and that
the Fourier series of f is absolutely convergent. Then, the Fourier series
converges uniformly to f , that is,
P∞ P∞
Proof. Let g(x) = n=−∞ fˆ(n)einx . Since n=−∞ |fˆ(n)| < ∞, g(x) is defined
everywhere.
24
Since the P
Fourier series of f is bounded, for any > 0, we can find large N
such that |n|>N |fˆ(n)| < . Hence, SN (f )(x) converges uniformly to g(x) as
N → ∞. Note that for large N , the Fourier coefficients of g(x) are
Z 2π
1
ĝ(n) = g(x)e−inx dx
2π 0
Z 2π
1 X
fˆ(n)einx e−inx dx
=
2π 0 n≤N
= fˆ(n).
Definition 3.8. The Hölder condition of f of order α, with α > 1/2 indicates,
We say fˆ(n) = O(1/|n|2 ) as |n| → ∞ when the left hand side is bounded by
a constant multiple of the right hand side, i.e. there exists C > 0 such that
|fˆ(n)| ≤ C/|n|2 . Particularly, f (x) = O(1) means that f is bounded.
25
Corollary 3.6. Suppose that f is a twice continuously differentiable function
on the circle. Then
−1 2π 00
Z
ˆ
2π f (n) = 2 f (θ)e−inθ dθ
n 0
Therefore,
Z 2π Z 2π
00 00
2π|n|2 |fˆ(n)| ≤ f einθ dθ ≤ |f (θ|dθ ≤ C,
0 0
There are also stronger versions of 3.6. Generally, the Fourier series of f con-
verges absolutely if f satisfies Hölder condition with α > 1/2.
26
3.7 Convolutions
Z π
1
f ∗ g (x) = f (y)g(x − y) dy. (3.9)
2π −π
= (f ∗ DN )(x),
27
Theorem 3.9. Suppose that f , g, and h are 2π-periodic integrable functions.
Then:
f ∗ (g + h) = (f ∗ g) + (f ∗ h).
(cf ) ∗ g = c(f ∗ g) = f ∗ (cg) for any c ∈ C.
f ∗ g = g ∗ f.
(f ∗ g) ∗ h = f ∗ (g ∗ h).
f ∗ g is continuous.
∗ g(n) = fˆ(n)ĝ(n).
f[
Proof. The first two properties, as the linearity of convolutions, can be proved
by the linearity of the integral.
The fifth and the sixth properties are extremely important. The fifth one shows
that f ∗ g is more ”regular” than f and g, meaning that without restrictions
on the continuity of f and g, only assuming that they are Riemann integrable,
f ∗ g is continuous.
We will first separate the proof into two parts. First we assume that f and g
are continuous, we may write
Z π
1
(f ∗ g)(x1 ) − (f ∗ g)(x2 ) = f (y)[g(x1 − y) − g(x2 − y)] dy.
2π −π
28
Now it comes to the sixth property, which is essential for interpreting the con-
volution. It shows relation between convolution and pointwise product. The
proof is quite simple
Z π
1
∗ g(n) =
f[ (f ∗ g)(x)e−inx dx
2π −π
Z π Z π
1 1
= f (y)g(x − y) dy e−inx dx
2π −π 2π −π
Z π Z π
1 1
= f (y)e−iny g(x − y)e−in(x−y) dx dy
2π −π 2π −π
Z π Z π
1 1
= f (y)e−iny g(x)e−inx dx dy
2π −π 2π −π
ˆ
= f (n)ĝ(n).
Property (a): The area under the kernels is always fixed, specifically, 2π. We
can rewrite it as
Z π
1
KN (x)dx = 1, for all natural numbers N
2π −π
This property gives us an intuition that good kernels act like weight distribution
function. Moreover, the weighted average of good kernels is always 1, indicating
that good kernels assign unit mass to the whole circle. The “unit” doesn’t mean
1, but a fixed value, in this case 2π.
Property (b): The integral of the absolute value of good kernels is bounded,
Z π
1
|KN (x)|dx ≤ M, for some M ∈ R.
2π −π
A trivial result is that if the good kernel is positive for all x ∈ [−π, π], this prop-
erty is actually a consequence of the previous one. In this case, the infimum of
M is 1.
29
Property (c): As N increases, the tail of good kernels gets slimmer. Mathe-
matically, it can be expressed as followed:
Z
0 0
∀ > 0, ∀δ > 0, ∃N ∈ N, such that N ≥ N =⇒ |KN (x)|dx < .
δ<|x|≤π
The weight is more concentrated near the origin and to be more extreme, the
good kernel will look alike a continuous straight line segment on the x-axis,
except for the discontinuity at the origin, of which the value will be infinity.
Usually, we call this function as Dirac Delta function.
Theorem 3.10. For any function f on the circle, given a family of good kernels
{KN (x)}∞
N =1 , we have
Also, f is integrable on the circle, then ∃B ∈ R, such that |f (x)| ≤ B. Hence for
any two points on the good kernels, the difference between their values cannot
be greater than 2B. Also, say the integral of the absolute value of good kernels
is bounded by M .
Now let’s take a look at the difference between the convolution and the function
30
itself.
Z π
1
(KN ∗ f )(x) − f (x) = f (x − y)KN (y)dy − f (x)
2π −π
Z π
1
= f (x − y) − f (x) KN (y)dy, by the first property of good kernels
2π −π
Z
1
= f (x − y) − f (x) KN (y)dy
2π δ≤|y|≤π
Z
1
+ f (x − y) − f (x) KN (y)dy
2π |y|<δ
Z
1
≤ f (x − y) − f (x) KN (y) dy
2π δ≤|y|≤π
Z
1
+ f (x − y) − f (x) KN (y) dy
2π |y|<δ
Z Z
1 1
≤ 2B KN (y) dy + KN (y) dy
2π δ≤|y|≤π 2π |y|<δ
2B
< + M, for all N ≥ N 0 for some N 0
2π
2π
2B M
= + −→ 0
2π 2π
Here is another way to understand this theorem. Since the weight of good
kernels concentrates on the origin. Therefore, for the following integral
Z π
1
(f ∗ Kn )(x) = f (x − y)Kn (y)dy
2π −π
If the Dirichlet kernel was a good kernel, then the theorem could apply to it and
as a consequence, the Fourier series, which is the convolution of the Dirichlet
kernel and a given function would converge to itself at points of continuity.
However, this is demonstrated to be false.
31
Another reason can be the Dirichlet kernel fails to satisfy the second property,
i.e. the Dirichlet kernel is not uniformly bounded.
Z π
4
|Dn (x)|dx = 2 lnN + O(1) −→ ∞
−π π
Proposition 3.12. The Dirichlet kernel satisfies the property (a) of good ker-
nels.
PN
Proof. Nevertheless, using the exponential form of DN = n=−N einx , the
integral of itself is 2π, satisfying property (a).
Z π Z π N
1 1 X
DN (x)dx = einx dx
2π −π 2π −π n=−N
N Z π
1 X
= einx dx
2π −π
n=−N
N π Z π
1 X 1 inx 1
= e + 1dx
2π in −π 2π −π
n=−N,n6=0
1
=0+ [π − (−π)]
2π
=1
This can be applied to diverging sequences, especially ones that are not divergent
to +∞ or −∞, which gives a more precise average value of sequences.
Example 3.5. One of the most classic examples is Grandi’s series.
32
In terms of the Dirichlet kernel, we define its Cesàro mean of the Dirichlet
kernels, what we call, the Fejér kernel, as followed,
Proof.
N −1
1 X
FN (x) = Dk (x)
N
k=0
N −1
1 X sin(k + 21 )x
=
N
k=0
sin 21 x
N −1
1 1 X 1
= sin k + x
N sin 21 x 2
k=0
N −1
1 X 1 1
= 2 1 2 sin (k + )x sin x
2N sin 2 x 2 2
k=0
N −1
1 X
= cos kx − cos(k + 1)x
2N sin2 21 x k=0
1
= (1 − cos N x)
2N sin2 21 x
2 Nx
1 sin 2
=
N sin2 ( x2 )
(3.10)
Start from proving the Fejér kernel satisfies the three properties.
33
Property (a):
Z π Z π N −1
1 1 1 X
FN (x) dx = Dk (x) dx
2π −π 2π
−π N k=0
N −1 Z
1 X π
= Dk (x) dx
2N π
k=0 −π
N −1 Z π !
1 X 1
= Dk (x) dx
N 2π −π
k=0
N −1
1 X
= 1=1
N
k=0
(3.11)
sin2 (N x/2)
Property (b): Since FN (x) = N1 sin2 (x/2) , trivially, the numerator and the de-
nominator are always non-negative, we therefore can deduce that property (b)
is satisfied.
Property (c): As the function sin2 (x) is not only bounded above but also con-
tinuous, given an arbitrary δ > 0, ∀|x| ∈ [δ, π] we have 0 < cδ ≤ sin2 ( x2 ).
Then
1 sin2 N x/2
Z Z
1 1
2 dx ≤ dx → 0
δ≤|x|≤π N sin (x/2) δ≤|x|≤π N cδ
as N → ∞.
A trivial result from the last theorem is about the convergence of the convolution
of the Fejér kernel and functions. Details will be discussed in the following
theorem.
Theorem 3.14. For any integrable 2π-periodic function f , the Fourier series
of f is Cesàro summable to f at every point of continuity, in other words,
the convolution of Fejér kernel and f is convergent to f . Moreover, if f is
continuous everywhere, the convergence is uniform.
34
Hence, the Fourier series of f is Cesàro summable to f .
Corollary 3.15. If the nth Fourier coefficients of an integrable function f are
0 for all natural numbers n, then f = 0 at all points of continuity.
Proof.
N
1 X ˆ
SN (f )(x) = f (n)einx
2π
n=−N
N
1 X
= 0
2π
n=−N
=0
By Fejér’s theorem,
σN (f )(x) = 0 → f
for all points of continuity of f . If f is continuous, f = 0 everywhere.
Understanding this corollary can be started from another view: Since f is Rie-
mann integrable, it is continuous almost everywhere. S∞ That is, for a fixed
> 0, the set of discontinuity points is covered by n−1 Un , where Un is an
open interval and the total length of Un is less than .
Corollary 3.16. If f is continuous on the circle,f can be uniformly approxi-
mated by trigonometric polynomials.
PN
Proof. SN (f )(x) = 1
2π n=−N fˆ(n)einx is a trigonometric polynomial, then so
is σN (f )(x).
35
If limr→1 A(r) converges to some constant s ∈ R, we say the series is Abel
summable and the limit of the Abel mean is s.
Notice that we take the absolute value of n is because we want it to fit in the
definition of Abel summability of series which requires the power of r to be non-
negative. Hence, we don’t simply write ck = ak eikθ , but ck = ak eikθ + a−k einθ .
P∞
Theorem 3.17. Let n=0 an be a series and the Abel mean is A(r), converging
for every r ∈ [0, 1). If the series is Abel summable to s, and an = o( n1 ), then
we have
X∞
an = s.
n=0
1
rN := 1 − .
N
The little o-notation indicates that limn→∞ nan = 0, so
Our aim is to show that for arbitrary > 0, there exists N ∈ N and N ≥ M ,
PN P∞ n
such that limr→1− n=0 an − n=0 an r < . This is equivalent to show that
36
PN P∞
limN →∞ n=0 an − n=0 an (rN )n = 0.
N
X ∞
X N
X ∞
X
an − an (rN )n = an (1 − (rN )n ) − an (rN )n
n=0 n=0 n=0 n=N +1
M
X N
X ∞
X
= an (1 − (rN )n ) + an (1 − (rN )n ) − an (rN )n
n=0 n=M +1 n=N +1
M
X N
X ∞
X
≤ an (1 − (rN )n ) + an (1 − (rN )n ) + an (rN )n
n=0 n=M +1 n=N +1
= S1 + S2 + S3
N
X
S2 = (1 − rN ) an (1 + rN + ... + (rN )n−1 )
n=M +1
N
X
< (1 − rN ) an n
n=M +1
< (1 − rN )N
1
= (1 − 1 + )N
N
=
37
∞
X
S3 = an (rN )n
n=N +1
∞
(rN )n
X
= an n
n
n=N +1
∞
X (rN )n
<
n
n=N +1
∞
X
< (rN )n
N +1
n=N +1
∞
X
< (rN )n
N + 1 n=0
1
= <
N + 1 1 − rN
by the definition of rN = 1 − N1 .
Therefore, we are able to deduce that for arbitrary < 0, by choosing suitable
PN P∞ n
N , we have n=0 an − n=0 an (rN ) < . Then the proof is finished.
1 − r2
Pr (θ) =
1 − 2r cos(θ) + r2
38
Proof.
−1
X X∞
Pr (θ) = r|n| einθ + r|n| einθ + 1
n=∞ n=1
X∞ ∞
X
= rn e−inθ + rn einθ + 1
n=1 n=1
X∞ n ∞
X n
= r e−iθ + r eiθ + 1
n=1 n=1
r e−iθ r eiθ
= + +1
1 − r e−iθ 1 − r eiθ
1 − r2 1
= , since cos θ = (eiθ + e−iθ )
1 − 2r cos θ + r2 2
We then prove it meets the three requirements of good kernels.
Property (a):
Z π Z π ∞
1 1 X
P r(θ)dθ = r|n| einθ dθ
2π −π 2π −π n=−∞
∞ Z π !
1 X |n|
= r einθ dθ
2π n=−∞ −π
∞ Z π
1 X 1
= r|n| 0 + 1 e0 dθ
2π 2π −π
n=∞,n6=0
=1
Property (b): The expression of P r(θ) and the condition that 0 < r < 1 tell us
P r(θ) ≥ 0, which implies property (b) is satisfied.
Property (c): Since we are considering the case when r is close to 1, so we can
assume that 12 < r < 1, then
39
Theorem 3.19. The Fourier series of an integrable function on the circle is
Abel summable to f at every point of continuity. If the function f is continuous
on the circle, then the Fourier series is Abel summable uniformly to f .
P∞
Proof. The Abel mean of SN (f )(x) is Ar(f )(x) = n=−∞ an r|n| einθ . Since
P r(x) is a good kernel, then
We have already talked a lot about convergence of the Fourier series at the point
of continuity, but what happens at the point of discontinuity? We will briefly
introduce this phenomenon.
There are some “damped oscillations” of the truncated Fourier series near the
discontinuity points. As n increases, the oscillating intervals are getting nar-
rower, but the overshoot remains the same size about 9%. Equivalently, from
the other side of the jump, the truncated Fourier series undershoot about 9%.
Hence in total, the partial Fourier series is 18% larger than the discontinuity of
the function. Moreover, as n increases, the truncated Fourier series at the jump
converges to the midpoint of the jump.
Informally, the Fourier series has the big “jump” at the discontinuity because
most of the amplitudes of sine and cosine overlap at that point in a perfect
way. But since the truncated Fourier series has only finite terms of sine and
cosine, the amplitudes cannot cancel out perfectly near the jump, leading to a
damped oscillation. As more terms are added, there will be more cancellations,
hence the truncated Fourier series can approximate the function better, which
explains why the damping intervals become narrower.
We now will check if the solution to the steady-heat equation we obtained before
is true and unique.
40
Theorem 3.20. Let f be an integrable function on the unit circle (the heat
distribution function on the unit circle). Then the heat distribution function u
defined in the unit disc
u(r, θ) = (f ∗ Pr )(θ)
has the following properties:
Proof. Property 1: We should first notice that in the unit disc problem, we
defined r ∈ [0, 1), with 1 excluded. However, when we prove the sequence of
derivatives of functions converges to the derivatives of limit of function sequence,
we need the function variables to be defined in a compact set. P∞As a result, we
should introduce ρ s.t. 0 < ρ < 1. Then ∀r ∈ [0, ρ], u(r, θ) = n=−∞ an r|n| einθ
can be differentiated once, twice and so on term by term.
=0
41
Both of the solutions equal to f (θ) at the disc boundary, where u1 − u2 = 0. So
now you probably have a picture in your mind: the boundary has value 0 and
the derivative over the disc is also 0. By intuition, you may guess the whole
disc has zero-value of u. We finish the proof if we use this lemma:
For any point p in the unit disc, any subdiscs at p contained in the unit disc, we
call them Dp and the average value of u in Dp is u(p). The proof of this lemma
is out of the scope of this section so we won’t look much into it.
Then the final result we get is that the whole disc will have u = 0, which is
a contradiction. Therefore, we cannot have two solutions unless they are equal.
In this section, we will focus on the convergence of the Fourier series. The
convergence of the Fourier series are divided into two parts: The mean-square
convergence on the entire interval and the pointwise convergence in separate
intervals.
In this section, we will discover the overall behavior of a function f on the entire
interval [0, 2π]. We will first lay out the definitions and notations that will be
used throughout this section.
42
• Associative law: For all vectors u, v, w ∈ V, u + (v + w) = (u + v) + w
• Additive identity: The set V contains an additive identity element, de-
noted by 0, such that for any vector v ∈ V , 0 + v = v and v + 0 = v.
• Existence of negatives: For every u ∈ V , there is a vector in V , written
−u and called the negative of u, which has the property that u+(−u) = 0.
• Associativity of multiplication: (ab)u = a(bu) for any a, b ∈ F and u ∈ V .
• Distributivity: (a + b)u = au + bu and a(u + v) = au + av for all a, b ∈ F
and u, v ∈ V
• Unitarity: 1u = u for all u ∈ V
Definition 4.2. An inner product space is a vector space V over the field
F together with an inner product, i.e., with a map (. , .) : V × V → F that
satisfies the following three properties for all vectors x, y, z ∈ V and all scalars
αβ ∈ F :
Given an inner product (·, ·) we may define the norm of X by kXk = (X, X)1/2
The presence of an inner product on a vector space allows one to define the
geometric notion of “orthogonality.” Let V be a vector space (over R or C)
with inner product (·, ·) and associated norm k.k Two elements X and Y are
orthogonal if (X, Y ) = 0, and we write X ⊥ Y . Three theorems can be derived
from this notion of orthogonality:
Theorem 4.1. The Pythagorean theorem: if X and Y are orthogonal, then
2 2 2
kX + Y k = kXk +kY k (4.1)
Proof.
2
kX + Y k = (X + Y, X + Y ) = (X, X) + (X, Y ) + (Y, X) + (Y, Y ) (4.2)
2 2
= (X, X) + (Y, Y ) = kXk +kY k (4.3)
43
Proof. If kY k =0, we want to show that for all X, (X,Y )=0. For all real t we
have:
2 2
0 ≤ kX + tY k = kXk + 2tRe(X, Y ) (4.5)
So if Re(X, Y ) 6= 0 contradicts the inequality if we pick some big enough t.
Then we have Re(X, Y ) = 0. Similarly, we have Im(X, Y ) = 0.We finished the
case for kY k =0.
If kY k =
6 0,we may set c = (X, Y )/(Y, Y ); then X − cY is orthogonal to Y , and
therefore also to cY . If we write X = X − cY + cY and apply the Pythagorean
theorem, we get
2 2 2 2
kXk = kX − cY k +kcY k ≥ |c|2 kY k (4.6)
So we have
2 2 2
kX + Y k ≤ kXk + 2kXkkY k +kY k = (kXk +kY k)2 (4.10)
44
Theorem 4.4. The vector space l2 (Z) over C is the set of all (two-sided) infinite
sequences of complex numbers
such that X
|an |2 < ∞ (4.12)
n∈Z
Proof. We will first prove that l2 (Z) is a vector space. That is, if A and B are
two elements in l2 (Z), then A + B is an element in l2 (Z). We assume
Which we have an = 0 when |n| > N Then according to the triangle inequality,
we have
kAN + BN k ≤ kAN k +kBN k ≤ kAk +kBk (4.16)
According to the definition of the norm, we have
X 2
|an + bn |2 = kAN + bN k ≤ (kAk +kBk)2 (4.17)
|n|≤N
When N → ∞, X
|an + bn |2 < ∞ (4.18)
n∈Z
45
Lemma 4.5. : The vector space l2 (Z) is complete
for all M. PM
For each k > N and M , we let k 0 → ∞ to get n=1 |ak,n − bn |2 < 2 . Since
M is arbitrary, one has shown that kAk − Bkl2 < as k ≥ N , where B =
(..., b−1 , b0 , b1 , ...). Also, b ∈ l2 (Z). Since is arbitrary, Ak → B in l2 (Z) as
k→∞
Z 2π
1 ¯
(f, g) = f (θ)g(θ)dθ (4.23)
2π 0
46
Proof. It is easy to find that the analogue of the Cauchy-Schwarz and triangle
inequalities hold in this example.
For condition (i), it fails since kf k = 0 implies only that f vanishes at its points
of continuity.
which C is a constant
Rb
Therefore, we have a ((log(θ))2 dθ) → 0 Thus, for all > 0, there exist a N > 0,
for n > m > N ,
Z 1/m Z 1/m
1 1
fn (θ) − fm (θ) = ( (log(1/θ))2 dθ)1/2 = ( (logθ)2 dθ)1/2 <
2π 1/n 2π 1/n
(4.27)
showing that {fn }∞n=1 is a Cauchy sequence in R and lim n→∞ nf = f . However,f ∈
/
R, since it is not bounded.
47
Consider the space R of integrable functions on the circle with inner product
Z 2π
1
(f, g) = f (θ)g(θ)dθ (4.28)
2π 0
Proof. When n = m,
Z 2π
1
(en , em ) = einθ eimθ dθ (4.30)
2π 0
Z 2π
1
= ei(n−m)θ dθ (4.31)
2π 0
Z 2π
1
= 1dθ (4.32)
2π 0
=1 (4.33)
When n 6= m,
Z 2π
1
(en , em ) = einθ eimθ dθ (4.34)
2π 0
Z 2π
1
= ei(n−m)θ dθ (4.35)
2π 0
i(n−m)θ
1 e
= |2π (4.36)
2π i(n − m) 0
1 ei(n−m)2π − e0
= (4.37)
2π i(n − m)
1 1+0−1
= (4.38)
2π i(n − m)
=0 (4.39)
48
Besides for the family {en }n∈Z , we can find the Fourier coefficient can be rep-
resented by an inner product:
Z 2π
1
an = f (θ)einθ dθ = (f, en ) (4.40)
2π 0
P
Proof. We will first show (f − |n|≤N an en ) ⊥ en
X X
(f − an en , en ) = (f, en ) − ( an en , en ) (4.43)
|n|≤N |n|≤N
= an − an (4.44)
=0 (4.45)
P P
So for any complex number bn , we have (f − |n|≤N an en ) ⊥ |n|≤N bn en
P P
Therefore if we choose an as bn , we have (f − |n|≤N an en ) ⊥ |n|≤N an en .
Then by the Pythagoras theorem, we have
2 2
2
X X
kf k = f − an en + an en (4.46)
|n|≤N |n|≤N
49
Proof. Let bn =an -cn , so we have
X X X
f− cn en = f − an en + bn e n (4.49)
|n|≤N |n|≤N |n|≤N
When cn = an , bn = 0,
X
f − (f − cn en ) = f − SN (f ) (4.52)
|n|≤N
When cn 6= an ,
2
Z 2π
X 1
bn en = |bn en |2 dθ > 0 (4.53)
2π 0
|n|≤N
Therefore, X X
(f − an en ) ⊥ (f − bn e n ) (4.54)
|n|≤N |n|≤N
Proof. Apply Lemma from section 3 and choose a continuous function g on the
circle such that
sup |g(θ)| ≤ sup |f (θ)| = B (4.56)
θ∈[0,2π] θ∈[0,2π]
50
and
Z 2π
|f (θ) − g(θ)|dθ < 2 (4.57)
0
Where C is a constant
kg − P k < (4.63)
2 2 X
kf k = (f − SN (f )) + |an |2 (4.64)
|n|≤N
51
4.2 Pointwise Convergence
For a small δ the function F is integrable on [−π, −δ] ∪ [δ, π], therefore the
function F is integrable on all of [−π, π].
Also,
t
tDN (t) = sin((N + 1/2)t) (4.69)
sin(t/2)
t
And the quotient sin(t/2) is continuous in the interval [−π, π], so
Then we apply the Riemann-Lebesgue Lemma to the functions F (t)t cos(t/2)/ sin(t/2)
and F (t)t:
Z π
1
F (t)t cos(t/2)/ sin(t/2)) sin(N t)dt → 0 (4.71)
2π −π
and
Z π
1
F (t)t cos(N t)dt → 0 (4.72)
2π −π
52
Therefore,
Z π
1
SN (f )(θ0 ) − f (θ0 ) = F (t)tDN (t)dt (4.73)
2π −π
Z π
1
= F (t)(t cos(N t) + t cos(t/2)/ sin(t/2) sin(N t))dt
2π −π
(4.74)
Z π
1
= F (t)t cos(t/2)/ sin(t/2)) sin(N t)dt (4.75)
2π −π
Z π
1
+ F (t)t cos(N t)dt (4.76)
2π −π
→0 (4.77)
In this section, we will investigate among all simple closed curves of length `,
in the plane R2 , which one encloses the largest area. The intuition tells us the
answer should be a circle, because if the region enclosed by the curve is concave,
then we can reflect the concave part to be convex and we can convince ourselves
that the flatter the curve is in some portion, the less efficient it is in an enclosing
area. We are going to prove it is the case rigorously later.
5.1.2 Curves
In this section, we only focus on curves that are closed and simple. This means
the curves don’t intersect themselves and the endpoints are joined. For example,
a square and a circle are both closed and simple.
For any point on the simple closed curve Γ, we define its orientation by a
parametrized expression, γ(s) : [a, b] → R2 .
53
x(s) and y(s) denote the x and y axis positions respectively. For simplicity, we
limit γ to be in class C 1 , that is, γ is continuously differentiable.
Theorem 5.1 (Green’s theorem). Given a closed simple curve C in the vector
field f~(x, y) = P (x, y)~i+Q(x,y)~j and the region R bounded by C, where the
position vector along the curve C is defined as ~r(t) = x(t)~i + y(t)~j, then closed
curve line integral of C in the vector field is
I ZZ
~ ∂Q ∂P
f · d~r = − dxdy
C ∂x ∂y
R
54
g(x) and f (x). WLOG, say C1 and C2 are the upper and the lower half of the
curve, and the direction along C is anticlockwise. Then,
I Z b Z a
f~ · d~r = f~(x, g(x))d~r + f~(x, f (x))d~r
C a b
Z b Z b
= f~(x, g(x))d~r − f~(x, f (x))d~r
a a
Z b h i
=−
~ d~r
f~ x, f (x) x, g(x)
a
Z b
f (x)
=− f~(x, y) g(x) · d~r
a
Z b
f (x)
=− P (x, y) g(x) dx
a
!
Z b Z f (x)
∂P
=− dy dx
a g(x) ∂y
ZZ
∂P
=− dydx
∂y
R
Now suppose the vector field only has ~j-component f~ = Q(x, y)~j, and repeat.
RR RR
We know the area of the region R is A = 1dA = 1 dydx. In order to use
R R
Green’s theorem, we need
∂Q ∂P
− = 1.
∂x ∂y
Easily, we can choose Q := x2 and P := − y2 , then
y x
f~(x, y) = − ~i + ~j,
2 2
and the closed line integral in this vector field is
I I 1I
~ y~ x ~ ~ ~
f · d~r = − i + j · dxi + dy j = xdy − ydx
C C 2 2 2 C
55
To avoid negative values of the area, we put absolute value sign over the integral.
I
1
A= xdy − ydx
2 C
`2
A≤
4π
and the equality holds if and only if Γ is a circle.
Proof. Let ` = 2π and the aim is now to show the curve enclosed the largest
area is the unit circle. General cases can be done by rescaling.
Since we have shown that the Fourier series can be differentiated term by term,
hence their derivatives can be written as
∞
X ∞
X
x0 (s) ∼ inan eins and y 0 (s) ∼ inbn eins ,
n=−∞ n=−∞
and
x0 (s) = x0 (s) and y 0 (s) = y 0 (s)
56
as x(s) and y(s) are real functions from which we deduce that an eins +a−n e−ins
is real, and hence an = a−n
∞
X ∞
X
x0 (s) ∼ (inan )eins and y 0 (s) ∼ (inbn )eins
n=−∞ n=−∞
∞
X
A=π n(an bn − bn an ) .
n=−∞
Apparently, |n| < |n|2 if n 6= −1, 0 and 1, hence it is without difficulty to deduce
that
∞
X
A≤π n2 (an bn − bn an ) . (∗)
n=−∞
∞
X ∞
X ∞
X
n2 (an bn − bn an ) ≤ n2 (an an + bn bn ) = n2 (|an |2 + |bn |2 ) = 1, (∗∗)
n=−∞ n=−∞ n=−∞
P∞
recalling Parseval’s identity, which implies that n=−∞ n2 (|an |2 + |bn |2 ) =
|x0 (s)2 + y 0 (s)2 | as n2 |an |2 is the nth-Fourier coefficient of x0 (s) and n2 |bn |2 is
that of y 0 (s).
To show the curve enclosed the greatest area is the unit circle starts from as-
suming the area A is π. In this case, (*) and (**)must be equality, so there
must be finite terms. Then only n = −1, 0 and 1 are valid, so write
and
1
X 1
X
n(an bn − bn an ) = |n|2 (|an |2 + |bn |2 ) = 1,
n=−1 n=−1
57
So (|a−1 |2 + |b−1 |2 ) + (|a1 |2 + |b1 |2 ) = 1, and we deduce that |a1 | = |a−1 | = 12
and |b1 | = |b−1 | = 12 . Since we don’t know if they are real values, then we can
write them as
1 1
a1 = eiα and b1 = eiβ .
2 2
P1
The relationship between α and β is connected by | sin (α − β)| = 1 as n=−1 n(an bn − bn an ) =
| − a−1 b1 + b1 a1 + a1 b1 − b1 a1 | = 2|a1 b1 − a1 b1 | = 1, then eα−β − e−α+β = 1,
hence |sin(α − β)| = 1. So α − β = kπ + π2 and k is any integer.
We now look at the application of Fourier series into number theory, starting
with the following definitions.
Definition 5.1. For x ∈ R, its integer part [x] is defined as the greatest
integer less than x. Its frictional part hxi is then defined as x − [x]. For
a, b ∈ R, they are equivalent module over Z if and only if their frictional
parts are equal, denoted by
a ≡ b mod Z
58
is to 2nπ for any n, the closer cos k is to 1. So the problem is now if 2nπ can
be as close as to an integer, that is, having a frictional part that is as small as
possible. Equivalently, we want to prove that
The integer part of a real might not be the closest integer to it, but in fact, the
above statement can be changed to be stronger
Remark. The left hand side refers to the proportion of the terms in the interval
(a, b) among ξ1 , ξ2 , ξ3 , ..., ξN . The right hand side is the proportion of the
length of the interval (a, b) in [0, 1). Informally, ξn is the microcosm of [0, 1)
such that its cover rate of the interval (a, b) is the same as that of [0, 1).
It is not hard to discover that if γ is rational, the sequence repeats itself and
obviously cannot be equidistributed, but what about γ is irrational? This is
what Weyl’s equidistribution theorem discusses.
Theorem 5.3. hnγi is equidistributed in [0, 1) if γ is irrational.
Proof. Before we start the proof, we need a few lemmas. The definition of
equidistribution involves the countability of a set, which is hard to use in our
59
proof. Our first job is to translate it into analytical language. For fixed (a, b) ⊂
[0, 1), we define its characteristic function χ(a,b) : [0, 1) → R as
1, x ∈ (a, b)
χ(a,b) (x) =
0, x ∈ [0, 1) − (a, b)
This function works as indicative variable that tells whether the input is in
(a, b). We extend it to be periodic 1. Now we can translate the countability
into
N
X
| {ξ1 , ξ2 , ..., ξN } ∩ (a, b)| = χ(a,b) (nγ)
n=1
Moreover, the RHS is translated into
Z 1
χ(a,b) (x) dx
0
Proof. Now we are free to apply Fourier series in this question. We will first
prove that the statement holds for all exponentials, therefore all sums of expo-
nentials that are Fourier series of the continuous function. Initially, we compute
the LHS when f (x) = e2πikx for k ∈ N ∪ {0}. The case when k = 0 is obvious
so we assume k 6= 0.
N
1 X 2πikγ n e2πikγ 1 − e2πikN γ
(e ) =
N n=1 N 1 − e2πikγ
60
From section 3, we know continuous functions f (x) can be approximated by
trigonometric polynomials, that is, there exists trigonometric polynomials P (x)
such that |f (x) − P (x)| < /3, so we have
N
1 X
f (nγ) − P (nγ) < /3 (5.3)
N n=1
Z 1
f (nγ) − P (nγ) dx < /3 (5.4)
0
N Z 1 N
1 X 1 X
f (nγ) − f (x) dx ≤ f (nγ) − P (nγ) +
N n=1 0 N n=1
Z 1
f (nγ) − P (nγ) dx+
0 (5.5)
N Z 1
1 X
P (nγ) − P (x) dx
N n=1 0
<
To prove that the statement also applies to χ(a,b) , we need to bound it with
two continuous functions. As in the figure. They agree with χ(a,b) (x) but for
intervals of length 2. So,
f+
f−
Figure 4: approximate f
61
Z 1 Z 1
b − a − 2 ≤ f− (x) dx and f+ (x) dx ≤ b − a + 2 (5.6)
0 0
N N N
1 X − 1 X 1 X +
f (nγ) ≤ χ(a,b) (nγ) ≤ f (nγ)
N n=1 N n=1 N n=1
N Z 1
1 X +
f (nγ) → f+ (x) dx as N → ∞
N n=1 0
Therefore,
Z 1 N Z 1
1 X
f− (x) dx ≤ lim χ(a,b) (nγ) ≤ f+ (x) dx
0 N →∞ N 0
n=1
Since this is true for all , the limit exists and tends to a − b, proving Theorem
5.1.
We can generalize Theorem 5.1 that says (5.1) holds for χ(a,b) (x). In fact, (5.1)
for all integrable functions on [0, 1).
Corollary 5.5. (5.1) holds for merely integrable f of period 1 and irrational γ,
Proof. The proof is quite similar to that of Theorem 5.1, so we will just outline
it. Define functions fU (x) = supxn <x<xn+1 f (x) for x ∈ (xn , xn +1) and fL (x) =
inf xn <x<xn+1 f (x) for x ∈ (xn , xn + 1). We now have f bounded by fU and fL .
Z 1 Z 1 Z 1
fL (x) dx ≤ f (x) dx ≤ fU (x) dx
0 0 0
62
Moreover, recall the definition of integrability and we have
Z 1 Z 1
fU (x) dx − fL (x) dx <
0 0
We now come back to the example in the beginning, where we are to prove that
We can assume there is 0 that doesn’t satisfy the above statement and hξn i ≥
for all n. Also, ξn satisfies the following for all (a, b) ⊂ [0, 1).
Take (a, b) = (0, ) and the left hand side is 0. However, the right hand side is
obviously not, a contradiction.
χ(a,b) is periodic and if the above equation holds for exponentials, we can deduce
that it also holds for χ(a,b) and prove equidistribution. However, can we go
reversely? This question leads to the following theorem.
Theorem 5.6. A sequence of reals ξ1 , ξ2 , ξ3 , ... in [0,1) is equidistributed if and
only if for all integers k 6= 0,
N
1 X 2πinkξn
e →0 as N → ∞ (5.8)
N n=1
63
P P
Since we have discussed one direction of the theorem, we will focus on the other
one when ξn is equidistributed and we want to deduce (5.8).
Let the slope of the ray be γ. Consider the grids formed by the reflections. In
fact, if γ is rational, the path repeats. If γ is irrational, the path is equidis-
tributed in the rectangle.
64
Remark. Imagine a group of parallel lines that ”equally fill” the plane. Each of
them can be uniquely represented by the intercept with an axis. We can general-
ize that the equidistribution of these lines is the equidistribution of coordinates
of these intercepts on the axis. In this case, the reflection of rays produce two
group of parallel lines(labelled blue and red in figure 6) and actually both of
them are equidistributed given irrational slope. As in the figure, each red line
has an intercept with the axis and if these red lines ”equally fill” the rectangle,
their intercepts will be equidistributed along the axis.
The proof of theorem can be split into two parts. First start with continuity
Lemma 5.8. If 0 < α < 1, then the function
∞
X n
fα (x) = f (x) = 2−nα e−i2 x
(5.10)
n=0
is continuous.
65
Then we come to the vital part of this theorem, it is nowhere differentiable.
an einx ,then
P
We suppose g(x) ∼
X
SN (g)(x) = an einx ,
n≤N
and
According to the figures,we find out the delayed means are essentially equal to
the partial sums.
66
1
−N 0 N
an einx
P
Figure 7: SN (g)(x) = |n|≤N
−N 0 N
P |n| iex
Figure 8: σN (g)(x) = |n|≤N (1 − N )an e
Proof. We have
∆2N (f ) = S2N (f )
∆N (f ) = SN (f )
Therefore,
∆2N (f ) − ∆N (f ) = S2N (f ) − SN (f )
n
= 2−nα ei2 x
Then, we will use the proof by contradiction to finish the proof of the theorem.
We assume f 0 (x0 ) exists for some x0
Lemma 5.10. Let g be any continuous function that is differentiable at x0 .
Then, the Cesaro means satisfy σN (g)0 (x0 ) = O(log N ), therefore
67
1
−2N −N 0 N 2N
|n|
an eiex + iex
P P
Figure 9: ∆N (g) = |n|≤N N ≤|n|≤2N 2(1 − 2N )an e
Z π Z π
0
|σN (g) (x0 )| = | FN0 (t)[g(x0 − t) − g(x0 )]dt| ≤ C ||FN0 (t)||t|dt
−π −π
Also,
1 sin2 (N t/2) 0
|FN (t)0 | = | |
N sin2 (t/2)
sin(N t/2) cos(N t/2) 1 cos(t/2) sin2 (N t/2)
= 2 −
sin (t/2) N sin3 (t/2)
C2
≤ 2
|t|
68
By using these two inequalities above, we have
Z Z
0 0
|σN (g) (x0 )| ≤ C |FN (t)||t|dt + C |FN0 (t)||t|dt
|t|≥1/n |t|≤1/n
Z Z
1
≤ CC1 dt + CC1 N dt
|t|≥1/n |t| |t|≤1/n
= O(log N ) + O(1)
= O(log N )
Now we know
∆2N (f )0 (x0 ) − ∆N (f 0 )(x0 ) = O(log N )
The investigation of heat flow is a vital part of the application of Fourier analysis.
This brings us a special partial differential equation, the heat equation.
Definition 5.3. Consider an infinite metal plate which we model as the plane
R2 , and suppose we are given an initial heat distribution at time t = 0. Let the
temperature at the point (x, y) at time t be denoted by u(x, y, t).
Consider a small square centered at (x0 , y0 ) with sides parallel to the axis and
of side length h. Using skills of calculus and the mean value theorem, we find
the time-dependent heat equation:
σ ∂u ∂2u ∂2u
= + 2 (5.16)
κ ∂t ∂x2 ∂y
where σ > 0 is a constant called the specific heat of the material and κ > 0 is
the conductivity of the material.
69
When the whole system reaches thermal equilibrium and ∂u
∂t = 0, the time-
dependent heat equation reduces to the steady-state heat equation
∂2u ∂2u
+ 2 =0 (5.17)
∂x2 ∂y
u=0
If we rewrite the Laplacian in polar coordinates and apply the chain rule, we
get
∂ 2 u 1 ∂u 1 ∂2u
u= 2
+ + 2 2
∂r r ∂r r ∂θ
∂ 2 u 1 ∂u 1 ∂2u ∂2u
u= + + = (5.18)
∂r2 r ∂r r2 ∂θ2 ∂(rθ)2
Model the ring by the unit circle. A point on the circle is described by its angle
∂2u ∂2u
θ = 2πx, where the variable x lies between 0 and 1. Then we get ∂(rθ) 2 = ∂x2 .
∂u ∂2u
=c 2
∂t ∂x
where the constant c is a positive physical constant which depends on the ma-
terial of the ring.
70
∂ ∂
(L, t) = (−L, t)
∂x ∂x
To solve this partial differential equation, we will need a useful technique, sep-
aration of variables.
Firstly, separate variables and look for special solutions of the form
u(x, t) = A(x)B(t).
Plugging this into the two sides of the heat equation gives
∂
(x, t) = A(x)B 0 (t)
∂t
∂2
(x, t) = A00 (x)B(t)
∂x2
Therefore
B 0 (t) A00 (x)
=
B(t) A(x)
The left side of this equation is a function only of the variable t, while the
right side is a function only of the variable x. The equality exists only if these
functions are constant functions. Therefore, we deduce that there must exist a
constant λ such that
B 0 (t) A00 (x)
= =λ
B(t) A(x)
B 0 (t) = λB(t),
71
Theorem 5.12. Let k be a square root of the constant c. If f is a solution of
the equation f 00 (x) = cf (x), then there exists constants α and β such that
The key to the proofs of these theorems is to define a new function based on
the equation to prove, say φ(x), and use this to investigate the relation between
f (x) and ekx .
B(t) = aeλt
and √ √
A(x) = αe λx
+ βe− λx
.
√ √
(α − β)(e λx
− e− λx
)=0
Since x is a variable in the interval [0, 1], we get α = β.
Also, since the point lies on the unit circle, A must be periodic of period 1,
which means
A(x) = A(x + 1)
√ √ √ √ √ √
e λx
+ e− λx
=e λx
e λ
+ e− λx − λ
e
√ √
λ(x+1) − λx
e =e
This is not possible just in R since the value of x varies in the interval [0, 1].
√
As a result, we can deduct that λ ∈ C and assume that λ = −k 2 where k is a
constant.
eik(x+1) = e−ikx
eik(2x+1) = 1.
So the only possibility is k = 2πn and λ = −4π 2 n2 where n ∈ Z.
72
Then a question arises: do we have u(x, t) → f (x) as t tends to 0? For a better
understanding of our above solution and to investigate this problem, we will
define the heat kernel in the following section.
Definition 5.4. The heat kernel for the circle, written as Ht , is given by
∞
2
n2 t 2πinx
X
Ht (x) = e−4π e ,
−∞
u(x, t) = (f ∗ Ht )(x).
2 2
Using the property of convolution, û = fˆĤt = fˆ(x)e−4π n t , combined with
Parseval’s identity, we get
1
2
n2 t 2
X
||u(x, t) − f (x)||2 = |fˆ(ω)|2 |1 − e−4π |
n=0
2
n2 t
When t → 0, (1 − e−4π ) → 0, meaning that ||u(x, t) − f (x)||2 → 0 and we
are done!
The heat kernel can actually be considered as an analogue of the Poisson kernel.
A change of variables in (5) leads to the solution
X 2
u(θ, τ ) = an e−n τ einθ = (f ∗ hr )(θ)
73
of the equation
References
[1] Elias M. Stein and Rami Shakarchi. Fourier Analysis, an introduction.
Princeton University Press, New Jersey 08540.
74