Modular Forms Updated
Modular Forms Updated
Sarah Zerbes
2
Contents
0 Prologue 4
3 Hecke operators 44
3.1 Double cost operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
3.2 The Hecke algebra of Γ1 (N ) . . . . . . . . . . . . . . . . . . . . . . . . . 48
3.2.1 Diamond operators . . . . . . . . . . . . . . . . . . . . . . . . . . 50
3.2.2 Hecke operators on q-expansions . . . . . . . . . . . . . . . . . . . 52
3.2.3 The Hecke algebra . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.3 The Petersson product . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
3.4 Hecke operators and the Petersson product . . . . . . . . . . . . . . . . . 63
3.5 Old and new modular forms . . . . . . . . . . . . . . . . . . . . . . . . . 67
3.6 L-functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
3.6.1 Basic definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
3.6.2 L-functions of cusp forms . . . . . . . . . . . . . . . . . . . . . . 75
3.6.3 Relation to elliptic curves . . . . . . . . . . . . . . . . . . . . . . 77
3
0 Prologue
Example 0.0.1. Let z ∈ C, =(z) > 0. Let q = e2πiz and define Ramanujan’s tau
function Y
∆(z) = q · (1 − q n )24 .
n∈N
This is one of the simplest examples of a modular form. Note that we can ”multiply
out” the product above which leads us to
X
∆(z) = τ (n)q n
n∈N
(2) Ramanujan proved in 1916 that the integers τ (n) satisfy the equation
X
τ (n) = d11 mod 691.
d|n
(3) Ramanujan also conjectured τ (nm) = τ (n)τ (m) for n, m coprime. This was proved
by Mordell in 1917.
(4) In 1972 Swinnerton-Dyer proved τ (n) satisfies congruences like (2) modulo 2, 3, 5,
7, 23 and 691 but no other primes.
(5) Ramanujan conjectured in 1916 for p prime holds |τ (p)| < 2 p11/2 . This was proved
in 1974 by Deligne.
(6) The quantity
τ (p)
∈ [−1, 1]
2p11/2
√
is distributed in the interval [−1, 1] with density function proportional to 1 − x2 .
This was conjectured by Sato and Tate (1960s) and proved by Barnet-Lamb, Ger-
aghty, Harris and Taylor in 2009 using Bau Chau Ngo’s Fundamental Lemma which
got Ngo the 2010 Fields Medal.
4
Example 0.0.3. We now consider another modular form
∞
Y
f (z) = q (1 − q n )2 (1 − q 11n )2
n=1
= q − 2q 2 − q 3 + 2q 4 + q 5 + 2q 6 + . . .
X∞
0
= a(n)q n with a(n) ∈ N
n=1
Theorem.
It turns out that this modular form is closely related to the elliptic curve
E : Y 2 + Y = X 3 − X 2 − 10X − 20.
For p prime, denote by N (p) the number of points on the elliptic curve in Fp . It is easy
to see heuristically tat N (p) ' p.
The theory of modular forms allows one to prove that the elliptic curve E and the
modular form f ‘correspond’ to each other in the following sense:
a(p) = p − N (p).
In particular, using the properties of the modular form f , we can easily calculate the
quantity N (p) for all p, so f ‘knows’ about the behaviour of the elliptic curve over Fp .
We say that the elliptic curve E is modular. It is generally not too difficult to attach
an elliptic curve to a modular form (this is called ”Eichler–Shimura”); however, it is
very difficult indeed to reverse this process, and this is the basis of Andrew Wiles’ work
on Fermat’s Last Theorem. The proof of this result was later completed by Breuil–
Conrad–Diamond–Taylor. I will talk a bit more about this when we discuss L-functions
of modular forms.
5
1 The modular group
1.1 The upper half-plane
Definition 1.1.1. Let H = {z ∈ C : =(z) > 0} the upper half-plane.
Proposition 1.1.2. The special linear group SL2 (R) = {A ∈ GL2 (R) : det(A) = 1}
acts on H via
a b az + b
.z = .
c d cz + d
Proof. For z ∈ H is =(z) > 0 and either c or d is nonzero, so cz + d 6= 0. Moreover
az + b 1
= = = ((az + b)(cz + d)) .
cz + d |cz + d|2
Say z = x + iy for x, y ∈ R.
az + b 1 2
= = = (ax + b)(cx + d) + acy +i (ad − bc) y
cz + d |cz + d|2 | {z } | {z }
∈R =1
1
= =(z) > 0
|cz + d|2
Therefore az+b
a b ∈ SL (R).
∈ H for any z ∈ H, 2
cz+d
1 0
c d
Also it is easy to check that 0 1 z = z and A(Bz) = (AB)z for any z ∈ H and for
any A, B ∈ SL2 (R). Thus SL2 (R) acts on H.
Note 1.1.3. The matrix −1 0
0 −1 ∈ SL2 (R) acts trivially on H, so the action of SL2 (R)
on H factors through the quotient PSL2 (R) = SL2 (R)/(±1), the projective special
linear group.
Definition 1.1.4. The automorphy factor is the function
j : SL2 (R) × H → C,
a b
(g, z) 7→ cz + d for g =
c d
Proposition 1.1.5. For any k ∈ Z, we can define a right action of SL2 (R) on the set
of holomorphic functions H → C given by
(f |k g) (z) := j(g, z)−k f (gz)
where f : H → C holomorphic, g = ac db ∈ SL2 (R). We will call this the weight k
action.
6
Proof. Firstly we need to show that f |k g is a well-defined holomorphic function H → C.
But this is obvious since cz + d 6= 0 and gz ∈ H for all z ∈ H. Clearly also the
equation f |k 1 = f holds. Therefore it remains to show (f |k g)|k h = f |k (gh) for arbitrary
g, h ∈ SL2 (R). The left hand side of the equation can be rewritten as
(f |k g)|k h = j(h, z)−k ((f |k g)(hz))
= j(h, z)−k j(g, hz)−k f (g(hz))
and the right hand side results in
f |k (gh) = j(gh, z)−k f ((gh)z).
We already know (gh)z = g(hz). So it remains to show j(gh, z) = j(h, z)j(g, hz). This
is the so called cocycle relation and can be checked easily.
7
Step 2. If z ∈ D and gz ∈ D, where g ∈ G, then one of the following holds:
1. g = ±Id
2. g = ±S and |z| = 1
4. g = ±ST = ± 01 −1 or g = ±T −1 S = ± −1 −1
or g = ±ST −1 S = ± −1 0
1 1 0 −1 −1
and z = ρ
5. g = ±T S = ± 11 −1 or g = ±ST −1 = ± 01 −1 or g = ±ST S = ± −1 0
0 −1 1 −1 and
z =ρ+1
Since there are no further cases this shows Step 2 (it remains to check the matrices in
case 4 and 5 – see exercise sheet 1) and therefore part (c) of the theorem.
8
Step 3. Let z ∈ D such that the stabilizer Gz of z is not ±Id. Then z = i, z = ρ or
z = ρ + 1.
Proof of Step 3. This follows directly from Step 2 by checking gz = z for all possible
g’s. Step 3 proves part (d) of the theorem.
Proof of Step 4. Let g ∈ G and let z be an arbitrary point of the interior of D. Then
gz ∈ H and by Step 1 exists g 0 ∈ G0 such that g 0 (gz) ∈ D. Moreover Step 2 implies that
either g 0 g ∈ {± Id} or z is on the boundary of D which is by assumption not the case.
Thus either g 0 g = Id or g 0 g = − Id. Since S 2 = − Id ∈ G0 , we deduce that g ∈ G0 , so
SL2 (Z) is generated by S and T . This proves part (a) of the theorem.
Remark 1.2.3. We have seen in the proof of Theorem 1.2.2 that SL2 (Z) is generated
by the elements S and T . These satisfy the relations
S 4 = Id (ST )3 = S 2 ,
and one can show that these generate all the relations, i.e. that
hS, T | S 4 , S −2 (ST )3 i
Remark 1.2.4. The set D is called the fundamental domain. The figure below
represents D itself and the transforms of D by some group elements of SL2 (Z). Part (c)
of the theorem shows that two sets gD and g 0 D where g, g 0 ∈ SL2 (Z) are either equal
(if g 0 = ±g) or only intersect along their edges. Furthermore part (a) implies that H is
covered by the sets {gD : g ∈ SL2 (Z)}: they form a tesselation of H.
9
1.3 Modular forms and modular functions
Definition 1.3.1. A weakly modular function of weight k and level 1 is a mero-
morphic function H → C such that f |k α = f for all α ∈ SL2 (Z), or equivalent
az + b
f = (cz + d)k f (z)
cz + d
for all z ∈ H and for all ac db ∈ SL2 (Z).
Note 1.3.2. Since SL2 (Z) is generated by the matrices S and T , it is sufficient to check
invariance under these two matrices, i.e. that
for all z ∈ H.
Lemma 1.3.3. There are no nonzero weakly modular functions of odd weight.
Proof. Let k be odd and let f be a weakly modular function of weight k. As shown
in (2) we have f (z) = f (z + 1) for all z ∈ H. Moreover we get f (z) = −f (z + 1) for
all z ∈ H, since f |k −1 −1
0 −1 = −f (· + 1). So f (z) = −f (z) and thus f (z) = 0 for all
z ∈ H.
q : H → C,
z 7→ exp(2πiz).
Note 1.3.4. Now let f be weakly periodic of weight k. Then f is periodic with period
1, so it can be written in the form
f (z) = f˜(exp(2πiz)),
Observe that the logarithm is multi-valued, but choosing a different value of the logar-
ithm is the same as adding an integer to log
2πi
q
. The periodicity of f hence implies that
f˜(q) does not depend on the chosen value of the logarithm.
10
Note 1.3.6. Any weakly modular function can be written as
∞
X
f (z) = an q n
n=−∞
for some an ∈ C where q = e2πiz ; we call this the q-expansion of f . This is just the
Laurent series of f˜ around q = 0, which converges for 0 < |q| < ε for ε sufficiently small
(⇔ =(z) 0)
Definition 1.3.7.
Note 1.3.9. If f and g are modular forms (resp. modular functions) of level 1 and
weights k and `, then the product f g is a modular form (resp. modular function) of
weight k + `.
11
Lemma 1.4.3. The series defining Gk (z) converges absolutely and uniformly on subsets
of H of the form
Rr,s = {x + iy : |x| ≤ r, y ≥ s}.
It hence converges to a holomorphic function on H.
For fixed m and n, we distinguish the cases |n| ≤ 2r|m| and |n| ≥ 2r|m|. In the first
case, we have
s2 2 s2
2 2
2 2 2 2 s s
|mz + n| ≥ m s ≥ m + 2
n ≥ min , 2 · (m2 + n2 ).
2 2(2r) 2 8r
s2 s2 1 2
c = min , , ,s ,
2 8r2 4
1 X 1
Gk (z) ≤ .
ck/2 (m2 + n2 )k/2
(m,n)6=(0,0)
We rearrange the sum by grouping together, for each fixed j = 1, 2, 3, . . . , all pairs (m, n)
with max{|m|, |n|} = j. We note that for each j there are 8j such pairs (m, n), each of
which satisfies
j 2 ≤ m2 + n2 .
Hence
∞ ∞
1 X 8j 8 X 1
|Gk (z)| ≤ k/2 = k/2 ,
c j=1
jk c j=1
j k−1
which is finite and independent of z ∈ Rr,s , so Gk (z) converges absolutely and uniformly
on Rr,s . Since every compact subset of H is contained in some Rr,s , this finishes the
proof by Proposition 1.4.2.
12
Remark 1.4.4. This proof clearly fails for k = 2. One can show that for k = 2, the
series (1.1) is conditionally but not absolutely convergent. We will come back to this
issue later in the course.
Proposition 1.4.5. For every even integer k ≥ 4, the function Gk is a modular form
of weight k and level 1. The q-expansion of Gk is given by
∞
2 · (2πi)k X
Gk (z) = 2 ζ(k) + · σk−1 (n)q n
(k − 1)! n=1
P∞ 1
dk−1 .
P
where ζ(k) = n=1 nk (the Riemann zeta function) and σk−1 (n) = d|n
∂
where q = e2πiz . Differentiating (k − 1) times with respect to z, and using that ∂z
=
∂
2πiq ∂q , leads to
∞
!
X −(k − 1)! ∂ k−1 X
= k−1 iπ − 2πi qn
n∈Z
(z + n)k ∂z n=0
∞
X
= −2πi (2πin)k−1 q n
n=1
∞
X
k
= −(2πi) nk−1 q n
n=1
(We are using here that k is even; for k odd we get an additional − sign.)
Hence we get
∞
X 1 (2πi)k X k−1 2πinz
tk (z) := = n e .
n∈Z
(z + n)k (k − 1)! n=1
13
Now we can split up the original sum of the function Gk into two parts, one where
m = 0 and one where m 6= 0. Afterwards we will simplify both parts using symmetry
(remember again that k is even) of the sums and the above formula:
X 1 X X 1
Gk (z) = k
+
n n∈Z
(mz + n)k
n∈Z\{0} m∈Z\{0}
∞ ∞ X
X 1 X 1
=2 k
+2
n=1
n m=1 n∈Z
(mz + n)k
∞
X
= 2ζ(k) + 2 tk (mz)
m=1
∞ ∞
(2πi)k X k−1 2πinmz
X
= 2ζ(k) + 2 n e
m=1
(k − 1)! n=1
∞ ∞
2 · (2πi)k X X k−1 nm
= 2ζ(k) + n q
(k − 1)! m=1 n=1
From there we obtain the proposed q-expansion by resorting the last sum:
∞
2 · (2πi)k X X k−1 l
Gk (z) = 2ζ(k) + d q
(k − 1)! l=1
d|l
| {z }
σk−1 (l)
Remark 1.4.7. The Bernoulli numbers are of great importance in mathematics. Barry
Mazur once said: “When a Bernoulli number sneezes, the tremors can be felt in all of
mathematics.”
Lemma 1.4.8. We have
Bk 6= 0 ⇔ k = 1 or k is even.
Proof. Exercise sheet 2.
Example 1.4.9. The first few non-zero Bernoulli numbers
1 1 1 1
B0 = 0, B1 = − , B2 = , B4 = − , B6 = ,
2 6 3 42
1 5 691
B8 = − , B10 = , B12 = − .
30 66 2730
14
Lemma 1.4.10. If k ≥ 2 is an even integer, then
(2πi)k Bk
ζ(k) = − .
2 · k!
Proof. Exercise sheet 2.
ResP (f /f 0 ) = vP (f ).
Proof. Exercise.
We recall without proof the following results from complex analysis:
Corollary 1.5.5. Let C(P, r, α) be an arc of a circle of radius r and angle α around a
point P . If g is holomorphic at P , then
Z
g(z)
lim dz = αi · g(P ).
r→0 C(P,r,α) z − P
15
The following result relates the contour integral of the logarithmic derivative of f to
the orders of f at the interior points:
Theorem 1.5.6. (Argument principle) Let f be a meromorphic function on an open
subset U ⊆ C, and let C be a contour in U not passing through any zeros or poles of f .
Then Z 0
f (z) X
dz = 2πi vP (f ).
C f (z) P ∈int(C)
Corollary 1.5.8. Let C(P, r, α) be an arc of a circle of radius r and angle α around a
point P . If f is meromorphic at P , then
f 0 (z)
Z
lim dz = αi · vP (f ).
r→0 C(P,r,α) f (z)
Now assume that f is a weakly modular funktion (of weight k and level 1).
Remark 1.5.9. Since f |k α = f for all α ∈ SL2 (Z), we have vαP (f ) = vp (f ). Hence
vP (f ) is well-defined for P being a SL2 (Z) orbit in H.
Moreover, if f is meromorphic at ∞, we can define the order of f at ∞ by
v∞ (f ) := v0 (f˜).
The following theorem is fundamental for studying the spaces of modular forms:
Theorem 1.5.10. (The valence formula) Let f 6= 0 be a modularfunction of weight k
and level 1. Then f has finitely many SL2 (Z)-orbits of zeros and poles in H, and
1 1 X k
v∞ (f ) + vi (f ) + vρ (f ) + vP (f ) = , (1.3)
2 3 P ∈W
12
where ρ = e2πi/3 and W is the set of all SL2 (Z)-orbits in H except the orbits of i and ρ.
Proof. Recall the fundamental domain from 1.2.2 and let C be the contour as shown in
the figure below. Here =(A) = =(E) = R (we will later let R → +∞) and the three
small circles have radius r. We assume that R is sufficiently large and r sufficiently small
that the interior of C contains all the zeros and poles of f except those at i, ρ, ρ + 1 and
∞.
Simplifying assumption: We assume for simplicity f has no zeros or poles on the
boundary of the fundamental domain, except possibly at i and ρ. (In the case where it
does contain zeros or poles of f , the contour has to be modified using additional small
arcs going around these zeros or poles in the counterclockwise direction.)
R 0 (z)
We will now calculate C ff (z) dz in two different ways and compose the results after-
wards.
16
(1) Computing the integral using Theorem 1.5.6, we get
f 0 (z)
Z X X
dz = 2πi vP (f ) = 2πi vP (f ),
C f (z) P ∈W
P ∈interior(C)
where W is the set described in the stated theorem. The last equality is satisfied
by the simplifying assumption, so the interior of the fundamental domain contains
exactly one representative of every pole or zero SL2 (Z)-orbit of H.
(2) Secondly, we estimate the integral by splitting up the contour in 8 parts. Let C1
be the part from E to A, C2 be the part from A to B, and so on, such that in the
end C8 is the part from D0 to E.
(i) Note that since f is a modular function, we have f (z) = f (z + 1). Hence also
f 0 (z) = f 0 (z + 1), and we have
f 0 (z) f 0 (z + 1) f 0 (z)
Z Z Z
dz = dz = − dz,
C2 f (z) C2 f (z + 1) C8 f (z)
so
f 0 (z) f 0 (z)
Z Z
dz + dz = 0.
C2 f (z) C8 f (z)
(ii) Now we consider C1 and change the variable by q(z) = e2πiz . This maps C1 to
a clockwise oriented circle around the origin with radius e−2πR . Furthermore
we have f (z) = f˜(q(z)), thus f 0 (z) = f˜0 (q(z)) q 0 (z) and since f is a modular
17
function, f˜ is meromorphic at 0. Therefore
Z ˜0
f 0 (z) f (q(z))q 0 (z)
Z
dz = dz
C1 f (z) C1 f˜(q(z))
f˜0 (q)
Z
= dq
˜
q(C1 ) f (q)
!
f˜0
= −2πi Res0
f˜
= −2πi v0 (f˜)
= −2πi v∞ (f ).
Similarly we get
f 0 (z)
Z
1
lim dz = − 2πi vρ (f )
r→0 f (z) 6
ZC3 0
f (z) 1 1
lim dz = − 2πi vρ+1 (f ) = − 2πi vρ (f ).
r→0 C7 f (z) 6 6
and thus
f 0 (z) f 0 (z)
Z Z
k
dz + dz = 2πi .
C4 f (z) C6 f (z) 12
Composing (i) to (iv) yields
Z 0
f (z) k 1 1
dz = 2πi − vρ (f ) − vi (f ) − v∞ (f ) .
C f (z) 12 3 2
Combining this with the result in (1) gives us exactly the proposed formula.
18
1.6 Applications to modular forms
The valence formula provides some interesting consequences to spaces of modular forms
which we will investigate below.
Definition 1.6.1. Let Mk be the set of all modular forms of weight k and level 1 and
let Sk be the set of all cusp forms of weight k and level 1.
Remark 1.6.2. It can be easily checked that these are both vector spaces over C.
Lemma 1.6.3.
Note 1.6.6. Since E4 and E6 are modular forms of weight 4 and 6, respectively, ∆ is
a modular form of weight 12. Since the q-expansion has zero constant coefficient, it is
indeed a cusp form.
Lemma 1.6.7. The modular form ∆ has a simple zero at ∞ and no other zeros.
Proof. Using the known q-expansions of E4 and E6 , one can compute the q-expansion
of ∆ as
so ∆ has a simple zero at ∞. Now since ∆ is a modular form, all the quantities v? (∆)
occurring in Theorem 1.5.10 are non-negative, so the only way to get equality is if there
are no zeros apart from the one at ∞.
19
Proposition 1.6.8. S12 is one-dimensional over C and spanned by ∆.
Proof. Let f ∈ S12 and define a function g by
f (i)
g(z) = f (z) − ∆(z).
∆(i)
This function is well-defined since ∆ does not vanish on H, so ∆(i) 6= 0. Clearly g ∈ S12
and g(i) = 0. Using the valence formula yields
1 1 X
v∞ (g) + vi (g) + vρ (g) + vp (g) = 1.
2 3 p∈W
But this is a contradiction since v∞ (g) ≥ 1 and vi (g) ≥ 1. Therefore g has to be zero
and
f (i)
f= ∆ ∈ C · ∆.
∆(i)
Corollary 1.6.9.
1. For all k ∈ Z, the map
Mk → Sk+12 , f 7→ f · ∆
is an isomorphism.
2. For k ≥ 4 we have Mk = Sk ⊕ (C · Ek ).
Proof. The first statement is trivial for k < 0 since then Mk = Sk+12 = {0} by Lemma
1.6.3 (a), (b). So let k ≥ 0. As ∆ is non-vanishing the given map is clearly an injection.
Now let g ∈ Sk+12 . Then ∆g is weakly modular of weight (k + 12) − 12 = k and
holomorphic on H since ∆ is non-vanishing. Furthermore v∞ (g) ≥ 1 by assumption, so
g
v∞ = v∞ (g) − v∞ (∆) = v∞ (g) − 1 ≥ 0.
∆
So ∆g ∈ Mk . Therefore the given map is also onto, thus bijectiv.
For the second part of the corollary we just have to note that Sk is the kernel of the
linear map Mk → C, f 7→ f (∞). Thus we have dim(Mk /Sk ) ≤ 1. On the other hand
we know that Ek ∈ Mk \ SK since Ek (∞) 6= 0. So Mk = Sk ⊕ (C Ek ).
Theorem 1.6.10.
(a) The space Mk is finite dimensional over C for all k ∈ Z.
(b) Let k ≥ 0 even. Then
( k
1 + 12 , k=6 2 mod 12,
dim(Mk ) = k
12
, k = 2 mod 12.
Otherwise Mk = {0}.
20
(c) A basis for Mk is given by {E4a E6b : a, b ∈ N0 , 4a + 6b = k}.
Proof. (a) This is a consequence of part (b).
(b) We will prove this by induction on k. First of all note that the statement is clear
for odd k since there aren’t any nonzero weakly modular functions of odd weight.
Moreover we already know that dim(M0 ) = 1, dim(M2 ) = 0 and dim(Mk ) = 0
for k < 0 by Lemma 1.6.3 (a) and (c). In addition we have dim(Mk ) = 1 for
k = 4, . . . , 10 since dim(Mk ) = dim(Sk ) + 1 by Corollary 1.6.9 and Sk = {0} for
these k’s by Lemma 1.6.3 (b). Hence the statement is true for k = 0, . . . , 10.
Let now k ≥ 12. Then
dim(Mk ) = dim(Mk−12 ) + 1
since dim(Sk ) = dim(Mk−12 ) by Corollary 1.6.9. So the statement is true for all k
by induction in steps of 12.
(c) We will use again induction to prove the statement. Note that there is nothing to
show for odd k, k < 0 and k = 2 since in these cases Mk = {0}. The case k = 0
is also trivial because M0 is the set of all constant functions, hence generated by
1 = E40 E60 .
Let now k ≥ 4 be even. Obviously there is always a pair (a, b) such that a, b ∈ Z≥0
and 4a + 6b = k. Pick such a pair. Let f ∈ Mk . Then f can be written in the form
f = λ E4a E6b + g
for some λ ∈ C and g ∈ Sk since the modular form E4a E6b is in Mk and does not
vanish at infinity. So there is an h ∈ Mk−12 such that g = h · ∆ by corollary 1.6.9
and by induction we may assume h to be a linear combination of E4r E6s where
r, s ∈ Z≥0 and 4r + 6s = k − 12. Hence
3
E4 − E62
h·∆=h·
1726
is a linear combination of E4r+3 E6s and E4r E6s+2 and since
4(r + 3) + 6s = 4r + 6(s + 2) = k
the function h is a linear combination of E4p E6q with 4p + 6q = k. So the linear
span of these functions contains g and hence also f . Therefore
Mk = span{E4a E6b : a, b ∈ N0 , 4a + 6b = k}.
To show that the given set is indeed a basis of Mk it suffices to check that
(a, b) ∈ Z2≥0 : 4a + 6b = k = dim(Mk ).
21
Example 1.6.11. For the first few values of k, the dimensions of Mk and Sk are given
by
k dim Mk dim Sk
0 1 0
2 0 0
4 1 0
6 1 0
8 1 0
10 1 0
12 2 1
14 1 0
16 2 1
Example 1.6.12. Both, E42 and E8 are in M8 . But dim(M8 ) = 1 by Theorem 1.6.10 (b).
Hence E42 and E8 are linearly dependent and as both are 1 at infinity, we can conclude
that E42 and E8 are equal. So
∞
!2 ∞
X X
1 + 240 σ3 (n)q n = E42 = E8 = 1 + 480 σ7 (n)q n
n=1 n=1
, so
n−1
X
σ7 (n) = σ3 (n) + 120 σ3 (m)σ3 (n − m).
m=1
This is very hard to prove (or even conjecture!) without using the theory of modular
forms.
Example 1.6.13. From the theorem, we deduce that
Note that these forms are linearly independent (exercise), so since dim(M30 ) = 3, they
form a basis.
The following theorem is a very useful consequence of the fact that the spaces of
modular forms are finite-dimensional:
Theorem 1.6.14. Let f be a modular form of weight k and level 1 with q-expansion
P∞ n
n=0 an q . Suppose that
Then f = 0.
22
Proof. Suppose that f 6= 0. Then the hypothesis implies that
v∞ (f ) ≥ bk/12c + 1 > k/12.
Hence the left-hand side of (1.3) is strictly greater than k/12, which gives a contradiction.
Corollary 1.6.15.
P∞ Letn f, g be
P∞ modular forms of the same weight k and level 1, with
q-expansions n=0 an q and n=0 bn q n , respectively. Suppose that
aj = bj for all j = 0, ..., bk/12c.
Then f = g.
Corollary 1.6.15 is a very powerful tool: it allows us to conclude that two modular
forms are identical if we only know a priori that their q-expansions agree to a certain
finite precision.
3
and E2 (z) = π2
· G2 (z).
Lemma 1.7.2.
1. The series in Definition 1.7.1 is convergent, but not absolutely convergent, and
defines a holomorphic function on H1 .
2. We have ∞
X
G2 (z) = 2ζ(2) − 8π σ1 (n)q n .
n=1
Proof. 1. Exercise.
2. Argue as in the proof of proposition 1.4.5.
Proposition 1.7.3. The functions G2 and E2 satisfies the transformation property
−2 1
z G2 − = G2 (z) − 2πiz, (1.4)
z
−2 1 6i
z E2 − = E2 (z) − . (1.5)
z πz
1
It is not a modular form, however: it can’t be, since M2 = {0}.
23
The proof of this result is based on the following lemma, which gives an example of
two double series that contain the same terms but sum to different values due to the
order of summation being different.
Lemma 1.7.4. For all z ∈ H, we have
XX 1 1
− = 0, (1.6)
m6=0 n∈Z
mz + n mz + n + 1
XX 1 1
2πi
− =− . (1.7)
n∈Z m6=0
mz + n mz + n + 1 z
Also, we have
XX 1
z −2 G2 (−1/z) = 2ζ(2)z −2 + (1.12)
m6=0 n∈Z
(nz − m)2
XX 1
= 2ζ(2) + (1.13)
m∈Z n6=0
(nz − m)2
XX 1
= 2ζ(2) + 2
; (1.14)
n∈Z m6=0
(mz + n)
24
note that in the second equality we just relabelled the parameters, but did not change
the order of summation.
Subtracting (1.7) and simplifying, we obtain
2πi XX 1
z −2 G2 (−1/z) + = 2ζ(2) + 2
, (1.15)
z n∈Z m6=0
(mz + n) (mz + n + 1)
and by imitating the proof of Lemma 1.4.3 one can show that the sum on the right-hand
side is absolutely convergent. We can hence change the order of summation, and we see
that (1.15) is equal to (1.11).
Corollary 1.7.5. The q-expansion of ∆ is given by
Y
∆=q (1 − q n )24 .
n≥1
Let D(z) = q · ∞ n 24
where q = e2πiz as usual. We can check that this
Q
n=1 (1 − q )
product converges sufficiently fast for D to be defined and holomorphic on H. Evidently
D(z + 1) = D(z) and D(z) → 0 as =(z) → ∞. So to check that it is a modular form of
weight 12 (clearly cuspidal), it suffices to show that D(− z1 ) = z 12 D(z). The result then
follow immediately, since we already know that S12 is 1-dimensional.
Recall that ∂d
∂z
∂
= 2πiq ∂q . Then
∞
!
∂ ∂ X
(log(D(z))) = log(q) + 24 log(1 − q n )
∂z ∂z n=1
∞
X −2πinq n
= 2πi + 24
n=1
1 − qn
∞ ∞
!
X X
= 2πi 1 − 24 nq n qr
n=1 r=0
∞ ∞
!
XX
nr
= 2πi 1 − 24 nq
n=1 r=0
∞
!
X
= 2πi 1 − 24 σ1 (n)q n
n=1
= 2πi E2 (z).
Hence finally
∂ D(−1/z) 1 1 12
log 12
= 2 2πi E2 − − − 2πi E2 (z)
∂z z D(z) z z z
2πi 1 2 6z
= 2 E2 − − z E2 (z) +
z z iπ
= 0.
25
So there is a constant λ sucht that D(− z1 ) = λz 12 D(z) for all z ∈ H. For z = i solves
this to D(i) = D(− 1i ) = λD(i), and since D(i) 6= 0 we have λ = 1, and therefore
D(− z1 ) = z 12 D(z).
We can now expand the product formula for ∆(z) as
X
∆(z) = τ (n)q n for some τ (n) ∈ Z.
n≥1
11
3. We have |τ (p)| ≤ 2p 2 for all primes p.
We will see a proof of properties 1) and 2) later in the course, in the section on Hecke
operators. Property 3) was proved by Deligne in 1974 as a consequence of his proof of
the Weil conjectures, for which he was awarded the Fields medal in 1978.
26
2 Modular forms of higher level
The idea is to look at functions transforming nicely under subgroups of SL2 (Z).
Note 2.1.2. Γ(N ) is the kernel of the group homomorphism induced by the reduction
map Z → Z/N Z:
πN : SL2 (Z) → SL2 (Z/N Z).
It is hence a normal subgroup of finite index. (Ex: show that πN is sujective. This
statement goes by the name of ”strong approximation for SL2 ”. It can be shown to be
false for GL2 (Z).)
Lemma 2.1.4. Any congruence subgroup has finite index in SL2 (Z).
Proof. It sufficies to show that [SL2 (Z) : Γ(N )] < ∞ for all N ∈ N. But this is clear as
SL2 (Z)/Γ(N ) ,→ SL2 (Z/N Z) and SL2 (Z/N Z) is finite.
Remark 2.1.5. The converse to Lemma 2.1.4 is false. There exist finite index Γ ⊆
SL2 (Z) which don’t contain Γ(N ) for any N . (For example there is one of index 7.) But
every finite index subgroup of SLn (Z) is congruence for n ≥ 3. So SL2 is quite unusual.
(Bass-Serre-Milnor theorem)
27
Note 2.1.7. We have a chain of inclusions
Γ(N ) ⊆ Γ1 (N ) ⊆ Γ0 (N ) ⊆ SL2 (Z).
These inclusions are in general strict; however, all of them are equalities for N = 1, and
Γ0 (2) = Γ1 (2).
Lemma 2.1.8. For N ≥ 1, we have
Y 1
[Γ1 (N ) : Γ(N )] = N, [Γ0 (M ) : Γ1 (N )] = N 1− ,
p
p|N
Y 1
[SL2 (Z) : Γ0 (M )] = N 1+ .
p
p|N
Definition 2.1.9. Let Γ be a congruence subgroup. We say that Γ is even (resp. odd)
if − Id ∈ Γ (resp. Id 6∈ Γ). We define the projective index of Γ to be
dΓ = [PSL2 (Z) : Γ̄],
where Γ̄ is the image of Γ in PSL2 (Z).
has the property that for any z ∈ H there exists γ ∈ Γ such that γz ∈ DΓ . Furthermore,
γ is unique up to multiplication by an element of Γ ∩ {± Id}, except possibly if γz lies
on the boundary of D. We call DΓ a fundamental domain for Γ.
Proof. If z ∈ H, thern there exists g ∈ SL2 (Z) and z0 ∈ D such that g.z = z0 . The coset
decomposition implies that we can express g uniquely as γ −1 γ 0 with γ ∈ Γ and γ 0 ∈ R.
We now have
γ.z = γg.z0 = γ 0 .z0 ∈ DΓ .
The uniqueness is left as an exercise.
Example 2.2.2. Let Γ = Γ0 (2). A system of representatives for the quotient Γ\ SL2 (Z)
is
1 0 0 −1 0 −1
, , = {Id, S, ST }.
0 1 1 0 1 1
Using this, one can draw the fundamental domain for Γ.
Note that there are now two points in its closure which do not belong to H: the cusp
∞, as well as 0.
28
Definition 2.2.3. The set P1 (Q), the projective line over Q, consists of Q ∪ {∞}.
We give this an action of SL2 (Z) via
a b ax + b
.x =
c d cx + d
a
where the right-hand-side is interpreted as c
if x = ∞, and as ∞ if cx + d = 0.
Proof. Clearly it sufficies to show that for any x ∈ P1 (Q) we can map ∞ to x. For
x = ∞ we have ∞.1 = ∞. So letx = ac with a, c ∈ Z coprime.
Then there are r, s ∈ Z
a −s a −s
such that ar + cs = 1, thus c r ∈ SL2 (Z) and c r .∞ = x.
Note 2.2.5. An easy computation shows that the stabiliser of ∞ in SL2 (Z) is the
subgroup
1 b
SL2 (Z)∞ = ± : b∈Z .
0 1
It follows from Proposition 2.2.4 that we hence have a bijection
Definition 2.2.6. For Γ ≤ SL2 (Z) of finite index we define the set of cusps of Γ,
denoted by Cusps(Γ), as the set of Γ-orbits in P1Q .
29
∈ Γ0 (p), so uv ∈ [∞]. Conversly, if γ ∈ Γ0 (p) then p
(1) If p divides v then uv −sr
divides the denominator of γ.∞ by definition. So the orbit of ∞ is given by all
fractions uv with p dividing the denominator v.
So this is everything and there are exactly two distinct orbits as claimed.
h ∈ Γt ⇔ h.t = t
⇔ hγt (∞) = γt (∞)
⇔ γt−1 hγt (∞) = ∞
⇔ γt−1 hγt ∈ SL2 (Z)∞
⇔ h ∈ γt SL2 (Z)∞ γP t−1.
30
Lemma 2.2.12. The subgroup
does not depend on the choice of representative for P , and it has finite index in SL2 (Z)∞ .
Proof. We first show that if we have elements γt and γ̃t in SL2 (Z) such that γt .∞ = t
and γ̃t .∞ = t, then
Note that γt−1 γ̃t fixes ∞, so it is an element in SL2 (Z)∞ , say γt−1 γ̃t = g ∈ SL2 (Z)∞ .
Then
Here, we get the last equality since γt−1 Γγt ∩ g SL2 (Z)∞ g −1 ⊆ SL2 (Z)∞ and hence is
commutative, so in particular its elements commute with g.
Suppose now that we choose another element t in the Γ-orbit of t, and let γt0 ∈ SL2 (Z)
such that γt0 .∞ = t0 . Then we can write γt0 = gγt for some g ∈ Γ which satisfies g.t = t0 .
Then
γt−1 −1 −1
0 Γγt0 = γt g Γgγt = γt−1 Γγt−1 ,
and hence
γt−1 −1
0 Γγt ∩ SL2 (Z)∞ = γt Γγt0 ∩ SL2 (Z)∞ .
Lemma 2.2.13. Let H be a subgroup of finite index in SL2 (Z)∞ , and let h be the index
of ±H in SL2 (Z)∞ . Then H is one of the following:
1 h
h( 0 1 )i
−1 h
H= 0 −1 = {(−1)t ( 10 th
1 ) : t ∈ Z}
1 h
{± Id} × h( 0 1 )i
Proof. Exercise.
31
Remark 2.2.15. If Γ is normal in SL2 (Z), the subgroup HP does not depend on the
cusp P , and hence all the cusps have the same width and regularity.
Example 2.2.16. Let us determine the width of the two cusps in Cusps(Γ0 (p)).
We now want to count the number of cusps for a given congruence subgroup. We need
the following group-theoretic result:
(i) For any x ∈ X, StabH (x) has finite index in StabG (x), and we have an injection
H\G H\X,
Hg 7→ Hg.x0
and for each x ∈ X, the cardinality of the fibre of this map over Hx equals the
index [StabG (x) : StabH (x)].
32
For (ii), the transitivity of the G-action on X implies that for all x ∈ X, we can choose
an element gx ∈ G such that gx .x0 = x, so the map H\G → H\X is surjective. Denote
by THx the fibre of this map over Hx, i.e.
THx ∼
= {Hg 0 ∈ H\G | Hg 0 gx .x0 = H.x}
= {Hg 0 ∈ H\G | Hg 0 .x = Hx}
= H\(H StabG (x))
∼
= StabH (x)\ StabG (x),
Remark 2.3.2. Let k be odd and Γ be even. Let f be a weakly modular function of
weight k and level Γ. By Lemmas 2.2.12 and 2.2.13 there is h ∈ N such that ± 10 h1 ∈ Γ,
so
f = f |k −1 −h
f = f |k 10 h1 = f (· + h) and 0 −1 = −f (· + h).
Example 2.3.3. Let f be weakly modular of level SL2 (Z) and weight k. Then f (N z)
is weakly modular of level Γ0 (N ) and weight k.
33
Proof. We have
az + b aN z + bN aN z + bN
f N =f =f c .
cz + d cz + d N
Nz + d
a Nb
a b
If c d ∈ Γ0 (N ) then c/N d ∈ SL2 (Z) and hence
aN z + bN c k
f c = (N z) + d f (N z) = (cz + d)k f (N z)
N
Nz + d N
2.4 q-expansion at ∞
Proposition 2.4.1. Let f : H → C be weakly modular of weight k and level Γ and let
h = hΓ (∞).
(z) = (±1)k f (z + h)
f (z) = f |k ± 1 h
0 1
for all z ∈ H.
If k is even then (±1)k = 1, so f = f (· + h), and if Γ is odd and ∞ is regular, then
1 h ∈ Γ, so we also have f = f (· + h). In both cases we can argue as in section 1.3.
0 1
If k is odd and Γ is odd but ∞ is irregular, then − 10 h1 ∈ Γ and therefore
f (z) = −f (z + h) ∀z ∈ H.
So we can argue for F as before and get f (z) = eπiz/h F̃ (qh (z)).
34
Definition 2.4.3. Let f : H → C be weakly modular of weight k and level Γ. We say
that f is meromorphic at ∞ if f˜ is meromorphic at 0. Similarly we define f to be
holomorphic at ∞ if f˜ is holomorphic at 0. If f is meromorphic at ∞, we define
1
v∞,Γ (f ) = min{n ∈ Z : a∞,n 6= 0.}
2
We then say f is vanishing at ∞ if v∞,Γ (f ) > 0. If f is holomorphic at ∞ we define
(
f˜(0) if k is even or if k is odd, Γ is odd and ∞ is regular
f (∞) =
0, if k is odd and Γ is odd and irregular at ∞.
1
Remark 2.4.4. To motivate the definition v∞,Γ (f ) = v0 (F̃ ) + 2
in the irregular case
note that the additional 12 term ensures
since this would fail for f , g with f (z) = eπiz/h f˜(qh ) and g(z) = eπiz/h g̃(qh ) without this
extra term. Moreover, note that in the irregular case f being holomorphic at ∞ implies
f vanishes at ∞.
Lemma 2.5.1. Let f : H → C be weakly modular of weight k and level Γ and let g ∈
SL2 (Z)∞ but not necessarily in H∞ . Then f |k g is meromorphic at ∞ if and only if f is.
Moreover v∞,g−1 Γg (f |k g) = v∞,Γ (f ) and (f |k g)(∞) = f (∞) if defined and if k is even.
Proof. We check that f |k g is indeed weakly modular of weight k and level g −1 Γg since
(f |k g) |k g −1 γg = (f |k γ) |k g = f |k g.
Moreover we have
h i h i
hg−1 Γg (∞) = SL2 (Z)∞ : g −1 H∞ g = SL2 (Z)∞ : H ∞
1 t
and g ∈ SL2 (Z)∞ .
since SL2 (Z)∞ is abelian
Now let g = ± 0 1 . Then
(
(±1)k f˜ e2πit/h q ,
if k is even or if k is odd, Γ is odd and ∞ is regular,
(f |k g)(z) = k it/h 2πit/h
.
(±1) e F̃ e q , if k is odd and Γ is odd and irregular at ∞.
35
Definition 2.5.2. Let f be weakly modular of weight k and level Γ. Let P ∈ Cusps(Γ)
be represented by an element t ∈ P1 (Q) and choose some γt ∈ SL2 (Z) such that γt .∞ = t.
Define vP,Γ (f ) = v∞,γt−1 Γγt (f |k γt ).
36
The proof of this will take us a while.
Definition 2.6.4. Let VΓ (f ) = z∈Γ\H nvzΓ(f )
P P
(z)
+ P ∈Cusps(Γ) vP,Γ (f ).
Hence we have
!
X vw (f ) X X 1 dΓ0 X vz (f )
= vz (f ) = .
nΓ0 (w) nΓ0 (w) dΓ nΓ (z)
w∈H\X z∈Γ\H w∈Γ0 \H z∈Γ\H
[w]=[z] mod Γ
Similarily we can argue at the cusps: If P ∈ Cusps(Γ) and Q ∈ Cusps(Γ0 ) which maps
to P under the natural map Cusps(Γ0 ) → Cusps(Γ), then we have by definition
hΓ0 (Q)
vQ,Γ0 (f ) = vP,Γ (f ).
hΓ (P )
Therefore we get again by Proposition 2.2.17
X X hΓ0 (Q) d Γ0
vQ,Γ0 (f ) = vP,Γ (f ) = vP,Γ (f ) .
hΓ (P ) dΓ
Q∈Cusps(Γ0 ) Q∈Cusps(Γ0 )
Q=P in Cusps(Γ) Q=P in Cusps(Γ)
and thus
X X X dΓ0 X
vQ,Γ0 (f ) = vQ,Γ0 (f ) = vP,Γ (f ).
dΓ
Q∈Cusps(Γ0 ) P ∈Cusps(Γ) Q∈Cusps(Γ0 ) P ∈Cusps(Γ)
Q=P in Cusps(Γ)
37
Lemma 2.6.6. For any g ∈ SL2 (Z) we have
Vg−1 Γg (f |k g) = VΓ (f ).
Proof. We clearly have vz (f |k g) = vgz (f ) for any z ∈ H and ng−1 Γg (z) = nΓ (gz) since
stabΓ (gz) = g(stabg−1 Γg (z))g −1 . Hence
X vz (f |k g) X vgz (f )
= .
ng−1 Γg (z) nΓ (gz)
z∈(g −1 Γg)\H gz∈Γ\H
This deals with the non-cusp terms in the valence formular. But similarily we can check
that vP (f |k g) = vgP (f ) for all P ∈ Cusps(Γ), so the cusp terms in Vg−1 Γg (f |k g) and
VΓ (f ) are also equal.
Now we can finally proof the valence formula.
Proof of theorem 2.6.3. Let Γ0 be any finite index subgroup of SL2 (Z) which is normal
and contained in Γ. (Note that such a group exists since Γ is a congruence subgroup.)
Then
dΓ
VΓ (f ) = · VΓ0 (f )
dΓ0
by Lemma 2.6.5. Let d = dΓ0 and choose g1 , . . . , gd ∈ SL2 (Z) such that g1 , . . . , gd are
coset representatives for PSL2 (Z)/Γ0 . Define
d
Y
F (z) = (f |k gi )(z).
i=1
Then F is weakly modular of weight dk for the full modular group SL2 (Z), and mero-
morphic at ∞. Hence by Theorem 1.5.10, we have
dk k
VSL2 (Z) (F ) = ⇒ VΓ0 (F ) = d2
12 12
since VΓ0 (F ) = d VSL2 (Z) (F ) by Lemma 2.6.5 But we can easily check that
d
X d
X
VΓ0 (F ) = VΓ0 (f |k gi ) = Vgi−1 Γ0 gi (f |k gi ) = dVΓ0 (f )
i=1 i=1
where we obtain the last two equalities since Γ0 is normal and applying Lemma 2.6.6.
Hence
dk kdΓ
VΓ0 (f ) = ⇒ VΓ (f ) = ,
12 12
which finishes the proof.
Corollary 2.6.7. Mk (Γ) is empty for any k < 0 and for any Γ.
Proof. Clear since the left hand side of the valence formula must be non-negative.
38
Corollary 2.6.8 (”The unreasonable effectiveness of modular forms in number theory”).
Let k ∈ Z and suppose f and g are modular forms of weight k and level Γ, and their
q-expansions agree up to degree k12dΓ , so up to and including qhm where m = b k12dΓ c and
h = h∞ (Γ). Then f = g.
Proof. We have v∞,Γ (f − g) ≥ 1 + b k12dΓ c > k dΓ
12
, which yields a contradiction to Theorem
2.6.3 unless f − g = 0.
Corollary 2.6.9. For any k ≥ 0 and any finite index subgroup Γ ≤ SL2 (Z) we have
k dΓ
dim (Mk (Γ)) ≤ 1 + .
12
In particular Mk (Γ) is finite dimensional.
Proof. Let m = b k12dΓ c and h = h∞ (Γ). Consider the linear map Mk (Γ) → Cm+1 mapping
f to the coefficients up to qhm in its q-expansion. By Corollary 2.6.8 this map is injective,
hence dim(Mk (Γ)) ≤ m + 1.
Remark 2.6.10.
(i) It can be shown that if −1 ∈ Γ and k is any non-negative even integer or if Γ is
odd and k is any non-negative integer then
k
dim(Mk (Γ)) ≥ ( − 1)dΓ .
12
(ii) In Diamond & Shurman there are precise formulae for the dimsion of Mk (Γ).
(b) For (c, d) ∈ Z2 there exists γ ∈ SL2 (Z) with bottom row (c, d) if and only if
gcd(c, d) = 1.
Proof. For (a) note that
0 0 0
a d − b0 c −a0 b + b0 a 1 ab0 − a0 b
0 −1 a b d −b
gg = = = ∈ SL2 (Z)+
∞.
c d −c a 0 −cb + da 0 1
SL2 (Z)+ 2
∞ \ SL2 (Z) → {(c, d) ∈ Z : gcd(c, d) = 1}.
39
We will now motivate the definition of a generalised Eisenstein series using this bijec-
tion.
Proposition 2.7.5. The function Gk,Γ,∞ is a weakly modular function of weight k and
level Γ.
Proof. It can be shown that the sum defining Gk,Γ,∞ converges absolutely and uniformly
on compact subsets of H. Thus Gk,Γ,∞ is well-defined and holomorphic. Moreover,
Gk,Γ,∞ is also clearly Γ-invariant under the weight k action.
for all γ ∈ Γ. Hence the terms in the sum defining Gk,Γ,∞ cancel out, so Gk,Γ,∞ = 0.
Now let k be even or let k be odd and Γ regular at ∞. We compute Gk,Γ,∞ (∞). We
have (
d−k if c = 0
lim (cz + d)−k = (2.1)
=(z)→∞ 0 if c 6= 0
40
Note also that for γ = ( ac db ), we have c = 0 if and only if γ ∈ Γ∞ . Thus
But X X
(Gk,Γ,∞ |k γt ) (z) = j(γγt , z)−k = j(γ, z)−k .
γ∈Γ+
∞ \Γ γ∈Γ+
∞ \Γγt
In particular Gk,Γ,∞ |k g is bounded as =(z) → ∞ for all g ∈ SL2 (Z), so Gk,Γ,∞ is indeed
a modular form.
Note 2.7.7. We have constructed a modular form that doesn’t vanish at ∞ for all pairs
(k, Γ) where this isn’t trivially impossible.
Proof. Let t be a representative of P , and choose γt ∈ SL2 (Z) such that γt .∞ = t. Define
Then Gk,Γ,P is a modular form of weight k and level Γ which does not vanish at P but
at all other cusps, by Proposition 2.7.6.
41
Note 2.7.9. The Eisenstein series Gk,Γ,P is well-defined if k is even, and in this case
independent of the choice of t. But if k is odd Gk,Γ,P is only well-defined up to sign.
Definition 2.7.10. We define Ek (Γ) as the subspace of Mk (Γ) spanned by the Gk,Γ,P ’s.
Note 2.7.11. We have
(
| Cusps(Γ)|, if k is even
dim(Ek (Γ)) = .
| Cuspsreg (Γ)|, if k is odd and Γ is odd
Example 2.7.12. Let p be prime and Γ = Γ0 (p). Then Cusps(Γ) = {0, ∞}, both cusps
are regular (see Example 2.2.16), and Γ is even. So the case k odd is trivial. For k ≥ 4
an even integer there are two Eisenstein series: Gk,Γ,∞ and Gk,Γ,0 .
Hence
0 −1
Gk,Γ,0 (z) = Gk,Γ0 (p),∞ |k (z)
1 0
X 1
= z −k −1
2
(−cz + d)k
(c,d)∈Z
gcd(c,d)=1
p-d
X 1
=
(−c + dz)k
(c,d)∈Z2
gcd(c,d)=1
p-d
X 1
= .
(cz + d)k
(c,d)∈Z2
gcd(c,d)=1
p-c
42
Thus we have
Finally consider
X 1
2Ek (pz) = .
(cpz + d)k
(c,d)∈Z2
gcd(c,d)=1
which gives us
X 1
2Ek (pz) = Gk,Γ,∞ + = Gk,Γ,∞ + p−k Gk,Γ,0 .
(pcz + pd)k
(c,d)∈Z2
gcd(c,d)=1
p-c
Hence Ek (Γ) is spanned by Ek (z) and Ek (pz). Note that we have also shown that
Ek (pz) is p−k at cusp 0.
43
3 Hecke operators
3.1 Double cost operators
It turns out that the space MK (Γ) has a very interesting structure: it is a module over
a commutative ring, classed the Hecke algebra.
Lemma 3.1.1.
1. If Γ is a congruence subgroup and α ∈ GL2 (Q)+ , then SL2 (Z) ∩ α−1 Γα is also a
congruence subgroup.
2. Any two congruence subgroups are commensurable: we have
[Γ1 : Γ1 ∩ Γ2 ] < ∞ and [Γ2 : Γ1 ∩ Γ2 ].
Proof. 1. Let N ≥ 1 such that Γ(N ) ⊆ Γ, and such that N α ∈ M2 (Z) and N α−1 ∈
M2 (Z). Then one can check that
αΓ(N 3 )α−1 ⊆ Γ(N ) ⊆ Γ,
so Γ(N 3 ) ⊆ α−1 Γα.
2. Note that there is some M ≥ 1 such that Γ(M ) ⊆ Γ1 ∩ Γ2 .
Definition 3.1.2. Let GL+ 2 (Q) denote the set of invertible 2 × 2 matrices over Q with
positive determinant. Let Γ1 , Γ2 be congruence subgroups, and let α ∈ GL+ 2 (Q). The
double coset Γ1 αΓ2 is the set
Γ1 αΓ2 = {γ1 αγ2 | γ1 ∈ Γ1 , γ2 ∈ Γ2 }.
Note 3.1.3. Multiplication gives a left (resp. right) action by Γ1 (resp. by Γ2 ) on
Γ1 αΓ2 . We can hence decompose the double coset into Γ1 -orbits:
[
Γ1 αΓ2 = Γ1 βj .
j
44
Proof. Consider the map
The map is clearly surjective, and two elements γ2 , γ20 get mapped to the same element
if and only if
Γ1 αγ2 = Γ1 αγ20 ⇔ γ20 γ2−1 ∈ α−1 Γ1 α ∩ Γ2 .
Note 3.1.7. Note that the action of SL2 (R) on H extends naturally to GL+
2 (R).
Definition 3.1.8.
g ∈ GL+
(ii) Let Γ1 , Γ2 ≤ SL2 (Z) of finite index, S 2 (Q) and let β1 , . . . , βr ∈ GL2 (Q)
+
Proposition 3.1.9. f |k [Γ1 gΓ2 ] is independent of the choice of the βi ’s, and it is weakly
modular of weight k and level Γ2 .
Proof. If β10 , . . . , βs0 is another set of coset representatives then we see that s = r. So we
can reorder such that βi = γi βi0 for some γi ∈ Γ1 . Hence f |k βi = f |k βi0 , so f |k [Γ1 gΓ2 ] is
independent of the choice of the βi ’s.
In particular, if β1 , . . . , βr is one such choice then so is β1 γ, . . . , βr γ2 for any γ2 ∈ Γ2 .
Hence the sum r r r
X X X
f |k βi = f |k (βi γ) = ( f |k βi )|k γ,
i=1 i=1 i=1
Pr
so i=1 f |k βi is weakly modular of weight k and level Γ2 .
45
Note 3.1.10. Note that acting on the right of f |k [Γ1 gΓ2 ] by Γ2 is effectively permuting
summands.
[Γ1 gΓ2 ]
Mk (Γ1 ) −−−−→ Mk (Γ2 ).
(4) Suppose Γ1 ⊆ Γ2 and g = 1. Then the αi ’s are just coset representatives for Γ1 \Γ2
and we are sending
X
f 7→ f |k γ.
γ∈Γ1 \Γ2
(5) The lastexample is a much more subtle one. Let Γ = Γ1 = Γ2 = SL2 (Z) and
g = 10 p0 for some prime p. Then
−1
0 a b
Γ ∩ g Γg = Γ (p) = : p divdes b .
c d
46
One can check that Γ0 (p)\Γ is given by the coset representatives 10 1j j=0,...,p−1 and
0 −1
1 0 . So for f ∈ Mk (Γ) we have
p−1
X 1 0 1 j 1 0 0 −1
f |k [ΓgΓ] = f |k + f |k
0 p 0 1 0 p 1 0
j=0
p−1
X 1 j 0 −1
= f |k + f |k
0 p p 0
j=0
p−1
X
k−1 −k z+j k−1 −k 1
= p p f +p (pz) f − .
j=0
p pz
0 −1
1 j
where ᾱj = 0 1 for j = 0, . . . , p − 1 and β̄ = 1 0 .
Definition 3.1.15. (a) Let Γ1 , Γ2 ≤ SL2 (Z) of finite index. We define R(Γ1 , Γ2 ) to
be the C-vector space with basis the symbols [Γ1 gΓ2 ] for each g ∈ Γ1 \ GL+
2 (Q)/Γ2 .
We define X
[Γ1 gΓ2 ] × [Γ2 hΓ3 ] := cγ · [Γ1 γΓ3 ]
γ∈Γ1 \ GL+
2 (Q)/Γ3
where
cγ := |{(i, j) ∈ {1, . . . , s} × {1, . . . , t} : λi µj ∈ Γ1 γ}| .
47
Remark 3.1.16. It is tedious to check that this definition is indeed well-defined, so
independent of the choice of λi and µj , and that this multiplication is associative, so
[Γ1 gΓ2 ] × [Γ2 hΓ3 ] × [Γ3 jΓ4 ] = [Γ1 gΓ2 ] × [Γ2 hΓ3 ] × [Γ3 jΓ4 ].
In particular, R(Γ) := R(Γ, Γ) is a ring and Mk (Γ) and Sk (Γ) are right modules over it.
a b
where γ = 1 in the case of Γ0 (N ) and γ = N p in the case of Γ1 (N ) with a, b
being any integers such that ap − bN = 1.
48
1 0
Proof. Let g = . For Γ = Γ0 (N ) or Γ = Γ1 (N ), let
0 p
Γ0 = Γ ∩ g −1 Γg .
By Lemma 3.2.1, we know that there exists lifts of the coset representatives to Γ.
For ᾱj , this is easy: we take the lift αj = 10 1j .
For Γ̄0 β̄, we need to find an element β of Γ0 (N ) or Γ1 (N ) whose reduction (mod p)
lies in the coset 0 −1
∗ 0 = 0? ?? SL2 (Z/pZ).
∗ ∗ 1 0
This will be satisfied by any matrix β ∈ Γ which
• for Γ = Γ0 (N ), is of the form Npac db with a, b, c, d ∈ Z such that pad − N bc =
1;
• for Γ = Γ1 (N ), is of the form Npac db with a, b, c, d ∈ Z such that pad − N bc =
49
Now write
1 0 pa b a b p 0
= ,
0 p Nc d N c pd 0 1
so we can write
1 0
a b p 0
Γ 0 p β=Γ .
N c pd 0 1
a b
In the case Γ = Γ0 (N ), the matrix is an element of Γ, so we have
N c pd
1 0
p 0
Γ 0 p β=Γ .
0 1
1. If p divides N , then
p−1
1 0 1X z+i
Γ Γ (f ) = f
0 p p i=0 p
50
Example 3.2.6. The map
(Z/4Z)× → C× , 1 7→ 1, 3 7→ −1
is a Dirichlet character mod 4. In particular, it is the only non-trivial character mod 4.
An example of a character mod 13 is the map
(Z/13Z)× → C× , 2 7→ e2πi/12 ,
which is well-defined since 2 generates (Z/13Z)× .
Note 3.2.7. If M divides N any Dirichlet character mod M induces a character mod
N.
Definition 3.2.8. We say a character χ is primitive if it is not induced from a character
(mod M ) for any M dividing N , M < N .
Example 3.2.9. The characters in Example 3.2.6 above are primitive characters. How-
ever, the character
χ : (Z/8Z)× → C× , 1, 5 7→ 1, 3, 7 7→ −1
is not primitve since it comes from the above character mod 4.
Note 3.2.10. If χ is a Dirichlet character (mod N ), it can be extended to a map
χ : Z → C by the recipe
(
χ(d (mod N )) if (d, N ) = 1
χ(d) =
0 if (d, N ) > 1
The resulting function is multiplicative: it satisfies
χ(d1 d2 ) = χ(d1 )χ(d2 ) ∀d1 , d2 ∈ Z.
Lemma 3.2.11. The map
ι : Γ0 (N ) → (Z/N Z)× , ( ac db ) 7→ d (mod N )
is well-defined, and it induces an isomorphism
Γ0 (N )/Γ1 (N ) ∼
= (Z/N Z)× .
Definition 3.2.12. Let d ∈ (Z/N Z)× , and let g ∈ Γ0 (N ) such that ι(g) = d (mod N ).
Then the diamond operator hdi is the double coset operator Γ1 (N )gΓ1 (N ) ∈ R(Γ1 (N )).
Note 3.2.13. Since Γ1 (N ) is normal in Γ0 (N ), we have
Γ1 (N )gΓ1 (N ) = Γ1 (N )g = gΓ1 (N ).
The map
(Z/N Z)× → R(Γ1 (N )), d 7→ hdi
is hence a group homomorphism, and we get an action of (Z/N Z)× by linear operators
on Mk (Γ1 (N )) and Sk (Γ1 (N )).
51
Recall the following result from the representation theory of finite groups:
Proposition 3.2.14. Let V be any complex vector space with an action of (Z/N Z)× by
linear operators. Then
M
V = Vχ
χ : (Z/N Z)× →C×
where
V χ = v ∈ V : g.v = χ(g) · v for all g ∈ (Z/N Z)× .
This is called the space of modular forms of weight k, level N and character χ.
We similarly define Sk (Γ1 (N ), χ).
f |k g = hι(g)if = f.
p−1 X p−1
1X z+j 1 j
Up .f = f = f |k ,
p j=0 p 0 p
j=0
p 0
Vp .f = p1−k f |k .
0 1
52
2πi
Proof. Note that if ζp = e p , then
p−1
(
X p if p|n
ζpnj =
j=0
0 if p - n
Now
p−1 p−1
X 1 j k−1 −k
X z+j
f |k =p p f
0 p p
j=0 j=0
p−1 ∞
1 X X z+j
= an e2πin p
p j=0 n=0
∞ p−1
!
X 2πinz 1X
= an e p ζpnj
n=0
p j=0
= Up .f
Tp .f = Up .f + χ(p)pk−1 Vp .f.
53
For n ≥ 1, we get
2k
σk−1 (np) + pk−1 σk−1 (n/p) ,
an (Tp .Ek ) = −
Bk
where σk−1 (n/p) = 0 if p - n. We now want to show that
since p and m are coprime. But (3.1) can easily seen to be true.
54
Proof of claim. Note that αβ has determinant pq, so by the Smith normal form we
1 ∗
1 0
have αβ ∈ SL2 (Z) 0 pq SL2 (Z). One can show that since αβ = 0 pq mod N we in
fact have
1 ∗
αβ ∈ Γ Γ.
0 pq
This proves that the product Tp Tq is a constant multiple of Γ1 (N ) 10 pq 0
Γ1 (N ) and
one can check that this constant is indeed one.
It remains to check that Tp hdi = hdiTp . Let γ = ac db as in the definition of hdi. Note
that since Γ1 (N ) is normal in Γ0 (N ) we have
1 0 −1
hdi Tp = (Γ1 (N )γ) Γ1 (N )γ γ Γ1 (N )
0 p
−1
1 0 −1
= Γ1 (N ) γΓ1 (N )γ γ γ γΓ1 (N )
0 p
1 0 −1
= Γ1 (N )γ γ Γ1 (N ) hdi .
0 p
Proposition 3.2.28. Let f ∈ Mk (Γ1 (N )), and let m, n be comprime. Then am (Tn f ) =
amn (f ). In particular, we have a1 (Tn f ) = an (f ).
Proof. First a prime power n = pr . By induction and using proposition 3.2.20 we get
∞
X ∞
X
Tpr (f ) = anpr (f )q n + pk−1 anpr−1 (hpi f ) q np
n=0 n=0
∞
X 2
+ p 2(k−1)
anpr−2 hpi2 f q np
n=0
∞
r
X
+ ... + p r(k−1)
an (hpir f ) q np .
n=0
55
If n = pr11 . . . prkk with gcd(n, m) = 1, then
Remark 3.2.29. For general m, n (not necessarily comprime), one can show (exercise)
that X
am (Tn f ) = dk−1 a mn
d2
(hdif ) .
d|gcd(m,n)
Proposition 3.2.30. For all χ mod N the operators Tn preserve the subspaces Mk (Γ1 (N ), χ)
and Sk (Γ1 (N ), χ) of Mk (Γ1 (N )) and Sk (Γ1 (N )).
Proof. This follows from the commutativity of T (Γ1 (N )) as commuting operators pre-
serve each others eigenspaces.
Definition 3.2.31. We say f ∈ Mk (Γ1 (N )) is an Hecke eigenform (or just eigenform)
if it is a simultaneous eigenvector for all the operators in T (Γ1 (N )) (i.e. for all the Tn0 s
and hdi’s).
A normalized Hecke eigenform is an eigenform satisfying a1 (f ) = 1.
Note 3.2.32. Let f ∈ Mk (Γ1 (N )) be an eigenform, say Tn .f = λn f for all n. Then
an (f ) = a1 (Tn f ) = λn a1 (f ) ∀n ≥ 1.
iii apr (f ) = ap (f )apr−1 (f ) − pk−1 χ(p)apr−2 (f ) for all primes p and all r ≥ 2.
Proof. The implication ⇒ follows directly from Definition 3.2.26 and Theorem 3.2.33.
Conversely, if f ∈ Mk (Γ1 (N ), χ) satisfies properties (i)-(iii), then f is already normalized,
so we need to show that it satisfies
56
If p - m, then it follows from Proposition 3.2.28 that am (Tp .f ) = amp (f ), which by (ii)
is equal to am (f )ap (f ). If m = pr m0 with p - m0 , then by Remark 3.2.29 we have
am (Tp .f ) = apr+1 m0 (f ) + χ(p)pk−1 apr−1 m0 (f ).
Using (ii) and (ii), this can be shown to be equal to ap (f )am (f ) as required.
Question. Do such normalized eigenforms actually exist?
Example 3.2.35.
1. A non-Eisenstein eigenform is given by ∆ ∈ S12 (SL2 (Z)). This is clear since all Tn
preserve S12 and S12 is spanned by ∆. Moreover ∆ is obviously normalized. Let
τ (n) = an (∆). Then
τ (mn) = τ (m)τ (n)
for m and n coprime by Proposition 3.2.34. This shows a statement which was
made in the prologue of this lecture.
2. Similarly we can show that the cusp forms E4 ∆, E6 ∆, E42 ∆, E4 E6 ∆ and E42 E6 ∆
of level (SL2 (Z)) and weight 16, 18, 20, 22 and 26 are normalized eigenforms since
the corresponding spaces of cusp forms are one-dimensional.
3. More interesting is the case k = 24 since S24 (SL2 (Z)) is two-dimensional. It can
easily be shown that S24 (SL2 (Z)) is spanned by f1 = E43 ∆ and f2 = ∆2 . The
q-expansion of these are given by
f1 = q + 696q 2 + 162252q 3 + 128318089q 4 + . . .
f1 = q 2 − 48q 3 + 1080q 4 + . . . .
We want to know how T2 acts on this basis. By the formula in the proof of
Proposition 3.2.28, we have
T2 (f1 ) = 696q + 128318089q 2 + . . . + 223 q 2 + 696q 4 + . . .
= 696q + 136706697q 2 + . . .
and
T2 (f2 ) = q + 1080q 2 + . . . + 223 q 4 + . . .
= q + 1080q 2 + . . .
In terms of the given basis we therefore have
T2 (f1 ) = 696f1 + 136222281f2
T2 (f2 ) = f1 + 384f2 .
Thus T2 is given by the matrix
696 1
.
136222281 384
Open conjecture (Maeda’s conjecture). The Galois group of the splitting field of the
characteristic polynomial of T2 on Sk (SL2 (Z)) is as large as possible, so isomorphic to
the symmetric group Sym(d) where d = dim(Sk ).
57
3.3 The Petersson product
The aim of this section is to define a Hermitian inner product on the space Sk (Γ1 (N )).
Lemma 3.3.1. Let U ⊆ H be a closed set whose boundary consists of finitely many line
segments and circle arcs. Let, f : U → C be a continuous function, and let γ ∈ SL2 (R).
Then Z Z
dx dy dx dy
f (z) 2 = f (γ.z) 2 ,
z∈U y γ −1 U y
where we write z = x + iy.
Proof. We view H as an open subset of R2 with coordinates (x, y) and γ = a b
c d as a
differentiable map H → H. Write
and we have
∂γ1 ∂γ2
Γ0 (z) = +i .
∂x ∂x
Hence ∂γ1 ∂γ2
= |γ 0 (z)|2 .
∂x
|Jγ | = ∂γ ∂x
1 ∂γ2
∂y ∂y
58
dx∧dy
Remark 3.3.2. What we have shown that the differential 2-form y2
is SL2 (R)-
invariant.
Notation. Let Γ ≤ SL2 (Z) be a finite index subgroup, let R be a set of coset repres-
entatives for Γ\ PSL2 (Z) and let S
D be the fundamental domain as defined in theorem
1.2.2. We then denote the union γ∈R γD by DΓ .
Lemma 3.3.3. Let F : H → C be a continuous function with is Γ-invariant, i.e.
F (γ.z) = F (z) ∀ γ ∈ Γ, z ∈ H.
UY = {x + iy ∈ H : |x| ≤ 1/2, y ≥ Y }.
Note 3.3.5. The fundamental domain DΓ is the union of some compact set K ⊂ H and
the set
γ UY {γ.z|γ ∈ R, z ∈ UY }.
Lemma 3.3.6. Let F be as in Lemma 3.3.3. Suppose that for all γ ∈ SL2 (Z) there exist
real numbers cγ > 0 and eγ < 1 such that
= cγ y eγ −2 dy.
y=Y
59
Note 3.3.7. Condition (3.2) is in particular satisfied if F tends to 0 exponentially at
the cusps.
Proposition 3.3.8. Let Γ ≤ SL2 (Z) be a finite index subgroup, k ≥ 1 and f, g ∈ Mk (Γ).
Define F : H → C by
F (z) = f (z)g(z)(=(z))k .
Then F is Γ-invariant. If at least one of f and g vanishes at each cusp then F tends
exponentially to 0 at each cusp and hence
Z
dx dy
F (x + iy) 2
DΓ y
converges.
=(z)
Proof. F as defined is of weight 0 Γ-invariant since f, g ∈ Mk (Γ) and =(gz) = |j(g,z)| 2.
Moreover, the decay at the cusps can easily be shown by considering the product of the
q-expansions of f and g at the corresponding cusps: the product will be a function in q
without a constant term, so it certainly tends to 0 exponentially. It hence follows from
Lemma 3.3.6 that the integral converges.
Definition 3.3.9. Let Γ ≤ SL2 (Z) be a congruence subgroup, k ≥ 1 and f, g ∈ Mk (Γ),
at least one of f and g vanishing at every cusp. Then we define the Petersson product
as Z
hf, giΓ = f (z)g(z)(=(z))k−2 dx dy.
DΓ
2. hf, gi = hg, f i;
60
Proof. Assume first that Γ = SL2 (Z). Let f ∈ Sk (Γ). Define F (z) = |f (z)|=(z)k/2 . We
can check that g is SL2 (Z)-invariant. So it is bounded on H if and only if it is bounded
on the funcamental domain D. But D ∩ {z ∈ C : |=(z)| ≤ R} is compact and F (z)| → 0
as =(z) → ∞ since f˜ is holomorphic at 0 and vanishes there, so |f˜(q)| < C|q| for small
q and some C > 0. But q = e2πiz decreases faster than =(z)k/2 increases.
The argument easily generalizes to general congruence subgroups.
Proof. Let f ∈ Sk (Γ) and Cy be a small circle around the origin described by e2πi(x+iy)
where y is fixed and 0 ≤ x ≤ 1. Since
Using Lemma 3.3.13 and using that qh = e2πiz/h with z = x + iy, we get
Z 1
C −2πin/h(x+iy)
|an (f )| ≤ k/2
e dx
0 y
= Cy −k/2 e2πyn/h .
Proof. It can easily be checked that hf, giΓ = hf |k γ, g|k γiγ −1 Γγ for all γ ∈ SL2 (Z). So
we can assume that c = ∞ without loss of generality. Let f ∈ Sk (Γ). By definition we
have
Z !
X
hf, Gk,Γ,∞ iΓ = f (z) 1|k γ(z) (=(z))k−2 dx dy.
DΓ
γ∈Γ+
∞ \Γ
61
Now we can interchange integral and sum and afterwards apply lemma 3.3.1. This yields
X Z
hf, Gk,Γ,∞ iΓ = f (z)(1|k γ(z))(=(z))k−2 dx dy
DΓ
γ∈Γ+
∞ \Γ
X Z
= (f |k γ −1 )(z)1(z)(=(z))k−2 dx dy
γDΓ
γ∈Γ+
∞ \Γ
X Z
= f (z)(=(z))k−2 dx dy
γDΓ
γ∈Γ+
∞ \Γ
Z
= f (z)(=(z))k−2 dx dy.
Γ+
∞ \H
Rh 1
R1
But 0
e2πinx/h dx = h 0
e2πinx dx = 0. So the product we started with was 0.
Remark 3.3.15. If c and d are distinct cusps then hGk,Γ,c , Gk,Γ,d i is well-defined, but it
is not generally 0. Moreover, for k ≥ 3 can be shown that Ek (Γ), the subspace of Mk (Γ)
spanned by the Gk,Γ,c ’s, is exactly given by the set
{f ∈ Mk (Γ) : hf, gi = 0 for all g ∈ Sk (Γ)} .
In an abuse of notation we say Ek (Γ) is ”the orthogonal complement of Sk (Γ)”, which is
not correct since h·, ·i is not well-defined on all Mk (Γ). However h·, ·i certainly defines
a positive definite innder product on Sk (Γ). In particular we can take the above set as
the definition of Ek (Γ) for k = 1, 2.
62
3.4 Hecke operators and the Petersson product
Definition 3.4.1. Let V be a finite-dimensional C-vector space equipped with a positive
definite inner product h , , i, and let T : V → V be a linear operator. The adjoint of
T is the linear operator T ∗ : V → V such that hT x, yi = hx, T ∗ yi for all x, y ∈ V .
1. We have Z Z
dx dy dx dy
= dΓ · ,
DΓ y2 D y2
hwereD is the fundamental domain for SL2 (Z) and dΓ is the projective index of Γ.
2. Let g ∈ GL+ −1
2 (Q) such that g Γg ⊆ SL2 (Z). Then
Z Z
dx dy dx dy
2
= ,
Dg−1 Γg y DΓ y2
Proof. Exercise.
Definition 3.4.4. We normalize the Petersson inner product as follows: for Γ a con-
gruence subgroup, we let
Z
1
hf, giΓ = f (z)g(z)(=(z))k−2 dx dy,
covol(Γ) DΓ
Proof.
The aim of this section is to compute the adjoints of Hecke operators with respect to
the Petersson inner product. More precisely, we want to prove the following theorem:
Theorem 3.4.6. Let f1 , f2 ∈ Mk (Γ) for some Γ ≤ SL2 (Z) of finite index such that at
least one of f1 , f2 is a cusp form and let g ∈ GL+
2 (Q). Then
63
To prove this theorem we first need several technical results:
Proposition 3.4.7. Let Γ ≤ SL2 (Z) be a congruence subgroup and let g ∈ GL+
2 (Q) such
−1 −1
that g Γg ⊆ SL2 (Z). Then for any f1 ∈ Mk (Γ) and f2 ∈ Mk (g Γg) we have
where g 0 = det(g) · g −1 as above and f1 , f2 are such that both sides are defined.
If we let U grow into a fundamental domain for g −1 Γg then gU grows into a fundamental
domain for Γ and we are done.
Lemma 3.4.9. For any Γ ≤ SL2 (Z) of finite index and any g ∈ GL+
2 (Q) we have
Γ : Γ ∩ g −1 Γg = Γ : Γ ∩ gΓg −1 .
Proof. We first check the index. Let Γ0 = Γ ∩ gΓg −1 . Then g −1 Γ0 g = Γ ∩ g −1 Γg. Since
these are both contained in SL2 (Z), we deduce from Lemma 3.4.3 that
Γgγi ∩ γ̃j gΓ 6= ∅.
64
Proof of claim. Suppose that Γgγi ∩ γ̃j gΓ = ∅ for some i, j. Then
a
Γgγi ⊆ γ̃k gΓ.
k6=j
Proof. The operator Γg 0 Γ preserves the cusp forms. Using theorem 3.4.6 we can pass to
the orthogonal complement: We have for any f1 ∈ Ek (Γ) and for any f2 ∈ Sk (Γ) that
hf1 |k [ΓgΓ], f2 iΓ = hf1 , f2 |k [Γg 0 Γ]iΓ = 0
since f2 |k [Γg 0 Γ] is in Sk (Γ). Thus f1 |k [ΓgΓ] is orthogonal to every cusp form and therefore
is in Ek (Γ).
65
Definition 3.4.11. In the setting of Definition 3.4.1, the operator T is normal if it
commutes with its adjoint T ∗ : T T ∗ = T ∗ T .
Theorem 3.4.12. Let N ≥ 1, Γ = Γ1 (N ), and consider the C-vector space Sk (Γ). Let
p be a prime not dividing N . Then
Tp∗ = Γ p0 01 Γ
−1 1 0 pa b
= Γ Na pb 0 p N 1
Γ
−1
= Γ Na pb Γ · Tp
−1
= hp iTp .
66
Remark 3.4.17. Considering Γ0 (N ) instead of Γ1 (N ) the same logic applies, but as
the hdi operators are trivial in this case the Tp are self-adjoint. Hence their eigenvalues
are real. Therefore Sk (Γ0 (N ) has a basis of modular forms with real eigenvalues for the
Tp ’s. In particular this applies to Γ = SL2 (Z).
The following example shows that the Tp , p not dividing N , can indeed fail to be
diagonalisable.
Example 3.4.18. Let f ∈ Sk (Γ1 (N )) and p prime not dividing N . Assume f is an
eigenvector for Tp . Now look at the space Sk (Γ1 (N p)). It contains f1 (z) := f (z) and
f2 (z) := f (pz). By comparing formulae for the Tp -action on the q-expansions (Theorem
3.2.20), which are not the same at N and N p, we find that
Tp f1 = λf1 − pk−1 χ(p)f2 , Tp f2 = f1
if f is a Tp -eigenform with eigenvalue λ and f has character χ. More generally at level
pj N , the space spanned by f (z), f (pz), . . . , f (pj z) is Tp -stable and the matrix of Tp looks
like
λ 1 0 0 ··· 0
−pk−1 χ(p) 0 1 0 0
0 0 0 1 0
.. .
.. .. .. . .
. . . . .
0 0 0 0 1
0 0 0 0 ··· 0
Exercise 3.4.19. This matrix is not diagonalisable for j ≥ 3, independent of λ, χ and
k.
So there is an obstruction to diagonalise Tp for p dividing N coming from forms of
small level at p.
67
2. The space of all old modular forms is defined to be y
X
Sk (Γ1 (N ))old = Sk (Γ1 (N ))p-old .
p prime, p|N
3. Define Sk (Γ1 (N ))p-new as the orthogonal complement of Sk (Γ1 (N ))p-old , and define
the space of new forms
\
Sk (Γ1 (N ))new = Sk (Γ1 (N ))p-new .
p prime, p|N
Remark 3.5.2. The space Sk (Γ1 (N ))new is precisely the orthogonal complement of
Sk (Γ1 (N ))old .
Proposition 3.5.3. The subspaces of Sk (Γ1 (N )) in Definition 3.5.1 are stable under
the operators Tn for all n ≥ 1 and hdi for all d ∈ (Z/N Z)× .
We first recall the formulae from Theorem 3.2.20: If f = n≥0 an q n ∈ Mk (Γ1 (N )),
P
then
T` .f = U` .f if `|N (3.5)
k−1
T` .f = U` .f + ` V` h`i.f if ` - N (3.6)
Here, X X
U` .f = an` q n , and V` .f = an q n` .
Note 3.5.4.
Proof. It sufficies to show that the old subspaces are stable under the operators hdi, Tp
and their adjoints. By Theorem 3.4.12, the adjoints of Tp for p not dividing N and of hdi
for all d ∈ (Z/N Z)× are in the subalgebra T (Γ1 (N )), so we don’t need to worry about
them.
Firstly consider T` for ` not dividing N . Then the action of T` on Sk (Γ1 (N )) and
Sk (Γ1 (N/p)) is given by the formulae (3.6). Hence
for all f ∈ Sk (Γ1 (N/p)), so T` preserves Sk (Γ1 (N ))p-old for all p dividing
N.
× a b
Now consider hdi for some d ∈ (Z/N Z) . Choose some γ = N c d ∈ Γ0 (N ). Then
γ ∈ Γ0 (N/p) also represents hdi ∈ R(Γ1 (N/p)). As functions on H we clearly have
68
Furthermore
1−k p 0 1−k p 0 a b
hdi (i2,p f ) = hdi p f |k =p f |k |k .
0 1 0 1 Nc d
Note that
p 0 a b a pb p 0
= .
0 1 Nc d N c/p d 0 1
Hence
1−k a pb p 0
hdi (i2,p f ) = p f |k |k
N c/p d 0 1
1−k p 0
= p (hdi f )|k
0 1
= i2,p (hdi f ).
2. If p2 is not dividing N (so p does not divide N/p), then for f ∈ Sk (Γ1 (N/p)) we
haveTp (i1,p f ) = Up (i1,p f ) by (3.5). On the other hand, using (3.6), we have
Hence
Tp (i1,p f ) = i1,p (Tp f ) − pk−1 i2,p (hpi f ) ∈ Sk (Γ1 (N ))p-old .
On the other hand we have
3. If p2 divides N , then formula (3.5) applies for Tp both at level N and at level N/p.
So we have
69
We hence deduce that Tq preserves Sk (Γ1 (N ))p-old for all primes p, q.
Therefore we are left with checking that the adjoints Tq∗ , for q dividing N , preserve
old forms (equivalently, that the space of new forms is preserved by Tq ).
0 −1
Let wN = N 0 . This normalises Γ1 (N ), so Γ1 (N )wN Γ1 (N ) is a single left or right
coset (as for hdi’s), and defines an element of R(Γ1 (N )). We check that
∗ q 0 −1
Tq = Γ1 (N ) Γ1 (N ) = wN Tq wN
0 1
as −1
0 −1 1 0 0 −1 p 0
= .
N 0 0 q N 0 0 1
We will show that wN preserves the p-old subspace for all primes p dividing N , from
which it follows that Tq preserves the p-new subspace for all primes p, q dividing N .
As before, we compare wN with the corresponding operator at level N/p, namely wN/p .
k−1 −k 1
wN (i1,p f )(z) = N (N z) f −
Nz
k−1 −k !
N N 1
= pk−1 pz f −N
p p p
pz
| {z }
=wN/p (f )(pz)
k−1
=p i2,p (wN/p f )(z).
Sk (Γ1 (N ))p−old = i1,p (Sk (Γ1 (N/p))) + i2,p (Sk (Γ1 (N/p))) + · · · + ij,p Sk (Γ1 (N/pj )) ,
70
(ii) wN does not preserve character subspaces: wN maps Sk (Γ1 (N ), χ) to Sk (Γ1 (N ), χ).
Moreover, note that χ = χ−1 .
Proposition 3.5.8. Let χ be a primitive character mod N . Then
Proof. The action of the hdi’s on Sk (Γ1 (N/p)), p dividing N , must factor through
(Z/(N/p)Z)× . The maps i1,p and i2,p commute with the hdi’s, so any hdi eigenvector in
an old subspace must have a character factoring through Z/(N/p)Z for some p. Thus it
will not be primitive.
Warning. The converse is not true: in general, it is not possible to tell from its
character whether a modular form is new.
Proposition 3.5.9 (An alternative definition of the new subspace). For p being a prime
dividing N , and assume2 that N/p 6∈ {1, 2}. Define maps
δ
1X
tr1,p : Mk (Γ1 (N )) → Mk (Γ1 (N/p)), f 7→ f |k γi ,
δ i=1
−1
tr2,p : Mk (Γ1 (N )) → Mk (Γ1 (N/p)), f 7→ p wN/p ◦ tr1,p ◦wN (f ),
`δ
where δ = [Γ1 (N/p) : Γ1 (N )] and γ1 , . . . , γδ such that Γ1 (N/p) = i=1 Γ1 (N )γi . Then
tr1,p ◦i1,p = id, tr2,p ◦i2,p = id and we have
Note that we can see δ tr1,p as the element [Γ1 (N ) · Γ1 (N/p)] in R(Γ1 (N ), Γ1 (N/p))
(c.f. Example 3.1.13 (4)).
Proof. We have for f ∈ Sk (Γ1 (N/p)) and g ∈ Sk (Γ1 (N )) that
so the kernel of tr1,p is the orthogonal complement of the image of i1,p . Similarly
−1
hi2,p (f ), giΓ1 (N ) = hp−1 wN (i1,p (wN/p f )), giΓ1 (N )
−1 ∗
= p hwN (i1,p (wN/p f )), giΓ1 (N )
= p−1 hi1,p (wN/p f ), wN giΓ1 (N )
= p−1 hwN/p f, tr1,p (wN g)iΓ1 (N/p)
−1
= p−1 hf, wN/p (tr1,p (wN g))iΓ1 (N/p)
= p−2 hf, tr2,p (g)iΓ1 (N/p) .
71
Remark 3.5.10. It is an amusing fact, that this definition of ”new” and ”old” works
also for Eisenstein series, but there is an Eisenstein series of level 6 which is new and
old simultaneously.
(b) If f ∈ Sk (Γ1 (N ))new is an eigenvector for all T` with ` not dividing N , then f is a
scalar multiple of a primitive form.
(c) If f ∈ Sk (Γ1 (N )) and g ∈ Sk (Γ1 (M )) are primitive forms with a` (f ) = a` (g) for
all but finitely many primes `, then N = M and f = g.
We are not going to proof this theorem in the lecture. There is a nearly (but not
quite) complete proof in Diamond & Shuman and a different one in Miyake. For a full
proof see the paper of Atkin & Lehner, 1970.
Moreover, a form g ∈ Sk (Γ1 (N )) is an eigenvector for T` for all ` not dividing N if and
only if it lies in one of the subspaces Sk (Γ1 (N ))[f ].
Proof. We have seen that Sk (Γ1 (N ))new has a basis of primitive forms. By induction on
the number of divisors of N , the subspaces Sk (Γ1 (N ))[f ], f primitive of level dividing
N , span Sk (Γ1 (N )).
Suppose the sum is not a direct sum. Then there is a nontrivial linear relation
X
ci,j fi (di,j z) = 0
i,j
with scalars ci,j , primitive forms fi and factors di,j dividing N/level(fi ). We can suppose
without loss of generality that this relation has the least possible number of nonzero
ci,j ’s. Then all the fi ’s such that ci,j 6= 0 must have the same Tl eigenvalue for all l not
dividing N , since otherwise apllying Tl − λ for some λ would give a relation with fewer
terms. Hence all the fi ’s with ci,j 6= 0 for some j have some Tl eigenvalue for all l not
72
dividing N and thus they are equal by the Strong Multiplicity One theorem. So any
linear relation between vectors in
X
Sk (Γ1 (N ))[f ]
f primitive of
level dividing N
comes from a relation in Sk (Γ1 (N ))[f ] for a single f . Hence the sum is direct.
Note that this also shows that the vectors {f (dz) : d dividing N/level(f )} are linearly
independent. So the set
is a basis. So it remains to show that any g ∈ Sk (Γ1 (N )) being an eigenvector for all T` ,
` not dividing N , is in Sk (Γ1 (N ))[f ] for some
Pmf . Suppose g is such an eigenvector for
all T` , ` not dividing N . We can write g = i=1 µi gi with gi ∈ Sk (Γ1 (N ))[fi ] for some
fi . if T` g = αg then
Xm
0 = (T` − α)(g) = µi (T` − α)(gi ).
i=1
Since vectors in subspaces Sk (Γ1 (N ))[f ] for distinct f ’s are linearly independent, all the
vectors (T` − α)(gi ) are zero. Since this holds for all ` not dividing N , Theorem 3.5.13
implies there is at most one nonzero µi , so g = µi gi ∈ Sk (Γ1 (N ))[fi ]. This finishes the
proof.
3.6 L-functions
3.6.1 Basic definitions
Definition 3.6.1. Let f ∈ Mk (Γ1 (N )) be a modular form with q-expansion f =
n
P
n≥0 an q . The L-function of f is the function in one complex variable s given by
∞
X
L(f, s) = an n−s .
n=0
Proposition 3.6.2.
1. If f ∈ Sk (Γ1 (N )), then L(f, s) converges absolutely for all s with <(s) > k/2 + 1.
2. If f ∈ Mk (Γ1 (N )) is not a cusp form, then L(f, s) converges absolutely for all s
with <(s) > k.
Proof. (1) By Proposition 3.3.14, we know that |an (f )| ≤ M nk/2 for some M > 0.
Hence, if <(s) > k2 + 1, then
X X k
−s
an n ≤ M n 2 −<(s) < ∞.
n≥1 n≥1
73
(2) For f ∈ Mk (Γ1 (N )), one can show that there exists M > 0 such that |an (f )| ≤
M nk for all n. The proof of the statement is analogous.
i a1 (f ) = 1;
iii apr (f ) = ap (f )apr−1 (f ) − pk−1 χ(p)apr−2 (f ) for all primes p and all r ≥ 2.
are equivalent to L(f, s) having an Euler product. Suppose first that the conditions are
satisfied. Multiplying (iii) by tr and summing over all r ≥ 2 we see that (ii) is equivalent
to
∞
X ∞
X ∞
X
r r k−1 2
a (f )t = ap (f )t
pr a (f )t − p
pr χ(p)t apr (f )tr
r=2 r=1 r=0
!
X
⇔ apr (f )tr (1 − ap (f )t + χ(p)pk−1 t2 ) = a1 (f ) + ap (f )t(1 − a1 (f )).
r≥0
Conversely, if this equality holds, then letting s → ∞ we get a1 (f ) = 1, and the other
implications can also be reversed to show that (3.7) is equivalent to conditions (i) and
(iii).
The Fundamental Theorem of Arithmetic implies that if g is any function of prime
powers, then
YX ∞ ∞ Y
X
r
g(p ) = g(pr ).
p r=0 n=1 pr ||n
Using this fact, it is easy to see that (3.7) and condition (ii) are equivalent to the
existence of the Euler product.
74
3.6.2 L-functions of cusp forms
We now focus on cusp forms. We will prove that for f ∈ Sk (Γ1 (N )), L(f, s) satisfies a
very important symmetry property, the so-called functional equation.
Definition 3.6.4. Define the Gamma-function
Z ∞
dt
Γ(s) = ts e−t .
0 t
Note 3.6.5. We have Γ(n) = n! for all n ≥ 1.
Definition 3.6.6. The completed L-function of f ∈ Sk (Γ1 (N )) is defined as
k
which converges for <(s) > 2
+ 1. Now we compute
Z ∞ X
−1 s−1 −t
Λ(f, s) = (2π) t e dt an n−s
0 n≥1
Z ∞ s
X t dt
= an e−t .
n≥1 0 2πn t
t
Via the change of variables t 7→ 2πn , we get
X Z ∞ t s X Z ∞
−t dt
an e = an ts−1 2−2πnt dt
n≥1 0 2πn t n≥1 0
Z ∞ X !
= an e−2πnt ts−1 dt
0 n≥1
Z ∞
= f (it)ts−1 dt
0
as required.
Definition 3.6.8. Λ(f, s) is called the Mellin transform of f .
75
Definition 3.6.9. Define the operator WN = ik N 1−k/2 wN .
1 1
f = (f + WN .f ) + (f − WN .f ),
2 2
and it is clear that 12 (f ± WN .f ) ∈ Sk (Γ1 (N ))± . Moreover, if f ∈ Sk (Γ1 (N ))+ , g ∈
Sk (Γ1 (N ))− , then
so hf, gi = 0.
Theorem 3.6.12. Let f ∈ Sk (Γ1 (N ))± . Then the function Λ(f, s) extends to entire
function on C, which satisfies the functional equation
k
Λ(f, s) = ±N s− 2 Λ(f, k − s).
s
Proof. Let ΛN (s) = N 2 Λ(f, s), so we need to show that ΛN (s) = ±ΛN (k − s). By
making the change of variables t 7→ √tN , we get
Z ∞ Z ∞
s
s−1 it
ΛN (s) = N 2 f (it)t dt = f √ ts−1 dt.
0 0 N
We now break the integral at 1:
Z ∞ Z 1 Z ∞
it s−1 it s−1 it
f √ t dt = f √ t dt + f √ ts−1 dt.
0 N 0 N 1 N
R∞
Note that 1 f √itN ts−1 dt converges to an entire function of s, since
√
it
f √ = O(e−2πt/ N
)
N
76
as t → ∞. For the other part, compute that
i k it
(WN .f ) √ =t f √
Ny N
Z 1 Z 1
it s−1 i
⇒ f √ t dt = (WN .f ) √ ts−k−1 dt
0 N Nt
Z0 ∞
it
= (WN .f ) √ tk−s−1 dt,
1 N
where the second equality follows from the change of variables t 7→ t−1 . Since WN .f =
±f , this integral converges to en antire function.
To obtain the functional equation, note that in total the integral is
Z ∞
it s it k−s dt
ΛN (s) = f √ t + (WN .f ) √ t .
1 N N t
Since WN .f = ±f , this is ±ΛN (s), completing the proof of the functional equation.
Here, ap (E) = 1+p−|E(Fp )|, where E(Fp ) denotes the number of points of the reduction
of E (mod p) over the finite field Fp . (We include here the point at ∞.)
Theorem 3.6.13 (Eichler–Shimura). Let f ∈ Sk (Γ1 (N )) be a normalized eigenform
whose Fourier coefficients are all integers. Then there exists an elliptic curve Ef defined
over Q such that
L(Ef , s) = L(f, s).
So does the converse of this theorem hold? In other words, given an elliptic curve over
Q of conductor NE , can we find a cusp form of level NE having the same L-function as
E?
Definition 3.6.14. An elliptic curve E is modular if there is a newform f ∈ Sk (Γ1 (NE ))
with ap (E) = ap (f ) for all p, i.e. L(E, s) = L(f, s).
77
The following theorem, which relies on the work of Andrew Wiles on Fermet’s Last
Theorem, gives a postive answer to this question.
Corollary 3.6.16. Let E/Q be an elliptic curve. Then L(E, s) has analytic continuation
to C, and it satisfies a functional equation relating L(E, s) and L(E, 2 − s).
78