analytic-number-theory-notes
analytic-number-theory-notes
AARON LANDESMAN
1. I NTRODUCTION
Kannan Soundararajan taught a course (Math 249A) on Analytic
Number Theory at Stanford in Fall 2017.
These are my “live-TeXed“ notes from the course. Conventions
are as follows: Each lecture gets its own “chapter,” and appears in
the table of contents with the date.
Of course, these notes are not a faithful representation of the course,
either in the mathematics itself or in the quotes, jokes, and philo-
sophical musings; in particular, the errors are my fault. By the same
token, any virtues in the notes are to be credited to the lecturer and
not the scribe. 1 Please email suggestions to [email protected]
1This introduction has been adapted from Akhil Matthew’s introduction to his
notes, with his permission.
1
2 AARON LANDESMAN
2. 9/26/17
2.1. Overview. This will be somewhat of an introductory course in
analytic methods, but more like a second introduction. We’ll assume
familiarity with the prime number theorem, connecting contribu-
tions of primes from zeros of the zeta function. You might look at the
first half of Davenport’s book or so as a prerequisite. We’ll assume
the students know how to prove there are infinitely many primes in
arithmetic progressions.
To get started, we’ll do the first four or five lectures proving Vino-
gradov’s three prime theorem:
Theorem 2.1 (Vinogradov). Every large odd number is the sum of three
primes.
When we say “large,” one can actually compute the bound explic-
itly (i.e., it is effective).
Remark 2.2. Helfgott, a few years back, made the bound accessible
so that one could compute exactly which odd numbers were not ex-
pressible as the sum of three primes. He showed something like all
primes more than 7 could be written as the sum of three primes.
To start the proof, write N = p1 + p2 + p3 , and we’ll count the
number of ways to write N as such. In fact, we’ll consider
∑ Λ ( n1 ) Λ ( n2 ) Λ ( n3 )
N = n1 + n2 + n3
where
(
log p if n = pk
Λ(n) :=
0 else
If we define
√
Ψ( x ) := ∑ Λ(n) = x + O(xe −c log x
).
n≤ x
0
3
S(α) e(− Nα)dα = ∑
0 n ,n ,n ≤ N
Λ(n1 )Λ(n2 )Λ(n3 )e (n1 + n2 + n3 − N )α) dα
1 2 3
= R3 ( N ).
To bound this, note that S(0) = Ψ( N ) ∼ N. We’d like to bound it by
about N 2 .
Also, S(1/2) is pretty big because
S(1/2) = (Λ(2) + λ(4) + · · · ) − ∑ Λ(n)
n≤ N,n odd
because e( x ) = −1 if x is odd.
Then, for all
10−6
|λ| ≤ ,
N
we have <S(α) > .99N. Then,
Z
S(α)3 e(− Nα)dα ' 10−6 N 2 .
|α|≤10−6 /N
We could similarly make an argument in a small neighborhood of
1
2 . This gives an analytic reason that the number of representations
might be on the scale of N 2 . So there are portions of the integral
which give the correct answer.
4 AARON LANDESMAN
∑ Λ(n)e(n/3).
n≤ N
Remark 2.6 (Insight). S(α) is big near most rational numbers with
small denominators. It’s not big near 1/4, so we’re only saying it’s
big near certain ones. This might have something to do with whether
the denominator of the rational number is square free: if it is not
square free, you essentially get translates over roots of unity of that
prime whose square divides q, and things cancel out
Exercise 2.7. Show
∗
ak
∑ e(
q
) = µ ( q ),
k mod q
0
2
|S(α)| dα = ∑ Λ ( n1 ) Λ ( n2 )
0
e ((n1 − n2 )α) dα
n1 ,n2 ≤ N
= ∑ Λ ( n )2
n≤ N
√
= ∑ log nΛ(n) + O( N log N )
n≤ N
= N log N.
√
Exercise 2.10. Verify the above, where the O( N log N ) difference
is coming from prime powers. The idea for the last step is that most
6 AARON LANDESMAN
numbers less than N are on the order of N. One might use partial
summation, which is integration by parts.
Usually,
p
|S(α)| ∼ N log N.
If α is far from every rational number, such as the golden ratio, φ,
1
we might try to compute S(φ). We might expect that S(φ) N 2 +ε .
We don’t know whether this is true, but we do know
∑ Λ(n)e(nφ) N 1−δ
n≤ N
for some δ > 0 (and it will probably even be a pretty large δ). We will
now develop a technique saying that once you are far away from a
rational number, you can get this sort of power saving.
2.4. Strategy for determining asymptotics for R3 ( N ). We have the
integral
Z 1 Z Z
3
S(α) e(− Nα)dα = ( + )S(α)3 e(− Nα)dα
0 M m
We want the second part over the minor arcs to be smaller than N 2 .
The idea is that we can bound
!Z
Z 1
m
S(α)3 e(− Nα)dα ≤ ∑ |S(α)|
0
|S(α)|2 dα
α∈m
( N log N ) ∑ |S(α)|.
m
So, it is enough to have
εN
∑ |S(α)| ≤
log N
.
α∈m
This will show the contribution from the minor arcs is less than that
of the major arcs, assuming we know the major arcs contribute N 2 .
Exercise 2.11 (Roth). For all δ > 0, there is N0 = N (δ) so that for all
N ≥ N0 , such that every subset A ⊂ [1, N ] with |A | ≥ δN has a
(nontrivial) three term arithmetic progression.
Letting
A (α) = ∑ e( aα),
a ∈A
ANALYTIC NUMBER THEORY NOTES 7
we obtain
Z 1
A (α)2 A (−2α)dα
0
counts the number of triples ( x, y, z) with x + z = 2y. This includes
|A | trivial solutions, so we want to see this integral is larger. We
might expect δ3 N 2 solutions. But now, it’s a bit hard to see how to
actually bound this integral.
Exercise 2.12 (Vague exercise). If, “away from 0,”
|A (α)| ≤ ε|A |
then the contribution of that portion of the integral
Z 1
A (α)2 A (−2α)dα
0
is bounded by ε|A |2 . (We’d like to know something like ε ≤ δ/106 .)
The idea is that either we have this bound above, or else we get
some additive structure in A which we exploit to get a bigger den-
sity set.
There are notes on this on Sound’s web-page from a course he
taught on additive combinatorics.
Now, we want to focus on showing that for some definition of the
minor arcs, the sum S(α) has a little bit of cancellation.
2.5. Vinogradov’s method. Here is the key idea from Vinogradov’s
method. This comes up many times throughout analytic number
theory. We’d like to understand the sequence
S(α) = ∑ Λ(n)e(nα).
n≤ N
for looking at 0’s of the zeta function. Let’s start with the simplest
version of these, where instead of summing over primes and prime
powers, we only sum over all the integers. Say we want to consider
∑ e(nα).
n≤ N
This is a geometric progression, so it is easy to sum:
e(α) (1 − e( Nα))
∑ e(nα) =
1 − e(α)
n≤ N
x
=
sin πα
where x is bounded by 2, and the numerator is approximately sin πNα.
3. 9/28/17
Recall last time we had
R3 ( N ) : = ∑ Λ ( n1 ) Λ ( n2 ) Λ ( n3 ).
n1 + n2 + n3 = N
and found
Z 1
R3 ( N ) = S(α)3 e(− Nα)dα.
0
The idea is to show that S(α) is large only near rational numbers
with small denominators (the minor arcs). On the complement, we
want to show |S(α)| is small, and then bound
Z Z 1
3
| S(α) e(− Nα)dα| ≤ sup |S(α)| |S(α)|2 dα.
m α∈m 0
n≤ N
log ne(nα) =
1−
log td ∑ e(nα)
n≤t
Z N
1
= log N ∑ e(nα) −
1− t ∑ e(nα)dt
n≤ N n≤t
1
(log N ) min N, .
||α||
Then,
∑ Λ(n) = ∑ µ( a) ∑ log b
n≤ N a b≤ N/a
If we knew ∑∞
µ( a)
a=1 a = 0 we could then prove the prime number
theorem. This is essentially equivalent to proving the prime number
theorem, so it would take some work. Things are good when a is
small, but there is a problem when a is big.
Goal 3.3. Our overall aim is to bound S(α).
3.1. Vinogradov’s idea. We’d like to somehow decompose Λ(n) into
pieces, where either we use a simple exponential sum, or using the
following idea. The idea has to do with bilinear forms.
We notate m ∼ M meaning M < m ≤ 2M.
B( M, N ) := ∑ ∑ am bn · f (m, n),
m∼ M n∼ N
Let ∗ denote
2
∑ ∑ bn f (m, n)
m∼ M n∼ N
Then,
2
∗= ∑ ∑ bn f (m, n)
m∼ M n∼ N
≤ ∑ bn 1 b n 2 ∑ f (m, n1 ) f (m, n2 ).
n1 ,n2 ∼ N m∼ M
We might
√ √ try setting all | an | = |bn | = 1, 1and then
now
1
our bound
is M N + N M instead of MN so we save √ + √ . Again, this
M N
bound holds under various assumptions that the f (m, n) are approx-
imately orthogonal.
This is the key strategy. We now want to implement the above
strategy in the situation we are in. The key point of the strategy
is that we have transferred the problem from understanding the un-
known an , bm to the known problem of understanding the correlation
of f (m, n).
∑ ∑ am bn e(mnα).
m∼ M n∼ N
∗= ∑ | bn 1 bn 2 | ∑ e ( m ( n1 − n2 ) α ) .
n1 ,n2 ∼ N m∼ M
We conclude
!1/2 1/2
1/2
1
∑ am bn e(mnα) ≤ ∑ | a m |2 ∑ | bn | 2 ∑ min M,
||kα||
.
m,n n |k|≤ N
We have proven:
a 1
Proposition 3.6. If |α − q < q2
and ( a, q) = 1, then
∑ ∑ am bn e(mnα)
m∼ M n∼ N
1/2 1/2 log q 1/2 √
∑ | an | 2
∑ | bn | 2
p p
MN + Mq + Nq + q .
q
Question 3.7. Why is the above bound useful?
Just to summarize, this might be helpful to think of the case that
the ai , b j are bounded in norm by 1. We then get approximately a
bound by
√
MN √ √ √ √ MN
1 1
√ √
√ MN + q M + N + q = √ + MN √ + √ + q MN.
q q M N
The middle term is what we would get from the orthogonality re-
lation. If q is small or large, we don’t beat the trivial bound, as ex-
pected. This is the crucial bound for our particular bilinear form.
Next time, we’ll try and rewrite the coefficients as a bilinear form
as a function of multiple summands. The final ingredient is to write
a combinatorial identity to express Λ(n) in terms of things we un-
derstand. That is, we want to write it as something like
4. 10/3/17
4.1. Review. Last time, we were trying to bound sums like
! 2
= ∑ | bn 1 bn 2 | ∑ f (m, n1 ) f (m, n2 )
n1 ,n2 m
= ∑ | bn 1 | 2 + | bn 2 | 2 ∑ f (m, n1 ) f (m, n2 ) ,
n1 ,n2 m
ANALYTIC NUMBER THEORY NOTES 15
and we hoped that the correlations, i.e., the terms ∑m f (m, n1 ) f (m, n2 )
were bounded by M if n1 = n2 and O(1) if n1 6= n2 . We then could
bound the above by ( M + N ) ∑i |bn |2 .
Recall last time, we were trying to bound sums like
∑ ∑ am bn e(mnα)
m∼ M n∼ N
∑ ∑ am bn e(mnα)
m∼ M n∼ N
!1/2 !1/2
log q √ √ √ √
√
q ∑ | am | 2
∑ | bn | 2
MN + q M+ N +q .
m n
We had
Λ(n) = ∑ µ(a) log b
S(α) = ∑ Λ(n)e(nα).
n≤ N
∑ log ne(nα)
n≤ x
∑ ∑ am bn e(mnα)
m∼ M n∼ N
4.2. Mollifying ζ (s). We now want to Mollify ζ (s). For this, we will
use Selberg’s sieve.
One way to study the zeta function could be an appropriate trun-
cation. We may consider
µ(n)
M(s) = ∑ ns
.
n ≤U
a(n) = ∑ µ(d)
d|n,d≤U
1
if n = 1
= 0 if 1 < n ≤ U
the norm is bounded by d(n) if n > U
We have
ζ0 ζ0
− ( s ) = − ( s ) (1 − ζ M + ζ M )
ζ ζ
ζ0
= −ζ (s) M(s) − (s) (1 − ζ M(s)) .
ζ
First, we should be fairly happy with the term −ζ (s) M(s) which
has Dirichlet series given by
!
log b M(n)
∑ bs ∑ ns ,
n ≤U
and we’ll have a long sum in the b’s, where we can hope to get some
cancellation. Next, to understand
ζ0
(s) (1 − ζ M(s)) .
ζ
We try to think of this product as a sort of bilinear form. The terms
from 1 − ζ M(s) only matter for n larger than U. Thinking of this
term as a bilinear term, we’re happy because 1 − ζ M(s) is large. But,
we have to ensure that ζ 0 /ζ is not too “skinny.” To deal with this, we
can subtract out the small primes, and then later add them back.
To accomplish this, we define
Λ(n)
P(s) := ∑ ns
.
n ≤V
Then,
ζ0 ζ0
− (s) = − (s) − P(s) + P(s)
ζ ζ
Λ(n) Λ(n)
= ∑ + ∑
n >V
ns n ≤V
ns
Then,
ζ0 0 ζ0
− (s) = −ζ (s) M(s) − (s) (1 − ζ M(s))
ζ ζ
0
−ζ
0
= −ζ (s) M(s) + (s) − P(s) (1 − ζ M(s)) + P(s) (1 − ζ M(s))
ζ
0
−ζ
0
= −ζ (s) M(s) + (s) − P(s) (1 − ζ M(s)) + P(s) − ζ (s) M(s) P(s)
ζ
18 AARON LANDESMAN
The point of this breakdown is that we now have three terms we can
handle using our two tools we have.
The middle term decomposes into two parts, both of which are
big, which gives a bilinear form.
The last term has a long sum from the ζ (s) term in simple coeffi-
cients. For P(s), we can just ignore it because V is small. The first
term is similarly a long some from the ζ 0 term.
Remark 4.3. The first term which we handle via our first summation
technique is called a “type 1 sum” and the second handled via our
second bilinear form summation technique is called a “type 2 sum.”
Example 4.4. Let’s say we want to write
−ζ 0 −ζ 0
(s) = (s) (1 − ζ M)2 + 2ζ M − ζ 2 M2
ζ ζ
0
ζ
= − + ζ M (1 − ζ M) − 2ζ 0 M + ζζ 0 M2 .
0
ζ
The first is a type 2 sum, the second is a type 1. The ζ and ζ 0 are both
somewhat a simple divisor function because if the product of ζ, ζ 0
goes in a long range then at least one of them must be summed in a
long range. So, the third term is also a type 1 sum.
Remark 4.5 (Heath Brown identity, aka binomial theorem). Given
−ζ 0
( s ) (1 − ζ M ) k
ζ
one can try expanding this in k via the binomial theorem, and try to
bound various terms.
4.3. Proving Vinogradov’s theorem. Recall we have
a 1
α− ≤ 2.
q q
with (α, q) = 1. Our goal is to bound S(α) in terms of q. Trivially we
know S(α) is bounded by N, and we want to save a bit more than
one log on the minor arcs, and then we’ll have to concentrate on the
major arcs.
Using Vaughan’s identity, (where we have not yet specified U and
V). There are three type 1 sums and one type 2 sum (from the bilinear
form. Recall we have
ζ0
0
−ζ
0
− (s) = −ζ (s) M(s) + (s) − P(s) (1 − ζ M(s)) + P(s) − ζ (s) M(s) P(s)
ζ ζ
ANALYTIC NUMBER THEORY NOTES 19
and we are trying to bound the four sums. First we deal with the
P(s) term, which is
∑ Λ(n)e(nα) V.
n ≤V
Exercise 4.6. Carry out the argument from week 1 for dealing with
sums like
1
∑ min N, ||nα|| .
|n|≤ N
There is a small lie in what we will next do, and your job is to fix it
by Thursday. You should check what happens for smaller n as well.
Now, we’ll aim to attack the last type 1 sum, which is the term
corresponding to ζ (s) M(s) P(s) which is
∑ ∑ µ(n)Λ(`) ∑ e(n`rα)
n≤U `≤V r ≤ N/n`
Exercise 4.7. Verify the above bounds using a method similar to the
type 2 bound of the first ζ 0 M term.
Adding up our three type one sums, and removing terms trivially
bounded by others, we get
2 N N
(log N ) + q + UV + .
q U
This handles three of the four terms. The last term remaining to be
handled is the type 2 sum corresponding to
0
−ζ
(s) − P(s) (1 − ζ M(s))
ζ
The first sum only contains terms larger than V and the second only
contains terms larger than U. Using
1
1 − ζ M(s) = ∑ s ∑ µ(d)
n n d|n,d>U
with mn ≤ N.
Remark 4.8. We now have two terms with variables in our bilinear
form, both with large values. Both will range over dyadic intervals.
It starts to look like a bilinear form, though there is the caveat that the
two variables are connected by the condition that mn ≤ N. Hence,
we need some technical device to separate the variables.
Essentially, this is saying these are like points lying below a hy-
perbola and we would like to approximate the hyperbola by some
rectangle.
ANALYTIC NUMBER THEORY NOTES 21
Morally,
∑ Λ(n) ∑ ∑ µ(s)e(mnα)
n >V m>U d|m,d>U
∑ Λ(n)2 B log B.
n∼ B
Next,
∑ a ( m )2 ∑ d ( m )2
m∼ A m∼ A
5. 10/5/17
5.1. Recap of last time. Recall that we have defined
S(α) := ∑ Λ ( n ) e ( n ).
n≤ N
and
µ(n)
M(s) = ∑ ns
.
n ≤U
The first three terms are type 1 sums, and the last term is a type 2
sum. Last time, we discussed the bound for the type 1 sums. We
saw they were bounded by
2 N N
(log N ) + q + UV + .
q U
For example, to bound the term ζ 0 (s) M(s), we had to bound
N 1
∑ min n , ||nα|| .
n ≤U
We could split this into dyadic blocks and carry out the usual sum.
When 1 ≤ n ≤ q, we cannot split it into intervals of length q.
Exercise 5.1. For the blocks, we should take a dyadic sum over inter-
vals 2k q ≤ n ≤ 2k+1 q, and then we should pay attention to the case
1 ≤ n ≤ q, and we should get a bound around q log q or something
like that for the sum of the first q terms.
ANALYTIC NUMBER THEORY NOTES 23
At the end of last class, we were discussing the type 2 sums. There
were many small things we needed to keep track of. We wrote the
sum as
= ∑ d k −1 ( b ),
ab=n
24 AARON LANDESMAN
and use induction. If we knew ∑b≤y dk−1 (b), we could then use the
hyperbola method for a ≤ A, ab ≤ B and choose the parameters A, B
with AB = x.
5.3. A second method for bounding the sum of the divisor func-
tion. We now want a second method of calculating this. We are try-
ing to bound
d(n)
ζ ( s )2 = ∑ ns
.
We have
xs
Z c+i∞
1
∑ d(n) = 2πi c−i∞
ζ ( s )2
s
ds
n≤ x
for c > 1.
Exercise 5.4. Show
(
1 ds 1 if y > 1
Z
ys =
2πi (c) s 0 if y < 1
(see davenport’s book) where the path (c) means that from c − i∞ to
c + i∞. Essentially, one can prove this by noting the integral is 0 for
very small y, and there is only one pole at y = 1 which has residue 1.
When we expand
x s = x (1 + (s − 1) log x + · · · )
and
1
ζ (s) = + γ + O ( s − 1).
s−1
The residue of the pole at s = 1 is
x log x + (2γ − 1) x.
It would be useful, and can be done easily, to have some bounds for
|ζ (s)| (1 + |t|)1/2
where s = σ + iτ, 0 ≤ σ ≤ 1.
We are trying to bound
1 xs
Z
ζ (s)2 ds
2πi (c) s
xs xc
Z c+iT
1
= ζ (s)2 ds + O( ).
2πi c−iT s T
ANALYTIC NUMBER THEORY NOTES 25
∑ d k ( n ),
n≤ x
and we consider
∞
dk (n)
ζ (s)k = ∑ n s
.
n =1
x (log x )k
( k − 1) !
with some lower order terms we can work out. The main term will
be a polynomial of log x of degree k − 1.
Exercise 5.5. Show we end up getting a residue of the form
In both cases, the main term is the area of the region you are consid-
ering. The best error known is only about O( x1/3−δ ).
26 AARON LANDESMAN
Note that this will converge absolutely whenever we are to the right
of 1. We have
1 1
ζ (s) = ∏ 1 + s + 2s + · · · .
p p p
We can approximate
∞
d ( n )2
4 9
∑ ns = ∏ 1 + s + 2s + · · ·
p p
n =1 p
= ζ ( s )4 F ( s )
where
α
F (s) = ∏ 1 + 2s + · · ·
p
p
for some α, which converges absolutely if Re(s) > 1/2. One then
obtains that
∑ d(n)2 ∼ Cx (log x)3 ,
n≤ x
∑ di ( n )
n≤ x
where dπ (n) are the coefficients of the Dirichlet series of ζ (s)π . Re-
latedly, one might try to count
∑ 1,
n≤ x,n= a2 +b2
and one can work out
∑ 1 = ζ (s)1/2 L(s, χ−4 )1/4 F (s),
n = a2 + b2
where f (s) is regular to the left of 1.
It’s not completely obvious how these functions continue analyti-
cally. We could make sense of
ζ (s) = exp (π log ζ (s)) ,
which makes sense to the right of 1. But, it also can be extended to
regions where there are no zeros or poles of the ζ function. If we
understand the zero-free region of the zeta function, then we can
make sense of this function in this zero-free region.
In the region
c
γ > 1− ,
log T
28 AARON LANDESMAN
∑ d ( n )2 .
n≤ x
This yields x (log x )4 as a bound, and you are only off by one log.
Exercise 5.11. Verify this.
Exercise 5.12. Let p(n) denote the number of partitions of n. Prove
that
√
p(n) ≤ exp π 2/3n .
Moreover, find the optimal constant so that p(n) ≤ eαn . (Sound thinks
the constant above is optimal).
ANALYTIC NUMBER THEORY NOTES 29
∑ Λ(n)ein
n≤ N
using that
1 a C
− ≥ 20 ,
π q q
so one can always find a pretty good approximation to π. So, we get
a bound of about
∑ Λ(n)ein N .99
n≤ N
ANALYTIC NUMBER THEORY NOTES 31
6. 10/10/17
6.1. Exercises and questions. Last time, we let q be a number with
a 1
α− < 2
q q
N
∑ Λ(n)e(nα) (log N ) 3 4/5
p
√ + qN + N .
n≤ N
q
∑ ∑ f 1 f 2 e(mnα),
m∼ A n∼ B
∑ Λ(n)e(nφ) N 4/5+ε ,
n≤ N
for φ the golden ratio. Can one say something better? Presumably
the right answer is N 1/2 , though that may be hard. Maybe one could
show something like N 3/4 . The key is that we have rational approx-
imations at every scale.
32 AARON LANDESMAN
We split this up into major and minor arcs. On the minor arcs, we
bounded this by
Z
| S(α)3 e(− Nα)dα| < max |S(α)| N log N.
m m
q ≥ (log N )10 .
Otherwise, there exists α ∈ M (in a major arc).
That is,
a 1
α− ≤
q qQ
with q < (log N )10 . The major arcs M are disjoint. The total measure
of the major arcs is
φ ( q )2 C (log N )10
|M| = ∑ qQ
∼
Q
,
q≤(log N )10
where
k
τ (χ) = ∑ χ(k) exp .
k mod q
q
Then,
an 1
∑ Λ(n) exp q
= ∑ τ (χ) ∑ Λ(n)χ(an).
φ(q) χ mod
n≤ x q n≤ x
Define
ψ( x; χ) := ∑ Λ ( n ) χ ( n ).
n≤ x
Then,
Ψ ( x; q, a) = ∑ Λ( x )
n≤ x,n≡ a mod q
1
= ∑ χ(a)Ψ(x, χ).
φ(q) χ mod q
We have
Ψ ( x, χ0 ) = Ψ( x ) + O (log x )2 .
for some c > 0. The key step in the proof of this is that the region
σ > 1 − log 2c+|t| has no zeros of the zeta function ζ (s), where s =
σ + it.
We therefore get a bound
Ψ ( x, χ0 ) = Ψ( x ) + O (log x )2
xρ x
∼ x− ∑ +O .
|ρ|≤ T
ρ T
7. 10/12/17
7.1. Review. Let χ be some character Z/qZ → C× . We have
Ψ( x, χ) = ∑ Λ ( n ) χ ( n ).
n≤ x
If χ = χ0 , we have
p
Ψ( x ) + O (log x )2 = x + O x exp −c log x .
38 AARON LANDESMAN
If χ 6= χ0 , GRH implies
|Ψ( x, χ)| x1/2+ε .
We would like an unconditional bound around
p
(7.1) | Ψ ( x, χ )| x exp − c log x
p
and we would like to say when q ≤ exp log x . We have
an 1
∑ Λ(n) exp q = φ(q) ∑ τ (χ)χ (a) Ψ(x, χ).
n≤ x χ mod q
log x
into the error term. In the presence of the Siegel zero, we can only
get this
p uniform desired result for q ≤ (log x ) A , but not for q ≤
exp c log x . This is also ineffective.
Therefore, we obtain
an µ(q) p
∑ Λ ( n ) exp
q
φ ( q )
x + O x exp − c log x .
n≤ x
We know
3 Z N
a µ(q)
3
p
S +β = exp ( xβ ) dx + O N exp − c log N
q φ ( q )3 0
ANALYTIC NUMBER THEORY NOTES 41
The error term in the integral over the major arcs is then
p 1 p
O ∑ N 3
exp − c log N
Q
= O N 2
exp − c log N .
10
q≤(log N )
So the error terms are under control. We now want to understand the
main term.The is almost independent of a except for the
main term
a
factor exp − N q +β . We want to understand the main term of
Z
S(α)3 e (− Nα) dα.
M
which is
∗ Z 1/qQ Z N
µ(q) a
∑ ∑ φ ( q )3
exp ( xβ) dx exp − N
q
+β dβ.
q≤(log N ) 10 a mod q −1/qQ 0
q≤(log N )10
!
N2
=O .
(log N )10
So, we can replace our integral from − N/qQ to N/qQ by an inte-
gral going off to infinity. This integral is essentially computing the
number of ways to write N as a sum of three numbers, which is es-
sentially N 2 /2. But, we can also compute it since this is essentially a
Fourier transform. That is,
Z ∞ Z 1 3
2
N exp ( Nxβ) dx e ( β) dβ
−∞ 0
Then one can use Parseval’s identity to compute the Fourier trans-
form of this. Here, Parseval is counting the number of ways of writ-
ing 1 as a sum of three real numbers. Before we were writing N as a
sum of three integers.
Now, let’s finish our calculation. We were trying to compute
Z 1/qQ Z N 3
µ(q)
∑ φ(q)3 cq ( N ) −1/qQ 0 exp (xβ) dx e (− Nβ) dβ.
10
q≤(log N )
µ(q) N2
∑ φ ( q )3
c q ( N )
2
.
q≤(log N )10
ANALYTIC NUMBER THEORY NOTES 43
This sum is called the singular sum The tail of this sum is roughly
!
1 2
∑ O φ(q) (log N )−10 .
10
q>(log N )
Let
∞
µ(q)
S ( N ) := ∑ φ ( q )
c ( N ).
3 q
q =1
Then,
p −1
aN
cp(N) = ∑ exp p
a =1
(
−1 if p - N
=
p−1 if p | N
Then,
1
S (n) = ∏ 1 − c p (n)
p ( p − 1) s
!
1 1
= ∏ 1 + 3 ∏
· 1− 2
.
p-N ( p − 1) p | N ( p − 1)
Finally, we have
Z 1
0
S(α)3 e (− Nα) = ∑
n1 + n2 + n3 = N
N2
N
= S (N) + O .
2 log N
Because the contribution of prime squares and cubes is negligible,
we get that every sufficiently large odd number is the sum of three
primes (and in fact it is the sum of three primes in many ways),
where here we are using that
S (N) ≥ c
for all N where c is some universal constant bounded below by
1
2· ∏ 1− .
p ≥3 ( p − 1)2
0
|S(α)|2 = ∑ Λ(n)2 ∼ 2N log N.
n≤2N
N = x1k + · · · + xsk .
ANALYTIC NUMBER THEORY NOTES 45
8. 10/17/17
8.1. Exercises to solidify the ideas thus far. Here are two exercises,
which are a bit longer and harder than usual.
Exercise 8.1 (Difficult exercise). Assume GRH. Give a bound for
∑ Λ(n) exp (nα)
n≤ x
for α − qa ≤ q12 without using bilinear forms, but instead using GRH
and thinking about the prime number theorem and arithmetic pro-
gressions. Hint: We discussed how to write
na 1
∑ Λ(n) exp q ∼ φ(q) ∑ χ(a)τ (χ)Ψ(x, χ),
n≤ x
Ψusing GRH.
and one can input information about
na
One would then try to write exp q in terms of exp (nβ) with
1
| β| ≤
qQ
and then one should try to obtain good minor arc estimates for this
summation using a “quasi-Riemann hypothesis” (i.e., assuming there
46 AARON LANDESMAN
x
sup ∑ µ(n) exp (nα) A
α ∈R n ≤ x (log x ) A
ANALYTIC NUMBER THEORY NOTES 47
for any A > 0. You will have to figure out what happens when α is
on a major or minor arc.
When α is on a minor arc, you will want to use Vinogradov’s
√ √
method and use a bilinear estimate, you will have x/ q + q · x.
0
We use Vaughn’s identity for obtaining bilinear forms for − ζζ (s),
and we would need an analog for identifying 1ζ (s). One would take
ζ · M for M a modifier, and play around with powers of that.
To deal with the case when α is on a major arc. This has to do with
understanding
an
∑ µ(n) exp q ,
n≤ x
0
∑ µ(n) exp (nα) dα = ∑ µ ( n )2 .
n≤ x n≤ x
48 AARON LANDESMAN
Remark 8.5. The minor arc technology gave use an estimate of the
form
N p
√ + qN + E
q
where E is some error term endemic to the method.
√ The first two
terms are optimized when q is on the order of N, in which case the
first two terms give N 3/4 , so you cannot really do better than N 3/4
with this minor arcs method. We can write exp(nα) in terms of
Z
∑ t
χ(n)nit f ,
χ
To make
|ζ (1 + it)|
large, we would like
pit ∼ 1
and to make it small we would like
pit ∼ −1
for many small primes p.
Exercise 8.6 (Simple exercise). Show that for any N, there are qua-
dratic characters χ with χ( p) = 1 for all p ≤ N (and similarly
χ( p) = −1 for all p ≤ N).
Remark 8.7. Then, χ will occur to some modulus q, and q might be
very large in terms of N. If one tries to compute one of these via
the Chinese remainder theorem, one might then have the modulus
exponentially large in N.
Conjecture 8.8 (Vinogradov). If χ( p) = 1 for all p ≤ N, then q > N A
for arbitrarily large A. Conversely, χ( p) = −1 for all p ≤ N then
q > N A for all large A.
Example 8.9. The least quadratic non-residue must be smaller than
qε , if Conjecture 8.8 were to hold true.
Remark 8.10. The chance that all the first N primes land heads, one
should expect the chance is around 1/2 N . So, one would expect the
number of primes would have to be exponentially large.
Every once in a while, there can be a surprise. For example, if
D = −163. Then,
−163
= −1
p
for p < 41. There are 12 such primes. If you think of things as coin
tosses, there would only be a 1/212 (since there are 12 primes up to
and including 37) but 163 is substantially smaller than 212 = 4096.
L(1, χ−163 ) should be very small. Indeed, the Class number for-
mula gives
√
πh Q
−163 π
L (1, χ−163 ) √ ∼ .
163 13
50 AARON LANDESMAN
and the class number is 1 here (and h denotes class √number). Recall
that the class number formula implies that for Q
−D ,
√
πh(Q
−D
L (1, χ D ) = √ .
D
Goldfeld and Gross-Zagier’s result implies
C log D
L 1, χ− D ≤ √ .
D
Siegel’s theorem implies that if χ is a quadratic character mod q, then
L(1, χ) ≥ C (ε) q−ε
for all ε > 0.
Remark 8.11. The zero free region is determined as follows. If L ( β, χ) =
0, then
L (1, χ) = (1 − β) L0 (σ, χ)
for some β ≤ σ ≤ 1.
Exercise 8.12. Prove that if
1
1 ≤ σ ≤ 1− ,
log q
then
L0 (σ, χ) ≤ C (log q)2 .
See Davenport. Essentially you can make sense of this a little left of
the 1 line. Then you can differentiate it and deduce this. So, you
have a bound on how close a zero can be to 1. So, if there’s a bad
Siegel 0 it has to be bounded away from 1.
8.3. The zero-free region of the Zeta function. We’d like to instead
look at the completed zeta function
ξ (s) = s (s − 1) π −s/2 Γ(s/2)ζ (s).
The functional equation says
ξ ( s ) = ξ ( s − 1).
The Hadamard product formula gives
s
e A+ Bs
∏ 1 − ρ es/ρ .
ρ
The trivial zeros come because the Γ(s/2) function has zeros at s =
0, −2, −4, · · · .
ANALYTIC NUMBER THEORY NOTES 51
8.4. Siegel zero situation. We’ll now discuss a proof due to Gold-
feld of Siegel’s theorem. We want to show that a lower bound for
L(1, χ)
L(1, χ) C (ε)q−ε
for χ a quadratic character modq. We look at the region
h ε i
1− ,1 .
10
Either
(1) All quadratic Dirichlet L-functions have no zero in this region
We take β = 1 − 10ε
. We define Ψ to some character mod3.
(2) There is some quadratic character Ψ mod r for some r with
L( β, Ψ) = 0 with 1 ≥ β ≥ 1 − 10ε
.
Consider
a(n)
ζ (s) L(s, χ) L(s, Ψ) L(s, χΨ) = ∑ ns
.
Exercise 8.15. Check a(n) ≥ 0 for all n by just expanding the defini-
tions of Dirichlet characters for the various L functions.
This function above is the Dedekind ζ function for the biquadratic
extension defined by χ and Ψ. This function is always non-negative
on primes because
(1 + χ( p)) (1 + Ψ( p)) ≥ 0
and both χ, Ψ takes values ±1, 0.
Then, for c > 1, consider
1
Z
I := ζ (s + β) L(s + β, χ) L(s + β, Ψ) L(s + β, χΨ)Γ(s) X s ds
2πi (c)
Look at the 0’s of Γ, compute the residues, and you will see the Taylor
expansion for e−1/x .
ANALYTIC NUMBER THEORY NOTES 53
as above.
Exercise 8.17. Indeed show that for χ a character modq, show
| L (1, χ) | log q.
for X a large power of qr (using Re(s) = − β + 1/2).
Therefore, we would conclude a bound of the form
L (1, χ) (qr )−ε .
We could get an effective bound, but we don’t know what r is. In
case 1, r = 3, so things would be fine. But, if there is some violation
to the Riemann hypothesis, then r depends on what the violation to
the Riemann hypothesis is. So this r is the source of the ineffectivity
in Siegel’s theorem.
Next time, we’ll discuss effectivity of the 3-prime theorem. Then
we’ll move on to discussing a theorem of Maynard:
Theorem 8.18. There are infinitely many primes with no 7 in their decimal
expansion.
9. 10/24/17
9.1. Quick recap of the proof of Siegel’s theorem. Recall that last
time we proved Siegel’s theorem:
Theorem 9.1 (Siegel). We have
C (ε)
L(1, χ) >
qε
(with C (ε) ineffective).
ANALYTIC NUMBER THEORY NOTES 55
enough bounds. But, in any case, after making this precise, we can
bound Γ by a sterling approximation, and then bound
|ζ (1/2 + it)| (1 + |t|) .
2πi (c)
ζ (s + β) L(s + β, χ) L(s + β, Ψ) L(s + β, χΨ) X s Γ(s)ds = ∑ nβ
≥ e−1/x
Instead, consider
then
ξ0 1
ξ
(s) = ∑ s − ρ.
ρ
Then,
1 σ−β
Re = .
s−ρ | s − ρ |2
with s = σ + it, ρ = β + iγ. On the one hand, the expression
ξ0 ξ0 ξ0 ξ0
(σ) + (σ, χ1 ) + (σ, χ2 ) + (σ, χ1 χ2 )
ξ ξ ξ ξ
We also know
ξ0 ξ0 ξ0 ξ0
(σ) + (σ, χ1 ) + (σ, χ2 ) + (σ, χ1 χ2 )
ξ ξ ξ ξ
1 1 1 1
+ log q1 + log q2 + log q1 q2 + O(1)
1−σ 2 2 2
ζ 0 L 0 L 0 L0
+ (σ) + (σ, χ1 ) + (σ, χ2 ) + (σ, χ1 χ2 )
ζ L L L
1
≤ + log q1 q2 + O(1).
σ−1
If β 1 , β 2 were close to 1, we have a lower bound by something close
to 2/1 − σ.
Exercise 9.6. If q1 , q2 are comparable to each other, say q1 ≤ q1002 , q2 <
1 1
q100
1 , then we can’t have both a lower bound by σ− β1 + σ− β 2 and an
upper bound by
1
+ log(q1 q2 ) + O(1)
θ−1
10−6
Here, we are choosing σ to be around 1 + log q1 q2 .
60 AARON LANDESMAN
Then,
ξ K0 −1 1 Γ0 ζ0
1 1 1
(σ) = + + log dK + r1 log π + (σ/2) + r2 (· · · ) + K (σ).
ξK σ σ−1 2 2 2Γ ζK
ζK0
Using that the last term ζ K (σ) is negative, and the whole sum is pos-
0
itive, choosing σ near 1 optimally and knowledge of ΓΓ (1/2) and
Γ0
Γ (1) gives a lower bound for the discriminant. One must choose an
appropriate value of σ, Sound suggests something like σ = 1 + nc .
So, if you have a field with small discriminant, this also means
there are not many primes of small norm. The zeros of such an L
function are then also nicely behaved.
Exercise 9.10. Work out the details in the above remark.
ANALYTIC NUMBER THEORY NOTES 61
M
3
S(α) exp (− Nα) dα = ∑ 1
| β|≤ qQ
S
q
+β exp − N
q
+β dβ.
10
q≤(log N ) ,q0 |q
Then, to find the contribution of the cube of this main term, we find
∗
τ ( χ )3 N 3β0 − aN
∑ φ(q)3 χ(a) β3 exp q
a mod q 0
From χ( a) exp −aN q we get another Gauss sum so
∗
τ ( χ )3 N 3β0 − aN τ (χ)4 N 3β0 −1
∑ φ(q)3 χ(a) β3 exp q ∼ φ(q)3 β3 .
a mod q 0 0
62 AARON LANDESMAN
10. 10/26/17
Remark 10.1. Goldfeld Gross Zagier says
c log | D |
L (1, χ) ≥ p
|D|
for imaginary quadratic fields. Then, effectively, h(− D ) > c log | D |.
√
Remark 10.2 (Euler’s idoneal numbers). Consider Q
p
− D . Can it
be that all p ≤ | D | are either ramified or inert?
Gauss’ genus theorem tells us
h (− D ) ≥ 2 part of the class group = 2# primes | D = d (| D |) .
The divisor function grows as O(| D |ε ). The problem is to find all
discriminants − D < 0 where
√
cl Q − D = (Z/2Z)r .
√ There
Remark10.3. is work of Biro on class numbers of fields of the
form Q 2
n +4 .
Then,
∑ Λ ( n ) ∼ κ a0 ( q ) ( q − 1 ) k .
n<qk
n ∈A
Remark 10.14. There is also a more involved version where one ob-
tains a lower bound of the form (q − 1)k for q = 10.
More generally, this can be done with any base replacing 2, with
obvious exceptions.
There is also the following cute result: One might ask if one can
find Fermat primes, with two 1’s in the binary expansion and all
other digits 0. This might be a hard problem because the set is quite
sparse, but, one can try to further ask if there are infinitely many
primes with k 1’s. One might try an easier problem asking if there
simply exist primes with exactly k 1’s in their binary expansion. The
following theorem shows the answer is yes.
has an asymptotic formula, with about 1/k of the numbers in this set prime.
K 2K
Remark 10.17. One would expect about (K/2 )∼ √
K
Theorem 10.18 (Bourgain). There are primes p ≤ 2k for which you can
specify any αK of the binary digits, for some fixed α > 0, where the last
digit must be 1 (so that the number is not even).
and
A (−α) = ∑ exp (−nα) .
n<qk
n ∈A
In our situation, we can understand the Fourier transform of A
so well that we can actually understand its L1 norm. It is binary be-
cause we are intersecting two sets here: primes and integers without
a specified digit.
We could still use the circle method here, but it is a little easier to
apply the circle method in a discrete setting.
10.3. The circle method in a discrete setting. Consider
q k −1
−a
1 a
∑ S q k A q k = ∑ Λ ( m )1 A ( n )
q k a =0 k m,n<q
m≡n mod qk
= ∑ Λ ( n )1 A ( n ).
n≤qk
The last equality holds because m, n < qk . For the penultimate one,
writing
a am
S = ∑ Λ(m) exp
qk m qk
and
−a −na
A = ∑ 1 A (n) exp ,
qk n qk
and the only terms that survive are m ≡ n mod qk . This is an excel-
lent approximation to the integral from the circle method.
Exercise 10.19. Verify the above equalities.
66 AARON LANDESMAN
We now separate terms into major and minor arcs, as in the circle
method.
We now write
a `
k
= +β
q d
1
with d ≤ qk/2 and | β| ≤ dqk/2
.
We try to approximate a/qk using rational numbers with denomi-
k/2
nator at most q . By Dirichlet’s theorem, we can always write num-
bers in this form.
We write A as some large positive number. The major arcs are
those values of a with
k
A (log qk ) A
d ≤ log q , | β| ≤ .
qk
The minor arcs are the remaining values of a. The major arcs are
distinct because the denominators are small and we are taking small
intervals around each rational number.
We’ll first deal with the major arcs. The harder part will come later
when we deal with the minor arcs.
Then,
!
qk
` b µ(d) nb
S + k = ∑ exp +O .
d q φ(s)
n<qk
qk (log k)3A
qk
a
If b 6= 0, the main term vanishes and S 3A . So,
qk
= O
(log qk )
we will only have to worry about the terms d = 1 or d = q.
Let’s now sum all these major arcs. The contribution from the error
terms of case 1 is
!
1 1 qk k
+O ( q − 1)
qk qk log qk A
11. 10/31/17
11.1. review. Let q be a large but fixed base and let A be the set of
numbers missing a0 ∈ [0, q − 1].
Goal 11.1. Count
∑ Λ(n)
n<qk
n ∈A
and show it is asymptotically c a0 (q) (q − 1)k for some constant c a0 (q) >
0.
Let
A ( α ) Ak ( α ) : = ∑ 1A (n)e (nα) ,
n<qk
with
S (α) = ∑ Λ(n)e (nα) .
n≤qk
We can approximate
a ` 1
− <
qk d dqk/2
with d ≤ qk/2 . Last time, we were trying to work out the major arc
A
contributions over intervals with d ≤ log qk and
A
a ` log qk
− ≤ .
qk d qk
ANALYTIC NUMBER THEORY NOTES 69
Last time we computed the major arc contribution in the case d was
A
a power of q less than log qk where we got
( q
q −1 if a0 = 0 mod q
c a0 ( q ) ( q − 1 ) k =
1 − 1 2 if a0 6= 0.
( q −1)
11.2. Remaining major arcs. We next deal with the remaining major
arcs. Namely, we show those centered at d` for d not a power of q are
negligible. We’ll put a trivial bound on S and the cancellation will
come form A (α).
We know
|S(α)| ≤ qk (1 + o (1))
trivially. We now look for cancellation in
a `
A =A +c
qk d
A
(log qk )
for c at most qk
, as we are on a major arc.
We have
k −1 q −1
Ak ( α ) = ∏ ∑ e nj qj α .
j =0 n j =0,n j 6= a0
∑
e nj θ ≤ (q − 3) + |e(nθ ) + e ((n + 1)θ )|
n j 6 = a0
≤ (q − 3) + 2 |cos(πθ )|
= (q − 1) − 2 (1 − | cos πθ |)
2
≤ (q − 1) exp −cq ||θ || .
for some small cq > 0, where
|| x || = min | x − n|.
n ∈Z
Then,
!
k −1 2
|Ak (α)| ≤ (q − 1) exp −cq k
∑ qj α .
j =0
70 AARON LANDESMAN
A
` (log qk )
Assuming α = d +O qk
, we get
!
k −1 2
|Ak (α)| ≤ (q − 1)k exp −cq ∑ qj α
j =0
!
k/2 2
`
(q − 1)k exp −cq ∑ qj .
j =0
d
1
Remark 11.2. Note that if ||θ || ≤ 2q then ||qθ || = q ||θ ||.
log d
Lemma 11.3. For k in an interval of length log q + 1, we can find k0 with
` 1
qk0 ≥ .
d 2q
Proof. We have
` 1
qk ≥
d d
and now using Remark 11.2, we see that powers of q increase, and
eventually the value “wraps around 1” so we get some term which
is not too small.
Therefore,
k k
|Ak (α)| (q − 1) exp −c1 .
log k
Therefore, these other major arcs contribute
!
1 2A −ckq
∑ k k k k
q (q − 1) exp −cq k/ log k (q − 1) log q exp .
qk A log k
d<( log qk
)
(`,d)=1,
d not a power of q
or
Z 1
|Ak (α)| dα.
0
There will be a huge amount of cancellation here.
Let’s look at
k −1 q −1
|Ak (α)| = ∏ ∑ e ( n j q j α ) − e ( a0 q j α ) .
j =0 n =0
Let’s write
b1 b2 b3
α= + 2 + 3 +···
q q q
for 0 ≤ b j ≤ q − 1. Multiplying by q j α yields
b j +1
z+ + εj
q
72 AARON LANDESMAN
1
with z and integer and 0 < ε j ≤ q. Therefore, the distance to the
nearest integer of q j α is well determined by b j+1 . We have good
bounds whenever b j+1 6= 0, q − 1, while at b j+1 = 0, q − 1, we need
to use the rather poor bound of q − 1.
Putting the above together, we have
!
k −1
1
|Ak (α)| ≤ ∏ min q − 1, 1 +
j =0 2 qj α
q−1 if b j+1 = 0 or q − 1
q q −1
= 1 + 2bj+1 if 1 ≤ b j+1 ≤ 2
q q −1
1 + if 2 ≤ b j+1 ≤ q − 2.
2( q −1− b ) j +1
and
Z 1
|Ak (α)| dα ≤ (3 + log q)k .
0
Here |α − qa | = q12 for x.9x ≥ q > x.1 . On the L1 norm in this lemma,
we’re only using a very small power of x. As long as the denom-
inators are not too small or large, we are doing well on the minor
arcs.
Then, we want to bound
1
k
q a
∑ Ak ( a/qk )S(− a/qk )
Writing
k −1
n= ∑ nj qj
j =0
we have
!
k −1 k −1
A j (α) ∑ ∑ nj qj e nj qj α ∏ ∑ e ni qi α
j =06 = a0 n j i =0,i 6= j n i 6 = a0
qk · B
where B was the bound for A (α). Then, integrating it out, we again
get a bound qk (3 + log q)k .
The idea to finish the proof is to use the above lemma’s bound
with a different k. We will continue bounding the minor arcs next
time using our lemmas with other values of k.
12. 11/2/17
12.1. Review. Last time, we wanted to evaluate
−a
1 a
∑ Ak q k S q k .
qk a mod qk
We also found
Z 1
|Ak (α)| dα (3 + log q)k .
0
By a Sobolev type argument, we saw that if
α1 , . . . , α N
are δ spaced (i.e., αi − α j ≥ δ if i 6= j.
Then,
N
1
∑ Ak (α j ) ≤ δ + q (3 + log q)k .
k
j =1
a `
where α = q = d + η.
Proposition 12.2. We have
`
∑ ∑ ∑ Ak
d
+η
D ≤d≤2D (`,d)=1 η
qk |η |∼ B
qk (η +`/d)∈Z
αq
k 2
( q − 1) D B
where
q
log q −1 ( B + log q)
αq =
log q
ANALYTIC NUMBER THEORY NOTES 77
Then,
αq q
D 2αq
= q k1
= (3 + log q)k1 .
q−1
where qk1 ∼ D2 , qk2 ∼ B.
Proof. We’ll now split this sum into
the first k1 digits
the middle k − k1 − k2 digits
the last k digits
2
with k1 − 4k2 ≤ k.
The first k1 digits is dominated by Ak1 (α) . For the middle digits,
there are (q − 1)k−k1 −k2 , each bounded by 1, so we get (q − 1)k−k1 −k2
as a bound. For the last digits, we get
Remark 12.3. Thinking about what we are doing, there are D2 points
of the form `/d and about B points η near each `/d. Given a fixed
`/d, we are multiplying it by something corresponding to each of
the B well spaced B intervals. Then, we choose qk1 on the scale of D2
and qk2 on the scale of B.
We can bound
!
` ` 1 `
Ak + η ( q − 1) k − k 1 − k 2 sup Ak 1 +η Ak 2 q k
+η .
d |η |∼ B/qk
d qk2 d
Note that the last Ak2 term corresponds to B well spaced points mod
1. That is there are B 1k2 (since we chose qk2 ∼ B, and in fact we will
q
need qk2
< B).
Now, we sum this over `, d, η. By a lemma from last time, fixing
`, d and summing over η, we get
1 `
∑ Ak 2 q k 2 q d + η
k
∼ qk2 (3 + log q)k2 .
η
78 AARON LANDESMAN
∑ λ(n)e (nα)
n≤ x
1
We have d ≤ qk/2 and |η | ≤ dqk/2
. Therefore,
!
qk
` 3
q
4k/5 k
S +η q +√ + qk D log q .
d D
103 qk 1 qk
v≥ .
BD 10 BD
So, in this case, we can redo our previous argument with a larger
denominator. Using the bound from before, we see
!
`
q k 3
s + η q4k/5 + √ log qk .
d BD
using that
√
qk BD
q3k/4
qk/2
and so we can absorb the third term into q4k/5 .
80 AARON LANDESMAN
Now, DB ≤ qk/2 and D ≤ qk/2 , so we just need αq < 15 for the first
term to be sufficiently small and we need αq < 41 for the second term
to be sufficiently small. We we just need to check αq < 15 .
Let’s now examine this constraint. Recall
q
log q−1 ( B + log q)
αq =
log q
If q is sufficiently large, this will hold. Indeed, for q > 2 · 106 this will
hold.
For example, if αq = .19, the first term is bounded by q−.01k and
−.12A
the second term is bounded by ( DB)−.12 ≤ log qk , and we
can choose A as large as we want so that this savings dominates
5
log qk .
Exercise 12.5. Work out any changes in the case that q is composite.
Hint: There is essentially no difference. We only used some sim-
plifications for computing the major arcs. We divided major arcs
into cases that the denominators are powers of the modulus q. One
would then have to work out differences when the denominator di-
vides q, or something like that.
Remark 12.6. For further ideas along this line, look at Piatetski-Shapiro
yielding primes of the form bnα c for α = 1.01.
13. 11/7/17
Today we’ll start talking about something new, the Bombieri-Vinogradov
Theorem. This tells us about the distribution of primes in arithmetic
progressions. Let Ψ( x; q, a) denote the number of primes up to x
congruent to a mod q. Let ( a, q) = 1. We are looking to estimate
x
E ( x; q, a) := Ψ ( x; q, a) −
φ(q)
ANALYTIC NUMBER THEORY NOTES 81
for an ∈ C.
Instead of taking the sum in the above goal from n = 1 to N we
can re-parameterize the sum as
M+ N N
∑ an e (nα) = ∑ a M+ N e (( M + n)α)
n = M +1 n =1
N
= ∑ a M+ N e( Mα) · e (nα)
n =1
82 AARON LANDESMAN
2 !
R N
1 N
∑ ∑ an e ( N + ) ∑ | a n |2
nα j
j =1 n =1
δ n =1
2
N Z α j +δ/2
1 2
∑ an e ∑ an e(nα)
nα j dα
n =1
δ α j −δ/2
Z α j +δ/2
! !
+
α j −δ/2
∑ an e(nα) ∑ nan e(nα) dα.
n n
ANALYTIC NUMBER THEORY NOTES 83
(2)
!1/2 !1/2
M N
∑ ∑ amn xm yn C ∑ | xm | 2
∑ |yn | 2
m =1 n =1 m n
(3)
2
N M
∑ ∑ amn xm ≤ C ∑ | x m |2 .
n =1 m =1 m
Exercise 13.8. Show one of the above three inequalities holds for all
choices of x, y if and only if the other two do. I.e., show the above
three statements are equivalent.
By duality, in order to give the desired bound, it suffices to bound
N R
∑ ∑ br e(nαr )
n =1 r =1
84 AARON LANDESMAN
N R N
∑ ∑ br e(nαr ) = ∑ br bs ∑ e (n (αr − αs )) .
n =1 r =1 r,s≤ R n =1
Since αr and αs are all well spaced, the terms in the exponentials
won’t be close to integers very often.
There are two types of terms, those with r = s, in which case we
get a contribution of N ∑r |br |2 .
There are also the off-diagonal terms with r 6= s. Here,
N
1
∑ e (nθ )
||θ ||
n =1
where ||θ || is the integer nearest to θ. We can then estimate the sum
of the off diagonal terms by
!
1 1
∑ |br |2 + |bs |2 ||αr − αs || ∑ |br |2 ∑ ||αr − αs ||
r 6=s r s 6 =r
! !
R
1
∑ |br |2 ∑ jδ
r j =1
!
1
∑ |br | 2
log R
r δ
!
1 1
∑ |br |2 log .
r δ δ
We now have an extra factor of log, and we will now explain how
to remove this factor of log . We will set it up, but won’t really carry
it out.
ANALYTIC NUMBER THEORY NOTES 85
= ∑ br bs ∑ Φ̂ (k + αr − αs ) .
r,s k
using Poisson summation. The Fourier transform is large at 0 (around
N + 2δ ). You can get the rate of decay by integrating by parts many
times. One can learn about the decay from the derivative of Φ.
The Fourier transform is approximately supported on an interval of
length δ , apart from some small fluctuations. Since Φ̂(k + αr − αs )
never gets within δ of an integer, it is always close to 0 when r 6= s.
Therefore, including the contribution at r = s, we get the sum is well
estimated by
R
Φ̂(0) ∑ |br |2
r =1
and we save the log term.
Exercise 13.9 (Involved exercise). Complete the above sketch into a
proof
Remark 13.10. Here is a problem: Can on e choose Φ ≥ 0, Φ ≥ χ[1,N ]
and Φ̂ supported in (−δ, δ) minimizing Φ̂(0)?
86 AARON LANDESMAN
We will next deduce the large sieve from the above theorem. Say
we have
a
: q ≤ Q, ( a, q) = 1
q
1
which is about Q2 points each Q2
spaced.
Q 2
∗ M+ N M+ N
an
∑ ∑ ∑ an e
q
≤ N + O ( Q2 ) ∑ | a n |2 .
q=1 a mod q n= M+1 n = M +1
= ∑ µ(q)
n∈[ M+1,M+ N ],n prime
= µ(q) (π ( M + N ) − π ( M))
Combining the above, the left hand side of the Large sieve is bounded
by
Q 2
∗ M+ N
µ ( q )2
an
∑ ∑ ∑ an e
q
≥ ∑ φ
(q) (π ( M + N ) − π ( M))2
q=1 a mod q n= M+1 q≤ Q
µ ( q )2
∑ φ
(q) (π ( M + N ) − π ( M))2 ≤ N + O( Q2 ) (π ( M + N ) − π ( M)) .
q≤ Q
ANALYTIC NUMBER THEORY NOTES 87
If we make the Q too big, this O( Q2 ) term will start to dominate. So,
we might want Q2 to be something like Q = N 1/2−ε . The bound then
becomes
1
N (1 + o (1)) .
∑q≤ N 1/2−ε µ(q)2 /φ(q)
x xβ
Ψ( x; q, a) = − χ( a)
φ(q) φ(q) β
for χ a quadratic character. If one could replace the 2 by 1.99 one
would imply there are no Siegel zeros.
Exercise 13.17 (What is large about the large sieve). For primes we
used that one residue class is forbidden and so we get some im-
balances. Now, more generally, suppose we have S ⊂ [1, N ] with
p +1
|S( mod p)| ≤ 2 . Use the Large sieve to show
|S| ≤ N 1/2+ε .
Here, the sieve is large because we are forbidding a large
√ number of
residue classes. Say here the primes p range up to p ≤ N.
Remark 13.18. This bound is tight because if we take S to be the set
of squares, we get the claimed number of residue classes.
Remark 13.19. There is a conjecture of Helfgott and Venkatesh say-
ing that if one is missing half the residue classes and do have half the
residue classes, it should look like some quadratic polynomial.
14. 11/9/17
Last time we discussed the Large sieve in its additive form. That
is, if α1 , . . . , α R are δ well spaced, then
R M+ N
1
∑ ∑ an e(nαr ) ≤ N + O δ ∑ |an |2 .
r =1 M +1
ANALYTIC NUMBER THEORY NOTES 89
a
One way we’ll apply this is by taking q with ( a, q) = 1, q ≤ Q, R =
1
Q2 , δ ∼ Q2
and obtaining
2
∗ M+ N
an
∑ ∑ ∑ a(n)e
q
≤ N + O( Q2 ) ∑ | a(n)|2 .
q≤ Q a mod q n= M +1
We have
M+ N M+ N
1 an
∑ a(n)χ(n) = τ (χ) ∑ χ( a) ∑ a(n)e q
.
n = M +1 a mod q n = M +1
Let
M+ N
a an
S := ∑ a(n)e .
q n = M +1
q
We wanted to bound
2
∗
2 1 ∗ a
∑ ∑ a(n)χ(n) = ∑
q χ mod ∑ χ( a)S( q )
χ mod q q a mod q
2
1 a
≤ ∑
q χ mod ∑ χ( a)S( q )
q
a mod q
2
φ(q) ∗ a
= ∑
q a mod q
S
q
.
√
using that |τ (χ)| = q.
So, using the above and the large sieve,
2 2
∗ M+ N ∗
q a
∑ φ(q) ∑ ∑ a(n)χ(n) ≤ ∑ ∑ S
q
q≤ Q χ mod q n = M +1 q≤ Q a mod q
M+ N
≤ N + O ( Q2 ) ∑ | a(n)|2 .
n = M +1
and so
1
φ(q) ∑
Ψ( x; q, a) = χ ( a ) ∑ Λ ( n ) χ ( n ).
χ n≤ x
Then, we get
x 1
φ(q) χ∑
max Ψ( x; q, a) − ≤ |Ψ( x, χ)|
( a,q)=1 φ(q) 6=χ 0
We have χ(n) = χe(n) if (n, q) = 1. If (n, q) > 1 bun (n, qe) = 1 then
the two could be different.
92 AARON LANDESMAN
(log x )2 .
by
∗
1
(14.1) ∑ ∑ ∑ φ(q)
| Ψ ( x, χ
e )| + O Q ( log x ) 3
14.4. The case R is large. Now, let’s deal with the range
√
10A x
(log x ) ≤R≤ .
(log x ) B
We now use the trick of decomposing Λ(n) via Vaughan’s identity.
We have
∑ Λ(n)e (nα)
which by Vaughan’s identity yields a good bound
∑ ∑ am bn e(mnα).
m n
we have
−ζ 0 −ζ 0
(s) = (s) − P(s) (1 − ζ (s) M(s)) + P(s) − ζ 0 (s) M(s) − ζ (s) M(s) P(s)
ζ ζ
with the first term on the right a type 2 sum and the latter three terms
Type 1 sums.
We’d now like to try to bound what all these terms give us.
First, let’s consider the type 2 sum.
14.5. Bounding the type 2 sum. Recall we are trying to bound some
sum of terms of the form ∑n Λ(n)χ(n).
Expanding Λ(n) using Vaughan’s identity, we get some bound of
the form
∗
∑ ∑ ∑ ∑ Λ(m) + ∑ µ ( d ) χ ( m ) χ ( n ).
R≤q≤2R χ mod q m>U n>V,mn≤ x d|n,d>V
∑ µ(d)
d|n,d>V
Next time we’ll deal with the other two terms. Let’s just give an
idea of how to deal with one of them now. We’re trying to bound the
term corresponding to
ζ ( s ) M ( s ) P ( s ).
We want to bound
∗
∑ ∑ ∑ Λ(m)µ(n) ∑ χ ( k ) χ ( m ) χ ( n ).
q∼ R χ mod q m≤U,n≤V k≤ x/mn
∑ χ(k)
k≤ x/mn
15. 11/14/17
15.1. Brun-Titchmarsh. Recall a few classes ago, we were trying to
bound π ( M + N ) − π ( M). We wanted to bound
∗
ap
∑ ∑ e
q
= µ(q) (π ( M + N ) − π ( M)) .
a mod q M+1≤ p≤ M+ N
One then gets a bound to which one can now use the large sieve.
We had two ranges. If R small, like R ≤ (log x )10A we could use our
bounds for |Ψ ( x, χ)|. To conclude, we only needed to deal with
√
10A x
(log x ) ≤R≤
(log x ) B
using Vaughan’s identity and the large sieve.
ANALYTIC NUMBER THEORY NOTES 97
Recall
−ζ 0 ζ0
(s) − P(s) (1 − ζ M(s)) = − (s) − P(s) + ζ 0 (s) M(s) + ζ (s) M(s) P(s).
ζ ζ
with
Λ(n)
P= ∑ ns
n ≤U
µ(n)
M= ∑ ns
n ≤V
We’ll also bound this crudely, forgetting about the sums over M and
N, and get cancellation from the sum over k.
√
max
x
∑ χ(n) q log q
n≤ x
This yields,
1 an
χ(n) = ∑ χ( a)e
τ (χ) a mod q
.
q
therefore, we have
1 an
∑ χ(n) = τ (χ) ∑ χ( a) ∑ e q
.
n≤ x a mod q n≤ x
Summing
an
∑e q
n≤ x
as a progression, we have
an 1
∑e q
≤ min x,
a
.
n≤ x q
ANALYTIC NUMBER THEORY NOTES 99
We also have
1
|τ (χ)| = O( √ ).
q
a | a|
We know ∑n≤ x χ(n) ≤ x. Say −q/2 ≤ a ≤ q/2. Then, q ∼ q
q
and we use the bound x if | a| ≤ a and the bound q/| a| if | a| > q/x.
Therefore, we obtain a bound
1 an
∑ χ(n) = τ (χ) ∑ χ( a) ∑ e q
n≤ x a mod q n≤ x
1
√ (q log q)
q
√
= q log q.
n 1 n an
∑ χ(n)Φ x
= ∑ χ( a) ∑ Φ x e q
τ (χ) a mod
n q n
1 a
= ∑ χ( a) ∑ x Φ x k + q
τ (χ) a mod
b
q k
1 kq + a
= ∑ χ(a) ∑ xΦb x
τ (χ) a mod q
q k
=
1
∑ b xm .
χ(m) x Φ
τ (χ) m q
∑ χ(n) = o(x)
n≤ x
if x ≥ q1/4+ε .
Exercise 15.9. If χ is quadratic modq for q a prime, then then the
least quadratic non-residue (lqnr) modq is at most q1/2 log q. Gauss
1/2
√ ≤ q . A trick of Vinogradov allows you to save a
showed lqnr
factor of e and Polya Vinogradov yields
√
lqnr ≤ q1/(2 e)+ε
Burgess implies
√
lqnr ≤ q1/(4 e)+ε
.
15.4. Some more extended exercises.
Exercise 15.10 (Difficult, theorem of Goldfeld).
Question 15.11 (Open question, Sophie Germian). Are there infin-
itely many primes p with p − 1 = 2q with q a prime.
A weakening of the above statement would be to find primes p so
that p − 1 has a large prime factor.
Let
P( x )
be the largest prime dividing x The exercise is to show that there are
lots of primes p ≤ x so that
P( p − 1) ≥ x1/2+δ
for some δ > 0. Hint: The point of this exercise is to use Bombieri
Vinogradov. Here is the idea of the proof. Suppose there were lots of
such primes, say
∑ ∑ Λ(d) ∼ x
p≤ x q | p −1
102 AARON LANDESMAN
One can keep track of small prime factors, but occasionally it might
have a very large prime factors.
Conjecture 15.16 (Montgomery’s conjecture). We have
!
Ψ( x ) x1/2+ε
Ψ( x; q, a) = +O √ .
φ(q) q
Remark 15.17. There reasoning behind this is that the error term is
approximately
1
φ(q) χ mod∑
χ(n)Ψ( x, χ)
q,χ6=χ 0
16. 11/16/17
Today we’ll begin a discussion of gaps between primes. Let pn be
the nth prime. By the prime number theory, pn ∼ n log n. Hence, on
average, pn+1 − pn ∼ log n.
Question 16.1 (Open question). What can we say about the distribu-
tion of
p n +1 − p n
log n
as n varies?
To make sense of this question, we can pick an interval (α, β) ⊂
R>0 and ask about
p n +1 − p n
1
lim # n ≤ N : ∈ (α, β) .
n→∞ N log pn
How might we guess this? There is a naive model called the Cramer
model. This is clearly bogus, but also reasonably successful. Define
the random variable Xn by
(
1 with probability log1 n
Xn : =
0 with probability 1 − log1 n
for n ≥ 3.
Now, let’s count the probability
Prob ( Xn+1 = Xn+2 = · · · = Xn+h−1 = 0, Xn+h = 1, given Xn = 1) .
This is asking for the chance of a gap of size h.
To calculate this, thinking of h as small compared to log n. we see
this is approximately
h −1
1 1 1
1− ∼ e−h/ log n .
log n log n log n
Thinking of this a different way, looking at the interval [n, n + h],
we can ask for the chance there are exactly k values for which Xm =
1. We can also handle this quite easily. We can pick k numbers to be
ANALYTIC NUMBER THEORY NOTES 105
Example 16.2. So the guess one might obtain from this is that in the
interval
[n, n + log n]
the chance of finding k primes is about
1 −1
e
k!
So, we might make the following conjecture.
Conjecture 16.3. If h = λ log n, then as n → ∞ chosen randomly,
then The number of primes in [n, n + h] is Poisson with parameter λ.
That is,
1 λk −λ
# {n ≤ N : [n, n + h] contains k primes } ∼ e .
N k!
Remark 16.4. Saying this another way,
p n +1 − p n
Z β
1
# n≤N: ∈ (α, β) ∼ e− x dx.
N log pn α
Example 16.5. One could ask the same sort of question about any
subset of the integers (or discrete subset of the real numbers). For
example, say we would like to know the spacing of the zeros of the
zeta function. Say they are of the form 1/2 + iγn with
2πn
γn ∼ .
log n
The spacings of
log n
γn + 1 − γn ∼1
2π
on average. We can ask now about the distribution. These are not
expected to behave like a Poisson process.
Question 16.6. Why should we believe the above conjecture?
Well, there is in fact a better conjecture going back to Hardy and
Littlewood.
106 AARON LANDESMAN
1 −k
νH ( p)
S(H ) : = ∏ 1 − 1− .
p p p
0
∑ Λ(n)e(nα) ∑ Λ(m)e(−mα) e(2α)dα.
n≤ N m≤ N
Compute what the major arc contribution is. When one computes
this, one might have a guess as to what the answer should be. There
will be a major arc around 0 and a major arc around 1. Hint: Here,
take α close to qa with error roughly n1 . Consider
a
∑ Λ(n)e q n + nβ .
n
Assume Λ(n) behaves like log n from the prime number theorem,
and similarly put in an estimate from the prime number theorem on
arithmetic progressions. One should also check we get 0 as our main
term if we put e(α) instead of e(2α) above.
ANALYTIC NUMBER THEORY NOTES 107
0
∑ Λ(n)Λ(n + 2)e(nα) ∑ Λ(m)e(−mα)e(6α .
n≤ N m≤ N
Remark 16.14. Hardy and Littlewood did not like this probabilistic
argument because it is assuming primes
are independent.
That is,
1 2e−γ N
when one considers π ( N ) ∼ N ∏ p≤√ N 1 − p ∼ log N .
17. 11/28/17
Last time, we were discussing the Hardy Littlewood conjecture:
Conjecture 17.1 (Hardy-Littlewood). If you have a set
H : = { h1 , . . . , h k }
then
x
# {n ≤ x : n + h primes } ∼ S(H )
(log x )k
where
−k
1 ν ( p)
S(H ) = ∏ 1− 1− H
p p p
ANALYTIC NUMBER THEORY NOTES 111
with
νH ( p) = #H mod p
Remark 17.3. Sieve methods can give upper bounds for the number
x
of prime k-tuples asymptotic to k . We have already seen one
(log x )
version of this, which is the large sieve.
Exercise 17.4 (Extended exercise). Use the large sieve to give such
an upper bound. Recall the large sieve looked at some exponential
sum. One can try to bound the number of inadmissible tuples over
all possible primes.
√
∑ ∑ λd
x ≤n≤ x d|(n+h1 )···(n+hk )
√
∑ ∑ λd ≥ # { R < n ≤ x : n + h1 , . . . , n + hk are all prime } .
x ≤n≤ x d|(n+h1 )···(n+hk )
112 AARON LANDESMAN
∑ ∑ ∑ ∑
λd = λ d1 λ d2 1
.
√ √
x ≤n≤ x d|(n+h1 )···(n+hk ) d1 ,d2 x ≤n≤ x
[d1 ,d2 ]|(n+h1 )···(n+h2 )
We’d now like to diagonalize this quadratic form and read of the
diagonal entries by Cauchy Schwartz. Now, d1 , d2 are tied together
by the lcm function. Let (d1 , d2 ) denote the gcd of d1 , d2 . Let a =
(d1 , d2 ) be the gcd. Then, let d1 = ar1 , d2 = ar2 . We obtain
λ ar1 λ ar2
∑ ∑ ar1 r2
f ( ar1 r2 ) .
a r1 ,r2 ,
(r1 ,r2 )=1
By multiplicativity, we have
f ( ar1 r2 ) = f ( a) f (r1 ) f (r2 ).
Next, use Möbius inversion to remove the coprimality condition. We
have
(
1 if (r , r2 ) = 1
∑ µ(b) = 0 else 1
b|(r ,r )1 2
Then,
λ ar1 λ ar2
∑ ∑ ar1 r2
f ( ar1 r2 )
a r1 ,r2 ,
(r1 ,r2 )=1
f ( a ) µ ( b ) f ( b )2 λ abs1 λ abs2
= ∑∑ ∑ f ( s1 ) f ( s2 )
a b
a b2 s1 ,s2 s1 s2
f ( a ) µ ( b ) f ( b )2 λ abs
= ∑∑ ∑ f (s)
a b
a b2 s s
f ( a ) f ( b )2
= ∑ ξ d2 ∑ µ(b)
d ab=d
ab2
f (d) f (b)
= ∑ ξ d2 ∑ ∑ µ(b) .
d ab=d
d b|d
b
Then, let
f (b)
h(d) := ∑ µ(b) .
b|d
b
114 AARON LANDESMAN
We may observe
f ( p)
h(d) = ∏ 1−
p
.
p|d
f (d) f (b) f (d)
∑ ξ d2 ∑ d ∑ b
µ(b) = ∑ d
h(d)ξ d2 .
d ab=d b|d d
λdbt f (bt)
= ∑ µ(b) ∑
b t bt
µ(b) f (b)
=∑ ξ db .
b
b
We have
!2 ! !
µ(b) µ ( b )2 f ( b ) f (d)
1= ∑ b f (b)ξ b ≤ ∑ b h(b) ∑ d
h(d)ξ d2
b b d
The equality case of Cauchy Schwartz occurs when the vectors are
proportional to each other, which occurs when
µ(b)
ξb ∼ .
h(b)
ANALYTIC NUMBER THEORY NOTES 115
(log R)k
∼ .
k!
So, we should expect
µ ( b )2 f ( b ) (log R)k
∑ bh(b) ∼ k!
b≤ R
up to multiplying by some convergent Euler factor to mitigate our
estimates above.
Exercise 17.6. Verify that this Euler factor is S(H )−1 , meaning
k
µ ( b )2 f ( b ) −1 (log R )
∑ bh(b) = S ( H )
k!
+ lower order terms .
b≤ R
116 AARON LANDESMAN
with
log( R/d)
λd = µ(d) .
log R
ANALYTIC NUMBER THEORY NOTES 117
Then,
λd λd µ ( a )2 µ (r ) µ ( s )
∑ [d11, d22] = ∑ a r s
.
d1 ,d ≤ R
2
a,r,s
r ≤ R/a
s≤ R/a
for R/2 ≤ a ≤ R.
Exercise 17.13 (difficult extended exercise). Then,
µ ( d1 ) µ ( d2 )
∑ [ d 1 , d 2 ]
→ c 6= 0
d1 ,d2 ≤R
as R → ∞.
The sieve accounts for the above by replacing
µ ( a )2 µ (r ) µ ( s )
∑ a r s
.
a,r,s
r ≤ R/a
s≤ R/a
by
µ ( a )2 µ (r ) µ ( s )
log R/ar log R/as
∑ a r s log R log R
.
a,r,s
r ≤ R/a
s≤ R/a
∑ ∑ λd
n + h1 = p ≤ x d|(n+h2 )···(n+hk )
18. 11/30/17
Last time, we discussed Selberg’s sieve. We proved bounds like
S(H ) x
# {n ≤ x : n + h1 , . . . , n + hk are all prime } ≤ (1 + o (1)) 2k k! .
(log x )k
This was shown by considering a quadratic form
2
∑ λd
d|(n+h1 )···(n+hk )
with
k
log R/d
λd ∼ µ(λ) .
log R
with R = x1/2−ε .
Exercise 18.1. Another way to find this is to consider
2
∑ ∑ λd
n≤ x,n+h1 prime d|(n+h1 )···(n+hk )
taking R = x1/4−ε . Then, show that one gets a bound which is better
when k = 2, but not for other k, of the form
2
S(H ) x
∑ ∑ λd ≤ (1 + o (1)) 4k−1 (k − 1)! .
n≤ x,n+h1 prime d|(n+h1 )···(n+hk ) (log x )k
Exercise 18.2. Use Selberg’s sieve to bound
n o
# n ≤ x : n2 + 1 is prime .
Use sieve weights summing over polynomial values. That is, bound
2
n o
# n ≤ x : n2 + 1 is prime ≤ ∑ ∑ λd
n≤ x d | n2 +1
ANALYTIC NUMBER THEORY NOTES 119
a(n) = ∑ λd
d|(n+h1 )···(n+hk )
and
Q2 ( λ ) : = ∑ a ( n ).
n≤ x,n+h j prime
120 AARON LANDESMAN
f ([d1 , d2 ])
x· ∑ λ d1 λ d2
[ d1 , d2 ]
+ O ( R2 x ε )
d1 ,d2
with
f ( p) := νH ( p)
k ∑ λ d1 λ d2 ∑ 1
d1 ,d2 n≤ x,
n+h1 prime ,
(n+h2 )···(n+hk )≡0 mod [d1 ,d2 ]
x g ([d1 , d2 ])
.
log x φ ([d1 , d2 ])
x λ d1 λ d2
k ∑
log x d ,d φ ([d1 , d2 ])
g ([d1 , d2 ]) .
1 2
λds f (s) 2
f (d) f ( p)
=∑
d ∏ ∑ s
1− .
d p|d
p
λds g(s) 2
kx g(d) g( p)
log x ∑ φ(d) ∏ ∑ φ(s)
= 1− .
d p|d
φ( p)
For both these cases, the first step is to understand the rightmost
terms
λds f (s) 2 λds g(s) 2
∑ s and ∑
φ(s)
.
g(l )l
where h(l ) is either f (l ) or φ(l ) . In both cases, h is multiplicative.
Usually h( p) ∼ w with w = k − 1 or k.
Take
p ( t ) (0) t
P(y) = ∑ t!
y
t≥k
Lemma 18.4. For c > 0 and (c) the corresponding vertical line in the
complex plane,
(log z)t
(
1 zs
Z
t! if z > 1
ds =
2πi (c) st+1 0 if z < 1
122 AARON LANDESMAN
Proof. Either move the line of integration to the left picking up the
pole at t = 0. If z < 1 move the line of integration to the right, and
there is no pole so the integral is 0.
We now want to understand
∞
!
P ( t ) (0) µ(dl ) R s ds 1
Z
∑ 2πi (c)
∑ l 1+ s h ( l ) d st+1 (log R)t
.
t≥k `=1
Using the lemma, we can evaluate this, which only gives a nonzero
result if d < R. We have
∞
µ(dl )
∑ 1+ s
h(l )
`=1 l
for T the tame Euler factor from above, and T |s=0 denoting the eval-
uation of T at s = 0.
We can simplify the above to
T | s =0
(w) log R/d
P .
(log R)w log R
To finish this argument, we compute
1 s
h( p)
T | s =0 = ∏ 1 −
p ∏
1− µ(d)
p p-d
p
−w −w !
1 h ( p ) 1
= ∏ µ( p) 1 − ∏ 1− 1− .
p|d
p p-d
p p
Let’s plug this in to our first quadratic form and see what we get.
ANALYTIC NUMBER THEORY NOTES 123
= ζ (s + 1)k T2 |s=0
and then we want to understand the other corresponding term. We
have
( )
1 k 1 −2k f ( p) 2 1 2k
f ( p) f ( p)
∏ 1− p 1−
p
1−
p
+
p
1−
p
1−
p
.
p
1 −k
f ( p)
= ∏ 1− 1−
p p p
= S(H ).
Then, by partial summation,
−2k
x 2 f (d) f ( p) 1
∑ µ ( d ) ∏ 1 − 1 −
(log R)2k d d p|d
p p
! !
f ( p) 2
2
1 −2k
(k) log R/d
· ∏ 1− 1− P
p-d
p p log R
!
Z R 2 k
x log R/z log R ( log z )
∼ 2k 1
P(k) d S(H )
(log R) k!
xS(H ) y k −1
Z 1
∼ P(k) (1 − y)2 dy.
(log R)k 0 ( k − 1) !
124 AARON LANDESMAN
One does a similar calculation for Q2 (λ). One can similarly compute
the Euler product, and one again gets
xS(H ) y k −2
Z 1
Q2 ( λ ) = k P(k−1) (1 − y)2 dy.
log x (log R)k−1 0 ( k − 2) !
All we need is to find a polynomial P to make the ratio Q2 (λ)/Q1 (λ) >
1 subject to the condition that R < x1/4 . It’s advantageous to make
R as large as possible in terms of x but R < x1/4 . These quadratic
forms we have only depend on the polynomial P. Next time, we’ll
finish this.
It will end up happening that when R = x1/4−ε you get this ratio
to be just under 1.
But, one can even get this ratio to tend to infinity thinking of this
as a higher dimensional problem, using an argument of Maynard.
19. 12/5/17
Let H := {h1 , . . . , hk } be an admissible tuple so that S(H ) 6= 0.
We want to compare
2
Q1 : = ∑ ∑ λd
n≤ N d|(n+h1 )···(n+hk )
with
2
k
Q2 : = ∑ ∑ ∑ λd .
j=1 n≤ N,n+h j prime d|(n+h1 )···(n+hk )
Therefore,
1
Z
1 a+b
= xna (1 − xn )b .
a+1 a 0
Simplifying (19.1) we get that it suffices to check
k log R (k − 2)! (2l )! (k − 1)!l 2 (2l − 2)!
( k − 1) < .
log N (k − 1 + 2l )! (k + 2l − 2)!
126 AARON LANDESMAN
Q1 = ∑ ∑ ∑ λd
h1 ,...,hk ≤ H distinct n≤ N d|(n+h1 )···(n+h+k)
2
Q2 = ∑, ∑ ∑ ∑ λd .
h≤ H h1 ,...,hk ≤ H distinct n≤ N,n+h prime d|(n+h1 )···(n+hk )
be admissible. Consider
2
∑ ∑ d1,...,dk
λ
n∼ x d1 |(n+h1 )
d2 |(n+h2 )
. ..
dk |(n+hk )
and compare it to
2
∑ ∑ λd1 ,...,dk .
n∼ x di |(n+hi )
n+h j prime
Before we had
∑ λd
d|(n+h1 )···(n+hk )
and
λd = ∑ λd1 ,...,dk
d1 ···dk =d
∑ 1
n≤ x
p|n =⇒ p>z
128 AARON LANDESMAN
To count this, if
∏p
p≤z
is very small, we can easily sieve this. The product above is around
log x
ez . For example if z ≤ 106 this is very easy to sieve. For example,
take
W= ∏ p,
p≤log log log x
and compare it to
2
∑ ∑ λd1 ,...,dk .
n∼ x di |(n+hi )
n+h j prime
n≡ν mod W
Then, n + hi , n + h j = 1 for all i and j with n ≡ ν mod W.
So, we want to understand the quadratic form
We the set
!
k λd1 ,...,dk
y f1 ,..., f k = ∏ µ ( f i ) φ ( f i ) ∑ d1 · · · d k .
i =1 f |d i i
Then,
λd1 ,...,dk ∼ µ(d1 ) · · · µ(dk ).
130 AARON LANDESMAN
The Möbius function of the f i then cancel out and the f i mostly cancel
with the φ( f i ). Therefore, the choice of y f1 ,..., f k will look like
log f 1 log f n
y f1 ,..., f k ∼ F ,..., .
log R log R
Then, after an invertible change of variables,
y a1 ,...,ak
λd1 ,...,dk = ∏ µ(d j )d j ∑
.
d |a
φ ( x1 ) · · · φ ( x k )
j j
Then,
x y2f1 ,..., f
Q1 ∼
W ∑ φ( f 1 ) · · · φ( f k )
k
.
f 1 ,..., f k
x φ (W ) k
Z
k
(log R) F ( x1 , . . . , xk )2 dx1 · · · dxk
R W x1 ,...,xk
where
F ( x1 , . . . , x k ) = 0
unless x1 + · · · xk ≤ 1.
Remark 19.3. The (log R)k in the numerator (instead of the denomi-
nator) is a scaling fact relating to how we chose the y f1 ,..., f k .
Let’s now start understanding Q2 , which we will finish on Thurs-
day. This will be similar. We have
2
Q2 ∼ k ∑ ∑ λd1 ,...,dk
n∼ x di | n + hi
n+hk prime
n≡ν mod W
=k ∑ λd1 ,...,dk λe1 ,...,ek ∑ 1
d1 ,...,dk n∼ x
e1 ,...,ek n≡ν mod W
d k =1 n+hk prime
e k =1 n+hi ≡0 mod [di ,ei ]
ANALYTIC NUMBER THEORY NOTES 131
20. 12/7/17
Recall last time we had an admissible set
H = { h1 , . . . , h k }
and we chose
W= ∏ p
p≤log log log x
where
ν mod W with (ν + hi , W ) = 1 for all i
We had
2
Q1 = ∑ ∑ λd1 ,...,dk .
n≤ x d | n + hi
n≡ν mod W
with
λd1 ,...,dk
∏ µ (ri ) φ (ri ) ∑
yr1 ,...,rk =
ri | di
d1 · · · d k
where
yr1 ...,rk
∏ µ ( di ) di ∑
λd1 ,...,dk = .
di |ri
φ (r1 ) · · · φ (r k )
We chose
log r1 log rk
yr1 ,...,rk = F ,...,
log R log R
132 AARON LANDESMAN
with
F ( x1 , . . . , x k )
supported on x1 + · · · + xk ≤ 1.
We then get
x φ (W ) k
Z
k
Q1 = (log R) F ( x1 , . . . , xk )2 dx1 · · · dxk .
W W x1 ,...,xk
We have
1 φ ((d, e))
=
φ ([d, e]) φ(d)φ(e)
using that
φ(n) = ∑ g(d)
d|n
(k)
By convention we set y f
1 ,..., f k
= 0 unless f k = 1.
ANALYTIC NUMBER THEORY NOTES 133
We then have
2
(k)
x yf
1 ,..., f k
Q2 =
φ(W ) log x ∑ k
.
f 1 ,..., f k ∏ j=1 g ( f j )
Proof. We have
λd1 ,...,dk
∏ µ( f i ) g( f i ) ∑
(k)
yf
1 ,..., f k
=
d1 ,...,dk
φ ( d1 ) · · · φ ( d k )
f i | di
1 yr1 ,...,rk
∏ µ( f i ) g( f i ) ∑ ∏ µ(d j )d j ∑
=
f i | di
φ ( d1 ) · · · φ ( d k ) r1 ,...,rk φ (r1 ) · · · φ (r k )
d k =1 d j |r j
k
µ(d j )d j
∑ ∏ φ ( d j )
.
d1 ,...,d1 =1 j =1
f i | di |ri
f j µ( f j )
p
φ( f j ) ∏ 1−
p−1
.
p |r j / f j
p
Then, 1 − p−1 the above term is relatively small, unless r j = f j , in
which case the product is the empty product and goes away.
134 AARON LANDESMAN
We then see that this almost matches up with our first quadratic
form Q1 . The only difference is that we have two different quadratic
forms based on the function F we are choosing. So, we have boiled
everything down to a problem of optimizing F.
Recall we want F ( x1 , . . . , xk ) to be symmetric and x1 + · · · + xk ≤
1. We will choose
k
F ( x1 , . . . , x k ) = ∏ g(kxi )
i =1
with kxi = ui .
Then, the denominator is similarly given by
1
Z
kk u1 ,...,uk ∏ g(u j )2 du j .
u1 +···+uk ≤k