Bernoulli Numbers and Zeta Functions
Bernoulli Numbers and Zeta Functions
Tsuneo Arakawa
Tomoyoshi Ibukiyama
Masanobu Kaneko
Bernoulli
Numbers
and Zeta
Functions
Springer Monographs in Mathematics
123
Tsuneo Arakawa Tomoyoshi Ibukiyama (emeritus)
(deceased 2003) Osaka University
Osaka, Japan
Masanobu Kaneko
Kyushu University Don Zagier (Appendix)
Fukuoka, Japan Max Planck Institute for Mathematics
Bonn, Germany
Mathematics Subject Classification: 11B68, 11B73, 11M06, 11L03, 11M06, 11M32, 11M35
Two subjects are treated in this book. The main subject is the theory of Bernoulli
numbers, which are a series of rational numbers that appear in various contexts of
mathematics, and the other subject is the related theory of zeta functions, which are
very important in number theory. We hope that these are enjoyable subjects both for
amateur mathematics lovers and for professional researchers. There are easy parts
as well as difficult parts in this book, but we believe that, according to the taste of
the readers, they can enjoy at least some parts of this attractive mathematics. Since
the logical relations between the chapters are rather loose and not a straight course
from the beginning to the end, it would still be worthwhile for readers who pick
only some chapters which fit their tastes and background knowledge. As far as we
know, books whose main subject is Bernoulli numbers are rare. Some parts of this
book consist of rather standard number theory, but our expositions on these subjects
are not always so standard, and some parts are completely new and have not been
written in any reference before.
Now, there are many numbers in the world which have names, but what is a
Bernoulli number? In high-school we learn the formula for the sum of the integers
from 1 to n. Or maybe the formula of the sum of squares 12 C 22 C C n2 is
also written in the standard textbooks for high schools. But if we make the powers
bigger, for example, if we take
16 C 26 C C n6 ;
then what kind of formula can we have? It seems that many people hit upon this kind
of question and it often happens that ambitious high-school students try and find the
solution. In fact, the formula was already known for any power at the beginning
of the eighteenth century. (See Chap. 1, p. 2.) Bernoulli introduced some special
numbers to express this formula. These are the Bernoulli numbers. (For a more
precise history, see Chap. 1.) In this book, this formula of the sum of powers appears
in various places. In fact, in this book, four alternative proofs of this formula will be
given in different places (cf. Sects. 1.2, 4.3, 5.2, 8.3).
v
vi Preface
Well, is it so interesting to give this formula of the sum of powers? Indeed it is.
But if the meaning of Bernoulli numbers were only this, we would not have written
this book. The real meaning of Bernoulli numbers lies in a different place. Let’s
look at the following formulas:
1 1 1 1 1 2
1C 2
C 2 C 2 C 2 C 2 C D ;
2 3 4 5 6 6
1 1 1 1 1 4
1C 4
C 4 C 4 C 4 C 4 C D ;
2 3 4 5 6 90
1 1 1 1 1 1
1 C C C D :
3 5 7 9 11 13 4
(You can find the proofs in Corollary 4.12, p. 61, and Theorem 9.6, p. 148.) At
first glance, these kinds of formulas could give you a mysterious impression, since
a regular sum of simple rational numbers suddenly changes into a transcendental
number like . The left-hand sides of these formulas are values at some special
points of so-called zeta functions, or L-functions, which are another important
subject of the book. An example of zeta functions is given by
1 1 1 1 1
.s/ D 1 C C s C s C s C s C :
2s 3 4 5 6
Here the sum is regarded as a function with respect to the complex variable s. This
function is called the Riemann zeta function. The left-hand sides of the first two
formulas above are the values of .s/ at s D 2 and s D 4, namely .2/ and .4/. The
right-hand sides are related to Bernoulli numbers and generalized Bernoulli numbers
(see below). The true reason that Bernoulli numbers are indispensable for number
theory lies in this type of relation with special values (values at special points) of
zeta functions. A huge number of functions called zeta functions appear in number
theory, and they always contain very important information. We can even say that
there appears a different zeta function for each important number theoretical object.
The zeta functions which are related to Bernoulli numbers are just a part of them.
But still they have something to do with many number theoretical phenomena. For
example, they are connected with a kind of volume in non-Euclidean geometry, with
dimension formulas of spaces of modular forms, and with the number of equivalence
classes of quadratic forms (class number formula), and they appear as constant terms
of Eisenstein series, which are important in the theory of automorphic forms. Also
they are related to the number theory of cyclotomic fields, that is, the systems of
numbers obtained by adjoining the roots of unity to the rational numbers. If we
wanted to explain all of this, we would need several more books, so it is not possible
to do it here, but we have tried to give some hints about those relations.
Now, for the reader’s convenience, we sketch the content of each chapter.
Comments below written in parentheses will not be explained in this book in detail.
Preface vii
In Chap. 1, after explaining the history of how Bernoulli numbers were intro-
duced and giving the definition of Bernoulli numbers using recurrence relations,
we prove the formula for the sum of powers of consecutive natural numbers.
In Chap. 2, we give a formula to write down Bernoulli numbers in a simple
way. There we use Stirling numbers, whose definition is comparatively simple. In
Chap. 3, we will prove the Clausen–von Staudt theorem which gives information
on the denominators of Bernoulli numbers, and also Kummer’s congruence which
gives a congruence relation between Bernoulli numbers. (This latter theorem is
needed when we define p-adic zeta functions.) In Chap. 4, we define generalized
Bernoulli numbers associated to Dirichlet characters. (Through this extension,
the connection with quadratic fields and cyclotomic fields will appear.) Also the
Bernoulli polynomials (a system of polynomials similar to the generating function
of Bernoulli numbers) are defined, and as an application, we give a second proof
of the formula of the sum of powers. The formula expressing the values of the
Riemann zeta function at even positive integers by Bernoulli numbers is also given
here. In Chap. 5, we give the Euler–Maclaurin summation formula, which is very
useful to evaluate sum of values of functions at integers. In the coefficients of this
formula, Bernoulli numbers or Bernoulli polynomials appear. As an application, the
analytic continuation of the Riemann zeta function to the whole plane is proved
and the values of the Riemann zeta function at non-positive integers are beautifully
expressed by Bernoulli numbers. In Chap. 6, the relation between ideals of a
quadratic field and quadratic forms is given and a relation between the generalized
Bernoulli numbers and the class numbers of positive definite quadratic forms is
explained. In Chap. 7, a congruence relation between the class number of imaginary
quadratic fields and the Bernoulli numbers will be given. In Chap. 8, Gauss sums
are introduced and then formulas to describe various sums of the roots of unity with
characters by Gauss sums and generalized Bernoulli numbers are given. Here we
get a third proof of the formula for sums of powers. In Chap. 9, L-functions will
be introduced by modifying the usual definition of the Riemann zeta function by
adjoining characters; then their functional equation is shown by using a contour
integral expression. We will also give a formula to describe values at non-positive
integers of L-functions in terms of generalized Bernoulli numbers. This formula
gives a “raison d’être” for the generalized Bernoulli numbers. In Chap. 10, among
the zeta functions of prehomogeneous vector spaces, we pick an example which
apparently does not seem so simple but can be shown to be a very easy function.
(This is an example connected with the appearance of Bernoulli numbers in the
dimension formula for modular forms.) In Chap. 11, we give an alternative proof of
Kummer’s congruence by considering p-adic measures. (This has a deep connection
with p-adic L-functions.) In Chap. 12, we review the fact that Bernoulli numbers
appear in the Taylor expansion of the (co)tangent function, and ask what would
emerge if we replace the (co)tangent with elliptic functions (a well-worn device
to generalize phenomena on trigonometric functions), and explain the theory of
Hurwitz numbers. In Chap. 13, we show the analytic continuation of the Barnes
multiple zeta function, which is a generalization of the Hurwitz zeta function, and
explain relations between their special values and Bernoulli polynomials. We also
viii Preface
show the functional equation of the double zeta functions in the same way as in
Chap. 9. In Chap. 14, we define poly-Bernoulli numbers, which are obtained by
replacing the usual generating function of Bernoulli numbers by polylogarithm
functions. In an Appendix by Don Zagier, some “curious and exotic” identities for
Bernoulli numbers are given.
As explained above, Bernoulli numbers are related with a great many things.
But Bernoulli numbers do not have so many of their own problems to be solved,
except for a few very difficult problems. So, maybe there exist few mathematicians
who study only Bernoulli numbers. Since the situation is like that, there are not
many books similar to this one, and it was not so easy to have a perspective on
the wide range of relations for Bernoulli numbers. Although the present authors
are specialists of number theory, we dare not say we are specialists of Bernoulli
numbers. This book consists partly of quotations from the papers by authors
who occasionally encountered Bernoulli numbers from their different interests,
and partly of various memoranda (namely these are observations which are not
necessarily very original results worthy to be written as careful scientific papers,
but rather lightly checked, interestingly interpreted, or calculated for fun and worth
being taken note of so as not to be forgotten). And this kind of connection is
indeed the usual common connection of mathematicians to the Bernoulli numbers.
In this sense, we believe that this book is useful for any reader who is interested in
mathematics.
The original edition of this book was published in Japanese by Makino Publisher
in Japan in 2001. We would like to give our sincere thanks to Professor Don B.
Zagier for recommending us enthusiastically to write this English version based
on that Japanese book, for recommending this book to Springer-Verlag to be
published, and also for writing the nice Appendix. We would like to thank Springer-
Verlag for publishing this book, in particular Dr. Joachim Heinze of Springer
and Ms. Chino Hasebe and Mr. Masayuki Nakamura of Springer Tokyo for their
valuable help for the arrangement, and Dr. Alexander Weisse for his help with TeX.
We would also like to thank anonymous referees of the English version for a lot
of valuable comments. We would like to give our deep thanks to Mr. Suenobu
Makino, the owner of Makino Publishers, who originally suggested us the theme
of Bernoulli numbers and recommended to write the book in Japanese. Without his
recommendation, the Japanese version would not have appeared. Tsuneo Arakawa,
one of the co-authors for the Japanese version, passed away on October 3, 2003 at
the age of 54 before this English project started. Since some parts of this book are
based on his Japanese manuscript, we would like to keep his name as one of the
authors. The second- and the third-named authors express their profound regret and
sadness over the early death of Tsuneo Arakawa.
1 Bernoulli Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 1
1.1 Definitions: Introduction from History . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 1
1.2 Sums of Consecutive Powers of Integers
and Theorem of Faulhaber .. . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 6
1.3 Formal Power Series . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 13
1.4 The Generating Function of Bernoulli Numbers . . . . . . . . . . . . . . . . . . . 20
2 Stirling Numbers and Bernoulli Numbers . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 25
2.1 Stirling Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 25
2.2 Formulas for the Bernoulli Numbers Involving
the Stirling Numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 34
3 Theorem of Clausen and von Staudt, and Kummer’s
Congruence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 41
3.1 Theorem of Clausen and von Staudt . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 41
3.2 Kummer’s Congruence .. . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 43
3.3 Short Biographies of Clausen, von Staudt and Kummer . . . . . . . . . . . 46
4 Generalized Bernoulli Numbers . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 51
4.1 Dirichlet Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 51
4.2 Generalized Bernoulli Numbers . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 53
4.3 Bernoulli Polynomials . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 55
5 The Euler–Maclaurin Summation Formula and the
Riemann Zeta Function .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 65
5.1 Euler–Maclaurin Summation Formula . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 65
5.2 The Riemann Zeta Function . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 67
6 Quadratic Forms and Ideal Theory of Quadratic Fields .. . . . . . . . . . . . . . 75
6.1 Quadratic Forms .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 75
6.2 Orders of Quadratic Fields . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 77
6.3 Class Number Formula of Quadratic Forms . . . .. . . . . . . . . . . . . . . . . . . . 87
ix
x Contents
References .. .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 263
Index . . . . . . . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . . . . . . . . . . . . . 269
Chapter 1
Bernoulli Numbers
In his posthumous book Ars Conjectandi [16] published in 1713 (the law of large
numbers in probability theory is stated in this book), Jakob Bernoulli1 introduced
the Bernoulli numbers in connection to the study of the sums of powers of
consecutive integers 1k C 2k C C nk . After listing the formulas for the sums
of powers
X
n.n C 1/ X 2 n.n C 1/.2n C 1/ X 3
n n n
n.n C 1/ 2
iD ; i D ; i D ;:::
i D1
2 i D1
6 i D1
2
1
Born on December 27, 1654 in Basel, Switzerland—died on August 16, 1705 in Basel,
Switzerland. Jakob is the eldest among the mathematicians in the famous Bernoulli family. It
is said that Jakob, his younger brother Johann (born on July 27, 1667 in Basel, Switzerland—
died on January 1, 1748 in Basel, Switzerland), and his second son, Daniel (born on February 8,
1700 in Groningen, Netherlands—died on March 17, 1782 in Basel, Switzerland) are the most
distinguished among them.
k
where j
is the binomial coefficient
!
k k.k 1/.k 2/ .k j C 1/
D ;
j jŠ
It is this Bj that is subsequently called a Bernoulli number,2 and is the main theme
of this book.
Bernoulli does not give symbols for B0 and B1 , and he writes A; B; C; D; : : :
for B2n . As we will see later, B2nC1 D 0 except for B1 . In his book, the right-hand
side of (1.1) is written as
where c D k.
We would like to mention here that, in the book called Katsuyo Sanpo (“essentials
of the art of calculation”) by the outstanding Japanese mathematician Takakazu
Seki,3 published also posthumously, in 1712 (and thus 1 year before Bernoulli!), the
formula for the sums of powers and the inductive definition of the Bernoulli numbers
are given. His formula and definition are completely the same as Bernoulli’s.
2
It was apparently de Moivre (Abraham, born on May 26, 1667 in Vitry-le-Francois, Champagne,
France—died on November 27, 1754 in London, England) who first called this number a Bernoulli
number in the book Miscellanea analytica de seriebus et quadraturis (London, 1730). De Moivre
is famous for de Moivre’s formula in trigonometry.
3
Born in 1642(?) in Kohzuke(?), Japan—died on October 24, 1708 in Edo, Japan. He is usually
considered Japan’s greatest mathematician of the Edo period (1600–1857, when the country
was closed to essentially all foreign contact). Hardly anything is known about his mathematical
education, and he seems to have been largely self-taught. He served under the shoguns Tsunashige
Tokugawa and Ienobu Tokugawa, occupying the post of Controller of the Treasury Office, and
wrote several treatises in higher mathematics. Apart from discovering Bernoulli numbers simul-
taneously with or before Bernoulli, he discovered determinants and the rules for calculating them
simultaneously with or before Leibniz, solved extraordinarily difficult problems of elimination
theory for systems of polynomial equations in many variables, and began a study of calculation
procedures for the arc of a circle that was continued and completed by his disciple Katahiro Takebe
with the discovery of infinite series expansions for various trigonometric functions.
1.1 Definitions: Introduction from History 3
B0 D 1:
B0 C 2B1 D 2;
4
Johann Faulhaber (born on May 5, 1580 in Ulm, Germany—died in 1635 in Ulm, Germany).
5
Carl Gustav Jacob Jacobi (born on December 10, 1804 in Potsdam, Prussia (now Germany)—died
on February 18, 1851 in Berlin, Germany).
4 1 Bernoulli Numbers
Fig. 1.1 Katsuyou Sanpou, Seki’s tabular presentation of the power-sum formula, written not in
Chinese characters but using the notation of Sangi or counting rods. The table (which is reproduced
here with a 90ı rotation) contains each of the ingredients of the formula in parentheses in Eq. (1.1):
The number designated by “power” in the translation overleaf is the k of this formula, the binomial
coefficients are tabulated in the “Pascal’s triangle” in the right part of the table (D left part in the
translation), the Bernoulli numbers Bj are given on the left (the right in the English translation),
the missing coefficient kC1
kC1
in (1.1) corresponds to the crossed out “1” at the beginning of each
column of the Pascal triangle, and the “denominator” is the number k C 1 by which the whole
expression has to be divided at the end of the calculation
5
1
1 12 level 12 void
1 11 66 level 11 take 5/66 and add
1 10 55 220 level 10 void
1 9 45 165 495 level 9 take 1/30 and subtract
1 8 36 120 330 792 level 8 void
1 7 28 84 210 462 992e level 7 take 1/42 and add
1 6 21 56 126 252 462 792 level 6 void
1 5 15 35 70 126 210 330 495 level 5 take 1/30 and subtract
1 4 10 20 35 56 84 120 165 220 level 4 void
.4 B0 4B1 6B2 / D 0:
1a 3 6 10 15 21 28 36 45 55 66 level 3 take 1/6 and add
B0 C 3B1 C 3B2 D 3;
1 1 1 1 1 1 1 1 1 1 1 1 level 1 all
0 1 2 3 4 5 6 7 8 9 10 11 power
1c 2d 4 5 6 7 8 9 10 11 12 denominator
1
3
d: This number should be 3.
B2 D
1
4
Putting n D 2 in (1.2) we have
e: This number should be 924.
B3 D
(English translation, reflected
with respect to the original)
Fig. 1.2 Katsuyou Sanpou
which gives
6 1 Bernoulli Numbers
which gives
1 1
B4 D .5 B0 5B1 10B2 10B3 / D ;
5 30
and so on. Table 1.1 is the list of Bernoulli numbers Bn up to n D 18. It follows from
the definition that Bn is a rational number. Mathematical software packages such as
Mathematica and Maple include the algorithm for computing Bernoulli numbers,
and it is not hard to obtain several hundred Bn ’s in a moment.
Remark 1.2. There is another convention on Bernoulli numbers: B1 D 12 and
everything else is the same. We adopt the first definition because this is the original
definition of Seki and Bernoulli for one thing, and it is better suited to the special
values of the Riemann zeta function for another, that is, the formula in Theorem 5.4
is valid also for m D 1. Although the difference is minuscule, we call readers’
attention to it.
Since Bn D 0 for every odd n greater than 1 (we prove this later), it is not difficult
to translate formulas from one definition to the other by replacing Bn by .1/n Bn .
In this section we first prove the Seki–Bernoulli formula (1.1) for the sum
X
n
Sk .n/ WD ik (k; n integers, k 0; n 1):
i D1
Then we will state Faulhaber’s theorem and give a recurrence formula for Bernoulli
numbers at the end of this section.
Before going into the proof, let us give several explicit formulas for Sk .n/:
S0 .n/ D n;
n2 n
S1 .n/ D C
2 2
1.2 Sums of Consecutive Powers of Integers and Theorem of Faulhaber 7
1
D n.n C 1/;
2
n3 n2 n
S2 .n/ D C C
3 2 6
1
D n.n C 1/.2n C 1/;
6
n4 n3 n2
S3 .n/ D C C
4 2 4
1 2
D n .n C 1/2 ;
4
n5 n4 n3 n
S4 .n/ D C C
5 2 3 30
1
D n.n C 1/.2n C 1/.3n2 C 3n 1/;
30
n6 n5 5n4 n2
S5 .n/ D C C
6 2 12 12
1 2
D n .n C 1/2 .2n2 C 2n 1/;
12
n7 n6 n5 n3 n
S6 .n/ D C C C
7 2 2 6 42
1
D n.n C 1/.2n C 1/.3n4 C 6n3 3n C 1/:
42
S0 .n/ D n:
n2 n n3 n2 n
S1 .n/ D C ; S2 .n/ D C C ;::::
2 2 3 2 6
By induction we see that
1
Sk .n/ is a polynomial of degree k C 1 in n, whose leading term is nkC1 .
kC1
Our objective is to write down this polynomial explicitly.
Consider the polynomial Sk .x/.D kC1 1
x kC1 C / obtained by replacing
the variable n by x. In general, two polynomials f .x/; g.x/ are identical if
f .n/ D g.n/ for all positive integers n. By definition we have
Sk .n C 1/ Sk .n/ D .n C 1/k .n D 1; 2; 3; : : : /:
Thus, we obtain
Sk .x C 1/ Sk .x/ D .x C 1/k :
Sk .0/ D 0:
This yields the constant term of Sk .x/. The other coefficients may be obtained
.j /
from the derivatives Sk .0/; 1 j k. Taking the derivative of the relation
Sk .x C 1/ Sk .x/ D .x C 1/k , we obtain
This holds for any positive integer n. If we let Sk0 .0/ D bk (b0 D 1), then we have
the relation
Putting x D 0, we have
Taking the derivative of (1.4) once again, and using (1.4), we have
Sk000 .x/ D kSk1
00 0
.x/ D k.k 1/Sk2 .x/:
Putting x D 0, we have
Sk000 .0/ D k.k 1/bk2 :
Similarly, taking the derivatives successively, we obtain
.j /
Sk .0/ D k.k 1/ .k j C 2/bkj C1 .2 j k C 1/:
Finally, we have
X
kC1 .j /
Sk .0/ j
Sk .x/ D x
j D0
jŠ
!
X
kC1
1 kC1 .0/
D bkj C1 x j .Sk .0/ D 0/
j D1
k C 1 j
!
1 X kC1
k
D bj x kC1j :
k C 1 j D0 j
which is nothing but the recurrence for the Bernoulli numbers. We therefore
conclude bj D Bj . In view of the identity
! !
1 kC1 1 k
D ;
kC1 j kC1j j
we obtain (1.1). t
u
10 1 Bernoulli Numbers
This Bk .x/ is called the kth Bernoulli polynomial, which leads to various appli-
cations. We will give its definition and some properties in Sect. 4.3 (p. 55). The
formula for sums of consecutive powers may be obtained easily from the Bernoulli
polynomial. We will show it in Sect. 4.3.
Let us prove the fact that Bk D 0 for odd k greater than 1 using the polynomial
Sk .x/. We will prove this fact later more easily using the generating function, but
we prove it here because we need to use it before that.
Proposition 1.4. If n is an odd integer greater than or equal to 3, then Bn D 0.
As a consequence, .1/n Bn D Bn for all positive integers n except for n D 1.
Proof. Suppose k 1. Putting x D 1 in the formula Sk .xC1/Sk .x/ D .xC1/k
and using the fact Sk .0/ D 0, we obtain Sk .1/ D 0. Thus, by putting x D 1 in
the formula
!
X k
kC1
.k C 1/Sk .x/ D Bj x kC1j ;
j D0
j
we obtain
!
X k
kC1
.1/j Bj D 0:
j D0
j
Œ k1 !
X 2
kC1
2 B2j C1 D k C 1;
j D0
2j C 1
where Œx stands for the greatest integer less than or equal to x. Since B1 D 1=2, the
term for j D 0 in the left-hand side cancels with the right-hand side, and we obtain
1.2 Sums of Consecutive Powers of Integers and Theorem of Faulhaber 11
Œ k1 !
X 2
kC1
B2j C1 D 0 .k 3/:
j D1
2j C 1
Then we have
u2 X .k/ k1i
k1
S2kC1 .n/ D A u
2k C 2 i D0 i
and
v X k1
.k/
S2k .n/ D .k C 1 i /Ai uk1i ;
.2k C 1/.2k C 2/ i D0
.k/
where Ai is a number determined by the formula
i 1
!
.k/ .k/ 1 X kC1j .k/
A0 D 1; Ai D Aj .1 i k 1/:
k C 1 i j D0 k C j 2i
kC1j
(For k C j 2i < 0, we put 2i C12j
D 0.)
For the proof we refer to [30] and [60]. We encourage readers to attempt to prove
it by themselves.
Before ending this section we prove an amusing recurrence formula for Bernoulli
numbers using Sk .x/ [55].
In the defining recurrence formula for Bernoulli numbers (1.2), if we move the
right-hand side to the left, and use the fact that .1/i Bi D Bi for i ¤ 1 and
B1 D 1=2, then we obtain
!
X
n
nC1
.1/ i
Bi D 0 (1.5)
i D0
i
e
Remark 1.7. Though this formula is similar to (1.5), it requires only half the number
of terms to compute B2n (and thus B2n ).
Proof. From the binomial theorem we have
!
X
nC1
n C 1 nCi C1
x nC1
.1 x/ nC1
D .1/ i
x :
i D0
i
X
m X
m
y 2i C1 D .2x 1/2i C1 D S2i C1 .2m/ 22i C1 S2i C1 .m/
xD1 xD1
to obtain
1.3 Formal Power Series 13
!
X n
2n i n
2 .n C 1/ .1/ S2i C1 .2m/ 22i C1 S2i C1 .m/
i D0
i
!
X n
i nC1
D .1/ .2n C 2/S2nC1.m/ C
nC1
.1/ .n C i C 1/SnCi .m/:
i D0
i
Since this holds for all positive integers m, we can use the same argument as before
to conclude that the formula obtained by replacing m by x,
!
X n
2n i n
2 .n C 1/ .1/ S2i C1 .2x/ 22i C1 S2i C1 .x/
i D0
i
!
X n
i nC1
D .1/ .2n C 2/S2nC1 .x/ C
nC1
.1/ .n C i C 1/SnCi .x/;
i D0
i
holds as a formula for polynomials. We take the derivative of both sides of this
with respect to x and set x D 0. The desired formula follows from the facts
that Sk0 .0/ D Bk (previous section), that the term for i D 0 in the left-hand side
vanishes due to the fact 2S10 .0/ 2S10 .0/ D 0, and that Bk D 0 for odd k 3
(Proposition 1.4). t
u
For the other definition of Bn we mentioned in Remark 1.2 in Sect. 1.1, we need to
t
t
replace the left-hand side by et 1 D ett e1 t. In fact, the generating function is very
useful for the study of various properties of Bernoulli numbers.
In this section we will explain fundamental facts about formal power series for
those who are not familiar with such objects. In the next section we will prove the
above formula as Theorem 1.12. Those who are not familiar with abstract algebra
are advised to take a glance at the next section and see how computations go before
reading this section.
Let R be a commutative integral domain with unit (written 1). (Although we
write in this general fashion, readers may think of R as the rational number field
Q for the time being. In this book R is either Q, or its finite extension, except
for Chap. 11, where we need p-adic numbers. A commutative integral domain is
14 1 Bernoulli Numbers
and
X1
tn t2 t3
log.1 C t/ D .1/n1 D t C :
nD1
n 2 3
and
X
1 X
1 X1 X
n
an t n bn t n D cn t n ; cn D ai bni :
nD0 nD0 nD0 i D0
The product is defined using the distributive law formally, and the first few terms of
it are
.a0 C a1 t C a2 t 2 C a3 t 3 C / .b0 C b1 t C b2 t 2 C b3 t 3 C /
D a0 b0 C .a0 b1 C a1 b0 /t C .a0 b2 C a1 b1 C a2 b0 /t 2
C .a0 b3 C a1 b2 C a2 b1 C a3 b0 /t 3 C :
It is readily verified that RŒŒt is a commutative ring with unit. The zero element
and the unit are the formal power series
0 C 0 t C 0 t2 C 0 t3 C
1.3 Formal Power Series 15
and
1 C 0 t C 0 t2 C 0 t3 C ;
respectively. They are denoted simply by 0 and 1, respectively. The set of formal
power series all of whose coefficients are 0 except for the constant term is naturally
identified with R. Furthermore, regarding a polynomial P .t/ with coefficients in
R as a formal power series whose coefficients an are 0 when n is greater than the
degree of P .t/, we can consider RŒt as a subring of RŒŒt. The definitions of the
sum and the product of RŒŒt are natural generalizations of those of polynomials.
Since we assume that R is an integral domain, RŒŒt is also an integral domain.
Namely, we have
Proposition 1.8. The ring RŒŒt does not have a zero divisor (the product of
two non-zero elements is again non-zero). In particular, if A ¤ 0 and AB D
AC .A; B; C 2 RŒŒt/, then B D C .
Proof. Let A D a0 C a1 t C a2 t 2 C a3 t 3 C and B D b0 C b1 t C b2 t 2 C b3 t 3 C
be non-zero elements, and let ak ; bl be the first non-zero coefficients. Then the
coefficient ckCl of t kCl in A B D c0 C c1 t C c2 t 2 C is
Next, we show that we can define an infinite sum of elements of RŒŒt under
P1 .k/ i
certain conditions. Let fAk .t/g1
kD1 ; Ak .t/ D i D0 ai t be an infinite family of
formal power series. Suppose that these series satisfy the condition
.k/
for each i , there exist only finitely many k such that ai ¤ 0.
.k/ .k/
In other words, if we write Ak .t/ D ank t nk C .higher-order terms/ .ank ¤ 0/, we
have nk ! 1 as k ! 1. Then we define A1 C A2 C A3 C to be
1
X
A1 C A2 C A3 C D ai t i ;
i D0
where
1
X .k/
ai D ai :
kD1
In short, we take the sum of the coefficients in a common degree, and the above
condition ensures that each sum is a finite sum.
As a special case, we can define the “substitution” of a formal power series
P1 vanishing
with term into another formal power series. Namely, if A.t/ D
constant P
1
i D0 ai t i
, and B.t/ D i D1 bi t .B.0/ D b0 D 0/, then we can give a meaning
i
to the expression
1
X
A.B.t// D ai .B.t//i ;
i D0
e log.1Ct / D 1 C t
1.3 Formal Power Series 17
and
log.1 C .e t 1// D t
can be considered as the identities obtained by substituting one formal power series
to the other. (We will give proofs of these identities after Remark 2.7 on p. 33.)
We can also compute the reciprocal of a formal power series with constant term 1
by “substitution”. First we remark that the reciprocal of 1 t is 1 C t C t 2 C :
1
D 1 C t C t2 C :
1t
We can see this by computing the product .1t/.1Ct Ct 2 C /. Now, let 1CB.t/
be a formal power series with constant term 1. Then B.t/ is a formal power series
without constant term (B.0/ D 0), and thus the reciprocal of 1 C B.t/ can be
computed by
1 1
D D 1 C .B.t// C .B.t//2 C :
1 C B.t/ 1 .B.t//
As an example, we compute the first few Bernoulli numbers using (1.7). Since
t2 t3 t4 t5
et 1 D t C C C C C
2Š 3Š 4Š 5Š
t t2 t3 t4
D t 1C C C C C ;
2 6 24 120
we have
1 t t2 t3 t4
D 1 C C C C
1C t
C t2
C t3
C t4
C 2 6 24 120
2 6 24 120
2
t t2 t3 t4
C C C C C
2 6 24 120
3
t t2 t3 t4
C C C C
2 6 24 120
4
t t2 t3 t4
C C C C C C
2 6 24 120
t t2 t4
D 1 C C 0 t3 C :
2 12 720
18 1 Bernoulli Numbers
if and only if
a0 D 0 and a1 is invertible.
In this case B.t/ is unique, and we have B.A.t// D t. In other words A.t/ and B.t/
are inverse to each other with respect to composition.
Proof. If B.t/ D b1 t C b2 t 2 C exists and satisfies A.B.t// D t, then by
comparing the constant terms and the degree 1 terms, we have a0 D 0 and a1 b1 D 1.
This shows that the condition is necessary.
Conversely, suppose that A.t/ satisfies a0 D 0 and a1 is invertible. We would
like to determine the coefficients of B.t/ D b1 t C b2 t 2 C b3 t 3 C satisfying
A.B.t// D t. We first see from the coefficient of t that a1 b1 D 1. Since a1 is
invertible, we put b1 D a11 . For n 2, the coefficient of t n in A.B.t// equals the
coefficient of t n in
since there is no term t n after this due to the fact that B.0/ D 0. This can be
written as
a1 bn C .polynomial in a2 ; a3 ; : : : ; an ; b1 ; b2 ; : : : ; bn1 /:
The B.t/ obtained in this way satisfies B.0/ D 0, and b1 is invertible. We thus
see that there is a C.t/ (C.0/ D 0) such that
B.C.t// D t:
Then we substitute C.t/ for t in the formula t D A.B.t//, and use the fact that
B.C.t// D t to get
Thus, we have
B.A.t// D t:
t
u
The formulas we mentioned earlier,
can be interpreted as saying log.1 C t/ and e t 1 are inverse to each other with
respect to composition. P1
n d P1 n
The
P1derivative of formal power series nD0 an t , written dt nD0 an t
(or . nD0 an t n /0 ), is defined formally by term-by-term differentiation:
1
! 1
d X X
an t n
D nan t n1 D a1 C 2a2 t C 3a3 t 2 C 4a4 t 3 C :
dt nD0 nD1
1
! !0 1
!
X X
n X X
n
0
.f .t/g.t// D ai bni t n
D n ai bni t n1
nD0 i D0 nD1 i D0
1
!
X X
n X
n
D i ai bni C .n i /ai bni t n1
nD1 i D0 i D0
1
! 1
!
X X
n X X
n1
D i ai bni t n1
C ai .n i /bni t n1
nD1 i D1 nD1 i D0
0 0
D f .t/g.t/ C f .t/g .t/:
20 1 Bernoulli Numbers
Z tX
1
Also, when R Q, the integral an t n dt is defined formally by term-by-term
0 nD0
integration:
Z tX
1 1
X t nC1 t2 t3 t4
an t dt D
n
an D a0 t C a1 C a2 C a3 C :
0 nD0 nD0
nC1 2 3 4
is denoted by R..t//.
The sum and the product in R..t// are defined in the same way as in RŒŒt and with
these operations R..t//
P is a commutative integral domain that contains RŒŒt as a
subdomain. Also, 1 i
i DN ai t is invertible in R..t// if and only if the first non-zero
coefficient aN is invertible. These can be proved in the same way as in the proof of
Propositions 1.8 and 1.9. In particular, if R is a field, then so is R..t//. Furthermore,
the formal derivative and integral (when R Q) are also defined, and the quotient
rule .f .t/=g.t//0 D .f 0 .t/g.t/ f .t/g 0 .t//=g.t/2 holds. This follows from the
product rule. (Put f .t/=g.t/ D h.t/, and take the derivative of f .t/ D g.t/h.t/.)
As a matter of fact, the right-hand side belongs to QŒŒt, but we consider the
quotient of the formal power series te t by e t 1 in Q..t//.
6
Pierre Alphonse Laurent (born on July 18, 1813 in Paris, France—died on September 2, 1854 in
Paris, France).
1.4 The Generating Function of Bernoulli Numbers 21
P
Proof. It is sufficient to show . 1 tn
nD0 Bn nŠ /.e 1/ D te . The formula in question
t t
1
! 1
! 1 !
X tn X Bn n X tn
Bn .e 1/ D
t
t
nD0
nŠ nD0
nŠ nD1
nŠ
1
!
X X
n1
Bi 1
D tn
nD1 i D0
i Š .n i /Š
1
! !
X X
n1
n tn
D Bi :
nD1 i D0
i nŠ
P n
From the recurrence formula (1.2) we have in1
D0 i Bi D n for all n 1. Thus
we have
1
! ! 1
X X
n1
n tn X tn
Bi D D te t ;
nD1 i D0
i nŠ nD1
.n 1/Š
Expanding the left-hand side as in the above proof, and comparing it with the right-
hand side, we obtain the recurrence formula (1.2). This shows that Definition 1.1
and the definition using the generating function in Theorem 1.12 are equivalent.
Using the generating function, we can give a simpler proof of Proposition 1.4.
Proposition 1.14 (Proposition 1.4 revisited). If n is an odd integer greater than
or equal to 3, then Bn D 0.
t et
Proof. It suffices to show that the formal power series et 1
t
2
does not have any
odd-degree terms. Since we have
te t t t.e t 1 C 1/ t t t
D D t C
e 1 2
t e 1
t 2 e 1 2
and
.t/e t .t/ t t t t
D C D t C ;
e t 1 2 1 et 2 e 1 2
22 1 Bernoulli Numbers
t et
et 1
2t is invariant under the substitution t ! t. This shows that the coefficients
of odd-degree terms are all 0. t
u
Looking at Bn for even n in Table 1.1, we see that they are non-zero and, from
B2 on, the signs alternate. This P
may be seen from the formula for the value of the
Riemann zeta function .s/ D 1 1
nD1 ns at positive even integers (replace k by 2k
on p. 61) (4.4):
.1/k1 .2/2k
.2k/ D B2k .k 1/:
2 .2k/Š
But here, we give a direct proof using the following recurrence formula for the
Bernoulli numbers due to Euler7 [33].
Proposition 1.15.
!
X
n1
2n
.2n C 1/B2n D B2m B2.nm/ .n 2/:
mD1
2m
Proof. By the previous proposition, if we subtract the term of degree 1 from the
generating function of the Bernoulli numbers, we obtain the generating function of
even indexed B2n :
X 1
te t t t 2n
D B2n :
e 1 2
t
nD0
.2n/Š
Let f .t/ be the left-hand side. Taking the derivative of f .t/ as was explained on
p. 19, we have
t2
f .t/ tf 0 .t/ D f .t/2 :
4
P1 t 2n
Substituting nD0 B2n .2n/Š for f .t/, we have
1 1
! !
X t 2n X Xn
2n t 2n t2
.1 2n/B2n D B2m B2.nm/ :
nD0
.2n/Š nD0 mD0
2m .2n/Š 4
7
Leonhard Euler (Born on April 15, 1707 in Basel, Switzerland—died on September 18, 1783 in
St. Petersburg, Russia).
1.4 The Generating Function of Bernoulli Numbers 23
for n 2. Since the terms for m D 0 and m D n yield 2B2n , we subtract them from
both sides. Multiplying by 1, we obtain the formula in the proposition. t
u
Corollary 1.16. For n 1, we have .1/n1 B2n > 0.
Proof. We prove this by induction. For n D 1, B2 D 16 > 0. Suppose that the
proposition is true for all integers less than n. By multiplying the formula in the
proposition by .1/n1 , we get
!
X
n1
2n
.2n C 1/ .1/ B2n D
n1
.1/m1 B2m .1/nm1 B2.nm/ :
mD1
2m
The right-hand side is positive by assumption. Thus, .1/n1 B2n > 0, which settles
the proof. t
u
To conclude this chapter, we give expansions of tan x and cot x as alternative
generating functions for Bernoulli numbers.
Proposition 1.17. The Taylor expansion of tan x and the Laurent expansion of cot x
around x D 0 are
X1
x 2n1
tan x D .1/n1 .22n 1/22n B2n ;
nD1
.2n/Š
1
1 X x 2n1
cot x D C .1/n 22n B2n :
x nD1 .2n/Š
have
x x X 1
x 2n
coth D B2n ;
2 2 nD0
.2n/Š
1
cot 2x D .cot x tan x/ ;
2
24 1 Bernoulli Numbers
we have
This shows that the Taylor expansion of tan x can be obtained from the expansion of
cot x. As for the radii of convergence, we see it from the fact that the poles nearest
to 0 are located at ˙=2 and ˙, respectively. t
u
x 2n1
Remark 1.18. The coefficient of .2n1/Š in the Taylor expansion of tan x,
B2n
Tn WD .1/n1 .22n 1/22n ;
2n
is sometimes called the tangent number. It is a positive integer. The fact that it is
positive is seen from Corollary 1.16. We will prove that it is an integer in Remark 7.6
on p. 98. Knuth and Buckholtz [61] give another proof of this fact and a method to
compute Bernoulli numbers using Tn . See also Exercise 3.6 in Chap. 3.
Exercise 1.19. Compute the Bernoulli numbers Bn up to n D 12 by using
Theorem 1.6.
Exercise 1.20. Guess any pattern of prime numbers appearing in denominators of
Bernoulli numbers. (This is one of the topics covered in Chap. 3.)
Exercise 1.21. Compute the first several terms of the formal power series e log.1Ct /
and log.1 C .e t 1// using the definition of composition.
Exercise 1.22. Prove the quotient rule
We first define the Stirling numbers of the second kind, restricting ourselves to the
case where a combinatorial meaning can be given. We adopt Knuth’s notation [59].
Definition 2.1 (Stirling numbers of the second kind (Stirling’s subset
numbers)).
For positive integers n and m, define
n
WD the number of ways to divide a set of n elements into m
m nonempty sets.
1
According to Knuth [59], Stirling (James, born in May, 1692 in Garden, Scotland—died on
December 5, 1770 in Edinburgh, Scotland) first introduced the second kind. The names “first kind”
and “second kind” are due to Nielsen (Niels, born on December 2, 1865 in Orslev, Denmark—died
on September 16, 1931 in Copenhagen, Denmark), who first used these names in his book on the
Gamma function [73, §26].
For example, there are seven ways to divide the set {1, 2, 3, 4} into two non-
empty sets:
f1g [ f2; 3; 4g; f2g [ f1; 3; 4g; f3g [ f1; 2; 4g; f4g [ f1; 2; 3g;
f1; 2g [ f3; 4g; f1; 3g [ f2; 4g; f1; 4g [ f2; 3g:
4
Thus, 2 D 7.
From the definition, mn D 0 if m > n. Also, from the definition we have the
recurrence formula:
nC1 n n
D Cm : (2.1)
m m1 m
n
Table 2.1
m
mnn 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7
7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1
6 0 0 0 0 0 0 0 0 0 0 0 0 0 1 21
5 0 0 0 0 0 0 0 0 0 0 0 0 1 15 140
4 0 0 0 0 0 0 0 0 0 0 0 1 10 65 350
3 0 0 0 0 0 0 0 0 0 0 1 6 25 90 301
2 0 0 0 0 0 0 0 0 0 1 3 7 15 31 63
1 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1
0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0
1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0
2 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0
3 0 0 0 0 1 3 2 0 0 0 0 0 0 0 0
4 0 0 0 1 6 11 6 0 0 0 0 0 0 0 0
5 0 0 1 10 35 50 24 0 0 0 0 0 0 0 0
6 0 1 15 85 225 274 120 0 0 0 0 0 0 0 0
7 1 21 175 735 1624 1764 720 0 0 0 0 0 0 0 0
Putting n D 0 in this formula, we obtain 11 D 1. Also, since n0 D 0, we have
nC1 n
1 D 1 , and thus n1 D 1 .n 1/. Similarly, m1 D 0 .m 2/ can be verified
by putting n D 0 in (2.1).
Remark 2.3. Logically, we only need Definition 2.2. However, in order to make the
meaning clearer, we began with Definition 2.1.
We now define the Stirling numbers of the first kind.
Definition 2.4 (Stirling numbers of the first kind (Stirling’s cycle numbers)).
For positive integers n and m, define
n
WD number of permutations of n letters .elements of the sym-
m metric group of degree n/ that consist of m disjoint cycles.
We can see this in the same way as before. A permutation of n C 1 letters may fix
n C 1. In this case the letter n C 1 forms a cycle by itself, and there are
then letter
m1
permutations of the remaining n letters that decompose into m 1 disjoint
cycles. A permutation that moves the letter n C 1 can be obtained by inserting the
letter n C 1 in one of the m disjoint cycles in the decomposition of permutations
of n letters. There are j ways to form a cycle of length j C 1 out of a cycle of
length j by inserting another letter.
Therefore,
there are n ways to insert the letter
n C 1, and thus there are in total n mn permutations that move the letter n C 1. This
shows (2.2).
Just as we did for the Stirling numbers of the second kind, we define mn for any
m and n as follows.
Definition 2.5 (Stirling numbers
of the first kind (general case)). For any
n
integers n and m, define by the recurrence formula (2.2) with the initial
m
0 n 0
conditions D 1, and D D 0 . n, m ¤ 0 /.
0 0 m
Table 2.2 shows the values of mn in the same range as Table 2.1. Readers will
certainly notice the remarkable relations between these two tables. As Knuth asserts,
there is only one kind of Stirling number, and the recurrence formulas (2.1) and (2.2)
are essentially the same formula (see the next proposition).
We list some formulas which Stirling numbers satisfy. There are more formulas,
but we only list the ones we need later in this book. For others, see [38] for example.
Proposition 2.6.
n m
.1/ D :
m n
Xn
n m
.2/ xn D x .n 0/;
mD0
m
2.1 Stirling Numbers 29
hni
Table 2.2
m
mnn 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7
7 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1
6 0 0 0 0 0 0 0 0 0 0 0 0 0 1 21
5 0 0 0 0 0 0 0 0 0 0 0 0 1 15 175
4 0 0 0 0 0 0 0 0 0 0 0 1 10 85 735
3 0 0 0 0 0 0 0 0 0 0 1 6 35 225 1624
2 0 0 0 0 0 0 0 0 0 1 3 11 50 274 1764
1 0 0 0 0 0 0 0 0 1 1 2 6 24 120 720
0 0 0 0 0 0 0 0 1 0 0 0 0 0 0 0
1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0
2 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0
3 0 0 0 0 1 3 1 0 0 0 0 0 0 0 0
4 0 0 0 1 6 7 1 0 0 0 0 0 0 0 0
5 0 0 1 10 25 15 1 0 0 0 0 0 0 0 0
6 0 1 15 65 90 31 1 0 0 0 0 0 0 0 0
7 1 21 140 350 301 63 1 0 0 0 0 0 0 0 0
X
n
n m
.3/ x n D .1/n .1/m x :
mD0
m
Xn
d n n m d m
.4/ x D x .n 1/:
dx mD1
m dx
X
n l
.5:1/ For m, n 0, .1/l D .1/m ım;n :
l m
l0
X
l n l
.5:2/ For m, n 0, .1/ D .1/m ım;n :
l m
l0
Here, ım;n stands for Kronecker’s delta; i.e., ım;n D 1 for m D n and ım;n D 0 for
m ¤ n. The sums on the left-hand sides of .5.1/ and .5.2/ are finite sums.
30 2 Stirling Numbers and Bernoulli Numbers
!
.1/m X
m
n l m
.6/ For m, n 0, D .1/ l n:
m mŠ l
lD0
X1
n
.e t 1/m n t
.7/ D .m 0/:
mŠ nDm
m nŠ
X1
tm n n
.8/ D t .m 1/:
.1 t/.1 2t/ .1 mt/ nDm
m
X1 n
. log.1 t//m n t
.9/ D .m 0/:
mŠ nDm
m nŠ
Proof. (1) Write an;m D m . It suffices to verify that an;m satisfies the same
n
recurrence formula with the same initial conditions as those for mn . It is easy
for the initial conditions. In the recurrence formula (2.1), replacing m by n
and n by m, we have
m C 1 m m
D n :
n n 1 n
Adding n m
n to both sides, we have
m C 1 m m
Cn D :
n n n 1
This implies an;m1 C nan;m D anC1;m , which is nothing but the recurrence
formula (2.2). P
(2) We use the same strategy. Namely, write x n D nmD0 an;m x m , and verify that
an;m satisfies the same recurrence formula as mn within the range m; n 0
and the same initial conditions. It is easy for the initial conditions (put a0;m D 0
.m > 0/). From x mC1 D x m .x m/ and x m x D x mC1 C mx m , we have
X
nC1
anC1;m x m D x nC1 D x n x
mD0
X
n
D an;m x m x
mD0
2.1 Stirling Numbers 31
X
n
D an;m .x mC1 C mx m /
mD0
X
nC1 X
n
D an;m1 x m C man;m x m :
mD1 mD0
mD1
m dx dx
X
n
m X n
n m d n mC1 d mC1
D m x C x
mD1
m dx mD1
m dx
nC1
X
m
n n d
D m C xm
mD1
m m 1 dx
nC1
X m
nC1 d
D xm :
mD1
m dx
n
(To prove the next-to-last equality, we use nC1 D 0 and n0 D 0.) This shows
that the formula is true for n C 1.
(5) We prove it using (2) and (3). For (5.1), we put the formula
X
l
l
x l D .1/l .1/m xm
mD0
m
n
X Xl
n l
xn D .1/l .1/m xm
l mD0
m
lD0
X
n X
n
n l
D .1/ m
.1/ l
xm:
mD0
l m
lDm
Comparing the coefficients on both sides, we obtain (5.1). The formula (5.2)
can be obtained by substituting (2) into (3).
(6) We prove
it by verifying that the right-hand side satisfies the recurrence formula
for mn . Denote the right-hand side by an;m once again. It is easy to see that
0;0 D 1 and an;0 D 0 formn 1. (Define 0 D 1.) If n D 0, and m 1, then
0
a
P m l m
lD0 .1/ l D .1 1/ D 0. Thus, we have a0;m D 0. As for the recurrence
formula, we verify it as follows:
man;m C an;m1
! !
.1/m X .1/m1 X
m m1
l m l m1
D .1/ l C
n
.1/ ln
.m 1/Š l .m 1/Š l
lD0 lD0
! !
.1/m X
m
m m1
D .1/l ln
.m 1/Š l l
lD0
!
m X m
.1/ l l m n
D .1/ l
.m 1/Š m l
lD0
!
.1/m X
m
l m
D .1/ l nC1
mŠ l
lD0
D anC1;m :
One can prove (6) more naturally using the difference calculus. See for example
[53, §58].
(7) First, note that the right-hand
side of (7) can be expressed as the sum from
n D 0. (If n < m, then mn D 0.) Substitute (6) into the right-hand side and
simplify:
1
!
X .1/m X
m
m tn
r.h.s D .1/l ln
nD0
mŠ l nŠ
lD0
! 1 !
.1/m X X .lt/n
m
l m
D .1/
mŠ l nD0
nŠ
lD0
2.1 Stirling Numbers 33
!
.1/m X
m
l m
D .1/ e lt
mŠ l
lD0
m
.1/
D .1 e t /m
mŠ
.e t 1/m
D
mŠ
D l.h.s.
P
It can also be proved by writing .e t 1/m =mŠ D 1
n
a t , and verifying
n nDm n;m nŠ
that an;m satisfies the same recurrence formula as m .
(8) Denote the right-hand side by fm . From the recurrence formula (2.1) for mn ,
n
we have (noting that m D 0 for n < m)
1
X n
fm D tn
nDm
m
1
X
n1 n1
D Cm tn
nDm
m1 m
X1
X1
n n n
Dt t C mt
n
t
nDm1
m1 nDm
m
D tfm1 C mtfm :
Thus, fm D 1mtt
fm1 . From n1 D 1 .n 1/, we havef1 D 1t t
, which shows
that fm is equal to the left-handP
side of (8).
(9) Writing . log.1 t//m=mŠ D 1 tn
nDm an;m nŠ , and taking the derivatives of both
sides, we have
X1
. log.1 t//m1 1 t n1
D an;m :
.m 1/Š 1t nDm
.n 1/Š
Using (5.1) and (7) in this proposition, we give a proof of the formulas that were
left unproved in the previous chapter (p. 17). We give a proof of
log.1 C .e t 1// D t:
e log.1Ct / 1 D t
follows from Proposition 1.10. (Of course, we can prove it similarly using (9) in this
proposition.)
1
X .e t 1/m
log.1 C .e t 1// D .1/m1
mD1
m
1
X 1
n
X n t
D .1/m1 .m 1/Š
mD1
m nŠ nD0
X1 X1
! n
m1 m n t m
D .1/ .m 1/Š D
nD0 mD1
1 m nŠ 1
X1
tn
D .1/n1 ı1;n
nD0
nŠ
D t:
A formula expressing the Bernoulli numbers as a finite sum involving the Stirling
numbers of the second kind has been known at least since Kronecker2 [62]. (See
also an account of von Staudt in Sect. 3.3.) By Proposition 2.6 (6), the Stirling
numbers can be expressed as a finite sum involving only the binomial coefficients.
Therefore, the Bernoulli numbers can be expressed as an elementary (i.e., involving
only binomial coefficients and simple polynomials) double sum. Gould [36] studied
various historical sources for analogous formulas, and he concluded that, at least
until around 1970, the very fact that there are closed finite formulas (without
involving infinite sums and integrals) for the Bernoulli numbers was not widely
known. He also conjectures that there is no “elementary” (in the above sense)
2
Leopold Kronecker (born on December 7, 1823 in Liegnitz, Prussia (now Legnica, Poland)—died
on December 29, 1891 in Berlin, Germany).
2.2 Formulas for the Bernoulli Numbers Involving the Stirling Numbers 35
formula for the Bernoulli numbers expressed as a finite single sum. As far as we
know, such a formula has not been discovered.
We prove the following formula.
Theorem 2.8.
Xn
.1/m mŠ mn
Bn D .1/ n
.n 0/:
mD0
mC1
te t t log .1 .1 e t //
D D ;
et 1 1 e t 1 e t
P
and substituting 1 e t .D t C / for t in log.1 t/ D 1 mD1
tm
m
, we have
1
X tn log .1 .1 e t //
Bn D
nD0
nŠ 1 e t
X1
.1 e t /m1
D
mD1
m
X1
.1/m .e t 1/m
D
mD0
mC1
X1 1
.1/m mŠ X n .t/n
D (Proposition 2.6 (7))
mD0
m C 1 nDm m nŠ
1 !
X Xn
.1/m mŠ mn tn
D .1/ n
:
nD0 mD0
mC1 nŠ
tn
Comparing the coefficients of on both sides, we obtain the formula. t
u
nŠ
Remark 2.9. By Proposition 2.6 (6), we can also write
!
Xn
1 X
m
l m
Bn D .1/ n
.1/ l n:
mD0
m C 1 l
lD0
Proof. We can prove this by computing the generating function on the right-hand
side. Here, we useTheorem2.8.
From the recurrence formula (2.1) for the Stirling
nC1 n
numbers we have mC1 D mn C .m C 1/ mC1 , and thus
n n
X
n
.1/m mŠ m
C .m C 1/ mC1
r.h.s. D :
mD0
mC1
h i
mC1
On the other hand, since 1 D mŠ, it follows from Proposition 2.6 (5.1) that
n
X
n
.1/m mŠ.m C 1/ mC1
X
n
mC1 n
D .1/m D ı1;n :
mD0
mC1 mD0
1 mC1
Then,
X
n
nC1
an;0 D .1/ mŠ m
a0;m :
mD0
mC1
2.2 Formulas for the Bernoulli Numbers Involving the Stirling Numbers 37
1 1 1 1 1 1 1 1 1
2 3 4 5 6 7 8 9
1 1 1 1 1 1 1 1
2 3 4 5 6 7 8 9
1 1 3 2 5 3 7
6 6 20 15 42 28 72
0 1 1 2 5 5
30 20 35 84 84
1 1 3 1
− 30 −
30
−
140
−
105
0
0 − 1 − 1 − 4
42 28 105
1 1 1
42 42 140
0 1
30
1
− 30
1
X
gn .t/ D .m C 1/.an1;m an1;mC1 /t m
mD0
1 1
d X d X
D . an1;m t mC1 / . an1;mC1 t mC1 /
dt mD0 dt mD0
d d
D .tgn1 .t// .gn1 .t/ an1;0 /
dt dt
38 2 Stirling Numbers and Bernoulli Numbers
d
D gn1 .t/ C .t 1/ .gn1 .t//
dt
d
D ..t 1/gn1 .t//:
dt
d
hn .t/ D .t 1/ .hn1 .t// .n 1/:
dt
From this we obtain
d n
hn .t/ D .t 1/ .h0 .t//:
dt
Xn
m
n d
hn .t/ D .t 1/ m
h0 .t/:
mD0
m dt
Putting t D 0, we have
n
X n
an;0 D .1/m mŠ.a0;m1 a0;m /
mD0
m
n1
X Xn
n n
D .1/mC1 .m C 1/Ša0;m .1/m mŠa0;m
mD0
mC1 mD0
m
X
n
n n
D .1/ mŠa0;m .m C 1/
m
C
mD0
m C 1 m
X
n
nC1
D .1/ mŠ m
a0;m :
mD0
mC1
1
(2) m D .m 1/Š for m D 1; 2; 3; : : :.
nC1 n
(3) n
D n1 D n.nC1/2
for n D 0; 1; 2; : : :.
Exercise 2.14. Formulate the statements corresponding to the previous exercise for
the Stirling numbers of the first kind, and prove them.
Exercise 2.15. Prove the formula e log.1Ct / 1 D t for the composition of formal
power series by using Proposition 2.6 (9).
Exercise 2.16. Prove the following formula for the sum of powers:
!
X
n X
n
k n C 1
ik D jŠ :
i D1 j D0
j j C1
Exercise 2.17. Compute the first several Bn using the formula in Theorem 2.8.
Exercise 2.18. In the Akiyama–Tanigawa triangle (on p. 37), what are the numbers
next to the leftmost entries?
Chapter 3
Theorem of Clausen and von Staudt,
and Kummer’s Congruence
Here, the sum runs over all the prime numbers p such that p 1 divides n. (For
integers a and b, the symbol a j b means that a divides b, and a − b means that a
does not divide b.)
Proof. The statement is clearly true for n D 1. Let n be an even integer greater
than or equal to 2. Since the numerator of each term .1/m mŠ mn =.m C 1/ in the
right-hand side of the formula in Theorem 2.8 is an integer, the only prime numbers
1
Thomas Clausen (born on January 16, 1801 in Snogebaek, Denmark—died on May 23, 1885 in
Dorpat, Russia (now Tartu, Estonia)).
2
Karl Georg Christian von Staudt (born on January 24, 1798 in Imperial Free City of Rothenburg
(now Rothenburg ob der Tauber, Germany)—died on June 1, 1867 in Erlangen, Bavaria (now
Germany).
3
Ferdinand Gotthold Max Eisenstein (born on April 16, 1823 in Berlin, Germany—died on October
11, 1852 in Berlin, Germany).
D 0 3 C 3 2n 3n
1 .1/n 0 mod 4:
(For integers a; b and a natural number4 n, if nj.a b/, then we say that a is
congruent to b modulo n, and we write a b mod n. If n − .a b/, then we
write a 6 b mod n.) This shows that .1/m mŠ mn =.m C 1/ is an integer. We have
thus shown that if m C 1 is a composite number, .1/m mŠ mn =.m C 1/ is an integer.
Next, suppose that m C 1 equals a prime number p. Since p 1 n, it follows
from Proposition 2.6 (6) that
Xp1
!
n l p1
m
.1/ mŠ D .1/ ln
m l
lD0
X
p1
p1
l n mod p (since l
.1/l mod p)
lD0
1 mod p if p 1 divides n,
(3.1)
0 mod p if p 1 does not divide n.
The last congruence can be seen as follows. If p 1 divides n, then the fact that
P p1 n
lD0 l p 1 1 mod p follows from Fermat’s5 Little Theorem; i.e.,
l p1
1 mod p if l and p are relatively prime. If p 1 does not divide n,
then choose c such that c n 6 1 mod p (for example, a primitive root modp).
When l runs from 1 through p 1, cl mod p also
Pp1 runs from the classes of 1
Pp1
through p 1, and thus we have lD0 .cl/n n
lD0 l . Therefore, we have
4
Throughout the book, the term “natural number” means a positive integer.
5
Pierre de Fermat (born on August 17, 1601 in Beaumont-de-Lomagne, France—died on January
12, 1665 in Castres, France).
3.2 Kummer’s Congruence 43
Pp1 Pp1
.c n 1/ lD0 l n 0 mod p, and lD0 l n 0 mod p. We thus see that if
n
m C 1 D p, .1/m mŠ m =.m C 1/ is an integer if p 1 does not divide n. If, on
the other hand, p 1 divides n, then .1/m mŠ mn =.m C 1/ is not an integer but the
above congruence shows that it becomes an integer if we add p1 to it. This completes
the proof. t
u
The theorem of Clausen and von Staudt completely describes the denominators
of Bernoulli numbers. Then how about the numerators? Not much is known about
this. Here we briefly explain the notion of (ir)regular primes, which are related to
the numerators of Bernoulli numbers.
A prime p is said to be regular if p does not divide any of the numerators of the
Bernoulli numbers B2 ; B4 ; : : : ; Bp3 . Otherwise p is called irregular. According
to Kummer, p is regular if and only if the class number of the pth cyclotomic field
Q.p / is not divisible by p (cf. Washington [100, §5.3]). It is well known that this
work of Kummer originated from his attempt to solve Fermat’s last theorem. For
this topic, see [31] or [79].
For instance, there are only three irregular primes less than 100: 37; 59; 67.
Kummer determined (by hand!) all irregular primes less than or equal to 163, and
moreover determined the indices of Bn whose numerators are divisible by a given
irregular prime. The next few irregular primes after 67 are 101; 103; 131; 149; 157,
and p D 157 is the first prime which divides two of the numerators of
B2 ; B4 ; : : : ; Bp3 (see the table in [100] for irregular primes p less than 4,001
and indices of Bernoulli numbers divisible by p).
The goal of this section is to prove the following theorem, which plays a crucial role
in the arithmetic of cyclotomic fields and the theory of p-adic L-functions. For a
prime number p, we denote by Z.p/ the ring of rational numbers whose denominator
is prime to p. Using such a ring, it is easy to describe how many times p divides the
numerator of a rational number, or decide whether or not two rational numbers are
congruent modulo a power of p. The set of invertible elements, Z .p/ , of the ring Z.p/
is the set of rational numbers whose numerator and denominator are both relatively
prime to p. For two elements x and y, we say that the congruence relation x y
mod p a holds if the numerator of x y is divisible by p a .
Theorem 3.2. Let p be an odd prime.
(1) Suppose n is a positive even integer not divisible by p 1. Then we have
Bn
2 Z.p/ :
n
44 3 Theorem of Clausen and von Staudt, and Kummer’s Congruence
(2) Let a 1 be a natural number and m; n be two even integers that satisfy
a C 1 m n. Suppose that m and n are not divisible by p 1, and that
n m mod .p 1/p a1 . Then we have
Bn Bm
mod p a :
n m
AmC.p1/pa1 Am mod p a
holds.
P1
Proof. Write '.t/ D kD0 ak t
k
. Then, we have
1
X
'.e rt e st / D ak .e rt e st /k
kD0
1
!
X X
k
k rt .kh/ st h
D ak .1/h e e
h
kD0 hD0
1 X
!
X k
k .r.kh/Csh/t
D .1/ akh
e :
h
kD0 hD0
6
Ernst Eduard Kummer (born on January 29, 1810 in Sorau, Brandenburg, Prussia (now
Germany)—died on May 14, 1893 in Berlin, Germany).
3.2 Kummer’s Congruence 45
1 X
!
X k
k
Am D h
.1/ ak .r.k h/ C sh/m :
h
kD0 hD0
Since .e rt e st /k begins with the term t k , the sum with respect to h in the above
formula equals 0 if k > m. Thus, the above sum with respect to k is a finite sum.
Replacing m by m C .p 1/p a1 , and subtracting Am , we have
AmC.p1/pa1 Am
1 X
!
X k
k a1
D h
.1/ ak .r.k h/ C sh/m ..r.k h/ C sh/.p1/p 1/:
h
kD0 hD0
AmC.p1/pa1 Am mod p a
holds. t
u
Proof (of Theorem 3.2). For a natural number c ¤ 1 relatively prime to p, consider
1 c
'.t/ D :
t .1 C t/c 1
Since c 2 Z
.p/ , the expression
.1 C t/c 1 c1 2
DtC t C
c 2
belongs to Z.p/ ŒŒt, and it is invertible in Z.p/ ..t// (at the end of Sect. 1.3 on p. 20).
From
c 1 c1 c2 1
D C t C ;
.1 C t/ 1
c t 2 12
X tn 1
t te t t
D t D 1 C Bn ;
et 1 et 1 2 nD2 nŠ
46 3 Theorem of Clausen and von Staudt, and Kummer’s Congruence
we have
1 c 1 t ct
'.e 1/ D t
t
D
e 1 e ct 1 t e t 1 e ct 1
1
1c X Bn t n1
D C .1 c n / :
2 nD2
n .n 1/Š
Bn Bm
.1 c n / .1 c m / mod p a :
n m
Here, we choose c to be a primitive root modulo p (c mod p is a generator of
.Z=pZ/ ). If n is not divisible by p 1, then m is not divisible by p 1, and
we have 1 c n ; 1 c m 2 .Z=p a Z/ with 1 c n 1 c m mod p a . This gives
n ; m 2 Z.p/ and the congruence in the theorem. t
u
Bn Bm
Thomas Clausen, an astronomer and a mathematician, was born on January 16, 1801
in Snogbæk, Denmark.7 He was educated by a local pastor Georg Holst before he
got a position as an assistant at the Altona observatory, where Schumacher8 was the
head. He then moved to the Joseph von Utzschneider Optical Institute in Munich as
the successor to Fraunhofer,9 who is famous for “Fraunhofer lines” in physics and
optics. In 1842, he was appointed observer at the Dorpat observatory in Russia, and
there he remained until his retirement in 1872. The Copenhagen Academy awarded
Clausen a prize for his work on the determination of the orbit of the 1770 comet
(published in Astronomische Nachrichten in 1842). Bessel10 highly praised this
7
The description in this section is based on Biermann [17, 18], Noether [74], Hensel [40], and
Lampe [65].
8
Heinrich Christian Schumacher (born on September 3, 1780 in Bramstedt, Germany—died in
1850). His first degree was in law, and after that he studied astronomy under Gauss and became
an astronomer. He launched the journal Astronomische Nachrichten, which is famous for the
contributions of Abel and Jacobi on the theory of elliptic functions, as the managing editor.
Incidentally, Abel once met Clausen during his stay in Hamburg (cf. [19]).
9
Joseph von Fraunhofer (born on March 6, 1787 in Straubing, Germany—died on June 7, 1826 in
Munich, Germany).
10
Friedrich Wilhelm Bessel (born on July 22, 1784 in Minden, Westphalia (now Germany)—died
on March 17, 1846 in Königsberg, Prussia (now Kaliningrad, Russia)), an astronomer, who is
famous for the Bessel function.
3.3 Short Biographies of Clausen, von Staudt and Kummer 47
11
Johann Carl Friedrich Gauss (born on April 30, 1777 in Brunswick, Duchy of Brunswick (now
Germany)—died on February 23, 1855 in Göttingen, Hanover
(now Germany)).
2˛;2ˇ;˛Cˇ
12
2 F1 .˛; ˇ; ˛ C ˇ C 12 I x/2 D 3 F2 ˛CˇC 12 ;2˛C2ˇ
Ix
13
It was correct up to 248 decimal places. This was the world record from 1847 to 1853. For a
history of computation of , see for example [13].
n
14
Fermat conjectured that all the numbers of the form 22 C 1 (which are called Fermat numbers)
are prime. This is true for n 4, but Euler showed in 1732 that this is not the case for n D 5 by
n
giving the factorization. As of November 2013, it is known that every Fermat number 22 C 1 with
5 n 32 is not prime. (Complete factorization is known up to n D 11.)
15
Max Noether (born on September 24, 1844 in Mannheim, Germany—died on December 13,
1921 in Erlangen, Germany).
48 3 Theorem of Clausen and von Staudt, and Kummer’s Congruence
congruence (modulo prime number case). He presented these results in two papers16
submitted to Erlangen University on the occasion of becoming a member of the
senate and the faculty. These results, however, caught no attention because, as
Noether put it, von Staudt was so modest and did not care about the circulation of
the papers (cf. [88] for the contents of these two papers). He also published a paper
[97] on a proof of the “fundamental theorem on algebra” of Gauss (Clausen also
wrote a paper on the same subject [25]). He wrote up a paper one week before his
death, while suffering from asthma. He died on June 1, 1867. Noether quotes from
the funeral address the passage “his endurance, prepared for the lifelong effort, not
for months or years, to pursue a distinct goal”, which he thinks describes well the
character of von Staudt.
Ernst Eduard Kummer was born on January 29, 1810 in Sorau, Germany
(now Zary, Poland). Kummer entered the Gymnasium in Sorau in 1819 and
the University of Halle in 1828. He soon gave up his original study, Protestant
theology, under the influence of the mathematics professor Scherk17 and applied
himself to mathematics. In 1831 he received a prize for his essay on the question
posed by Scherk: “De cosinuum et sinuum potestatibus secundum cosinus et sinus
arcuum multiplicium evolvendis”.18 Kummer taught from 1832 until 1842 at the
Gymnasium in Liegnitz (now Legnia, Poland), mainly mathematics and physics.
His students during this period included Kronecker and Joachimsthal,19 both of
whom became interested in mathematics through Kummer’s encouragement and
stimulation. In particular, the influence of Kummer upon Kronecker was enormous
and they became lifelong friends. One can see the fragments of their friendship in
their correspondence collected in Kummer’s Werke.
The subjects of Kummer’s mathematical study are rather clearly divided accord-
ing to the periods. The first is in function theory, the second in number theory
and the third in geometry. His period of Gymnasium teaching coincided with his
creative period in function theory. The most important was the famous paper on
the hypergeometric series [63]. Kummer sent this paper to Jacobi, which led to his
scientific connection with Jacobi and with Dirichlet.20 In 1839, through Dirichlet’s
proposal, Kummer became a member of the Berlin Academy of Sciences, and in
1840, he married Ottilie Mendelssohn, a cousin of Dirichlet’s wife.21 (She died in
1848 and later Kummer married Bertha Cauer.) On the recommendation of Dirichlet
and Jacobi, Kummer was appointed full professor at the University of Breslau (now
16
Under the same title “De numeris Bernoullianis”, Erlangen, 1845.
17
Heinrich Ferdinand Scherk (born on October 27, 1798 in Poznan, Poland—died on October 4,
1885 in Bremen, Germany), he also wrote a paper on Bernoulli numbers [81].
18
“On expansions of powers of cosine and sine by cosine and sine with their arguments multiplied.”
19
Ferdinand Joachimsthal (born on March 9, 1818 in Goldberg, Prussian Silesia (now Zlotoryja,
Poland)—died on April 5, 1861 in Breslau, Germany (now Wroclaw, Poland)).
20
Johann Peter Gustav Lejeune Dirichlet (born on February 13, 1805 in Düren, French Empire
(now Germany)—died on May 5, 1859 in Göttingen, Hanover (now Germany)).
21
Dirichlet’s wife Rebecca Mendelssohn is a younger sister of the composer Felix Mendelssohn.
3.3 Short Biographies of Clausen, von Staudt and Kummer 49
22
Lazarus Immanuel Fuchs (born on May 5, 1833 in Moschin, Prussia (now Poznan, Poland)—died
on April 26, 1902 in Berlin, Germany).
23
Paul David Gustav du Bois-Reymond (born on December 2, 1831 in Berlin, Germany—died on
April 7, 1889 in Freiburg, Germany).
24
Paul Albert Gordan (born on April 27, 1837 in Breslau, Germany (now Wroclaw, Poland)—died
on December 21, 1912 in Erlangen, Germany).
25
Paul Gustav Heinrich Bachmann (born on June 22, 1837 in Berlin, Germany—died on March
31, 1920 in Weimar, Germany).
26
Karl Herman Amandus Schwarz (born on January 25, 1843 in Hermsdorf, Silesia (now
Poland)—died on November 30, 1921 in Berlin, Germany).
27
Georg Ferdinand Ludwig Philipp Cantor (born on March 3, 1845 in St. Petersburg, Russia—died
on January 6, 1918 in Halle, Germany).
28
Arthur Moritz Schönflies (born on April 17, 1853 in Landsberg an der Warthe, Germany (now
Gorzów, Poland)—died on May 27, 1928 in Frankfurt am Main, Germany).
Chapter 4
Generalized Bernoulli Numbers
Let us define a Dirichlet character as a map from the set of integers Z to the set of
complex numbers C.
Definition 4.1. Let f be a natural number. A function W Z ! C is called a
Dirichlet character modulo f if it satisfies the following three conditions:
(i) is multiplicative; i.e. .ab/ D .a/.b/ for any a; b 2 Z.
(ii) The value of .a/ only depends on a mod f ; i.e. .a C bf / D .a/ for any
a; b 2 Z.
(iii) If .a; f / D 1, then .a/ ¤ 0, and if .a; f / ¤ 1, then .a/ D 0. Here, .a; f /
stands for the greatest common divisor of a and f .
In particular, the function 0 defined by
1 .a; f / D 1;
0 .a/ D
0 .a; f / ¤ 1
satisfies the three conditions. We call this the trivial character (modulo f ).
For a natural number f , let '.f / be the number of integers from 1 to f which
are relatively prime to f . This is called the Euler (totient) function. If a is an integer
relatively prime to f , then we have
a'.f / 1 mod f:
This is known as Euler’s theorem in elementary number theory. For a proof see for
example Ireland and Rosen [50].
A Dirichlet character modulo f satisfies the following properties.
Lemma 4.2. (1) If a 1 mod f , then .a/ D 1.
(2) If .a; f / D 1, then .a/'.f / D 1.
(3)
(3) Since '.f / is the number of a mod f such that .a; f / D 1, the assertion
holds if is the trivial character. If is not thePtrivial character, then there
exists b such
P that .b/ ¤ 0; 1. If we
P put S D a mod f .a/, then we have
.b/S D a mod f .b/.a/ D a mod f .ba/. When a runs through all
the representatives modf , so does ba since .b; f / D 1. Thus, the sum on the
right equals S . This implies .1.b//S D 0 and hence S D 0, as 1.b/ ¤ 0.
t
u
We note that a Dirichlet character defines naturally by definition a character of
the multiplicative group .Z=f Z/ .
Now suppose f 0 j f , and let 0 be a Dirichlet character modulo f 0 . If satisfies
then we say that is defined mod f 0 . For a Dirichlet character modulo f , the
smallest f 0 such that is defined modf 0 is called the conductor of , and is
written as f . By definition, f is a divisor of f .
Definition 4.3 (primitive character). A Dirichlet character modulo f is called
primitive if f D f .
4.2 Generalized Bernoulli Numbers 53
.a/ D .a/;
Xf
X1
.a/te at tn
D Bn; :
aD1
ef t 1 nD0
nŠ
In the following we assume f > 1 and is not the trivial character unless otherwise
stated.
Since we have the expansion
te at 1 a 1
D C t
ef t 1 f f 2
2 3
a f t2 a 3a2 a f t3
C aC C C C ;
f 6 2 f 2 2 6
54 4 Generalized Bernoulli Numbers
B0; D 0;
1 X
f
B1; D .a/a;
f aD1
1 X X
f f
B2; D .a/a2 .a/a;
f aD1 aD1
1 X 3X f X
f f f
B3; D .a/a
3
.a/a C
2
.a/a:
f aD1 2 aD1 2 aD1
X
f
Bn; D f n1
.a/Bn .a=f / ; (4.1)
aD1
where Bn .x/ is the Bernoulli polynomial, which will be defined in the next section.
(See Remark 4.10 (2) in the next section.)
We have seen that all the Bernoulli numbers with odd indices greater than 1 are 0.
For generalized Bernoulli numbers we have the following.
Proposition 4.5. Let be a non-trivial character. Then, for any n satisfying
.1/n1 D .1/, we have Bn; D 0. In other words, if is an even character,
then Bn; with odd indices n are 0; if is an odd character, then Bn; with even
indices n are 0.
Proof. Since is non-trivial, we can rewrite the generating function as follows:
f 1 f 1
X .a/te at X .f a/te .f a/t
D
aD1
ef t 1 aD1
ef t 1
f 1
X .a/te at
D .1/
aD1
1 e f t
f 1
X .a/.t/e a.t /
D .1/ :
aD1
e f .t / 1
It follows immediately from this that the generating function is an even function if
.1/ D 1, and an odd function if .1/ D 1. t
u
4.3 Bernoulli Polynomials 55
Remark 4.6. For any n satisfying .1/n D .1/, it is known that Bn; is non-zero.
This can be seen as follows. From Theorem 9.6 on p. 148, Bn; is non-zero if and
only if L.n; / is non-zero. We can prove L.n; / ¤ 0 using the Euler product
formula for the L-function with n > 1. The fact that L.1; / ¤ 0, i.e., B1; ¤ 0 for
all , is essentially equivalent to Dirichlet’s theorem on prime numbers in arithmetic
progressions. Dirichlet proved his theorem using the class number formula. See
Dirichlet [29] or Serre [83, Chap. 6].
1
I.x n / D .x C 1/nC1 x nC1 D x n C .lower terms/;
nC1
Bn .x/ WD I 1 .x n /:
1
This was suggested by Don Zagier.
56 4 Generalized Bernoulli Numbers
Example 4.8.
1 1 3 1
B0 .x/ D 1; B1 .x/ D x ; B2 .x/ D x 2 x C ; B3 .x/ D x 3 x 2 C x;
2 6 2 2
1 5 5 1
B4 .x/ D x 4 2x 3 C x 2 ; B5 .x/ D x 5 x 4 C x 3 x:
30 2 3 6
(7)
X
k1 i
Bn x C D k 1n Bn .kx/ .k 1/:
i D0
k
Z xC1
Proof. (2) It is obtained by differentiating the defining equation Bn .y/ dy D
x
x n with respect to x.
(3) It follows from (2) that
Z xC1
Bn0 .y/ 1
dy D .Bn .x C 1/ Bn .x// D x n1 :
x n n
Bn0 .x/
D Bn1 .x/;
n
or
the right-hand side of (4.2) as a formal power series, and applying I term by
term, we have I.te xt =.e t 1// D tI.e xt /=.e t 1/ because of the linearity.
Furthermore, since we have
Z xC1
e .xC1/t e xt e t 1 xt
I.e xt / D e yt dy D D e ;
x t t
we see I.te xt =.e t 1// D e xt . The injectivity of I implies that both sides are
equal.
(1) The first half of (1) can be seen by putting x D 1 in (5) and by using the
generating function of Bernoulli numbers (Theorem 1.12 on p. 20). Putting
x D 0 in (2), we see that Bn .1/ D Bn .0/ for n ¤ 1. The second half of the
assertion follows from this and the first half.
(6) We compute the product t=.e t 1/e xt on the right-hand side of the generating
function (4.2). Since
X 1
t .t/e t tn
D D .1/ n
Bn ;
et 1 e t 1 nD0
nŠ
we have
1
! 1 !
t X tn X .xt/n
e D
xt n
.1/ Bn
et 1 nD0
nŠ nD0
nŠ
0 ! 1
X1 Xn
@ .1/j n tn
D Bj x nj A :
nD0 j D0
j nŠ
Z x
k C1
Dk Bn .y/ dy
x
k
x n
Dk D k 1n x n :
k
From this we have
X
k1
x i
Bn C D k 1n Bn .x/:
i D0
k k
1
xk D .BkC1 .x C 1/ BkC1 .x// :
kC1
X
n
1
ik D .BkC1 .n C 1/ BkC1 .1// :
i D1
kC1
k 1
1 kC1
Since kC1 j
D j kC1j , the formula (1.1) follows.
(2) The formula (4.1) which expresses generalized Bernoulli numbers in terms of
Bernoulli polynomials is obtained as follows. In (4.2) replace x by a=f and t
by f t. Then we obtain
1
te t a 1 X .f t/n
D Bn .a=f / :
ef t 1 f nD0 nŠ
4.3 Bernoulli Polynomials 59
Thus, we have
1
X X X 1
1 X
f f
tn .a/te t a .f t/n
Bn; D D .a/ Bn .a=f /
nD0
nŠ aD1
e 1
f t
aD1
f nD0 nŠ
1 X
X f
f n1 t n
D .a/Bn .a=f / :
nD0 aD1
nŠ
X
f
Bn; D f n1 .a/Bn .a=f /:
aD1
kŠ X e 2 i nx
Bk .x Œx/ D (4.3)
.2 i /k nk
n2Z
n¤0
holds for all real numbers x if k 2; it holds for all real numbers x 62 Z, if k D 1.
Here, the sum is taken for all integers different from 0. If k D 1, the infinite sum on
the right-hand side should be understood as
X
N
e 2 i nx
lim :
N !1 n
nDN
n¤0
If k 2, then the right-hand side of (4.3) converges absolutely and uniformly with
respect to x.
Proof. Let 0 < x < 1. Decompose the function
e x
f ./ D
e 1
into partial fractions as a meromorphic function in . This f ./ has poles at
D 2 i n .n 2 Z/ and all of them are of order 1. The residue at D 2 i n is
given by
e x.t C2 i n/
lim . 2 i n/f ./ D lim t D e 2 i nx :
!2 i n t !0 e t C2 i n 1
60 4 Generalized Bernoulli Numbers
Let N be a sufficiently large natural number, and put R D 2.N C1=2/. Let CN be a
square path passing through four corner points RCiR; RCiR; RiR; RiR
in this order in the -plane. If t is a point inside CN such that t ¤ 2 i n .n 2 Z/,
then it follows from the residue theorem that
Z !
f ./ X
N
e 2 i nx
d D 2 i f .t/ C
CN t nDN
2 i n t
!
1 XN
e 2 i nx
D 2 i f .t/ :
t t 2 i n
nDN
n¤0
1 X
N
e 2 i nx
f .t/ D C lim
t N !1 t 2 i n
nDN
n¤0
1 X
N
e 2 i nx
D lim
t N !1 2 i n 1 2ti n
nDN
n¤0
X 1 k1
e 2 i nx X
N
1 t
D lim :
t N !1 2 i n 2 i n
nDN kD1
n¤0
X1 X1
e 2 i nx k1
t
nD1
.2 i n/k
kD2
n¤0
X
N 1 X
X 1
1 e 2 i nx e 2 i nx k1
f .t/ D lim t :
t N !1 2 i n nD1 .2 i n/k
nDN kD2
n¤0 n¤0
4.3 Bernoulli Polynomials 61
Since the generating function (4.2) of Bernoulli polynomials can be regarded as the
Taylor expansion of te xt =.e t 1/ at t D 0 for each real number x, we have
1
X t k1
f .t/ D Bk .x/ :
kŠ
kD0
X
N
e 2 i nx
B1 .x/ D lim ;
N !1 2 i n
nDN
n¤0
1
X e 2 i nx
Bk .x/ D kŠ .k 2/:
nD1
.2 i n/k
n¤0
Since the sum on the right-hand side converges uniformly over the closed interval
Œ0; 1 if k 2, and since Bk .x/ is continuous in the same interval, the above formula
holds in Œ0; 1. Furthermore, the right-hand side is periodic with respect to x with
period 1. Thus, (4.3) is obtained by extending the left-hand side. t
u
From this theorem we can obtain the values of the Riemann zeta function at
positive even integers. (The Riemann zeta function will be treated in the next
section.)
Corollary 4.12. Let k be an even integer greater than or equal to 2. Then we have
X1
1 1 Bk
k
D .2 i /k : (4.4)
nD1
n 2 kŠ
For example,
X1 X1 X1 X1
1 2 1 4 1 6 1 8
D ; D ; D ; D ; etc:
nD1
n2 6 nD1
n4 90 nD1
n6 945 nD1
n8 9450
Proof. It suffices to put x D 0 in the theorem. The left-hand side is equal to Bk from
Proposition 4.9 (1). Since the right-hand side is twice the sum over the positive n
since k is even, the formula in question holds. t
u
Remark 4.13. The formula in the theorem can be rewritten as follows.
(1) If k is even,
X1
cos 2 nx
Bk .x Œx/ D 2.1/k=21 kŠ :
nD1
.2 n/k
62 4 Generalized Bernoulli Numbers
(2) If k is odd,
X1
sin 2 nx
Bk .x Œx/ D 2.1/.kC1/=2kŠ : (4.5)
nD1
.2 n/k
pqr S.p; q; rI a; b/ 2 Q:
X Z 1
e..am C bn C l/x/
D dx
0 mp nq l r
m; n; l2Z
m; n; l¤0
Z X
1
e..am C bn C l/x/
D dx
0 mp nq l r
m; n; l2Z
m; n; l¤0
Z 1 X e.amx/ X e.bnx/ X e.lx/
D dx:
0 mp nq lr
m2Z n2Z l2Z
m¤0 n¤0 l¤0
P
Here, we used the fact that the series m¤0 e.amx/=mp converges uniformly over
the closed interval Œ0; 1, because p; q; r 2. Using the formula (4.3), we have
2
This is based on a lecture of Don Zagier at Kyushu University in 1999.
4.3 Bernoulli Polynomials 63
pCqCr2 Z 1
.1/ 2 .2/pCqCr
S.p; q; rI a; b/ D Bp .axŒax/Bq .bxŒbx/Br .x/ dx:
pŠqŠrŠ 0
Thus, the value of the integral is a rational number (recall that the Bernoulli
polynomial is a polynomial with rational coefficients), and the assertion of the
proposition follows. t
u
Example 4.15. (p D q D r D 2, a D b D 1.) From the above formula,
Z 1
.2/6
S.2; 2; 2I 1; 1/ D B2 .x/3 dx:
8 0
Integration by parts using B10 .x/ D 1; B20 .x/ D 2B1 .x/ gives
Z 1 Z 1
B2 .x/3 dx D B10 .x/B2 .x/3 dx
0 0
Z
1 1
D B1 .x/B2 .x/3 0 6B1 .x/2 B2 .x/2 dx
0
Z 1 0
D B23 2 B1 .x/3 B2 .x/2 dx
0
Z
1 8 1 0
D B23 2 B1 .x/3 B2 .x/2 0 C B1 .x/5 B2 .x/ dx
5 0
Z
1 8
1 16 1
D B22 C
B23 B1 .x/5 B2 .x/ 0 B1 .x/6 dx
2 5 5 0
1 1 16
1
D B23 B22 C B2 B1 .x/7 0
2 10 35
1
D 2 3 :
2 3 57
Thus, we have
2 6
S.2; 2; 2I 1; 1/ D :
33 5 7
Exercise 4.16. For a non-trivial character modulo f > 1, write B4; and B5; by
Pf
aD1 .a/a with i D 1, 2, 3, 4, 5.
i
Exercise 4.17. Prove that the Bernoulli polynomial Bn .x/ for odd n > 1 is always
divisible by x.x 12 /.x 1/.
Chapter 5
The Euler–Maclaurin Summation Formula
and the Riemann Zeta Function
Aside from the formula for the sum of powers, the most basic topics in which
Bernoulli numbers appear is the Euler–Maclaurin1 summation formula and the
values of the Riemann zeta function at integer arguments. In this chapter we survey
these topics.
The Euler–Maclaurin summation formula gives a very effective tool for evaluating
a sum of values of a function at integers.
Theorem 5.1. Let a and b be integers satisfying a b, and let M be a natural
number. Suppose f .x/ is an M times continuously differentiable function2 over
Œa; b. Then, we have
X
b Z b X BkC1 M 1
1
f .n/ D f .x/dxC .f .a/Cf .b// C .f .k/ .b/ f .k/ .a//
nDa a 2 .k C 1/Š
kD1
M Z b
.1/
BM .x Œx/f .M / .x/dx:
MŠ a
Here, BkC1 is the Bernoulli number and BM .x/ is the Bernoulli polynomial, and
the sum on the right-hand side is understood to be 0 if M D 1.
1
Colin Maclaurin (born in February 1698 in Kilmodan, Cowal, Argyllshire, Scotland—died on
June 14, 1746 in Edinburgh, Scotland).
2
Differentiable M times and the M th derivative is continuous.
Remark 5.2. (1) Regard f .n/ as the area of a rectangle whose base is the interval
Œn 12 ; n C 12 and whose height is f .n/. Then the sum of the first two terms
on the right-hand side is the first approximation, so to say. The next term can
be seen as the higher approximation in terms of the data at the end points of the
interval.
(2) If we use the values of the Riemann zeta function at negative integers .k/ D
BkC1 =.k C 1/, which will be proved later, then the formula looks like
X
b Z b X .k/ M 1
1
f .n/ D f .x/dxC .f .a/Cf .b// .f .k/ .b/f .k/ .a//
nDa a 2 kŠ
kD1
Z b
.1/M
BM .xŒx/f .M / .x/dx:
MŠ a
Proof. Let g.x/ be an M times continuously differentiable function on Œ0; 1. Since
B1 .x/ D x 1=2, we have B10 .x/ D 1. Thus, using integration by parts, we have
Z 1 Z 1
g.x/dx D ŒB1 .x/g.x/10 B1 .x/g 0 .x/ dx
0 0
Z 1
1
D .g.1/ C g.0// B1 .x/g 0 .x/ dx:
2 0
0
Iterating integration by parts using Bk .x/ D BkC1 .x/=.k C 1/ (Proposition 4.9 (3)
on p. 56), we obtain
Z Z
1
1 1
1 1 1
g.x/dx D .g.1/ C g.0// B2 .x/g 0 .x/ 0 C B2 .x/g 00 .x/ dx
0 2 2 2 0
1 1
1
D .g.1/ C g.0// B2 .x/g 0 .x/ 0
2 2
Z 1
1
1 1
C B3 .x/g 00 .x/ 0 B3 .x/g 000 .x/ dx
23 23 0
D
X
M 1
1 .1/k
1
D .g.1/ C g.0// C BkC1 .x/g .k/ .x/ 0
2 .k C 1/Š
kD1
M Z 1
.1/
C BM .x/g .M / .x/ dx:
MŠ 0
Z 1 X
M 1
1 BkC1
g.x/dx D .g.1/ C g.0// .g .k/ .1/ g .k/ .0//
0 2 .k C 1/Š
kD1
M Z 1
.1/
C BM .x/g .M / .x/ dx:
MŠ 0
Therefore,
Z 1 X
M 1
1 BkC1
.g.1/ C g.0// D g.x/dx C .g .k/ .1/ g .k/ .0//
2 0 .k C 1/Š
kD1
M Z 1
.1/
BM .x/g .M / .x/dx:
MŠ 0
The formula in the theorem is obtained by adding up the above formula from n D a
to n D b 1 and then adding 12 .f .a/ C f .b// to both sides. t
u
The exponential function e z with the complex variable z is defined by the formula
z2 zn
ez D 1 C z C C C C :
2Š nŠ
The power series on the right-hand side converges absolutely for any z 2 C, and
thus it is a holomorphic function over the entire z-plane. From the formula for the
product of two power series, we have the exponential law
e zCw D e z e w :
For any positive real number a, the complex power as for s 2 C is defined by
as D e s log a :
jas j D e log a ;
where D Re.s/.
Let s be a complex number. Consider the sum
XN
1 1 1
s
D 1C s C C s:
nD1
n 2 N
f .x/ D x s :
XN
1 1 1
D 1
nD1
ns s 1 N s1
M 1
!
1 1 X BkC1 sCk1 1
C 1C s C 1 sCk
2 N kC1 k N
kD1
!Z
sCM 1 N
BM .x Œx/x sM dx; (5.1)
M 1
where xi is the binomial coefficient x.x 1/ .x i C 1/= i Š, and we used the
RN
fact .1/k BkC1 D BkC1 . If s D 1, then it follows from 1 x 1 dx D log N that
5.2 The Riemann Zeta Function 69
XN MX1
1 1 1 BkC1 1
D log N C 1C C 1 1Ck
nD1
n 2 N kC1 N
kD1
Z N
BM .x Œx/x 1M dx: (5.2)
1
XN ˇ ˇ
X
ˇ1ˇ N
1
Proof. (1) Let Re.s/ D . Since ˇ ˇ
ˇ ns ˇ D n
, by putting s D , M D 1
nD1 nD1
in formula (5.1), we obtain
XN Z N
1 1 1 1 1
D 1 1
C 1 C B1 .x Œx/x 1 dx:
nD1
n 1 N 2 N
1
3
It is unknown whether D 0:5772156649015328606065120900824 is an irrational number
or not.
70 5 The Euler–Maclaurin Summation Formula and the Riemann Zeta Function
XN MX1
1 1 1 BkC1 1
aN D log N D 1C C 1 1Ck
nD1
n 2 N kC1 N
kD1
Z N
BM .x Œx/x 1M dx:
1
Similarly to (1), the last integral converges, and thus faN g converges.
Letting N ! 1, we obtain
X
M 1 Z 1
1 BkC1
D lim aN D C BM .x Œx/x 1M dx
N !1 2 kC1 1
kD1
X
M 1 Z 1
BkC1
D BM .x Œx/x 1M dx:
kC1 1
kD0
The famous Riemann4 zeta function .s/ is defined for all complex numbers s
satisfying Re.s/ > 1 by
X1
1
.s/ D s
:
nD1
n
As we have already seen, the infinite series on the right-hand side converges
absolutely for Re.s/ > 1. If is a positive number greater than 1, then this series
converges uniformly in Re.s/ , and thus it is a holomorphic function of s in
Re.s/ > 1. Let us study properties of .s/, using the summation formula (5.1).
Suppose Re.s/ > 1, and let N ! 1 in (5.1). Then,
1 1
!
X 1 1 1 X
M
BkC1 sCk1
D C C (5.5)
nD1
ns s1 2 kC1 k
kD1
!Z
1
sCM 1
BM .x Œx/x sM dx:
M 1
Now, take the difference between (5.1) and (5.5). Then, for Re.s/ > 1, we have
1
!
X N
1 1 1 1 X
M
BkC1 sCk1 1
.s/ D s1 C sCk
nD1
n s s 1 N 2N s k C 1 k N
kD1
!Z
1
sCM 1
BM .x Œx/x sM dx: (5.6)
M N
Since the integral of the right-hand side converges absolutely in the region Re.s/ >
1 M , the formula (5.6) gives an analytic continuation of .s/ to a meromorphic
function in the region Re.s/ > 1 M . In other words, .s/ can be extended to
Re.s/ > 1 M through the formula (5.6). Since M can be any natural number,
.s/ has an analytic continuation to the entire s-plane as a meromorphic function.
It also follows from (5.6) that its only pole is a simple pole located at s D 1, and the
residue at s D 1 equals 1.
We summarize all this, and find the values of .s/ at integers less than or equal
to 0.
Theorem 5.4. (1) The function .s/ has an analytic continuation to the entire
s-plane as a meromorphic function, is holomorphic if s ¤ 1, and has a pole
of order 1 at s D 1 with residue 1.
4
Georg Friedrich Bernhard Riemann (born on September 17, 1826 in Breselenz, Germany—died
on July 20, 1866 in Selasca, Italy).
72 5 The Euler–Maclaurin Summation Formula and the Riemann Zeta Function
Suppose m 2. Now,
by using the
m1formulas
.1 m/.2 m/ .k m/ D
.1/k kŠ m1
k , 1 m1
kC1 k D 1
mk1 kC1 if k < m 1, and .1/k Bk D Bk for
k 2, we have
X
m1
BkC1
.1 m/.2 m/ .k m/ N mk1
.k C 1/Š
kD1
!
X
m1
k BkC1 m 1
D .1/ N mk1
kC1 k
kD1
!
X
m2
m 1 N mk1 Bm
D .1/k BkC1 C .1/m1
kC1 mk1 m
kD1
!
X
m1
k1 m 1 N mk Bm
D .1/ Bk C .1/m1
k mk m
kD2
!
X m1
m1
N mk Bm
D Bk :
k mk m
kD2
(since B0 D 1; B1 D 1=2). On the other hand, from the formula for the sum of
powers (1.1) on p. 1, we have
!
X
N X
m1
m1 N mk
m1
n D Bk : (5.8)
nD1
k mk
kD0
Bm
.1 m/ D :
m
t
u
Remark 5.5. The value in (2) was first given by Euler ([32], see also Weil5 [103]).
The values of the Riemann zeta function at positive even integers in Corollary 4.12
(p. 61)
.1/k1 B2k
.2k/ D .2/2k .k 2 N/
2 .2k/Š
were also computed first by Euler (ibid.). There is not much known about the values
at positive odd integers. It is conjectured that all .2k C 1/ are transcendental
numbers.6 Apéry7 [5] proved that .3/ is an irrational number, and Rivoal [80]
proved that there are infinitely many irrational numbers among .2k C 1/ .k D
1; 2; 3; : : :/ (For more recent results, see [99].)
Remark 5.6. The formula (5.8) for the sum of powers can also be obtained
P usingm1
the
Euler–Maclaurin summation formula (Theorem 5.1). Indeed, the sum N nD1 n
can be obtained by letting f .x/ D x m1 , a D 1, and b D N in the summation
formula.
Exercise 5.7. Use (5.6) (N D M D 1) and (5.3) to prove
1
lim .s/ D :
s!1 s1
5
André Weil (born on May 6, 1906 in Paris, France—died on August 6, 1998 in Princeton, USA).
6
Since is a transcendental number [71], .2k/ are all transcendental numbers.
7
Roger Apéry (born on November 14, 1916 in Ruen, France—died on December 18, 1994 in Caen,
France).
74 5 The Euler–Maclaurin Summation Formula and the Riemann Zeta Function
There are close relations between Bernoulli numbers and quadratic fields or
quadratic forms. In order to explain those, we give a survey of the ideal theory
of quadratic fields and quadratic forms. Since Gauss, it is well known that there is a
deep relation between the ideal theory of quadratic fields (i.e. quadratic extensions
of the rational number field) and integral quadratic forms. This is obvious for
specialists, but textbooks which explain this in detail are rare. In most textbooks,
they treat the correspondence of ideals only in the case of maximal orders (the
ring of all integers) for which the description is simple and easy. This is very
disappointing. We cannot see the whole picture of the theory of quadratic forms by
such a restricted treatment and sometimes it causes misunderstanding. We need the
full description of this kind of relation when we explain later the relation between
L-functions of prehomogeneous vector spaces and the Bernoulli numbers. So it
would be a good chance to try to explain the full relation here.
Q.x; y/ D ax 2 C bxy C cy 2
of degree two an integral binary quadratic form (or a quadratic form for short, since
we treat only the integral and binary case in this book). In particular, in case a, b, c
are mutually coprime (namely the greatest common divisor, g.c.d., of a, b, c is one),
we say Q.x; y/ is primitive. If we define the matrix S by
a b=2
SD ;
b=2 c
Often this expression helps easy understanding. A symmetric matrix like S whose
diagonal components are integers and the other components are half-integers1 is
called a half-integral symmetric matrix. We denote by L the set of all half-integral
symmetric matrices of order two:
a b=2
L D I a; b; c 2 Z :
b=2 c
where M2 .Z/ is the set of all integral 2 2 matrices. The group SL2 .Z/ is called the
(full) modular group and this is the subgroup of GL2 .Z/ of index two. The group
GL2 .Z/ (and hence also SL2 .Z/) acts on L from the right by
Two elements of L or the corresponding quadratic forms are said to be SL2 .Z/-
equivalent, or just equivalent, when they belong to the same SL2 .Z/-orbit under this
action. Two elements of L are said to be GL2 .Z/-equivalent when they belong to
the same GL2 .Z/ orbit by the above action. (In some books, SL2 .Z/-equivalence
is called a proper equivalence, but we do not add the word “proper” for this in this
book). Usually we call the number b 2 4ac D det.2S / the discriminant of the
quadratic form. We denote this number by D.S /. It is obvious that D.S / 0
or 1 mod 4. On the other hand, if D is an integer such that D 0 or 1 mod 4,
there exists S 2 L such that D.S / D D. The discriminant is invariant by the
action of GL2 .Z/. The cases where D is a square (i.e. a square of integers, e.g.
0, 1, 4, 9 . . . ) are exceptional, so in this chapter, we assume that D is not a square
unless otherwise stated. We fix such a non-square integer D 0 or 1 mod 4 and put
L .D/ D fS 2 L I D.S / D D; S primitiveg. Then it is known that both SL2 .Z/-
equivalence classes and GL2 .Z/-equivalence classes in L .D/ are finite. (The proof
1
Here half-integers mean their doubles are integers.
6.2 Orders of Quadratic Fields 77
of the finiteness of the class number and an explicit formula for the class number
will be given later.) When D.S / > 0, we call these numbers of equivalence classes
the class number and the class number in the wide sense of quadratic forms of
discriminant D for SL2 .Z/-equivalence and for GL2 .Z/ equivalence, respectively.
When D.S / < 0, we have ac > 0, and this means that S is positive definite
or negative definite. We denote by LC .D/ the subset of positive definite quadratic
forms of L .D/ and by L .D/ negative definite forms. When D.S / < 0, we say
that the number of SL2 .Z/ equivalence classes in LC .D/ the class number of the
quadratic forms of discriminant D. In this case, the number of GL2 .Z/ equivalence
classes in L .D/ and the number of SL2 .Z/ equivalence classes in LC .D/ are the
same. Indeed this is proved as follows. Any S 2 L .D/ is GL2 .Z/ equivalent to
1 0
an element of LC .D/ since for D , we have det. /t S D t S 2
0 1
LC .D/. If S1 and S2 2 LC .S / are GL2 .Z/ equivalent and S2 D det. /t S1 for
2 GL2 .Z/, then since S2 and t S1 are positive definite, we have det. / > 0,
hence det. / D 1, and S1 and S2 are SL2 .Z/ equivalent.
Now we review the theory of quadratic fields, but there are many books describing
this theory, so here we write only briefly. Also, we describe the theory mostly from
the viewpoint of quadratic forms. In some parts this makes our explanation more
complicated. But this is still useful since this kind of explanation is seldom found in
books nowadays. p p
When m is a non-square integer, we say that K D Q. m/ D Q C Q m is a
quadratic field. When m > 0, we call K a real quadratic field and when m < 0, we
call it an imaginary quadratic field. If m has a square factor, we can bring this factor
outside the square root, so we can assume that m is an integer without square factor.
We assume this from now on. An element ˛ in this field K satisfying ˛ 2 Cb˛Cc D 0
for some rational integers b, c (i.e. integers in the usual sense) is called an integer
of K. The set Omax of all integers in K form a ring. If we put
⎧ p
⎨ 1C m if m 1 mod 4;
2
!D p (6.1)
⎩ m otherwise,
then one can show Omax D ZpC Z!. The ring Omax is called the maximal order (or
the ring of all integers) of Q. m/.
If we define the linear map ˛ ! ˛ of ˛ 2 K to ˛ 2 K as Q-vector space by
p p
a C b m D a b m (a, b 2 Q), which is called a conjugation, then this is an
automorphism as a field. For any element ˛ of K, we write N.˛/ D ˛˛, T r.˛/ D
˛ C ˛, the former is called the norm and the latter the trace of the element ˛.
78 6 Quadratic Forms and Ideal Theory of Quadratic Fields
Of D Z C f !Z:
p p
Then this is an order of Q. m/. Conversely, for any order O of Q. m/, there exists
a natural number f such that O D Of .
Proof. To show that Z C f !Z is an order, we must show that this is a subring. This
is shown by the following calculation.
⎧
⎨ m1 2
f .f !/ C f if m 1 mod 4;
.f !/2 D 4
⎩ f 2m otherwise.
Of D Z C f Omax :
6.2 Orders of Quadratic Fields 79
a D Za C Z.d C ef !/:
a D Za C Z.d C ef !/:
that the module M D Za C Z.d C f !/ is always an ideal for any random choice
of a, d 2 Z. The module M is an Of -ideal only when a, d , f satisfy some special
conditions. We shall describe these conditions below.
Lemma 6.3. The module M D Za C Z.d C f !/ is an ideal of Of if and only if
ajN.d C f !/:
b 2 f 2 DK D 4ac;
b 2 f 2 DK D 4ac
for some c 2 Z. t
u
Remark 6.4. (1) Notation being the same as in the last lemma, we have
p
bC D
d C f! D ;
2
the term proper ideal is often used to indicate the ideal which is not the whole
ring itself. Note that our usage here is different from that.) Next, we shall see the
condition that a primitive ideal of Of is a proper Of -ideal.
Lemma 6.5. For a primitive ideal a of Of , we write a D Za C Z.d C f !/ by the
standard basis and define integers b, c as in Lemma 6.3. Then the ideal a is a proper
Of -ideal if and only if gcd of a, b, c is one.
Proof. Assume that a is an Of 0 module for a divisor f 0 of f . If a is an Of 0
module, then af 0 ! 2 a, hence we have af 0 ! D xa C y.d C f !/. Comparing
the coefficients, we see a D y.f =f 0 / and xa C yd D 0. But since a > 0,
we have y ¤ 0 and so d D x.f =f 0 /. Hence we have a; d 2 .f =f 0 /Z. So
bf 0 =f D df 0 =f C f 0 T r.w/ 2 Z and b 2 .f =f 0 /Z. Multiplying the Eq. (6.2) in
the proof of Lemma 6.3 by f 0 =f , we have
p
b 2 4ac 0 mod 4, so we have e 2. If e is odd, then put K D Q. 2m0 /.
Then we have DK D 8m0 and f D 2.e3/=2 f0 satisfies the demand. If e is even, put
p
K D Q. m0 /. If m0 1 mod 4 then we have DK D m0 , and then put f D 2e=2 f0 .
If m0 3 mod 4 then we have DK D 4m0 and put f D 2.e2/=2 f0 . These satisfy
the demand of the lemma. t
u
This lemma suggests that there is a deep relation between binary quadratic forms
and quadratic fields. For each proper primitive ideal a of an order of a quadratic
field, using a, b, c of Lemma 6.3, we define a quadratic form Qa by
Qa .x; y/ D ax 2 C bxy C cy 2 :
the set of equivalence classes in the narrow (wide) sense of proper Of ideals to
the set of SL2 .Z/-equivalence (GL2 .Z/-equivalence) classes of indefinite primitive
quadratic forms with discriminant f 2 DK .
Proof. For simplicity, we write L.D/ D L .D/ if D > 0 and L.D/ D LC .D/
if D < 0. We denote by L.D/=
the set of GL2 .Z/ equivalence classes in L.D/
and by L.D/= the set of SL2 .Z/ equivalence classes. Since any Of ideal is
equivalent (both in the wide sense and narrow sense) to the primitive ideal, it is
sufficient to consider primitive ideals. For a given proper primitive Of ideal, we
define a, b, c as before. Then since .a; b; c/ D 1, the quadratic form Qa .x; y/ D
ax 2 C bxy C cy 2 is primitive and the discriminant of Qa is f 2 DK . By our choice
of the standard basis of a, we have a > 0. So if D.S / < 0, then Qa .x; y/ is
positive definite. So the mapping a to Qa .x; y/ induces mappings to L.D/=
and to L.D/= . We first show that this mapping is surjective. Take a primitive
quadratic form Q.x; y/ D ax 2 Cbxy Ccy 2 . If this is positive definite, then we have
a > 0 automatically. If this is indefinite, then we might have a < 0, but changing the
quadratic form by SL2 .Z/ equivalence, we may assume that a > 0. This is proved
as follows. Assume that a < 0. For a given x, y, we have 4a.ax 2 C bxy C cy 2 / D
.2ax C by/2 C .4ac b 2 /y 2 . We have 4ac b 2 < 0 by the assumption
p that the
quadratic form is indefinite. We fix an integer y such that jaj= b 2 4ac < y.
Choosing x suitably, we can assume that 2ax C by is in any set of representatives
modulo 2a. So we can assume that j2ax C byj jaj. So we have .2ax C by/2
.b 2 4ac/y 2 < 0 for some x. This means that there exists .x; y/ 2 Z2 such
that ax 2 C bxy C cy 2 > 0. This property still holds if we divide x, y by the
g.c.d. of x and y, so we may assume that x and y are coprime. We can take an
A of SL2 .Z/ such that .x; y/ is the first row of A. So the (1,1) component
element
a b=2 t
of A b=2 c A is positive. This proves the assertion. We assume a > 0 from
now on. Let D be the discriminant of Q, that is, D D b 2 4ac D f 2 DK . Since we
have .f T r.!//2 D f 2 .DK C 4N.!//, we have .f T r.!//2 f 2 DK b 2 mod 4
and we have b f T r.!/ mod 2. So if we define d by b D 2d C f T r.!/, then
p
bC D
a D Za C Z.d C f !/ D Za C Z
2
Qa .x; y/ D a0 x 2 C b0 xy C c0 y 2
D a.x ky/2 C b.x ky/y C cy 2 ;
84 6 Quadratic Forms and Ideal Theory of Quadratic Fields
so this is SL2 .Z/ equivalent (a fortiori, GL2 .Z/-equivalent) to Q.x; y/. So the
mapping to L.D/=
or L.D/= is surjective. Next we see that these mappings
to L.D/=
and L.D/= induce well defined mappings from ideal classes in the
wide sense and in the narrow sense, respectively.pWrite two proper primitive Of
ideals by standard bases as ai D Zai C Z.bi C D/=2,pwhere D D bi2 4ai ci
(i D 1, 2), and assume that a2 D p a1 ˛ for some ˛ 2 Q. D/ p . Then there exists
A 2 GL2 .Z/ such that .a2 ; .b2 C D/=2/ D .a1 ˛; .b1 C D/˛=2/A. Since we
have
p ! p !
a2 b2 C2p D a1 b1 C2p D ˛0
D A;
a2 b2 D
2
a1 b1 D 2
0˛
D a1 N.˛/.a1 X 2 C b1 X Y C c1 Y 2 /
D a2 det.A/.a1 X 2 C b1 X Y C c1 Y 2 /:
Since ai > 0 for i D 1, 2, we have N.˛/ det.A/ > 0. So we are done. Next
we see that the mapping is injective. For two primitive proper Of ideals a1 D
Za1 C Z.d1 C f !/ and a2 D Za2 C Z.d2 C f !/, we define
So we have
So adding !1 ! 2 C!1 !2 D det.A/b2 a1 D det.A/a1 .2d2 CT r.f !// to the both sides
and dividing by 2, we have !2 ! 1 D det.A/a1 .d2 C f !/ D .N.!1 /=a2 /.d2 C f !/.
So we have a2 D a1 ˛. Hence a1 and a2 are equivalent in the wide sense, and if
det.A/ D 1, then equivalent in the narrow sense. Hence the mapping is injective.
t
u
When DK > 0, the equivalences of proper Of -ideals in the wide sense and in
the narrow sense are the same if and only if there exists " 2 Of with N."/ D 1, as
we can see easily. We give the corresponding fact in the case of symmetric matrices
(or quadratic forms) in the following proposition.
Proposition 6.8. Assume that D D f 2 DK is positive and is not a square. Assume
that for S1 , S2 2 L .D/ we have S2 D t BS1 B for some B 2 GL2 .Z/ with
det.B/ D 1. Then there exists C 2 SL2 .Z/ such that S2 D t CS1 C if and only if
there exists " 2 O
f such that N."/ D 1.
Proof. We may assume that S1 and S2 are primitive. First we assume the existence
of C . Then if we write A D BC 1 2 GL2 .Z/, then we have det.A/ D 1 and S1 D
det.A/ t AS1 A. We have shown in the proof of the last theorem that S1 is SL2 .Z/
equivalent to a matrix S in L.D/ corresponding to a quadratic form Qa .x; y/ for
some primitive Of proper ideal a D Za C Z.d C f !/. Now we put .!1 ; !2 / D
.a; d Cf !/A and ˛ D a=!1 , and apply the argument of the proof of the last theorem
for a1 D a2 D a. Then by (6.3) we have N.˛/ < 0 and we have a˛ D a. Since a is a
proper Of -ideal, we have ˛ 2 Of . We also have a D a˛ 1 , so ˛ 1 2 Of and N.˛/
and N.˛ 1 / are integers. So we have N.˛/ D 1 and ˛ 2 Of . Conversely, if there
exists
2 Of with N.
/ D 1, then we have a
D a. If we define A 2 GL2 .Z/ by
.a; d C f !/
D .a; d C f !/A;
86 6 Quadratic Forms and Ideal Theory of Quadratic Fields
then det.A/ D N.
/ D 1 and Qa ..x; y/A/ D det.A/Qa .x; y/, that is, AS t A D
det.A/S . Assuming that S1 D PS t P for P 2 SL2 .Z/, C D PAP 1 satisfies the
condition in the proposition. t
u
p p
Example 6.9. (1) If we pput K D Q. m/ and a D Zm C Z m, then Qa .x; y/ D
jmj1 N.jmjx C y pm/ D jmjx 2 sgn.m/y 2 . p
(2) We takep K D Q. 5/. For an ideal2 a D 3Z C .12 C 5/Z of Omax D
Z C Z 5, we have Qa .x; y/ D 3x C 2xy C 2y . On the other hand, for
Omax D .1/ itself, we have Q.1/ D x 2 C 5y 2 . Any primitive quadratic form
with discriminant 20 is equivalent to one of these two.
Remark 6.10. In the above, we did not treat negative definite quadratic forms. But
this causes no problem since all these can be obtained by multiplying 1 to the
positive definite ones. On the other hand, we excluded primitive quadratic forms
with square discriminant. These quadratic forms are (as far as discriminant is not
zero) very special quadratic forms in various senses. (For example, the volume of
the fundamental domain with respect to the unit group (the automorphism group)
is not finite etc.) We would like to omit such details, but at least we try to give a
classification of equivalence classes below.
Let Q.x; y/ D ax 2 C bxy C cy 2 be a primitive quadratic form and assume that
b 2 4ac D f 2 for a positive integer f . First we see that we may assume a D 0 by
replacing Q by an equivalent quadratic form. Indeed, if a ¤ 0, then noting that the
equation ax 2 C bx C c D 0 has a solution x D .b ˙ f /=2a, we denote by e the
g.c.d. of b C f and 2a and define x0 , y0 by b C f D x0 e and 2a D y0 e. Then
x0 and y0 are coprime and there exist w0 ; z0 2 Z such that x0 w0 y0 z0 D 1. Since
2
we have a xy00 C b xy00 C c D 0, we get ax02 C bx0 y0 C cy02 D 0 and
2
x0 y0 a b=2 x0 z0 ax0 C bx0 y0 C cy02 0
D D :
z0 w0 b=2 c y0 w0
c z 1 z
AD˙ or ˙ :
b w 0 ˙1
with det.A/ D ˙1. For the former one, if we take bz C cw D 1, then we have
det.A/ D 1 and
! ! !
t 0 b=2 0 b.bz C cw/=2 0 b=2
A AD D det.A/ ;
b=2 c b.bz C cw/=2 bzw C cw2 b=2 w
so we can choose a representative of SL2 .Z/ equivalence classes such that b > 0.
On the other hand, we have
10 0 b=2 1 z 0 b=2
D
z1 b=2 c 01 b=2 c C bz
with 0 c f 1 and 0 c f =2, respectively. The fact that these are not
equivalent with each other can be seen easily by the above calculation. In particular,
the number of SL2 .Z/- or GL2 .Z/- equivalence classes of quadratic forms with a
fixed square discriminant is finite. The number of SL2 .Z/ equivalence classes is f ,
and if we restrict to the primitive classes, the number is '.f /, where '.f / is the
Euler function. The case where the discriminant is not a square will be explained in
the next section.
In this section, first we prove the finiteness of the class number (the number of
equivalence classes) of primitive quadratic forms. As we mentioned in the last
section, this is equivalent to the finiteness of the class number (in the narrow sense or
in the wide sense) of quadratic orders Of , which are not necessarily maximal. Next
we state a formula of the class numbers (in the wide sense) of quadratic orders. (The
proof is postponed until Chap. 10.) As we saw in the last section, the class number
88 6 Quadratic Forms and Ideal Theory of Quadratic Fields
in the narrow sense is easily obtained from the one in the wide sense. Actually, once
we know the class number formula for the maximal order, the class number of an
arbitrary order is determined. The reason for this can be explained if we use group-
theoretic language, but in Chap. 10, we will give a different direct proof using zeta
functions.
Proposition 6.11. The number of SL2 .Z/ equivalence classes of binary primitive
quadratic forms of a given discriminant is finite. In particular, the class number of
binary primitive quadratic forms with discriminant f 2 DK is finite.
Proof. We already explained the case where the discriminant is square. So, here
we treat only those with discriminant f 2 DK , where K is a quadratic field and DK
is the fundamental discriminant of K. We fix f 2 DK now and we consider only
those quadratic forms which have this number as discriminant. We say first that any
primitive quadratic form is equivalent to ax 2 C bxy C cy 2 with jbj jaj jcj.
Indeed, we take a quadratic form ax 2 C bxy C cy 2 so that jaj, the absolute value
of the coefficient of x 2 , is the smallest among those equivalent to the original
one. Then by the transformation .x; y/ ! .y; x/, the coefficient x 2 becomes c,
so by our choice of a, we have jaj jcj. For some integer m 2 Z, we have
jaj b 2ma jaj. So, changing the quadratic form to the equivalent one by the
transformation .x; y/ ! .x my; y/, we have a.x my/2 C b.x my/y C cy 2 D
ax 2 C .b 2am/xy C .c bm C am2 /y 2 , we may assume that jaj b jaj
from the first. So we can assume that jbj jaj jcj. (In particular, if DK < 0, we
may assume that the quadratic form is positive definite, so we have jbj a c.)
On the other hand, we have b 2 4ac D f 2 DK by definition, so we have
Hence the number of b is also finite by jbj jaj and c D .b 2 f 2 DK /=4a is also.
Hence the assertion is proved. t
u
Let DK be the fundamental discriminant of a quadratic field and put D D f 2 DK .
We denote by h.D/ the class number in the wide sense of the order Of of the
quadratic field K D Q C Q!. To describe the class number formula, we introduce
symbol. For an odd prime p and an integer a, the quadratic
the quadratic residue
residue symbol pa with respect to p is defined as follows.
⎧
⎪
⎪
⎨1 if a 62 pZ and there exists x 2 Z such that x 2 a mod p,
a
D 1 if a 62 pZ and there is no x 2 Z such that x 2 a mod p,
p ⎪
⎪
⎩0 if a 2 pZ:
The following three relations are known as the reciprocity law of the quadratic
residue symbols.
6.3 Class Number Formula of Quadratic Forms 89
Theorem 6.12. (1) The class number of the quadratic field K with discriminant
DK is given by
⎧ jDK j1
⎪
⎪
⎪ w X DK
⎪
⎨ DK u if DK < 0,
u
h.DK / D uD1
K 1
⎪
⎪
DX
DK
⎪
⎪
⎩ log."/ log.sin.u=DK // if DK > 0.
uD1
u
Here w is the half of the number of units of Omax and when DK > 0, we denote
by " the fundamental unit of K.
(2) The class number of the order Of of K with discriminant D D f 2 DK is
given by
h.DK /f Y 1 DK
h.D/ D 1 :
ŒOmax W Of p p
pjf
Here for any ring R, the notation R means the group of all units of R.
As we mentioned before, the class number in the narrow sense can be easily
obtained by this formula, and the class number of primitive quadratic forms also.
The proof of the class number formula will be given in Chap. 10 when K is an
imaginary quadratic field. When K is a real quadratic field, the proof will be omitted
from this book.
The class numbers of imaginary quadratic
fields have a relation to the Bernoulli
numbers. Indeed, if we put .u/ D DuK , then by Sect. 4.2, we have
jDK j1
1 X
B1; D .u/u; (6.4)
DK uD1
so if DK < 0, we see
p1
1X u
h.p/ D u: (6.6)
p uD1 p
By the above formula, we see that h.D/ D 1 for D D 3, 4, 7, 8,
11, 19, 43, 67 and 163. It is known that there are no other (fundamental
6.3 Class Number Formula of Quadratic Forms 91
discriminants of) imaginary quadratic fields such that h.DK / D 1. Even if we take
the non-maximal orders of imaginary quadratic fields into account, there are only
four more orders with class number one, namely D D 12, 16, 27 and 28. In
contrast to this, it is conjectured that there are infinitely many real quadratic fields
with class number one,2 but no proof is known.
Remark 6.13. For S1 and S2 2 L .D/, we can also define an equivalence class by
S2 D t AS1 A for some A 2 GL2 .Z/. If S1 and S2 are SL2 .Z/ equivalent in the sense
we defined before, then of course these are equivalent in the above sense. But the
equivalences S2 D t A1 S1 A1 and S2 D t A2 S1 A2 for some A1 and A2 2 GL2 .Z/
with det.A1 / D det.A2 / D 1 are different conditions in general. For example, for
two symmetric matrices S1 , S2 2 L .12/ given by
3 0 1 0
S1 D and S2 D ;
0 1 0 3
there do not exist A 2 GL2 .Z/ such that S2 D t AS1 A. This can be seen from the
fact that the diophantine equation a2 3b 2 D 1 does not have integer solutions.
On the other hand, each p correspondspto the symmetric
p matrix
p obtained
p from the
standard basis of 3Z C 3Z or Z C Z 3. Since 3.Z C Z 3/ D 3Z C 3Z, these
ideals are equivalent in the wide sense. In fact, we have
01 3 0 01 1 0
D ;
10 0 1 10 0 3
01
so we have S2 D det.A/t ASA for A D 2 GL2 .Z/. We will give one more
10
example. Take S3 , S4 2 L .23/ given by
3 1=2 3 1=2
S3 D and S4 D :
1=2 2 1=2 2
Then we have
1 0 1 0
S4 D S3
0 1 0 1
but there exists no A 2 GL2 .Z/ such that S4 D det.A/ t AS3 A. Indeed if there exists
such A, then since S3 and S4 are positive definite, we should have det.A/ > 0.
Then comparing diagonal components of both sides, we see that we should have
A D ˙12 . This is a contradiction since .1; 2/ components do not coincide for
A D ˙12 . We explain shortly the difference between SL2 .Z/ equivalence and the
equivalence in the above sense. For any S1 2 L .D/, define
2
In Section 304 of his book on number theory, Disquisitiones Arithmeticae [35], Gauss stated his
speculation on this and also his reasonable insight on the relation to the size of fundamental units.
92 6 Quadratic Forms and Ideal Theory of Quadratic Fields
1 0 1 0
S2 D S1 :
0 1 0 1
If S2 is SL2 .Z/ equivalent to S1 , then S1 (or the corresponding SL2 .Z/ equivalence
class, or ideal or class in the narrow sense) is called ambig. In this case, for any
S 2 L .D/ such that S D t AS1 A for some A 2 GL2 .Z/, we see that S
is SL2 .Z/ equivalent to S1 . So there is no difference between two equivalences.
In the case when S2 is not SL2 .Z/ equivalent to S1 , then the GL2 .Z/ equivalence
class which contains S1 contains two different SL2 .Z/ equivalent classes. To count
the equivalence classes of L .D/ up to the equivalence that S2 D t AS1 A for
A 2 GL2 .Z/, we must count ambig classes, which is a part of the genus theory
of Gauss. Since this is complicated in general, we do not explain this theory in this
book. (See [29]. See also Exercises 6.21 and 10.25.)
integer m, show that 1 and ! in (6.1) gives a basis
Exercise 6.14. For a square-free p
of the maximal order Omax of Q. m/.
Exercise 6.15. Show that the quadratic form Q.x; y/ D x 2 C bxy C cy 2 with
discriminant D is equivalent to the following quadratic form.
x 2 D4 y 2 if b is even,
x 2 C xy C 1D 2
4 y if b is odd.
p
Exercise 6.16. (1) Let a D Za C Z bC2 D be a proper primitive ideal of order Of
of discriminant D D f 2 DK , where DK is a discriminant of a quadratic field
K. We assume a > 0. Show that a is a principal ideal of Of (i.e. a D Of ˛
for some element ˛ 2 Of ) if and only if there exists integers x and y such that
ax 2 C bxy C cy 2 D ˙1.
(2) Show that the class of quadratic forms in the wide sense corresponding to a
principal ideal a contains the one given in Exercise 6.15.
(3) Notation being as above, give an example of a principal ideal a such that there
exists no x, y 2 Z with ax 2 C bxy C cy 2 D 1.
(4) Give an example of an equivalence class of quadratic forms Q.x; y/ in the
narrow sense corresponding to a principal ideal (in the wide sense) such that
Q.x; y/ is not SL2 .Z/ equivalent to any form given in Exercise 6.15.
Exercise 6.17. Fix a positive discriminant D D f 2 DK , where K is a real quadratic
field, and consider an integral quadratic form Q.x; y/ D Ax 2 C Bxy C Cy 2 whose
discriminant is D.
(1) Let jaj be minimum among coefficients of x 2 of quadratic forms which are
p Q2 .x; y/ D ax C
2
equivalent to Q. Show that there exists
p a quadratic form
bxy C y equivalent to Q such that D 2jaj < b < pD and jaj
2
p jcj.
(2) Taking Q2 as in (1), show that ac < 0 and 0
p p < b. (Use . D C b/.p D b/ D
4ac.) In particular, show that 0 < b < D, D b < 2jaj < D C b.
6.3 Class Number Formula of Quadratic Forms 93
Exercise 6.18. Show that for any integral quadratic form Q, there exists a quadratic
form SL2 .Z/ equivalent to Q such that the coefficient of x 2 is positive. (See the
proof of Theorem 6.7, for example.)
Exercise 6.19. (1) Show that the following pairs of quadratic forms in (i) and
(ii) are SL2 .Z/ equivalent respectively.
(i) ax 2 C bxy C ay 2 and ax 2 bxy C ay 2 :
(ii) ax 2 C axy C cy 2 and ax 2 axy C cy 2 :
(2) Assume that D D f 2 DK < 0 and a positive definite quadratic form
Q.x; y/ D ax 2 C bxy C cy 2 has discriminant D. As shown in the proof
of Proposition 6.11, by replacing SL2 .Z/ equivalence, we may assume that
jbj a c. Such quadratic forms are called reduced. Conversely, assume that
there are two SL2 .Z/ equivalent positive definite quadratic forms Qi .x; y/ D
ai x 2 C bi xy C ci y 2 with discriminant D with jbi j ai ci (1 i 2).
Show that .a1 ; b1 ; c1 / D .a2 ; b2 ; c2 / except for the following pairs.
(i) a1 D c1 and .a2 ; b2 ; c2 / D .a1 ; b1 ; a1 /.
(ii) a1 D b1 and .a2 ; b2 ; c2 / D .a1 ; a1 ; c1 /.
Hint: Write down the transformation explicitly as a2 D a1 ˛ 2 Cb1 ˛ Cc1 2 and
assuming that a2 a1 and using the condition of a1 , b1 , c1 , show that j˛ j 1.
Exercise 6.20. (1) Show that h.D/ D 1 for D D 3, 4, 7, 8, 11, 12,
16, 19, 27, 28, 43, 67 and 163.
(2) Show that h.20/ D 2 and give a complete set of representatives of the ideal
classes.
(3) Show that h.40/ D 2 and give a complete set of representatives of the ideal
classes.
a b=2
Exercise 6.21. Fix a discriminant D D f 2 DK and consider S D 2
b=2 c
L .D/. Show that the following conditions on S are equivalent (the class to which
such an S belongs is called an ambig class).
(1) There exists A 2 GL2 .Z/ with det.A/ D 1 such that t ASA D S .
(2) The matrix S is SL2 .Z/ equivalent to
a b=2
:
b=2 c
a0 b 0 =2
(3) There exists S 0 D 2 L .D/ which is SL2 .Z/ equivalent to S
b 0 =2 c 0
such that b 0 is divisible by a0 . (Here we may take b 0 D 0 or b 0 D a0 .)
10
Hint: If t ASA D S with det.A/ D 1, then show that T r.A/ D 0 and A2 D .
01
Then reduce the case when A is upper triangular.
Chapter 7
Congruence Between Bernoulli Numbers
and Class Numbers of Imaginary
Quadratic Fields
For a prime p satisfying p 3 mod 4, recall that h.p/ denotes the class number
of the imaginary quadratic field of discriminant p.
Theorem 7.1. Let p > 3 be a prime, p 3 mod 4. We have the congruence2
Remark 7.2. (1) By the theorem of Clausen and von Staudt (Theorem 3.1), the
denominator of B pC1 is prime to p.
2
(2) Using Euler’s formula .1 k/ D Bk =k (Theorem 5.4 (2), p. 72), we have
1
Adolf Hurwitz (born on March 26, 1859 in Hildesheim, Germany—died on November 18, 1919
in Zürich, Switzerland).
2
This is due to Augustin Louis Cauchy (born on August 21, 1789 in Paris, France—died on May 23,
1857 in Sceaux, France). He expressed the left-hand side by a difference of numbers of quadratic
residues and non-residues in the interval .0; p=2/. That this is equal to the class number is nothing
but the class number formula of Dirichlet.
B pC1
1p
D pC1
2
2B pC1 mod p
2 2
2
The class number formula (6.6) at the end of the last chapter gives
ˇp1 ˇ
ˇX u ˇ 1X
p1
p1
1 ˇ ˇ
1 h.p/ D ˇ uˇ uD :
p ˇ p ˇ p 2
uD1 uD1
This shows that the class number h.p/ is uniquely determined by its value
mod p. Therefore, the above congruence asserts that the value ..1 p/=2/ of
the Riemann zeta function “knows completely” the class number h.p/.
(3) For a more advanced interpretation of this theorem using p-adic modular forms,
see [82].
(4) An analogous result in the real quadratic case is known as the following
Ankeny3–Artin4 –Chowla5 congruence [4]: Let p be a prime number congruent
p
to 1 modulo 4, and " D .t C u p/=2 be the fundamental unit. Then, we have
u
h.p/ B p1 mod p:
t 2
Example 7.3. Let us compute several class numbers by using the theorem.
(1) For p D 7,
1 1
2B 7C1 D 2B4 D 2 D 1 mod 7;
2 30 15
so h.7/ D 1:
(2) For p D 11,
1 1 1
2B 11C1 D 2B6 D 2 D D 1 mod 11;
2 42 21 1 22
so h.11/ D 1:
3
Nesmith Cornett Ankeny (born in 1927 in Walla Walla, USA—died on August 4, 1993 in Seattle,
USA).
4
Emil Artin (born on March 3, 1898 in Vienna, Austria—died on December 20, 1962 in Hamburg,
Germany).
5
Sarvadaman D. S. Chowla (born on October 22, 1907 in London, England—died on December
10, 1995 in Laramie, USA).
7.2 “Hurwitz-integral” Series 97
Definition
P1 7.4. A formal power series with rational number coefficients of the form
n
c
nD0 n t =nŠ such that all cn are integers is said to be a Hurwitz-integral series.
The set of all Hurwitz-integral series is denoted by H.
Note that P .t/ 2 H if and only if P .0/; P 0 .0/; P 00 .0/; P .3/ .0/; : : : are all in Z.
Typical examples of elements of H are e t and log.1 C t/.
Proposition 7.5. (1) H is an integral domain. Z t
d
(2) H is closed under term-by-term differentiation and integration .
dt 0
(3) The set of invertible elements of H is H D fP .t/ 2 HjP .0/ D ˙1g:
Proof. (1) We only need to show that the set H is closed under P multiplication,
1
other properties
P1 being trivial to verify. For two series P .t/
P D n
nD0 cn t =nŠ,
1
Q.t/ D n
nD0 dn t =nŠ, write the product P .t/Q.t/ as
n
nD0 en t =nŠ. The
numbers en are given by
!
X n
n
en D ci dni
i D0
i
1
X X 1
0 t n1 tn
P .t/ D cn D cnC1 ;
nD1
.n 1/Š nD0
nŠ
Z t 1
X X1
t nC1 tn
P .t/dt D cn D cn1 :
0 nD0
.n C 1/Š nD1
nŠ
P
(3) In order that P .t/ D 1 nD0 cn t =nŠ is invertible in QŒŒt, the condition c0 ¤ 0
n
and so, if c0 D P .0/ D ˙1, then the value of this at t D 0 is an integer since
P .0/; P 0 .0/; P 00 .0/; 2 Z. We therefore have P .t/ 2 H . Conversely, if
P .t/ 2 H , then from P .t/1 jt D0 D c01 2 Z we have c0 D ˙1.
t
u
Remark 7.6. The series sin.t/; cos.t/ (Taylor expansions of sin.x/; cos.x/ at the
origin, viewed as formal power series in x D t) are both in H, and cos.t/ D
1 t 2 =2Š C t 4 =4Š C is invertible by proposition (3) above. Hence the series
tan.t/ D sin.t/= cos.t/ is Hurwitz-integral. This shows that the tangent numbers Tn
in Remark 1.18 (p. 24) are integers.
Proposition 7.7. Suppose P .t/ 2 H; P .0/ D 0. Then for any N 2 N we have
P .t/N
2 H:
NŠ
Proof. We proceed by induction on N . The case N D 1 is trivial. Assume the
0
proposition is valid up to N 1. Using P .t/N =N Š D P .t/N 1 P 0 .t/=.N 1/Š
and P .0/ D 0, we have
Z
P .t/N t
P .t/N 1 0
D P .t/dt:
NŠ 0 .N 1/Š
The induction hypothesis and Proposition 7.5 (1), (2) show the right-hand side
belongs to H. t
u
Definition 7.8. For series P .t/.¤ 0/; Q.t/ 2 H, we write P .t/jH Q.t/ or
Q.t/ H 0 mod P .t/ if the condition Q.t/=P .t/ 2 H holds. Also we write
Q1 .t/ H Q2 .t/ mod P .t/ if P .t/jH .Q1 .t/ Q2 .t//.
In particular, when m is an integer (this is also an element in H), the congruence
P
Q1 .t/ H Q2 .t/ mod m means, if we write Q1 .t/ D 1
.1/ n
nD0 cn t =nŠ, Q2 .t/ D
P1 .2/ n .1/ .2/
nD0 cn t =nŠ, that the congruence cn cn mod m holds for every n.
7.3 Proof of Theorem 7.1 99
Put .p 1/=2 D m. We have m > 1 because of our assumption p > 3. In the class
number formula ((6.6) on p. 90)
p1
1X u
h.p/ D u;
p uD1 p
the sum into two parts according to u < p=2 and u > p=2, and note
divide
1
p D 1 because p 3 mod 4. Then we obtain
X u X p u
ph.p/ D u .p u/
p p
0<u<p=2 0<u<p=2
X u X u
D 2 uCp : (7.1)
p p
0<u<p=2 0<u<p=2
On the other hand, if we divide the sum according to the parity of u, then we have
X 2u X p 2u
ph.p/ D .2u/ .p 2u/
p p
0<u<p=2 0<u<p=2
X 2u X 2u
D 4 uCp :
p p
0<u<p=2 0<u<p=2
and hence
X m
1 a
h.p/ D :
2 2
aD1
p
p
Applying Euler’s criterion a
p am mod p (cf. [50, Proposition 5.1.2]) here, we
have
1 X m
m
h.p/ a mod p:
2 2m aD1
100 7 Congruence Between Bernoulli Numbers and Class Numbers of Quadratic Fields
Hence, the proof of the theorem h.p/ 2BmC1 mod p boils down to showing
X
m
am 2.2 2m /BmC1 mod p:
aD1
Further, by 2 mC1
1
mod p and .2 2m / .22mC1 2m / 2m .1 2mC1 /
mod p, this reduces to showing
X
m
BmC1
am 2m .1 2mC1 / mod p:
aD1
mC1
P1 n
First, by t
et 1 D1 t
2 C nD2 Bn tnŠ , we have
1 1 2
D t 2t
et C1 e 1 e 1
1
! 1
!!
1 t X tn 2t X .2t/n
D 1 C Bn 1 C Bn
t 2 nD2 nŠ 2 nD2
nŠ
1
1 X t n1
D C .1 2n /Bn
2 nD2 nŠ
1
1 X BnC1 t n
D C .1 2nC1 / :
2 nD1 n C 1 nŠ
Now since
2 3
2 1 1 e 2t 1 e 2t 1 e 2t
D D1C C C C
e C1
2t
1 1e2
2t
2 2 2
P
and D 1
1e2t
2 nD1 2
n1 t n
nŠ
2 H, we have e2t2C1 2 H. Also, by Proposition 7.7,
2t N 2t N
we have 1e2 H 0 mod N Š. In particular, if N 2, we have 1e2 H
tn 1e2t
0 mod 2. By this, combined with the fact that the coefficient of nŠ in 2 is
divisible by 2 if n 2, we have
2
H 1 C t mod 2: (7.4)
e 2t C1
7.3 Proof of Theorem 7.1 101
BnC1
2n .1 2nC1 / 2Z
nC1
if n 2. Next observe
X
m X
m
4at X
2mC1
.1 C e 2t / e 4at D e C e .4aC2/t D e 2jt
aD0 aD0 j D0
p 1
X X .2jt/n 1 X
X p
tn
D D .2j / n
j D0 nD0
nŠ nD0 j D0
nŠ
1
X X
p
tn
DpC1C 2 n
j n
nD1 j D1
nŠ
X t n
H 1 2n mod p:
n1
nŠ
p1jn
Pp
(Here we have used (3.1) on p. 42, i.e., j D1 j n 1 mod p if p 1jn and
Pp
j D1 j 0 mod p otherwise.) Multiplying both sides by e2t C1 .2 H/, we have
n 2
X
m X tn
2
2 e 4at
H 1 2n mod p:
aD0
e 2t C 1 n1
nŠ
p1jn
tm p1
Comparing the coefficients of mŠ on both sides, we obtain by (7.3) (note m D 2 <
p 1)
X
m
BmC1
2 .4a/m 2mC1 .1 2mC1 / mod p:
aD0
mC1
X
m
BmC1
am 2m .1 2mC1 / mod p:
aD0
mC1
Exercise 7.10. Let p be a prime congruent to 3 modulo 4. Prove that the numerator
of B pC1 can never be divisible by p.
2
Prove that all En are integers. Moreover, prove that E2n is odd for all n 1. (E2nC1
is easily seen to be 0.) Hint: Use (7.4).
Chapter 8
Character Sums and Bernoulli Numbers
We would like to explain arithmetic identities between Bernoulli numbers and the
root of unity. Namely we shall explain certain small collections of formulas between
exponential sums or character sums and Bernoulli numbers. We often encounter
such formulas when we compare the dimension formulas of modular forms obtained
by the Riemann–Roch theorem and by the trace formula. Often, the exponential
sums appear in the first method and the Bernoulli numbers appear in the second
method. But sometimes it is not easy to prove these relations in an elementary way,
and there are several cases that no elementary proofs are known. It seems that there
is no general elementary way to study these relations. Our elementary method in
this section is very restrictive and clearly not enough to attack difficult problems,
but still, it should give us some general feeling of what are easy exponential sums.
Before going to easy things treated in this section, we give here one example for
which no elementary proof is known.
Let p be a prime such that p 3 mod 4. We put D e 2 i=p . Then the following
formula holds.
X
.abc/ p pC1 1
D p B1; C B3; :
.1 a /.1 b /.1 c / 4 6
.a;b;c/2S
Here we put S D .a; b; c/ 2 .F p / I ab C bc C ca D 0 ; Fp is the prime field
3
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 103
in Mathematics, DOI 10.1007/978-4-431-54919-2__8, © Springer Japan 2014
104 8 Character Sums and Bernoulli Numbers
The results we shall give below are much easier compared with this example.
But we see that even researchers sometimes fail to recognize that these are easy,
so we think it is of some use to explain such elementary formulas here. To fix our
standpoint, we assume here that we are satisfied if an exponential sum or a character
sum can be written by the generalized Bernoulli numbers.
We start from simple examples which can be treated easily.PFor a prime p, we put
p D e 2 i=p . As one of our examples, we consider the sum aD1 .1 pa /1 . In fact
p1
X
p1
1 p1
D : (8.1)
aD1
1 pa 2
Xp 1 Y
p1
D X p1 C X p2 C C 1 D .X pa /:
X 1 aD1
Take the logarithm of both sides and compare their derivatives with respect to X .
Then the left-hand side is
Pp1
d aX a
log.X p1 C X p2 C C 1/ D p1aD1 ;
dX X CC1
and the right-hand side is
d X X
p1 p1
1
log.X pa / D :
dX aD1 aD1
X pa
Pp1
Here if we put X D 1, then we get the relation to be proved since p 1 aD1 aD
.p 1/=2. t
u
8.1 Simplest Examples 105
Proof 2. The next proof might appear in an exercise of the elementary algebra.
Here we use a little tool of algebra (Galois theory of cyclotomic fields). We never
use this argument later, so if the readers are not familiar with this tool, they are
recommended to skip this and to go directly to Proof 3. We call the field generated by
the nth roots of unity over the rational number field a cyclotomic field. Gauss studied
cyclotomic fields deeply [35, Chapter 7]. The cyclotomic field Q.p / is a Galois
extension of Q, and the set of the mappings a : p 7! pa for integers a with 1
a p1 gives the Galois group of Q.p / over Q. So the sum which appears in (8.1)
Pp1
is equal to the trace of x D 1=.1p / 2 Q.p / over Q, that is, the sum aD1 a .x/,
where a .x/ is the action of the element a in the Galois group on x. This trace is
invariant by each element a of the Galois group, so by the general Galois theory,
we see this is a rational number. The degree of extension of Q.p / over Q (namely
the dimension as a vector space over Q) is p 1, but since we have p D 1 1=x, x
also generates Q.p /, so the minimal polynomial of x over Q is also of degree p 1.
The minimal polynomial of p is 1 C X C X 2 C C X p1 D .X p 1/=.X 1/,
so the minimal polynomial of 1=x D p 1 is given by ..X C 1/p 1/=X . If
we put X D 1=Y , then this is written as Y 1p ..Y 1/p Y p /, so the minimal
polynomial of x whose coefficient of the highest degree is one is given by
1 p
.Y .Y 1/p /:
p
1 p.p 1/ p1
D :
p 2 2
1
:
1 p t
1 1 p1 Pp1
1 X XX c c
cD0 p t
D pi t i D pc t cCpj D :
1 p t i D0 j D0 cD0
1 tp
106 8 Character Sums and Bernoulli Numbers
If jtj < 1, this calculation is valid also as a convergent power series. If we take here
t ! 1, then by the definition of differentiation (or the theorem of l’Hôpital1 ), we
easily see that
1X c
p1
1
D c :
1 p p cD1 p
p
In the above calculation, we used only the properties that p D 1 and p ¤ 1, so
for any a prime to p, we can apply the same calculation to pa instead of p , so we
get also
1 X ac
p1
1
D c :
1 pa p cD1 p
X
p1
X
p1
1 if c 6 0 mod p,
pca D pca 1 D
p 1 if c 0 mod p,
aD1 aD0
so we get
X
p1
1 1X
p1
1 p.p 1/ p1
D .c/ D D :
aD1
1 pa p cD1 p 2 2
1
Guillaume François Antoine de l’Hôpital, (born in 1661 in Paris, France—died on February 2,
1704 in Paris, France).
8.2 Gaussian Sum 107
Before we start explaining the relation between exponential sums and the general-
ized Bernoulli numbers, we explain about Gaussian sums. The simplest definition
of a Gaussian sum is given by
X
p1
2 2 i
g.p/ D px p D e p
xD0
for an odd prime p. This number was calculated by Gauss and the result is very
beautiful. Namely we have
q
g.p/ D .1/.p1/=2 p :
p
Here the choice of the square root is given as follows. If p 1 mod 4, we take p
p
as a positive number and if p 3 mod 4 we take p so that its argument is =2.
As we shall see later, it is very easy to prove that g.p/2 D .1/.p1/=2 p, but it is
considerably difficult to determine the sign of the square root in g.p/. It is said that
Gauss spent 4 years getting the proof. 2
At the present time, there are many known ways to determine this sign, but it
would be beyond the scope or different from the aim of this book to carry those
proofs here. So we would like to ask readers to refer to some other books on the
determination of the sign. (For example, many proofs are introduced in [15].)
Now, this Gaussian sum can be also expressed in a slightly different way. Let
be the quadratic residue symbol modulo p defined on p. 88 (i.e. .n/ D pn )
and put
2
Gauss wrote in his letter (Collected Works X, p. 24) to Olbers dated September 3, 1805: “Since
four years ago, a week has seldom passed when I had not made one or another attempt to solve this
difficulty, especially lively now also again in the latest period. But all pondering, all search have
been in vain, and each time sadly I must have laid down my pen again. At last a couple of days
ago it succeeded, not for my wearisome search, but only through the grace of God, I would like
to say. As the lightning strikes, the problem has been solved; Strangely enough, the solution
of this problem appears now easier than many others which did not well keep me so many days as
these years, and certainly no one will, when I give a lecture on this matter some day, get suspicious
of the long dilemma in which it brought me.” Incidentally, Olbers (Heinrich Wilhelm Matthias
Olbers, 1758–1840) was a medical doctor and an astronomer. The biggest asteroid, Ceres, which
made Gauss leap to fame, was discovered by Italian astronomer Giuseppe Piazzi (1746–1826) on
New Year’s day in 1801, but became lost through the difficulty of observation, then was accurately
located by Gauss by calculation. It was rediscovered by Olbers on New Year’s day in 1802. Also,
he newly discovered the asteroid Pallas in 1802 and Vesta in 1807. Olbers was a friend of Gauss
whom he visited most often.
108 8 Character Sums and Bernoulli Numbers
X
p1
g. / D .n/pn :
nD0
X
p1
X
p1
X
p1
g.p/ D .1 C .n//pn D pn C .n/pn :
nD0 nD0 nD0
f 1
X
g./ D .n/fn
nD0
X
f
.a/g./ D .n/fan :
nD1
The second relation is obtained by putting a D 1 and taking the conjugate of both
sides in the first relation. Namely we have .1/g./ D g./ and since .1/2 D
.1/ D 1, we have .1/1 D .1/. t
u
Since the Gaussian sum is never 0 as we see below, the above formula can be
written also as
X
f
.a/ D g./1 .n/fan : (8.2)
nD1
g./g./ D f:
f 1
X a.nm/
.a/.a/g./g./ D .n/.m/f :
n;mD1
f 1
X a.nm/ 0 if n ¤ m,
f D
aD0 f if n D m,
f 1
X
f .n/.n/ D f '.f /:
nD1
Next we will give a formula for the Gaussian sum for a primitive Dirichlet
character with 2 D 1.
Proposition 8.4. Let be a non-trivial primitive Dirichlet character such that
2 D 1. Then there exists a quadratic field K such that .u/ D K .u/ D DuK ,
where DK is the fundamental discriminant of K. Furthermore, for such , we have
p
g.K / D DK :
By using the Chinese remainder theorem, we decompose the sum in the definition
of g.K / into powers of primes and we reduce the formula to those in the case of
primes. In the middle of the proof, we need the reciprocity law of quadratic residues.
We omit the details here. We refer to [15]. See also Exercises 8.26 and 8.27.
In this section, we consider sums similar to those in Sect. 8.1, attaching characters to
them. In fact, these new sums are far more interesting. In the previous sections, we
took only the prime power root of unity, but in this section we consider the f th root
f D e 2 i=f of unity. Since it is troublesome to write the suffix f always, we fix a
natural number f once and for all and we denote f simply by . We will explain
below how to obtain formulas for sums like those in the last section by generalized
Bernoulli numbers and simple combinatorial numbers.
First of all, when a is not divisible by f , we get the formula given below. (This
formula will be used also in Chap. 11, Sect. 11.2.)
Lemma 8.5. If f =j a, then we have
f 1
1 1 X ac
D c :
1 a f cD1
Proof. This is obtained by the same calculation as in Proof 3 in Sect. 8.1, but here
we give more direct proof. By expanding polynomials, we have
f 1 f 1
X X
.1 X / cX c D .f 1/X f C .c .c 1//X c
cD1 cD1
Xf
D f X f C Xc:
cD1
8.3 Exponential Sums and Generalized Bernoulli Numbers 111
If we substitute as X D a in the above, then the second term of the right-hand side
Pf
is zero since cD1 ac D a .1 af /=.1 a / D 0, and we have
f 1
X
.1 a / c ac D f af D f;
cD1
X
f
.a/
Pk ./ D
aD1
.1 a /k
and try to describe this. Of course if f D 1, then the denominator becomes zero so
this has no meaning. So we assume that f > 1 from now on. As in the last section,
we define the Gaussian sum with respect to by
X
f
g./ D .a/ a :
aD1
X
f
.a/
Pk; .t/ D :
aD1
.1 a e t /k
f 1
1 X X X
f f
1
P1; .0/ D .a/ c ac D g./ .c/c D g./B1; :
f aD1 cD1
f cD1
In the last equality, we used the formula (4.1) for B1; in Sect. 4.2 (p. 54). t
u
In fact, P1; .t/ is almost identical to the definition of the generalized Bernoulli
numbers. Indeed, calculating the development, we have
1 1 Pf 1
X
f
X X j D0 .j /e
jt
P1; .t/ D aj jt
.a/ e D g./ .j /e jt
D g./ :
aD1 j D0 j D0
ef t 1
112 8 Character Sums and Bernoulli Numbers
Hence we have
d l P1; .l/ 1
.0/ D P1; .0/ D g./ BlC1; : (8.4)
dt l l C1
Next we want to give a formula for Pk; .0/. Now we put
1
Qk;a .t/ D :
.1 a e t /k
Then we have
X
f
Pk; .t/ D .a/Qk;a .t/;
aD1
and
d
Qk;a .t/ D k.QkC1;a .t/ Qk;a .t//;
dt
so inductively Qk;a .t/ is expressed by higher-order derivatives of Q1;a .t/, so Pk; .t/
is also expressed by derivatives of higher order of P1; .t/. More explicitly, by
using the Stirling number of the first kind (as for the definition, see Sect. 2.1), it
is expressed as follows.
Proposition 8.6.
k1
X
1 k .l/
Qk;a .t/ D Q1;a .t/;
.k 1/Š l C1
lD0
k1
X
1 k .l/
Pk; .t/ D P1; .t/:
.k 1/Š l C1
lD0
Proof. This is proved by induction with respect to k. The assertion is true for
k D 1. We assume it is true up to k. Since QkC1;a .t/ D Qk;a .t/ C k1 dt d
Qk;a .t/,
we can express QkC1;a .t/ by the linear combination of derivatives of Qk;1 .t/ by the
.l/
inductive assumption, and the coefficient of Q1;a .t/ in QkC1;a .t/ in this expression
is given by
8.3 Exponential Sums and Generalized Bernoulli Numbers 113
1 k 1 k
C
.k 1/Š l C1 k l
for 0 l k. By the recurrence relation of the Stirling number of the first kind
(p. 28, Eq. (2.2)), this is equal to
1 kC1
:
kŠ l C 1
X k1
g./ X
f
.a/ k BlC1;
Pk; .0/ D D : (8.5)
aD1
.1 /
a k .k 1/Š l C1 l C1
lD0
Proof. In the second formula in the proposition, put t D 0 and then use (8.4). t
u
Example 8.8. For example, we have
f 1
X .a/
D g./B1; ;
aD1
.1 a /
f 1
X
.a/ 1
D g./ B1; C B2; ;
aD1
.1 a /2 2
f 1
X
.a/ 3 1
D g./ B1; C B2; C B3; :
aD1
.1 a /3 4 6
f 1 f 1
X X 1 f 1
Q1;a .t/ D D
aD1 aD1
1 a et 1 ef t 1 et
f ef t et
D f ft 1 t
e 1 e 1
1
X .1 f n /t n1
D f 1C Bn :
nD1
nŠ
114 8 Character Sums and Bernoulli Numbers
(where ıl0 is Kronecker’s delta, that is, 1 for l D 0 and 0 otherwise), we get the
following formula.
Lemma 8.9.
f 1 f 1
X 1 X
D Qk;a .0/
aD1
.1 /
a k
aD1
k1
X
1 k 1
D .1 f lC1
/BlC1 C .f 1/ıl0 :
.k 1/Š l C1 l C1
lD0
f 1
X 1 f 1
D ;
aD1
1 a 2
f 1
X 1 .f 1/.f 5/
D ;
aD1
.1 /
a 2 12
f 1
X 1 .f 1/.f 3/
D ;
aD1
.1 /
a 3 8
f 1
X 1 .f 1/.f 3 C f 2 109f C 251/
D :
aD1
.1 a /4 720
Using similar methods, we will give formulas for the following sums:
f 1
X
c k ca :
cD0
8.3 Exponential Sums and Generalized Bernoulli Numbers 115
When f divides a, then we have ac D 1, so this should give the formula to describe
sums of powers. Also under the assumption that f does not divide a, we give
formulas to express these sums by Qk;a .0/. This is a generalization of the formula to
express the right-hand side of Lemma 8.5 by the left-hand side. (These formulas will
be applied to calculations
of exponential sums later.) As usual, for natural numbers
k, j , we denote by jk the binomial coefficient
!
k k.k 1/ .k j C 1/
D :
j jŠ
k
Here for j D 0, we put 0 D 1.
Lemma 8.11. (1) When a is divisible by f , we have
!
X
f
X
f
X k
k f kj C1
k ac
c D c D k
Bj :
cD0 cD0 j D0
j k j C 1
To show the last equality, we used the Leibniz rule3 on the higher-order derivatives
of products.
3
Gottfried Wilhelm Freiherr von Leibniz (born on July 1, 1646 in Leipzig, Saxony (now
Germany)—died on November 14, 1716 in Hannover, Hanover (now Germany)).
116 8 Character Sums and Bernoulli Numbers
X1
1 et Bn n1
Q1;a .t/ D Q1;0 .t/ D D 1 D 1 t :
e 1
t e 1
t
nD0
nŠ
This has a pole at t D 0. So comparing the constant terms of both sides of (8.7) as
a power series expansion with respect to t and noting that
.0/ 1 B2
Q1;a .t/ D C .1 B1 / t ;
t 2
.j / .1/j j Š Bj C1 Bj C2
Q1;a .t/ D t .j 1/;
t j C1 j C 1 .j C 2/
1
X f nt n
ef t D ;
nD0
nŠ
we get
f 1
! !
X BkC1 X k kj Bj C1 X k kj f j C1
k k
c D
k
C f C f .1/j j Šf k
cD0
k C 1 j D0
j j C 1 j D0
j .j C1/Š
!
X
k1
k Bj C1 f kC1
D f kj C fk
j D0
j j C 1 k C 1
!
X
k1
k Bj C1 f kC1
D f kj C fk
j D0
j C1 kj kC1
!
X k
k Bj f kC1
D f kj C1 C f k:
j D1
j k j C 1 k C 1
Pk 1
j k
(In the second equality, we used the fact j D0 .1/ j j C1 D 1
kC1 .) So we get
!
X
f
X k
k f kj C1
c D k
Bj :
cD0 j D0
j kj C1
In this way, we obtained the formula for the sum of powers again.
.n/
Next, we assume that a is not divisible by f . Here we want to express Q1;a .t/
by Qk;a .t/. This is a converse of the formula in Proposition 8.6 and this time, it is
described by using the Stirling number of the second kind. (As for definition, see
Sect. 2.1.)
8.3 Exponential Sums and Generalized Bernoulli Numbers 117
X
kC1
kC1 l
ımC1;kC1 .D ım;k / D .1/kC1 .1/l :
l mC1
lDmC1
.k/
X
k
.m/
Q1;a .t/ D ım;k Q1;a .t/
mD0
X
k X
kC1
kC1 l .m/
D .1/ kC1
.1/l
Q1;a .t/
mD0
l mC1
lDmC1
X
kC1
X
l1
kClC1 k C 1 l .m/
D .1/ Q1;a .t/
l mD0
m C 1
lD1
X
kC1
kC1
D .1/kClC1 .l 1/Š Ql;a .t/:
l
lD1
f 1
X X
kC1
kC1
c k ac D .1/kCmC1 .m 1/Š Qm;a .0/
cD0 mD1
m
! kj C1
X k
k X kj C1
C fj .1/kj Cm.m 1/Š Qm;a .0/
j D0
j mD1
m
! kj C1
X k
k X kj C1
D fj .1/kj Cm .m 1/Š Qm;a .0/:
j D1
j mD1
m
0 !
1
X k X
kmC1
k k j C 1
D .1/m .m 1/ŠQm;a .0/ @ .1/kj f j A:
mD1 j D1
j m
t
u
By the above calculation, we get the following examples.
118 8 Character Sums and Bernoulli Numbers
Example 8.12.
f 1
X f
c ac D ;
cD0
1 a
f 1
X f 2 C 2f 2f
c 2 ac D ;
cD0
1 a .1 a /2
f 1
X f 3 C 3f 2 3f 3f 2 C 9f 6f
c 3 ac D C C :
cD0
1 a .1 /
a 2 .1 a /3
In this section, we treat several sporadic examples. Sometimes we can use the
similar method for sums on character values as for sums on roots of unity. We
shall see this. Let be a primitive Dirichlet character modulo f . We consider the
following sum.
f 1
X
Sk ./ D .a1 C C ak /a1 ak :
a1 ;:::;ak D0
Here we should note that the sum depends on the range of natural numbers ai
(namely, if we change them to other representatives modf , then the value of the
sum would change). Even for k D 2, it is not an efficient method to calculate this
kind of sum by dividing the sum into various pieces and to seek the answers by
case-by-case analysis. First of all, let us recall the following relation (cf. (8.2)).
X
f
.a/ D g./1 .n/ an :
nD1
f 1
X X
f
Sk ./ D g./1 .n/ n.a1 CCak / a1 ak : (8.8)
a1 ;:::;ak D0 nD1
we obtain
X
f
.n/
Sk ./ D .f /k g./1
nD1
.1 n /k
f 1
X
S.; e1 ; : : : ; ek / D .a1 C C ak /a1e1 akek :
a1 ;:::;ak D1
The formula to describe this by Bernoulli numbers can be obtained by almost the
same method as before. Indeed, as before, writing the character values by Gaussian
sum and roots of unity, we have
0 1
f 1
X
f
Y
k X
S.; e1 ; : : : ; ek / D g./1 .n/ @ c ej cn A
nD1 j D1 cD1
X
f
X
Dg./1 .n/ .1/e1 CCek l1 lk Cm1 CCmk f l1 CClk
nD1 1lj ej
1mj ej lj C1
1j k
! !
e1 ek
Qm1 CCmk ;n .0/.m1 1/Š .mk 1/Š
l1 lk
e1 l1 C 1 ek lk C 1
m1 mk
! !
X
e1 CCek l1 lk l1 CClk e1 ek
D .1/ f
1l e
l1 lk
j j
1mj ej lj C1
1j k
120 8 Character Sums and Bernoulli Numbers
.1/m1 CCmk
.m1 1/Š .mk 1/Š
.m1 C C mk 1/Š
e1 l1 C 1 ek lk C 1
m1 mk
!
X k 1 1 m1 C C mk
m1 CCm
BlC1; :
l C1 l C1
lD0
e1 l
X 1 C1 ek l
X k C1 Y k
.1/m1 CCmk ej lj C 1
.mj 1/Š
m1 D1 mk D1
.m1 C C mk 1/Š j D1 mj
m1 CCm !
X k 1 1 m1 C C mk
BlC1; :
l C1 l C1
lD0
‚ …„ ƒ
k
.1/kC1 f k X 1
k1
k
Sk ./ D S.; 1; : : : ; 1/ D BlC1; ;
.k 1/Š l C1 l C1
lD0
f 1
!
X X
e1
f el e
S.; e/ D .n/ne D BlC1; ;
nD0
l C1 l
lD0
f 1
X
.f C 1/ 1
S.; 2; 1/ D .m C n/m2 n D f 2 f B1; C B2; C B3; :
m;nD0
2 3
X
f
Dm ./ D .a C m/a:
aD1
Then we have
X
f
N 1
Dm ./ D g./ N
a.n/ .aCm/n
a;nD1
8.5 Sporadic Examples: Using Functions 121
0 1
X
f
f
N 1 @
D g./ N
.n/ mn
Cf A;
.1 n /
nD1;.n;f /D1
X
f
X
m1 X
f
X
m1
N
.n/.1 mn /.1 n /1 D N
.n/ nl
D g./
N .l/;
nD1;.n;f /D1 lD0 nD1;.n;f /D1 lD1
Pf
taking the relation N
nD1 .n/.1 n /1 D g./B
N 1; in the last section into
account, we get
!
X
m
Dm ./ D f .l/ C B1; :
lD1
Lemma 8.15. Let f > 1 be an odd number and let be a primitive character with
conductor f . Then we have
f 1
X .a/
aD1
.1 a /.1 2a /2
1 1 1
D g./ B3;N C .1 C .2//B2;N C .3 C 5.2//B1;N :
24 4 8
Proof. If we put
X
f
.a/.1 C a e t /
Qk; .t/ D ;
aD1
.1 2a e 2t /k
X
f
.a/
Pk; .t/ D ;
aD1
.1 a e t /k
then we get
1
X t n1
Q1; .t/ D P1; .t/ D g./ Bn;N :
nD1
nŠ
122 8 Character Sums and Bernoulli Numbers
Since
d
Qk; .t/ D 2kQkC1; .t/ C .1 2k/Qk; .t/ .2/Pk; .2t/:
dt
we can write Qk; .0/ inductively by Pk; .0/, and then by Bernoulli numbers. For
example, if we put k D 1 in the above relation and take the derivatives, we get
0
Q1; .t/ D 2Q2; .t/ Q1; .t/ .2/P1; .2t/;
0 1 00 0 0
Q2; .t/ D .Q .t/ C Q1; .t// C .2/P1; .2t/;
2 1;
and
g./1
Q2; .0/ D B2; C .1 C .2//B1; ;
2 2
0 g./ 1 1
Q2; .0/ D B3; C C .2/ B2; :
2 3 2
and
1 0
Q3; .0/ D .Q .0/ C 3Q2; .0/ C .2/P2; .0//:
4 2;
Since the left-hand side of the lemma is equal to Q3; .0/, substituting this for the
above value and using the relation P2; .0/ D g./.B1; C B2; =2/, we get the
right-hand side. t
u
X
p1
I D .a/a:
aD1
X
p1
X
p1
I D .p a/.p a/ D .a/.p a/:
aD1 aD1
X
p1
X
p1
2I D .a/.a C p a/ D p .a/ D 0:
aD1 aD1
X
p1
I D .l/lmn:
l;m;nD1;4lnm2 mod p
X
p1
2I D .l/ln.m C p m/
l;m;nD1;4lnm2 mod p
X
p1
Dp .l/l. .ln/ C 1/n
l;nD1
X
p1
Dp .l .n/n C .l/ln/
l;nD1
D p 3 .p 1/B1; :
124 8 Character Sums and Bernoulli Numbers
So we have
p 3 .p 1/
I D B1; :
2
Example 8.18. The next sum was first considered in connection
with dimension
formulas of automorphic forms [92]. We put .n/ D pn , where p is a prime and
X .s 2 rt/
I D :
. rCs 1/. sCt 1/. s 1/
.s;t;r/2S
At first glance, it might seem difficult to calculate this sum in an elementary way,
but actually this can be easily calculated as follows. If we put a D r C s, b D s C t,
c D s, then s 2 rt D c 2 .a C c/.b C c/ D ab bc ca. Hence if we put
T D F3 p , then substitute a, b, c for a, b, c, we get
X .ab bc ca/
I D
. a 1/. b 1/. c 1/
.a;b;c/2T
X .ab bc ca/
D
. a 1/. b 1/. c 1/
.a;b;c/2T
X .ab bc ca/ a b c
D
.1 a /.1 b /.1 c /
.a;b;c/2T
So by expanding the numerator and transposing the same term to the left-hand
side, we get
X
2I D .ab bc ca/
.a;b;c/2T
1 1 1
C C
.1 a /.1 b / .1 b /.1 c / .1 c /.1 a /
1 1 1
C 1 :
1 a 1 b 1 c
8.6 Sporadic Examples: Using the Symmetry 125
X
p1
X
.ab bc ca/ D .p 1/ .ab/ D p:
b;cD1 b¤a; b¤0
We have also
X .ab bc ca/ p.p 1/
D :
1 a 2
.a;b;c/2T
X
.ab bc ca/ D p.p 1/:
.a;b;c/2T
So as a total, we have
X
I D .abc/abc:
.a;b;c/2S
The condition in the sum I is complicated and it is not easy to calculate I . But if
we assume that .1/ D 1, then I can be calculated as follows. By the relation
X
I D .abc/.p a/.p b/.p c/
.a;b;c/2S
we get
X
2I D .abc/..p a/.p b/.p c/ C abc/
.a;b;c/2S
X
D .abc/.p 3 p 2 .a C b C c/ C p.ab C bc C ca//:
.a;b;c/2S
But we have .abc/ D .c 2 .a C b// D .a C b/ and since the condition of the
definition of S is symmetric with respect to a, b, c, we have
X X
.abc/.ab C bc C ca/ D 3 .a C b/ab
.a;b;c/2S .a;b;c/2S
X
D3 .a C b/ab
a;b;aCb60 mod p
X
p1
1
D3 .a C b/ab D 3p 2 B1; C B2; :
2
a;bD1
X
p1
D 3 .a/a D 3pB1;
aD1
X X
p1
X
p1
.a C b/ D .a C b/ D . .a// D 0:
a;b;c2S a;bD1 aD1
8.7 Sporadic Example: Symmetrize Asymmetry 127
So we have
3p 3
I D B2; :
4
Remark 8.20. When .1/ D 1, a simple formula to write down I by general-
ized Bernoulli numbers is known only conjecturally.
Let .x/ be the quadratic residue character modulo p (p is a prime) and consider
the following special sum.
X
p1
I D .m 4n/ .m/mn:
m;nD1
A formula for this sum was first given in [91] and [39] by comparing the trace
formula and algebraic geometry, but this can be calculated in an elementary way as
well. We illustrate this calculation below.
In the above definition, if we take n instead of m 4n, then the sum is very
simple and given by
!2
X
p1
X
p1
.mn/mn D .m/m D p 2 B1;2 :
m;nD1 mD1
But the sum I we actually defined above is subtly breaking its symmetry and that
obstructs an easy calculation. We get over this by changing it to an exponential sum
and transforming it to a symmetric one. By the relation
X
p1
X
p1
1 1
.m/ D g. / .a/ am
.m 4n/ D g. / .b/ b.m4n/ ;
aD1 bD1
we have
X
p1
X
p1
I D g. /2 .ab/ mn amCb.m4n/ :
a;bD1 m;nD1
Pp1 p
But for c 62 pZ, we have mD1 m
cm
D 1 c , hence we have
128 8 Character Sums and Bernoulli Numbers
0 1
B X p1
.ab/ X p1
C
I D g. /2 @p 2 C .1/ mn 4nb A
a;bD1
.1 4b /.1 aCb /
m;n;bD1
a6b mod p
X
p1
.ab/ p.p 1/2
D p .1/ ;
a;bD1
.1 4b /.1 aCb / 4
aCb¤p
X
p1
.ab/
J D .1/ :
a;bD1
.1 4b /.1 aCb /
aCb¤p
Now a direct calculation of J seems difficult partly because the sum in the definition
of J is not symmetric with respect to a and b. So we try to rewrite this in a
symmetric way. If we exchange a and b, this is just a change of notation, so we
have
X
p1
.ab/
J D .1/ :
a;bD1
.1 4a /.1
bCa /
aCb¤p
Now we will add this to the original expression and take the average of both. To
calculate this, we use the following equality.
X
p1
.ab/
J1 D ;
a;bD1
1 aCb
aCb¤p
X
p1
.ab/.1 4aC4b /
J2 D ;
a;bD1
.1 4a /.1 4b /.1 aCb /
aCb¤p
8.7 Sporadic Example: Symmetrize Asymmetry 129
then we have
.1/
J D .J1 J2 /:
2
Here we have
1X X
p1 p1
J1 D .ab/k k.aCb/
p a;bD1
kD1
aCb¤p
1 p.p 1/ .1/.p 1/
D .g. /2 .1/.p 1// D :
p 2 2
Also we have
X
p1
1 4aC4b
J2 D .ab/
a;bD1
.1 4a /.1 4b /.1 aCb /
aCb¤p
p1 P3
X .ab/ l.aCb/ X
p1
4 .1/
D lD0
4a /.1 4a /
:
.1 4a /.1 4b / aD1
.1
a;bD1
p1 .p 1/.p 5/ p2 1
D C D :
2 12 12
Furthermore we have
X
p1
.a/ la 1 X
p1
1 X
p1
D .a/c 4acCal
D g. / .4c C l/c:
aD1
1 4a p a;cD1 p cD1
Pp1
Hence if we put S.l/ D cD1 .4c C l/c, then we have
.1/ X
3
.1/.p 2 1/
J2 D S.l/2 :
p 3
lD0
1 X 1 X
3 3
p1 p2 1 .p 1/.2p 1/
J D C S.l/2 D S.l/2 :
4 6 2p 12 2p
lD0 lD0
130 8 Character Sums and Bernoulli Numbers
There are many ways to evaluate this, but here we calculate it just directly. (As for
more general method of calculation, see [45].) First we have
X
p1
. .2c/c C .2c C 1/c/
cD1
!
1 X X
p1 p1
D . .2c/ .2c/ C .2c C 1/.2c C 1// .2c C 1/
2 cD0 cD0
!
1 X
2p1
D .c/c
2 cD1
1X X
p1 p1
D .c/.c C c C p/ D .c/c:
2 cD1 cD1
Hence we have
X
p1
X
p1
.2c C 1/c D .1 .2// .c/c:
cD1 cD1
X
p1
1 X
4p1
. .4c/c C .4c C 1/c C .4c C 2/c C .4c C 3/c/ D .c/c
cD1
4 cD1
X
p1
D .c/c:
cD1
X
p1
X
p1
. .4c C 1/ C .4c C 3//c D .1 .2// .c/c:
cD1 cD1
X
p1
X
p1
X
p1
.4c C 3/c D .4c C 1/.p 1 c/ D .4c C 1/c:
cD0 cD0 cD0
X
p1
X
p1
1 .2/ X
p1
.4c C 1/c D .4c C 3/c D .c/c:
cD1 cD1
2 cD1
So we have
X !2
X
3
.1 .2//2
p1
S.l/ D 1 C .1 .2//2 C
2
.c/c
2 cD1
lD0
D .4 3 .2//p 2 B1;2 :
X
4p1
X
p1
4pB1; ı D .c/ı.c/c D . .4c C 1/.4c C 1/ .4c C 3/.4c C 3//
cD0 cD0
p2
S.1/2 C S.3/2 D .B1; ı /2 :
2
Also we have h.p/ D B1; ı . Summing up all the above results, we get
⎧
⎪
⎪7=2 if pD3 ;
p.p1/.p2/ ⎨ 2
I D p h.p/2 =4 if p 1 mod 4;
12 ⎪
⎪
⎩.4 3 .2//p 2 h.p/2 =2 if p 3 mod 4 and p ¤ 3:
By the way, as for further generalization of the results in this section, see [45].
132 8 Character Sums and Bernoulli Numbers
The example given below is not related to Bernoulli numbers so much, but since the
method is similar, we take this opportunity
to explain it. Here too, we assume that
p is an odd prime and .x/ D p is the quadratic residue character modulo p.
x
Pp1
Let P .x/ be a polynomial in a variable x. If we consider the sum xD0 .P .x//,
then this is related with the number of pairs .x; y/ 2 F2p of integral solutions of
y 2 D P .x/, so related with the congruence zeta function of algebraic curves and
a complicated quantity. There is no simple formula for general P .x/. But in the
special case where P .x/ is a quadratic polynomial, it corresponds to a conic. In this
case the structure of the curve itself is simple and there is a simple formula for the
sum as above. Let us practice the calculation. We write a quadratic polynomial of
x as P .x/PD ax 2 C bx C c (a, b, c 2 Z, a 6 0 mod p). It is not so difficult to
calculate x mod p .P .x//. Since we take p to be an odd prime, we consider the
finite field Fp and rewrite P .x/ D a..x d /2 C e/. Then we have
X X
.P .x// D .a/ .x 2 C e/:
x mod p x mod p
1 X
p1
.k/ k.x Ce/ ;
2
.x 2 C e/ D
g. /
kD0
X
p1
1 X
p1
X
p1
k.x Ce/ :
2
.x C e/ D
2
.k/
xD0
g. / xD0 kD0
Pp1 2
But we have xD0 kx D .k/g. /, so we get the following result.
1
X X
p1
1 if e 6 0 mod p,
2 ke
.k/ D ke
D
kD0 kD1 p1 if e 0 mod p.
t
u
8.9 A Sum with Quadratic Conditions 133
By the way, the result above has of course a relation to the number of pairs
.x; y/ 2 F2p such that x 2 Ce D y 2 . Namely, for each x 2 Fp , there are 1C .x 2 Ce/
number of y which satisfy this equation. So p 1 pairs of .x; y/ are solutions of
this equation if e 6 0 mod p.
In general, also for a quadratic form Q.x/ of n variables, that is, a homogeneous
polynomial of variables
P xi of x D .x1 ; x2 ; : : : ; xn / of degree two, we sometimes
consider the sum x mod p Q.x/ and the results are well known, but this is remote
from Bernoulli numbers and we omit it here.
Pp1 Pp1
Next we treat the sum xD0 .x 2 C e/x. (Note that the sum xD0 .P .x//x
for a general quadratic polynomial P .x/ does not reduce to this sum.) Here we have
X
p1
X
p1
X
p1
.x 2 C e/x D .p x/ ..p x/2 C e/ D .p x/ .x 2 C e/;
xD1 xD1 xD1
so we have
!
X
p1
X
p1
X
p1
2 .x C e/x D p
2
.x C e/ D p
2
.x C e/ .e/ :
2
Hence we have
X
p1
.1C 2.e//p if e 6 0 mod p;
.x 2 C e/x D p.p1/
xD0 2 if e 0 mod p:
X
p1
.1 C .e//p 2 .p 1/
.x 2 C ey 2 /xy D :
x;yD0
4
X .a/ p1
D .1 C .e//g. /B1; :
.1 a /.1 b /.1 c / 4
.a;b;c/2M.he /
.1 b /1 C .1 b /1 D 1;
X
p1
.a/
2I D
a;bD1
.1 a /.1 aN eb
N 2/
!
X
p1
.a/ 1 X
p1
D m aN eb
N 2m
.1 /
a p mD1
a;bD1
1 X
p1
.a/
D m. .aN em/g.
N / 1/
p a;mD1 .1 a /
p1 p1
D N
.e/g. /B1; g. /B1;
2 2
p1
D .1 C .e//g.
N /B1; :
2
t
u
As an application of this formula, we can show the following.
Corollary 8.23.
X p 3 .p 1/
.n/lmn D .1 C .e//B1; :
4
.l;m;n/2M.he /
Proof. For a while, for any rational number ˛ whose denominator is not divisible
by p, we understand that we define ˛ by regarding ˛ as an element of Fp . Then
we have
X
p1
X
p 3 I D lmn .a/ alCbmCcn
l;m;nD1 .a;b;c/2M.he /
X
p1
X
p1
ae m2 X
p1
n ae m 2
D lmn .a/ al 4n ae .bC 2n /
X
p1
lmn al .a/ aem
2 =4n 2 =4n
D . .n=.ae//g. / aem /
l;m;n;aD1
X
p1
lmn al . .ne/ aem
2 =4n
D g. / .a//
l;m;n;aD1
X
p1
X
D .g. / .ne/ .l/g. //lmn C pg. / .n/lmn:
l;m;nD1 4lnem2 mod p
Hence we have
0 1
g. / @ p 2 .p 1/2 X
I D .1 C .e//pB1; p .n/lmnA :
p3 4
.l;m;n/2M.h4=e /
Pp1 2
In the above calculation, we used xD0 cx D .c/g. /. Here in the final
expression of I and in the first term of the right-hand side above, the number e
appears only as .e/. Since we have .4=e/ D .e/, we can replace the condition
M.h4=e / of the sumPon the right-hand side by M.he /. By these considerations, we
get the formula for .l;m;n/2M.he / .n/lmn. t
u
Several such sums with quadratic conditions have something to do with special
values of zeta functions of prehomogeneous vector spaces. But there are a lot of
mysteries around this in general. It has not been clarified when such formulas for
exponential sums can exist or what essential reasons there are for that. For some
special quadratic conditions, we have several conjectures but proofs are not known.
It seems premature to treat this problem here, so we omit the concrete description of
conjectures or other things, but it would be a very interesting problem to study these
sums from the viewpoint of relations between exponential sums and special values
of zeta functions of prehomogeneous vector spaces.
Exercise 8.24. Give an example of non-trivial primitive Dirichlet character such
that 3 is trivial and for that , calculate the Gaussian sum g./ explicitly.
Exercise 8.25. (1) Assume that the cardinality of a finite group G is odd. Then
show that any character such that 2 D 1 (the identity character) is a trivial
character.
(2) Assume that G is a finite cyclic group of even order. Then show that the non-
trivial character of G such that 2 D 1 exists uniquely.
Exercise 8.26. (1) Let p be an odd prime. For any natural number r, it is known
that .Z=p r Z/ Š Z=p r1 Z Z=.p 1/Z. By using this, show that if a non-
trivial Dirichlet character modulo p r satisfies .a/2 D 1 for any a with
.a; p/ D 1, then the conductor of is p.
136 8 Character Sums and Bernoulli Numbers
(2) Let r be a natural number with r 2. Itis known that .Z=2r Z/ Š Z=2Z
Z=2r2 Z (or Š ..1 C 2Z/=.1 C 4Z// .1 C 4Z/=.1 C 2r2 Z/ , the latter is
cyclic generated by 5.) Show that if a non-trivial Dirichlet character modulo
2r satisfies .a/2 D 1 for any a with .a; 2/ D 1, then the conductor of is
either 4 or 8.
(3) Show that if is any non-trivial Dirichlet character such that 2 is trivial, then
the conductor of is jDK j for a fundamental discriminant of some quadratic
field K.
(4) Show that if is any non-trivial primitive Dirichlet character such that 2
is trivial, then D K for some quadratic field K, where K is defined as in
Sect. 6.3.
Exercise 8.27. Let be a primitive Dirichlet character with conductor f . Assume
that f D m1 m2 with .m1 ; m2 / D 1. We regard as a character of .Z=f Z/ Š
.Z=m1 Z/ .Z=m2 Z/ and denote by i the restriction to Z=mi Z for i D 1, 2.
We also denote by i the Dirichlet character modulo mi associated with i .
i is mi .
(1) Prove that the conductor of
(2) For any n, put n D exp 2n i . We take integers x, y with xm1 C y m2 D 1.
Show that for any a 2 Z, we have
fa D m
ay ax
:
1 m2
f 1
X .a/ a 1
D g./B2; :
aD1
.1 /
a 2 2
f 1
X .a/ a .1 C a / 1
D g./B3; :
aD1
.1 a /3 3
8.9 A Sum with Quadratic Conditions 137
X
p1
1 p.p 1/.7p C 1/
.m C n/ .m/mn D p 2 B1;2 :
m;nD0
2 12
(3) We have
X
p1
.lm C mn C nl/ D .1/p.p 1/:
l;m;nD0
X
p1
p 2 .p 1/
.lm C mn C nl/l D .1/ :
2
l;m;nD0
X
p1
p 2 .p 1/.2p 1/
.lm C mn C nl/l 2 D .1/ :
6
l;m;nD0
X
p1
p 3 .p 1/2
.lm C mn C nl/l 3 D .1/ :
4
l;m;nD0
X
p1
p 2 .p 2 1/
.lm C mn C nl/lm D .1/ :
6
l;m;nD0
X
p1
p 3 .p 2 1/
.lm C mn C nl/l 2 m D .1/ :
12
l;m;nD0
X
p1
3p 3 p 3 .p 1/
.lm C mn C nl/lmn D .B1; /2 C .1/ :
2 4
l;m;nD0
Pp1
(Hint: nD0 .n.l C m/ C lm/ is 0 if l C m 6 0 mod p. In the last relation,
substitute l, m, n by p l, p m, p n and add that to the original expression.)
Chapter 9
Special Values and Complex Integral
Representation of L-Functions
Since the series on the right-hand side converges absolutely for Re.s/ > 1, .s; a/
is defined in this range. This function .s; a/ is called a Hurwitz zeta function.
(Hurwitz [43], Whittaker1 and Watson2 [104, Part II]). In particular, when a D 1,
.s; 1/ is nothing but the Riemann zeta function .s/. In this section, we shall give
an integral representation of .s; a/.
We denote by .s/ the gamma function. This is defined by
1
Edmund Taylor Whittaker (born on October 24, 1873 in Southport, England—died on March 24,
1956 in Edinburgh, Scotland).
2
George Neville Watson (born on January 31, 1886 in Devon, England—died on February 2, 1965
in Warwickshire, England).
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 139
in Mathematics, DOI 10.1007/978-4-431-54919-2__9, © Springer Japan 2014
140 9 Special Values and Complex Integral Representation of L-Functions
Z 1
.s/ D e t t s1 dt (9.1)
0
for complex numbers s such that Re.s/ > 0. The integral on the right-hand side
converges absolutely and uniformly on any compact sets in Re.s/ > 0, and so we
see that .s/ is a holomorphic function for Re.s/ > 0. By integration by parts, we
get the most important property
.s C 1/ D s .s/ (9.2)
of the gamma function. Through this equality (9.2), .s/ can be continued
analytically to a meromorphic function on the whole s-plane. More concretely, if
we rewrite (9.2) as
.s C 1/
.s/ D ;
s
then since the right-hand side is a meromorphic function for Re.s/ > 1, .s/ is
continued analytically to Re.s/ > 1 and s D 0 is a pole of order 1 with residue 1.
We also have
.s C 2/
.s/ D ;
s.s C 1/
.s C M /
.s/ D (9.3)
s.s C 1/ .s C M 1/
and for the range Re.s/ > 1, we see that the right-hand side converges absolutely, so
the above calculation is justified. Hence, for the range Re.s/ > 1, we get an integral
representation of .s; a/ as
Z 1 Z 1
1 e at s1 1 te .1a/t s2
.s; a/ D t dt D t dt: (9.4)
.s/ 0 1 e t .s/ 0 et 1
In the definition of I."; 1/, the parts for u < " and u > " overlap, but we interpret
it that for u < " we take the path above the real axis and for u > " below the real
axis. This path is illustrated in Fig. 9.1.
Now consider the complex curvilinear integral
Z
te .1a/t s2
t dt:
I.";1/ et 1
Since we have to treat t s2 on I."; 1/, we shall choose a complex power t s for
s 2 C. Denoting the argument of t by arg t, we can define a single-valued function
log t on C fz D x C iy j y D 0; x 0g by
0 ε
t s D e s log t :
and on C2 by
Z Z 1
te .1a/t s2 te .1a/t .s2/.log t C2 i /
t dt D e dt
C2 et 1 " et 1
Z 1 .1a/t
te
De 2 i.s2/
t s2 dt:
" et 1
Z Z Z
t e .1a/t s2 1
t e .1a/t s2 e .1a/t s1
t dt D e 2 i s 1 t dt C t dt:
I.";1/ e 1
t
" e t 1 I."/ e t 1
(9.5)
Z 2 ˇˇ .1a/"ei ˇ
ˇ
ˇe i s ˇ
D" Re.s/
ˇ "ei e ˇ d: (9.6)
0 ˇe 1 ˇ
9.2 Contour Integral 143
"
We see is bounded as a function of " and , so if we take the limit " ! 0
1
e "ei
then, for Re.s/ > 1, by (9.6) we get
Z
e .1a/t s1
lim t dt D 0: (9.7)
"!0 I."/ et 1
On the other hand, in (9.5), by Cauchy’s integral formula, the curvilinear integral
on the left-hand side does not depend on the choice of sufficiently small positive
number " (" < 2), for if we take another "0 > 0, then in the region surrounded by
two paths, the integrand is a single-valued holomorphic function. Now we assume
that Re.s/ > 1. In (9.5), since the left-hand side does not depend on " > 0, we fix
" on the left-hand side, and on the right-hand side we take a limit as " ! 0. Then
using (9.7), we get
Z Z
te .1a/t s2 1
te .1a/t s2
t dt D e 2 i s 1 t dt: (9.8)
I.";1/ e 1
t
0 et 1
1
As is well known in complex analysis, is a holomorphic function on the
.s/
whole s-plane, so by Proposition 9.1 and Lemma 9.2, poles of .s; a/ result from
zeros of the function e 2 i s 1. Since .s; a/ converges absolutely for Re.s/ > 1
and is holomorphic there, possible poles are s D 1 or s D 1 m (m 2 N). Among
these, at s D 1 m the function
1
.s/.e 2 i s 1/
is holomorphic. Hence .s; a/ has a unique pole s D 1 and its order is 1. The residue
is given by
Z Z
.s 1/ te .1a/t s2 1 te .1a/t 1
lim t dt D t dt
s!1 .s/.e 2 i s 1/ I.";1/ e t 1 2 i I.";1/ e t 1
Z .1a/t
1 te .1a/t 1 e
D t dt D Res D 1:
2 i I."/ e 1 t t D0 et 1
The above calculation of the last integral is due to the residue theorem. The only
e .1a/t
pole of the integrand t inside I."/ is t D 0 and it is of order 1, so it is enough
e 1
to calculate its residue.
We have finished the proof of the latter half of Proposition 9.1. t
u
Next, we calculate the special values of .s; a/ at non-positive integers s D 1m
(m 2 N) by using the contour integral representation.
Proposition 9.3. Let m be a natural number. Then for a > 0, we have
Bm .a/
.1 m; a/ D :
m
Here Bm .a/ is the Bernoulli polynomial defined in Sect. 4.3 (p. 55).
Proof. By Proposition 9.1 we have
Z
1 te .1a/t m1
.1 m; a/ D lim t dt:
s!1m .s/.e 2 i s 1/ I.";1/ et 1
te .1a/t m1
In the last curvilinear integral, the integrand t becomes a single-valued
et 1
holomorphic function on C n f0g. Notation being as before, in the integral for the
contour I."; 1/ D C1 C I."/ C C2 , the curvilinear integrals on C1 and C2 cancel
each other. That is,
9.3 The Functional Equation of .s; a/ 145
Z Z
te .1a/t m1 te .1a/t m1
t dt C t dt D 0:
C1 et 1 C2 et 1
1 s.s C 1/ .s C m 1/
lim D lim
s!1m .s/.e 2 i s 1/ s!1m .s C m/ e 2 i.sCm1/ 1
.1/.2/ .1 m/
D
2 i
.1/m1 .m 1/Š
D :
2 i
So we have
te .1a/t m1
.1 m; a/ D .1/ m1
.m 1/Š Res t :
t D0 et 1
We fix t0 < 0, and denote by C.t0 / the line on the t-plane defined by Re.t/ D t0
oriented upwards:
We assume Re.s/ < 0. In the contour integral representation of the Hurwitz zeta
function .s; a/ (Proposition 9.1), we cut out the contour I."; 1/ as in Fig. 9.2, and
shift it over the imaginary axis to the line C.t0 /. Here we are taking 0 < a 1.
When it goes beyond the imaginary axis, it passes the pole t D 2 i n(n 2 Z;
te .1a/t s2
n ¤ 0) of the integrand t t , so the residues appear there. The assumptions
e 1
Re.s/ < 0 and 0 < a 1 are conditions for the integral to converge. Under the
condition 0 < a 1, the curvilinear integral along C.t0 / converges absolutely.
Since we can transform as
146 9 Special Values and Complex Integral Representation of L-Functions
−C(t0)
=⇒
0 t0 0
.s; a/
1 X e .1a/2 i n Z te .1a/t s2
D .2 i / t dt
.s/.e 2 i s 1/ .2 i n/1s C.t0 / e 1
t
n2Z
n¤0
1
1 X i 3 i
e 2 i na e 2 .s1/ C e 2 i na e 2 .s1/
D .2 i /
.s/.e 2 i s 1/ nD1
.2 n/1s
Z
te .1a/t s2
t dt ;
C.t0 / et 1
where the notation fag denotes the fractional part of a: 0 fag < 1, a fag 2 Z.
This transformation is valid for Re.s/ < 0. By these relations (9.9), (9.10), we get
1
is is .2/s X e 2 i na
e 2 .s; a/ C e 2 .s; 1 fag/ D ; (9.11)
.s/ nD1 n1s
.2/s .2/s
.s/ D s .1 s/ D .1 s/:
2 .s/ cos 2 2 .s/ sin .sC1/
2
Adjusting a little, this gives the usual functional equation of the Riemann zeta
function.
Theorem 9.5 (Functional equation of Riemann zeta function). The Riemann
zeta function .s/ can be continued analytically to a meromorphic function on the
whole s-plane. If we put
s
.s/ D s=2 .s/;
2
it satisfies the functional equation
In this section, our aim is to define a Dirichlet L-function and to look for its
functional equation and special values.
Let be a Dirichlet character modulo f . The Dirichlet L-function associated
with the character is defined by
X1
.n/
L.s; / D :
nD1
ns
The series on the right-hand side converges absolutely for Re.s/ > 1 and it is
holomorphic there.
First, we consider the special values at positive integers.
The Gaussian sum g./ associated with is defined by
X
f
g./ D .a/e 2 i a=f
aD1
X
f
.n/g./ D .a/e 2 i an=f (9.12)
aD1
.1/k1 .2 i /k
L.k; / D g./Bk; :
2kŠf k
Example 9.7. We give examples first. Let be the primitive character modulo 4 and
let k be an odd positive integer. Then we have D and g./ D 2i , so we have
kC1
.1/ 2 k
L.k; / D Bk; :
2k kŠ
We write two of them explicitly below. By the expression in (4.1) in Sect. 4.2, we
have B1; D 12 and B3; D 32 , so we get
1 1 1
L.1; / D 1 C C D ;
3 5 7 4
1 1 1 3
L.3; / D 1 C C D :
33 53 73 32
Next, let be the primitive character modulo 3. Since we have B1; D 13 , B3; D
p
3 , g./ D
2
3i , we get
1 1 1 1 1
L.1; / D 1 C C C D p ;
2 4 5 7 8 3 3
1 1 1 1 1 22 3
L.3; / D 1 C C C D p :
23 43 53 73 83 34 3
X
f 1
X e 2 i an=f
L.k; /g./ D .a/ :
aD1 nD1
nk
Noting the assumption .1/ D .1/k , the right-hand side is transformed into
1X X e 2 i an=f
f
.a/ :
2 aD1 nk
n2Z
n¤0
150 9 Special Values and Complex Integral Representation of L-Functions
By virtue of Theorem 4.11 (p. 59) and (4.1) (p. 54), this value can be rewritten as
.2 i /k X
f
.2 i /k
.a/Bk .a=f / D Bk; :
2kŠ aD1 2kŠf k1
Noting g./1 D .1/g./=f by Lemmas 8.2 and 8.3 in the last section, we can
derive the theorem from this. t
u
We express L.s; / by using Hurwitz zeta functions. Hurwitz zeta functions
are very useful to deduce various properties of L-functions. Under the assumption
Re.s/ > 1, we can transform L.s; / as
Xf
X1
.a C f m/
L.s; / D
aD1 mD0
.a C f m/s
X
f 1
X 1
D .a/
aD1 mD0
.a C f m/s
X
f 1
X
s 1
Df .a/
aD1 mD0
.m C a=f /s
X
f
s a
L.s; / D f .a/ s; :
aD1
f
X
f
f 1 .a/ D 0
aD1
Let us derive the functional equation of L.s; / from the functional equation of
Hurwitz zeta functions (Theorem 9.4). We use the following notation ı D ı./,
defined by
0 if .1/ D 1;
ıD
1 if .1/ D 1:
where we put
g./
W ./ D p :
f iı
Proof. By using Proposition 9.8 and Theorem 9.4, we continue L.s; / analytically
to the region Re.s/ < 0, and get
X
f
s a
L.s; / D f .a/ s;
aD1
f
⎧ ⎫
. i /.2/s1 ⎨ i s X X .a/e 2 i n f 2 i n fa ⎬
1 f 1 f
i s X X .a/e
a
D s e 2 e 2 :
f .s/ sin s ⎩ nD1 aD1
n1s
nD1 aD1
n1s ⎭
s
1=2 s1 sC1
.s/ D 2
2 2
sCı
D 1=2 2s1 1sCı
2 cos .sı/ 2 2
and then substituting this into (9.13) and simplifying the result, we get the desired
functional equation. t
u
Finally, let us look at special values of L.s; / at negative integers.
Theorem 9.10. Let be a primitive character modulo f . For any positive integer
m, we have the following formula:
Bm;
L.1 m; / D :
m
Proof. By Propositions 9.8 and 9.3, we have
X
f
a
L.1 m; / D f m1
.a/ 1 m;
aD1
f
X
f
Bm .a=f /
D f m1
.a/ :
aD1
m
Bm;
By (4.1) on p. 54, this is equal to . t
u
m
Remark 9.11. We note that Theorem 9.6 can be derived also from Theorem 9.10
and the functional equation (Theorem 9.9).
Exercise 9.12. Let a be any complex number. Put f .s/ D .s/.s a/. Find a
functional equation of f .s/, that is, give a relation between f .s/ and f .b s/ for a
suitably chosen number b.
Exercise 9.13. Let a be a positive integer. Determine the location of poles of the
function .s/.s a/ and their residues for each case when a is even or odd.
Exercise 9.14. (1) Let be the primitive character with conductor 4. Show that
L.1; / D 4 by using the integral
Z 1
1
dx
0 1 C x2
Z 1
1
dx:
0 1 C x C x2
Exercise 9.15. (1) Let be any Dirichlet character modulo f . Show that for any
x 2 C with Re.s/ > 1, the infinite product
Y
.1 .p/p s /1
pWpri me;p−f
We first review the prime ideal decomposition of the maximal order of quadratic
fields. Since there are many good books on prime ideal decompositions of the
Dedekind domain, we do not give proofs here. For details, we refer to Lang [67],
Weber1 [101]. Then we supply the theory of ideals of a quadratic order which is not
maximal. Since such orders are not Dedekind domains, the theory of ideals is not
covered by the above general theory, so we explain the difference.
We fix a quadratic field K of discriminant DK and denote by Omax the maximal
order of K. For any non-zero ideals a and b of Omax , we write
X
ab D ai bi I ai 2 a; bi 2 b; is an arbitrary natural number
i D1
and call this a product of a and b. It is clear that this is also a non-zero ideal of
Omax . We give the following theorem without proof. We denote by K the Dirichlet
character associated with K defined in Sect. 6.3.
1
Heinrich Martin Weber (born on May 5, 1842 in Heidelberg, Germany—died on May 17, 1913 in
Strasbourg, Germany (now France)).
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 155
in Mathematics, DOI 10.1007/978-4-431-54919-2__10, © Springer Japan 2014
156 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms
Theorem 10.1. (1) Any .non-zero/ ideal of the maximal order Omax of a quadratic
field K is a product of prime ideals. This factorization is unique. For any ideals
a and b of Omax , we have N.ab/ D N.a/N.b/.
(2) For any prime p 2 Z, the ideal pOmax is decomposed as follows.
(i) If K .p/ D 1, then pOmax D p1 p2 , where p1 and p2 are different prime
ideals related by p2 D p1 . Here we put
p1 D f˛I
N ˛ 2 p1 g
pOmax D p2
Here the sum runs over all non-zero ideals of Omax and it converges absolutely for
Re.s/ > 1. The existence and uniqueness of the prime ideal factorization gives the
Euler product decomposition
Y 1
K .s/ D ;
p
1 N.p/s
again for Re.s/ > 1. Here the product runs over all prime ideals of Omax . Also the
decomposition of primes of Z in K (Theorem 10.1) gives
K .s/ D .s/L.s; K /:
(1) The Dirichlet character K .n/ D DnK is a primitive Dirichlet character
modulo jDk j and satisfies the above property.
(2) Conversely, any non-trivial primitive Dirichlet character modulo jDK j which
satisfies the above property is equal to K .
Now in the rest of this section, we explain a multiplicative structure of ideals
including the case of non-maximal orders.
p f 1 and we consider ideals of an order Of of a quadratic
We fix an integer
field K D Q. m/. We extend the notion of ideals so that we can consider a
group of ideals. For this purpose, we introduce Of -modules, called fractional ideals,
which are not necessarily contained in Of . To define this, we first consider a free
Z-submodule L of the quadratic field K. The rank of L as a free Z-module is at
most 2. Indeed, if we assume that rankZ L 3, then there are elements !1 , !2 ,
!3 2 L which are the part of a basis over Z. If c1 !1 C c2 !2 C c3 !3 D 0 for some
ci 2 Q, then there is some non-zero r 2 Z such that rci 2 Z for all i , so we have
rci D 0 and hence ci D 0 for i D 1, 2, 3. Since L K and K is of dimension two
as a vector space over Q, this is a contradiction.
p
Definition 10.3. A free Z-submodule of rank 2 in the quadratic field K D Q. m/
is called a lattice of K.
For lattices L1 and L2 , we define their product by
X
L1 L2 D ai bi I ai 2 L1 ; bi 2 L2 ; is an arbitrary natural number :
i D1
p p
(a, b 2 Z), but .5 C 5 2/˛ D .a 10b/ C .a C 5b/ 2 and since this should
p
belong to O5 , we have a C 5b 2p5Z, so a 2 5Z. So we get L Z C Z 2,
therefore we have aL 5.Z C Z 2/ ¤ O5 . Notice in this example that a is not
a proper ideal.
Lemma 10.5. Let a be an Of ideal. There exists the fractional ideal L of Of such
that aL D Of if and only if a is a proper Of ideal. If a is proper, L is uniquely
determined by a.
Proof. First assume that a is proper and we construct L such that aL D Of
explicitly. It is sufficient to give it when a is a primitive ideal. Let f1; !g be the
standard basis of the maximal order of K and express the ideal as a D Za C Z.d C
f !/ in the standard basis. We denote by x 7! x the non-trivial automorphism of
K=Q, and put M D Z C Za1 .d C f !/ D a1 a. This is obviously an Of module
and hence a fractional ideal of Of . Taking the product, we have
f 2 KI L Lg D Of :
For any proper ideal a of Of , a1 is also a proper fractional ideal. For any proper
fractional ideal L, we have rL Of for some integer r, and rL is obviously a
proper ideal of Of , so L has also an inverse M , which is a fractional Of ideal such
that LM D Of .
Lemma 10.6. The product L1 L2 of proper fractional ideals L1 and L2 of Of is a
proper fractional ideal.
Proof. It is sufficient to prove when L1 and L2 are (integral) ideals. Let a and b
be proper ideals and put ab D c. For ˛ 2 K, assume that ˛c c. Since a is a
proper ideal, there exists a1 and we have a1 c D a1 ab D b. Hence we have
˛b D a1 .˛c/ a1 c D b and since b is a proper ideal, we have ˛ 2 Of . t
u
10.1 Ideal Class Groups of Quadratic Fields 159
a C mOf D Of :
a C f Of a C f Omax Of :
Lemma 10.10. Let m be an arbitrary natural number. For any proper ideal a of
Of , there exists a proper ideal of Of which is equivalent to a and coprime to m.
Proof. It is enough to prove it when a is a proper primitive ideal. We write a
primitive quadratic form corresponding to the class of a by
a b=2
:
b=2 c
It is enough to see that we can make a prime to m by changing the quadratic form to
an equivalent quadratic form. Let S be the set of all primes which divide m and put
S1 D fp 2 S I p − ag;
S2 D fp 2 S I pja and p − cg;
S3 D fp 2 S I pja and pjcg:
Q Q
Put x D p2S2 p and y D p2S1 p. Then we have .x; y/ D 1, and if we put
l WD ax 2 C bxy C cy 2 , then l is coprime to m. Indeed, if p 2 S1 then ax 2 is
coprime to p, but pj.bxy C cy 2 /, so p − l, and if p 2 S2 then pj.ax 2 C bxy/ but
p − cy 2 , so p − l. If p 2 S3 , then since we assumed first that a, b, c are coprime, we
have p − b, and since p − xy, again l is coprime to p. Taking a matrix A 2 SL2 .Z/
whose first row is .x; y/, we have
a b=2 t l
A AD ;
b=2 c
so we are done. t
u
We denote by I0 .Of ; f / the set of ideals of Of which are coprime to f and by
I0 .Omax ; f / the set of ideals of Omax which are coprime to f .
Lemma 10.11. The mapping W I0 .Omax ; f / ! I0 .Of ; f / defined by .a/ D
a \ Of is a bijection. This maps a product to a product and does not change the
norm.
Proof. First we show that this mapping does not change the norm. Let a be an ideal
of Omax coprime to f . By the isomorphism theorem of rings, we have Of =a \
Of Š .Of C a/=a. Since we assumed here that a is coprime to f , we have a C
f Omax D Omax , but since f Omax Of , we also have Of C a D Omax . Namely
we get
and this means that the norms are equal. In particular, we have seen that the
image of the mapping is contained in I0 .Of ; f / since the norm is coprime to f .
10.1 Ideal Class Groups of Quadratic Fields 161
The surjectivity of the mapping is proved as follows. For a0 2 I0 .Of ; f /, the Omax
ideal a0 Omax is coprime to f , since 1 2 Of D a0 C f Of a0 Omax C f Omax .
We have
h.DK /. This calculation is not so difficult and is explained for example in [101] or
[67], but we omit it here. In next section, we use a different method to obtain this
difference in the case of imaginary quadratic fields.
We prove that the class number of an imaginary quadratic field can be written in
terms of a generalized Bernoulli number. There are many ways to prove this; here
we use properties of theta functions.
Theorem 10.12. Let K be an imaginary quadratic field, K be the character
corresponding with K defined in Sect. 6.3 and w be the half of the number of roots
of unity in the maximal order Omax of K. Then the class number of OK is given by
h.DK / D wB1;K :
For the proof, we review the transformation formula of theta functions. Let a, b,
c be integers, and we assume that the quadratic form Q.x; y/ D ax 2 Cbxy Ccy 2 is
positive definite. We define Q1 by Q1 .x; y/ D 4.4ac b 2 /1 .cx 2 bxy Cay 2 /.
We put
It is easy to see that this converges absolutely and uniformly on compact sets in H,
and is a holomorphic function on H.
Proposition 10.13. For 2 H, the following transformation formula holds.
p
1
Q1 D 4ac b 2 .=2i / Q ./:
Proof. A general proof can be found for example in [3, p. 25]. Here we give a direct
proof restricted to this case. We use the Poisson2 summation formula which is well
2
Siméon Denis Poisson (born on June 21, 1781 in Pithiviers, France—died on April 25, 1840 in
Sceaux, France).
10.2 Proof of the Class Number Formula of Imaginary Quadratic Fields 163
known in the theory of Fourier3 transforms. To simplify the notation, for arbitrary
complex number z 2 C, we write e.z/ D e 2 i z . For ˛ > 0, 2 H and , 2 R, the
following formula is well known:
Z 1
e..2˛/1 2 /
e.21 ˛ 2 /e. /d D p :
1 i˛
Here as thepsquare root of i˛ (whose real part is positive since 2 H), we take
the branch ˛ > 0 for D i . When we put
the Fourier transform of this function with respect to the variables .; / 2 R2 is
calculated as follows:
fO. ; /
Z 1Z 1
D f .; /e. /d d
1 1
Z 1 Z 1
D e.21 .a. C .b=2a//2 C .c b 2 =4a/2 //e. /d d
1 1
Z 1 Z 1
D e.21 .a 2 C .c b 2 =4a/2 //e. . .b=2a/ //d d
1 1
1
D p p
ia i.c b 2 =4a/
e..2a/1 2 /e..2.c b 2 =4a//1 . .b=2a/ /2 /
2i 1 2 2
D p e .a b C c / :
.4ac b 2 / 2.ac b 2 =4/
By this result and the Poisson summation formula (e.g. [102]), we get
X X
f .; / D fO. ; /:
;2Z ; 2Z
3
Jean Baptiste Joseph Fourier (born on March 21, 1768 in Auxerre, France—died on May 16, 1830
in Paris, France).
164 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms
h.DK /
lim .s 1/K .s/ D p
s!1 w jDK j
and
g./ i B1;
L.1; K / D B1;K D p K ;
jDK j jDK j
lim .s 1/.s/ D 1:
s!1
The latter two equalities are in Theorems 5.4 and 9.6. The point here is the
evaluation of lims!1 .s 1/K .s/. We fix an ideal class of Omax and denote by C the
set of all (integral) ideals of Omax belonging to this class. We define the following
zeta function which is a part of K .s/:
X 1
.s; C / D :
a 2C
N.a/s
N.a/s X 1
.s; C / D :
2w ˛2a N.˛/s
We write a D Za C Z.d C !/ using the standard basis and put N.d C !/ D ac,
b D 2d C T r.!/ as usual. Then for ˛ D xa C y.d C !/ we have N.˛/ D
a.ax 2 C bxy C cy 2 /. Since N.a/ D a, we can cancel as in the numerator and the
denominator. So we have
1 X
.s; C / D .ax 2 C bxy C cy 2 /s :
2w
x;y2ZI .x;y/¤.0;0/
We have also 4ac b 2 D jDK j. Now our aim is to calculate the residue of this
function at s D 1. We use theta functions. Since .s; C / converges absolutely
10.2 Proof of the Class Number Formula of Imaginary Quadratic Fields 165
we get
Z 1 X
1
e .a
2 CbCc2 /y
.s/.s; C / D y s1 dy
2w 0
.;/2Z2 .0;0/
s Z 1
D .Q .iy/ 1/y s1 dy:
2w 0
Therefore we have
h.DK /
lim .s 1/K .s/ D p :
s!1 w jDK j
Comparing this with the value of L.1; K /, we get the class number formula. t
u
Since .s/ and L.s; / have functional equations, K .s/, which is a product of
these two, also has a functional equation and it is easy to calculate its explicit form.
But as a matter of fact, if we use the above integral representation, we can obtain the
analytic continuation and the functional equation of .s; C /, and through this we
can get an alternative proof of the analytic continuation and the functional equation
of K .s/. Namely:
Proposition 10.15. Let C be an ideal class of the maximal order Omax of a
imaginary quadratic field K, and let the other notation be the same as above.
Then .s; C / can be continued analytically to a meromorphic function on the
whole s-plane. If we put .s; C / D .2/s jDK js=2 .s/.s; C /, then it satisfies
the following functional equation.
.1 s; C / D .s; C /:
In the same way, if we put K .s/ D .2/s jDK js=2 .s/K .s/, then we have
K .1 s/ D K .s/:
Proof. For the proof, we use the integral representation mentioned before. For
Re.s/ > 1, we have
Z 1 Z 1
2
.2w/ s .s/.s; C / D .Q .iy/ 1/y s1 dy C p .Q1 .iy/ 1/y s dy
1 jDK j 1
2 1
p (10.1)
jDK j.1 s/ s
but the integral on the right-hand side is holomorphic on the whole s-plane. Hence
the right-hand side is a meromorphic function on the whole complex s-plane. So the
analytic continuation is proved.
The functional equation is shown as follows. Multiplying both sides of the
integral representation before by 2s jDK js=2 and changing s ! 1 s, we get
.2w/.1 s; C /
D .2w/jDK j.1s/=2 .2/s1 .1 s/.1 s; C /
Z 1
D 2s1 jDK j.1s/=2 .Q .iy/ 1/y s dy
1
10.2 Proof of the Class Number Formula of Imaginary Quadratic Fields 167
Z 1
1=2
C 2 jDK j
s
jDK j .1s/=2
.Q1 .iy/ 1/y s1 dy
1
Here the integrals in the parentheses are almost identical to the integral expres-
sion (10.1) of .s; C / if we exchange Q and Q1 . We defined
4
Q1 .x; y/ D .cx 2 bxy C ay 2 /;
jDK j
and the quadratic forms ax 2 C bxy C cy 2 and cx 2 bxy C ay 2 are equivalent since
it is the same if we replace .x; y/ ! .y; x/. Since we have
(If we note that the discriminant of Q1 is 16jDK j1 , we see the symmetry to the
case Q.) Therefore comparing this with (10.2), we have
Z 1
.2w/.1 s; C / D 2s jDK js=2 .Q1 .iy/ 1/y s1 dy
0
Z 1
D .2/s jDK js=2 22s jDK js s .Q1 .iy/ 1/y s1 dy
0
Hence the proposition is proved. The functional equation of K .s/ is obvious from
this. t
u
As for the formula for the class number h.f 2 DK / of the ideal class group of a
general order Of , it is easiest to compare it with the ideal class group of Omax by
a group theoretic method. (For example, see Weber [101, p. 366], Lang [67, p. 95],
Zagier [106].) But here we introduce a different method.
We first give the formula for the zeta functions of Of ideals for general f 1.
Here we do not assume that K is imaginary, since the theory is the same for real
quadratic fields.
We fix a natural number f and consider the zeta function of ideals of the order
Of D Z C Zf ! of K, which is defined by
X 1
.s; f 2 DK / D ;
a
N.a/s
where the sum runs over arbitrary ideals of Of , which are not necessarily proper.
Proposition 10.16. (1) For a positive integer a, let r.a/ be a number of residue
classes modulo 2a represented by integers b such that b 2 f 2 DK mod 4a.
Then we have
1
X
.s; f 2 DK / D .2s/ r.a/as :
aD1
Here Œe=2 denotes the biggest integer which does not exceed e=2, that is, we put
Œe=2 D e0 if e D 2e0 or 2e0 C 1 for some integer e0 . Also when p D 2, we
obtain the same formula (10.3) by similar consideration. For example, assume that
0 e 2m. If e D 2e0 is even, then b 2 22m f02 DK mod 2eC2 means that
b D 2e0 b0 and b02 f02 DK mod 4. This last relation imposes the condition that
b0 is even or odd. So the number of solutions b mod 2eC1 is 2e0 C1 =2 D 2e0 . If
e D 2e0 C 1 is odd, then 2e0 C 2 2m, so b D 2e0 C1 b0 . Again the relation b02
f02 DK mod 2 tells that b0 is odd or even. So the number of solutions b mod 22e0 C2
is 22e0 C2.e0 C1/ =2 D 2e0 . The case e 2m C 1 is calculated similarly. Here we
note that if DK is odd, then K .2/ D 1 if DK 1 mod 8 and K .2/ D 1 if
DK 5 mod 8, and that the congruence equation x 2 y mod 2lC2 with l 1
for a fixed y and unknown x has no solution when y 5 mod 8 but two solutions
x mod 2lC1 when y 1 mod 8. We should also note that, by definition, if DK is
even then DK 0 mod 4, and if DK 4 mod 8 then DK =4 3 mod 4. The rest
is the same and is omitted here. By using (10.3), we see
170 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms
1
X .1 C p s / .1 C p s1 /p .12s/.mC1/
rp .p e /p es D
eD0
1 p 12s
⎧
⎪
⎨p m.12s/s if pjDK ,
C .1 C K .p//p m.12s/s
⎪
⎩ if p − DK .
1 p s
The last term written for each case pjDK or p − DK separately has a unified
expression
.1 C K .p//p m.12s/s
:
1 K .p/p s
t
u
Next we put
X 1
.s; Of / D :
a
N.a/s
But this time, we define the above sum so that a runs only over proper ideals of Of .
4
August Ferdinand Möbius (born on November 17, 1790 in Schulpforta, Saxony (now Germany)—
died on September 26, 1868 in Leipzig, Germany).
10.2 Proof of the Class Number Formula of Imaginary Quadratic Fields 171
If m1 and m2 are coprime natural numbers, then we have .m1 /.m2 / D .m1 m2 /.
The next formula is known as the Möbius inversion formula.
Lemma 10.17. Let F .n/ and G.n/ be functions defined for all natural numbers n.
We assume that, for an arbitrary natural number n, the following equality holds.
X
F .n/ D G.d /:
d jn
.s; Of / D .s/L.s; K /
Y .1 p s /.1 K .p/p s / p m12sm .1 p 1s /.K .p/ p 1s /
:
pjf
1 p 12s
p m kf
Proof. The first equality of (1) comes from the facts that any ideal of Of is a
proper ideal of Oe for some ejf and that if the ideal is regarded as an ideal of
Oe , then the norm becomes f =e times the original one. The second equality follows
from the Möbius inversion formula, applying it to F .f / D f s .s; f 2 DK / and
G.e/ D e s .s; Oe /. In order to show (2), we calculate the right-hand side of
the second equality of (1) by using Proposition 10.16. Since the Möbius function
Qtmultiplicative,
is it is enough to calculate the Euler p-factors. Here when f D
mi
p
i D1 i , by the definition of the Möbius function, we can assume that d runs only
Q m j
over the numbers pi i i with ji D 0; 1. So if we write
then we have
Y
.s; Of / D .s/L.s; K / .f .m; p s / p s f .m 1; p s // :
pjf
X 1
.s; C; Of / D :
a 2C
N.a/s
Let wf be half the number of roots of unity in Of . For a primitive quadratic form
Q.x; y/ D ax 2 Cbxy Ccy 2 with discriminant f 2 DK corresponding to C , we have
X
.2wf /.s; C; Of / D .ax 2 C bxy C cy 2 /s :
.x;y/2Z2 .0;0/
This is proved in the same way as in the case of the maximal order. If we use the
transformation formula of Q ./, the residue at s D 1 of this Dirichlet series is
obtained in the same way as before, independently of C , and given by
lim .s 1/.s; C; Of / D p :
s!1 wf f jDK j
By this, we have
lim .s 1/.s; Of / D h.f 2 DK / p :
s!1 wf f jDK j
On the other hand, using the Euler product expression in the formula for .s; Of /
in Proposition 10.18, we have
Y
lim .s 1/.s; Of / D .lim .s 1/K .s// .1 K .p/p 1 /
s!1 s!1
pjf
h.DK / Y
D p .1 K .p/p 1 /:
w jDK j pjf
10.3 Some L-Functions Associated with Quadratic Forms 173
h.DK / Y
h.f 2 DK / D f .1 K .p/p 1 /
w=wf
pjf
We introduce here one of the simplest example of zeta functions associated with so-
called prehomogeneous vector spaces. The content of this section is based on [47].
that p 3 mod 4. We define a Dirichlet character
Let p be a prime. We assume
modulo p by .a/ D pa . From this, we define a function Q on the set L of
all 2 2 half-integral symmetric matrices as follows. First, for
a b=2
T D 2 L ;
b=2 c
Indeed since the rank of T mod p is 1, there exists g such that the second column
of T g mod p is 0, and so is the second column of t gT g, but since this is symmetric,
the .2; 1/ component is also 0. We define .T / D .e/ for such T . This is well
defined since if we assume that
0
xy e0 x z e 0
D
zw 00 yw 0 0
0 xy
for e, e 6 0 mod p and 2 GL2 .Z=pZ/, then we have e 0 D x 2 e (and z D 0,
z w
x 2 .Z=pZ/ ).
We denote by LC the set of all positive definite half-integral symmetric matrices
in L . Recall that T1 and T2 2 L are said to be equivalent if we have t gT1 g D T2
for some g 2 SL2 .Z/. We define an L-function of LC with character Q by
174 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms
X Q .T /
L.s; LC ; / D :
.T / det.T /s
T 2L
C =
Here LC =
denotes a complete set of representatives of LC up to the equivalence
and
.T / the order of the finite group fg 2 SL2 .Z/I t gT g D T g. (This L-function
was defined first by K. Hashimoto in relation to dimension formulas of automorphic
forms. This is also an example of L-functions of prehomogeneous vector spaces.
As we see later, Q can be regarded as a character of some ideal class group, so the
term “L-function with character Q ” is not strange.)
a b=2
Let us try to calculate an example. If T D 2 L and rank.T mod
b=2 c
p/ D 1, then Q .T / D .a/ if p − a and Q .T / D .c/ if pja. Indeed if pja, then
by the condition on the rank, we have pjb and c 2 .Z=pZ/ , and if p − a, then we
have
!
1 0 a b=2 1 b=2a a 0 a0
D b2 mod p:
b=2a 1 b=2 c 0 1 0 c 4a 00
22s1 B1;
L.s; LC ; / D .2s 1/:
ps
10.3 Some L-Functions Associated with Quadratic Forms 175
p m1
L.1 m; LC ; / D B2m B1; :
22m m
Before proving this, we prove a lemma. For an imaginary quadratic field K, its
discriminant DK and a positive integer f , we put d D f 2 DK and
a b=2 ;pri m
P.d / D T D 2 LC I 4ac b D d :
2
b=2 c
;pri m
Here LC is the subset of LC consisting of primitive ones. We also put S.d / D
P.d /=
.
Lemma 10.21. We assume that d (< 0) is divisible by p.
p p X
(1) If Q. d / ¤ Q. p/, then Q .T / D 0.
T 2S.d /
p p
(2) If Q. d / D Q. p/, then Q .T / D 1 for any T 2 P.d /.
Proof. We first see that Q .T / is the value of at the norm of the ideal correspond-
ing to T if rank.T mod p/ D 1. In this case, by definition, there exist integers x,
y coprime with each other such that Q .T / D .e/ for e D ax 2 C bxy C cy 2 .
Since 4ae D .2ax C by/2 C .4ac b 2 /y 2 and since we assumed that 4ac b 2
is divisible by p, we have .ae/ D .2ax C by/2 . By assumption, we have
p − e, and replacing by an equivalent one if necessary, we may assume that
p − a. Hence we have .2ax C by/ ¤ 0 and .2ax C by/2 D 1, so we
have .a/ D .e/. p This is the value at the norm. Therefore, since we have
p
K D if K D Q. d / D Q. p/, we get Q .T / D 1 by Proposition 10.2
p p
and Lemma 10.11. If K D Q. d / ¤ Q. p/, then K ¤ . So Q .T / takes 1
on some elements of S.d / again by Proposition 10.2. That is, Q is a character of
the group S.d / which is not trivial, so the sum over the group becomes 0. t
u
Proof of Theorem. First, it is obvious that Q .eT / D .e/ Q .T / for any integer e
prime to p and any T 2 LC , and since
.eT / D
.T /, we reduce the L-function to
the sum over primitive quadratic forms. Namely if we put
X Q .T /
Lpri m .s; LC ; / D ;
;pri m
.T / det.T /s
T 2LC =
then we have
176 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms
1
X X .e/ Q .T /
L.s; LC ; / D
eD1 T 2L;pri m =
.eT / det.eT /s
C
1
X .e/ pri m
D L .s; LC ; /
eD1
e 2s
Here L.2s; / is the Dirichlet L-function. Next, if we apply the previous lemma to
;pri m
LC , only the part for T with 4 det.T / D d D f 2 p (f are positive integers)
remains alive. Since
.T / has a common value for all the elements in P.d /, we
denote this by
.d /. Then we have
1
X jS.d /j
Lpri m .s; LC ; / D 22s :
p s f 2s
.d /
f D1
By virtue of the relation between primitive quadratic forms and proper primitive
ideals of an order of a quadratic field, jS.d /j is the class number h.f 2 DK / of the
p
order Of of the imaginary quadratic field K D Q. p/. By the class number
formula, we have
1
h.DK / X 12s Y 1
Lpri m .s; LC ; / D 22s f 1 .q/ :
jO
max jp
s q
f D1
qjf
qWpri me
h.DK / B1;
D s :
jO
max jp s 2p
1
X X 1
D f 12s .m/.m/:
m
f D1 mjf
1
X Y
n12s m2s .m/.m/ D .2s 1/ .1 .q/q 2s /
m;nD1 qWpri me
aa D aOf :
b1 b2
˛Dx C c1 z;
2
b2 b1
ˇDy C c2 z;
2
D .c2 x C c1 y/
10.3 Some L-Functions Associated with Quadratic Forms 179
and N.a1 a2 / D a1 a2 .
b2 D
(4) If we put c0 D 0 , then this should be an integer. Show that
4a1 a2
2
b1 C b2
c0 D .c1 z C b1 x/.c2 z C b2 y/ C .a1 x C a2 y/.c2 x C c1 y/ xy
2
D ˛ˇ :
b1 Cb2
We write a1 D la10 , a2 D la20 and 2 D lB.
(1) Put
b1 b2 C D
b0 D xa1 b2 C ya2 b1 C z
2
b1 b2
˛Dx C c1 z;
2
b2 b1
ˇDy C c2 z;
2
D .xc2 C yc1 /
b1 b2
D a10 ˛ a20 ˇ 2 mZ:
2
Hence c1 z 2 mZ.
(ii)
b2 b1
B.c1 z/ a20 D c1 C Bx:
2
Hence c1 2 mZ.
(iii) We have b1 Cb
2
2
D lB 2 mZ and hence b1 2 mZ. Also we have
a1 D la10 2 mZ.
(iv) m D 1 since gcd.a1 ; b1 ; c1 / D 1.
10.3 Some L-Functions Associated with Quadratic Forms 181
where b0 = l is an integer.
(5) Show that
(6) Find integers a3 , b3 , c3 , X and Y similarly as in the last exercise such that
(Hint: Write down the product of general elements of a1 and a2 by the basis
of a1 a2 and take the norms. For example, put a3 D a10 a20 , b3 D b0 = l, c3 D
b32 D
, and X D lx1 x2 C ˛x2 y1 C ˇx1 y2 C y1 y2 , Y D a10 x2 y1 C a10 x1 y2 C
4a10 a20
By1 y2 . )
Exercise 10.29. (1) We fix D D f 2 DK . Show that for a natural number m and a
proper Of ideal a, there exists a proper Of ideal b which is equivalent to a and
whose norm N.b/ is prime to m.
(2) For two proper Of ideals a1 , a2 , show that N.a1 a2 / D N.a1 /N.a2 / by using
Exercise 10.27 but not using Exercise 10.28.
p
Exercise 10.30. (1) For K D Q. 5/, let Of be the order of K of conductor f .
By using Proposition 10.18, count the number of proper Of ideals of norm 4
and 16 for f p D 1, 2, 3. Also, give them all explicitly.
(2) For K D Q. 11/ and the order O3 of K of conductor 3, give all the proper
ideals of norm 9 explicitly.
p
p 10.31. For K D Q. 7/, we take the order O3 of conductor 3. Put L D
Exercise
3Z C 7Z. Show that L is a proper O3 lattice, though it is not an integral ideal of
O3 (i.e. not contained in O3 ).
p p
Exercise 10.32. p For K D Q. 82/, let Omax D Z C Z 82 be the maximal order
and O2 D Z C 2 82Z the order of conductor 2.
p
(1) Show that a D 2Z C 82Z is an ideal of Omax .
(2) Show that a is not prime to 2 and that a \ O2 is not a proper ideal of O2 .
Exercise 10.33. Assume that a is a proper Of ideal. We write a by the standard
basis as a D l.Za C Z.d C f !// with N.d C f !/ D ac. We assume that a C
f Omax D Of . Prove that a C f Of D Of by the following steps.
(1) Two numbers l and f are coprime.
(Hint: If there exists a prime p such that pjl and pjf , then a C f Omax
pOmax , which is a contradiction.)
182 10 Class Number Formula and an Easy Zeta Function of the Space of Quadratic Forms
In modern number theory, the p-adic method or p-adic way of thinking plays an
important role. As an example, there are objects called p-adic L-functions which
correspond to the Dirichlet L-functions, and in fact the natural setup to understand
the Kummer congruence described in Sect. 3.2 is in the context of the p-adic
L-functions. To be precise, a modified version (by a suitable “Euler factor”) of
Kummer’s congruence guarantees the existence of the p-adic L-function.
To discuss this aspect fully is beyond the scope of this book, but in this chapter we
explain the p-adic integral expression of the Bernoulli number and prove Kummer’s
congruence using it. Interested readers are advised to read books such as Iwasawa
[51], Washington [100], Lang [66].
We assume the basics of p-adic numbers. For this we refer readers to Serre [83,
Ch. 1] or Gouvea [37]. The results in this chapter are not used in other chapters.
In this section we review the general correspondence between measures on the ring
of p-adic integers Zp and the ring of formal power series. We use this setup in the
next section to define the Bernoulli measure on Zp and to express Bernoulli numbers
as integrals. This expression turns out to be very useful in proving Kummer’s
congruence relation.
Let Qp be the algebraic closure of the field Qp of p-adic numbers. The p-adic
absolute value j j of Qp (normalized by jpj D 1=p) is extended uniquely to Qp . We
use the same notation j j for this extension. Then Qp is not complete with respect
to this absolute value, and the completion is denoted by Cp . The absolute value j j
also extends naturally to Cp . Let Op be the ring of integers of Cp :
Op D fx 2 Cp j jxj 1g:
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 183
in Mathematics, DOI 10.1007/978-4-431-54919-2__11, © Springer Japan 2014
184 11 p-adic Measure and Kummer’s Congruence
Remark 11.1. Like the complex number field C, the field Cp is complete and
algebraically closed. To do analysis in the p-adic setting, we need this big field.
First we review the general theory of measures on Zp .
Denote the Z-module Z=p n Z by Xn and the canonical map from XnC1 to Xn by
nC1 , so nC1 W XnC1 ! Xn is defined by
The system of pairs .Xn ; n / gives a projective system and we have the projective
limit lim Xn :
Y
lim Xn D .xn / 2 Xn j nC1 .xnC1 / D xn :
n1
The ring of p-adic integers Zp is identified with this projective limit lim Xn .
holds.
The set of Op -valued measures on Zp is denoted by M.Zp ; Op / . This has an
Op -module structure. Further, the norm of D fn g 2 M.Zp ; Op / is defined as
Z pn 1
X
'.x/d.x/ D lim '.r/n .r/:
Zp n!1
rD0
11.1 Measure on the Ring of p-adic Integers and the Ring of Formal Power Series 185
(We use the abbreviated notation n .r/ for n .r mod p n /. A similar abbreviation
will be used in the following.) The convergence of the limit on the right-hand side
is guaranteed by the following estimate: when n < m, we have
ˇ pX
n 1 pm 1
X ˇ
ˇ ˇ
ˇ '.r/n .r/ '.l/m .l/ ˇ
rD0 lD0
ˇ pn 1 pmn 1 ˇ
ˇ X X ˇ
D ˇˇ '.r/n .r/ '.r C p n q/m .r C p n q/ ˇ
rD0 qD0
0 1
ˇ pX
n 1 pmn 1
X ˇ
ˇ @ ˇ
Dˇ .'.r/ '.r C p n q// m .r C p n q/A ˇ
rD0 qD0
in t is a continuous function on Zp .
To D fn g 2 M.Zp ; Op / we associate f 2 Op ŒŒX in the following manner.
n
Set D Op ŒŒX , n D ..1 C X /p 1/ and consider the projective system
f.=n ; $n /g by the natural map $n W =n ! =n1 . Define fn .X / 2
=n by
pn 1 pn 1 r
! pn 1
X XX r X
fn .X / D n .r/.1 C X / D r
n .r/ X D
k
cn;k X k :
rD0 rD0 kD0
k
kD0
pn 1
!
X r
cn;k D n .r/ :
rD0
k
Since we have
0 1
pn 1
X
.$n fn /.X / D $n @ n .r/.1 C X /r A
rD0
186 11 p-adic Measure and Kummer’s Congruence
0 1
pn1 1 p1
X X 0
D $n @ n .r 0 C p n1 l/.1 C X /r .1 C X /p A
n1 l
r 0 D0 lD0
pn1 1
X 0
D n1 .r 0 /.1 C X /r
r 0 D0
D fn1 .X /;
the system .fn / is an element in the projective limit lim =n . Now we have the
isomorphism
where
pn 1
!
X r
cm D lim n .r/
n!1
rD0
m
Z !
x
D d.x/:
Zp m
1 X r
n .r/ D f . 1/ .r 2 Xn /; (11.1)
pn pn
D1
the sum running over all p n -th roots of 1. Since j 1j < 1, f . 1/ converges.
For each m 0, we have
!
1 X r 1 X X r m
m
. 1/ D n m
.1/mj j
pn pn p pn j D0 j
D1 D1
!
X m
D .1/mj :
j
0j m
j r mod pn
11.1 Measure on the Ring of p-adic Integers and the Ring of Formal Power Series 187
is convergent and the value is in Op . To check the distribution property (ii) of the
measure, we need to calculate the following value:
X X
nC1 .y/ D nC1 .x C p n a/
y2XnC1 ; nC1 .y/Dx a mod p
0 1
1 X X
D @ .xCp
n a/
A f . 1/:
p nC1 a mod p
p nC1 D1
n
a mod p p D1
nC1
p D1
so we have
X
nC1 .y/ D n .x/
y2XnC1
nC1 .y/Dx
188 11 p-adic Measure and Kummer’s Congruence
pn 1
!
X r
ck0 D lim n .r/
n!1
rD0
k
1 pn 1
! !
X X r X m
D lim cm .1/mj :
n!1
mD0 rD0
k 0j m
j
j r mod p n
0
We fix k. To calculate the coefficient
r of cm in the expression of ck in the right-hand
side above, we fix m. We have k D 0 for k > r so we may assume that k r.
Taking n big enough, we assume that m < p n . Then, if j r mod p n for some j
with 0 j m, we have j D r since we also have 0 r p n 1 by definition.
So we may assume that k r D j m. So the coefficient of cm is given by
! ! ! !
X
m
r m X mk m
mk
1 if m D k;
.1/ mr
D .1/mki D
k r i k 0 if m ¤ k:
rDk i D0
Hence we have ck0 D ck . So we have fQ D f and two mappings are inverse with
each other and we see that the set M.Zp ; Op / of Op -valued measures and the space
of formal power series Op ŒŒX are bijective.
More precisely, we can introduce a product for both spaces and show that these
are isomorphic as Op algebras, as given in the following theorem whose complete
proof is omitted (see e.g. Lang [66, Ch.4]).
For two measures ; 2 M.Zp ; Op /, we define an Op -valued function ./n
on Xn by
pn 1
X
. /n .x/ D n .y/n .x y/ .x 2 Xn /: (11.2)
yD0
1
X
P
M.Zp ; Op / 3 D fn g 7! f .X / D cm X m 2 Op ŒŒX :
mD0
11.1 Measure on the Ring of p-adic Integers and the Ring of Formal Power Series 189
Here, cm is determined by :
Z !
x
cm D d.x/;
Zp m
1 X x
n .x/ D f . 1/:
pn pn
D1
If this is the case, the coefficients an are uniquely determined by ' and given by
!
X
n
n
an D .1/ nk
'.k/:
k
kD0
P1 x
So if we replace '.x/ by the expansion '.x/ D mD0 am m in Theorem 11.4, we
have
1 1
!
X X X
m
m
n .x0 / D am cm D cm .1/ mk
'.k/:
mD0 mD0
k
kD0
1
Kurt Mahler (born on July 26, 1903 in Krefeld, Prussian Rhineland—died on February 25, 1988
in Canberra, Australia).
190 11 p-adic Measure and Kummer’s Congruence
n
Now, for any with p D 1, we have
1 1
!
X X
mXm
cm . 1/ D m
cm .1/mk k :
mD0 mD0
k
kD0
This is obvious for x; x 0 2 N, and the general case for x; x 0 2 Zp follows from the
fact that the set N of natural numbers is dense in Zp .
In the following, we list several properties of measures and corresponding power
series, which will be used later.
Property (1). Let z 2 P. If corresponds to f (i.e. P D f ), then
Z
f .z/ D .1 C z/x d.x/:
Zp
In particular, by putting z D 0,
Z
f .0/ D d.x/:
Zp
11.1 Measure on the Ring of p-adic Integers and the Ring of Formal Power Series 191
1
X
Proof. Writing f .X / D cn X n , we have by Theorem 11.3
nD0
Z 1 Z !
X x n
.1 C z/ d.x/ D
x
z d.x/
Zp Zp nD0 n
1 Z !
X x
D zn d.x/
nD0 Zp n
1
X
D cn zn D f .z/:
nD0
t
u
We call the map
from C.Zp ; Op / to Op a bounded linear functional on
C.Zp ; Op / if the following conditions (i), (ii) are satisfied:
(i) For any '; ' 0 2 C.Zp ; Op / and any a; b 2 Op ,
(ii) There exists a positive constant M > 0 such that for any ' 2 C.Zp ; Op /,
j .'/j M k'k:
j
.'/j
k
k D sup :
'2C.Zp ;Op / k'k
'¤0
n .x/ D .'x; n /;
1 X
.Uf /.X / D f .X / f ..1 C X / 1/: (11.4)
p p
D1
Since
1 X
..1 C X / 1//l 2 Zp ŒX
p p
D1
X
O ./ D 1 .a/ a ;
p
a2Z=pZ
X Z x
D O ./ 1 C ..1 C z/ 1/ df .x/
p D1 Zp
X X
D O ./f ..1 C z/ 1/ D f .z/ 1 f ..1 C z/ 1/:
p p
p D1 D1
1
!
X x
D .1 C 1/ D
x x
. 1/n :
nD0
n
d
D D .1 C X /DX ; where DX D :
dX
194 11 p-adic Measure and Kummer’s Congruence
Property (3). For f 2 Op ŒŒX , the power series corresponding to the
measure xf is Df . Hence the power series corresponding to the measure
x k f .k natural number/ is D k f and the equalities
Z Z
x k df .x/ D dD k f .x/ D .D k f /.0/
Zp Zp
hold.
Proof. It is enough to show this when k D 1. Let g 2 Op ŒŒX be the power series
corresponding to the measure xf . By Property (1), we have for z 2 P
Z
g.z/ D x.1 C z/x df .x/:
Zp
P1 P1
Put f .X / D nD0 an X n ; g.X / D nD0 bn X n . Using
! ! !
X X X
X D .n C 1/ Cn
n nC1 n
D .n C 1/anC1 C nan :
This gives g D Df . t
u
In general, for a power series f .X /, we define a new power series f .Z/ in Z
by setting X D e Z 1:
we have
X1
an Z n
f .Z/ D e aZ D :
nD0
nŠ
since
DZ f .Z/ D .1 C X /DX f .X / D Df .X /:
The next property is the basis of the fact that the isomorphism P in Theorem 11.3
is an Op algebra isomorphism.
Property (4). Let the measures ; correspond respectively to the power series
f; g 2 Op ŒŒX (i.e., D f ; D g ). Then the power series corresponding to
the convolution is fg:
f g D fg :
1 X r
n .r/ D f . 1/;
pn pn
D1
1 X kCr
n .k r/ D n g. 1/:
p pn
D1
pn 1
X 1 X 1 X kCr
. /n .k/ D n
r f . 1/ n g. 1/
p pn p pn
rD0 D1 D1
p 1 n
1 XX k 1 X
D n f . 1/g. 1/ n .=/r
p p rD0
196 11 p-adic Measure and Kummer’s Congruence
1 X
D f . 1/g. 1/ k
pn
D fg; n .k/:
Here and run through all p n -th roots of 1. From this, Property (4) follows. t
u
We define a specific measure called the Bernoulli measure. Recall that the first
Bernoulli polynomial is by definition equal to
1
B1 .x/ D x :
2
In the following, p denotes an odd prime. For each natural number n and x 2 Xn D
Z=p n Z, set
x
En .x/ D B1 ;
pn
Z
Bk .1/k
D x k1 dEc :
k 1 ck Zp
Proof.(1) Since c 2 Z
p , we see fc 2 Zp ŒŒX , the first two terms of fc .X / being
c1 1 c2
fc .X / D C X C :
2 12
1 X r
n .r/ D fc . 1/
pn pn
D1
X
1 1 1 c
D fc .0/ C n r c :
pn p n 1 1
p D1; ¤1
n
Now we use Lemma 8.5 on p. 110. For p D 1; ¤ 1 and f D p n , the lemma
gives
f 1
1 1 X cj
D j
c 1 f j D1
cl c.f 1/
D
f 2f
1
c k c c
Dc n
C
p 2 2f
and by substituting this into the formula for n .r/ above and noting that fc .0/ D
.c 1/=2, we have
1
c1 r 1 1 c r c c
n .r/ D C C c C
2f pn 2 2f pn 2 2f
1
r 1 c r 1
D n
c n
:
p 2 p 2
By the definition of the Bernoulli measure, we conclude n .r/ D Ec;n .r/, i.e.,
D Ec and the power series corresponding to Ec is fc .
The proof of (2) goes as follows. By Property (3) and Eq. (11.6) we have
198 11 p-adic Measure and Kummer’s Congruence
Z
x k1 dEc D .D k1 fc /.0/ D .DZk1 fc /.0/:
Zp
1 c
fc .Z/ D fc .e Z 1/ D
e Z 1 e cZ 1
1
X Z n1
D .1 c n /.1/n Bn ;
nD1
nŠ
so we have
Bk
.DZk1 fc /.0/ D .1 c k /.1/k
k
and thus
Z
Bk
x k1 dEc D .1 c k /.1/k :
Zp k
Bm Bn
.1 p m1 / .1 p n1 / mod p a :
m n
To prove this, we need the following integral expression of the Bernoulli number,
a refined version of Proposition 11.5 (2).
Proposition 11.7. Let k be a positive even integer and take c 2 Z
p . Then we have
Z
Bk
.1 c k /.1 p k1 / D x k1 dEc :
k Z
p
11.3 Kummer’s Congruence Revisited 199
1 X
g.X / D Ufc .X / D fc .X / fc ..1 C X / 1/:
p p
D1
Here, since
1 X 1 1
D p ;
p p X 1 X 1
D1
we get
1 c 1 c
g .Z/ D cZ cpZ
e 1 e 1
Z e 1 e
pZ 1
1
X 1
X
k Bk Bk
D .1 c /.1/
k
Z k1 .1 c k /.1/k .pZ/k1
kŠ kŠ
kD0 kD0
1
X Bk k1
D .1 c k /.1 p k1 /.1/k Z :
kŠ
kD1
Proof of Theorem 11.6. The first assertion is already given in Theorem 3.2, but we
give here an alternative proof for that too. Since we assumed m 6 0 mod p 1, we
can take c 2 Z such that .c; p/ D 1 and c m 6 1 mod p. For instance one may take
a primitive root mod p. From the proposition above, we have
Z
Bn
.1 c n /.1 p n1 / D x n1 dEc
n Z
p
and
Z
Bm
.1 c /.1 p
m m1
/ D x m1 dEc :
m Z
p
The assumption m n mod .p 1/p a1 gives c nm 1 mod p a , and since we
assumed .1c m ; p/ D 1, we have also .1c n ; p/ D 1. Since Ec is an Op measure,
the above integral values are in Op and we see that Bn =n and Bm =m 2 Z.p/ . Since
x m1 x n1 mod p a if x 2 Z p , and since Ec is an Op -valued measure, we have
n1 Bn m1 Bm
.1 c / .1 p /
n
.1 p / 2 p a Op :
n m
Bn Bm
.1 p n1 / .1 p m1 / 2 p a Zp :
n m
This proves the theorem. t
u
Theorem 3.2 is a corollary of Theorem 11.6. Indeed, if a < m n, then by
Theorem 11.6, we have
Bm Bn
.1 p m1 / .1 p n1 /
m n
Bm Bn Bn nm
D .1 p m1 / C .p 1/p m1
m n n
0 mod p a :
Bm Bn
0 mod p a :
m n
11.3 Kummer’s Congruence Revisited 201
p 1
m
1 X n
lim m i :
m!1 p
i D0
p 1m
1 X
lim m f .i /
m!1 p
i D0
is sometimes referred to as the Volkenborn integral of f over Zp . See [94, 95] for
details.)
Chapter 12
Hurwitz Numbers
r 2 D cos.2/;
and is analogous to the similar quantity for the case of the circle:
Z 1
dx
D2 p :
0 1 x2
1
Niels Henrik Abel (born on August 5, 1802 in Frindoe, Norway—died on April 6, 1829 in Froland,
Norway).
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 203
in Mathematics, DOI 10.1007/978-4-431-54919-2__12, © Springer Japan 2014
204 12 Hurwitz Numbers
p
Let }.z/ be the Weierstrass }-function with periods $ and $ 1 defined by
X
1 1 1
}.z/ D 2 C 2 : (12.1)
z p .z
/ 2
2Z$ CZ$ 1
¤0
and
} 00 .z/ D 6}.z/2 2:
which can be obtained from the expansion of cot given in Proposition 1.17 by
differentiation, since cot0 .x/ D sin21.x/ . In a similar manner to the proof of
Proposition 1.15, namely by comparing the coefficients of
we have
1
H4 D ;
10
and the recursion
!
X
n1
4n
.2n 3/.4n 1/.4n C 1/H4n D3 .4i 1/.4n 4i 1/ H4i H4.ni /
i D1
4i
for n 2. In particular, we see from this that H4n is a positive rational number.
Several values are in Table 12.1.
The analogy between Hurwitz and Bernoulli numbers is more apparent when we
look at the special values of zeta functions. Namely, first we rewrite the formula
(4.4) as
X 1 2n
n1 .2/
D .1/ B2n .n 1/:
r2Z
r 2n .2n/Š
r¤0
Note the sum on the left runs over all non-zero integers. If we replace the rational
integers here by Gaussian integers, we have the formula
X 1 .2$/4n
D H4n :
p
4n .4n/Š
2ZCZ 1
¤0
To deduce this, expand the right-hand side of the definition (12.1) of }.z/ by using
2
1
.z
/2
D
12 C
2z3 C 3z
4
C and compare with (12.2).
Hurwitz proved a Clausen–von Staudt type theorem for Hn . To state this, we
need the following classical theorem of Fermat and Euler.
Theorem 12.1. Any prime number congruent to 1 modulo 4 can be written as a
sum of two squares essentially in the unique way.
Various proofs of this theorem are known since Euler. See [50] or [77] for some
of the standard proofs. For Zagier’s “One-sentence proof”, which simplifies a proof
by Heath-Brown, see [108].
Suppose now that p is a prime number congruent to 1 modulo 4 and write
p D a2 C b 2 :
206 12 Hurwitz Numbers
a b C 1 .mod 4/:
(Note that the right-hand side is independent of the choice of the sign of b since b
is even.) These conditions uniquely determine a and so we denote it by ap to make
the dependence on p explicit. For instance, we have
5 D .1/2 C 22 ; 13 D 32 C 22 ; 17 D 12 C 42 ;
so
X
4n
1 .2ap / p1
H4n D C C Gn
2 p1 mod 4
p
p1j4n
with an integer Gn . In the sum, p runs over prime numbers congruent to 1 modulo
4 such that p 1 divides 4n.
As examples of the theorem, we have
1 1 .2/
H4 D D C ;
10 2 5
3 1 .2/2
H8 D D C 1;
10 2 5
567 1 .2/3 6
H12 D D C C C 5;
130 2 5 13
43659 1 .2/4 2
H16 D D C C C 253;
170 2 5 17
392931 1 .2/5
H20 D D C C 39299;
10 2 5
1724574159 1 .2/6 62
H24 D D C C C 13265939:
130 2 5 13
Hurwitz also proved that every Gn for n > 1 is odd and determined its class
modulo 8. The proof [44] of Theorem 12.2 is not as elementary as the original
Clausen–von Staudt theorem. It uses the complex multiplication of the }-function
12.2 A Short Biography of Hurwitz 207
as well as the notion of Hurwitz integral series (cf. Chap. 7). See [27, 58] for a
modern interpretation of the theorem and Kummer-type congruence for Hn .
Adolf Hurwitz was born on March 26, 1859 in Hildesheim, Germany.2 His teacher
at the Gymnasium was Schubert,3 who is famous for his enumerative geometry
(“Schubert calculus”). Impressed by Hurwitz’s genius, Schubert came every Sunday
as a house tutor and persuaded Hurwitz’s father, who was not at all affluent, to
send his son to the university to study mathematics. Already during his time at
the Gymnasium, while he was still 17, Hurwitz wrote a joint paper with Schubert
on enumerative geometry. In the spring of 1877, he entered Munich Technical
University, where Klein4 was. From the fall of the same year until the spring
of 1879, he attended the lectures of Kummer, Weierstrass, and Kronecker at
Berlin University. Returning once again to Munich, and then moving to Leipzig
together with Klein, he obtained his doctoral degree there with research on modular
functions. He became a “Privatdozent” at Göttingen University in 1882 and then, on
the recommendation of Lindemann,5 a professor at Königsberg University in 1884.
Hilbert6 and Minkowski7 were among the students there. The exchanges among the
three of them in those early days and the influence which this had on Hilbert are
vividly described in [78]. In 1892, he moved to the ETH in Zürich as a successor
of Frobenius,8 a position he was to hold until his death. At the same time he was
offered a professorship at Göttingen as the successor of Schwarz, who moved to
Berlin, but he had already accepted the position at Zürich (the director at Zürich at
that time, Bleuler, went especially to Königsberg to make the contract).
Hurwitz’s health was not good. He got typhoid fever twice and suffered from
frequent migraines. And in 1905, one of his kidneys had to be removed. In the midst
of all this, he continued to do research and published in a large number of areas.
2
The description in this section is based on Freudenthal [34], Hilbert [41], and Pólya [76].
3
Hermann Cäsar Hannibal Schubert (born on May 22, 1848 in Potsdam, Germany—died on July
20, 1911 in Hamburg, Germany).
4
Felix Christian Klein (born on April 25, 1849 in Düsseldorf, Prussia (now Germany)—died on
June 22, 1925 in Göttingen, Germany).
5
Carl Louis Ferdinand von Lindemann (born on April 12, 1852 in Hannover, Hanover (now
Germany)—died on March 6, 1939 in Munich, Germany) who proved in 1882 that is a
transcendental number.
6
David Hilbert (born on January 23, 1862 in Königsberg, Prussia (now Kaliningrad, Russia)—died
on February 14, 1943 in Göttingen, Germany).
7
Hermann Minkowski (born on June 22, 1864 in Alexotas, Russian Empire (now Kaunas,
Lithuania)—died on January 12, 1909 in Göttingen, Germany).
8
Ferdinand Georg Frobenius (born on October 26, 1849 in Berlin-Charlottenburg, Prussia (now
Germany)—died on August 3, 1917 in Berlin, Germany).
208 12 Hurwitz Numbers
His articles have been collected and comprise two volumes. As well as the Hurwitz
zeta functions and Hurwitz numbers treated in this book, there are many other topics
which carry his name, such as the Hurwitz class number relations, Hurwitz’s genus
formula, the Hurwitz estimates for the size of automorphism groups of Riemann
surfaces and his investigations on the quaternions and other algebras. And according
to Hilbert, continued fractions were also a favorite topic of Hurwitz. (He wrote
approximately five articles on the subject.)
Hurwitz seems to have been an exceptionally modest and courteous man.
According to Pólya,9 who was the main editor of his collected works, he never
failed to tip his hat in greeting even to service people whom he met on the street.
Moreover, he was so skilled at the piano that in his youth he hesitated between a
career as a mathematician or as a pianist. He died on the 18th of November, 1919,
at the age of 60.
Exercise 12.3. Deduce the recursion of Hn given in the text from (12.3).
Exercise 12.4. Compute the residues 2Hn mod 16 for first several n, and find the
pattern. (For a proof, see [44].)
9
George Pólya (born on December 13, 1887 in Budapest, Hungary—died on September 7, 1985 in
Palo Alto, USA).
Chapter 13
The Barnes Multiple Zeta Function
1
Mathias Lerch (born on February 20, 1860 in Milinov, Bohemia (now Czech Republic)—died on
August 3, 1922 in Susice, Czechoslovakia (now Czech Republic)).
2
Takuro Shintani (born on February 4, 1943 in Kita-kyushu, Japan—died on November 14, 1980
in Tokyo, Japan).
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 209
in Mathematics, DOI 10.1007/978-4-431-54919-2__13, © Springer Japan 2014
210 13 The Barnes Multiple Zeta Function
This zeta function converges absolutely for Re.s/ > r. This kind of zeta function
was systematically studied by Barnes3 [11, 12].
For a while, we assume Re.s/ > r. We see that .s; w; a/ has an integral
representation:
1 1 Z 1
1 X X
.s; w; a/ D t s1 e .aCm1 w1 CCmr wr /t dt
.s/ m D0 m D0 0
1 r
Z 1 1
X 1
X
1
D t s1 e .aCm1 w1 CCmr wr /t dt
.s/ 0 m1 D0 mr D0
Z 1
1 e at
D t s1 dt
.s/ 0 .1 e w1 t / .1 e wr t /
Z 1
1 e .w1 CCwr a/t
D t s1 dt:
.s/ 0 .e w1 t 1/ .e wr t 1/
Since the series .s; w; a/ converges absolutely for Re.s/ > r, by virtue of
Lebesgue’s dominant convergence theorem for the Lebesgue4 integral, the above
calculation is justified for the same range of s. When r D 1, this is essentially the
Hurwitz zeta function. Namely, for w1 > 0, we have
a
.s; w1 ; a/ D ws
1 s; :
w1
Now, a multiple zeta-function .s; w; a/ as well as the Hurwitz zeta function, can be
represented by a contour integral. Indeed, taking " > 0 small enough and denoting
3
Ernest Barnes (born on April 1, 1874 in Birmingham, England—died on November 29, 1953 in
Sussex, England).
4
Henri Léon Lebesgue (born on June 28, 1875 in Beauvais (Oise), France—died on July 26, 1941
in Paris, France).
13.1 Special Values of Multiple Zeta Functions and Bernoulli Polynomials 211
s D r; r 1; : : : ; 0; 1; 2; : : :
of e 2 i s 1 which are not in the range of absolute convergence. But .s/ has poles
of order 1 at integer points less than or equal to 0, so points s D 1 m .m 2 N/
cannot appear as poles and .s; w; a/ is holomorphic there.
Let us look for special values of .s; w; a/ at integer points s D 1 m .m 2 N/.
For any integer k 0, and for any fixed a and w, we define constants Ck .w; a/ by
X 1
e at t r
Qr D Ck .w; a/t k ;
i D1 .e w i t 1/
kD0
a D a1 w1 C a2 w2 C C ar wr .a1 ; a2 ; : : : ; ar 2 R/;
we have
.1 m; w; a/
0 1
X Y r
Bmj .aj /
D .1/r .m 1/Š @ A w1m1 1 wrmr 1
j D1
m j Š
m1 C Cmr DmCr1
m1 0; :::; mr 0
Proof. The proof is based on the same method as the one used for looking at
special values of the Hurwitz zeta functions. In the same way as in the proof of
Proposition 9.3, we have
Z
1 e .w1 CCwr a/t
.1 m; w; a/ D lim t m dt
s!1m .s/.e 2 i s 1/ I."/ .e w1 t 1/ .e wr t 1/
212 13 The Barnes Multiple Zeta Function
Z Y r
.1/m1 .m 1/Š e .1aj /wj t
D t m dt:
2 i I."/ j D1
e wj t 1
Now by using the generating function of Bernoulli polynomials (p. 56, (4.2)) and
Proposition 4.9 (4), the Laurent expansion of the integrand is obtained as follows:
Y r X1 X1 Y r
m e .1aj /wj t m .wj t/mj 1
t Dt Bmj .1 aj /
j D1
e wj t 1 m D0 m D0 j D1
mj Š
1 r
0 1
1 1 m 1
X X Y
r
.1/mj Bmj .aj /wj j
D @ A t m1 C Cmr rm :
m1 D0 mr D0 j D1
mj Š
Y
r
.wi t/e ai wi t t r e at
D D .w1 wr / Qr ;
i D1
e wi t 1 i D1 .e
wi t 1/
Now we set w D .w1 ; w2 /. It will be a very interesting problem to study what kind
of zeta functions appear if we cut out the contour integral representation (13.1) of
the Barnes double zeta function .s; w; a/, in the same way as we did when we
obtained the functional equation of the Hurwitz zeta functions (Theorem 9.4).
In order to answer this problem, we introduce a new Dirichlet series. For each
real algebraic irrational number 5 ˛, let us consider the following Dirichlet series:
5
A real algebraic number which is not a rational number.
13.2 The Double Zeta Functions and Dirichlet Series 213
X1
cot n˛
.s; ˛/ D :
nD1
ns
Since cot.x/ is unbounded, we need to show that this converges when Re.s/ is big
enough.
By virtue of Roth’s theorem on Diophantine approximation in transcendental
number theory, the following things are known. Assume that ˛ is an algebraic
irrational number. When we take any positive number ", there exists a positive
constant C."/ such that
ˇ m ˇˇ
ˇ
ˇ˛ ˇ > C."/n2" (13.2)
n
for arbitrary integers m; n (n > 0). Now for a real number x, we use the notation
hhxii for the distance to the nearest integer from x. We have 0 hhxii 1=2.
Using this notation, for any integer n, (13.2) is expressed as
1 1
j cot n˛j <
sin hhn˛ii 2hhn˛ii
1
< n1C" :
2C."/
as a majorant series of .s; ˛/, and it converges for > 2 C ". Since we can take
arbitrary ", .s; ˛/ converges for6 Re.s/ > 2, and moreover it converges uniformly
on compact sets, so it is a holomorphic function of s in that range.
We denote by SL2 .Z/the modular group which appeared before (p. 76). If we
ab
let an element V D of SL2 .Z/ act on a real algebraic irrational number ˛ by
cd
6
In fact, if we use Roth’s theorem more efficiently, we can show absolute convergence for Re.s/ >
1 (see [6]).
214 13 The Barnes Multiple Zeta Function
a˛ C b
V˛ D ;
c˛ C d
In the former notation, we have 2 .s; !; a/ D .s; .1; !/; a/. We put
.2/s
.s/ D :
.s/ sin.s=2/
The zeta function 2 .s; !; a/ converges absolutely for Re.s/ > 2, is holomorphic
there, and is continued analytically to a meromorphic function on the whole s-plane,
and its poles are s D 1; 2 and of order 1.
Using the contour integral representation (13.1), the principal part of the Laurent
expansion at s D 1 is given by
1C! a 1
2 .s; !; a/ D C : (13.3)
2! ! s1
Proposition
13.2.
Let ˛ > 0 be a real algebraic irrational number. We put
n 1
V D .n 2 Z/. Then for the range Re.s/ > 2, we get the following
1 0
transformation formula:
Proof. We start from the contour integral representation of the double zeta function
2 .s; ˛; 1/ given by
Z
1 e ˛t
2 .s; ˛; 1/ D t s1 dt: (13.5)
.s/.e 2 i s 1/ I.";1/ .e t 1/.e ˛t 1/
In the same way as in the proof of Theorem 9.4, shifting the contour I."; 1/ to
the left to C.t0 / and using the residue theorem, we can write
Z
1 e ˛t
2 .s; ˛; 1/ D t s1 dt
.s/.e 2 i s 1/ C.t0 / .e t 1/.e ˛t 1/
!
X .2 i n/s1 .2 i n=˛/s1 1
2 i 2
C 2 i n=˛ ;
1e i n˛ e 1 ˛
n2Zf0g
1 1 1 1 1 n 1
2
D cot n˛ C and D cot ;
1e i n˛ 2i 2 e 2 i n=˛ 1 2i ˛ 2
we have
2 .s; ˛; 1/
2 i.2/s1 e i s=2 .1 C e i s / s 1
D .1 s; ˛/ C ˛ 1 s;
.s/.e 2 i s 1/ 2 ˛
e i s=2 .1 e i s / s
C .1 ˛ /.1 s/ :
2i
The transformation up to here is valid for Re.s/ < 1. Substituting 1 s for s and
using .s/ .1 s/ D = sin s, in the range Re.s/ > 2 we have
2 .1 s; ˛; 1/
s e i s=2 .1 e i s / 1
D .2/ .s/ sin s .s; ˛/ C ˛ s;
s1
1 e 2 i s ˛
i e i s=2 .1 C e i s /
C .1 ˛ s1 /.s/
1 e 2 i s
1 1
D .s; ˛/ ˛ s;
s1
C .1 ˛ / cot.s=2/.s/ ;
s1
.s/ ˛
Therefore, noting .˛ C n; s/ D .˛; s/, we obtain the assertion of the theorem.
t
u
The transformation formula (13.4) can be extended to the following form. For
x 2 R, we denote by fxg the real number which satisfies x fxg 2 Z, 0 fxg < 1
(fractional part of x).
ab
Theorem 13.3. For a real algebraic irrational number ˛ and V D 2
cd
SL2 .Z/ which satisfies c > 0 and c˛Cd > 0, the following transformation formula
holds in the range Re.s/ > 2.
Here we put D c˛ C d and j runs over integers which represent residue classes
modulo c, and for each j mod c, we put xj D 1 fjd=cg, yj D fj=cg.
Proof. See [6] and
[7].
n 1
If we put V D in this theorem, we get the transformation formula in
1 0
Proposition 13.2. t
u
From now on, we always
assume that ˛ is a real quadratic irrational number.
ab
Then there exists V D 2 SL2 .Z/ which satisfies the conditions
cd
V ˛ D ˛; c˛ C d > 0: (13.6)
This is easily understood if we think as follows. Let F be the real quadratic field
Q.˛/. Let L D Z˛ C Z be a lattice generated by ˛ and 1 and O.L/ be an order
associated to L, that is, O.L/ D fu 2 F j uL Lg. We denote by E.L/ the group
of all totally positive units of O.L/. We take an element of E.L/ such that ¤ 1.
Since 2 L and ˛ 2 L, if we define a 2 2 matrix V with coefficients in Z by
˛ ˛ ab
DV ; V D ;
1 1 cd
˛ ˛ ab
DV ; V D 2 SL2 .Z/
1 1 cd
.s/ X
.s; ˛/ D cot.s=2/.s/ C 2 .1 s; ; xj C yj /: (13.7)
1 s1
j mod c
Z; n ¤ 0/, and s D 0. For s D 2; 4; 6; : : : we have .s/ D 0, so we note that
these are not poles. As for s D 2, as we remarked in the footnote on p. 213, .s; ˛/
converges absolutely for Re.s/ > 1 and is holomorphic there, so this cannot be a
pole. Now we consider s D 0. Taking the result that the principal part of 2 .s; !; a/
at the pole s D 1 is given by (13.3) and the fact .0/ D 12 into consideration, the
residue of the function .s; ˛/ at s D 0 (possible pole of order at most 1) is given by
X
1 2 1 C xj C yj
1
:
.1 / j mod c 2
.1/m1 .2/2m1 X
.2m 1; ˛/ D 2 .2 2m; ; xj C yj /
.1 2m2 /.2m 2/Š j mod c
7
When ˛ is a unit of a real quadratic field, this kind of special value has been calculated by Berndt
[14].
218 13 The Barnes Multiple Zeta Function
1
ˇ D b1 .bj 2 Z; b2 ; b3 ; : : : 2/:
1
b2
1
b3
b4 :
::
It is important that all the coefficients bj (j 2) except for the first one are integers
bigger than or equal to 2. Here the sequence fb1 ; b2 ; : : :g is periodic, namely, there
exist some natural numbers and r such that for j not less than , we have
brCj D bj .j /:
We call the smallest such natural number r the period of ˇ. We write this continued
fraction expansion as
! D ŒŒ n1 ; n2 ; : : : ; nr .nj 2/:
Then we have
1
D ŒŒ nr ; nr1 ; : : : ; n1
!
[107, p. 136, (17)]. By this expression, we have
8
The continued fractions treated here are different from the usual continued fractions of real
quadratic irrational numbers.
13.3 .s; ˛/ and Continued Fractions 219
F ./ D .; / 2 R R;
! D ŒŒ n1 ; n2 ; : : : ; nr :
1
!i D ni D V .ni /!i C1 ;
!i C1
so we have
L D Z! C Z
and take a generator " of the group E.L/ of totally positive units of O.L/ so that
" > 1. We define V by the relation
9
This is different from the usual definition of reduced for the setting of ordinary continued
fractions. Usually a real quadratic irrational number ˛ is called reduced when ˛ > 1 and
0 > ˛ > 1.
220 13 The Barnes Multiple Zeta Function
! ! ab
" DV ; V D 2 SL2 .Z/:
1 1 cd
For any V0 2 SL2 .Z/ and a real algebraic irrational number ˛, we use the notation
of an automorphy factor
J.V0 ; ˛/ D c˛ C d: (13.11)
As in the case when ˛ is in the complex upper half plane, this has a property
J.V1 V2 ; ˛/ D J.V1 ; V2 ˛/J.V2 ; ˛/ for any V1 , V2 2 SL2 .Z/. Then using this
property of the automorphy factor (13.11), we have
D !2 !3 !r !:
So we have
" D !1 !2 !r : (13.12)
D Zi C Zi C1
X
L .s/ D s : (13.14)
T
2L C.1;"/
As we shall see later, L .s/ is written as a finite sum of double zeta functions
explained before, so it converges absolutely for Re.s/ > 2 and is holomorphic in
that range. Let RC be the set of positive numbers. The group of totally positive units
E.L/ acts on RC RC by
[
r
C.1; "/ D fxk1 C yk j x > 0; y 0g .disjoint union/;
kD1
so by (13.13) we get
[
r
L \ C.1; "/ D fmk1 C nk j m 2 Z>0 ; n 2 Z0 g .disjoint union/: (13.15)
kD1
This decomposition was given in Zagier [105]. Here we denote by Z>0 or Z0 the set
of positive integers, or integers not less than 0, respectively. By the decomposition
(13.15), we get
X
r
L .s/ D .k1 /s 2 .s; !rC2k ; 1/: (13.16)
kD1
By this expression, L .s/ converges absolutely for Re.s/ > 2 and is continued
analytically to a meromorphic function which is holomorphic in the whole s-plane
except for s D 1; 2.
Between the functions .s; !/ and L .s/, the following functional equation
holds.
Theorem 13.5. Let ! be a reduced number of F . Put L D Z C Z! and let " > 1
be a totally positive fundamental unit of the order O.L/ of L. Then we have
.s/
.s; !/ D cot.s=2/.s/ L .1 s/: (13.17)
"s11
222 13 The Barnes Multiple Zeta Function
We can also show through this equation that .s; !/ can be continued analytically
to a meromorphic function of s.
Proof. Since we have now !i D V .ni /!i C1 and !i > 0, by Proposition 13.2, we
have an equality
for 1 k r. Multiplying .k1 /s1 by both sides of this equality, then taking the
sum from k D 1 to k D r and using (13.12) and (13.16), we get
t
u
Remark 13.6. If we cut out the contour integral representation, not only of a double
zeta function, but also of a multiple zeta function in the same way as in Sect. 13.2,
then a kind of Lerch type zeta function appears. It would be an interesting problem
to investigate what kind of modular properties it has.
Chapter 14
Poly-Bernoulli Numbers
X 1
Lik .1 e t / .k/ t
n
D B :
1 e t nD0
n
nŠ
X1
tn
Here, for any integer k, Lik .t/ stands for the formal power series .
nD1
nk
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 223
in Mathematics, DOI 10.1007/978-4-431-54919-2__14, © Springer Japan 2014
224 14 Poly-Bernoulli Numbers
.k/
Table 14.1 Bn .0 k 5; 0 n 7/
k\n 0 1 2 3 4 5 6 7
0 1 1 1 1 1 1 1 1
1 1 1
2
1
6
0 30
1
0 1
42
0
2 1 1
4
36
1
24
1 7
450
1
40
2205
38
168
5
3 1 1
8
216
11
288
1 1243
54000
7200
49
3704400
75613 599
35280
4 1 1
16
1296
49 41
3456
26291
3240000
144000
1921 845233
1555848000
1048349
59270400
5 1 1
32
7776
179 515
41472
194400000
216383
25920000
183781 4644828197
653456160000
153375307
49787136000
Lik .1 e t /
D
1 e t
Z t Z tk Z tk1 Z t3 Z t2
1 dtk dtk1 dt2
et t dt1 : (14.1)
e 1 0 e tk 1 0 e tk1 1 0 0 e 1 0
t2
P
This shows that, in the case of k 1, the generating function 1
.k/ t n
nD0 Bn nŠ of
.k/
Rpoly-Bernoulli numbers Bn is obtained, starting with 1, by applying the integration
t
0 dt and division by e 1 successively k times, and finally by multiplying e .
t t
Remark 14.2. (1) Our poly-Bernoulli number is different from the “Bernoulli
numbers of higher order” defined by Nörlund [75, Chapter 6]. The generating
t k
function of the Bernoulli numbers of higher order is et 1 and this can be
t
written as a linear combination of derivatives of et 1 . Hence the Bernoulli
number of higher order is essentially a linear combination of classical Bernoulli
.k/
numbers. As is shown below, our Bn can also be expressed in terms of classical
Bernoulli numbers, but we need to take products successively.
(2) There is a very interesting object called “multiple zeta values” (cf. [57,98,109]),
which generalize the values at positive integer arguments of the Riemann zeta
function. The poly-Bernoulli numbers are in some way related to the multiple
zeta values [9].
.k/
Table 14.1 gives some values of Bn for k 0.
.k/
The next two propositions give recurrence formulas for Bn , which can be
derived from the definition and the iterated integral expression of the generating
function.
Proposition 14.3. For any k 2 Z and n 0, we have
!
1 X
n1
n
B.k/ D Bn
.k1/
B.k/
:
n
nC1 mD1
m1 m
1
!
X t n
t t
Proof. We differentiate the relation Lik .1 e / D .1 e / which B.k/
n
nŠ nD0
is obtained from the definition by multiplying by 1 e t . By the relation Lik0 .t/ D
1
Lik1 .t/, we have
t
X1 X1
Lik1 .1 e t / t t .k/ t
n
t .k/ t
n1
e D e B C .1 e / B :
1 e t nD0
n
nŠ nD1
n
.n 1/Š
X 1
Lik1 .1 e t / tn
Multiplying by e t on both sides and using t
D Bn.k1/ , we have
1e nD0
nŠ
1
X X1 X1
tn tn .k/ t
n1
Bn.k1/ D B.k/ C .e t
1/ B
nD0
nŠ nD0
n
nŠ nD1
n
.n 1/Š
1
X 1 1
t n X t l X .k/ t n1
D B.k/ C B
nD0
n
nŠ lŠ nD1 n .n 1/Š
lD1
1 1
!
X t n X X
n1
1 .k/
D C
B.k/ BmC1
tn
nD0
nŠ n
nD1 mD0
.n m/ŠmŠ
1
! !
.k/
X X
n1
n .k/ tn
D B0 C n C
B.k/ BmC1 :
nD1 mD0
m nŠ
.k1/ .k/
Hence we have B0 D B0 if n D 0, and if n 1 we obtain
! !
X
n1
n .k/ X
n1
n
Bn.k1/ D B.k/ C B D .n C 1/Bn C
.k/
B.k/
m :
n
mD0
m mC1 mD1
m 1
Proof. We use the iterated integral expression of the generating function, which
shows
Z t
Lik .1 e t / et Lik1 .1 e s /
t
D e s ds:
1e e 1 0
t 1 e s
226 14 Poly-Bernoulli Numbers
nD0 lD0
l C1 l mD0
m nŠ
1
! ! !!
X Xn Xn
.1/l n l tn
D .1/m Bm.k1/
Bnl :
nD0 mD0
l C1 l m nŠ
lDm
n l n nm
Use l m D m nl and replace l ! n l and then m ! n m to obtain
! ! !
X
n X n
.1/l n l
.1/m Bm
.k1/
Bnl
mD0
l C1 l m
lDm
! ! !
X
n
n .k1/ X .1/l
m
m
D .1/m B Bl ;
mD0
m nm nl C1 l
lD0
Proof. The proof goes in the same way as Theorem 2.8. In fact, by using
Proposition 2.6 (7) (p. 30) we have
14.2 Theorem of Clausen and von Staudt Type 227
1
X X1
tn Lik .1 e t / .1 e t /m1
B.k/ D t
D
nD0
n
nŠ 1e mD1
mk
X1 1
X 1
.1/m .e t 1/m .1/m mŠ X n .t/n
D D
mD0
.m C 1/k mD0
.m C 1/k nDm m nŠ
1 !
X Xn
.1/m mŠ mn tn
D .1/ n
:
nD0 mD0
.m C 1/k nŠ
tn
Comparing coefficients of nŠ
on both sides, we obtain the theorem. t
u
p k B.k/
n 1 mod p:
.k/
(2) When p 1 − n, p k1 Bn is in Z.p/ and
⎧
⎪
⎪ 1 n n
⎨ p p 1 2k mod p
⎪ if n 1 mod .p 1/;
p k1 B.k/
n
⎪
⎪ .1/n1 n
⎪
⎩ mod p otherwise:
p p1
.k/
Write m C 1 D ap e ; .a; p/ D 1; e 0. If e D 0, then bn .m/ 2 Z.p/
.k/
and p k1 bn .m/ 0 mod p by the assumption k 2, hence this term can be
discarded. Note that, since the Stirling number mn is an integer, we have
mŠ
ordp .bn.k/.m// ordp :
.m C 1/k
.k/
First assume e 2. We show the congruence p k bn .m/ 0 mod p and further
k1 .k/
P1 h m i
p bn .m/ 0 mod p if p 1 − n. Using ordp .mŠ/ D j D1 pj , we estimate
1
X
mŠ m
ordp D ek
.m C 1/k j D1
pj
e
m ap 1
ek D ek D ap e1 1 ek
p p
p e1 1 ek D .1 C p 1/e1 1 ek
1 C .e 1/.p 1/ 1 ek D .e 1/.p 1/ ek
.e 1/.k C 1/ ek D k C e 1
k C 1:
.k/
mŠ
From this we obtain ordp .mC1/ k k C 1 and thus p k1 bn .m/ 2 Z.p/ .
.k/
Therefore p k bn .m/ 0 mod p holds. If at least one inequality in the above
.k/
chain of inequalities is strict, we get p k1 bn .m/ 0 mod p. If all the inequalities
become equalities, we have e D 2; m C 1 D p 2 , and p D k C 2. In this case, the
.k/
lemma below (the case a D p) shows p k1 bn .m/ 0 mod p if p 1 − n.
Lemma 14.8. Let n and a be natural numbers. We have the congruence
⎧ !
⎪
⎪ c 1
n ⎨ mod p if n D a 1 C c.p 1/ for some c a;
a1
ap 1 ⎪
⎪
⎩
0 mod p otherwise:
n
We use the generating function of the Stirling numbers m (Proposition 2.6 (8)
on p. 30)
X1
n n tm
t D : (14.2)
nDm
m .1 t/ .1 2t/ .1 mt/
1
! 1
!
t ap1 X a C i 1 i.p1/ X a C i 1 a1C.aCi /.p1/
Dt ap1
t D t :
.1 t p1 /a i D0
i i D0
a1
0 Bn
p k1 B.k/
n .n n / mod p:
n
.k/
If in particular n is odd, we have p k1 Bn 0 mod p.
This comes from the congruence
1
John Wilson (born on August 6, 1741 in Applethwaite, England—died on October 18, 1793 in
Kendal, England).
230 14 Poly-Bernoulli Numbers
.1/n1 n Bn
.n n0 / mod p;
p p1 n
2p1 !
X
n l 2p 1
.1/ mŠ m
D .1/ ln
m l
lD1
! !
!
X
p1
2p 1 n 2pl 2p 1 p 2p 1
D .1/l
l C.1/ .2p l/ C.1/
n
pn
l 2p l p
lD1
14.2 Theorem of Clausen and von Staudt Type 231
! !
X
p1
2p 1 n l 2p 1
.1/ l
l C.1/ .2npl n1
Cl /
n
mod p 2 :
l l 1
lD1
Combining
!
2p 1
.1/l1 mod p
l 1
t
u
Proof of Theorem 14.10. The iterated integral expression (14.1) of the generating
P
function of Bn and es s1 D 1 sl
.2/ l
lD0 .1/ Bl lŠ give
1
X Z tX
1
tn et sl
B.2/ D t .1/l Bl ds
nD0
n
nŠ e 1 0 lD0 lŠ
1
X 1
t m1 X t lC1
D Bm .1/l Bl :
mD0
mŠ .l C 1/Š
lD0
n .n 2/
n D Bn1 B1 C B1 Bn1 D
B.2/ Bn1 :
2 4
This also holds for n D 1, and thus (1) is proved.
232 14 Poly-Bernoulli Numbers
(2-1) is clear from the formula in Theorem 14.6. By elementary computation, one
2 in that formula is not an integer are m C 1 D
mŠ
sees that the only cases where .mC1/
8; 9; prime; 2 prime. By Lemma 14.12, one sees that p does not appear in the
.1/m mŠf n g
denominator of .mC1/2m if m C 1 D 2p; (p: prime 5). Next consider the case
of m C 1 D p (p: prime 5). In this case, we have
X !
p1
X
p1
n l p1
.1/ mŠm
D .1/ ln ln mod p:
m l
lD1 lD1
X
p1
l n pBn mod p 2
lD1
holds (cf. Ireland and Rosen [50, Cor. of Prop. 15.2.2]). On the other hand, if n n0
mod p 1; 1 < n0 < p 1, then n0 is also even and by Vandiver2 [93, (63)] (see
also Remark 14.24) we have
X
p1
X
l
1 X
p1
0
X
l
1
ln ln mod p
i D1
i i D1
i
lD1 lD1
Bn0 mod p:
2
Harry Schultz Vandiver (born on October 21, 1882 in Philadelphia, USA, died on January 9, 1973
in Austin, USA).
14.3 Poly-Bernoulli Numbers with Negative Upper Indices 233
This gives
n
.1/m mŠ p .Bn Bn0 / mod p 2 :
m
and (2-2)-(b) follows. As for the 3-order, the terms in the formula of Theorem 14.6
which have 3 in the denominators are
2Š n2 5Š n5 8Š n8
; ; :
32 62 92
Computation using Proposition 2.6 (6) (p. 30) gives (2-3). The determination of the
2-order is similar and is omitted. t
u
By noting the appearance of the condition p j Bnn in (2-2)-(b-1) of Theorem 14.10,
we can state the irregularity of a prime p in terms of the denominator of the di-
.2/
Bernoulli number Bn .
Corollary 14.13. A prime p . 5/ is irregular if and only if there exists an even n
.2/
with p C 1 n 2p 4 such that the denominator of Bn is not divisible by p.
For classical Bernoulli numbers, odd indexed ones are 0 except for B1 , denomi-
nators of even indexed ones are easily determined (Clausen and von Staudt), and
the numerators of even indexed ones are mysterious quantities that are closely
related to the arithmetic of cyclotomic fields. As for the di-Bernoulli numbers,
the odd indexed ones are essentially the classical Bernoulli numbers and the
determination of denominators of even indexed ones requires the numerators of the
classical Bernoulli numbers. Can one expect any interesting arithmetic concerning
the numerators of the di-Bernoulli numbers? We do not know.
.k/
When k is 0 or negative, the poly-Bernoulli number Bn is a positive integer
.k/
(Table 14.2). We give yet another formula for Bn in this case, and introduce
.k/
two different combinatorial interpretations of Bn for non-positive k which were
discovered recently.
We have the following formula.
234 14 Poly-Bernoulli Numbers
.k/
Table 14.2 Bn .5 k 0; 0 n 7/
knn 0 1 2 3 4 5 6 7
0 1 1 1 1 1 1 1 1
1 1 2 4 8 16 32 64 128
2 1 4 14 46 146 454 1394 4246
3 1 8 46 230 1066 4718 20266 85310
4 1 16 146 1066 6902 41506 237686 1315666
5 1 32 454 4718 41506 329462 2441314 17234438
X
min.n;k/
nC1 kC1
Bn.k/ D .j Š/ 2
:
j D0
j C1 j C1
.k/
In particular, Bn is positive and symmetric in n and k.
Corollary 14.15.
.n/
Bn.k/ D Bk :
.k/
Proof. 3 We compute the two-variable generating function of Bn with the aid of
Theorem 14.6 and Proposition 2.6 (7), in the following way.
X 1
1 X n X1 X1 Xn
x yk n xn y k
Bn.k/ D .1/ n m
.1/ mŠ .m C 1/k
nD0 kD0
nŠ kŠ nD0 mD0
m nŠ kŠ
kD0
1
X X
n
n .mC1/y x n
D .1/n .1/m mŠ e
nD0 mD0
m nŠ
1
X 1
X
n
n x
D .1/m mŠe .mC1/y .1/n
mD0 nDm
m nŠ
1
X .e x 1/m
D .1/m mŠe .mC1/y
mD0
mŠ
1
D ey
1 .e y .e x 1//
e xCy
D :
ex C e y e xCy
Furthermore, we have
3
The following proof, which greatly simplifies the original proof in [10], is due to Hiroyuki Ochiai.
The authors would like to thank him for providing this simple proof.
14.3 Poly-Bernoulli Numbers with Negative Upper Indices 235
e xCy e xCy
D
e x C e y e xCy 1 .e x 1/.e y 1/
1
X
D e xCy .e x 1/j .e y 1/j
j D0
1
X 1 d x d
D .e 1/j C1 .e y 1/j C1 :
j D0
.j C 1/ dx
2 dy
X kC1
d @ n C 1 x A d @ k C 1 y A
D .j Š/2
j D0
dx nDj
j C 1 .n C 1/Š dy j C 1 .k C 1/Š
kDj
X 1
1 X
1 X
nC1 k C 1 xn y k
D .j Š/2 :
j D0 nDj kDj
j C1 j C 1 nŠ kŠ
nC1 kC1
Comparison of the coefficients on both sides gives the theorem (note j C1 j C1
D0
when j > min.n; k/). t
u
.k/
By the theorem, we also have a different generating function for Bn .
Corollary 14.16. We have
X 1
1 X 1
X
Bn.k/ x n y k D pj .x/pj .y/;
nD0 kD0 j D0
where
j Šx j
pj .x/ D :
.1 x/ .1 2x/ .1 .j C 1/x/
X
nC1
pj .x/ D j Š xn
nDj
j C1
X
n
.l/
.1/l Bnl D 0:
lD0
Proof. Denote the left-hand side of Corollary 14.16 by B.x; y/. By Theorem 14.6
and Proposition 2.6 (8), we have
X 1
1 X X
n
n
B.x; y/ D .1/n
.1/ mŠ m
.m C 1/k x n y k
nD0 kD0 mD0
m
1
X 1
X
n n 1
D .1/m mŠ .1/n x
mD0 nDm
m 1 .m C 1/y
1
X mŠx m
D :
mD0
.1 C x/ .1 C 2x/ .1 C mx/ .1 .m C 1/y/
f 2 SnCk j n i .i / k; 1 8i n C kg
.k/
is equal to Bn .
14.3 Poly-Bernoulli Numbers with Negative Upper Indices 237
p2
X 1 1
Bi 1C CC .m C 1/i mod p:
mD1
2 m
.1/ .2p/
Bi D Bi Bi mod p:
.i /
By Corollary 14.15, the right-hand side is equal to Bp2 . Applying Theorem 14.6
again, we obtain
X
p2
p2
Bi .1/m mŠ .m C 1/i mod p:
mD0
m
p2
It therefore suffices to show the following lemma (note 0 D 0).
Lemma 14.23. If 1 m p 2, then
p2 1 1
.1/ m1
mŠ 1C CC mod p:
m 2 m
Write .1/m1 mŠ p2
m
D bm . The recursion of the Stirling numbers (2.1) (p. 26)
gives
p1
.1/ m1
mŠ D m.bm1 C bm / .m 2/:
m
On the other hand, by Proposition 2.6 (6) (p. 30), we have
!
p1 Xm
l m
.1/ m1
mŠ D .1/ l p1
m l
lD1
!
Xm
l m
.1/ mod p
l
lD1
1 mod p
Remark 14.24. For 1 < i p 2, the right-hand side of the theorem is congruent
modulo p to
p1
X 1 1
1C CC mi :
mD1
2 m
Vandiver’s original theorem (a special case of [93, (63)]) states that this is congruent
modulo p to Bi .
Exercise 14.25. Prove that if mŠ
.mC1/2
is not an integer, then m C 1 D
8; 9; prime; or 2 prime.
.k/
Exercise 14.26. Define the numbers Cn by the generating function
X 1
Lik .1 e t / .k/ t
n
D C :
et 1 nD0
n
nŠ
.k/ .n/
Cn1 D Ck1 .n; k 1/:
.k/
Exercise 14.27. Prove that the number Bn is always even when n; k 1. Hint:
Use Theorem 14.14.
Exercise 14.28. List all n by k lonesum matrices for small n and k, and check
Theorem 14.20.
Exercise 14.29. Prove the congruence stated in Remark 14.24.
Appendix
Curious and Exotic Identities for Bernoulli
Numbers
Don Zagier
defining the Bernoulli numbers, the less familiar (and divergent) ordinary generat-
ing series
1
X x x2 x4 x6
ˇ.x/ D Bn x n D 1 C C (A.2)
nD0
2 6 30 42
also has many virtues and is often just as useful as, or even more useful than, its
better-known counterpart (A.1). As a first application, in Sect. A.2 we discuss the
“modified Bernoulli numbers”
!
X n
nCr Br
Bn D n1 : (A.3)
rD0
2r nCr
1
Here, and throughout this appendix, we use the convention B1 D 1=2, rather than the
convention B1 D 1=2 used in the main text of the book.
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 239
in Mathematics, DOI 10.1007/978-4-431-54919-2, © Springer Japan 2014
240 Appendix: Curious and Exotic Identities for Bernoulli Numbers
These numbers, which arose in connection with the trace formula for the Hecke
operators acting on modular forms on SL.2; Z/, have several unexpected properties,
including the surprising periodicity
BnC12 D Bn (n odd) (A.4)
and a modified form of the classical von Staudt–Clausen formula for the value
of Bn modulo 1. The following section is devoted to an identity discovered by
Miki [A10] (and a generalization due to Gessel P [A4]) which has Pthe
n striking
property of involving Bernoulli sums both of type Br Bnr and r
Br Bnr ,
i.e., sums related to both the generating functions (A.1) and (A.2). In Sect. A.4 we
look at products of Bernoulli numbers and Bernoulli polynomials in more detail.
In particular, we prove the result (discovered by Nielsen) that when a product
of two Bernoulli polynomials is expressed as a linear combination of Bernoulli
polynomials, then the coefficients are themselves multiples of Bernoulli numbers.
This generalizes to a formula for the product of two Bernoulli polynomials in two
different arguments, and leads to a further proof, due to I. Artamkin, of the Miki–
Gessel identities. Finally, in Sect. A.5 we discuss the continued fraction expansions
of various power series related to both (A.1) and (A.2) and, as an extra titbit,
describe an unexpected appearance of one of these continued fraction expansions
in connection with some recent and amazing discoveries of Yu. Matiyasevich
concerning the non-trivial zeros of the Riemann zeta function.
This appendix can be read independently of the main text and we will recall
all facts and notations needed. We should also add a warning: if you don’t like
generating functions, don’t read this appendix!
Given a sequence of interesting numbers fan gn0 , one often tries to understand
them by using the properties of the corresponding generating P1functions. The two
n
most popular choices for theseP1 generating functions are nD0 na x (“ordinary
n
generating function”) and nD0 an x =nŠ (“exponential generating function”).
Usually, of course, at most one of these turns out to have useful properties. For the
Bernoulli numbers the standard choice is the exponential generating function (A.1)
because it has an expression “in closed form.” What is not so well known is that the
ordinary generating function of the Bernoulli numbers, i.e., the power series (A.2),
even though it is divergent for all non-zero complex values of x, also has extremely
attractive properties and many nice applications. The key property that makes it
useful, despite its being divergent and not being expressible as an elementary
function, is the following functional equation:
A.1 The “Other” Generating Function(s) for the Bernoulli Numbers 241
Proposition A.1. The power series (A.2) is the unique solution in QŒŒx of the
equation
1 x
ˇ ˇ.x/ D x : (A.5)
1x 1x
Proof. Let fBn g be unspecified numbers and define ˇ.x/ by the first equality
in (A.2). Then comparing the coefficients of x m in both sides of (A.5) gives
!
X
m1
m 1 if m D 1;
Bn D (A.6)
nD0
n 0 if m > 1:
This is the same as the standard recursion for the Bernoulli numbers obtained by
multiplying both sides of (A.1) by e x 1 and comparing the coefficients of x m =mŠ
on both sides. t
u
The functional equation (A.5) can be rewritten in a slightly prettier form by
setting
1
X
ˇ1 .x/ D x ˇ.x/ D Bn x nC1 ;
nD0
! !
x X1
nCr 1
1
X `
ˇr ˇr .x/ D Bn x `C1
1x nD0
n nCr 1
`DnCr
1
! ! !
X ` X
`r
`r C1
D x `C1
Bn D r x rC1 :
r 1 nD0
n
`Dr
Alternatively, we can deduce (A.9) from (A.7) by induction on r by using the easily
checked identity
d x x 2 0 x
x2 F D F (A.11)
dx 1x 1x 1x
1
! !
X nk1 Xk
n k Bn
ˇk .x/ D Bn x nk
D .1/
nD0
n nD0
n x kn
1 k 1 1
D Bk C D Bk C 1 D .1/k Bk ;
x x k1 x x
where Bk .X / is the kth Bernoulli polynomial. (One can also prove these identities
by induction on k, using either (A.10) or else (A.9) together with the uniqueness
statement in Proposition A.2 and the corresponding well-known functional equation
for the Bernoulli polynomials.) However, there is a different and more interesting
way to extend the definition of ˇr to non-positive integral values of r. For k 2 Z,
define
X .n 1/Š
k .x/ D BnCk x n 2 x QŒŒx :
.n C k/Š
n max.1;k/
Then one easily checks that r .x/ D .r 1/Š ˇr .x/ for r > 0, so that the
negative-index power series k are just renormalized versions of the positive-index
power series ˇr . But now we do get interesting power series (rather than merely
polynomials) when k 0, e.g.
2
Or even in C if we work formally in x r QŒŒx.
A.1 The “Other” Generating Function(s) for the Bernoulli Numbers 243
1
X X1 X1
Bn x n BnC1 x n BnC2 x n
0 .x/ D ; 1 .x/ D ; 2 .x/ D :
nD1
n nD1
n.n C 1/ nD1
n.n C 1/.n C 2/
(A.12)
The properties of these new functions corresponding to (A.10) and (A.9) are
given by:
Proposition A.3. The power series k .x/ satisfy the differential recursion
Bk
x 2 k0 .x/ D k1 .x/ x .k 0/ (A.13)
kŠ
.with 1 .x/ D ˇ1 .x// as well as the functional equations
x
0 0 .x/ D log.1 x/ C x ; (A.14)
1x
x 1 1
1 1 .x/ D log.1 x/ 1 ;
1x x 2
x 1
.1/k 1
k k .x/ D Bk log.1 x/ C Pk1 ; (A.15)
1x kŠ x x
where Pk1 .X / is a polynomial of degree k 1, the first few values of which are
P0 .X / D 1, P1 .X / D X 12 , P2 .X / D X 2 X C 12 1
, P3 .X / D X 3 32 X 2 C
1
3
X C 12 and P4 .X / D X 4 2X 3 C 4 X 2 C 4 X 360 .
1 3 1 13
Proof. Equation (A.13) follows directly from the definitions, and then Eqs. (A.14)
and (A.15) (by induction over k) follow successively from (A.7) using the general
identity (A.11). t
u
We end this section with the observation that, although ˇ.x/ and the related
power series ˇr .x/ and k .x/ that we have discussed are divergent and do not give
the Taylor or Laurent expansion of any elementary functions, they are related to
the asymptotic expansions of very familiar, “nearly elementary” functions. Indeed,
Stirling’s formula in its logarithmic form says that the logarithm of Euler’s Gamma
function has the asymptotic expansion
1 1 X1
Bn
log .X /
X log X X C log.2/ C X nC1
2 2 nD2
n.n 1/
0 .X / 1 X Bn 1 1
.X / WD
log X X n D log X 0
.X / 2X nD2 n X
244 Appendix: Curious and Exotic Identities for Bernoulli Numbers
as X ! 1, with 0 .x/ defined as in Eq. (A.12), and the functions ˇr .x/ correspond
similarly to the derivatives of .x/ (“polygamma functions”). The transformation
x 7! x=.1x/ occurring in the functional equations (A.5), (A.9), (A.14) and (A.15)
corresponds under the substitution X D 1=x to the translation X 7! X C1, and the
compatibility equation (A.11) simply to the fact that this translation commutes with
the differential operator d=dX , while the functional equations themselves reflect
the defining functional equation .X C 1/ D X .X / of the Gamma function.
with Bn as in (A.3). On the other hand, the functional equation (A.14) applied to
x D t=.1
t C t 2 /, together with the parity property 0 .x/ C x D 0 .x/, which
is a restatement of (A.16), implies the two functional equations
t
g
C1 .t/ D g
.t/ C D g
.t/
1
t C t 2
and comparing this with (A.20) immediately gives the desired formula (A.18) for
Bn , n odd. u
t
We mention one further result about the modified Bernoulli numbers from [A15].
The ordinary Bernoulli numbers satisfy the asymptotic formula
2 nŠ
Bn
.1/.n2/=2 (n ! 1, n even). (A.21)
.2/n
As one might expect, the modified ones have asymptotics given by a very similar
formula:
.n 1/Š
Bn
.1/.n2/=2 (n ! 1, n even). (A.22)
.2/n
The (small) surprise is that, while the asymptotic formula (A.21) holds to all orders
in 1=n (because the ratio of the two sides equals .n/ D 1 C O.2n /), this is not
246 Appendix: Curious and Exotic Identities for Bernoulli Numbers
true of the new formula (A.22), which only acquires this property if the right-hand
side is replaced by .1/n=2 Yn .4/, where Yn .x/ is the nth Bessel function of the
second kind.
Here is a small table of the numbers Bn and BQn D 2nBn Bn for n even:
n 2 4 6 8 10 12 14 16 18 20 22
Bn 1
24
27
80
1260
29 451
1120
264
65
12012
6571 571
312
181613
38080
23663513
1220940
10188203
83600
564133
552
BQn 0 83 10
3 136
21
5 330
4249 651
13
3056
21
109269
170
247700
57
38775
The surprising identity described in this section was found and proved by
Miki [A10] in an indirect and non-elementary way, using p-adic methods. In
this section we describe two direct proofs of it, or rather, of it and of a very similar
identity discovered by Faber and Pandharipande in connection with Chern numbers
of moduli spaces of curves. The first, which is short but not very enlightening,
is a variant of a proof I gave of the latter identity [A2] (but which with a slight
modification works for Miki’s original identity as well). The second one, which
is more natural, is a slight reworking of the proof given by Gessel [A4] based on
properties of Stirling numbers of the second kind. In fact, Gessel gives a more
general one-parameter family of identities, provable by the same methods, of which
both the Miki and the Faber–Pandharipande identities are special cases. In Sect. A.4
we will give yet a third proof of these identities, following I. Artamkin [A1].
Proposition A.5 (Miki). Write Bn D .1/n Bn =n for n > 0. Then for all n > 2 we
have
!
X
n2 X
n2
n
Bi Bni D Bi Bni C2Hn Bn ; (A.23)
i D2 i D2
i
where Hn D 1 C 1
2 CC 1
n denotes the nth harmonic number.
B2g
(Faber–Pandharipande). Write bg D .2 22g / for g 0. Then for all
.2g/Š
g > 0 we have
First proof. We prove (A.24), following [A2]. Write the identity as a.g/ D b.g/ C
P
1 P
1
c.g/ in the obvious way, and let A.x/ D a.g/ x 2g1 , B.x/ D b.g/ x 2g1
gD1 gD1
P
1
and C.x/ D c.g/ x 2g1 be the corresponding odd generating functions. Using
gD1
P
1 1
the identity bg x 2g1 D , we obtain
gD0 sinh x
X Z x
1
A.x/ D bg 1 bg 2 t 2g1 1 .x t/2g2 1 dt (by Euler’s beta integral)
2 g1 ; g2 >0 0
Z 1
1 x
1 1 1
D dt ;
2 0 t sinh t x t sinh.x t/
1
1 X 22n B2n 2n 1 sinh x
B.x/ D x D log ;
sinh x nD1 2n .2n/Š sinh x x
Z Z
1 1 1
1
X x
x 2g1 t 2g1 x
1
C.x/ D bg dt D C dt ;
gD1 0 xt 0 x t sinh x sinh t xt
A similar proof can be given for Miki’s original identity (A.23), with “sinh” replaced
by “tanh”. t
u
Second proof. Now we prove (A.23), following the method in [A4]. Recall that the
Stirling number of the second kind S.k; m/ is defined as the number of partitions
of a set of k elements into m non-empty subsets or, equivalently, as 1=mŠ times the
number of surjective maps from the set f1; 2; : : : ; kg to the set f1; 2; : : : ; mg. It can
be given either by the closed formula
248 Appendix: Curious and Exotic Identities for Bernoulli Numbers
!
1 X
m
m` m
S.k; m/ D .1/ `k (A.25)
mŠ `
`D0
(this follows immediately from the second definition and the inclusion-exclusion
principle, since `k is the number of maps from f1; 2; : : : ; kg to a given set of `
elements) or else by either of the two generating functions
1
X xm
S.k; m/ x k D ;
.1 x/.1 2x/ .1 mx/
kD0
(A.26)
1
X xk .e x 1/m
S.k; m/ D ;
kŠ mŠ
kD0
both of which can be deduced easily from (A.25). (Of course all of these formulas
are standard and can be found in many books, including Chap. 2 of this one, where
S.k; m/ is denoted using Knuth’s notation mk .) From either generating function one
finds easily that S.k; m/ vanishes for k < m, S.m; m/ D 1, S.m C 1; m/ D m 2Cm ,
2
X
n2 m2
S.m C n; m/ D Bn mC nBn1 C Bi Bni C .n 3/ ; (A.27)
i D2
2
while
P from the second formula in (A.26) and the expansion log .e x 1/=x D
Bn x n =nŠ we get
n>0
A.4 Products and Scalar Products of Bernoulli Polynomials 249
S.m C n; m/
m m m x n e x 1 m
D 1C 1C 1C Coefficient of in
1 2 n nŠ x
X n1
! 2
n m
D 1 C Hn m C Bn m C Bi Bni C
i D1
i 2
!
X
n1
n m2
D Bn m C 2Hn Bn C Bi Bni C .n 1/ : (A.28)
i D1
i 2
Gessel does not actually write out the proof of this identity, saying only that it can
be obtained in the same way as his proof of (A.23) and pointing out that, because
Bn .1/ D Bn and 22g B2g .1=2/ D .2g 1/Š bg , it implies (A.23) and (A.24) by
specializing to x D 1 and x D 1=2, respectively.
a precise form in the following proposition. The formula turns out to be somewhat
simpler if we use the renormalized Bernoulli polynomials Bn .x/ D Bnn.x/ rather
than the Bn .x/ themselves when n > 0. (For n D 0 there is nothing to be calculated
since the product of any Bi .x/ with B0 .x/ D 1 is just Bi .x/.)
Proposition A.7. Let i and j be strictly positive integers. Then
! !
X 1 i 1 j
Bi .x/ Bj .x/ D C B2` Bi Cj 2` .x/
i Cj
i 2` j 2`
0`< 2 (A.30)
i 1
.1/ .i 1/Š .j 1/Š
C Bi Cj :
.i C j /Š
Note that, despite appearances, the (constant) second term in this formula is
symmetric in i and j , because if Bi Cj ¤ 0 then i and j have the same parity.
Proof. Write Bi;j .x/ for the right-hand side of (A.30). We first show that the
difference between Bi;j .x/ and Bi .x/Bj .x/ is constant. This can be done in two
different ways. First of all, using Bn .x C 1/ Bn .x/ D x n1 we find
! !
X 1 i 1 j
Bi;j .x C 1/ Bi;j .x/ D C B2` x i Cj 2`1
i Cj
i 2` j 2`
0`< 2
1 1
D x j 1 Bi .x/ C x i 1 C x i 1 Bj .x/ C x j 1
2 2
D Bi .x C 1/Bj .x C 1/ Bi .x/Bj .x/ :
It follows that the Bi;j .x/ Bi .x/Bj .x/ is periodic and hence, since it is also
polynomial, constant. Alternatively, we can use that Bn0 .x/ equals 1 for n D 1 and
.n1/Bn1 .x/ for n > 1 to show by induction on i Cj that Bi;j .x/ and Bi .x/Bj .x/
have the same derivative (we omit the easy computation) and hence again that their
difference is constant. To show that this constant vanishes, it suffices to show that the
integrals of the two sides of (A.30) over the interval [0,1] agree. Since the integral
of Bn .x/ over this interval vanishes for any n >p0, this reduces to the following
statement, in which to avoid confusion with i D 1 we have changed i and j to
r and s. t
u
Proposition A.8. Let r and s be positive integers. Then
Z 1
rŠ sŠ
Br .x/ Bs .x/ dx D .1/r1 BrCs : (A.31)
0 .r C s/Š
Proof. Here again we give two proofs. The first uses the Fourier development
kŠ X e 2 i nx
Bk .x/ D .0 < x < 1; k 1/ (A.32)
.2 i /k n2Z nk
n¤0
A.4 Products and Scalar Products of Bernoulli Polynomials 251
discussed in Chap. 4, Theorem 4.11 of this book. (For k D 1 the sum converges
R1
only conditionally and one has to be a little careful.) Since the integral 0 e 2 i kx dx
equals ık;0 , this gives
Z 1
rŠ sŠ X 1 rŠ sŠ
Br .x/ Bs .x/ dx D .1/r D .1/r1 BrCs
0 .2 i / rCs
n2Z
nrCs .r C s/Š
n¤0
X Z
t r1 us1 1
e xt e xu 1 1 e t Cu 1
Ir;s D dx D
r; s0
rŠ sŠ 0 et 1 eu 1 et 1 eu 1 t C u
X1
1 1 1 Bk t k1 .u/k1
D u
D
t Cu e 1 e 1
t kŠ t Cu
kD0
1 X Bk X
D C t r1 .u/s1 ;
tu kŠ r; s1
k2
rCsDk
and
X X n1 Z
.i 1/Š .j 1/Š 1
.1/i 1 D .x/i 1 .1 x/ni 1 dx
i; j 1
.n 1/Š i D1 0
i Cj Dn
Z 1
1 C .1/n
D .1 x/n1 .x/n1 dx D
0 n
252 Appendix: Curious and Exotic Identities for Bernoulli Numbers
Using this, one finds, with almost the same proof as before, the following
generalization of Proposition A.7:
Proposition A.10. Let i and j be positive integers. Then for any two variables x
and y we have
max.i;j /
! !
X 1 i .1/m j
Bi .x/ Bj .y/ D Bi Cj m.y/ C Bi Cj m.x/ BmC .x y/
mD0
i m j m
.i 1/Š .j 1/Š C
C .1/j 1 Bi Cj .x y/ ; (A.35)
.i C j /Š
X
Bi .x/Bj .y/ Hn1 Bn .x/ C Bn .y/
i; j 1
i Cj Dn
!
X n1 B C .x y/
n1
D Bnm .y/ C .1/m Bnm .x/ m
mD1
m m
1 C .1/n C
C Bn .x y/ : (A.36)
n2
We observe that Eq. (A.36) was also found by Hao Pan and Zhi-Wei Sun [A12] in a
slightly different form, the right-hand side in their formula being
!
X n1 Bm .y x/
n
Bm .x y/
Bnm .y/ C Bnm .x/
mD1
m1 m2 m2
1 Bn .x/ Bn .y/
C ; (A.37)
n xy
which is easily checked to be equal to the right-hand side of (A.36); their formula
has the advantage of being more visibly symmetric in x and y and of using only the
Bernoulli polynomials Bm .x/ rather than the symmetrized Bernoulli polynomials
BmC .x/, but the disadvantage of having a denominator x y (which of course
disappears after division into the numerator Bn .x/Bn .y/) rather than being written
in an explicitly polynomial form.
We end this section by giving a beautifully symmetric version of the multiplica-
tion law for Bernoulli polynomials given by the same authors in [A13].
Proposition
A.11 (Sun–Pan). For each integer n 0 define a polynomial
r s
in four variables r, s, x and y by
xy n
! !
X
r s i r s
D .1/ Bj .x/ Bi .y/ : (A.38)
xy n i j
i; j 0
i Cj Dn
Then for any six variables r, s, t, x, y and z satisfying r CsCt D n and xCyCz D 1
we have
r s s t t r
t Cr Cs D0: (A.39)
xy n yz n zx n
First proof (sketch). We can prove (A.39) in the same way as (A.36) was proved
above, replacing the product Bj .x/Bi .y/ in (A.38) for i and j positive using
formula (A.35) (with x and y replaced by 1 y and x) and then using elementary
254 Appendix: Curious and Exotic Identities for Bernoulli Numbers
binomial coefficient identities to simplify the result. We do not give the full
calculation, which is straightforward but tedious. t
u
Second proof. An alternative, and easier, approach is to notice that, since the left-
hand side of (A.39) is a polynomial in the variables x, y and z D 1 x y, it is
enough to prove the identity for x; y; z > 0 with x C y C z D 1. But for x and y
between 0 and 1 we have from (A.32)
X
r s
.2 i / n
D Cn .r; sI a; b/ e 2 i.bxay/
xy n
a; b2Z
with
⎧P
⎪
⎪
i j
i; j 1; i Cj Dn .r/i .s/j a b if a ¤ 0, b ¤ 0
⎪
⎪
⎨ .r/n an if a ¤ 0, b D 0
Cn .r; sI a; b/ D
⎪
⎪ .s/n b n if a D 0, b ¤ 0
⎪
⎪
⎩
0 if a D 0, b D 0
where B 1 .x/ as usual is the periodic version of the first Bernoulli polynomial (equal
to x 12 if 0 < x < 1, to 0 if x D 0, and periodic with period 1), and satisfy the
famous Dedekind reciprocity relation
b2 C c2 C 1 1
s.b; c/ C s.c; b/ D :
12bc 4
This was generalized by Rademacher, who discovered that if a, b and c are pairwise
coprime integers then the sum
X ah bh
s.a; bI c/ D B1 B1 (A.40)
c c
h .mod c/
A.4 Products and Scalar Products of Bernoulli Polynomials 255
which equals s.a0 ; c/ for any a0 with aa0 b .mod c/ or ba0 a .mod c/, satisfies
the identity
a2 C b 2 C c 2 1
s.a; bI c/ C s.b; cI a/ C s.c; aI b/ D : (A.41)
12abc 4
in which X , Y and Z (which does not appear explicitly on the right) are formal
variables satisfying X C Y C Z D 0, then we have the following relation:
Proposition A.12 ([A5]). Let a; b; c be three natural numbers with no common
factor, x; y; z three elements of T, and X; Y; Z three formal variables satisfying
X C Y C Z D 0. Then
0 1 0 1 0 1
a b c b c a c a b
1=4 if .x; y; z/ 2 .a; b; c/T ;
S@ x y z A C S@ y z x A C S@ z x y A D
X Y Z Y ZX ZX Y 0 otherwise.
We do not give the proof of this relation, since three different proofs (all similar
in spirit to various of the proofs that have been given in this appendix) are given
in [A5], but we wanted to at least mention this generalized Dedekind–Rademacher
reciprocity law because of its formal resemblance, and perhaps actual relationship,
to the Sun–Pan reciprocity law (A.39).
256 Appendix: Curious and Exotic Identities for Bernoulli Numbers
There are several classical formulas expressing various versions of the standard
(exponential) generating functions of the Bernoulli numbers as continued fractions.
A simple example is
X 2n .2n 1/Bn
x
tanh x D x n1 D ; (A.44)
nŠ x2
n2 1C
x2
3C
x2
5C
::
:
whose proof is recalled below, and a somewhat more complicated one, whose proof
we omit, is
X B2n
x=2 1
D x 2n
D (A.45)
tanh x=2 .2n/Š a1 x 2
n0 1C
a2 x 2
1C
::
:
with an defined by
⎧
⎪
⎪ 1
⎪
⎪ if n D 1 ,
⎪
⎪ 12
⎪
⎪
⎪
⎨ .n C 1/.n C 2/
an D if n is even ,
⎪
⎪ .2n 2/.2n 1/.2n/.2n C 1/
⎪
⎪
⎪
⎪ .n 2/.n 1/
⎪
⎪
⎪
⎩ .2n 1/.2n/.2n C 1/.2n C 2/ if n > 1 is odd .
It was discovered by M. Kaneko that the convergents Pn .x/=Qn .x/ of the continued
fraction (A.45) could be given in a simple closed form, namely
! !1
X
n=2
n 2n C 1 xi
Pn .x/ D
i D0
2i 2i .2i C 1/Š
! !1
X
n=2
nC1 2n C 2 xi
Qn .x/ D
i D0
2i 2i .2i /Š
A.5 Continued Fraction Expansions for Generating Functions of Bernoulli. . . 257
x3 x5 x7
1xC2 C 16 C 272 C D tan x
3Š 5Š 7Š
x
1 x C 2 x 3 C 16 x 5 C 272 x 7 C D
1 2 x2
1
2 3 x2
1
3 4 x2
1
1
The numbers 1, 2, 16, 272, . . . defined by the first of these two formulas are just
the numbers .4n 2n /jBn j=n, so the second formula gives a continued fraction
expansion for the non-exponential generating function for essentially the Bernoulli
numbers. Again we omit the proof, referring for this to the book cited, mentioning
only the following alternative and in some ways prettier form of the formula:
1 2 16 272 1
3 C 5 7 C D (A.46)
X X X X 1
XC
X 1
C
2 X
C
3
X C4
in which the continued fraction is convergent and equal to 1 X2 4 X C2
4
for all X > 0.
Other continued fraction expansions for non-exponential Bernoulli number
generating functions that can be found in the literature include the formulas
3
In the English translation [A8] (which we highly recommend to the reader) this formula has been
relegated to the exercises: Chapter 5, Problem 5.6, page 85.
258 Appendix: Curious and Exotic Identities for Bernoulli Numbers
1
X x
B2n .4x/n D ;
1 x
nD1 1C C
2 1 1 x
C C
2 3 1 1 x
C C
3 4 ::
:
and
1
X x
.2n C 1/ B2n x n D
x
nD1 1C1C
1 x
1C C
2 1 1 x
C C
2 2 1 1 x
C C
2 3 1 1 x
C C
3 3 ::
:
all given by J. Frame [A3] in connection with a statistical problem on curve fitting.
For good conscience’s sake we give the proofs of one continued fraction of each
of the two above types, choosing for this purpose the two simplest ones (A.44)
and (A.46). We look at (A.44) first. Define functions I0 ; I1 ; : : : on .0; 1/ by
Z a
t n .1 t=a/n t
In .a/ D e dt n 2 Z0 ; a 2 R>0 :
0 nŠ
A.5 Continued Fraction Expansions for Generating Functions of Bernoulli. . . 259
1 e 2x C 1 1 I1 .2x/ 1 1
D 2x D C D C ;
tanh x e 1 x I0 .2x/ x 3 1
C
x 5 1
C
x ::
:
This time J0 .X / is simply the constant function 1, while J1 .X / has the exact
evaluation
X X X X 1
J1 .X / D 1 C1 C C ; (A.47)
2 4 2 4 2
as is easily deduced from Euler’s integral representation
Z 1
1 t x1
.x/ D C dt ;
0 1t
as well as the asymptotic expansion
Z 1
1 2 t3 16 t 5 272 t 7
J1 .X /
t 3 C 5 7 C e t dt
0 X X 3Š X 5Š X 7Š
1 2 16 272
3 C 5 7 C
X X X X
260 Appendix: Curious and Exotic Identities for Bernoulli Numbers
Jn1 .X / X JnC1 .X /
for n > 0, and rewriting this as D C we obtain that J1 .X / D
Jn .X / n Jn .X /
J1 .X /
has the continued fraction expansion given by the right-hand side of (A.46),
J0 .X /
as claimed. t
u
We end this appendix by describing an appearance of the continued frac-
tion (A.46) in connection with the fantastic discovery of Yuri Matiyasevich that
“the zeros of the Riemann zeta function know about each other.” Denote the zeros
of .s/ on the critical line <.s/ D 12 by n and n with 0 < =.1 / =.2 /
and for M 1 consider the finite Dirichlet series M .s/ defined as the N N
determinant4
ˇ ˇ
ˇ 1 1 1 1 ˇˇ
ˇ 1
ˇ : :: : : : :: :: ˇˇ
ˇ :: : ::
ˇ : : : ˇ
ˇ 1 1 ˇ
ˇn n nM nM ns ˇ
M .s/ D ˇ : :: ˇˇ ;
ˇ : :: : : :: ::
ˇ : : : : : : ˇ
ˇ ˇ
ˇ ˇ
ˇ ˇ
ˇN 1 N 1 N M N M N s ˇ
4
We have changed Matiyasevich’s notations slightly for convenience of exposition.
References 261
1
then it turns out that the function cM M .s/ not only has almost the same zeros,
but is itself a very close approximation to .1 21s /.s/ over a long interval of the
critical line.
In studying this latter function, Matiyasevich
P was led to consider the real
numbers M defined by M D 4M 2M nD1 M;n =n, where M;n denotes the
coefficient of ns in the Dirichlet series cM 1
M .s/=.s/. Since by the nature
of his investigation he was working to very high precision, he obtained very
precise decimal expansions of these numbers, and in an attempt to recognize
them, he computed the beginning of their continued fraction expansions. (Recall
that rational numbers and real quadratic irrationalities can be recognized
numerically by the fact that they have terminating or periodic continued
fraction expansions.) To his surprise, when M was highly composite these
numbers had very exceptional continued fraction expansions. For instance, for
2M D l.c.m.f1; 2; : : : ; 10g D 2520, the number M has a decimal expansion
beginning 0:9998015873172093 and a continued fraction expansion beginning
Œ0; 1; 5039; 2520; 1680; 1260; 1008; 840; 720; 630; 560; 504. In view of the
fact that nearly all real numbers (in a very precise metrical sense) have continued
fraction expansions with almost all partial quotients very small, this is certainly
not a coincidence, and it is even more obviously not one when we notice that
the numbers 5040, 2520, . . . 504 are 5040=n for n D 1; 2; : : : ; 10. This leads
one immediately to the continued fraction (A.46) with X D 4M and hence, in
view of the evaluation of that continued
fraction
given
above, to the (conjectural)
approximation M 12 M C 1 12 M C 12 , which turns out indeed to be a
very good one for M large, the two numbers differing only by one part in 10108 in
the above-named case 2M D 2520. We take this somewhat unusual story as a fitting
place to end our survey of curious and exotic identities connected with Bernoulli
numbers.
References
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 263
in Mathematics, DOI 10.1007/978-4-431-54919-2, © Springer Japan 2014
264 References
22. Carlitz, L.: Arithmetic properties of generalized Bernoulli numbers. J. Reine Angew. Math.
202, 174–182 (1959)
ˇ ˇ:ˇC1
23. Clausen, T.: Ueber die Fälle, wenn die Reihe von der Form y D 1C ˛1 xC ˛:˛C1
1:2
: C1 x 2 C
0 ˇ0 0 0 0 ˇ 0 :ˇ 0 C1 0 0
etc: ein Quadrat von der Form z D 1 C ˛1 0 ı"0 x C ˛ :˛1:2C1 0 : 0 C1 ı"0 :ı:"0 C1
C1 2
x C etc: hat.
J. Reine Angew. Math. 3, 89–91 (1828)
24. Clausen, T.: Über die Function sin ' C 212 sin 2' C 312 sin 3' C etc:. J. Reine Angew. Math.
8, 298–300 (1832)
25. Clausen, T.: Beweis, daß die algebraischen Gleichungen Wurzeln von der Form a Cbi haben.
Astron. Nachr. 17, 325–330 (1840)
26. Clausen, T.: Lehrsatz aus einer Abhandlung über die Bernoullischen Zahlen. Astron. Nachr.
17, 351–352 (1840)
27. Coates, J., Wiles, A.: Kummer’s criterion for Hurwitz numbers, in Algebraic number theory
(Kyoto Internat. Sympos., RIMS, Kyoto, 1976), pp. 9–23. Japan Soc. Promotion Sci., Tokyo
(1977)
28. Dilcher, K.: A Bibliography of Bernoulli Numbers, available online, http://www.mscs.dal.
ca/~dilcher/bernoulli.html
29. Dirichlet, P.G.L.: Lectures on Number Theory, with Supplements by R. Dedekind. History of
Mathematics Sources, vol. 16. Amer. Math. Soc. (1999)
30. Edwards, A.W.F.: A quick route to sums of powers. Am. Math. Monthly 93, 451–455 (1986)
31. Edwards, H.W.: Fermat’s Last Theorem. A Genetic Introduction to Algebraic Number Theory.
Springer, New York-Berlin (1977)
32. Euler, L.: Remarques sur un beau rapport entre les séries des puissances tant directes que
réciproques. Opera Omnia, series prima XV, 70–90 (1749)
33. Euler, L.: De numero memorabili in summatione progressionis harmonicae naturalis occur-
rente. Opera Omnia, series prima XV, 567–603 (1785)
34. Freudenthal, H.: Hurwitz, Adolf, in Dictionary of Scientific Biography, vols. 5 & 6. Charles
Scribner’s Sons, New York (1981)
35. Gauss, C.F.: DisquisitionesArithmeticae (1801)
36. Gould, H.W.: Explicit formulas for Bernoulli numbers. Am. Math. Monthly 79, 44–51 (1972)
37. Gouvéa, F.Q.: p-adic Numbers, an Introduction. Springer
38. Graham, R., Knuth, D., Patashnik, O.: Concrete Mathematics. Addison-Wesley (1989)
39. Hashimoto, K.: Representation of the finite symplectic group Sp.2; Fp / in the space of Siegel
modular forms. Contemporary Math. 53, 253–276 (1986)
40. Hensel, K.: Festschrift zur Feier des 100. Geburtstages Eduard Kummers, B.G. Teubner, 1910.
(Kummer’s collected papers, 33–69)
41. Hilbert, D.: Adolf Hurwitz, Gedächtnisrede, Nachrichten von der k. Gesellshaft der Wis-
senschaften zu Göttingen (1920), pp. 75–83 (Hurwitz’s Mathematische Werke, I xiii–xx)
42. Hungerford, T.W.: Algebra, Graduate Texts in Mathematics, vol. 73. SpringerP(1974) D 1
43. Hurwitz, A.: Einige Eigenschaften der Dirichlet’schen Funktionen F .s/ D n
ns , die
bei der Bestimmung der Klassenanzahlen binärer quadratischer Formen auftreten. Zeitschrift
für Math. und Physik 27, 86–101 (1882). (Mathematische Werke I, 72–88)
44. Hurwitz, A.: Über die Entwicklungskoeffizienten der lemniskatischen Funktionen. Math.
Ann. 51, 196–226 (1899). (Mathematische Werke II, 342–373)
45. Ibukiyama, T.: On some elementary character sums. Comment. Math. Univ. St. Pauli 47, 7–13
(1998)
46. Ibukiyama, T., Saito, H.: On zeta functions associated to symmetric matrices and an explicit
conjecture on dimensions of Siegel modular forms of general degree. Int. Math. Res. Notices
8, 161–169 (1992)
47. Ibukiyama, T., Saito, H.: On a L-functions of ternary zero forms and exponential sums of Lee
and Weintraub. J. Number Theory 48-2, 252–257 (1994)
48. Ibukiyama, T., Saito, H.: On zeta functions associated to symmetric matrices I. Am. J. Math.
117-5, 1097–1155 (1995); II: Nagoya Math. J. 208, 265–316 (2012); III: Nagoya Math. J.
146, 149–183 (1997)
References 265
49. Ibukiyama, T., Saito, H.: On “easy” zeta functions (trans. by Don Zagier). Sugaku Exposition,
14(2), 191–204 (2001). Originally in Sugaku, 50–1, 1–11 (1998)
50. Ireland, K., Rosen, M.: A Classical Introduction to Modern Number Theory, 2nd edn.
Graduate Texts in Mathematics, vol. 84. Springer (1990)
51. Iwasawa, K.: Lectures on p-adic L-functions. Annals of Math. Studies, vol. 74. Princeton
University Press, Princeton (1972)
52. Jacobi, C.G.J.: De usu legitimo formulae summatoriae Maclaurinianae. J. Reine Angew.
Math. 12, 263–272 (1834). (Mathematische Werke VI, 64–75)
53. Jordan, Ch.: Calculus of Finite Differences, Chelsea Publication, New York (1965). (First
edition, Budapest, 1939)
54. Kaneko, M.: A generalization of the Chowla-Selberg formula and the zeta functions of
quadratic orders. Proc. Jpn. Acad. 66(A)-7, 201–203 (1990)
55. Kaneko, M.: A recurrence formula for the Bernoulli numbers. Proc. Jpn. Acad. 71(A)-8,
192–193 (1995)
56. Kaneko, M.: Poly-Bernoulli numbers. J. Th. Nombre Bordeaux 9, 199–206 (1997)
57. Kaneko, M.: Multiple zeta values. Sugaku Expositions 18(2), 221–232 (2005)
58. Katz, N.: The congruences of Clausen-von Staudt and Kummer for Bernoulli-Hurwitz
numbers. Math. Ann. 216, 1–4 (1975)
59. Knuth, D.: Two notes on notation. Am. Math. Monthly 99, 403–422 (1992)
60. Knuth, D.: Johann Faulhaber and sum of powers. Math. Comp. 61(203), 277–294 (1993)
61. Knuth, D., Buckholtz, T.J.: Computation of tangent Euler and Bernoulli numbers. Math.
Comp. 21, 663–688 (1967)
62. Kronecker, L.: Bemerkungen zur Abhandlung des Herrn Worpitzky. J. Reine Angew. Math.
94, 268–270 (1883). (Mathematische Werke II, 405–407)
˛ˇ ˛.˛C1/ˇ.ˇC1/
63. Kummer, E.E.: Über die hypergeometrische Reihe 1 C 1 x C 12. C1/ x 2 C
˛.˛C1/.˛C2/ˇ.ˇC1/.ˇC2/ 3
123. C1/. C2/
x C : : : . J. Reine Angew. Math. 15, 39–83, 127–172 (1836).
(Collected papers II, 75–166)
64. Kummer, E.E.: Über eine allgemeine Eigenschaft der rationalen Entwicklungscoefficienten
einer bestimmten Gattung analytischer Functionen. J. Reine Angew. Math. 41, 368–372
(1851). (Collected papers I, 358–362)
65. Lampe, E.: Nachruf für Ernst Eduard Kummer. Jahresbericht der Deutschen Mathematiker-
vereinigung 3, 13–21 (1894). (Kummer’s collected papers I, 15–30)
66. Lang, S.: Cyclotomic Fields, Graduate Texts in Mathematics, vol. 59. Springer (1980)
67. Lang, S.: Elliptic Functions, 2nd edn. Graduate Texts in Mathematics, vol. 112. Springer
(1987). (Original edition, Addison-Wesley Publishing, 1973)
68. Launois, S.: Combinatorics of H-primes in quantum matrices. J. Algebra 309(1), 139–167
(2007)
69. Lee, R., Weintraub, S.H.: On a generalization of a theorem of Erich Hecke. Proc. Natl. Acad.
Sci. USA 79, 7955–7957 (1982)
70. Leopoldt, H.-W.: Eine Verallgemeinerung der Bernoullischen Zahlen. Abh. Math. Sem. Univ.
Hamburg 22, 131–140 (1958)
71. Lindemann, F.: Ueber die Zahl . Math. Ann. 20, 213–225 (1882)
72. Meissner, E.: Gedächtnisrede auf Adolf Hurwitz. Vierteljahrsschrift der Naturforschenden
Gesellschaft in Zürich. 64, 855–857 (1919)
73. Nielsen, N.: Handbuch der Theorie der Gammafunktion. Chelsea Publication, New York
(1965). (First edition, Leipzig, 1906)
74. Noether, M.: Zur Erinnerung an Karl Georg Christian von Staudt. Jahresbericht d. Deutschen
Mathem.-Vereinigung 32, 97–119 (1923)
75. Nörlund, N.E.: Vorlesungen über Differenzenrechnung. Chelsea Publication, New York
(1954). (First edition, Springer, Berlin, 1924)
76. Pólya, G.: Some mathematicians I have known. Am. Math. Monthly 76, 746–753 (1969)
77. Rademacher, H.: Lectures on Elementary Number Theory, Die Grundlehren der mathematis-
chen Wissenschaften, vol. 169. Springer (1773)
266 References
106. Zagier, D.: Modular forms whose Fourier coefficients involve zeta functions of quadratic
fields, in Modular functions of one variable VI. Lect. Notes in Math., vol. 627, pp. 105–169.
Springer (1977)
107. Zagier, D.: Zetafunktionen und Quadratische Körper. Springer (1981)
108. Zagier, D.: A one-sentence proof that every prime p 1 .mod 4/ is a sum of two squares.
Am. Math. Monthly 97–2, 144 (1990)
109. Zagier, D.: Values of zeta functions and their applications, in ECM volume. Progress Math.
120, 497–512 (1994)
Index
Symbols B
L-function Bachmann, 49
p-adic —, 43, 183 Barnes, 210
— of prehomogeneous vector space, 174, — double zeta function, 212, 214
175 contour integral representation of —,
Dirichlet —, 148, 176 214
p-adic transformation formula of —, 214, 216
— L-function, 43, 183 — multiple zeta function, 209–211
— integral, 44 contour integral representation of —,
— modular form, 41, 96 210
— number field, 183, 184 Berndt, 217
measure on — number field, 184 Bernoulli measure, 196, 199
ring of — integers, 183 Bernoulli number
p-adic integral expression of —, 198
— and class number, 95
A — and class number of imaginary quadratic
Abel, 46, 203 field, 90
Academia Algebrae, 3 — of higher order, 224
Akiyama, 36 definition of —, 3
Akiyama–Tanigawa algorithm, 36 denominator of —, 41
algebraic irrational number, 212 different definition of —, 6
ambig, 92, 178 formula of — involving Stirling number, 35
Ankeny, 96 generalized —, 53, 58, 107, 110, 111, 127
Ankeny–Artin–Chowla congruence, 96 generating function of —s, 53
Ars Conjectandi, 1 generating function of —s, 13, 20, 23
Artin, 96 poly- —, 223
asteroid recurrence formula of —, 3, 12, 22
Ceres, 107 sign of —, 23
Pallas, 47, 107 table of —, 6
Vesta, 107 Bernoulli polynomial, 10, 54, 55, 65, 141, 144,
Astronomische Nachrichten, 46 209, 217
automorphic factor, 220 definition of —, 55
automorphic form, 209 Fourier expansion of —, 59
dimension formula of —s, 124, 174, 177 generating function of —s, 56
T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 269
in Mathematics, DOI 10.1007/978-4-431-54919-2, © Springer Japan 2014
270 Index
H
F Hashimoto, 174
Faulhaber, 3 Hensel, 46
Faulhaber’s theorem, 11 Hilbert, 207
Fermat, 42, 47, 205 Holst, 46
— number, 47 Hurwitz, 95, 139, 203, 207
—’s little theorem, 42, 237 — integral series, 97
formal Laurent series, 20 — number, 203
formal power series, 13, 14 — zeta function, 139, 209, 210
derivative of —, 19 contour integral representation of —,
integral of —, 20 143
inverse of —, 18 functional equation of —, 147
product of —, 14 values at non-positive integers of —,
substitution of —, 16 144
sum of —, 14 hyperbolic function, 23
Fourier, 163 hypergeometric series, 47, 48
Fourier transform, 163, 193
Freudenthal, 207
Frobenius, 207 I
Fuchs, 49 ideal
fundamental discriminant — and quadratic form, 82
— of quadratic field, 78 — class group, 159
fundamental domain, 221 — group, 159
fundamental period of continued fraction — of order of quadratic field, 79
expansion, 218 equivalence of — (in the narrow, wide
Fundamental theorem of finitely generated sense), 82
abelian groups, 78 fractional —, 157
fundamental theorem on algebra, 48 inverse —, 158
fundamental unit, 90, 91, 219 norm of —, 79
primitive —, 79
proper —, 80
G proper fractional —, 158
Galois theory, 105 standard basis of —, 79
gamma function, 25, 139, 140, 147, 165 imaginary quadratic field, 77
duplication formula of —, 147 indeterminate, 14, 105
reflection formula of —, 147 inverse Fourier transform, 193
Gauss, 47, 47–49, Ireland, 230, 232
75, 91, 92, 105, 107 irregular prime, 43
—ian sum, 104, 107, 108, 111 Iwasawa, 183
— of quadratic form, 108 Iwasawa isomorphism, 186, 188
formula of —, 110
sign of —, 107
conjecture of —, 91 J
Gauss symbol, 59 Jacobi, 3, 46, 48, 203
generalized Bernoulli number, 53, 58, 107, Joachimsthal, 48
110, 111, 127 Joseph von Utzschneider Optical Institute, 46
272 Index
L
O
l’Hôpital, 106
Ochiai, 234
Lampe, 46
odd character, 53
Lang, 155, 168, 183, 188, 189
Olbers, 107
lattice, 157, 216
order
— of quadratic field, 157
— of lattice, 216
double zeta function of —, 220
— of quadratic field, 78
order of —, 216
product of —, 157
Law of Large Numbers, 1
P
Lebesgue, 210
Pólya, 207, 208
— dominant convergence theorem, 210
period of continued fraction expansion, 218
Lebesgue integral, 210
Peters, 47
Leibniz, 115
Piazzi, 107
Leibniz rule, 115
Poisson, 162
Lerch, 209
Poisson summation formula, 163
Lerch type zeta function, 209
poly-Bernoulli number, 223
Lindemann, 73, 207
denominator of —, 227
formula of — in terms of Stirling number,
226
M generating function of —s, 223
Möbius, 170 recurrence formula of —, 224, 225
— function, 170 table of —, 224, 234
— inversion formula, 171 polylogarithm, 223
Maclaurin, 65 prehomogeneous vector space
Mahler, 189 zeta function of —, 135, 155, 177
theorem of —, 189 primitive
Maple, 6 — character, 52
Mathematica, 6 — ideal, 79
maximal order, 77 principal ideal domain, 78
measure projective geometry, 47
— and formal power series, 185, 186, 188, proper ideal, 80
190 purely periodic continued fraction expansion,
— on p-adic number field, 184 218
convolution of —, 188, 195
Mendelssohn, 48
Minkowski, 207 Q
modular form quadratic field, 77
p-adic —, 96 (fundamental) discriminant of —, 78
modular group, 76, 213 class number of order of —, 82
modulus of Dirichlet character, 51 discriminant of order of —, 79
multiple gamma function, 209 ideal group of —, 155
Index 273
imaginary —, 77 Seki, 2, 3, 6
integer of —, 77 Serre, 55, 183
maximal order of —, 77 Shintani, 209
norm of element of —, 78 simplicial cone, 221
order of —, 78 standard basis, 79
class number formula of —, 89 Stark, 209
conductor of —, 79 Stark–Shintani conjecture, 209
ideal of —, 79 Stirling, 25
real —, 77 Stirling number, 25
class field construction over —, 209 — of the first kind, 27, 28, 112, 113
ring of integers of —, 77 recurrence formula of —, 28
trace of element of —, 78 table of —, 29
quadratic form, 75 — of the second kind, 25, 26, 116
— and ideal, 82 recurrence formula of —, 26
— with square discriminant, 86 table of —, 27
class number formula of —s, 87 alternative definition of —, 33
class number of —s, 77 various formulas of —, 28
discriminant of —, 76 structure theorem for finitely generated abelian
equivalence of —, 76 groups, 157
primitive —, 75 sum of powers, 1, 10
quadratic residue proof of formula of —, 6, 58, 73, 116
— symbol, 88 symmetric group, 27
reciprocity law of —, 89 symmetric matrix
quartic surface, 49 equivalence of —, 76
half-integral —, 76, 173
prehomogeneous vector space of symmetric
R matrices, 177
real quadratic field, 77 primitive —, 76
real quadratic irrational number, 216
continued fraction expansion of —, 218
reduced —, 219 T
reduced real quadratic irrational number, 219 tangent number, 24, 98
regular prime, 43 Tanigawa, 36
residue theorem, 60, 141 theorem of Clausen and von Staudt, 25, 41, 95
Riemann, 71 theorem of l’Hôpital, 106
Riemann zeta function, 22, 67, 71, 139, 174 theta function, 108, 162, 164
analytic continuation of —, 71 transformation formula of —, 162, 165
functional equation of —, 148 totally real algebraic number field, 209
special values of —, 6 trace formula, 127
values at negative integers of —, 66, 72, 96 trace of element of quadratic field, 78
values at positive even integers of —, 22, transcendental number, 73, 207
61, 73 trivial character, 51
ring of all integers, 77
Rosen, 230, 232 U
Roth, 213 unique factorization domain, 78
Roth’s theorem, 213 uniqueness of prime ideal decomposition, 156
upper half plane, 162
Utzschneider, 46
S
Scherk, 48
Schönflies, 49 V
Schubert, 207 Vandiver, 232, 237, 238
Schumacher, 46 von Staudt, 34, 41, 47
Schwarz, 49, 207 theorem of Clausen and —, 25, 41, 95
274 Index