Cotangent sums, quantum modular forms, and the generalized Riemann hypothesis

Lewis, John; Zagier, Don

doi:10.1007/s40687-018-0159-8

Cotangent sums, quantum modular forms, and the generalized Riemann hypothesis

Research
Open access
Published: 15 November 2018

Volume 6, article number 4, (2019)
Cite this article

Download PDF

You have full access to this open access article

Research in the Mathematical Sciences Aims and scope Submit manuscript

Cotangent sums, quantum modular forms, and the generalized Riemann hypothesis

Download PDF

John Lewis¹ &
Don Zagier²

2466 Accesses
6 Citations
Explore all metrics

Abstract

We show that an asymptotic property of the determinants of certain matrices whose entries are finite sums of cotangents with rational arguments is equivalent to the GRH for odd Dirichlet characters. This is then connected to the existence of certain quantum modular forms related to Maass Eisenstein series.

On a Category of Cotangent Sums Related to the Nyman-Beurling Criterion for the Riemann Hypothesis

Second-Order Modular Forms with Characters

Rational functions, cotangent sums and Eichler integrals

Article Open access 17 March 2021

1 Introduction and main results

To answer immediately a question that the reader may be asking, we should say from the outset that the generalized Riemann hypothesis (henceforth, just GRH) of the title refers only to the L-series associated with odd real Dirichlet characters and that we do not aim to prove it, but merely to show its equivalence to an asymptotic statement about the determinants of certain matrices whose entries $c_{m,n}$ are given by finite sums of cotangents of rational multiples of $\pi $. To answer a second natural question, since it may seem unusual for an author to contribute to the proceedings volume of a conference partly intended to celebrate his own birthday, we should also say a few words about the genesis of this joint paper. Both the definition of the cotangent sums considered and their relation to GRH were found by the first author and presented in his talk at the conference in question. In the course of that talk he mentioned the homogeneity property $c_{\ell m,\ell n}=\ell c_{m,n}$, leading the second author to ask whether the function $C:{\mathbb {Q}}\rightarrow {\mathbb {R}}$ defined by $C(m/n)=c_{m,n}/n$ might be related to a quantum modular form in the sense of [16]. The answer to this question turned out to be positive. Furthermore, and unexpectedly, this quantum modular aspect turned out also to be related to the earlier joint work of the authors [13] on the analogs of classical period polynomials for Maass waveforms. A glance at the title of the final section of this paper will show why it seemed natural, and even irresistible, to publish both parts of the story in these proceedings.

We now describe the main results of the paper in more detail. We denote by

$$\begin{aligned} \chi (n) = {\left\{ \begin{array}{ll} (-1)^{(n-1)/2} &{} \text {if }n\text { is odd,} \\ 0 &{} \text {if }n\text { is even,} \end{array}\right. } \qquad L(s) = 1 - \frac{1}{3^s} + \frac{1}{5^s} - \frac{1}{7^s} +\cdots . \end{aligned}$$

(1)

the primitive Dirichlet character of conductor 4 and its associated L-series. (In Sect. 5 we will describe the generalization to other odd primitive real Dirichlet characters.) For positive integers m and n we define a real number $c_{m,n}$ by

$$\begin{aligned} c_{m,n} := \frac{4}{\pi }\,\sum _{j,\,k>0} \frac{\chi (j)\,\chi (k)}{\max (j/m,k/n)}, \end{aligned}$$

(2)

where the conditionally convergent sum is to be interpreted as the limit for $N\rightarrow \infty $ of the sum over the rectangle $1\le j\le mN$, $1\le k\le nN$. These numbers are studied in more detail in Sect. 2, where we will see that they are algebraic numbers (in fact, algebraic integers) that can be expressed as finite sums of cotangents of odd multiples of $\pi /4m$ or $\pi /4n$, a typical example being

$$\begin{aligned} c_{3,4} = \cot \left( \frac{\pi }{12}\right) + \cot \left( \frac{5\pi }{12}\right) + \cot \left( \frac{9\pi }{16}\right) + \cot \left( \frac{13\pi }{16}\right) = 6 - 2\,\sqrt{2+\sqrt{2}}. \end{aligned}$$

For each $N\in {\mathbb {N}}$ we define the two symmetric matrices

$$\begin{aligned} \mathsf{C}_N := \begin{pmatrix} c_{1,1} &{} \cdots &{} c_{1,N} \\ \vdots &{} \ddots &{} \vdots \\ c_{N,1} &{} \cdots &{} c_{N,N} \end{pmatrix}, \qquad \widehat{\mathsf{C}}_N := \left( \begin{array}{ccc|c} &{} &{} &{} 1 \\ &{} \mathsf{C}_N &{} &{} \vdots \\ &{} &{} &{} N \\ \hline 1 &{} \cdots &{} N &{} 4N/\pi \\ \end{array}\right) , \end{aligned}$$

(3)

which we will show later are positive definite, and define a function $R:{\mathbb {N}}\rightarrow {\mathbb {R}}$ by

$$\begin{aligned} R(N) := N\,\frac{\det \mathsf{C}_N}{\det \widehat{\mathsf{C}}_N}. \end{aligned}$$

(4)

The first main result of this paper is then the following.

Theorem 1

The following two statements are equivalent:

(a)
The function R(N) is unbounded.
(b)
The L-series L(s) has no zeros in the half-plane $\sigma :=\mathfrak {R}(s)>\frac{1}{2}$ (GRH).

The proof of this theorem will be given in Sect. 3. The direction (a)$\,\Rightarrow \,$(b) is fairly elementary, while the reverse direction uses functional analysis and techniques coming from old work of Beurling. The result that we actually prove (Theorem 3 in §3) is in fact stronger than Theorem 1 in both directions: if GRH holds, then R(N) is not merely unbounded, but actually tends to infinity as $N\rightarrow \infty $, and if L(s) has a zero $\rho $ with $\mathfrak {R}(\rho )>\frac{1}{2}$, then the function R(N) is bounded by a number depending explicitly on $\rho $. Of course we believe that the two statements in Theorem 1 are both true rather than both false, the evidence being on the one hand the generally held belief in the validity of (b) and on the other hand the plot of the function R(N) for $N\le 100{,}000$ shown in Fig. 1, which suggests that R(N) tends to infinity and perhaps even grows roughly linearly with $\log N$. (The straight-line fit shown in the picture is the graph of the function $\,5.18\,\log N$.)

The second main result concerns the quantum modular nature of certain functions related to the coefficients $c_{m,n}$. On the one hand, the numbers $c_{m,n}$ defined by (2) have the obvious homogeneity property $c_{\ell m,\ell n}=\ell \,c_{m,n}$ and can be rewritten in the form

$$\begin{aligned} c_{m,n}= \frac{4n}{\pi }\,\,C\left( \frac{m}{n}\right) , \qquad C(x) := \sum _{j,\,k>0} \chi (j)\,\chi (k)\,\min \left( \frac{x}{j},\,\frac{1}{k}\right) \qquad (x\in {\mathbb {Q}}_{>0}). \end{aligned}$$

(5)

On the other hand, by splitting the sums in (2) or (5) into two pieces depending on which its two arguments realize the “max” or “min,” we obtain the decompositions

$$\begin{aligned} c_{m,n} = h_{m,n}+h_{n,m}, \qquad C(x) = H(x)+x\,H\left( \frac{1}{x}\right) , \end{aligned}$$

(6)

where the function $H:{\mathbb {Q}}_{>0}\rightarrow {\mathbb {R}}$ and numbers $h_{m,n}\in {\mathbb {R}}$ are defined by

(7)

(Here and later an asterisk on a summation sign means that terms with equality—in this case, those with $j=kx$—are to be counted with multiplicity 1/2.) We will show in Sect. 2 that $h_{m,n}$ (or its sum with 1/2 if mn is odd) is a sum of cotangents of odd multiples of $\pi /4n$ and hence is an algebraic integer, refining the corresponding properties for $c_{m,n}$ given above.

We can now come to the quantum modular form property mentioned in the opening paragraph. The graph of the function H on ${\mathbb {Q}}$ is shown in Fig. 2.

This graph suggests that the function H is only naturally defined on ${\mathbb {Q}}$ and is not well behaved as a function on ${\mathbb {R}}$, having a possibly dense set of discontinuities and certainly not differentiable at generic points. The graph of the function C shown in Fig. 3, on the other hand, suggests that it is everywhere continuous and possibly even differentiable except at rational points with odd numerator and denominator, where it seems to be left and right differentiable but with a discontinuous derivative. Our second main result, proved in Sect. 4, refutes the last of these statements and confirms the others.

Theorem 2

The function H(x) is discontinuous at all rational points x with odd numerator and denominator and is continuous but not left or right differentiable at other rational points. The function C(x) is continuous everywhere, is not left or right differentiable at rational points with odd numerator and denominator, but is differentiable at all other rational points.

The similarity between the continuity and differentiability statements for the two functions H(x) and C(x) in this theorem is not coincidental, since we will see in Sect. 4 that the derivative of C(x) is equal to H(1 / x).

We now discuss the “quantum modular form” aspect of Theorem 2. Note first that the function H is periodic of period 4 and hence extends to a periodic function on all of ${\mathbb {Q}}$. This function is even, so we could also extend H from ${\mathbb {Q}}_{>0}$ to ${\mathbb {Q}}$ by $H(x):=H(|x|)$. One might then think of extending C to negative values of the arguments by setting $C(x)=H(x)+xH(1/x)$ for all $x\ne 0$, but if we did that then C(x) for $x<0$ would exhibit oscillatory behavior of the same sort as H(x). Instead, we extend C to all of ${\mathbb {Q}}$ as an even function, i.e., we set $C(x)=C(|x|)$ for $x\ne 0$ and extend this by continuity to $C(0)=0$. Then the relationship between C and H becomes

$$\begin{aligned} H(x) + |x|\,H(1/x) = C(x) \qquad (x\ne 0). \end{aligned}$$

(8)

On the other hand, the 4-periodicity property of H(x) can be strengthened to the statement

$$\begin{aligned} H(x)+ H(x+2)= \frac{\pi }{4}, \end{aligned}$$

(9)

saying that up to sign and up to a constant H actually has period 2. Also, since H is even, we can replace the term H(1 / x) in (8) by $H(-1/x)$. But the two matrices $S=\left( {\begin{matrix}0&{}-1\\ 1&{}0\end{matrix}}\right) $ and $T^2=\left( {\begin{matrix}1&{}2\\ 0&{}1\end{matrix}}\right) $ generate the subgroup $\Gamma _\vartheta $ of index 3 in the full modular group $\,\text {SL}(2,{\mathbb {Z}})=\langle S,T\rangle $ consisting of matrices congruent to $\left( {\begin{matrix}1&{}0\\ 0&{}1\end{matrix}}\right) $ or $\left( {\begin{matrix}0&{}1\\ 1&{}0\end{matrix}}\right) $ modulo 2 (this is the so-called theta group, under which the Jacobi theta function $\vartheta (z)=\sum _ne^{\pi in^2z}$ transforms like a modular form of weight 1/2), so Eqs. (8) and (9) can be combined to the following statement:

Corollary

The function $H:{\mathbb {Q}}\rightarrow {\mathbb {R}}$ satisfies the transformation property

$$\begin{aligned} \varepsilon (\gamma )\,|cx+d|\,H\left( \frac{ax+b}{cx+d}\right) = \,H(x) +\hbox {(continuous function)} \end{aligned}$$

for every matrix $\gamma =\left( {\begin{matrix}a&{}b\\ c&{}d\end{matrix}}\right) \in \Gamma _\vartheta $, where $\varepsilon :\Gamma _\vartheta \rightarrow \{\pm 1\}$ is the homomorphism sending both generators $\left( {\begin{matrix}1&{}2\\ 0&{}1\end{matrix}}\right) $ and $\left( {\begin{matrix}0&{}-1\\ 1&{}0\end{matrix}}\right) $ of $\Gamma _\vartheta $ to $-1$.

This corollary tells us precisely that the function H is a quantum modular form in the sense of [16] (except that there is an absolute value sign here that was not present there). Recall that a “quantum modular form” is by definition a function on ${\mathbb {Q}}$ that does not extend nicely to ${\mathbb {R}}$ but which modulo (the restriction from ${\mathbb {R}}$ to ${\mathbb {Q}}$ of) better-behaved (here, continuous) functions on ${\mathbb {R}}$ transforms like a modular form with respect to some subgroup of finite index of $\text {SL}(2,{\mathbb {Z}})$ (here the theta group $\Gamma _\vartheta $) with some weight (here $-\,1$) and character (here $\varepsilon $). In Sect. 6 of this paper we will show that the quantum modularity property of H is related to an actual modular form (specifically, a Maass Eisenstein series of eigenvalue 1/4) on the group $\Gamma _\vartheta $.

The authors would like to thank Steven Johnson (MIT) and Alexander Weisse (MPI) for invaluable help with the numerical aspects of this paper.

2 Properties of the numbers $c_{m,n}$ and $h_{m,n}$

In this section we give an integral representation for the numbers $c_{m,n}$ that will be used later both to extend the function C(x) continuously from ${\mathbb {Q}}$ to ${\mathbb {R}}$ and as the key to the proof in Sect. 3 of the relation between the numbers $c_{m,n}$ and the zeros of the Dirichlet L-series L(s). We will also prove the formulas for $c_{m,n}$ and $h_{m,n}$ as algebraic numbers (finite sums of cotangents).

The first step is to rewrite the definition of H given in (7) more compactly in the form

$$\begin{aligned} H(x) = \sum _{k=1}^\infty \frac{\chi (k)}{k}\,S(kx) \qquad (x\in {\mathbb {Q}}), \end{aligned}$$

(10)

where S(x) is the function defined on ${\mathbb {R}}_{>0}$ by

(11)

The function S, like H itself, is periodic of period 4, and we can use this property to extend it to all of ${\mathbb {R}}$. (Equivalently, we can extend it by $S(-x)=S(x)$ and $S(0)=0$, or by observing that the summation condition in (11) can be replaced by $J<j\le x$ for any integer $J<x$ divisible by 4, which works for all real x.) The function S is then the step function given by

$$\begin{aligned} S(x) = {\left\{ \begin{array}{ll} 1 &{} \text {if }4k+1<x<4k+3\text { for some }k\in {\mathbb {Z}}, \\ 0 &{} \text {if }4k-1<x<4k+1\text { for some }k\in {\mathbb {Z}}, \\ \frac{1}{2} &{} \text {if }x\text { is an odd integer,} \end{array}\right. } \end{aligned}$$

(12)

as pictured in Fig. 4. Note that the sum in (10) converges (conditionally) for $x\in {\mathbb {Q}}$ by summation by parts, because the function $\chi (k)S(kx)$ is then an odd periodic function of k and hence has average value zero, so that the sums $\sum _{k=1}^K\chi (k)S(kx)$ are bounded.

We use the bounded periodic function S to prove the following formula for the $c_{m,n}\,$:

Proposition 1

The numbers $c_{m,n}$ defined by (2) have the integral representation

$$\begin{aligned} c_{m,n} = \frac{4}{\pi }\,\int _0^\infty S(mt)\,S(nt)\,\frac{\mathrm{d}t}{t^2} \qquad (m,\,n\in {\mathbb {N}}). \end{aligned}$$

(13)

Proof

From the definition of S we find

and since the second sum is identically zero for N divisible by 4, Eq. (13) follows by letting N tend to infinity in $4{\mathbb {Z}}$. $\square $

Our next result expresses $c_{m,n}$ as a finite sum of algebraic cotangent values indexed by lattice points in the rectangle $[0,m/2]\times [0,n/2]$ satisfying a certain “selection rule.”

Proposition 2

Let m and n be positive integers. Then the number defined by (2) is given as the following integral linear combination of cotangents of rational multiples of $\pi $:

$$\begin{aligned} c_{m,n}= \sum _{\begin{array}{c} 0\le j\le m/2\\ 0\le k\le n/2 \end{array}} {\left\{ \begin{array}{ll} \cot (\pi a)-\cot (\pi b) &{} \text {if }a:=\max \left( \frac{4j+1}{4m},\frac{4k+1}{4n}\right) \\ &{}\quad < b:=\min \left( \frac{4j+3}{4m},\frac{4k+3}{4n},\frac{1}{2}\right) , \\ 0 &{}\text {if }a\ge b. \end{array}\right. } \end{aligned}$$

Corollary

The numbers $c_{m,n}$ are algebraic integers for all $m,\,n>0$.

Using this proposition we can calculate any $c_{m,n}$ in closed form as an algebraic number. For example, the first $4\times 4$ matrix of values [cf. Eq. (3)] is given by

$$\begin{aligned} \mathsf{C}_4 = \begin{pmatrix} 1&{}2-\sqrt{2}&{}2-\sqrt{3}&{}2-2\sqrt{2-\sqrt{2}} \\ 2-\sqrt{2}&{}2&{}\sqrt{2}&{}4-2\sqrt{2}\\ 2-\sqrt{3}&{}\sqrt{2}&{}3&{}6-2\sqrt{2+\sqrt{2}}\\ \,2-2\sqrt{2-\sqrt{2}}&{}4-2\sqrt{2}&{}6-2\sqrt{2+\sqrt{2}}&{}4 \end{pmatrix}. \end{aligned}$$

Proof

Since S(x) is even and periodic of period 4, we can rewrite the integral representation (13) using Euler’s formula $\sum _{k\in {\mathbb {Z}}}(k+t)^{-2}=\pi ^2/\sin ^2(\pi t)$ as

$$\begin{aligned} c_{m,n} = \pi \,\int _0^{1/2} S(4mx)\,S(4nx)\,\frac{\mathrm{d}x}{\sin ^2(\pi x)}. \end{aligned}$$

But from the definition of S(4mx) and S(4nx) as characteristic functions, it follows that the restriction of S(4mx)S(4nx) to $[0,\frac{1}{2}]$ is the characteristic function of the union of all intervals [a, b] with $a<b$ as given in the proposition. This proves the proposition because $\pi \int _a^b\csc ^2(\pi x)\,\mathrm{d}x = \cot (\pi a)-\cot (\pi b)$. The corollary then follows because by elementary algebraic number theory the number $\cot (\pi \alpha )$ is an algebraic integer for every rational number $\alpha $ having an even denominator. $\square $

We next give the corresponding formula for the numbers $h_{m,n}$, which is in fact simpler.

Proposition 3

Let m and n be positive integers. Then the number defined by (7) is given as the following half-integral linear combination of cotangents of odd multiples of $\pi /4n$:

$$\begin{aligned} h_{m,n} = \sum _{k=1}^{2n-1} \chi (k)\,S\left( \frac{km}{n}\right) \,\cot \left( \frac{\pi k}{4n}\right) . \end{aligned}$$

(14)

Proof

Since the summand in (10) is an even function of k, we can rewrite the sum as a sum over all nonzero integral values of k and then use the periodicity (with period at most 4n) of the factor $\chi (k)S(km/n)$ together with Euler’s formula $\sum _{n\in {\mathbb {Z}}}(x+n)^{-1}=\pi \cot (\pi x)$ (with the sum being interpreted as a Cauchy principal value) to get

$$\begin{aligned} \begin{aligned} h_{m,n}&= \frac{2n}{\pi }\,\sum _{k\in {\mathbb {Z}}/\{0\}} \frac{\chi (k)}{k}\,S\left( \frac{km}{n}\right) = \frac{2n}{\pi }\,\sum _{k (\text {mod}~4n),\;k\ne 0} \chi (k)\,S\left( \frac{km}{n}\right) \,\sum _{r\in {\mathbb {Z}}}\frac{1}{4nr+k} \\&= \frac{1}{2}\,\sum _{0<k<4n} \chi (k)\,S\left( \frac{km}{n}\right) \,\cot \left( \frac{\pi k}{4n}\right) = \sum _{0<k<2n} \chi (k)\,S\left( \frac{km}{n}\right) \,\cot \left( \frac{\pi k}{4n}\right) \end{aligned} \end{aligned}$$

as claimed. $\square $

Remark

One can also deduce the formula for $c_{m,n}$ in Proposition 2 from (14); this is left as an exercise to the reader. We also observe that the terms on the right-hand side of (14) (assuming, as we can, that m and n are coprime) are integral multiples of cotangents of odd multiples of $\pi /4n$, and hence are algebraic integers (in some cyclotomic field), except for the $k=n$ term if m and n are odd, which is half-integral because then $S(km/n)=\frac{1}{2}$ and $\cot (\pi k/4n)=1$. Hence $h_{m,n}$ is an algebraic integer unless both m and n are odd, in which case it equals an algebraic integer plus 1 / 2, and the sum $c_{m,n}=h_{m,n}+h_{n,m}$ is always an algebraic integer.

3 The connection with the generalized Riemann hypothesis

In the first subsection of this section we outline the basic strategy for proving Theorem 1. The easier direction (a)$\,\Rightarrow \,$(b) of this theorem is then proved in Sect. 3.2 and the more difficult converse direction in Sect. 3.3, using ideas from two classical papers by A. Beurling. As already stated in introduction, we will actually prove the following stronger result:

Theorem 3

Let L(s) be the L-series defined in (1) and R(N) be the function defined in (4).

(a)
If L(s) has no zeros with $\mathfrak {R}(s)>\frac{1}{2}$, then R(N) tends to infinity as N tends to infinity.
(b)
If $L(\rho )=0$ for some $\rho \in {\mathbb {C}}$ with $\mathfrak {R}(\rho )>\frac{1}{2}$, then R(N) satisfies the bound
$$\begin{aligned} R(N) \le \frac{\pi }{8}\,\frac{|\rho |^2}{\mathfrak {R}(\rho ) - \frac{1}{2}} \qquad (\forall N\ge 1). \end{aligned}$$
(15)

3.1 The square-wave function ${\textit{G}}$ and its multiplicative shifts

Set $L^2 = L^2(0, 1)$ and let $\xi $ be the characteristic function of the interval (0, 1] and ${\textit{G}}:{\mathbb {R}}_{>0}\rightarrow {\mathbb {R}}$ the function

$$\begin{aligned} {\textit{G}}(x) = S(1/x) = {\left\{ \begin{array}{ll} 1 &{}\text {if }\frac{1}{4k+3}<x<\frac{1}{4k+1}\text { for some }k\in {\mathbb {Z}}_{\ge 0},\\ 0 &{} \text {otherwise} \end{array}\right. } \end{aligned}$$

(16)

(and ${\textit{G}}(x)=1/2$ if 1 / x is an odd integer, but we are working in $L^2$). Clearly both $\xi $ and ${\textit{G}}$ belong to $L^2$, since they are bounded with support in the interval (0, 1]. On the other hand, if we identify $L^2$ with the space of square-integrable functions on ${\mathbb {R}}_{>0}$ that vanish on $(1,\infty )$, then we have bounded operators $\tau _a$ on $L^2$ defined by

$$\begin{aligned} \tau _af(x) = f(x/a) \qquad (0<a\le 1,\;x\in {\mathbb {R}}_{>0}) \end{aligned}$$

We set $G_a=\tau _aG$ and for any ${\mathcal {D}}\subseteq {\mathbb {R}}$ and $N\in {\mathbb {N}}$ define subspaces ${\mathcal {V}}_{\mathcal {D}}$ and ${\mathcal {V}}_N$ of $L^2$ by

$$\begin{aligned} {\mathcal {V}}_{\mathcal {D}}= \langle G_a|a\in {\mathcal {D}}\cap (0,1]\rangle , \quad {\mathcal {V}}_N = \langle G_{n/N}|1\le n\le N\rangle , \end{aligned}$$

(17)

where $\langle S\rangle $ denotes the linear span of a subset S of a linear space. Finally, we denote by $\mathsf{d}(\xi ,{\mathcal {V}}_N)$ the $L^2$ distance from $\xi \in L^2$ to the subspace ${\mathcal {V}}_N $, viz.

$$\begin{aligned} \mathsf{d}(\xi ,{\mathcal {V}}_N) = \left\| \xi - {\mathcal {V}}_N\right\| = \inf _{x\in {\mathbb {R}}^N}\left\| \xi - \sum _{n=1}^Nx_n\,\tau _{n/N}{\textit{G}}\right\| . \end{aligned}$$

(18)

The key connection is then given by the following proposition.

Proposition 4

The distance $\mathsf{d}(\xi ,{\mathcal {V}}_N)$ is related to the number R(N) defined in (4) by

$$\begin{aligned} \mathsf{d}(\xi ,{\mathcal {V}}_N)^2 = \frac{\pi /4}{R(N)} \qquad (N\in {\mathbb {N}}). \end{aligned}$$

(19)

The proof of this, which is a straightforward computation, will be given in Sect. 3.2. On the other hand, a quite elementary argument, also given in Sect. 3.2, proves the following

Proposition 5

If $L(\rho )=0$ with $\mathfrak {R}(\rho )>\frac{1}{2}$ then $\mathsf{d}(\xi ,{\mathcal {V}}_N)^2 \ge \frac{2\mathfrak {R}(\rho )-1}{|\rho |^2}$ for all $N\in {\mathbb {N}}$ .

Together these two results prove statement (b) of Theorem 3. The harder statement (a) follows from the two following results, proved in Sects. 3.2 and 3.3, respectively.

Proposition 6

The space ${\mathcal {V}}_{\mathcal {D}}$ is dense in ${\mathcal {V}}_{\mathbb {R}}$ for any set ${\mathcal {D}}$ that is dense in (0, 1).

Proposition 7

If L(s) has no zeros in $\sigma >\frac{1}{2}$, then ${\mathcal {V}}_{\mathbb {R}}$ is dense in $L^2$.

The whole argument is summarized by the following diagram, where the flow of the double arrows shows that the six boxed statements are mutually equivalent.

(The implications labeled “Prop. 4” follow in one direction because each ${\mathcal {V}}_N$ is contained in ${\mathcal {V}}_{\mathbb {Q}}$ and in the other because if R(N) were bounded for some infinite sequence of integers $\{N_i\}$ then the set $\cup _i{\mathcal {V}}_{N_i}$ would be dense in ${\mathcal {V}}_{\mathbb {R}}$ but have a nonzero distance from $\xi $.)

3.2 Proof of Propositions 4, 5 and 6

Recall from linear algebra that the distance d(v, V) from a vector v in a (real) Hilbert space to a finite-dimensional vector space V with basis $\{v_1,\dots ,v_N\}$ is given by the formula $d(v,V)^2=\det (\widehat{G})/\det (G)$, where G is the $N\times N$ Gram matrix with (m, n)-entry $\langle v_m,v_n\rangle $ (scalar product of the basis elements $v_m$ and $v_n$) and $\widehat{G}$ the augmented $(N+1)\times (N+1)$ matrix obtained from G by adding a row and column with entries $\langle v,v_n\rangle $ and a diagonal entry $\langle v,v\rangle $. We apply this to the vector $v=\xi \in L^2$ and the space ${\mathcal {V}}_N$ defined as above as the span of the vectors $G_{n/N}\in L^2$ ($n=1,\dots ,N$). (Here we mean the real rather than the complex span, i.e., we work in the real Hilbert space of real-valued square-integrable functions on (0, 1).) We also denote by $\widehat{{\mathcal {V}}}_N$ the span of ${\mathcal {V}}_N$ and $\xi $. We will show in a moment that the vectors $G_{n/N}$ and $\xi $ are linearly independent, so $\dim \widehat{{\mathcal {V}}}_N=\dim {\mathcal {V}}_N+1=N+1$. We can then compute the entries of the Gram matrices as

$$\begin{aligned} \langle \xi ,\xi \rangle = \int _0^1\mathrm{d}x = 1,\qquad \langle \xi ,G_{n/N}\rangle = \int _0^1S\left( \frac{n}{Nx}\right) \,\mathrm{d}x = \frac{n}{N}\,L(1) = \frac{\pi n}{4N} \end{aligned}$$

[the first is trivial and the second a special case of Eq. (21)] and

$$\begin{aligned} \langle G_{m/N},\,G_{n/N}\rangle = \int _0^1 S\left( \frac{m}{Nx}\right) \,S\left( \frac{n}{Nx}\right) \,\mathrm{d}x = \frac{m}{N}\,C\left( \frac{m}{n}\right) = \frac{\pi }{4N}\,c_{m,n} \end{aligned}$$

[combine Eqs. (25) and (5)]. Proposition 4 follows.

To find the connection with the L-series L(s) and its complex zeros, we next consider the “complex monomial functions” $\mathsf{p}_s$ on (0, 1] defined by

$$\begin{aligned} \mathsf{p}_s(t) = t^{s-1} \xi (t) = t^{s-1} \qquad (0<t\le 1,\; s\in {\mathbb {C}}). \end{aligned}$$

Note the key fact that $\mathsf{p}_s \in {L}^2$ if and only if $\sigma > \frac{1}{2}$, and also that for $f \in L^2$ and $\sigma > \frac{1}{2}$, we have $\langle \mathsf{p}_s, f \rangle $ = $\widetilde{f}(s)$, where $\widetilde{f}(s)$ denotes the Mellin transform

$$\begin{aligned} \widetilde{f}(s) = \int _0^\infty f(x)\, x^{s-1}\,\mathrm{d}x = \int _0^1 f(x)\, x^{s-1}\,\mathrm{d}x. \end{aligned}$$

(Here we are again identifying $L^2=L^2(0,1)$ with the space of functions $L^2({\mathbb {R}}_{>0})$ supported on (0, 1].) Then obviously

$$\begin{aligned} \langle \mathsf{p}_s,\xi \rangle = \int _0^1 x^{s-1}\,\mathrm{d}x =\frac{1}{s}, \end{aligned}$$

(20)

and the Mellin transform of $G_a$ is almost equally easy to calculate using (12):

$$\begin{aligned} \langle \mathsf{p}_s,{\textit{G}}_a\rangle =\int _0^\infty S\left( \frac{a}{x}\right) \,x^{s-1}\,\mathrm{d}x = \frac{1}{s}\,\sum _{k=0}^\infty \,\left( \frac{a^s}{(4k+1)^s} - \frac{a^s}{(4k+3)^s}\right) = \frac{a^s}{s}\, L(s), \end{aligned}$$

(21)

the calculation being valid for all s in $\mathfrak {R}(s)>0$ since the integral then converges absolutely. One consequence of this is that the functions $G_a$ ($0<a\le 1$) are all linearly independent, since if some (finite) linear combination of them vanished then the vanishing of its Mellin transform would imply that the product of L(s) with a finite Dirichlet series vanished identically (first for $\mathfrak {R}(s)>0$, and then for all s by analytic continuation). In particular, the space ${\mathcal {V}}_N$ has dimension exactly N. In fact, almost the same argument also shows that the functions $G_a$ and $\xi $ are linearly independent, since a linear relation among them would imply that the product of L(s) with a finite Dirichlet series was constant, which is obviously impossible. It follows that the extended space $\widehat{{\mathcal {V}}}_N$ spanned by the functions ${\textit{G}}_{n/N}$ ($1\le n\le N$) and $\xi $ has dimension $N+1$, as claimed above.

The proof of the direction (a)$\,\Rightarrow \,$(b) of Theorem 1 now follows easily. If there exists $\rho $ with $ \mathrm {Re}\,(\rho ) > \frac{1}{2}$ and $L(\rho ) = 0$ then by (21) we have $\mathsf{p}_{\rho } \perp {\mathcal {V}}_N $ for all N. Since ${\mathcal {V}}_N$ is finite-dimensional, the infimum in Eq. (18) is attained, so that R(N) is unbounded (and hence $\mathsf{d}(\xi ,{\mathcal {V}}_N)$ has lim inf equal to 0) if and only if there is a sequence of functions $h_N \in {{\mathcal {V}}}_N$ converging in $L^2$ to $\xi $, and this would contradict $\mathsf{p}_{\rho } \perp {\mathcal {V}}_N $ for all N since $\langle \mathsf{p}_\rho ,\xi \rangle =\rho ^{-1}\ne 0$.

This argument actually proves the stronger statement given in Proposition 5 and in part (b) of Theorem 3. Indeed, if $\rho $ is a zero of L(s) with $\mathfrak {R}(\rho )>\frac{1}{2}$, then on the one hand $\mathsf{p}_\rho $ belongs to $L^2$ as already noted, with $L^2$ norm

$$\begin{aligned} \left\| \mathsf{p}_\rho \right\| ^2 = \int _0^{1}\left| x^{\rho -1}\right| ^2\,\mathrm{d}x = \frac{1}{2\,\mathfrak {R}(\rho ) - 1}, \end{aligned}$$

and on the other hand $\langle \mathsf{p}_\rho ,f\rangle =0$ for any $f\in {\mathcal {V}}_N$ by virtue of (21). We apply this to the function $f=f_N\in {\mathcal {V}}_N$ attaining the minimum distance to $\xi $. Then applying the Cauchy–Schwarz inequality to the scalar product $\langle \mathsf{p}_\rho ,\xi -f_N\rangle $ we get [using (20)]

$$\begin{aligned} \frac{1}{|\rho |^2} = \left| \langle \mathsf{p}_\rho ,\xi \rangle \right| ^2 = \left| \langle \mathsf{p}_\rho ,\xi -f_N\rangle \right| ^2 \le \Vert \mathsf{p}_\rho \Vert ^2\,\Vert \xi -f_N\Vert ^2 = \frac{d(\xi ,{\mathcal {V}}_N)^2}{2\,\mathfrak {R}(\rho ) - 1}, \end{aligned}$$

and (15) now follows immediately from (19).

Finally, we prove Proposition 6 (“density lemma”). We first note that the $L^2$ norm $\Vert {\textit{G}}_a-{\textit{G}}_b\Vert $ tends to 0 as a tends to b, since by the calculation $\langle {\textit{G}}_a,{\textit{G}}_b\rangle =aC(a/b)$ given above (for rational numbers, but true in general by the same argument) we have

$$\begin{aligned} \Vert {\textit{G}}_a-{\textit{G}}_b\Vert ^2 = \langle {\textit{G}}_a,{\textit{G}}_a\rangle +\langle {\textit{G}}_a,{\textit{G}}_b\rangle - 2\,\langle {\textit{G}}_a,{\textit{G}}_b\rangle = (a+b)\,C(1) - 2a\,C(b/a), \end{aligned}$$

which tends to 0 as $b\rightarrow a$ because C(x) is a continuous function. It follows immediately that if ${\mathcal {D}}$ is dense in (0, 1) then the vector ${\textit{G}}_a\in L^2$ for any $a<1$ lies in the closure of ${\mathcal {V}}_{\mathcal {D}}$, and since ${\mathcal {V}}_{\mathbb {R}}$ is spanned by such vectors, this shows that $\overline{{\mathcal {V}}_{\mathbb {R}}}=\overline{{\mathcal {V}}_D}$ as claimed.

3.3 Proof of Proposition 7

It remains to show that $\mathsf{GRH}$ implies that $\overline{{\mathcal {V}}_{\mathbb {R}}}=L^2$.

Let ${H}^2 = {H}^2({\mathcal {R}}) $ denote the Hardy space of the right half-plane ${\mathcal {R}}=\{s\mid \mathfrak {R}(s)>\frac{1}{2}\}$, that is, the space of all holomorphic functions F(s) on ${\mathcal {R}}$ that are square-integrable on the lines $\mathfrak {R}(s)=c$ for all $c>\frac{1}{2}$ and with these $L^2$ norms $\Vert \ \Vert _c$ bounded in c, and then equipped with the Hilbert space norm $\Vert F\Vert = \sup \{\Vert F\Vert _c \mid c>\frac{1}{2}\}$. We quote some results from the standard analysis of these spaces, citing as we go the relevant sections in the books by Hoffman [11] and Garnett [8] for the details.

The Paley–Wiener theorem [11, pp. 131–132] gives the fact that the Mellin transform is (up to scalar multiple) an isometry between the spaces $L^2 $ and ${H}^2$. Applying this to the function ${\textit{G}}$ implies that its Mellin transform $\,\widetilde{\textit{G}}(s)=L(s)/s\,$ lies in the space ${H}^2$, a fact which is also easy to see directly from the definition of ${H}^2$. Next, we note that if we assume that $\mathsf{GRH}$ holds, so that $\widetilde{\textit{G}}(s)$ has no zeros in ${\mathcal {R}}$, then the factorization theorem for functions in the space ${H}^2$ [11, pp. 132–133] applied to the function $\widetilde{\textit{G}}(s)$ gives the implication

$$\begin{aligned} L(s)\textit{ has no zeros in } \sigma >\tfrac{1}{2} \quad \Longrightarrow \quad \widetilde{\textit{G}}(s)\textit{ is an outer function.} \end{aligned}$$

(22)

Indeed, the factorization theorem represents any element F of ${H}^2$ as the product of a “Blaschke product,” a “singular function” and an “outer function.” Here we can omit the definitions of all three of these since the Blaschke product is defined as a product over the zeros of F in ${\mathcal {R}}$ and hence is equal to 1 if F has no such zeros, and the singular function factor is also constant because $\widetilde{\textit{G}}(s)$ both continues across the line $\mathfrak {R}(s)=\frac{1}{2}$ and has slow rate of convergence to zero as $s \rightarrow \infty $ on the positive reals (cf. 3.14 in [1] and Chapter II, Theorem 6.3, of [8]). This leaves only the outer function factor of $\widetilde{\textit{G}}(s)$, and here we can apply a corollary of a theorem of Beurling on shift-invariant subspaces in ${H}^2$ ([4, 11, pp. 99–101]), which tells us that a function $F\in {H}^2$ is outer if and only if the space

$$\begin{aligned} {\mathcal {W}}(F) := {\mathbb {C}}[1/s]\,F(s) = \left\langle s^{-n}F(s)\mid n\ge 0\right\rangle \end{aligned}$$

is dense in ${H}^2$. (Notice that ${\mathcal {W}}(F)$ is contained in ${H}^2$ because the definition of the ${H}^2$ norm implies that $\Vert s^{-1}F(s)\Vert _{{H}^2}\le 2\Vert F\Vert _{{H}^2}<\infty $ for $F\in {H}^2$, so $s^{-1}{H}^2\subset {H}^2\,$.)

It follows that GRH implies that ${\mathcal {W}}(\widetilde{\textit{G}})$ is dense in ${H}^2$. We can now apply the inverse Mellin transform from ${H}^2$ to $L^2$ to obtain the isometrically equivalent picture in $L^2$. An easy calculation shows that $s^{-1}\widetilde{f}(s)$ is the Mellin transform of If for any $f\in L^2$, where

$$\begin{aligned} If(x) := \int _x^\infty f(t) \, \frac{\mathrm{d} t}{t} , \end{aligned}$$

and hence that the space ${\mathcal {W}}(\widetilde{f})$ is the Mellin transform of the space

$$\begin{aligned} W(f) := {\mathbb {C}}[I]\,f = \left\langle I^n f\mid n\ge 0\right\rangle \qquad (f\in L^2). \end{aligned}$$

It is not actually obvious that I preserves the space $L^2$, but this follows from the Paley–Wiener theorem together with the fact just shown that multiplication by $s^{-1}$ preserves ${H}^2$. A more direct argument is that the operator I is the adjoint $\mathsf{A}^{*}$ of the averaging operator

$$\begin{aligned} \mathsf{A}h(x) = \frac{1}{x}\,\int _0^x h(t)\,\mathrm{d}t , \end{aligned}$$

as is easily checked, and since $\mathsf{A}$ is a bounded operator on $L^2(0,\infty )$ (cf. [10, Example 5.4, p. 23]), its adjoint I is also bounded on $L^2(0,\infty )$. But I also preserves the condition of having support in (0,1), so it is a bounded operator on $L^2$.

As a consequence implication (22) translates on the $L^2$ side to the statement

$$\begin{aligned} L(s)\textit{ has no zeros in }\sigma >\tfrac{1}{2} \quad \Longrightarrow \quad W({\textit{G}})\textit{ is dense in}~L^2. \end{aligned}$$

(23)

Proposition 7 is then obtained by applying to $f={\textit{G}}$ the following lemma, in which ${\mathcal {V}}(f)$ for any $f\in L^2$ denotes the span of the multiplicative translates $\tau _af$ ($0<a\le 1$).

Lemma

For $f \in L^2$ we have $W(f)\subset \overline{{\mathcal {V}}(f)}\,$.

Proof

We first note that it suffices just to show that $If \in \overline{{\mathcal {V}}(f)}$ for all $f\in L^2$, because then $I^n f \in \overline{{\mathcal {V}}(f)}$ for all $n\ge 1$ by induction and the remark that $g\in \overline{{\mathcal {V}}(f)}\implies \overline{{\mathcal {V}}(g)}\subseteq \overline{{\mathcal {V}}(f)}$. Since $(W^\perp )^\perp = {\overline{W}}$ for all subspaces W, we need to show that

$$\begin{aligned} \langle h, \tau _af \rangle = 0 \quad \text {for all }a\in (0,1] \implies \langle If,\,h \rangle = 0 . \end{aligned}$$

(24)

But since $a\tau _{a^{-1}}$ is the adjoint of $\tau _a$ and $\mathsf{A}$ is the adjoint of I, this statement just says that any function that is orthogonal to all h(ax) with $0<a\le 1$ is also orthogonal to $\mathsf{A}h$, and this is obvious because $\mathsf{A}h(x)=\int _0^1h(ax)\,\mathrm{d}a$. $\square $

4 Analytic properties of C(x) and H(x)

In this section we study the analytic properties of C(x) and H(x), discussing in particular the question of their extendability from ${\mathbb {Q}}$ to ${\mathbb {R}}$ (which we answer completely only in the case of C(x)) and describing the asymptotic properties in neighborhoods of rational points needed to establish Theorem 2 and its corollary (quantum modularity). We also discuss the relation of these two functions to the Dirichlet series L(s). It should be mentioned in passing that much of the material in this section is similar in spirit to the papers [2, 3] of Bettin and Conrey, with $\chi $ replaced by the trivial character and L(s) by the Riemann zeta function.

4.1 Sum and integral representations of C(x) and H(x)

Proposition 8 gives a number of formulas for these two functions. However, the word “function” has a slightly different meaning in the two cases. We have already defined them both as functions on ${\mathbb {Q}}$. Part (i) of the proposition says that C extends continuously to ${\mathbb {R}}$, and parts (iv) and (v) give further properties of that function. But in the case of H we do not know to what extent it can be defined as a function on ${\mathbb {R}}$. (This question is discussed in Sect. 4.2, though not in great detail since this is not our main interest.) However, it can certainly be defined as a distribution on ${\mathbb {R}}$, in fact even as the derivative of a continuous function, since by integrating (10) we get the absolutely convergent sum representation

$$\begin{aligned} \int _0^xH(x')\,\mathrm{d}x'=\frac{\pi x}{8} - \frac{1}{2}\sum _{k=1}^\infty \frac{\chi (k)}{k^2}\,A\left( \frac{kx}{2}\right) \end{aligned}$$

for its integral, where A is the periodic continuous function on ${\mathbb {R}}$ defined by $A(n+\epsilon )=(-1)^n\epsilon $ for $n\in {\mathbb {Z}}$ and $|\epsilon |\le \frac{1}{2}$. Parts (ii) and (iii) of the proposition are then to be interpreted in the sense of distributions, one giving the Fourier expansion of the even periodic distribution H and the other its relation, already mentioned in introduction, with the derivative of C.

Proposition 8

(i)
The function C(x) extends continuously from ${\mathbb {Q}}$ to ${\mathbb {R}}$ via the formula
$$\begin{aligned} C(x) = \int _0^\infty S(t)\,S(xt)\,\frac{\mathrm{d}t}{t^2}\qquad (x\in {\mathbb {R}}). \end{aligned}$$
(25)
(ii)
The distribution H has the Fourier expansion
$$\begin{aligned} H(x)=\frac{\pi }{8} - \frac{2}{\pi }\,\sum _{n=1}^\infty \frac{\chi (n)\,d(n)}{n}\,\cos \left( \frac{\pi nx}{2}\right) , \end{aligned}$$
(26)
where d(n) denotes the number of positive divisors of n.
(iii)
The function C(x) and the distribution H(x) are related by the equation
$$\begin{aligned} C'(x) = H(1/x)\qquad (x>0). \end{aligned}$$
(27)
(iv)
The function C(x) is also given by the sum
$$\begin{aligned} C(x)=\frac{\pi }{8} + x\,\sum _{n=1}^\infty \chi (n)\,d(n)\,J\left( \frac{\pi nx}{2}\right) \qquad (x>0), \end{aligned}$$
(28)
where J(x) denotes the modified cosine integral
$$\begin{aligned} J(x) = -\int _x^\infty \frac{\cos (t)}{t^2}\,\mathrm{d}t\qquad (x\in {\mathbb {R}}_{>0}). \end{aligned}$$
(29)
(v)
The Mellin transform of C(x) is given by
$$\begin{aligned} \widetilde{C}(s) = -\,\frac{L(-s)\,L(s+1)}{s\,(s+1)} \qquad (-1<\sigma <0), \end{aligned}$$
(30)
where L(s) is the analytic continuation of the L-series defined in (1).

Proof

Formula (25) for $x\in {\mathbb {Q}}_{>0}$ is precisely the statement of Proposition 1 after an obvious change of variables, and the convergence and continuity of the integral for all x in ${\mathbb {R}}_{>0}$ are easy consequences of the facts that S is supported on $[1,\infty )$ and is locally constant and bounded. To prove (ii), we insert the (standard) Fourier expansion

$$\begin{aligned} S(x)=\frac{1}{2} - \frac{2}{\pi }\sum _{m=1}^\infty \frac{\chi (m)}{m}\,\cos \left( \frac{\pi mx}{2}\right) , \end{aligned}$$

(31)

of the even periodic function S(x) into sum (10), combine the double sum into a single one and use Leibniz’s formula $L(1)=\pi /4$. For (iii), we insert (11) into (25) to get

$$\begin{aligned} C(x) = \int _1^\infty \sum _{0<k<xt}\chi (k)\,S(t)\,\frac{\mathrm{d}t}{t^2} = \sum _{k=1}^\infty \chi (k)\,\int _{k/x}^\infty \frac{S(t)}{t^2} \,\mathrm{d}t , \end{aligned}$$

(32)

from which Eq. (27) follows immediately by differentiating and using (10). (The easy justification of these steps in the sense of distributions by integrating against smooth test functions is left to the reader.) Finally, to prove (iv) we combine (27) and (6) to get

$$\begin{aligned} -\,x^2\,\frac{ \mathrm{d}}{\mathrm{d}x}\left( \frac{C(x)}{x}\right) = C(x) - x\,C'(x) = C(x) - x\,H\left( \frac{1}{x}\right) = H(x) \end{aligned}$$

and then insert Fourier development (26) and integrate term-by-term. This gives (28) up to an additive term $\lambda x$ that can be eliminated by noting that C is bounded. Note that the sum in (28) is uniformly and absolutely convergent since a simple integration-by-parts argument shows that $J(x)=\text {O}(x^{-2})$ as $x\rightarrow \infty $. Finally, for (v) we note first that the Mellin transform $\widetilde{S}(s):=\int _0^\infty S(x)\,x^{s-1}\mathrm{d}x$ of S is given according to Eq. (21) by $\widetilde{S}(s)=-L(-s)/s$ for $\sigma =\mathfrak {R}(s)<0$, and then use (25) to get

$$\begin{aligned} \widetilde{C}(s) = \int _0^\infty S(t)\,\frac{\widetilde{S}(s)}{t^s}\,\frac{\mathrm{d}t}{t^2} = \widetilde{S}(s)\,\widetilde{S}(-s-1) = -\,\frac{L(-s)\,L(s+1)}{s\,(s+1)} \end{aligned}$$

for $-1<\sigma <0$, as claimed. (The Mellin transform exists in this strip because $C(x)=\text {O}(\min (|x|,1))\,$ as an easy consequence of (25).) In particular, $\widetilde{C}(s)$ extends meromorphically to all of ${\mathbb {C}}$ and is invariant under $s\mapsto -s-1$. We also note that by the well-known functional equation of L(s), Eq. (30) can be written in the alternative form

$$\begin{aligned} \widetilde{C}(s)= -\,\frac{(2/\pi )^{s+1}\Gamma (s)\,\cos (\pi s/2)}{s+1}\,L(s+1)^2. \end{aligned}$$

We can then use this to give a second derivation of (28), not making use of the distribution H, by first applying the Mellin inversion formula to write C(x) as an integral over a vertical line $\sigma =c$ with $-1<c<0$ and then shifting the path of integration to the right, picking up the term $\pi /8$ in (28) from the residue at $s=0$ and permitting us to use the convergent Dirichlet series representation $L(s)^2 = \sum _{n=1}^\infty \chi (n)d(n)n^{-s}$ for $\sigma >1$. (We omit the details of this calculation.) $\square $

4.2 The function H on the real line

In this subsection we give two one-parameter generalizations of H(x) and discuss possible ways to define this function at irrational arguments.

We first generalize the definitions of H(x) and C(x) and the assertions of Proposition 8 to families depending on a complex parameter s. Specifically, we can generalize (10) to

$$\begin{aligned} H_s(x) := \sum _{k=1}^\infty \frac{\chi (k)}{k^s}\,S(kx)\qquad (x\in {\mathbb {R}},\;\sigma =\mathfrak {R}(s)>1), \end{aligned}$$

(33)

which now converges absolutely. The same calculation as for H(x) then gives the Fourier expansion

$$\begin{aligned} H_s(x)=\frac{L(s)}{2} - \frac{2}{\pi }\,\sum _{n=1}^\infty \frac{\chi (n)\,\sigma _{s-1}(n)}{n^s}\,\cos \left( \frac{\pi nx}{2}\right) , \end{aligned}$$

where L(s) is the Dirichlet L-function defined in (1) and $\sigma _\nu $ is the sum-of-divisors function $\sigma _\nu (n)=\sum _{k|n,k>0}k^\nu $. Similarly, we define $C_s(x)=x^sC_s(1/x)$ for all $s\in {\mathbb {C}}$ with $\sigma >0$ by

$$\begin{aligned} C_s(x) = s\int _0^\infty S(t)\,S(xt)\,\frac{\mathrm{d}t}{t^{s+1}} = \sum _{j,\,k>0} \frac{\chi (j)\,\chi (k)}{\max (j/x,k)^s}, \end{aligned}$$

(34)

where the equality of the two expressions is proved by the same calculation as for the proof of (25). Then $C_s(x)$ and $H_s(x)$ are related by

$$\begin{aligned} C_s(x) = H_s(x)+x^s\,H_s\left( \frac{1}{x}\right) , \qquad C_s'(x) = s\,x^{s-1}H_s\left( \frac{1}{x}\right) \qquad (x>0) \end{aligned}$$

(35)

by the same calculations as in the special case $s=1$ and one has an expansion like (28) but with J(x) replaced by $-\int _x^\infty t^{-s-1}\cos (t)\,\mathrm{d}t$. As long as $\sigma >1$ all of the sums and integrals involved are absolutely convergent and all steps are justified.

A different possible way to regularize H(x) for $x\in {\mathbb {R}}$ is to define

$$\begin{aligned} H(x) = \lim \limits _{\varepsilon \searrow 0}\left( \dfrac{8}{\pi }-\dfrac{2}{\pi }T(x,\varepsilon )\right) , \quad \text {where}\quad T(x,\varepsilon ) = \sum _{n=1}^\infty \frac{\chi (n)\,d(n)}{n}\,e^{-n\varepsilon }\,\cos (n\pi x/2).\nonumber \\ \end{aligned}$$

(36)

Using the identity $\sum _{n>0}\dfrac{d(n)}{n}\,x^n = \sum _{k>0}\dfrac{1}{k}\,\log \left( \dfrac{1}{1-x^k}\right) $ and standard trigonometric identities, we find after a short calculation that

$$\begin{aligned} T(x,\varepsilon ) = \frac{1}{2}\,\sum _{k=1}^\infty \frac{\chi (k)}{k}\,\arctan \left( \frac{\cos (k\pi x/2)}{\sinh (k\varepsilon )}\right) . \end{aligned}$$

Both this series and the original one defining T converge exponentially fast for any positive $\varepsilon $, giving us another possible approach to the analytic properties of the limiting function H.

Summarizing this discussion, we have at least five potential definitions of H(x) for $x\in {\mathbb {R}}$:

Definition 1

Define H(x) by series (10), if this sum converges.

Definition 2

Define H(x) by Fourier series (26), if this series converges.

Definition 3

Define H(x) as the limit for $s\searrow 1$ of series (33), if this limit exists.

Definition 4

Define H(x) as the limit in (36), if this limit exists.

Definition 5

Define H(x) as the limit of $H(x')$ as $x'\in {\mathbb {Q}}$ tends to x, if this limit exists.

We can then ask—but have not been able to answer—the question whether any or all of these definitions converge for irrational values of x or, if they do, whether they give the same value. The last definition is in the sense the strongest one, since any definition of H on ${\mathbb {R}}$ or a subset of ${\mathbb {R}}$ that agrees with the original definition on ${\mathbb {Q}}$ must coincide with the value in Definition 5 at any argument x at which this function is continuous. In any case, we pose the explicit question:

Question

Are the five definitions given above convergent and equal to one another for all irrational values of x, or for all x belonging to some explicit set of measure 1?

It seems reasonable to expect the answer to the second question to be affirmative with the set of measure 1 being the complement of some set of irrational numbers having extremely good rational approximations, like the well-known “Brjuno numbers” in dynamics.

4.3 Asymptotic behavior of H(x) and C(x) near rational points

The results of Sect. 4.1 prove the continuity of C(x), which is part of the statement of Theorem 2 in Introduction. We now discuss the remaining statements there, concerning the behavior of C and H near rational points.

We begin by looking numerically at the asymptotic properties of C(x) near $x=1$. Computing the values of $C\left( 1\pm \frac{1}{n}\right) $ for $1\le n\le 1000$ and using a numerical interpolation technique that is explained elsewhere (see, e.g., [9]), we find empirically an expansion of the form

$$\begin{aligned} C\left( 1\pm \frac{1}{n}\right) \sim c_0 + \frac{1}{n}\,\left( -\frac{1}{4}\,\log n+c_1^\pm \right) \pm \frac{c_2}{n^2}+\frac{c_3}{n^3} \pm \frac{c_4}{n^4}+\frac{c_5}{n^5} \pm \cdots \end{aligned}$$

(37)

with the first few coefficients having numerical values given by

$$\begin{aligned} \begin{aligned}&c_0=\,C(1)=\frac{\pi }{4},\quad c_1^+=c_1^-+c_0=-0.23528274848426799887\ldots , \quad c_2 = -\frac{1}{8}, \\&c_3 = 0.058801396529669\ldots ,\quad c_4 = \frac{1}{48} - c_3,\quad c_5 = 0.01927655829\ldots . \end{aligned} \end{aligned}$$

These numerics become clearer if we work with H(x) instead, where we find the simpler expansion

$$\begin{aligned} H\left( 1\pm \frac{1}{n}\right) \sim \pm \,\left( \frac{1}{4}\,\log n + h_0^\pm + \frac{h_2}{n^2}+\frac{h_4}{n^4} + \cdots \right) \end{aligned}$$

(38)

with no odd powers of 1 / n and with coefficients given numerically by

$$\begin{aligned} h_0^+= & {} h_0^-+c_0=0.7706809118817\ldots , \quad h_2=0.01713472986\ldots , \\ h_4= & {} -0.00345272385\ldots , \end{aligned}$$

which then gives expansion (37) with $c_1^\pm =-h_0^\mp -\dfrac{1}{4}$, $c_3=h_2+\dfrac{1}{24}$, and more generally $c_r=(-1)r\sum _{0<i<r/2}\left( {\begin{array}{c}r-1\\ 2i-1\end{array}}\right) h_{2i}$ for $r>1$. Moreover, on calculating the next few values of $h_{2i}$ numerically to high precision we are able to guess the closed formulas

$$\begin{aligned} h_0^\pm = \pm \frac{\pi }{8}+\frac{\gamma }{4} + \frac{1}{4}\log \frac{8}{\pi }, \qquad h_{2i} = (-1)^{i-1}\,\frac{(2^{2i-1}-1)^2\,B_{2i}^2}{2i\,(2i)!}\,\pi ^{2i}\;\quad (i\ge 1) \end{aligned}$$

(39)

for the coefficients in (38), where $\gamma $ is Euler’s constant and $B_n$ the nth Bernoulli number. In fact, it is not difficult to prove (38) [and hence also (37)] with these values of $h_i$ by applying a twisted Euler–Maclaurin formula (giving the asymptotics as $\varepsilon \rightarrow 0$ of sums over an interval of $\chi (n)f(n\varepsilon )$ for a smooth function f) to the closed formula

$$\begin{aligned} H\left( 1\pm \frac{1}{n}\right) = \frac{4n}{\pi }\,\sum _{\begin{array}{c} 0<k<2n\\ k\equiv 1\pmod 4 \end{array}}\cot \frac{\pi k}{4n} \end{aligned}$$

which follows easily from (14), though we will not carry this out here.

The “$\log n$” terms in Eqs. (37) and (38) already show that C(x) is not differentiable at $x=1$ and H(x) is not continuous at $x=1$, but from these formulas one might imagine that the functions $C(x)-|x-1|\log (|x-1|)/4$ and $H(x)-\log (|x-1|)/4$ are $C^\infty $ to both the right and the left of this point. However, this is not the case, because both Eqs. (37) and (38) are valid only when n is an integer and change in other cases. For instance, if n tends to infinity in ${\mathbb {N}}+\frac{1}{2}$ rather than ${\mathbb {N}}$, then $H(1\pm 1/n)$ has an expansion of the same form as (38) and with the same constants $h_0^\pm $, but with $h_2=\pi ^2/576$ replaced by $-7\pi ^2/1152$, $h_4=-49\pi ^4/1{,}382{,}400$ replaced by $127\pi ^4/2{,}764{,}800$, etc. This statement, which again can be proved using an appropriate twisted version of the Euler–Maclaurin formula, is a typical phenomenon of quantum modular forms.

If we look at the asymptotics of C(x) and H(x) as x tends to any rational number $\alpha $ with odd numerator and denominator, then we find a similar behavior, which is not surprising since any such $\alpha $ is $\Gamma _\vartheta $-equivalent to 1 and both H and C have transformation properties, modulo functions with better smoothness properties, under the action of $\Gamma _\vartheta $. More precisely, if we write $\alpha =a/c$ with a and $c>0$ odd and coprime and complete $\left( \begin{matrix}a\\ c\end{matrix}\right) $ to a matrix $\left( {\begin{matrix}a&{}b\\ c&{}d\end{matrix}}\right) \in \text {SL}(2,{\mathbb {Z}})$, then we find asymptotic expansions

$$\begin{aligned} \begin{aligned} H\left( \frac{an+b}{cn+d}\right)&\sim \pm \,\frac{\log |cn+d|}{4c} + \sum _{i=0}^\infty \frac{h^\pm _{i,\alpha }}{(cn+d)^i}, \\ C\left( \frac{an+b}{cn+d}\right)&\sim C\left( \frac{a}{c}\right) \pm \frac{\log |cn+d|}{4(cn+d)} + \sum _{i=1}^\infty \frac{c^\pm _{i,\alpha }}{(cn+d)^i} \end{aligned} \end{aligned}$$

as $n\rightarrow \pm \infty $ with $n\in {\mathbb {Z}}$, and similar expansions with other coefficients $h^\pm _{i,\alpha ,\beta }$ and $c^\pm _{i,\alpha ,\beta }$ if $n\rightarrow \pm \infty $ with $n\in {\mathbb {Z}}+\beta $ for some fixed rational number $\beta $.

Similar statements hold for x tending to rational numbers $\alpha $ with an even numerator or denominator, but now without the log term, the simplest case being

$$\begin{aligned} C\left( \frac{1}{n}\right)\sim & {} \frac{\pi }{8n} + \chi (n)\,\frac{\pi ^2}{16n^2} + \chi (n-1)\,\frac{\pi ^3}{64n^3} + \cdots \\&+ \frac{\chi (n-k+1)}{4}\,\frac{A_k^2}{k!}\,\left( \frac{\pi }{2n}\right) ^{k+1} + \cdots \end{aligned}$$

where $A_k$ is the number of “up-down” permutations of $\{1,\dots ,k\}$ (permutations $\pi $ where $\pi (i)-\pi (i-1)$ has sign $(-1)^i$ for all i), which is also equal to the coefficient of $x^k/k!$ in $\tan x+\sec x$. (Note that $A_{k-1}=(4^k-2^k)|B_k|/k$ for k even, so that the coefficients here are closely related to those in (39).) The difference between the two cases is due to the fact that the action of $\Gamma _\vartheta $ on ${\mathbb {P}}^1({\mathbb {Q}})$ has two orbits (cusps), and comes from the relation that we will see in the final section between the quantum modularity properties of C and H and a specific Maass modular form on $\Gamma _\vartheta $ that is exponentially small at the cusp $\infty $ but has logarithmic growth at the cusp 1.

5 Generalization to other odd Dirichlet characters

In this section we discuss the changes that are needed when the character defined in (1) is replaced by an arbitrary odd primitive Dirichlet character. Suppose that $D<0$ is the discriminant of an imaginary quadratic field K (or equivalently, that D is either square-free and congruent to 1 mod 4 or else equal to 4 times a square-free number not congruent to 1 mod 4), and let $\chi =\chi _D=\left( \frac{D}{\cdot }\right) $ be the associated Dirichlet character, the case studied up to now corresponding to $K={\mathbb {Q}}(i)$ and $D=-4$. The Dirichlet L-series $L(s,\chi )$ is defined as $\sum _{n>0}\chi (n)n^{-s}$ for $\sigma >1$ and by analytic continuation otherwise, and is equal to the quotient of the Dedekind zeta function of K by the Riemann zeta function. Its value at $s=1$ is well known to be nonzero and given by

$$\begin{aligned} L(1,\chi ) = \frac{\pi }{\sqrt{|D|}}\,h'(D), \end{aligned}$$

where $h'(D)$ is 1 / 3 or 1 / 2 if $D=-3$ or $D=-4$ and otherwise is the class number of K.

We now define S(x) by the same formula (11) as before, with $\chi =\chi _D$. It is still periodic (with period |D|) and even and hence bounded, but now has average value $h'(D)$ rather than 1/2 and also no longer takes on only the values 0 and 1, as one sees in the pictures of the graphs of this function for $D=-3$ and $D=-7$ shown in Fig. 5.

We define $H(x)=H_D(x)$ and $C(x)=C_D(x)$ for $x\in {\mathbb {Q}}$ by the same formulas (8) and (10) as before, but we define $h_{m,n}$ and $c_{m,n}$ by (5) and (7) with a factor $|D|/\pi $ instead of $4/\pi $ in order to get integral linear combinations of cotangents. Fourier expansions (26) and (31) are then replaced by

$$\begin{aligned}&S(x) = h'(D) - \frac{\sqrt{|D|}}{\pi }\sum _{m=1}^\infty \frac{\chi (m)}{m}\,\cos \left( \frac{2\pi mx}{|D|}\right) , \end{aligned}$$

(40)

$$\begin{aligned}&H(x) = \frac{\pi \,h'(D)^2}{\sqrt{|D|}} - \frac{\sqrt{|D|}}{\pi }\, \sum _{n=1}^\infty \frac{\chi (n)\,d(n)}{n}\,\cos \left( \frac{2\pi nx}{|D|}\right) , \end{aligned}$$

(41)

and the formula corresponding to (14) now reads

$$\begin{aligned} h_{m,n}=\sum _{0<k<|D|n/2} \chi (k)\,S\left( \frac{km}{n}\right) \,\cot \left( \frac{\pi k}{|D|n}\right) \end{aligned}$$

(42)

by the same calculations as before. Finally we also define the matrices $\mathsf{C}_N$ and $\widehat{\mathsf{C}}_N$ exactly as we did in the special case $D=-4$ (except for replacing the lower right-hand entry in $\widehat{\mathsf{C}}_N$ in (3) by $|D|N/\pi $), and then from the scalar product calculations

$$\begin{aligned} \langle \xi ,\xi \rangle = 1,\qquad \langle \xi ,{\textit{G}}_a\rangle = L(1)\,a,\qquad \langle {\textit{G}}_a,{\textit{G}}_b\rangle = a\,C(a/b), \end{aligned}$$

which are proved exactly as before, we deduce the same connection as in Theorems 1 and 3 between the unboundedness of the function R(N) and the Riemann hypothesis for $L(s,\chi )$.

The only real difference with the case $D=-4$ is in the argument for quantum modularity. The function H(x) on ${\mathbb {Q}}$ is still even and periodic of period |D| (and also anti-periodic up to a constant with period |D| / 2 if D is even), it again has discontinuities at infinitely many rational points by an argument similar to the one given in Sect. 4.3, and by the analog of Proposition 8 the function $C:{\mathbb {Q}}\rightarrow {\mathbb {R}}$ again has a continuous extension to ${\mathbb {R}}$ and is therefore much better behaved analytically than H(x), as illustrated by the following graphs of these functions for $D=-3$ (Figs. 6, 7), which look qualitatively much like their $D=-4$ counterparts in Figs. 2 and 3.

The difference is that this longer suffices to prove the quantum modularity of H because the subgroup of $\text {SL}(2,{\mathbb {Z}})$ generated by the matrices $T^{|D|}$ (or $T^{|D|/2}$ if D is even) and S now no longer has finite index. Instead, we need a statement like the corollary to Theorem 2 with $\Gamma _\vartheta $ replaced by the congruence subgroup $\Gamma _{(D)}=\Gamma _0(D)\cup S\Gamma _0(D)$ of $\text {SL}(2,{\mathbb {Z}})$ (or by $\Gamma _{(D/2)}$ if D is even, which is indeed $\Gamma _\vartheta $ if $D=-4$). This statement is given in the following theorem.

Theorem 4

Let $D<0$ and $H(x)=\sum _{k>0}\chi _D(k)S_D(kx)/k$ be as above. Then the function

$$\begin{aligned} C_\gamma (x) := H(x) - \varepsilon (\gamma )\,|cx+d|\,H\left( \frac{ax+b}{cx+d}\right) \qquad (x\in {\mathbb {Q}}) \end{aligned}$$

(43)

extends continuously to ${\mathbb {R}}$ for all matrices $\gamma =\left( {\begin{matrix}a&{}b\\ c&{}d\end{matrix}}\right) \in \Gamma _{(D)}$, where $\varepsilon :\Gamma _{(D)}\rightarrow \{\pm 1\}$ is the homomorphism mapping $\Gamma (D)$ to 1 and S to $-1$.

This theorem is illustrated for $D=-3$ and a typical element of $\Gamma _{(-3)}$ in Fig. 8.

We will deduce Theorem 4 as a consequence of the following proposition, which is a generalization to arbitrary elements $\gamma \in \Gamma _{(D)}$ of Eq. (32) for $D=-4$ and $\gamma =S$.

Proposition 9

Let $\gamma =\begin{pmatrix} a&{}b\\ c&{}d\end{pmatrix}\in \Gamma _{(D)}$ with $c\ne 0$. Then

$$\begin{aligned} C_\gamma (x) = {\left\{ \begin{array}{ll} {\displaystyle H(-d/c) - |c|\,\varepsilon (\gamma )\,\sum _{k=1}^\infty \chi (k)\,\int _{k\,\gamma x}^\infty \frac{S(t)\,\mathrm{d}t}{(ct-ak)^2}} &{} \text { if }x<-d/c, \\ {\displaystyle H(-d/c) - |c|\,\varepsilon (\gamma )\,\sum _{k=1}^\infty \chi (k)\,\int _{-\infty }^{k\,\gamma x} \frac{S(t)\,\mathrm{d}t}{(ak-ct)^2}} &{} \text { if }x>-d/c. \end{array}\right. } \end{aligned}$$

(44)

To see that this implies the continuity, we note that we can rewrite the right-hand side by interchanging the summation and integration in the form [generalizing (25)]

$$\begin{aligned} C_\gamma (x) = H\left( -d/c\right) - |c|\,\varepsilon (\gamma )\,\int _{-\infty }^\infty \, S(t)\,\left( \sum _{k>0,\, k\gamma x\,\gtrless \,t} \frac{\chi (k)}{(ak-ct)^2}\right) \,dt \end{aligned}$$

(45)

for $x\gtrless -d/c$. The right-hand side of this formula is continuous because S(t) is piecewise continuous and bounded and has support in $|t|\ge 1$, while the second factor of the integrand is piecewise continuous and bounded by a constant times $t^{-2}$, as one sees easily by partial summation. Another argument is that (44) is equivalent to the equality of distributions

$$\begin{aligned} C'_\gamma (x)= & {} -\,\varepsilon (\gamma )\,|c|\,{\mathrm{sgn}}(x+d/c)\sum _{k=1}^\infty \chi (k)\,\frac{S(k\,\gamma x)}{k}\nonumber \\= & {} -\,\varepsilon (\gamma )\,c\;{\mathrm{sgn}}(cx+d)\,H\left( \frac{ax+b}{cx+d}\right) , \end{aligned}$$

(46)

[generalizing (27)], and we already know that the distribution H is locally the derivative of a continuous function.

For the proof of (44) we will restrict to the case when the matrix $\gamma $ belongs to $\Gamma (D)$, which is sufficient for the proof of Theorem 4 because $\gamma \mapsto C_\gamma $ is a cocycle and we already know the continuity of $C_\gamma $ ($=C$) for the special case $\gamma =S$ representing the nontrivial coset of $\Gamma (D)$ in $\Gamma _{(D)}$. To avoid distracting case distinctions, we consider only the case $x+d/c<0$ and $\gamma x>0$. The argument is the same in principle in all other cases, but we are actually working with ordered tuples of points on the 1-manifold ${\mathbb {P}}^1({\mathbb {R}})$, on which the group $\Gamma _{(D)}$ acts in an orientation-preserving way, and it is notationally simpler to fix the positions of the points occurring with respect to the point at infinity, so that we can work on ${\mathbb {R}}$ instead. We may also assume (by replacing $\gamma $ by $-\gamma $ if necessary, but then remembering that $\gamma $ may be congruent to $-\mathbf 1 _2$ rather than $\mathbf 1 _2$ modulo D) that $c>0$. Finally, to be able to work with absolutely convergent sums and to reorder the terms freely, we will replace H(x) by the function $H_s(x)$ ($s>1$) introduced in Sect. 4.2 and $C_\gamma $ by the corresponding cocycle

$$\begin{aligned} C_{\gamma ,s}(x) = H_s(x) - \varepsilon (\gamma )\,|cx+d|^s\,H_s\left( \frac{ax+b}{cx+d}\right) , \end{aligned}$$

(47)

and only set $s=1$ at the end. The equation we have to prove then becomes

$$\begin{aligned} C_{\gamma ,s}(x) = H_s(-d/c) - c\,s\,\sum _{k=1}^\infty \chi (k)\,\int _{k\,\gamma x}^\infty \frac{S(t)\,\mathrm{d}t}{(ct-ak)^{s+1}}. \end{aligned}$$

(48)

We denote the last term (without the minus sign) by A. Since the variable t in the integral is always positive (because we are considering the case $\gamma x>0$), we can replace S(t) by its definition (11) and interchange the summation and integration to getf

In the second sum we replace the vector $\begin{pmatrix} j\\ k \end{pmatrix}$ by the vector $-\gamma \begin{pmatrix} j\\ k \end{pmatrix}=\begin{pmatrix} -aj-bk\\ -cj-dk \end{pmatrix}$. This does not change the product $\chi (j)\chi (k)$ because $\gamma $ is congruent to plus or minus the identity modulo the period |D| of $\chi $, changes the expression $cj-ak$ in the denominator to k, and changes the inequalities $k>0,\,j\ge k\gamma x$ to $kx\le j<-kd/c$ (which imply $k>0$). Hence

$$\begin{aligned} \begin{aligned} A&= |cx+d|^s\,\sum _{k=1}^\infty \frac{\chi (k)}{k^s}\,S(k\gamma x) + \sum _{k=1}^\infty \frac{\chi (k)}{k^s}\,\left( S(-kd/c) - S(kx)\right) \\&= H_s(-d/c) - C_{\gamma ,s}(x)\, \end{aligned} \end{aligned}$$

completing the proof of Eq. (48) and hence of the proposition and theorem.

6 Modular forms are everywhere

As one can already see from the examples in the original article [16] where this notion was introduced, quantum modular forms are sometimes related to actual modular forms. These modular forms may be either holomorphic or Maass forms, with the quantum modular form in the latter case being related to the “periods” of Maass forms in the sense developed in [7, 13]. The quantum modular form H(x) that we have been studying in this paper turns out to be of this latter type, with the associated Maass form being an Eisenstein series with eigenvalue 1/4 for the hyperbolic Laplace operator. In this final section we explain how this works, first for the case $D=-4$ studied in the first four sections of this paper. In that case the relevant modular group $\Gamma _\vartheta $ is generated by a translation and an inversion, so that for both the period theory of the Maass form u and the quantum modularity of H one needs only the functional equation of the associated L-series. Then at the end we indicate how the quantum modularity of H for general D follows from the full theory of periods of Maass forms.

We start with the case $D=-4$, so that $\chi $ is the character given by (1). The relevant modular form here is the Maass Eisenstein series

$$\begin{aligned} u(z) = y^{1/2}\sum _{n=1}^\infty \chi (n)\,d(n)\, K_0\left( \frac{\pi ny}{2}\right) \,\sin \left( \frac{\pi nx}{2}\right) \qquad (z=x+iy\in {\mathfrak {H}}), \end{aligned}$$

(49)

where ${\mathfrak {H}}$ denotes the upper half-plane and $K_0$ the usual K-Bessel function of order 0. This is an eigenfunction with eigenvalue 1/4 with respect to the hyperbolic Laplace operator $\Delta =-y^2\,\left( \frac{\partial ^2}{\partial x^2}+\frac{\partial ^2}{\partial y^2}\right) $ and is a modular function with character $\varepsilon $ for the group $\Gamma _\vartheta $ defined in Sect. 1, meaning that $u(\gamma z)=\varepsilon (\gamma )u(z)$ for all $\gamma \in \Gamma _\vartheta $ or, more explicitly, that

$$\begin{aligned} u(z+2) = -\,u(z) = u(-1/z) \qquad (z\in {\mathfrak {H}}). \end{aligned}$$

(50)

To see this, we observe that u(z) is proportional to $E\left( \frac{z+1}{4},\frac{1}{2}\right) -E\left( \frac{z-1}{4},\frac{1}{2}\right) $, where E(z, s) is the usual nonholomorphic Eisenstein series of weight 0 and eigenvalue $s(1-s)$ for the Laplace operator with respect to the full modular group $\Gamma _1=\text {SL}(2,{\mathbb {Z}})$. (This follows easily from the well-known Fourier expansion of $E(z,\frac{1}{2})$ as a linear combination of the three functions $\sqrt{y}$, $\sqrt{y}\log y$ and $\sqrt{y}\sum _{n\ne 0} d(n)K_0(2\pi |n|y)e^{2\pi inx}$.) Transformation equations (50) then follow from the $\text {SL}(2,{\mathbb {Z}})$-invariance of $E(z,\frac{1}{2})$, the first one trivially since $E(z+1,s)=E(z,s)$ and the second by using the invariance of E(z, s) under $\left( {\begin{matrix}1&{}0\\ \mp 4&{}1\end{matrix}}\right) $ to get

$$\begin{aligned} E\left( \frac{-1/z\pm 1}{4},\,\frac{1}{2}\right) = E\left( \frac{(-1/z\pm 1)/4}{\mp 4((-1/z\pm 1)/4)+1},\,\frac{1}{2}\right) = E\left( \frac{z\mp 1}{4},\,\frac{1}{2}\right) . \end{aligned}$$

We now associate to u(z) the periodic holomorphic function f on ${\mathbb {C}}/{\mathbb {R}}={\mathfrak {H}}^+\cup {\mathfrak {H}}^-$ (where ${\mathfrak {H}}^\pm =\{z\in {\mathbb {C}}\mid \pm \mathfrak {I}(z)>0\}$) having the same Fourier coefficients as u, i.e.,

$$\begin{aligned} f(z) = \sum _{n=1}^\infty \chi (n)\,d(n)\,q^{\pm n/4} \qquad \left( z\in {\mathfrak {H}}^\pm ,\; q=e^{2\pi iz}\right) . \end{aligned}$$

(51)

Proposition 10

The period function $\psi (z)$ defined by

$$\begin{aligned} \psi (z)= f(z) + \frac{1}{z}\,f\left( -\frac{1}{z}\right) \qquad (z\in {\mathbb {C}}/{\mathbb {R}}) \end{aligned}$$

(52)

extends holomorphically from ${\mathbb {C}}/{\mathbb {R}}$ to ${\mathbb {C}}'={\mathbb {C}}/(-\infty ,0]\,$.

Proof

We follow the proof of the corresponding result given in Chapter 1 of [13] for Maass cusp forms on the full modular group $\Gamma _1$. (See Theorem on p. 202 of [13], which also gives a converse statement characterizing cusp forms in terms of holomorphic functions $\psi $ on ${\mathbb {C}}'$ satisfying a certain functional equation.) That proof required only the functional equation of the L-series associated with the Maass cusp form, which worked because the group $\Gamma _1$ is generated by the translation T and the inversion S, and can be applied here because $\Gamma _\vartheta =\langle S,T^2\rangle $ has a similar structure. Our situation is also a little different because our function u(z) is an Eisenstein series rather than a cusp form, but since Fourier expansion (49) of u(z) at infinity has no constant term this has no effect on the proof.

The argument in [13] was first to write the L-series of the Maass form u(z) as the Mellin transform of the restriction of u (or of its normal derivative in the case of an odd cusp form) to the imaginary axis, multiplied by a suitable gamma factor, and to deduce from this relationship and the S-invariance of u a functional equation for the L-series. One then observed that the Mellin transforms of the restrictions to the positive or negative imaginary axis of both the associated periodic holomorphic function f and the associated period function $\psi $ were also equal, up to different gamma factors, to the same L-series, and the functional equation of this L-series combined with Mellin inversion then led to a formula for $\psi $ that applied in all ${\mathbb {C}}'$ rather than just on ${\mathbb {C}}/{\mathbb {R}}$. Here the first step, which would require the normal derivative since the restriction of function (49) to $i{\mathbb {R}}_+$ vanishes, can be skipped since the L-series $L(u;s)=\sum \chi (n)d(n)n^{-s}$ is simply the square of Dirichlet L-series (1) and hence has a known functional equation. The Mellin transforms of the restrictions to f and $\psi $ to the positive or negative imaginary axis are then given by

$$\begin{aligned} \widetilde{f}_\pm (s):= & {} \int _0^\infty f(\pm iy)\,y^{s-1}\,\mathrm{d}y = \frac{\Gamma (s)}{(\pi /2)^s}\,L(s)^2\qquad (\mathfrak {R}(s)>0),\\ \widetilde{\psi }_\pm (s)= & {} \int _0^\infty \psi (\pm iy)\,y^{s-1}\,\mathrm{d}y = \int _0^\infty \left( f(\pm iy) \,\mp \, \frac{i}{y}\,f(\pm i/y)\right) \,y^{s-1}\,\mathrm{d}y \\= & {} \widetilde{f}_\pm (s) \,\mp \,i\,\widetilde{f}_\pm (1-s) = \frac{\Gamma (s)\,L(s)^2}{(\pi /2)^s\,\cos (\pi s/2)}\,e^{\mp i\pi s} \qquad (0<\mathfrak {R}(s)<1), \end{aligned}$$

where in the last line we have used the functional equation of L(s) and standard identities for gamma functions. By the Mellin inversion formula we deduce that

$$\begin{aligned} \psi (\pm iy)=\frac{1}{2\pi i}\,\int _{\mathfrak {R}(s)=c} \frac{\Gamma (s)\,L(s)^2}{(\pi /2)^s\,\cos (\pi s/2)}\,(\pm iy)^{-s}\,\mathrm{d}s \qquad (y>0) \end{aligned}$$

for any $c\in (0,1)$, and analytic continuation from $i{\mathbb {R}}/\{0\}$ to ${\mathbb {C}}/{\mathbb {R}}$ then gives the formula

$$\begin{aligned} \psi (z) = \frac{1}{2\pi i}\,\int _{\mathfrak {R}(s)=c} \left[ \frac{\Gamma (s)\,L(s)^2}{(\pi /2)^s\,\cos (\pi s/2)}\right] \,z^{-s}\,\mathrm{d}s \end{aligned}$$

for all $z\in {\mathbb {C}}/{\mathbb {R}}\,$. But the factor in square brackets is bounded by a power of s times $e^{-\pi |s|}$ for $|s|\rightarrow \infty $ on the vertical line $c+i{\mathbb {R}}$, so the integral on the right-hand side of this equation is absolutely convergent for all $z\in {\mathbb {C}}$ with $|\arg (z)|<\pi $, i.e., for all $z\in {\mathbb {C}}'\,$. $\square $

We now show how to connect the functions f(z) and $\psi (z)$ to the functions H(x) and C(x), respectively, and to deduce the continuity of C—and hence the quantum modularity of H—from Proposition 10 for the period function $\psi $. The first step is easy, since from Fourier expansions (51) and (26) and standard integral formulae we find the relation

$$\begin{aligned} f(z) \doteq \int _{-\infty }^\infty \frac{H(t)}{(t-z)^2}\,\mathrm{d}t \qquad (z\in {\mathbb {C}}/{\mathbb {R}}) \end{aligned}$$

between the periodic function f(z) and the periodic distribution H(x). (Here and in what follows the symbol $\,\doteq \,$ denotes equality up to easily computed scalar factors whose values are irrelevant for the argument.) Replacing t by 1 / t in the integral, we obtain

$$\begin{aligned} f(z) \doteq \int _{-\infty }^\infty \frac{C'(t)}{(1-tz)^2}\,\mathrm{d}t \doteq z\,\int _{-\infty }^\infty \frac{C(t)}{(1-tz)^3}\,\mathrm{d}t , \end{aligned}$$

(53)

where we have used (27) to get the first equality and integration by parts for the second. But now replacing t by 1 / t and using the functional equation $C(t)=|t|C(1/t)$ we find

$$\begin{aligned} \frac{1}{z}\,f\left( -\frac{1}{z}\right) = \frac{1}{z}\,f\left( \frac{1}{z}\right) \doteq -\,z\,\int _{-\infty }^\infty \frac{{\mathrm{sgn}}(t)\,C(t)}{(1-tz)^3}\,\mathrm{d}t \end{aligned}$$

(54)

with the same proportionality constant as in (53), and adding these two equations gives the relation

$$\begin{aligned} \psi (z) \doteq z\,\int _{-\infty }^0 \frac{C(t)}{(1-tz)^3}\,\mathrm{d}t \doteq z\,\int _{-\infty }^0 \frac{C(t)}{(t-z)^3}\,\mathrm{d}t \qquad (z\in {\mathbb {C}}/{\mathbb {R}}) \end{aligned}$$

(55)

between the period function of the Maass waveform u and the function C(x). This establishes the desired connection between the continuity of $C(x)=H(x)+|x|H(1/x)$, which expresses the quantum modularity of H, and Proposition 10, which expresses the modularity of u: in one direction, if we know that C(x) is continuous (and bounded by $\,\min (1,|x|)$), then (55) immediately gives the analytic continuation of $\psi (z)$ from ${\mathbb {C}}/{\mathbb {R}}$ to ${\mathbb {C}}'$, and conversely, the inversion formula for the Stieltjes transform given in [14] lets us invert (55) to get

$$\begin{aligned} C(t) \doteq t\, \int _{{\mathcal {C}}} \, \psi (tz) \frac{1 + z}{z}\,\mathrm{d}z, \end{aligned}$$

(56)

where ${{\mathcal {C}}}$ is any contour with endpoints at $z = -1$ which encloses the origin, so that the continuity of C is a direct consequence of the holomorphy of $\psi $ in ${\mathbb {C}}'$.

We observe that the entire discussion given here applies in an essentially unchanged form to the more general functions $C_s$ and $H_s$ discussed in Sect. 4.2, with Maass form (49) replaced by the form $E\left( \frac{z+1}{4},\frac{s}{2}\right) -E\left( \frac{z-1}{4},\frac{s}{2}\right) $ with spectral parameter s / 2 instead of 1 / 2.

Finally, we consider the case when the character $\chi $ in (1) is replaced by an arbitrary primitive odd Dirichlet character $\chi _D$. Then all of the above calculations still go through: the function u is defined by Eq. (49) with $\chi $ replaced by $\chi _D$ and $\pi /2$ by $2\pi /|D|$, which is again a Maass form (with spectral parameter $\frac{1}{2}$ and character $\varepsilon $ on the group $\Gamma _{(D)}$) because it is proportional to $\sum _{r\pmod D}\chi (r)E\left( \frac{z+r}{|D|},\frac{1}{2}\right) $; the associated periodic function f and period function $\psi $ are defined by (51) (with the new $\chi $ and with $q^{n/4}$ replaced by $q^{n/|D|}$) and (52) (with no change at all); and Proposition 10 remains true with the same proof. The difference, however, is that this proposition is no longer equivalent to the modularity of u, but only to its invariance (up to sign) under the transformations S and $T^D$ (or $T^{D/2}$ if D is even), which in general generate a subgroup of $\Gamma _{(D)}$ of infinite order, as already discussed in Sect. 5. To get the full modularity (of this u or any other potential Maass form on $\Gamma _{(D)}$), we need to generalize Proposition 10 to the statement that for any matrix $\gamma =\left( {\begin{matrix}a&{}b\\ c&{}d\end{matrix}}\right) \in \Gamma _{(D)}$ the function

$$\begin{aligned} \psi _\gamma (z)= f(z) - \frac{\varepsilon (\gamma )}{cz+d}\,f\left( \frac{az+b}{cz+d}\right) \end{aligned}$$

extends holomorphically from ${\mathbb {C}}/{\mathbb {R}}$ to ${\mathbb {C}}/(-\infty ,-d/c]\,$ if $c>0$ or ${\mathbb {C}}/[-d/c,\infty )$ if $c<0$. This statement can be proved in several ways and can be linked by a discussion similar to the one above to the continuity property of $C_\gamma (x)$ stated in Theorem 4. But the simplest approach is to relate the functions $C_\gamma (x)$ directly to the invariant distribution associated with u in the sense developed in [7, 12, 13]. Specifically, this theory says that an eigenfunction u of $\Delta $ with spectral parameter s and Fourier expansion $u(x+iy)=\sqrt{y}\,\sum _{n\ne 0}A_n\,K_{s-1/2}(\lambda |n|y)e^{i\lambda nx}$ is invariant (possibly with character) under the action of a Fuchsian group $\Gamma $ if and only if the associated distribution $U(x)=\sum _{n\ne 0}|n|^{s-1/2}\,A_n\,e^{i\lambda nx}$ on ${\mathbb {P}}^1({\mathbb {R}})$ is invariant (with the same character) with respect to the group action $(U|\gamma )(x)=|cx+d|^{-2s}U(\gamma x)$ for $\gamma =\left( {\begin{matrix}a&{}b\\ c&{}d\end{matrix}}\right) \in \Gamma $. (Here the word “distribution” must be interpreted correctly, namely, as a functional on the space of test functions $\phi (x)$ that are smooth on ${\mathbb {R}}$ and for which $|x|^{-2s}\phi (1/x)$ is smooth near $x=0$.) In our case the distribution U associated with u is given by $U(x)=\sum _{n=1}^\infty \chi (n)d(n)\sin (2\pi nx/|D|)$ and is related to distribution (41) by $U(x)\doteq H'(x)$. But that means that if we differentiate definition (43) of $C_\gamma (x)$ to get

$$\begin{aligned} C_\gamma '(x) = H'(x) - \frac{\varepsilon (\gamma )}{|cx+d|}\,H'(\gamma x) - \varepsilon (\gamma )\,|c|\,{\mathrm{sgn}}(x+d/c)\,H\gamma x), \end{aligned}$$

then the first two terms on the right cancel and we recover Eq. (46).

Acknowledgements

Open access funding provided by Max Planck Society.

References

Balazard, M., Saias, E.: The Nyman–Beurling equivalent form for the Riemann hypothesis. Expo. Math. 18, 131–138 (2000)
MathSciNet MATH Google Scholar
Bettin, S., Conrey, J.B.: A reciprocity formula for a cotangent sum. Int. Math. Res. Not. IMRN 24, 5709–5726 (2013)
Article MathSciNet Google Scholar
Bettin, S., Conrey, J.B.: Period functions and cotangent sums. Algebra Number Theory 7, 215–242 (2013)
Article MathSciNet Google Scholar
Beurling, A.: On two problems concerning linear transformations in Hilbert space. Acta Math. 81, 239–255 (1949)
Article MathSciNet Google Scholar
Beurling, A.: A closure problem related to the Riemann zeta-function. Proc. PNAS 41, 312–314 (1955)
Article MathSciNet Google Scholar
Bruggeman, R., Lewis, J.: Function theory related to the group $PSL(2,{\mathbb{R}})$. In: Farkas, H.M., Gunning, R.C., Knopp, M.I., Taylor, B.A. (eds.) From Fourier Analysis and Number Theory to Radon Transforms and Geometry. In Memory of Leon Ehrenpreis. Developments in Mathematics, vol. 28, pp. 107–201. Springer, Berlin (2013)
Google Scholar
Bruggeman, R., Lewis, J., Zagier, D.: Period functions for Maass wave forms and cohomology. Mem. Am. Math. Soc. 237, xii+132 (2015)
MathSciNet MATH Google Scholar
Garnett, J.B.: Bounded Analytic Functions. Springer, Berlin (2007)
MATH Google Scholar
Grünberg, D., Moree, P.: Sequences of enumerative geometry: congruences and asymptotics, with an appendix by D. Zagier. Exp. Math. 17, 409–426 (2008)
Article Google Scholar
Halmos, P., Sunder, V.: Bounded Integral Operators on $L^2$-Spaces. Springer, Berlin (1978)
Book Google Scholar
Hoffman, K.: Banach Spaces of Analytic Functions. Prentice Hall, Upper Saddle River (1962)
MATH Google Scholar
Lewis, J.: Eigenfunctions on symmetric spaces with distribution-valued boundary forms. J. Func. Anal. 29, 287–307 (1978)
Article MathSciNet Google Scholar
Lewis, J., Zagier, D.: Period functions for Maass wave forms I. Ann. Math. 153, 191–258 (2001)
Article MathSciNet Google Scholar
Schwarz, J.H.: The generalized Stieltjes transform and its inverse. J. Math. Phys. 46, 013501 (2005)
Article MathSciNet Google Scholar
Zagier, D.: Eisenstein series and the Riemann zeta function. In: Automorphic Forms, Representation Theory and Arithmetic, pp. 275–301. Springer-Verlag, Berlin, Heidelberg, New York (1981)
Zagier, D.: Quantum modular forms. In: Étienne, B., David, E., Masoud, K., Matilde, M., Henri, M., Sorin, P. (eds.) Quanta of Maths: Conference in Honor of Alain Connes, Clay Mathematics Proceedings, vol. 11, pp. 659–675. AMS, Providence and Clay Mathematics Institute, Peterborough (2010)

Download references

Author information

Authors and Affiliations

Massachusetts Institute of Technology, Cambridge, MA, USA
John Lewis
Max Planck Institute for Mathematics, Bonn, Germany
Don Zagier

Authors

John Lewis
View author publications
You can also search for this author in PubMed Google Scholar
Don Zagier
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Don Zagier.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Lewis, J., Zagier, D. Cotangent sums, quantum modular forms, and the generalized Riemann hypothesis. Res Math Sci 6, 4 (2019). https://doi.org/10.1007/s40687-018-0159-8

Download citation

Received: 19 May 2018
Accepted: 30 August 2018
Published: 15 November 2018
DOI: https://doi.org/10.1007/s40687-018-0159-8

Cotangent sums, quantum modular forms, and the generalized Riemann hypothesis

Abstract

Similar content being viewed by others

On a Category of Cotangent Sums Related to the Nyman-Beurling Criterion for the Riemann Hypothesis

Second-Order Modular Forms with Characters

Rational functions, cotangent sums and Eichler integrals

1 Introduction and main results

Theorem 1

Theorem 2

Corollary

2 Properties of the numbers \(c_{m,n}\) and \(h_{m,n}\)

Proposition 1

Proof

Proposition 2

Corollary

Proof

Proposition 3

Proof

Remark

3 The connection with the generalized Riemann hypothesis

Theorem 3

3.1 The square-wave function \({\textit{G}}\) and its multiplicative shifts

Proposition 4

Proposition 5

Proposition 6

Proposition 7

3.2 Proof of Propositions 4, 5 and 6

3.3 Proof of Proposition 7

Lemma

Proof

4 Analytic properties of C(x) and H(x)

4.1 Sum and integral representations of C(x) and H(x)

Proposition 8

Proof

4.2 The function H on the real line

Definition 1

Definition 2

Definition 3

Definition 4

Definition 5

Question

4.3 Asymptotic behavior of H(x) and C(x) near rational points

5 Generalization to other odd Dirichlet characters

Theorem 4

Proposition 9

6 Modular forms are everywhere

Proposition 10

Proof

Acknowledgements

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation