Abstract
We investigate the norm of a degree 2 Siegel modular form of asymptotically large weight whose argument is restricted to the 3-dimensional subspace of its imaginary part. On average over Saito–Kurokawa lifts an asymptotic formula is established that is consistent with the mass equidistribution conjecture on the Siegel upper half space as well as the Lindelöf hypothesis for the corresponding Koecher–Maaß series. The ingredients include a new relative trace formula for pairs of Heegner periods.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
1.1 Restriction norm of eigenfunctions
The question, ‘to what extent can the mass of a Laplace eigenfunction \(\phi \) on a Riemannian manifold X localize?’, is a classical problem in analysis and is often quantified by upper (or lower) bounds for \(L^p\)-bounds for the restriction of \(\phi \) to suitable submanifolds \(Y \subseteq X\). The prototypical example is the case where X is a surface and Y is a curve, often a geodesic; see e.g. [13, 15, 18, 30, 62, 67, 68] and references therein.
If X is the quotient of a symmetric space by an arithmetic lattice (often called an arithmetic manifold), an additional layer of number theoretic structure enters. Not only can this be used to obtain stronger bounds [61], but sometimes the period integrals can be expressed in terms of special values of L-functions. A typical such case is the \(L^2\)-restriction of a Maaß form for the group \(\mathrm{SL}_{n+1}(\mathbb {Z})\) to its upper left n-by-n block, which can be expressed as an average of central values of \(\mathrm{GL}(n) \times \mathrm{GL}(n+1)\) Rankin–Selberg L-functions [56, 57]. Other potential cases arise from the Gross-Prasad conjecture [38], see [69] for an illuminating discussion. Often in this context, optimal restriction norm bounds are equivalent to the Lindelöf conjecture on average over the spectral family of L-functions in question.
In this paper we consider a new and somewhat different instance of the interplay between mass equidistribution and L-functions in the context of certain Siegel modular forms F for the symplectic group \(\mathrm{Sp}_4(\mathbb {Z})\): we investigate the \(L^2\)-restriction of a Saito–Kurokawa lift F(Z) on the 6-dimensional Siegel upper half space \(\mathbb {H}^{(2)}\) to the 3-dimensional subspace where the argument \(Z = X+iY \in \mathbb {H}^{(2)}\) is restricted to its imaginary part. This is geometrically a very natural set-up, it is a direct higher-dimensional analogue of the classical problem of bounding a cusp form f for \(\mathrm{SL}_2(\mathbb {Z})\) on the vertical geodesic, mentioned at the beginning; cf. [11, Sect. 7]. While the latter leads, via Hecke’s integral representation, directly to the corresponding L-function L(f, s), things become much more involved for Siegel modular forms.
We start by stating the corresponding period formula. For an even positive integer k let \(S_k^{(2)}\) denote the space of Siegel modular forms of degree 2 of weight k for the group \(\mathrm{Sp}_4(\mathbb {Z})\), equipped with the standard Petersson inner product; see Sect. 2. We think of k as tending to infinity and are interested in asymptotic results with respect to k. We restrict the argument of a cusp form \(F\in S_k^{(2)}\) to its imaginary part iY with \(Y \in \mathcal {P}(\mathbb {R})\) where \(\mathcal {P}(\mathbb {R})\), equipped with the measure \(dY/(\det Y)^{3/2}\), is the set of positive definite symmetric 2-by-2 matrices. Consider the restriction norm
where \(\mathrm{SL}_2(\mathbb {Z})\) acts on \(\mathcal {P}(\mathbb {R})\) by \(\gamma \mapsto \gamma ^{\top } Y \gamma \). Letting \(\mathbb {H}\) denote the usual upper half plane, we observe that \(\mathrm{SL}_2(\mathbb {Z}) \backslash \mathcal {P}(\mathbb {R}) \cong \mathrm{SL}_2(\mathbb {Z}) \backslash \mathbb {H} \times \mathbb {R}_{>0}\) has infinite measure; see (6.1) below. The factor
accounts for the fact that, in accordance with the literature, we choose the standard measures on \(\mathrm{Sp}_4(\mathbb {Z}) \backslash \mathbb {H}^{(2)}\) and \(\mathrm{SL}_2(\mathbb {Z}) \backslash \mathbb {H}\) which are not probability measures.
Let \(\Lambda \) denote a set of spectral components of \(L^2(\mathrm{SL}_2(\mathbb {Z})\backslash \mathbb {H})\) consisting of the constant function \(\sqrt{3/\pi }\), an orthonormal basis of Hecke–Maaß cusp forms and the Eisenstein series \(E(., 1/2 + it)\) for \(t \in \mathbb {R}\). The set \(\Lambda \) is equipped with the counting measure on its discrete part and with the measure \(dt/4\pi \) on its continuous part. We denote by \(\int _{\Lambda }\) the corresponding combined sum/integral. For \(F \in S_k^{(2)}\) and \(\mathtt{u} \in \Lambda \) let \(L(F \times \mathtt{u}, s)\) denote the Koecher–Maaß series defined in (6.3). This series has a functional equation featuring the gamma factors \(G(F \times \mathtt{u}, s)\) as defined in (6.4), but has no Euler product. The following proposition is proved in Sect. 6.
Proposition 1
For \(F \in S_k^{(2)}\) with even k we have
where \(\Lambda _{\text { ev}}\) denotes the set of all even \(\mathtt{u} \in \Lambda \).
An arithmetically interesting subfamily of Siegel modular forms are the Saito–Kurokawa lifts \(F_h\) (sometimes called the Maaß Spezialschar) of half-integral weight modular forms \(h \in S^+_{k-1/2}(4)\) in Kohnen’s plus-space or equivalently their Shimura lifts \(f_h \in S_{2k-2}\) (see Sect. 2 for details). In this case, the Koecher–Maaß series \(L(F_h \times \mathtt{u}, s)\) roughly becomes a Rankin–Selberg L-function of two half-integral weight cusp forms, namely of h and the weight 1/2 automorphic form whose Shimura lift equals \(\mathtt{u}\); see Proposition 16 below and cf. [24]. Of course, this series also has no Euler product. The convexity bound for L-functions along with trivial bounds implies
whilst the statement
would follow from the Lindelöf hypothesis for these L-functions. It should be noted, however, that in absence of an Euler product it is not expected that these L-functions satisfy the Riemann hypothesis, but one may still hope that the Lindelöf hypothesis is true; see [49] for some support of this conjecture. However, even if it is, then proving (1.2) appears to be far out of reach by current technology—it corresponds to an average of size \(k^{3/2}\) of a family of L-functions of conductor \(k^8\). (This is analogous to the genus 1 situation in which the \(L^2\)-restriction norm of a holomorphic cusp form of weight k leads to an average of size \(k^{1/2}\) of a family of L-functions of conductor \(k^4\); see [11, (1.12)].) These problems belong to the hard cases where sharp bounds for the \(L^2\)-restriction norm imply very strong subconvexity bounds.
1.2 The main result and mass equidistribution in higher rank
Fix a smooth, non-negative test function W with non-empty support in [1, 2]. Let \(\omega := \int _1^2 W(x) x\, dx\) and consider
for a large parameter K and a Hecke eigenbasis \(B_{k-1/2}^+(4)\) of \(S_{k-1/2}^+(4)\). Note that \(\dim S_{k-1/2}^+(4) \sim k/6\). The first main result of this paper is the following asymptotic formula.
Theorem 2
We have \(\mathcal {N}_{\text {{ av}}}(K) = 4\log K + O(1)\) as \(K \rightarrow \infty \).
This may be interpreted as an asymptotic average version of the Lindelöf hypothesis for twisted Koecher–Maaß series. This restriction problem, however, is structurally quite different from all previously considered restriction problems with connections to L-functions: (a) the period formula features L-functions that are not in the Selberg class and (b) Theorem 2 reveals that the restriction norm does generally not remain bounded.
There is a strong connection between Theorem 2 and the mass equidistribution conjecture that we now explain. Let g be a test function on \(\mathrm{Sp}_4(\mathbb {Z}) \backslash \mathbb {H}^{(2)}\). Then the (arithmetic) mass equidistribution conjecture for the Siegel upper half space states that
as F traverses a sequence of Hecke–Siegel cusp forms of growing weight. While the corresponding statement for classical cusp forms of degree 1 was proved by Holowinsky and Soundararajan [36], no such statement has been obtained for Siegel modular forms of higher degree (but see [70] for certain cases of the quantum unique ergodicity conjecture in higher rank and [19] for an averaged version). Nevertheless, one may even go one step further and conjecture that the above limit holds when one restricts the full space \(\mathrm{Sp}_4(\mathbb {Z}) \backslash \mathbb {H}^{(2)}\) to a lower-dimensional submanifold. In particular, one might conjecture that
holds. As the right hand side has infinite measure, we cannot simply replace g with the constant function. This is precisely the reason why \(\mathcal {N}_{\text {av}}(K)\) is unbounded as \(K \rightarrow \infty \). However, since F is a cusp form, the \(L^2\)-normalized and \(\mathrm{Sp}_4(\mathbb {Z})\)-invariant function \(|F(iY)|^2 (\det Y)^{k}/\Vert F \Vert ^2\) decays exponentially quickly if Y is (in a precise sense) very large or very small. So effectively g may be restricted to the characteristic function of a compact set depending on k. We quantify this in Appendix C and show that, for such g, the right hand side equals
In this case the previous asymptotic reads
as \(k \rightarrow \infty \). The asymptotic (1.4) is, of course, highly conjectural, and as mentioned above even the ordinary higher rank mass equidistribution conjecture (without restricting to a thin subset) is currently out of reach. As an aside we remark that one has to be careful with such conjectures. For instance, the \(L^2\)-norm restriction of even Maaß forms for \(\mathrm{SL}_2(\mathbb {Z})\) to the vertical geodesic is twice as big as one might think [74].
Theorem 2 provides an unconditional proof of (1.4) on average over Saito–Kurokawa lifts in agreement with the mass equidistribution conjecture for the rank two Siegel upper half space.
In particular, the constant 4 in Theorem 2 is very relevant, and this constant has a story of its own. It is the outcome of several archimedean integrals, numerical values in period formulae and a gigantic Euler product whose special value can be expressed in terms of zeta values (cf. (12.8)). In deducing its value we have corrected several numerical constants in the literature. We shall come back to this point in due course; see for instance the remark after Lemma 8. The authors would like to thank Gergely Harcos for useful and clarifying discussions in this respect.
Theorem 2 opens the door for several other related problems. The reader may wonder as to how the norms of the other Siegel cusp forms behave; that is, those forms which are not obtain via global parabolic induction (non-CAP), unlike the Saito–Kurukawa lifts. Any reasonable spectral average would include at least the space of Siegel modular forms \(S_k^{(2)}\) of weight k which is of dimension \(\sim ck^{3}\) for some constant c (in fact \(c = 1/8640\)). This leads to a bigger average than the one presently considered over about \(k^2\) Saito–Kurokawa lifts. The starting point for the \(L^2\)-restriction norm of all Siegel modular forms is again the period formula in Proposition 1. Coupled with an approximate functional equation (as in Lemma 17), this is amenable to the Kitaoka–Petersson formula [50] and an analysis along the lines of [9]. We hope to return to this interesting problem soon.
A different symplectic restriction problem was treated in [59] and [11], where the argument \(Z \in \mathbb {H}^{(2)}\) of Saito–Kurokawa lifts was restricted to the diagonal, a four-dimensional subspace of \(\mathbb {H}^{(2)}\). The corresponding analogue of Proposition 1, due to Ichino [37], leads to an average of size k of Langlands L-functions of conductor \(k^4\).
Whilst the proof of Theorem 2 rests on many ingredients which we address in detail in the coming sections, we would like to highlight a few results which may be of stand alone interest. We describe these in the remainder of the introduction.
1.3 A relative trace formula for pairs of Heegner periods
Here we focus on a novel trace formula of independent interest beyond its application in proving Theorem 2.
Let D be a discriminant, i.e. a non-square integer \(\equiv 0, 1\) (mod 4). For a discriminant \(D < 0\) let \(H_D \subseteq \mathrm{SL}_2(\mathbb {Z}) \backslash \mathbb {H}\) denote the set of all Heegner points; that is, the set of all \(z = (\sqrt{|D|}i - B)/(2A)\) where \(AX^2 + BXY + CY^2\) is a \(\Gamma \)-equivalence class of integral quadratic forms of discriminant \(D = B^2 - 4AC\). For a function \(f : \mathrm{SL}_2(\mathbb {Z}) \backslash \mathbb {H} \rightarrow \mathbb {C}\) define the period (“trace”)
where \(\epsilon (z) \in \{1, 2, 3\}\) is the order of the stabilizer of z in \(\mathrm{PSL}_2(\mathbb {Z})\). Its counterparts for positive discriminants D are periods over geodesic cycles. These periods are classical objects with myriad interwoven connections to half-integral weight modular forms, base change L-functions, quadratic fields and quadratic forms. The key point is that the corresponding generating series has (metaplectic) modular properties. An interesting special case is the constant function \(f = 1\) in which case \(P(D; 1) = H(D)\) is, by definition, the Hurwitz class number.
With applications to the above mentioned symplectic restriction problem in mind, we are interested in pairs of Heegner periods in the spectral average
for a suitable test function h, two discriminants \(D_1, D_2 < 0\) and \(\Lambda _{\text {ev}}\) as in Proposition 1. While pairs of geodesics have been studied in a few situations [63,64,65] (but only for compact Riemann surfaces and not from an arithmetic point of view), to the best of our knowledge nothing seems to be known about spectral averages of pairs of Heegner periods. Opening the sums in the definition of \(P(D_1; \mathtt{u})\) and \(P(D_2; \mathtt{u})\), this can be expressed as a double sum of an automorphic kernel
in the usual notation which resembles the set-up of a relative trace formula. However, the standard methods in this situation (e.g. [32]) do not easily apply here as the stabilizers of \(z_1\) and \(z_2\) are essentially trivial. We thus take a different approach to establish the following relative trace formula for which we need some notation. For \(n >0\) and \(t \in \mathbb {R}\) let
For \(t\in \mathbb {R}\), \(x > 0\) and \(\kappa \in \mathbb {R}\) let
where \(J_{it}(x)\) is the Bessel function. Finally, for \(\kappa \in \mathbb {Z} + 1/2\), \(n, m \in \mathbb {Z}\) and \(c \in \mathbb {N}\) define the modified Kloosterman sums
where
Note that \(K^+_{\kappa }(n, m, c)\) is symmetric in m and n and 2-periodic in \(\kappa \). They satisfy the Weil-type bound
see e.g. [73, Lemma 4] in the case \(n=m\), the general case being analogous. In order to simplify the notation we assume that \(D_1, D_2\) are fundamental discriminants. In Section 7 we state the general version for arbitrary negative discriminants.
Theorem 3
Let \(\Delta _1, \Delta _2\) be negative fundamental discriminants and let h be an even function, holomorphic in \(|\mathfrak {I}t | < 2/3\) with \(h(t) \ll (1+|t|)^{-10}\). Then
The experienced reader will spot the strategy of the proof from the shape of the formula: A Katok-Sarnak-type formula translates \(P(\Delta ; \mathtt{u})\) into a product of a first and a \(\Delta \)-th half-integral weight Fourier coefficient. In this way, a pair of two Heegner periods becomes a product of four half-integral weight Fourier coefficients. A quadrilinear form of half-integral weight Fourier coefficients is not directly amenable to any known spectral summation formula, so this looks like a dead end, but we can use a Waldspurger-type formula a second time, now in the other direction, to translate the two first coefficients into a central L-value. This L-value can be written explicitly as a sum of Hecke eigenvalues by an approximate functional equation. We can now use the correspondence between half-integral and integral weight forms a third time, namely by combining the Hecke eigenvalues into the half-integral weight coefficients by means of metaplectic Hecke relations. Finally, the Kuznetsov formula for the Kohnen plus space provides the desired geometric evaluation of the relative trace. This particular version of the Kuznetsov formula is also new and will be stated and proved in Sect. 5. The same strategy works also with minor modifications for pairs of (traces of) geodesic cycle integrals.
1.4 Mean values of L-functions
We highlight another ingredient of independent interest, a special case of which (Corollary 5) is a hybrid Lindelöf-on-average bound for central values of twisted L-functions. Its proof is deferred to Sect. 8.
Proposition 4
Let \(\mathcal {D}, \mathcal {T} \geqslant 1\) and \(\varepsilon > 0\). For a fundamental discriminant \(\Delta \) let \(\chi _{\Delta } = (\frac{\Delta }{.})\) be the Jacobi–Kronecker symbol.
(a) We have
where the sum is over an orthonormal basis of Hecke–Maaß cusp forms u with spectral parameter \(t_{u}\), and \(\alpha (u)\) is any sequence of complex numbers, indexed by Maaß forms.
(b) We have
for an arbitrary function \(\alpha : [-\mathcal {T}, \mathcal {T}] \rightarrow \mathbb {C}\).
The proof uses, among other things, the spectral large sieve of Deshouillers-Iwaniec [21] and Heath-Brown’s large sieve for quadratic characters [34]. Note that \(L({ u} \times \chi _{\Delta }, 1/2) \geqslant 0\) is non-negative [47, Corollary 1]. The key point here is that there is complete uniformity in \(\mathcal {T}\) and \(\mathcal {D}\).
We give an immediate application. Let us choose \(\alpha (u) = L(u, 1/2)\) and note that
where the right hand side is the base change L-function to \(K = \mathbb {Q}(\sqrt{\Delta })\). We can now use a standard mean value bound for L(u, 1/2), e.g. [41, Theorem 3], to conclude
Corollary 5
For \(\mathcal {T}, \mathcal {D} \geqslant 1\) and \(\varepsilon > 0\) we have
where the first sum runs over a basis of Hecke–Maaß cusp forms u with spectral parameter \(t_u \leqslant \mathcal {T}\).
Again this bound is completely uniform and best-possible in the \(\mathcal {D}\) and \(\mathcal {T}\) aspect. We give another interpretation of Proposition 4(a). For odd u, the root number of \(L(u \times \chi _{\Delta }, s)\) is \(-1\) (see [10, Lemma 2.1]), so the central value vanishes. For even u the central L-values \(L(u \times \chi _{\Delta }, 1/2)\), as in (4.9) below, are proportional to squares of Fourier coefficients \(b_v(\Delta )\) of weight 1/2 Maaß forms v in Kohnen’s subspace for \(\Gamma _0(4)\), normalized as in (3.7). We refer to Sect. 3 for the relevant definitions. In particular, for the usual choice of the Whittaker function the normalized Fourier coefficient \(\tilde{b}_v(\Delta ) = e^{-\pi |t_v|/2} |t_v|^{ \text {sgn}(\Delta )/4} |\Delta |^{1/2} b_v(\Delta ) \) is of size one on average. In this way we conclude bounds for linear forms in half-integral weight Rankin–Selberg coefficients:
where the v-sum runs over an \(L^2\)-normalized Hecke eigenbasis of non-exceptional weight 1/2 Maaß forms in Kohnen’s subspace for \(\Gamma _0(4)\) with spectral parameter \(t_v\). We refer to the remark after the proof of Lemma 8 for more details.
1.5 Organization of the paper
Sections 2–5 prepare the stage and compile all necessary automorphic information. New results include versions of the half-integral Kuznetsov formula and a Voronoi formula for Hurwitz class numbers in Sect. 5.
Proposition 1, Theorem 3, and Proposition 4 are proved in Sects. 6–8 respectively. This is followed by an interlude on the analysis of special functions and oscillatory integrals in Sect. 9.
In the remainder we complete the proof of Theorem 2. In Sect 10 we first prove an upper bound \(\mathcal {N}_{\text {av}}(K) \ll K^{\varepsilon }\) by a preliminary argument. This will be useful and necessary to control certain auxiliary variables and error terms later. The full asymptotic formula is derived in Sects. 11–14. Due to several applications of certain spectral summation formulae, we have various diagonal and off-diagonal terms. Section 12 treats the total diagonal term that extracts the leading term \(4\log K\) in Theorem 2. Sections 13 and 14 deal with the diagonal off-diagonal and the off-off-diagonal term.
1.6 Common notation
For \( c\not = 0\) we extend the Jacobi-Symbol \(\chi _c(d)= (\frac{c}{d})\) for positive odd integers d to all integers \(d\not = 0\) as the completely multiplicative function defined by \(\chi _c(-1) = \text {sign}( c)\) and \(\chi _c(2) = 1\) if \(c \equiv 1 \, (\text {mod }8)\), \(\chi _c(2) = -1\) if \(c \equiv 5 \, (\text {mod }8)\), \(\chi _c(2) = 0\) if c is even. The value of \(\chi _c(2)\) remains undefined only if \(c \equiv 3 \, (\text {mod }4)\).
We call an integer \(D \in \mathbb {Z} \setminus \{0\}\) a discriminant if \(D \equiv 0\) or 1 (mod 4). Every discriminant D can uniquely be written as \(D = \Delta f^2\) for some \(f\in \mathbb {N}\) and some fundamental discriminant \(\Delta \) (possibly \(\Delta = 1\)). For each discriminant D, the map \(\chi _D\) is a quadratic character of modulus |D| that is induced by the character \(\chi _{\Delta }\) corresponding to the field \(\mathbb {Q}(\sqrt{\Delta })\). (If \(\Delta = 1\), then \(\chi _{\Delta }\) is the trivial character.) Throughout, the letters D and \(\Delta \) are always reserved for discriminants resp. fundamental discriminants, usually negative.
The letter \(\Gamma \) is used for the gamma function and also for the group \(\Gamma = \mathrm{SL}_2(\mathbb {Z})\); confusion will not arise. We write \({\overline{\Gamma }} = \mathrm{PSL}_2(\mathbb {Z})\).
For \(a, b \in \mathbb {N}\) we write \(a \mid b^{\infty }\) to mean that all prime divisors of a divide b. We also write \((a, b^{\infty }) = a/a_1\) where \(a_1\) is the largest divisor of a that is coprime to b. We use the usual exponential notation \(e(z):=e^{2\pi i z}\) for \(z\in \mathbb {C}\). The letter \(\varepsilon \) denotes an arbitrarily small positive constant, not necessarily the same at every occurrence. The Kronecker symbol \(\delta _{S}\) takes the value 1 if the statement S is true and 0 otherwise. The notation \(\int _{(\sigma )}\) denotes a complex contour integral over the vertical line with real part \(\sigma \). We use the usual Vinogradov symbols \(\ll \) and \(\gg \), and we use \(\asymp \) to mean both \(\ll \) and \(\gg \). We always assume that the number K in Theorem 2 is sufficiently large.
2 Holomorphic forms of degree one and two
For a positive integer k let \(S^+_{k-1/2}(4)\) denote Kohnen’s plus [51] space of holomorphic cusp forms of weight \(k-1/2\) and level 4. These have a Fourier expansion of the form
and form a finite-dimensional Hilbert space with the inner product
This space is isomorphic (as a module of the Hecke algebra) to the space \(S_{2k-2}\) of holomorphic cusp forms of weight \(2k-2\) and level 1 [51, Theorem 1]. We denote by \(f_h \in S_{2k-2}\) the (unique up to scaling) image of a newform \(h \in S^+_{k-1/2}(4)\). The Hecke algebra on \(S^+_{k - 1/2}\) is generated by the operators \(T(p^2)\), p prime, and for \(p=2\) we follow Kohnen’s definition [51, p. 250] of T(4) that allows a uniform treatment of all primes including \(p=2\). If \(\lambda (p)\) are the Hecke eigenvalues of \(f_h\) (normalized so that the Deligne’s bound reads \(|\lambda (p)| \leqslant 2\)), then
for all primes p with the convention \(c_h(x) = 0\) for \(x \not \in \{n \in \mathbb {N} \mid (-1)^k n \equiv 0, 3\, (\text {mod 4}) \}\). Iterating this formula gives
for squarefree \(m \in \mathbb {N}\).
The space \(S^{+}_{k-1/2}(4)\) can be characterized as an eigenspace of a certain operator acting on the space \(S_{k-1/2}(4)\) of all holomorphic cusp forms of weight \(k-1/2\) and level 4 [51, Proposition 2]. It possesses Poincaré series \(P^+_n \in S^+_{k-1/2}(4)\) satisfying the usual relationFootnote 1 [52, (4)]
for all \((-1)^k n \equiv 0, 3\, (\text {mod 4})\) and all \(h \in S^+_{k-1/2}(4)\) with Fourier expansion (2.1). These Poincaré series are the orthogonal projections of the Poincaré series \(P_n \in S_{k-1/2}(4)\) onto \(S^{+}_{k-1/2}(4)\) and their Fourier coefficients are computed explicitly in [52, Proposition 4]. This gives us the following Petersson formula for Kohnen’s plus space.
Lemma 6
Let \(k \geqslant 3\) be an integer, \(\kappa = k-1/2\). Let \(\{h_j\}\) be an orthogonal basis of \(S^+_{\kappa }(4)\) with Fourier coefficients \(c_j(n)\) as in (2.1). Let n, m be positive integers with \((-1)^kn, (-1)^km \equiv 0, 3\, (\mathrm{mod } \,\,4)\). Then
For a positive integer k we denote by \(S^{(2)}_k\) the space of Siegel cusp forms of degree 2 of weight k for the symplectic group \(\mathrm{Sp}_4(\mathbb {Z})\) with Fourier expansion
for \(Z = X + iY \in \mathbb {H}^{(2)}\) on the Siegel upper half plane, where \(\mathcal {P}(\mathbb {Z})\) is the set of symmetric, positive definite 2-by-2 matrices with integral diagonal elements and half-integral off-diagonal elements. This is a finite-dimensional Hilbert space with respect to the inner product
Remark: In the generic case, the above norm may be used to determine the associated adjoint L-function at \(s=1\) by a recent work of Chen and Ichino [17]. However Siegel cusp forms are not globally generic, albeit that they are equivalent to a generic form everywhere but the archimedean place. Nevertheless, their Petersson norm may be expressed in terms of its adjoint L-function via Böcherer’s conjecture (now a theorem due to the breakthrough result of Furusawa and Morimoto [29] alongside the explicit formulation in [23]).
There is a special family of Siegel cusp forms that are derived from elliptic cusp forms (Saito–Kurokawa lifts or Maaß Spezialschar). Let k be an even positive integer. Let \(h \in S^+_{k-1/2}(4)\) be a Hecke eigenform of weight \(k-1/2\) with Fourier expansion as in (2.1), and let \(f_h \in S_{2k-2}\) denote the corresponding Shimura lift. The Saito–Kurokawa lift associates to h (or \(f_h\)) a Siegel cusp form \(F_h\) of weight k for \(\mathrm{Sp}_4(\mathbb {Z})\) with Fourier expansion (2.5), where
see e.g. [27, §6]. If \(L (f_h, s)\) is the standard L-function of \(f_h\) (normalized so that the functional equation sends s to \(1-s\)), then the norms of \(F_h\) and h are related by ( [53, p. 551], [14, Lemma 4.2 & 5.2 with \(M=1\)])
Remarks: (1) This inner product relation was generalized to Ikeda lifts in [48]. Their case \(n=2\) is consistent with (2.7) taking into account the different normalization of the inner product (2.2) with an additional factor 1/6 imported from [54].
(2) Note that the formula three lines after (4) in [53] is off by a factor of 2 (and also based on a different inner product). The proof of [14, Lemma 5.2] contains a number of typos (the powers of \(\pi \) in (10) and the preceding display seem to be incorrect as well as the normalizing factor after (10)), but the final result for \(M=1\) is independent of this and consistent with (2.7).
(3) For future reference we note that for \(\mathfrak {R}s > 1\) we have
if \(\lambda (n)\) denotes the n-th Hecke eigenvalue of \(f_h\).
3 Non-holomorphic automorphic forms
We recall the spectral decomposition of \(L^2(\Gamma \backslash \mathbb {H})\) with \(\Gamma = \mathrm{SL}_2(\mathbb {Z})\), consisting of the constant function \(u_0 = \sqrt{3/\pi }\), a countable orthonormal basis \(\{u_j, j = 1, 2, \ldots \}\) of Hecke–Maaß cusp forms and Eisenstein series \(E(., 1/2 + it)\), \(t \in \mathbb {R}\). As in the introduction we call the collection of these functions \(\Lambda \) and the subset of even members \(\Lambda _{\text {ev}}\), and we write \(\int _{\Lambda }\) resp. \(\int _{\Lambda _{\text {ev}}}\) for the corresponding spectral averages. We also introduce the notation \(\int ^{*}_{\Lambda _{\text {ev}}} d\mathtt{u}\) for a spectral average without the residual spectrum which in this case consists only of the constant function. We use the general notational convention that an element in \(\Lambda \) or \(\Lambda _{\text {ev}}\) is denoted by \(\mathtt{u}\) while a cusp form is usually denoted by u.
For \(t \in \mathbb {R}\) let \(U_t^{\text {ev}}\) denote the space of even weight zero Maaß cusp forms for \(\Gamma \) with Laplacian eigenvalue \(1/4 + t^2\). It is equipped with the inner product
We write the Fourier expansion as
with \(a(-n) = a(n)\), where \(W_{0, it}(4\pi y) = 2 y^{1/2} K_{it}(2\pi y)\) is the Whittaker function. The Hecke operators T(n), normalized as in [47, (1.1)], act on \(U_t^{\text {ev}}\) as a commutative family of normal operators. We call \(t = t_u\) the spectral parameter of u. The Eisenstein series
for \({\overline{\Gamma }} = \mathrm{PSL}_2(\mathbb {Z})\) are eigenfunctions of all T(n) with eigenvalue
and an eigenfunction of the Laplacian with eigenvalue \(s(1-s)\). We call \((s - 1/2)/i\) the spectral parameter of E(., s). If \(\mathtt{u}\) is an Eisenstein series or a Hecke–Maaß cusp form with Hecke eigenvalues \(\lambda _\mathtt{u}(n)\) we define the corresponding L-function \(L(\mathtt{u}, s) = \sum _n \lambda _\mathtt{u}(n) n^{-s}\). In particular
If \(u \in U_t^{\text {ev}}\) is a cuspidal Hecke eigenform with eigenvalues \(\lambda (n)\), then \(|n|^{1/2}a(n) = a(1) \lambda (|n|)\), and by Rankin–Selberg theory (and [33, 6.576.4] with \(a = b = 4\pi \), \(\nu = \mu = it\))
We recall the Kuznetsov formula [55] and combine directly the “same sign” and the “opposite sign” formula to obtain a version for the even part of the spectrum. The conversion between Hecke eigenvalues and Fourier coefficients in the cuspidal case follows from (3.4).
Lemma 7
Let \(n, m \in \mathbb {N}\). Let h be an even holomorphic function in \(|\mathfrak {I}t| \leqslant 2/3\) with \(h(t) \ll (1 + |t|)^{-3}\). For non-constant \(\mathtt{u} \in \Lambda \) let \(\mathcal {L}(\mathtt{u}) = L(\text {{ sym}}^2 \mathtt{u}, 1)\) if \(\mathtt{u}\) is cuspidal andFootnote 2\(\mathcal {L}(\mathtt{u}) = \frac{1}{2}|\zeta (1 + 2it)|^2\) if \(\mathtt{u} = E(., 1/2 + it)\) is Eisenstein.Footnote 3 Then
where
We turn to half-integral weight forms. Let \(V_t^+(4)\) denote the (“Kohnen”) space of weight 1/2 Maaß cusp forms for \(\Gamma _0(4)\) with eigenvalue \(1/4 + t^2\) with respect to the weight 1/2 Laplacian and Fourier expansion
The congruence condition on the indices can be encoded in an eigenvalue equation: the functions \(v \in V_t^+(4)\) are invariant under the operator L defined in [47, (0.7), (0.8)], cf. also [5, (A.1)]. The space \(V_t^{+}(4)\) is a finite-dimensional Hilbert space with respect to the inner product
The Hecke operators \(T(p^2)\), p prime, act on \(V^+_t(4)\) as a commutative family of normal operators that commute with the weight 1/2 Laplacian (again we use Kohnen’s modification for T(4) in order to treat all primes uniformly). Explicitly, if \(T(p^2) v = \lambda (p) v\), then (see [47, (1.3)]) the Fourier coefficients of v satisfy
for all primes p and all \(n \in \mathbb {Z} \setminus \{0\}\) with \(n \equiv 0, 1\) (mod 4) with the convention \(b(x) = 0\) for \(x \not \in \mathbb {Z}\). If \(v \in V_t^+(4)\) is an eigenfunction of all Hecke operators \(T(p^2)\) with eigenvalues \(\lambda (p)\), the relation (3.9) can be captured in the identity
for a fundamental discriminant \(\Delta \). Extending \(\lambda (p)\) to all n by the usual Hecke relations, we see that the denominator is just \(\sum _{\nu } \lambda (p^{\nu })p^{-\nu s}\), so that
for a fundamental discriminant \(\Delta \) and \(f \in \mathbb {N}\).
4 Period formulae
Let \(v \in V_t^+(4)\). Katok and Sarnak proved in [47, Proposition 4.1] that there is a linear map (a theta lift) \({\mathscr {S}}\) sending v to a non-zero element in \(U_{2t}^{\text {ev}}\) if \(b(1)\not = 0\) and to 0 otherwise. A calculation [47, pp. 221–223] shows that if v is an eigenform of \(T(p^2)\), then \({\mathscr {S}} v\) is an eigenform of T(p) with the same eigenvalue, and this computation works verbatim for \(p=2\), too. Conversely, given an eigenform \(u\in U_{2t}^{\text {ev}}\) with Hecke eigenvalues \(\lambda (p)\), by [4, Theorem 1.2]Footnote 4 there is a unique (up to scaling) \(v \in V_t^+(4)\) having eigenvalues \(\lambda (p)\) for \(T(p^2)\), \(p > 2\), (and then automatically also for T(4), since T(4) commutes with the other operators) which may or may not satisfy \(b(1) \not = 0\). In particular, for a given eigenform \(u\in U_{2t}^{\text {ev}}\) there is at most one eigenform \(v \in V_t^+\) (up to scaling) with \({\mathscr {S}}v = u \in U_{2t}^{\text {ev}}\). If it exists, we normalize it to be \(L^2\)-normalized and denote its Fourier coefficients b(n) as inFootnote 5 (3.7). If no such v exists, we define b(n) to be 0.
If \(\mathtt{u}\) is a Hecke–Maaß cusp form or an Eisenstein series, the absolute square of the periods \(P(D; \mathtt{u})\) can be expressed in terms of central L-functions. We introduce the relevant notation. For a discriminant \(D = \Delta f^2\) with a fundamental discriminant \(\Delta \) let
With \(\rho _s\) as in (3.2) we can re-write this as
Since \(\rho _s = \rho _{1-s}\), we see that L(D, s) satisfies the same type of functional equation as \(L(\Delta , s)\) namely
with \(\mathfrak {a} = 1\) if \(D < 0\) and \(\mathfrak {a} = 0\) if \(D > 0\). For a Hecke–Maaß cusp form or an Eisenstein series \(\mathtt{u} \) define
The key point is that
as one can check by a formal computation with Euler products using the Hecke relation for the eigenvalues \(\lambda _\mathtt{u}\) (which are identical for Maaß forms and Eisenstein series), see [54, p. 188-189]. For later purposes we record the simple bound
uniformly in \(\Delta \) and \(\mathtt{u}\), which follows from the Kim-Sarnak bound with \( 2 \cdot 7/64 < 1/3\).
From (4.2), the fact that \(|L(\chi _{\Delta }, 1/2 + it)|^2 = L(E(., 1/2 + it), \Delta , 1/2)\) and (4.5) we have
The next key lemma expresses the periods P(D; u) defined in (1.5) for cusp forms u as half-integral weight Fourier coefficients, and then their squares as L-functions. The first formula (4.8) is essentially a formula of Katok-Sarnak [47, (0.16) & (0.19)], the passage from squares of metaplectic Fourier coefficients to L-functions in (4.9) is a Kohnen-Zagier type formula of Baruch-Mao [4, Theorem 1.4]. The combination (4.10) of these two is a special case of a formula of Zhang [76, Theorem 1.3.2] or [77, Theorem 7.1], derived independently by a different method.
Lemma 8
If \({ u} \in U_{2t}^{\text {{ ev}}}\) is an even Hecke–Maaß cusp form and \(D_1, D_2 <0\) are two discriminants, then
For a discriminant D of either sign we have
For \(D < 0\) we have
Remark: The exact shape of these formulas is an unexpectedly subtle matter, and the attentive reader might well be confused by the various and slightly contradictory versions in the literature. There are at least four sources of possible conflict:
-
the Whittaker functions can be normalized in different ways;
-
the inner products can be normalized in different ways;
-
the translation from adelic language to classical can cause problems;
-
there can be ambiguities related to the groups \(\mathrm{GL}(2)\) vs. \(\mathrm{SL}(2)\) vs. \(\mathrm{PSL}(2)\).
The Katok-Sarnak formula exists in the literature with proofs given in at least in three different versions: [47, (0.16)], [6, Theorem A1] and [25, Theorem 4]. The original version of Katok-Sarnak was carefully revised by Biró, but the latter seems to be still off by a factor 2 compared to the version in Duke-Imamoğlu-Toth, which was checked numerically.
The Baruch-Mao formula [4, Theorem 1.4] is quoted in [25, (5.17)] with an additional factor 2. Zhang’s result [77, Theorem 7.1] (and also the remark after [76, Theorem 1.3.2], the theorem itself being correct) is missing the stabilizer \(\epsilon (z)\) in the period P(D; u). This formula is slightly incorrectly reproduced in [58] and several follow-up papers, based on a different normalization of the Whittaker function. Finally, neither combination of one of the three Katok-Sarnak formulae with the Baruch-Mao formula in [4, Theorem 1.4] coincides with Zhang’s formula.
We therefore feel that these beautiful and important results should be stated with correct constants and normalizations. For the proof of Theorem 2 and its connection to the mass equidistribution conjecture this is absolutely crucial. As [25, Theorem 4] was checked numerically by the authors, we follow their version of the Katok-Sarnak formula. This gives (4.8). We verified and confirmed the constant in Zhang’s formula independently by proving an averaged version in Appendix A. This gives (4.10). By backwards engineering, we established the numerical constant in the Baruch-Mao formula, which gives (4.9) and coincides with [25, (5.17)].
Note that (4.10) is essentially universal: the right hand side of (4.10) is independent of any normalization, the left hand side depends only on the normalization of the inner product (3.1) which is standard.
Proof
We start with the formula [47, (0.16)] for a general discriminant \(D < 0\), but use the numerical constants as in [25, Theorem 4] (proved only for fundamental discriminants there). This formula expresses P(D; u) for an arbitrary discriminant \(D < 0\) as a sum over Fourier coefficients of all v with \({\mathscr {S}}v = { u}\). By the above remarks, there is at most one such v. If there is none, then both sides of (4.8) and (4.10) vanish by [47, (0.16), (0.19)] and our convention that \(b(n) = 0\) in this case, and there is nothing to prove. Also note that the left hand side of (4.8) and both sides of (4.10) are independent of the normalization of u, so without loss of generality we may assume that u is Hecke-normalized as in [47]. We obtain
Next we insert [47, (0.19)] (again keeping in mind the different normalization of (3.8) and observing that this is coincides with the numerically checked version of [25, Theorem 4]) getting (4.8).
If D is a fundamental discriminant, then (4.9) follows from [25, (5.17)] together with (3.4) with \(a(1) = 1\) and 2t in place of t. By (3.10) and (4.5) this remains true for arbitrary discriminants. The formula (4.10) is a direct consequence of (4.8), (4.9) and well-known properties of the gamma function, and was proved independently by Zhang [76, Theorem 1.3.2]. \(\square \)
Remark: The argument at the beginning of this proof shows that the u-sum in Proposition 4(a), up to terms of size 0, can be replaced with the v-sum in (1.10), using (4.9). This completes the proof of (1.10).
A similar result holds for Eisenstein series. If \(aX^2 + bXY + cY^2\) is an integral quadratic form of discriminant \(D = b^2 -4ac < 0\) with Heegner point \(z = (\sqrt{|D|}i - b)/(2a)\), then
Hence
where \(\zeta (D, s)\) is definedFootnote 6 in [75, (6)]. By [75, Proposition 3 iii] (or [25, Theorem 3]) we obtain the following lemma in analogy to Lemma 8.
Lemma 9
If \(D < 0\) is a discriminant, then
and hence
Remarks: (1) The second formula follows from the first by (3.3) and (4.7).
(2) Here the verification of the numerical constants is much easier than in the cuspidal case. Taking residues at \(s=1\) in (4.11) for a fundamental discriminant \(\Delta < 0\) returns the class number formula for \(\mathbb {Q}(\sqrt{\Delta })\), which confirms the numerical constants.
(3) For future reference we recall the standard bounds
for \(\varepsilon > 0\). This is in particular relevant to obtain upper bounds in (4.12) and (4.10).
We close this section by stating a standard approximate functional equation [44, Theorem 5.3] for the L-functions occurring in the previous period formulae. For \(u \in U_t^{\text {ev}}\) and a fundamental discriminant \(\Delta \) (possibly \(\Delta = 1\)) we have
where
with \(\mathfrak {a} = 1\) if \(\Delta < 0\) and \(\mathfrak {a} = 0\) if \(\Delta > 0\). Note that \(W_t\) depends on \(\Delta \) only in terms of its sign. If we want to emphasize this we write \(W_t^+\) and \(W_t^-\) with \(\pm = \text {sgn}(\Delta )\).
A similar expression holds for \(E(., 1/2 + it)\) in place of u except that in the case \(\Delta = 1\) we have \(L(E(., 1/2 + it), 1, s) = \zeta (s+it)\zeta (s- it)\) and there is an additional polar termFootnote 7. We have
with \(\rho _{1/2 + it}\) as in (3.2).
5 Half-integral weight summation formulae
In this section we compile the Voronoi summation and the Kuznetsov formula for half-integral weight forms in the plus space.
5.1 Voronoi summation
As before let
be a Maaß form in the plus space of weight \(k \in \{1/2, 3/2\}\) and spectral parameter t for \(\Gamma _0(4)\) with respect to the usual theta multiplier. We start with the Voronoi summation formula [16, Theorem 3].
Lemma 10
Let \(c\in \mathbb {N}\), \(4 \mid c\), \((a, c) = 1\). Let \(\phi \) be a smooth function with compact support in \((-\infty , 0) \cup (0, \infty )\) and for \(y > 0\) define
where
with F as in (1.6). Then
with \(\epsilon _a\) as in (1.8).
The proof of the Voronoi formula (Lemma 10) follows from a certain vector-valued functional equation satisfied by the L-functions with coefficients \(b(\pm n) e(\pm an/c)\). The same functional equation holds if v is not cuspidal (this is clear from general principles and worked out explicitly in [22] along the same lines), but in this case the L-functions are not entire; they have various poles. We use this observation for two non-cuspidal modular forms. The first is a half-integral weight Eisenstein series
which transforms under \(\Gamma _0(4)\) as a weight 1/2 automorphic form with theta multiplier; see [25, p. 964]. As usual, D runs over all discriminants. The other is Zagier’s weight 3/2 Eisenstein seriesFootnote 8
which transforms under \(\Gamma _0(4)\) as a weight 3/2 automorphic form with theta multiplier. As before H(D) is the Hurwitz class number. For \(\epsilon \in \{\pm 1\}\), \((a, c) = 1\), \(4\mid c\), the Dirichlet series
has possible poles at \(w = 1/2 \pm it\) with certain residues \(R_{\epsilon }(\pm t, a/c)\), say (cf. [7, Lemma 2] in the case \(c=1\), the general case being analogous). Consequently we obtain the following analogue of Lemma 10.
Lemma 11
Let \(c\in \mathbb {N}\), \(4 \mid c\), \((a, c) = 1\), \(t \in \mathbb {R} \setminus \{0\}\). Let \(\phi \) be a smooth function with compact support in \((-\infty , 0) \cup (0, \infty )\) and for \(y > 0\) define \(\Phi \) as in (5.2). Then
For \(t=0\) one combines the ±-terms and takes the limit as \(t \rightarrow 0\). In our application, the values \(R_{\epsilon }(t, a/c)\) are irrelevant (as long as they are polynomial in c and t) since we apply the formula with a function \(\phi \) that oscillates much more strongly that \(x^{\pm it}\) so that the integral is negligible.
We obtain a similar summation formula for Hurwitz class numbers. Although we do not need it for the present result, we compute in Appendix B the residues explicitly and get the following handsome formula.
Lemma 12
Let \(c\in \mathbb {N}\), \(4 \mid c\), \((a, c) = 1\). Let \(\phi \) be a smooth function with compact support in \((0, \infty ) \). Then
where
Remark: Observing that for negative \(D \equiv 0, 1\) (mod 4) we have
it is a straightforward exercise to conclude
for \(\delta = 0, 1\). Further congruence conditions on D can be imposed, and the error term can be improved by a more careful treatment of the dual term in the Voronoi formula. See [72] for the corresponding result for the ordinary class number h(d). The following table provides some numerical results (here we combined the cases \(\delta = 0\) and \(\delta = 1\)).
X | 1000 | 2000 | 4000 | 6000 | 8000 | 10,000 |
---|---|---|---|---|---|---|
\(\displaystyle \sum ^{\begin{array}{c} \quad \\ \quad \end{array}}_{D \leqslant X} H(D)\) | 5280.5 | 15,131.3 | 43,189.5 | 79,685.7 | 122,967 | 172,106 |
\(\displaystyle \frac{\pi }{18} \overset{\begin{array}{c} \quad \\ \quad \end{array}}{X^{3/2}} - \frac{1}{4}X\quad \) | 5269.22 | 15,110.7 | 43,153.7 | 79,615.6 | 122,885.6 | \(\underset{\begin{array}{c} \quad \\ \quad \\ \quad \end{array}}{172,032.9}\) |
5.2 The Kuznetsov formula
The Kuznetsov formula was generalized by Proskurin [66] to arbitrary weights and by Andersen–Duke [2] to Kohnen’s subspace. Interestingly, only the direction from Kloosterman sums to spectral sums appears to be in the literature, but no complete version in the other direction. Biró [5, p. 151] has a version only valid for test functions on the spectral side whose spectral mean value is 0, Ahlgren–Andersen [1, Sect. 3] use an approximate version.
We take this opportunity to state and prove the relevant Kuznetsov formula both for the full space of half-integral weight forms and for Kohnen’s subspace. Starting with the former, for \(\kappa \in \{1/2, 3/2\}\) the relevant Kloosterman sums are
for \(4 \mid c\). The Eisenstein series belong to the two essential cusps \(\mathfrak {a} = \infty , 0\). We normalize and denote their Fourier coefficients by \(\phi _{\mathfrak {a}m}(1/2 + it) = \phi ^{(\kappa )}_{\mathfrak {a}m}(1/2 + it)\) as in [66, (12)–(14)]. We denote by \(\left. \sum \right. ^{(\kappa )}\) a sum over an orthonormal basis of the space of cusp forms of weight \(\kappa \) and label the members by \(v_j\), \(j = 1, 2, \ldots \) with Fourier coefficients \(b_j(n)\) as in (5.1) and spectral parameters by \(t_j\). We recall the definition of F from (1.6).
Proposition 13
Let \(\kappa \in \{1/2, 3/2\}\), \(m, n > 0\). Let h be an even function, holomorphic in \(|\mathfrak {I}t| < 2/3\) with \(h(t) \ll (1 + |t|)^{-4}\). Then
if in addition \(h(\pm i/4) = 0\). Moreover, regardless of the value of \(h(\pm i/4)\) we have
Remark: Note that the space of weight 1/2 Maaß forms v with spectral parameter i/4 are in the kernel of the Maaß lowering operator. Hence \(y^{-1/4}v\) is holomorphic, so that v has no non-vanishing negative Fourier coefficients. This is consistent with the fact that the Eisenstein contribution vanishes in this case because of the gamma factors in the denominator.
Proof
By [66, Lemma 3] with \(\sigma = 1\), \(t = 2\tau \in \mathbb {R}\) and the first formula in [66, Lemma 6] we have the “pre-Kuznetsov” formula
where
Note that taking \(\sigma = 1\) is admissible in the present situation because we have the same Weil-type bounds for the Kloosterman sums \(K_{\kappa }(n, m, c)\) as for \(K^+_{\kappa }(n, m, c)\) in (1.9). Also note that there is a typo in [66, Lemma 6] in the upper limit of the integral.
For h as in the lemma and \(t \in \mathbb {R}\) we have the following inversion formula
This is the lemma on p. 327 of [55]Footnote 9 which is readily proved by residue calculus. We now integrate the pre-Kuznetsov formula against
Our assumptions on h ensure absolute convergence and the possibility to shift the contour up and down to \(\mathfrak {I}t = \pm 1/2\) without crossing poles. In this way the \(\delta \)-term becomes
For the Kloosterman term we insert the definition in (1.6) getting
We note that
where the last equality follows from the recurrence relations [33, 8.471.1&2]. Substituting this, we can evaluate the y-integral by the fundamental theorem of calculus, arriving at
after some elementary manipulations. We multiply the resulting expression by \(\sqrt{mn}\) to obtain the first formula. Note that the c-sum is absolutely convergent by the power series expansion of the Bessel function contained in \(F(x, 2t, -\kappa )\) and the fact that \(h(\pm i/4) = 0\), as we can shift the t-contour up and down to \(|\mathfrak {I}t| = 1/2 - \varepsilon \).
There are two ways to derive the second formula from the first. We can either observe that in Proskurin’s notation we can compute \(\langle \overline{\mathcal {U}_m(., s_1)}, \overline{\mathcal {U}_n(., {\bar{s}}_2)}\rangle \) instead of \(\langle \mathcal {U}_m(., s_1), \mathcal {U}_n(., {\bar{s}}_2)\rangle \). This changes the signs of the coefficients in the spectral expansion and has the effect of changing \(\kappa \) into \(-\kappa \). Note that in this case we do not need the extra condition \(h(\pm i/4) = 0\) to shift contours. Alternatively, we use the following fact (cf. [26, (4.17),(4.18),(4.27),(4.28),(4.64), p. 507, p. 509]): the map
with \(X : f(z) \mapsto f(-{\bar{z}})\) and \(\Lambda _{\kappa } = \kappa /2 + y(i\partial _x - \partial _y)\) the weight lowering operator is a bijective isometry between weight \(\kappa \) and weight \(2-\kappa \) that exchanges positive and negative Fourier coefficients:
This yields again the second formula from the first. \(\square \)
In order to get a corresponding formula for the Kohnen space, we apply the L operator
to the formula. As in the case of the Petersson formula (Lemma 6), this has the effect that
-
the cuspidal term is restricted to forms in the Kohnen space;
-
the \(\delta \)-term is multiplied by 2/3; the reason for the number 2/3 is that the dimension of the Kohnen space is 1/3 of the full space, but only half of the coefficients appear;
-
the Kloosterman sums (5.6) are replaced with the Kloosterman sums (1.7) and the Kloosterman term is also multiplied by 2/3.
The hardest part is to compute the Eisenstein coefficients. All of this has been worked out in detail in [2, Sect. 5]. The corresponding formula [2, Theorem 5.3] can be inverted in the same way. In the following lemma let \(\sum ^+\) denote a sum over an orthonormal basis of weight 1/2 Maaß cusp forms in Kohnen’s space. We recall the definition (4.1) and (4.2) of L(D, s) for a discriminant D.
Lemma 14
Let \(\kappa = 1/2\) and let \(n, m \in \mathbb {Z}\) that are congruent 0 or 1 modulo 4. Let h be an even function, holomorphic in \(|\mathfrak {I}t| < 2/3\) with \(h(t) \ll (1 + |t|)^{-4}\). Then
for \(n, m > 0\) if in addition \(h(\pm i/4) = 0\). Moreover, regardless of the value of \(h(\pm i/4)\) we have
if \(n, m < 0\).
6 Harmonic analysis on positive definite matrices
Here we prove Proposition 1. We identify the Hilbert spaces
via
Note that for \( Y = \iota (x+iy, r)\) with \( r = \det Y\) we have
so that the measures coincide. The group \({\overline{\Gamma }} = \mathrm{PSL}_2(\mathbb {Z})\) acts faithfully on \(\mathcal {P}(\mathbb {R})\) and \(\mathcal {P}(\mathbb {Z})\) (as defined after (2.5)) by \(T \mapsto U^{\top } T U\) for \(U \in {\overline{\Gamma }}\). This is compatible with the action of \({\overline{\Gamma }}\) on \(\mathbb {H}\) by Möbius transforms. Every smooth function \(f \in L^2(\Gamma \backslash \mathbb {H})\) has a spectral decomposition
We recall the notion of the spectral parameter \(t_\mathtt{u}\) for \(\mathtt{u} \in \Lambda \); the constant function has spectral parameter i/2. Combining the spectral decomposition on \(\Gamma \backslash \mathbb {H}\) with Mellin inversion, we conclude that, for a smooth function \(\Phi \in L^2(\Gamma \backslash \mathcal {P}(\mathbb {R}))\), the spectral decomposition
holds provided \(\Phi \) is sufficiently rapidly decaying as \(r\rightarrow 0\) and \(r \rightarrow \infty \). Here,
is the Mellin transform with respect to the r-variable. This gives the Parseval formula
For an automorphic form \(\mathtt{u} \in \Lambda \) and a Siegel cusp form \(F \in S^{(2)}_k\) with Fourier expansion (2.5) we define the twisted Koecher–Maaß series by
where \(\epsilon (T) = \{U \in {\overline{\Gamma }} \mid U^{\top } T U = T\}\) is the stabilizer and \(\mathfrak {R}s\) is (initially) sufficiently large. This series is often defined in terms of \(\mathrm{GL}_2(\mathbb {Z})\)-equivalence instead of \({\overline{\Gamma }}\)-equivalence, but for us the present version is more convenient. This function has no Euler product, but it does have a functional equation. Let
Let us assume that k is even. Then for all even \(\mathtt{u}\) (including Eisenstein series and the constant function), the function \(L(F \times \mathtt{u}, s)\) has an analytic continuation to \(\mathbb {C}\) that is bounded in vertical strips and satisfies the functional equation [39, Theorem 3.5]
The functional equation is a consequence of the following period formula. For \(\mathfrak {R}s\) sufficiently large we have (cf. [39, pp. 927–928])
Now splitting the T-sum into equivalence classes modulo \({\overline{\Gamma }}\) and unfolding the integral, this equals
Note that it is important that the action of \({\overline{\Gamma }}\) is faithful. The last integral over \(\mathcal {P}(\mathbb {R})\) was evaluated by Maaß [60, p. 85 and p. 94]:
for any \(\mathtt{u} \in \Lambda \). Thus we obtain
initially for \(\mathfrak {R}s\) sufficiently large, but then by analytic continuation for all \(s \in \mathbb {C}\). For odd \(\mathtt{u}\), the left hand side vanishes, since \(\langle F(\iota (., r)), \mathtt{u}\rangle = 0\) for every r. The Parseval formula (6.2) now implies Proposition 1 in the introduction.
In the special case where F is a Saito–Kurokawa lift, the Koecher–Maaß series simplifies.
Lemma 15
If k is even and \(F = F_h \in S_k^{(2)}\) is a Saito–Kurokawa lift with Fourier expansion as in (2.5) and (2.6) and \(\mathtt{u} \in \Lambda \), then
for \(\mathfrak {R}s\) sufficiently large and \(P(D; \mathtt{u})\) as in (1.5).
Remark: One would expect that \(c_h(|D|)\) is roughly of size \(|D|^{\frac{k}{2} - \frac{3}{4}}\) and that \(P(D; \mathtt{u})\) is roughly of size \(|D|^{\frac{1}{4}}\) (for non-constant \(\mathtt{u}\)) so that typically \(c_h(|D|)P(D; \mathtt{u})|D|^{-\frac{k-1}{2}}\) is roughly of constant size (with respect to D). Using trivial bounds for \(P(D; \mathtt{u})\) and \(c_h(|D|)\), the quantity \(c_h(|D|)P(D; \mathtt{u})|D|^{-\frac{k-1}{2}}\) is certainly \(\ll |D|^{3/4+\varepsilon }\).
Proof
We copy the argument from [12, p. 22]. Let \(\Delta \) be a negative fundamental discriminant, \(D = \Delta f^2\) a negative discriminant and \(T \in \mathcal {P}(\mathbb {Z})/{\overline{\Gamma }}\). For such \(T = \left( {\begin{matrix} n &{} r/2\\ r/2 &{} m\end{matrix}}\right) \) we write \(e(T) = (n, r, m)\) for the greatest common divisor of n, m, r. It follows from (2.6) that
Writing \(t = t'd\) with \(t' \mid f/d\), we can evaluate the \(t'\)-sum getting
as desired. \(\square \)
We combine Proposition 1 and Lemma 15 with (2.7) and use the notation of these formulas to derive the following basic spectral formula for the restricted norm \(\mathcal {N}(F_h)\) of a Saito–Kurokawa lift \(F_h \in S_k^{(2)}\).
Proposition 16
Let k be even and let \(F = F_h \in S_k^{(2)}\) be a Saito–Kurokawa lift with Fourier expansion as in (2.5) and (2.6). Let \(f_h \in S_{2k-2}\) denote the corresponding Shimura lift of h. Then
where
and \(\mathcal {L}(h, \mathtt{u}, s) \) is the analytic continuation of
for \(\mathfrak {R}s\) sufficiently large.
The renormalized functions still satisfy the functional equation
The inclusion of the gamma factor in (6.6) is motivated by the formula in Lemma 6.
From the Dirichlet series expansion and the functional equation we obtain an approximate functional equation, cf. [44, Theorem 5.3].
Lemma 17
Let \(F = F_h\in S_k^{(2)}\) be a Saito–Kurokawa lift and \(\mathtt{u}\in \Lambda \) with spectral parameter \(t_\mathtt{u}\). For \(t, x \in \mathbb {R}\) let
Then
Combining Proposition 16, Lemma 17 and (2.8) we obtain the following basic formula:
7 A relative trace formula
In this section we prove a slightly more general formula than that stated in Theorem 3. Let \(D_1 = \Delta _1f_1^2, D_2 = \Delta _2f_2^2 < 0\) be two arbitrary negative discriminants and h as specified in Theorem 3. Combining (4.8) (with t/2 in place of t) and (4.11) we have
By the argument of the proof of Lemma 8 we can re-write the sum over even cusp forms u as a sum over weight 1/2 cusp forms v in Kohnen’s space. Thus the last two terms of the preceding display become
Here, as in Lemma 14, \(\sum ^{(+)}\) indicates a sum over an orthonormal basis of weight 1/2 cusp forms \(v_j\) in Kohnen’s space with spectral parameter \(t_j\) and Fourier coefficients \(b_j(D)\), and \(u_j\) is the corresponding Shimura lift with spectral parameter \(2t_j = t_u\). If \(\lambda _j(n)\) are the Hecke eigenvalues of \(u_j\), then by the approximate functional equations (4.14) and (4.16) with \(\Delta = 1\) we can write
and
with \(W_t\) as in (4.15) with \(\mathfrak {a} = 0\). Note that \(W_{2t}(x)\) is even and holomorphic in \(|\mathfrak {I}t | <2/3\), satisfying the uniform bound \(W_{2t}(x)\ll (1+|t|^2)/x^2 \) in this region (by trivial bounds). Moreover \(W_{2t}(x)\) vanishes at \(t = \pm i/4\). We first deal with residue term in the formula for \(|\zeta (1/2 + 2it)|^2\) and substitute this back into the Eisenstein term. This gives
We can slightly simplify this by applying in the plus-term the functional equation for \(L(D_1, 1/2 + 2it)\) and changing t to \(-t\), and in the minus-term the functional equation (4.3) for \(L(D_2, 1/2 - 2 i t)\). In this way we see that the two terms are equal and after some simplification we obtain
Our next goal is to use the half-integral weight Hecke relations to combine \(\lambda _j(n) b_j(D_1)\). For notational simplicity let us define \(\tilde{b}_j(D_1) = \sqrt{|D_1|}b_j(D_1)\). From (3.10) we obtain
In the last step we used (3.10) again together with Möbius inversion. This yields
Comparing (3.10) and (4.2), we see that the same Hecke relations hold for Eisenstein series, and we therefore have
We are now in a position to apply the second Kohnen-Kuznetsov formula in Lemma 14 with
in place of h(t). This function satisfies the hypotheses of that formula (and decays rapidly enough in n and m), recall that \(W_{2t}(n)\) vanishes at \(t = \pm i/4\). The diagonal exists only if \(\Delta _1 = \Delta _2\) and \(vwn = f_2\). For a function H we introduce the integral transform
with F as in (1.6).
Theorem 18
Let \(D_1 = \Delta _1 f_1^2, D_2 = \Delta _2f^2_2\) be negative discriminants and let h be an even function, holomorphic in \(|\mathfrak {I}t | < 2/3\) with \(h(t) \ll (1+|t|)^{-10}\). Define \(W_t\) as in (4.15), L(D, s) as in (4.1) and \(K^+_{3/2}(a, b, c)\) as in (1.7). Then
where \(H_{b}(t) = h(t)W_{t}(b)\) and \(H^{\dagger }\) is given by (7.1).
Remarks:
(1) Specializing \(f_1 = f_2 = 1\) we obtain Theorem 3.
(2) Recall again that \(W_t(x) = 0\) for \(t = \pm i/2\), so that contour shifts in the t-integral ensure that the c, n-sum is absolutely convergent.
(3) The first term on the right hand side corresponds to the constant function and is obviously indispensable. In all practical applications, the t-integral in the second term is of bounded length due to the factor \(\exp ((1/2 - it)^2)\), so that by subconvexity bounds for \(L(D, 1/2 + it)\) this term is dominated by the class number term. The last term can be analyzed in the same way as a typical off-diagonal term in the Kuznetsov formula except that it contains an extra n-sum of length \(\approx |t|\) from the approximate functional equation, cf. (9.17) below. Contrary to its appearance, the diagonal term is symmetric in \(f_1, f_2\) (as it should be) and can be written as
where
with \(\alpha _p = \max (v_p(f_1), v_p(f_2))\) and \(\beta _p = \min (v_p(f_1), v_p(f_2))\) for the usual p-adic valuation \(v_p\).
8 Mean values of L-functions
This section is devoted to the proof of Proposition 4. To begin with, we recall Heath-Brown’s large sieve in two variations [34, Corollaries 3 & 4]Footnote 10
Lemma 19
(a) Let \(N, Q \geqslant 1\), let S(Q) denote the set of real primitive characters of conductor up to Q and let \((a_n)\) be a sequence of complex numbers with \(|a_n| \leqslant 1\). Then
for every \(\varepsilon > 0\).
(b) Let \(D, N \geqslant 1\), \((a_n), (b_{\Delta })\) be two sequences of complex numbers with \(|a_m|, |b_{\Delta }| \leqslant 1\), where \(b_{\Delta }\) is supported on the set of fundamental discriminants \(\Delta \). Then
We start with part (a) of the proposition. As mentioned in the introduction, \(L(u \times \chi , 1/2) = 0\) if u is odd for root number reasons. For even u we use the approximate functional equation (4.14) and write
with
where \(\mathfrak {a} = 1\) if \(\Delta < 0\) and \(\mathfrak {a} = 0\) if \(\Delta > 0\). We can treat positive and negative discriminants separately, so that we may assume that \(G(s, t_{ u} ) \) is independent of \(\Delta \). Eventually we would like to apply the Cauchy-Schwarz inequality, the spectral large sieve inequality and Heath-Brown’s large sieve for quadratic characters. The latter requires that the n-sum is restricted to odd squarefree integers. Therefore we uniquely factorise \(n = 2^{\alpha } n_1n_2^2\) with \(n_1, n_2\) odd, \(n_1\) squarefree and use the Hecke relations to write
We use Möbius inversion to detect the condition \((n_2, \Delta ) = 1\) and we observe that the \(\alpha \)-sum depends only on \(\Delta \) modulo 8. Therefore
where \(\Delta ' f\) is restricted to negative fundamental discriminants and
uniformly in \(\mathfrak {R}s \geqslant \varepsilon \) for \(\theta = 7/64\) by the Kim-Sarnak bound. In the above formula we define \(\chi (\delta , 2^{\alpha }) := \chi _{\Delta }(2^{\alpha })\) for any \(\Delta \equiv \delta \) (mod 8).
We substitute this back into (8.1). Shifting the contour to the far right, we can truncate the \(n_1\)-sum at \(n_1 \leqslant ( \mathcal {D}\mathcal {T})^{1+\varepsilon } \) for \(|t_u| \leqslant \mathcal {T}\) at the cost of a negligible error. Having done this, we shift the contour back to \(\mathfrak {R}s = \varepsilon \), truncate the integral at \(|\mathfrak {I}s| \leqslant ( \mathcal {D}\mathcal {T})^{\varepsilon }\), again with a negligible error, so that
A priori, the right hand side is restricted to even u, but by positivity we can extend it to all u. Next we apply the Cauchy-Schwarz inequality. In the second factor we artificially insert \(1/L(\text {sym}^2{ u}, 1)\) at the cost of a factor of \(\mathcal {T}^{\varepsilon }\) (by (4.13)) to convert the Hecke eigenvalues into Fourier coefficients and apply the spectral large sieve inequality [21, Theorem 2]. This leaves us with bounding
for \(N \leqslant ( \mathcal {D}\mathcal {T})^{1+\varepsilon }\).
For a given odd, squarefree \(d \in \mathbb {N}\), the \(n_1\)-sum equals
For odd squarefree \(n \not = 1\), the map \(\Delta ' \mapsto (\frac{\Delta '}{n})\) is a primitive quadratic character of conductor n, so that by Heath-Brown’s large sieve for quadratic characters (Lemma 19a) this expression is bounded by
Putting everything together, we complete the proof of Proposition 4(a).
The proof of part (b) is almost identical except that the spectral large sieve is replaced with the standard bound [44, Theorem 9.1] for Dirichlet polynomials. Here we use the approximate functional equation
where \(|\epsilon (t)| = 1\) and
We included the polynomial in order to counteract the pole at \(s=1/2 - it\) of \(L(\chi _{\Delta }, s + 1/2 + it)^2\) for \(\Delta = 1\), so that no residual term arises in the approximate functional equation. The function \(\tilde{G}\) has similar analytic properties as G above, and the divisor function \(\tau \) satisfies the same Hecke relations as \(\lambda _u\). The proof is now almost literally the same, except that the factor \((\mathcal {T}^2 + \mathcal {N})/\mathcal {N}\) in (8.2) is \((\mathcal {T} + \mathcal {N})/\mathcal {N}\). Thus the proof of Proposition 4 is concluded.
9 Interlude: special functions and oscillatory integrals
In this rather technical section we compile various sums and integrals over Bessel functions and other oscillatory integrals that we need as a preparation for the proof of Theorem 2. To start with, the following lemma is a half-integral weight version of [42, Lemma 5.8], but with a somewhat different proof.
Lemma 20
Let \(x > 0\) \( A \geqslant 0\), \(K > 1\). Let w be a smooth function with support in [1, 2] satisfying \(w^{(j)}(x) \ll _{\varepsilon , j} K^{j\varepsilon }\) for \(j \in \mathbb {N}_0\). Then there exist smooth functions \(w_0, w_+, w_-\) such that for every \(j \in \mathbb {N}_0\) we have
and
The implied constants in (9.1) depend on the B-th Sobolev norm of w for a suitable \(B= B(A, j)\). The functions \(w_{\pm }\) are explicitly given in (9.3) and (9.5).
Proof
We denote the left hand side of (9.2) by W(x). For \(k \in \mathbb {N}\), by [33, 8.411.13] we have
We put
Applying Poisson summation modulo 4, we have
The \(h = 0\) term vanishes (as do all even h), and by partial integration the other terms are bounded by \(O_n( K |h|^{-n}( K^{-(1-\varepsilon )n} + \theta ^n) e^{-K\theta })\) for any \(n \in \mathbb {N}\), so that
Again by Poisson summation we have
Since \(-1/4 < \theta \leqslant 3/4\), we see by partial integration in the y-integral that the contribution of \(h \not = 0\) is \(O_A(K^{-A})\). Let v be a smooth function with compact support in \([-2, 2]\), identically equal to 1 on \([-1, 1]\). Then for \(h = 0\) we can smoothly truncate the \(\theta \)-integral by inserting the function \(v(\theta K^{9/10})\), the error being again \(O_A(K^{-A})\) by partial integration. We obtain \(W_1^+(x) = W_2^+(x) + W_2(x)\), where \(W_2(x) \ll _A K^{-A}\) and after changing variables
Then for \(j \in \mathbb {N}_0\) we have
with \(\phi (\theta ; x, y) = -3\pi \theta /K+ 2\pi \theta y + x ( \cos (2\pi \theta /K)-1)\) satisfying
In the following we frequently use the Taylor expansions \(\sin (t) = t + O(t^3)\) and \(\cos (t) = 1 - t^2/2 + O(t^4)\).
We first extract smoothly the range \(|\theta | \leqslant \frac{1}{100} K^2/x\). Here we observe that the derivative \(\frac{d}{d\theta } \phi (\theta ; x, y)\) cannot be too small (it is important that w is supported on [1, 2], not on [0, 1]), and we apply [11, Lemma 8.1] with
In this way we obtain a contribution of
to (9.4) for every \(n \geqslant 0\). This is easily seen to be
for every \(A > 0\). For the portion \(|\theta | \gg K^2/x\) we integrate by parts in the y integral and apply trivial estimates to obtain a bound
which is easily seen to be
The same analysis works for \(W_1^-(x) = W_2^-(x) + \tilde{W}_2(x)\) where
We put \(w_{\pm } = W_2^{\pm }\) and \(w_0 = W_1 + W_2 + \tilde{W}_2\), and the lemma follows on noting that \(\frac{1}{2}(i^k + (-i)^k) = i^k \delta _{2\mid k}\). \(\square \)
Remarks: (1) It is clear from the proof that if w depends on other parameters in a real- or complex-analytic way with control on derivatives, then \(w_{\pm } = W_2^{\pm }\), defined in (9.3), depends on these parameters in the same way. We will use this observation in Sects. 10.2 and 14.
(2) The bound (9.1) remains true for \(A \geqslant -1/2\). In the case, the claim follows for \(x \geqslant K^2\) from the asymptotic formula [33, 8.451.1 & 7 & 8]. We state this for completeness, but we do not need it here.
We need a similar formula for the transforms occurring in (3.6) and (7.1).
Lemma 21
Let \(A, T \geqslant 2\) and let \(\mathtt{h}\) be a smooth function with support in [T, 2T] satisfying \(\mathtt{h}^{(j)}(t) \ll T^{-j}\) for \(j \in \mathbb {N}_0\).
(a) We have
where \(K^{\pm }_A(x) \ll _A (T+x)^{-A}\) and \(x^j \partial _x^j H^{\pm }_A(x) \ll _{A, j} 1\). An analogous asymptotic formula holds for \(\mathtt{h}^{\dagger }(x)\).
(b) We have
where \(\tilde{K}_A(x) \ll _A (T+ x)^{-A}\) and \(x^j \partial _x^j H_A(x) \ll _{A, j} 1\).
Proof
(a) For the first part we recall the uniform asymptotic formula [28, 7.13.2(17)].
where \(\omega (x, t) = |t| \cdot \text {arcsinh} (|t|/x) - \sqrt{t^2 + x^2} + x\) and
for every \(M \geqslant 0\). The error term in [28] is \(O(x^{M })\), but for small x the error term \(O(|t|^{-M} )\) follows from the power series expansion [33, 8.440]. Partial integration in the form of [11, Lemma 8.1] with \(U = T\), \(Y = Q = T+x\), \(R= \text {arcsinh}(T/x) \gg T/x\) shows that
and the claim follows.
(b) For the proof of the second part we distinguish 3 ranges. For \(x > 10 T\) the claim follows easily from the rapid decay of the Bessel K-function and its derivatives. For \(x < T/10\) we use the uniform asymptotic expansion [28, 7.13.2(19)] (along with the power series expansion [33, 8.485, 8.445] for very small x)
where \(\tilde{f}^{\pm }_M\) satisfies the analogous bounds in (9.6). Again integration by parts ( [11, Lemma 8.1] with \(U = Y = Q = T\), \(R= \text {arccosh}(T/x) \geqslant 1\)) confirms the claim in this range. Finally, for \(x \asymp T\) we use the integral representation [33, 8.432.4]
This integral is not absolutely convergent, but partial integration shows that the tail is very small, and we can in fact truncate the integral at \(|u| \leqslant \varepsilon \log T\) at the cost of an admissible error \(O(T^{-A})\). Thus we are left with bounding
if B is chosen sufficiently large with respect to j. \(\square \)
For large arguments, the Bessel function \(J_{ir}(y)\) behaves like an exponential. More precisely, by [33, 8.451.1 & 7 & 8] we have an asymptotic expansion which we will need later:
for \(r \in \mathbb {R}\), \(x \geqslant 1\) and fixed \(n \in \mathbb {N}\). This is useful as soon as \(x \geqslant r^2\).
The following lemma is essentially an application of Stirling’s formula.
Lemma 22
Let \(k \geqslant 1\), \(s = \sigma + it \in \mathbb {C}\) with \(k + \sigma \geqslant 1/2\), \(M \in \mathbb {N}\). Then
where
for \(i, j \in \mathbb {N}_0\). Moreover,
Proof
This is a standard application of Stirling’s formula. First of all, since
with
it suffices for both statements to treat the two cases \(s= \sigma \in \mathbb {R}\) fixed and \(s = it \in i\mathbb {R}\). The first case is very simple, so we display the details for the second case. We have
where \(\tilde{G}_{M }\) satisfies
and
It is not hard to see that
for some absolute constant c (in fact, \(c = (\pi - \log 4)/4 = 0.438\ldots \) is the optimal constant). In particular, \(\Gamma (k+it)/\Gamma (k)\) is exponentially decreasing as soon as \(|t| \geqslant k^{1/2}\). Moreover, by a Taylor argument we have
This proves (9.9). To prove (9.8), we need to bound the derivatives of \(\alpha \) and \(\beta \) which is most quickly done by using Cauchy’s integral formula. Note that both \(\alpha \) and \(\beta \) have a branch cut at the two rays \(\pm t/k \in [ i, i \infty )\). We assume that k is sufficiently large (otherwise there is noting to prove) and we choose a circle \(C_1\) about k of radius k/100 and a circle \(C_2\) about t of radius \(\sqrt{k}/10\). Then w/z is away from the branch cuts for \(z \in C_1\), \(w \in C_2\), and we have
for \(z \in C_1\), \(w\in C_2\). From Cauchy’s integral formula we conclude
for \(i, j \in \mathbb {N}_0\). Combining (9.10), (9.11), (9.12) completes the proof of (9.8). \(\square \)
We apply this to the function \(\mathcal {G}(k, t_\mathtt{u}, s)\) defined in (6.5).
Corollary 23
Let \(A \geqslant 0\), \( \sigma \geqslant -1/4\) and let \(t \in \mathbb {R}\), \(t_\mathtt{u} \in \mathbb {R} \cup [-i/2, i/2]\), \(k \in 2\mathbb {N}\). Then
Moreover, for \(v \in \mathbb {C}\) we have
with
for \(\mathbf{j } \in \mathbb {N}_0^3\). Finally, for \(t, \tau \ll k^{2/3}\) we have
Recalling the definition (6.7) of \(V_t(x; k, t_\mathtt{u})\) we conclude from (9.13) and appropriate contour shifts the uniform bounds
for \(A > 0\), \(\mathbf{j } \in \mathbb {N}_0^4\).
In a similar, but simpler fashion we also apply this to the weight function \(W_t\) defined in (4.15) and state the bound
for \(A \geqslant 0\), \(j_1, j_2 \in \mathbb {N}_0\).
10 A weak version of Theorem 2
In this section we present a relatively soft argument that provides the upper bound \(\mathcal {N}_{\text {av}}(K) \ll K^{\varepsilon }\). This will useful later in order to estimate certain error terms. By (1.3) and (6.8) we have
By (9.16) we have \(t, t_\mathtt{u} \ll K^{1/2 + \varepsilon }\) (up to a negligible error). We insert a smooth partition of unity into the \(t_u\)-integral and attach a factor \(w(|t_\mathtt{u}|/\mathcal {T}_{\text {spec}})\) where w has support in [1, 2] unless \(\mathcal {T}_{\text {spec}} = 1\), in which case w has support in [0, 2]. Let
for \(\nu = 0, 1, 2, \ldots \) and \( \mathcal {T}_{\text {spec}} = 2^{\nu }\ll K^{1/2 + \varepsilon }\). This section is then devoted to the proof of the bound
for \(\varepsilon >0\). The constant function will need special treatment, and to this end we prove the following simple lemma.
Lemma 24
For \(x \geqslant 1\) we have
Proof
Let h(D) denote the usual class number. Since
we have
Since \(h(\Delta n^2) = h(\Delta ) n \prod _{p \mid n} (1 - \chi _{\Delta }(p)/p) \leqslant h(\delta ) n\tau (n)\) for a fundamental discriminant \(\Delta \), we get from bounds for moments of the ordinary class number [3] that
\(\square \)
Having this out of the way, we proceed to estimate \(\mathcal {N}_{\text {av}}(K; \mathcal {T}_{\text {spec}})\). We first sum over h using Lemma 6. This yields a diagonal term and an off-diagonal term that we treat separately in the following two subsections. Throughout, the letter D, with or without subscripts, shall always denote a negative discriminant unless stated otherwise. The letter A shall denote an arbitrarily large fixed constant, not necessarily the same on every occurrence.
10.1 The diagonal term
The contribution to \(\mathcal {N}_{\text {av}}(K; \mathcal {T}_{\text {spec}})\) of the diagonal term from Lemma 6 is bounded by
This can be bounded without too much effort by Corollary 5 for the cuspidal part, Proposition 4b) for the Eisenstein part and Lemma 24 for the constant function.
Indeed, for the constant function \(\mathtt{u} = \sqrt{3/\pi }\) we have \(P(D; \sqrt{3/\pi }) \ll H(D)\). Recalling (9.16) and Lemma 24, this gives a total contribution of
if \(\mathcal {T}_{\text {spec}} = 1\) and otherwise the contribution vanishes.
Similarly, by (4.12), (3.3), (4.7) and (4.13), the Eisenstein spectrum contributes
We recall our convention that \(\Delta \) denotes a negative fundamental discriminant, \(D = \Delta f^2\) an arbitrary negative discriminant (where f here has nothing to do with \(f_1, f_2\) above). In the above bound we have already executed the sum over f. By Proposition 4(b) and a standard bound for the fourth moment of the Riemann zeta-function this is \(O(K^{-1/2 + \varepsilon })\).
By (4.10), (4.5), (4.6) and (4.13), the contribution of the cuspidal spectrum is at most
by Corollary 5. All of these bounds are consistent with (10.1).
10.2 The off-diagonal term: generalities
We now consider the off-diagonal term in Lemma 6 from the sum over h. Here we need to bound
By (9.16) and the rapid decay of the Bessel function near 0 for large k, up to a negligible error all sums are effectively bounded by a power of K. We first sum over k using Lemma 20. Up to a negligible error, we obtain
where
for any \(A \geqslant 0\), \(\mathbf{j } \in \mathbb {N}_0^4\), cf. (9.16) and the remark after Lemma 20. In order to apply Voronoi summation, we open the Kloosterman sum and are left with bounding
where
Integration by parts shows that
for any \(A \geqslant 0\), \(\mathbf{j } \in \mathbb {N}_0^4\). The first and third factor on the right hand side of (10.5) imply \(|D_1 D_2| = K^{4+o(1)}\) and \(c, f_1, f_2 = K^{o(1)}\) up to a negligible error. On the other hand, the last factor implies \(D_1f_1^2 =D_2f_2^2(1 + O(1/K^{1/2}))\), so that in effect \(D_1, D_2 = K^{2+o(1)}\).
In the following we treat the cuspidal part, the Eisenstein part and the constant function separately. In principle we could treat them on equal footing and we will do this in Sect. 14, but for now we keep the prerequisites as simple as possible.
10.3 The Eisenstein contribution
We start with the contribution of the Eisenstein spectrum to (10.3). As this is much smaller in size than the cuspidal spectrum, very simple bounds suffice. By (4.11), (4.13) and (10.5) we obtain a contribution of
We use the basic inequality \(|L(D_1, 1/2 + it)L(D_2, 1/2 + it)| \leqslant \frac{1}{2}( |L(D_1, 1/2 + it)|^2 + |L(D_2, 1/2 + it)|^2) \). For fixed \(f_1, f_2, D_1\) there are \(O( K^{3/2+\varepsilon } )\) values of \(D_2\) satisfying the summation condition. Thus we obtain the bound
By (4.1) and Proposition 4(b) along with a bound for the fourth moment of the Riemann zeta function we obtain the desired bound
10.4 The cuspidal contribution
Next we consider the cuspidal contribution to (10.3). The key step is an application of the metaplectic Voronoi formula (Lemma 10). With this in mind, we consider the following portion
of (10.3), i.e. we freeze \(c, f_1, f_2\) and u for the moment. For notational simplicity we consider only the plus case, the minus case may be treated similarly. We insert (4.8) with \(t = t_u/2\) and re-write (10.6) as
To the \(D_1\)-sum we apply the Voronoi formula (Lemma 10) with weight function
for \(x < 0\) and \(\phi (x) = 0\) for \(x > 0\). We recall that \(c, f_1, f_2 \ll K^{\varepsilon }\) are essentially fixed from the decay conditions of \(V^{*}\), but we need to be uniform in \(|D_2| = K^{2+o(1)}\). We define
as in (5.2) with \(r = t_u/2\) and obtain that (10.6) is equal to
where only in the above sum do we allow D to be either positive or negative. If \(D > 0\), then the integral transform \(\Phi ((2\pi )^2 D/c^2 )\) contains a factor
and we recall that \(|x| = K^{2+o(1)}\), \(c \ll K^{\varepsilon }\) up to a negligible error. Thus the argument of the Bessel function is \(\gg K^{2-\varepsilon }\), while the index is \(\ll K^{1/2+\varepsilon }\). By the rapid decay of the Bessel K-function this contribution is easily seen to be negligible (we use (4.9) and bound b(D) trivially), and we may restrict from now on to \(D< 0\). In this case
Using (9.7), up to a negligible error we can write
with
for any \( j \in \mathbb {N}_0\). We substitute this back into (10.8) which equals (10.6). We substitute this back into (10.3). In this way we see that the cuspidal contribution to (10.3) is at most
with
By (10.5) and (10.10), each integration by parts with respect to x introduces an additional factor
and we conclude that
for every \(A \geqslant 0\). Here only the negative part in the ± sign is relevant, since otherwise the expression is trivially negligible. The limiting factor for the size of the x-integral is the first factor in the second line of (10.13), so that we obtain
for every \(A \geqslant 0\). It is not hard to see that this can be simplified as
With this bound we return to (10.11), apply the simple bound \(|b(D)b(D_2)| \leqslant |b(D)|^2 + |b(D_2)|^2\) together with (4.9), getting an upper bound of the shape
plus a similar expression that with L(u, D, 1/2)/|D| in place of \(L(u, D_2, 1/2)/|D_2|\) which can be treated in the same way. We sum over \(D, f_1, f_2, c\) and end up with (after changing the value of A)
We write \(D_2 = \Delta f^2\) with a fundamental discriminant \(\Delta \) and use (4.5), (4.6). Summing over f, we obtain
We estimate the denominator \(L(\text {sym}^2 u, 1)\) by (4.13) and apply Corollary 5 to finally obtain the upper bound \(\mathcal {T}_{\text {spec}}^2 K^{\varepsilon -1}\) in agreement with (10.1).
10.5 The constant function
This is very similar to the preceding subsection, so we can be brief. In short, we win a factor \(\mathcal {T}_{\text {spec}} \ll K^{1+\varepsilon }\) from the fact that the spectrum is reduced to one element, and we lose a factor \(|D|^{1/2} \ll K^{1+\varepsilon }\) since each class number is a factor \(|D|^{1/4}\) bigger than the generic period P(D; u). The key point is that we end up with a pure bound without a \(K^{\varepsilon }\)-power. This \(K^{\varepsilon }\)-power is unavoidable when we apply Corollary 5, but for a sum over class numbers H(D) alone we can apply Lemma 24 below which avoids a \(K^{\varepsilon }\)-power. The analogue of (10.6) is
We apply the Voronoi formula (Lemma 12) to the \(D_1\)-sum as before. Due to the oscillatory behaviour of \(\phi \) in (10.7) the main terms are easily seen to be negligible, and as in the previous argument also one of the osciallatory terms is negligible due to the exponential decay of \(\mathcal {J}^-\) in (5.5). The behaviour of \(\mathcal {J}^+\) similar, but much simpler, as no asymptotic formula of a Bessel function is necessary. The analogue of (10.11) then becomes
In the same way as above this leads to
The last step is justified by Lemma 24. The bound (10.14) is in agreement with, and completes the proof of, (10.1).
We conclude this section with a brief discussion. The bound (10.1) along with \(\mathcal {T}_{\text {spec}} \ll K^{1/2 + \varepsilon }\) (from the decay of \(V_t\)) implies immediately the upper bound \(\mathcal {N}_{\text {av}}(K) \ll K^{\varepsilon }\). The \(\varepsilon \)-power is unavoidable at this point because of the use of Heath-Brown’s large sieve in the proof of Proposition 4. Except for the spectral large sieve implicit in proof of Proposition 4, we have not touched the spectral \(\mathtt{u}\)-sum, so any further improvement must involve a treatment of this sum. This is precisely the purpose of the relative trace formula for Heegner periods given in Theorem 3 resp. Theorem 18. The weaker result (10.1) is nevertheless useful: it allows us to discard small eigenvalues \(t_\mathtt{u} \ll K^{1/2 - \varepsilon }\), most importantly the constant function, and it allows us to estimate efficiently some error terms later. The following sections are devoted to the proof of Theorem 2.
11 Proof of Theorem 2: the preliminary argument
Throughout we agree on the convention that D (with or without indices) denotes a negative discriminant and \(\Delta \) denotes a negative fundamental discriminant. We make two immediate manipulations. Fix some \(0< \eta < 1/100\) such that \(10^6/\eta \in 2\mathbb {Z}\). By the concluding remark of the preceding section we can insert the function
(not to be confused with the constant \(\omega \) in the previous display) into the \(\mathtt{u}\)-integral, and we can truncate the n, m-sum at
Both transformations induce an admissible error, the former due to that \(\omega (t_\mathtt{u}) - 1 \ll _A K^{-A}\) for every \(A > 0\) and \(t_\mathtt{u} \gg K^{\frac{1}{2} - \frac{\eta }{2}}\). Note that \(\omega \) is even, holomorphic, within [0, 1] for \(t_\mathtt{u} \in \mathbb {R} \cup \{i/2, -i/2\}\) and satisfies
for \(j \in \mathbb {N}_0\). In particular, up to a negligible error we may ignore the constant function \(\mathtt{u} = \sqrt{3/\pi }\) with \(t_\mathtt{u} = i/2\). As before we denote by \(\int _{\Lambda _{\text {ev}}}^{*}\) a spectral sum/integral over the non-residual spectrum, i.e. everything except the constant function.
Note that \(\lambda (n)\) depends on h, as it is a Hecke eigenvalue of \(f_h\). Before we can sum over h using Lemma 6, we must first combine \(\lambda (n)\) with \(c(|D_2|)\). To this end we recall (2.4) and recast \(\mathcal {N}_{\text {av}}(K)\), up to a small error coming from the truncation of the n, m-sum and the \(\mathtt{u}\)-integral, as
We can now sum over h using Lemma 6, and we start with an analysis of the diagonal term which is given by
While the upper bound \(K^{\varepsilon }\) for the diagonal term is fairly easy to obtain, cf. Sect. 10.1, the asymptotic analysis of \( \mathcal {N}^{\text {diag}}(K)\) is extremely involved and occupies this and the following two sections. We will eventually show that \( \mathcal {N}^{\text {diag}}(K) = 4\log K + O(1)\). The discussion of the off-diagonal term is postponed to Sect. 14.
We write \(d_2 = d_1\delta \), \(n = d_1\delta \nu \), so that \(d_1^2 \mid \nu ^2 D\). Since n is squarefree, this implies \(d_1^2 \mid D\). We write \(d_1 = d\) and \(D d^2\) in place of D. With this notation we recast \( \mathcal {N}^{\text {diag}}(K)\) as
From the Katok-Sarnak formula in combination with (3.10) in the cuspidal case and from (4.11) in combination with (4.2) in the Eisenstein case we concludeFootnote 11
for a fundamental discriminant \(\Delta \) where \(\alpha _\mathtt{u}(f)\) depends also on \(\Delta \), which we suppress from the notation. Using this notation along with (4.10) in the cuspidal case and (4.12) in the Eisenstein case we obtain
where \(\mathcal {L}(\mathtt{u}) = L(\text {sym}^2 \mathtt{u}, 1)\) if \(\mathtt{u}\) is cuspidal and \(\mathcal {L}(\mathtt{u}) = \frac{1}{2}|\zeta (1 + 2 i t)|^2\) if \(\mathtt{u} = E(., 1/2 + it)\) is Eisenstein (with the obvious interpretation in the case \(t = 0\)). With later transformations in mind, we also restrict the f-sum to \(f \leqslant K^{\eta }\). By trivial estimates along with Corollary 5 and (9.16), this induces an error of \(O(K^{\varepsilon - \eta })\). By the usual Hecke relations we have
We summarize the previous discussion as
where
with
The expression \(\mathcal {I}\) depends also on \(f^2 \nu d f_1f_2\), but we suppress this from the notation. We insert the approximate functional equations (4.14) and (4.16) getting
Since the \(\tau \)-integral is rapidly converging, it is easy to see that the polar term contributes at most \(O(K^{\varepsilon - 1})\) to (11.4), so from now on we focus on the first term of the preceding display. By the Hecke relations we can recast it as
This is now in shape to apply the Kuznetsov formula for the even spectrum, Lemma 7. We treat the three terms on the right hand side of (3.5) separately and start with the diagonal term to which the next section is devoted. The two off-diagonal terms are treated in Sect. 13. (In the simpler analysis of Sect. 10.1 the Kuznetsov formula is hidden in the spectral large sieve in the proof of Proposition 4. The precise asymptotic analysis requires us to deal with the two off-diagonal terms in Lemma 7 explicitly.)
12 The diagonal diagonal term
The diagonal contribution to (11.6) equals
Opening up the Mellin transform in the definition (4.15), this equals
We shift the \(s_1, s_2\)-contours to \(\mathfrak {R}s_1 = \mathfrak {R}s_2 = -1/4\) getting
We first deal with the error term and substitute it back into (11.4). Roughly speaking the factor \(|\tau |^{3/4}\) saves a factor \(K^{1/8}\) from the trivial bound, while on average over \(\Delta \) the L-values on the lines 1/2 and 3/4 are still bounded. More precisely, recalling (9.16) the error term gives a total contribution of at most
By a standard mean value bound for the \(\Delta \)-sum (e.g. [34, Theorem 2]), the previous display can be bounded by \(\ll K^{-1/8 + \varepsilon }\). We substitute the main term in (12.1) into (11.4) getting
where we recall that \(h(\tau )\) is given by (11.5) and depends in particular on t, k and \(\Delta \) (as well as on \(f, f_1, f_2, d, \nu \)). A trivial estimate at this point using (9.16) shows
(by standard mean value results for \(L(\chi _{\Delta }, 1)\)), but eventually we want an asymptotic formula, not an upper bound.
Our next goal is to show that the t-integral forces \(f_1\nu = f_2d\), up to a negligible error. To make this precise, we first observe that by the same computation as in (12.2) the portion \(|t| \leqslant K^{2/5}\) contributes at most \(O(K^{-1/10 + \varepsilon })\) to \( \mathcal {N}^{\text {diag}, \text {diag}}(K) \). We can therefore insert a smooth weight function that vanishes on \(|t| \leqslant \frac{1}{2} K^{2/5}\) and is one on \(|t| \geqslant K^{2/5}\). Integrating by parts sufficiently often using (9.16), we can then restrict to
up to a negligible error. It is then easy to see that the terms \(f_1 d \not = f_2 \nu \) contribute \(O(K^{\varepsilon - 2/5})\) to \( \mathcal {N}^{\text {diag}, \text {diag}}(K) \). Having excluded these, we re-insert the portion \(|t| \leqslant K^{2/5}\) to the t-integral, again at the cost of an error \(O(K^{\varepsilon -1/10})\). Finally we complete the \(d, \delta , \nu , m, f\)-sum at the cost of an error \(O(K^{\varepsilon -\eta })\). Since \((\nu , d) = 1\), the equation \(f_1d = f_2 \nu \) implies \(f_2 = dg\), \(f_1 = \nu g\) for some \(g\in \mathbb {N}\). Substituting all this, we recast \(\mathcal {N}^{\text {diag}, \text {diag}}(K) \) as
up to an error of \(O(K^{\varepsilon -\eta })\), where in the definition (11.5) of h we replace \(f_2 = dg\), \(f_1 = \nu g\). We substitute this modified version of (11.5) and open the Mellin transform (6.7), so that the previous display becomes
where
Recall our general assumption that \(\Delta \) runs over negative fundamental discriminants. The main term will come from the residue at \(v = 0\) and we need to analytically continue \({\mathscr {L}}\) to \(\mathfrak {R}v < 0\) and compute the residue at \(v = 0\). The critical portion is the \(\Delta \)-sum which we analyze in the following lemma.
Lemma 25
Let \(\alpha \in \mathbb {N}\) and uniquely write \(\alpha = \alpha _1^2 \alpha _2\) with \(\mu ^2(\alpha _2) = 1\). Then
In particular, the Dirichlet series
has analytic continuation to \(\mathfrak {R}v > -5/18\) except for a simple pole at \(v=0\) with residue
and is bounded by \(O_{\varepsilon }(|v| \alpha ^{1/4+\varepsilon })\) in the region \(\mathfrak {R}v \geqslant -5/18 + \varepsilon \), \(|v| \geqslant \varepsilon \).
Remark: The computation of the leading constant in Lemma 25 seems to be a new result even in the case \(\alpha = 1\) and features an interesting Euler product. See [46] for similar Euler products for averages at the point 1/2. The case \(\alpha = 1\) can be read off from [31].
Proof
Let w be a fixed smooth function that is equal to 1 on [0, 1] and vanishes on \([2, \infty )\). Let \(Y \geqslant 1\). We have
where \(\widehat{w}\) in the present case denotes the Mellin transform and the left hand side comes from a contour shift to \(\mathfrak {R}s = -1/2\) and the Conrey-Iwaniec [20] subconvexity bound for real characters. We obtain
We decompose the main term as \(S_{\square } + S_{\not = \square }\) depending on whether \(\alpha n\) is a square or not. We first consider the portion \(S_{\square }\) where n is restricted to \(n = \alpha _2k^2\) with \(k \in \mathbb {N}\). This gives
We decompose the main term as \( S_{\square }^{\text {odd}} + S_{\square }^{\text {even, 4}} + S_{\square }^{\text {even, 8}}\) depending on whether \(\Delta \) is odd, exactly divisible by 4 or exactly divisible by 8. We have
where \(\chi _0\) is the trivial character modulo 4 and \(\chi _{-4}\) the non-trivial character modulo 4. For \(\chi \in \{\chi _0, \chi _{-4}\}\) we consider the Dirichlet series
A standard application of Perron’s formula (e.g. [71, Corollary II.2.4]) shows now that
We have \(S_{\square }^{\text {even, 4}} \not = 0\) and \(S_{\square }^{\text {even, 8}} \not = 0\) only if \(\alpha \) is odd, which we assume from now on. Then
and by the same computation we obtain
Finally,
where now \(\chi _0\) is the unique character modulo 2, and we get the same main term for \(S_{\square }^{\text {even, 8}} \). Putting everything together, we obtain for \(\alpha _2\) odd that
and for \(\alpha _2\) even that
Note how beautifully the constants fit together.
We return to (12.4) and study \(S_{\not =\square }\) where the n-sum is restricted to \(\alpha n \not = \square \). The function \(D \mapsto \chi _D(\alpha n) = (D/\alpha n)\) is a 4n-periodic, non-trivial, completely multiplicative function, defined on non-zero integers \(D \equiv 0, 1\) (mod 4). If n is odd, this function is naturally defined on all D and in fact n-periodic. If n is even, it can be extended (using the supplementary laws of quadratic reciprocity) to a not necessarily primitive, but non-trivial Dirichlet character modulo 4n. Thus it suffices to bound
for a character \(\psi \) of conductor \(\ll \alpha Y\) and \(X' \leqslant X\). Writing \(\mu ^2 (m) = \sum _{d^2 \mid m} \mu (d)\) and using the Pólya-Vinogradov inequality, we bound the previous display by
Collecting error terms in (12.4), (12.5), (12.6), (12.7), the total error becomes
We choose \(Y = X^{8/9}\alpha ^{-1/3} + 1\) to recover a total error of \((X^{13/18} \alpha ^{1/6} + X^{1/9} \alpha ^{1/3} + X^{1/2} \alpha ^{1/4})(X\alpha )^{1+\varepsilon }\). This completes the proof of the asymptotic formula. The claim on the Dirichlet series \({\mathscr {K}}(v; \alpha )\) follows by partial summation. \(\square \)
With the notation of the previous lemma we can write
and we conclude that \({\mathscr {L}}\) has analytic continuation to \(\mathfrak {R}v > -13/18\) except for a simple pole at \(v = 0\) and polynomial (in fact linear) bounds on vertical lines. If \(\text {rad}(n)\) denotes the squarefree kernel of n, then
where we have implicitly computed the g-sum in the definition of \({\mathscr {L}}(v)\) as \(\zeta (2)\). Now a massive computation with Euler products, best performed with a computer algebra system, shows gigantic cancellation, and we obtain the beautiful formula
With this we return to (12.3) and shift the v-contour to \(\mathfrak {R}v = -1/10\). By (9.13) and trivial bounds the remaining integral expression is bounded by
It remains to deal with the double pole at \(v = 0\) whose residue is given by
We can remove the factor \(\omega (\tau )\tanh (\pi \tau )\) at the cost of an error \(O(K^{-\eta /2})\), cf. the definition (11.1). Using the definition (6.5), at \(v=0\) we have the following Taylor expansion
We have \(\frac{\Gamma '}{\Gamma }(z) = \log z +O(|z|^{-1})\) (for \(\mathfrak {R}z \geqslant 1\), say) and so
for \(t, \tau \ll k^{2/3}\), say (the t and \(\tau \) integrals are negligible outside this region). In this range we can insert (9.15) to conclude that
We evaluate the two integrals, sum over k (by Poisson, for instance) and recall the definition of \(\omega \) just before (1.3) getting
We have now detected the main term and we conclude this section by stating that
13 The diagonal off-diagonal term
13.1 Preparing the stage
We return to (11.6) and consider the off-diagonal terms on the right hand side of (3.5). We treat the first off-diagonal term in detail in the following three subsections. In Sect. 13.4 we show the minor modifications to treat the second off-diagonal term. The first off-diagonal term is given by
where h was defined in (11.5) and depends in particular on t, k and \(\Delta \). This needs to be inserted into (11.4) with \(r = f^2d\nu /(d_1 d_2 d_3^2) \leqslant K^{4\eta }\). In (11.4) we apply a smooth partition of unity to the \(\Delta \)-sum and consider a typical portion
for a smooth function w with compact support in [1, 2]. Our aim in this section is to prove the bound
uniformly in \(k \asymp K\), \(t \leqslant K^{1/2 + \varepsilon }\),
and \(f, f_1, f_2, d, \nu \in \mathbb {N}\) which are implicit in the definition (11.5). Taking (13.1) for granted, we estimate (11.4) trivially to obtain a contribution of \(O(K^{\varepsilon - \eta })\), which is admissible. So it remains to show (13.1), and to this end we start with some initial discussion.
The c-sum in \(\mathcal {I}^{\text {off-diag,1}}(\Delta , t,k, r)\) is absolutely convergent, as can be seen by using the Weil bound for the Kloosterman sum and shifting the t-contour to \(\mathfrak {R}i\tau = 1/3\), say, without crossing any poles. By the power series expansion for the Bessel function [33, 8.440] we have
and we can therefore truncate the c-sum at \(c \leqslant K^{10^6}\), say, at the cost of a very small error. Having done this, we can sacrifice holomorphicity of the integrand in the \(\tau \)-integral, and we insert a smooth partition of unity into the \(\tau \)-integral restricting to \(\tau \asymp T\), say, with
otherwise \(h(\tau )\) is negligible by (11.5), (9.16) and (11.2). We insert smooth partitions of unity into the \(\mathtt{n}, \mathtt{m}, c\)-sums and thereby restrict to \(\mathtt{n} \asymp N\), \(\mathtt{m} \asymp M\), \(c \asymp C\), say, where
by (9.17) and initially \(C \leqslant K^{10^6}\). Next we want to evaluate asymptotically the \(\tau \)-integral. To this end we use Lemma 21 with the weight function \(\mathtt{h} = \mathtt{h}_{\mathtt{n}, \mathtt{m}, \Delta }\) given by
where w is the weight function occurring in the smooth partition of unity of the \(\tau \)-integral. By (9.17), (11.2) and (9.16), the function \(\mathtt{h}\) is “flat” in all variables, i.e.
for \(k \asymp K\), \(|\Delta | \asymp X\), \(\mathtt{n} \asymp N\), \(\mathtt{m} \asymp M\) and \(\mathbf{j } \in \mathbb {N}_0^4\), uniformly in all other variables. Lemma 21a implies that we can restrict, up to a negligible error, to
where \(r_2 = r/d\), and up to a negligible error we are left with bounding
for a smooth function W with compact support in \([1, 2]^4\) and bounded Sobolev norms with variables X, N, M, C, T satisfying (13.2), (13.4), (13.5), (13.3), respectively.
The basic strategy is now to apply Poisson summation first in the \(\mathtt{n}\)-sum and then in the \(\mathtt{m}\)-sum in Sect. 13.2. This shortens the variables in generic ranges, so that a trivial bound turns out to be of size \(XK^{1/2 + 2\eta +\varepsilon }\). This is very close to our target (13.1). In Sect. 13.3 we will extract a character sum from this expression where the Pólya-Vinogradov inequality produces the final saving, at least in generic ranges of the variables. In order to also treat non-generic ranges where some of the variables are relatively short (and hence Poisson summation is less effective), at each step we also apply trivial bounds along with Heath-Brown’s large sieve. In particular, by Lemma 19b) we can bound (13.6) by
13.2 Poisson summation
Now we open the Kloosterman sum in (13.6) and apply Poisson summation in \(\mathtt{n}\) in residue classes modulo c. In this way we re-write \({\mathscr {J}}^{(1)}_r(X, N, M, C, T) \) as
We consider the character sum
This is non-zero only if \((n, c) = (r_2, c)\). We write \((r_2, c) = \delta \), \(r_2 = \delta r_2'\), \(c = \delta c'\), \(n = \delta n'\) with \((n'r_2', c') = 1\) and recast (13.9) as
We decompose \(\delta = \delta _1 \delta _2\) with \(\delta _2\) maximal so that \((\delta _2, c') = 1\). Note that \(r_2' \bar{n'}\) is coprime to \(c'\), so the condition \((c, \gamma ) = 1\) is equivalent to \((\delta _2, \gamma ) = 1\). We factor \(c = c'\delta _1 \cdot \delta _2\) with \((c'\delta _1, \delta _2) = 1\) and apply the Chinese Remainder Theorem to see that the previous display vanishes unless \(\delta _1 \mid \mathtt{m}\), say \(\mathtt{m} = \delta _1\mathtt{m}'\), in which case it equals
where R denotes the Ramanujan sum.
Next we consider the x-integral in (13.8). The phase has a unique stationary point at \(x = r_2\mathtt{m}/n^2\) if \(\text {sgn}(n) = \pm \) and no stationary point if \(\text {sgn}(n) = \mp \). If \(N \leqslant r_2\mathtt{m}/n^2 \leqslant 2N\) and \(\text {sgn}(n) = \pm \), we can apply the stationary phase lemma [11, Proposition 8.2] with \(X=1\), \(V = V_1 = Q = N\), \(Y = \sqrt{NMr_2}/C \geqslant T^2K^{-\varepsilon } \geqslant K^{1/2 - \varepsilon }\) to see that the integral is given by
for a smooth function \(W_1\) with compact support in \([1, 2]^4\) and bounded Sobolev norms, up to a negligible error from truncating the series in [11, (8.9)]. Otherwise we apply integration by parts in the form of [11, Lemma 8.1] with \(X = 1\), \(U = Q = N\), \(Y = R = \sqrt{NMr_2}/C\) to conclude that the integral is negligible.
Noting that with our previous notation
for \(\text {sgn}(n) = \pm \), we can now apply the additive reciprocity formula \(e(1/ab) = e({\bar{a}}/b)e({\bar{b}}/a)\) for \((a, b) = 1\) to conclude that \({\mathscr {J}}^{(1)}_r(X, N, M, C, T) \) equals, up to a negligible error term,
Here we recall the notation conventions from Sect. 1.5 regarding expressions \(\delta \mid c^{\infty }\) etc. For easier readability we remove all the dashes at the variables, and we define
We also open the Ramanujan sum \(R_{\delta _2}(\mathtt{m}) = \sum _{\delta _3 \mid (\delta _2, \mathtt{m})} \delta _3\mu (\delta _2/\delta _3)\), write \(\delta _2 = \delta _3\delta _4\) and replace m with \(m\delta _3\). Finally we drop the ±-sign in the summation condition on n (both cases are identical). With this notation we can re-write (13.10) as
Before we continue to transform this expression, we estimate trivially with the large sieve (Lemma 19b)
Our next goal is to apply Poisson summation in \(\mathtt{m}\) restricted to residue classes \(\delta _4 n |\Delta |\). For \(m \in \mathbb {Z}\), this leads to the character sum
We decompose both n and \(\delta _4\) according to their common divisor with \(\Delta \). We write \(n = n_1 n_2\) and \(\delta _4 = \delta _5 \delta _6\) where \(n_1\delta _5 \mid \Delta ^{\infty }\), \((n_2\delta _6, \Delta ) = 1\). We obtain by the Chinese Remainder Theorem that the above character sum equals
The first sum vanishes unless \(n_2\delta _6 \mid r_2|\Delta | + m c\) in which case it equals \(n_2\delta _6\). Since \(\Delta \) is a negative fundamental discriminant, the second sum vanishes unless \(n_1\delta _5 \mid r_2|\Delta | + m c\) in which case it equals
Having this evaluation available, the Poisson summation formula transforms (13.11) into
up to a factor i. The above integral is just the Fourier transform of \(W_2\) with respect to the third variable which we write as
Defining \(W_4(x, y, z, w) = W_3(x, y, z/(xy), w)\) we recast the previous display as
The function \(W_4\) is compactly supported in the first, second and fourth variable and rapidly decaying in the third. We first estimate the \(m=0\) contribution to be
by (13.5), (13.4), (13.2), (13.3) and a divisor estimate for \(n_1n_2\). This is clearly admissible for (13.1), so from now on we assume \(m\not = 0\). By the same argument, we obtain for the \(m \not = 0\) contribution the bound
By (13.5) and (13.2), this is only a factor \(K^{3\eta + \varepsilon }\) away from our target (13.1), so a very small additional saving suffices to win. For easier readability we consider only the case \(m > 0\), the other case being entirely analogous.
13.3 The endgame
Up until now we have not touched the long \(\Delta \)-sum, which we will now use to obtain some additional saving. Before we do this, we must exclude the case that C is very small. To this end we combine (13.14) and (13.12) to obtain
Similarly we can combine (13.14) and (13.7) to obtain
using (13.5). Combining the previous two bounds we finally obtain
by (13.3) and (13.2) which meets our target (13.1) unless
which we assume from now on. We recall that c is automatically coprime to \(n_1n_2 r_2\delta _3\delta _5\delta _6\). For fixed \(k \in \mathbb {N}\) let \({\mathscr {S}}(k) = \{k_1 x^2 : k_1 \mid k, x \in \mathbb {N}\}\) be the set of square classes of all divisors of k. From (13.13) we remove all \(c\in {\mathscr {S}}(2r_1\delta _1m)\). Since C is large, the \(O(K^{\varepsilon })\) square classes are only a thin subset of all c and by the same computation as in (13.14), they contribute no more than
to\( {\mathscr {J}}^{(1)}_r(X, N, M, C, T) \) which for \(\eta < 1/40\) is admissible.
With this we return to (13.13) and explain the idea how to obtain additional savings. Ignoring (for the purpose of these heuristic remarks) the secondary variables \(\delta _1, \delta _3, \delta _5, \delta _6, r_2, n_1\), we have to sum
We \(|\Delta | + m c = n_2 s\) for \(s \in \mathbb {N}\) and rewrite the sum to be taken over \(m, s, c, n_2\) where now \(|\Delta |\) is determined by the other variables. Assuming also for simplicity that \(n_2, m\) are odd, we obtain a sum over
where \(-n_2s +mc\) is essentially restricted to squarefree numbers. By quadratic reciprocity, the first two factors are essentially constant. Since c is not a square, the map \(n_2 \mapsto (\frac{c}{n_2})\) is a non-trivial character, and since typically the length of \(n_2\) is much longer than the length of c, the \(n_2\)-sum has some saving from the Pólya-Vinogradov inequality (here we also need to deal with the squarefree condition). We now make this precise. For clarity, we repeat (13.13) with the small amendments we have made so far:
where \(W_5(x, y, z, w) = x^{-1/2} W_4(x, y, z, w)\). We define s through the equation
Note that, up to a negligible error,
by (13.5), so that by (13.3) we conclude that mc is substantially smaller than \(| r_2\Delta |\). In particular, \(n_1n_2 \delta _5\delta _6 s \asymp r_2 X\), so that
We first argue that we can truncate the \(n_1\)-sum in (13.13) at \(n_1 \leqslant K^{4\eta }\). Indeed, since \((n_1, c) = 1\), but \(n_1 \mid \Delta ^{\infty }\), the squarefree kernel \(\text {rad}(n_1)\) must divide m. Summing trivially over \(n_1, n_2, s, c, m\) in (13.13) as in (13.14), the portion \(n_1 \geqslant Y\) contributes at most
by applying Rankin’s trick and using that \(\sum _n \text {rad}(n)^{-1} n^{-\sigma }\) is absolutely convergent for \(\sigma > 0\). If \(Y \geqslant K^{4\eta }\), this is \(\ll X K^{1/2-2\eta + \varepsilon }\) by (13.5) and (13.2), hence admissible for (13.1). Having truncated the \(n_1\)-sum, we decompose \(\Delta = \Delta _1\Delta _2\) into two fundamental discriminants of suitable signs where \((\Delta _2, 2n_1\delta _5) = 1\) and \(\Delta _1 \mid 8n_1 \delta _5 \), in particular \(|\Delta _1| \leqslant 8K^{8\eta }\) and \((\Delta _1, cn_2\delta _6) = 1\). With this notation and recalling (13.17) we can write
Next we make \(n_2m\) coprime to \(2r_2\Delta _1\) by factoring \(n_2 = n_2'n_2''\) and \(m=m'm''\) with \((n_2'm', 2r_2\Delta _1) = 1\), \(n_2''m'' \mid (2r_2\Delta _1)^{\infty } \mid (2r_2n_1\delta _5)^{\infty }\) so that the previous display equals
By a computation similar to (13.20), this time using that
for \(X \geqslant 1\), \(b \in \mathbb {N}\) (which follows in the same way by Rankin’s trick), we can assume \(n_2'', m'' \leqslant K^{4\eta }\), the remaining portion to (13.13) being \(\ll XK^{1/2 - 2\eta + \varepsilon }\). We are left with short (i.e. \(\ll K^{4\eta }\)) variables
and potentially long variables
subject to \(c \not \in {\mathscr {S}}(2r_1\delta _1m'm'')\) as well as
We can eliminate \(\Delta _2\) from the last equation, so that a congruence
remains. Then the conditions \((\Delta _2, 2n_1\delta _5) = (n_2'n_2''\delta _6, \Delta _2) = 1\) are re-phrased as
which is equivalent to
The condition \(n_1\delta _5 \mid (\Delta _1\Delta _2)^{\infty }\) reads
All of these conditions on \(n_2'\) can be detected by congruences modulo “short” variables in (13.22) (and some powers of 2) as well as \((n_2', m) = 1\). Finally we need to remember that \(\Delta _1\Delta _2\) is a fundamental discriminant. To this end we split into residue classes \(\Delta _1\Delta _2 \equiv 1, 5, 8, 9, 12, 13\) (mod 16) and insert a factor \(\mu ^2((n_1n_2'n_2'' \delta _5\delta _6s - mc)/(\alpha r_2))\) with \(\alpha \in \{1, 4\}\). We use the convolution formula
and insert all of this back into (13.16). We claim that we can restrict \(y \leqslant K^{A\eta }\) for some constant A. Indeed, summing over all short variables, as well as \( c, m'\), we get a congruence for \(n_2's\) modulo \(y^2\), so that the portion \(y > Y\) contributes at most
which is acceptable if \(Y = K^{A\eta }\) for A sufficiently large. Note that (13.18) implies that \(n_1n_2'n_2'' \delta _5\delta _6s - mc \) never vanishes, so there is no “\(1+\)” in the congruence count. In addition, for fixed y we also remove all \(c \in {\mathscr {S}}(y^2)\) at the cost of an error \(XK^{1/3+ O(\eta )}\) as in (13.15).
We are finally ready to return to (13.21) and split the sum over \(n_2'\) into residue classes modulo \(\nu \) modulo \(H := 32 r_1\delta _1\delta _3\delta _5\delta _6r_2n_1n_2''m''\Delta _1 y^2 = K^{O(\eta )}\). By assumption, c is not in a square class of any divisor of H. Thus we consider
for \(\nu \) odd, \(c \ll C\) and \(R \ll \sqrt{M/N} K^{O(\eta )}\). We can detect the congruence condition by characters, none of which conspires with \(n_2' \mapsto (\frac{c}{n_2'})\) to become the trivial character. By the Pólya-Vinogradov inequality, we can bound the previous display by \(C^{1/2} K^{O(\eta )}\), and by trivial estimates over \(c, m', s\) and the present estimate for the sum over \(n_2'\) we obtain the final bound
by (13.5), (13.4), (13.2) and (13.3). For sufficiently small \(\eta \) this is in agreement with (13.1) and completes the analysis of the first off-diagonal term in (11.6).
13.4 The second off-diagonal term
The analysis of the second off-diagonal term in (3.5) is structurally very similar (although the numerology is a little different), so we can be brief. Here we need to consider
with h as in (11.5). This needs to be inserted into (11.4) with \(r = f^2d\nu /(d_1 d_2 d_3^2) \leqslant K^{4\eta }\). Under the same size restrictions (13.2) as before we want to show that
As before we first use holomorphicity to ensure absolute convergence of the c-sum and obtain a very coarse truncation. Then we apply smooth partitions of unity restricting to \(\tau \asymp T\) satisfying (13.3), \(n \asymp N\), \(m \asymp M\) satisfying (13.4) and \(c\asymp C\). This time we use Lemma 21b to conclude that
and by an analogue of (13.6) we need to bound the quantity
where W satisfies the same properties as in (13.6). The trivial bound using the large sieve (Lemma 19) is now
Next we apply Poisson summation in \(\mathtt{n}\) in residue classes modulo c which is simpler than before because there is no exponential \(e(\pm 2\sqrt{\mathtt{n}{} \mathtt{m}r_2}/c)\). This transforms \( {\mathscr {J}}^{(2)}_r(X, N, M, C, T)\) into
for a weight function \(W_1\) that is compactly supported in the first, third and fourth variable and rapidly decaying in the second. This term contains the same character sum as in (13.9). By the same manipulation we obtain
This is arithmetically analogous to (13.11) except that the roles of \(\delta _2n\) and c are reversed in the exponential. This makes good sense since in generic ranges we have \(c \asymp K^{1/2}\), \( n \asymp K\) in (13.11), but \(c \asymp K\), \(n \asymp K^{1/2}\) in (13.25). The large sieve now gives the bound
Next we apply Poisson summation in \(\mathtt{m}\) in residue classes modulo \(|\Delta | c\). In the character sum
we decompose \(c = c_1 c_2\) where \(c_1 \mid \Delta ^{\infty }\), \((c_2, \Delta ) = 1\). The character sum vanishes unless \(c_1c_2 \mid mn\delta _2 - r_2 |\Delta |\) in which case it equals
Estimating trivially at this point yields
by (13.23) and (13.2), matching the bound in (13.14). With the roles of c and n reversed, we now want to make sure that \(n \ll C(N\delta _1\delta _2)^{-1}\) is large enough. By the trivial bound (13.27) we can assume that \(n \geqslant C(N\delta _1\delta _2)^{-1} K^{-4\eta }\). Now combining (13.27) and (13.24) we obtain
Combining (13.27) and (13.26) we obtain
Combining the previous two bounds, we obtain
by (13.4) and (13.3). This is acceptable unless
which we assume from now on, so that in particular \(n \gg C(N\delta _1\delta _2)K^{-4\eta } \gg N^{1/3 -10\eta }\). By the same argument as in the previous subsection we can now extract certain square classes in the n-sum and then save from the Pólya-Vinogradov inequality. In effect, we replace the factor C from a trivial bound of the \(c_2\)-sum by a factor \( K^{O(\eta )}\sqrt{N/C}\) of the square root of the conductor of \(c_2 \mapsto (\frac{n}{c_2})\). This leads to the final bound
by (13.2), (13.3) and our assumption \(C \geqslant K^{2/3 - 2\eta }\). This is in agreement with (13.1) and completes the analysis of the the second diagonal off-diagonal term, hence the analysis of the complete diagonal term.
14 The off-diagonal term
14.1 Initial steps
We return to (11.3) and analyze the off-diagonal term in Lemma 6 applied to the h-sum. Here we are only interested in upper bounds, so dropping all numerical constants it suffices to estimate
and our target bound is \(K^{-\eta }\). We recall that \(V_t(x, k, \tau )\) was defined in (6.7) and besides the decay properties it is important to note that \(V_t(x, k, \tau )\) is holomorphic in, say, \(|\mathfrak {I}\tau | \leqslant 1\). Since we want to apply the trace formula (Theorem 18) to the spectral sum later, we must not destroy holomorphicity in the third variable.
As in Sect. 11 we write \(d_2 = d_1\delta \), \(n = d_1\delta \nu \). Then \(d_1^2 \mid \nu ^2 D_2\) and \(d_1^2 \mid D_2\) since n is squarefree. Again we write \(d_1 = d\) and \(D_2 d^2\) in place of \(D_2\) and bound the preceding display as
We sum over k using Lemma 20 and open the Kloosterman sum. As in Sect. 10.2, up to a negligible error we obtain the upper bound
where \(\mathcal {I}^{\text {off}}(K) = \mathcal {I}_{d, \delta , \nu , m, c, f_1, f_2, \gamma }^{\text {off}}(K)\) is given by
here \(\tilde{V}\) satisfies (10.2) and is holomorphic in \(|\mathfrak {I}t_\mathtt{u}| \leqslant 1\). The bounds contained in (10.2) imply in particular \(c, f_1, f_2 \leqslant K^{\eta }\) up to a negligible error. For notation simplicity we consider only the plus-case, the minus case being entirely similar.
As in Sect. 11 we start with a Voronoi step, but in the present situation (since we have already excluded the constant function that requires a very careful treatment) we can afford to lose small powers of K on the way. We introduce the notation
for some constant c, not necessarily the same on every occasion. We always assume that \(\eta \) is sufficiently small.
To begin with, we integrate over t which by the properties of (10.2) induces the condition \(|D_1| f_1^2 - |D_2|(df_2)^2 \preccurlyeq K^{1/2}\) up to a negligible error. This now implies \(K^2 \preccurlyeq D_1, D_2 \preccurlyeq K^{2 }\), up to a negligible error. In \(\tilde{V}\) we can separate the variables \(D_1, D_2\) from \(t_\mathtt{u}\) by Mellin inversion (keeping holomorphicity in \(t_\mathtt{u}\)). Since \(\tilde{V}\) is of size \(K^{-1/2}\) and we also get a factor \(K^{1/2}\) from the t-integration, we are left with bounding
where \(V_1, V_2\) have support in \([K^{-O(\eta )}, K^{O(\eta )}]\) with Sobolev norms bounded by \( \preccurlyeq 1\), \(V_3\) is rapidly decaying, and \(\Omega (\tau )\) is holomorphic in \(|\mathfrak {I}\tau | \leqslant 1\), satisfies the conditions (11.2) and is non-negligible only in the range \(K^{1/2} \preccurlyeq |\tau | \preccurlyeq K^{1/2}\).
14.2 Metaplectic Voronoi summation
We now consider the \(D_1\)-sum
and insert (4.8) with \(t = t_\mathtt{u}/2\) if \(\mathtt{u}\) is cuspidal and (4.11) if \(\mathtt{u} = E(., 1/2 + it_\mathtt{u})\) is Eisenstein. For clarity we recall that
where
Even though the normalization in the cuspidal and the Eisenstein case is quite different, the Voronoi formulae in Lemma 10 (with \(r = t_\mathtt{u}/2)\) and 11 (with \(t = t_\mathtt{u}/2\)) can deal in the same fashion with the sums
with
for \(x < 0\) and \(\phi (x) = 0\) for \(x > 0\). It is easy to see that with this choice of \(\phi \) and for \(t \ll K^{1/2 + \varepsilon }\) the two polar terms in Lemma 11 are negligible, due to the strong oscillatory behaviour of the exponential \(e(\pm 2\sqrt{|xD_2|}\nu /c)\) for \(K^2 \preccurlyeq x, D_2 \preccurlyeq K^{2 }\), \(c \preccurlyeq 1\). By the same argument as in Sect. 10.4, see in particular (10.9), the terms with \(D > 0\) on the right hand side of the Voronoi summation formula are negligible. Hence, up to a negligible error, (14.2) becomes
Using (9.7) we approximate the Bessel J-function for \(t_\mathtt{u} \ll K^{1/2 + \varepsilon }\) by an exponential and replace up to a negligible error the preceding display by
with
for n sufficiently large, but fixed. In particular \(f^{\pm }\) satisfies (10.10), and we see from the asymptotic expansion (9.7) that \(f^{\pm }\) is holomorphic in the second variable in, say, \(|\mathfrak {I}\tau | < 1\). For notational simplicity we consider only the leading term \(k=0\), the lower order terms being completely analogous (but easier).
We restore the periods \(P(D; \mathtt{u})\), so that the leading term of (14.1) has the shape
with lower order terms of similar form. Recall that \(K^2 \preccurlyeq D_2 \preccurlyeq K^{2 }\). Integrating by parts, we see as in (10.12)–(10.13) that only the minus-term in the exponential is relevant (the plus-term is negligible) and the x-integral restricts to \(\sqrt{|D_2|}\nu - \sqrt{|D_1|} \preccurlyeq K^{-1/2}\). We therefore introduce a new variable \(h \in \mathbb {Z}\) by
with \(h \preccurlyeq K^{1/2 }\). Changing variables in the x-integral, we obtain
where h is restricted to numbers such that \(-(|D_2|\nu ^2 + h)\) is a negative discriminant. The weight function \(V_3\) forces \(x \preccurlyeq 1\) and more precisely
By a Taylor expansion we can write
where
Again we only keep the leading term (the lower order terms being similar, but easier). We substitute all of this back into \(\tilde{\mathcal {I}}^{\text {off}}(K)\) and pull the x-integration outside which is subject to (14.3), in particular \(x \preccurlyeq 1\). In this way we see that it suffices to bound
where \(V_x(z) = V_2(z) V_1(xz)\), uniformly in
A trivial bound using Cauchy-Schwarz and Proposition 4 gives \( \mathcal {I}_1^{\text {off}}(K) \preccurlyeq 1\), as in Section 10. In order to make progress and get additional savings, we must now treat the \(\mathtt{u}\)-integral non-trivially. This is where the trace formula has its appearance.
14.3 Application of the trace formula
We now apply Theorem 18 to the spectral expression
We discuss the four terms on the right hand side of the trace formula.
(1) The class number term gets immediately cancelled.
(2) The t-integral in the polar term is rapidly decaying and by a Burgess-type subconvexity bound \(L(D, 1/2 + it) \ll |D|^{3/16+\varepsilon }(|1+|t|)^{10}\), say, its contribution to (14.4) is at mostFootnote 12
(3) For the diagonal term we observe that \(\sum _m m^{-1} \int |\Omega (t)W_{t}(rnvm)t| dt \preccurlyeq K\) uniformly in n, r, v by (9.17). If the fundamental discriminants underlying \(D_2d^2\) and \(-|D_2|\nu ^2 - h\) coincide, we have at most \(\preccurlyeq K^{1/4}\) choices for h (in most cases much fewer), so the diagonal term contributes at most
to (14.4).
(4) It remains to deal with the fourth term, and to this end we write \(D_2 = \Delta _2 f_2\) with a fundamental discriminant \(\Delta _2\). Then the Kloosterman term can be bounded by
(Here we exchanged the roles of \(D_1\) and \(D_2\) in Theorem 18.) The general strategy is now as follows: We evaluate the t-integral by Lemma 21. The application of the trace formula was a gambit in the sense that the trivial bound is now only \(\preccurlyeq K^{1/2}\). It is not immediately clear how to make further progress. By using suitable Taylor expansions we observe the lucky coincidence that we obtain rational phases in the exponentials. We are then ready to apply Poisson summation in the long \(\Delta _2\)-sum which will eventually give enough savings. We now make these ideas precise.
We recall that \(\Omega \) satisfies the conditions stated in (11.2) and is negligible for \(|t| \gg K^{1/2+\varepsilon }\). The n, m-sum is absolutely convergent by (9.17), and we can truncate it at \(rnvm \preccurlyeq K^{1/2}\) at the cost of a negligible error. We split the n, m-sum into dyadic ranges \(N \leqslant n \leqslant 2N\), \(M \leqslant m \leqslant 2M\) where
By the remark after Theorem 18 and the properties of \(\Omega \), the integrand of the t-integral is holomorphic in, say, \(|\mathfrak {I}t| < 2/3\), so by contour shifts, Weil’s bound (1.9) and the power series expansion of the Bessel J-function we see that the c-sum is absolutely convergent and can be truncated at \(c \ll K^{10^6}\), say, at the cost of a negligible error. Having truncated the c-sum in this very coarse way, we can sacrifice holomorphicity and include a smooth partition of unity into the t-integral, where a typical portion is weighted by w(|t|/T) with a smooth compactly function w localizing \(|t| \asymp T\) with \(K^{1/2-2\eta } \ll T \ll K^{1/2+\varepsilon }\). We apply Lemma 21a) to evaluate asymptotically the t-integral which in particular restricts the size of c. Splitting also the c-sum into dyadic ranges \(c \asymp C\), we can assume, up to a negligible error,
Having recorded these conditions, we can write the t-integral (as usual, up to a negligible error) as
for a flat function H, i.e. \(x^j\frac{d^j}{dx^j} H^{\pm }(x) \ll _j 1\) for \(j \in \mathbb {N}_0\). Substituting back, it remains to bound
where for given \(r, v, d_1, \tau \) the parameters N, M, C are subject to (14.5) and (14.6) and \(D_2 = \Delta _2 (d_1r\tau vw )^2\). Estimating trivially at this point using the Weil bound (1.9), we obtain the bound
We see that applying the trace formula was a gambit in the sense that the trivial bound is now roughly a factor \(K^{1/2}\) off our target. On the other hand, all automorphic information is now gone, and we may hope to get enough saving from the long character sums. In particular, we can assume that \(C \geqslant K^{1 - a\eta }\) for some sufficiently large constant a, otherwise the trivial bound (14.8) suffices. For such C, we can use a Taylor expansion
with
in the current range of variables. The error term contributes \(\preccurlyeq K^{-1}\) to (14.7). Similarly, we also have
and again the error term contributes \(\preccurlyeq K^{-1}\) to (14.7). Defining \(\tilde{V}_x(z) = z^{-1} V_x(z)\), we are left with bounding
Note the very fortunate fact that the algebraic phase \( e(\pm 2 vwn\sqrt{|\Delta _2|(|D_2|\nu ^2 + h)}/c) \) in (14.7) has become a rational phase. As usual, the \(\Delta _2\)-sum runs over negative fundamental discriminants, and we split the sum into residue classes \(\Delta _2 \equiv 1, 5, 8, 9, 12, 13\) (mod 16) and insert a factor \(\mu ^2(\Delta _2/\alpha )\) with \(\alpha \in \{1, 4\}\) to detect squarefreeness. For notational simplicity let us treat the case of odd \(\Delta \), the case of even \(\Delta \) being similar. Using the well-known convolution formula for \(\mu ^2\), this leaves us with bounding
for a character \(\psi \) modulo 4. Recall that the Kloosterman sum is nonzero only if \(4 \mid c\). Estimating trivially at this point (using (1.9)), we can assume that
by (14.6) and (14.5), the remaining portion being \(O(K^{-\eta })\) if the \(K^{O(\eta )}\) factor in the previous \( \preccurlyeq \) sign is sufficiently large. We now open the Kloosterman sum and apply Poisson summation in \(\Delta _2\) in residue classes modulo \( c \delta d_1vm\tau \). If \(\mathtt{D}\) denotes the dual variable, this yields the character sum
We write \(c = c_1c_2\) where \((c_1, 2\delta d_1vm\tau ) = 1\) and \(c_2 \mid (2\delta d_1vm\tau )^{\infty }\). Then both sums split off a sum modulo \(c_1\) given by
cf. [40, Lemma 2] for the treatment of the theta-multiplier. Summing over \(\Delta _2\) bounds this double sum modulo \(c_1\) by
We estimate the remaining part of the character sum (14.10) trivially by \(c_2^2\delta d_1vm\tau \). By the properties of the (essentially non-oscillating) weight functions \(\tilde{V}_x\) and \(H^{\pm }\), the dual variable \(\mathtt{D}\) can be truncated at
and so the \(\Delta _2\)-sum in (14.9) can be bounded by
using the notation explained in Sect. 1.5. The first term in the first parenthesis accounts for the zero frequency in the Poisson summation formula. We substitute this back into (14.9) getting the (generous) upper bound
Here we dropped the variable \(\delta \preccurlyeq 1\). The rest is book-keeping. By Rankin’s trick we have
for every \(\varepsilon > 0\) and \(x \in \mathbb {N}\). Using (14.6) and (14.5), the bound (14.11) becomes
This is the desired power saving and completes the proof of Theorem 2.
Notes
Note that Kohnen normalizes the inner product (2.2) differently.
Note that the measure \(d\mathtt{u}\) is \(dt/4\pi \) in the Eisenstein case which explains the factor 1/2 in the definition of \(\mathcal {L}(E(., 1/2 + it))\).
with the obvious interpretation in (3.5) for \(t=0\)
Hecke operators and inner products are normalized differently in [4], but this plays no role in the present discussion.
This determines b(n) only up to a constant of absolute value 1, but we will only encounter products of the type \(b(n_1)\overline{b(n_2)}\), so that this constant is irrelevant.
Note that there is a sign error in [44, Theorem 5.3]: the residue R should be subtracted.
This corresponds to \((4\pi y)^{3/4} {\mathscr {F}}(z)\) in [35, Theorem 2, p. 91]. We have multiplied Zagier’s definition by a factor \((4\pi y)^{3/4}\) in order to make our \(|\mathcal {H}(z)|\) invariant under \(\Gamma _0(4)\).
In the original version of [34, Corollary 4], the n-sum is restricted to odd numbers n, but in the case of fundamental discriminants \(\Delta \), the symbol \((\frac{\Delta }{n})\) is also defined for even n, and the proof works in the same way.
This remains true in the excluded case \(\mathtt{u} = \text {const}\) if we define \(\lambda _\mathtt{u} = \rho _1\).
Of course, instead of subconvexity, we could also used mean value bounds on average over \(D_2\) to get an even stronger saving.
References
Ahlgren, S., Andersen, N.: Kloosterman sums and Maass cusp forms of half integral weight for the modular group. IMRN, pp. 492–570 (2018)
Andersen, N., Duke, W.: Modular invariants for real quadratic fields and Kloosterman sums. Algebra Numb Theory 14, 1537–1575 (2020)
Barban, M.B.: Linnik’s “large sieve” and a limit theorem for the class number of ideals of an imaginary quadratic field. Izv. Akad. Nauk. SSSR Ser. Mat. 26, 573–580 (1962)
Baruch, M., Mao, Z.: A generalized Kohnen-Zagier formula for Maass forms. J. Lond. Math. Soc. 82, 1–16 (2010)
Biró, A.: Cycle integrals of Maass forms of weight 0 and Fourier coefficients of Maass forms of weight 1/2. Acta Arith. 94, 103–152 (2000)
Biró, A.: A relation between triple products of weight 0 and weight 1/2 cusp forms. Israel J. Math. 182, 61–101 (2011)
Blomer, V.: Subconvexity for a double Dirichlet series. Compos. Math. 147, 355–374 (2011)
Blomer, V.: On the size of Ikeda lifts. Manuscr. Math. 148, 341–349 (2015)
Blomer, V.: Spectral summation formulae for GSp(4) and moments of spinor L-functions. J. Eur. Math. Soc. 21, 1751–1774 (2019)
Blomer, V., Fouvry, E., Kowalski, E., Michel, P., Milićević, D., Sawin, W.: The second moment theory of families of \(L\)-functions. arxiv:1804.01450
Blomer, V., Khan, R., Young, M.: Mass distribution of holomorphic cusp forms. Duke Math. J. 162, 2609–2644 (2013)
Böcherer, S.: Bemerkungen über die Dirichletreihen von Koecher und Maaß. Math. Gottingensis 68, 36 (1986)
Bourgain, J.: Geodesic restrictions and \(L^p\)-estimates for eigenfunctions of Riemannian surfaces. In: Linear and Complex Analysis. Amer. Math. Soc. Transl. Ser., vol. 227. Amer. Math. Soc., Providence, RI, pp 27-35 (2009)
Brown, J.: An inner product relation on Saito-Kurokawa lifts. Ramanujan J. 14, 89–105 (2007)
Burq, N., Gérard, P., Tzvetkov, N.: Restrictions of the Laplace-Beltrami eigenfunctions to submanifolds. Duke Math. J. 138, 445–486 (2007)
Bykovskii, V.A.: Functional equations for Hecke-Maass series. Funct. Anal. Appl. 34, 98–105 (2000)
Chen, S.-Y., Ichino, A.: Petersson norms of generic cusp forms and special values of adjoint \(L\)-functions for \({\rm GSp}_4\). arXiv:1902.06429
Chen, X., Sogge, C.: A few endpoint geodesic restriction estimates for eigenfunctions. Commun. Math. Phys. 329, 435–459 (2014)
Cogdell, J., Luo, W.: The Bergman kernel and mass equidistribution on the Siegel modular variety \({\rm Sp}_{2n}(\mathbb{Z}) \backslash \mathfrak{H}_n\). Forum Math. 23, 141–159 (2011)
Conrey, B., Iwaniec, H.: The cubic moment of central values of automorphic L-functions. Ann. Math. 151, 1175–1216 (2000)
Deshouillers, J.-M., Iwaniec, H.: Kloosterman sums and Fourier coefficients of cusp forms. Invent. Math. 70, 219–288 (1982)
Diamantis, N., Goldfeld, D.: A converse theorem for double Dirichlet series and Shintani zeta functions. J. Math. Soc. Jpn. 66, 449–477 (2014)
Dickson, M., Pitale, A., Saha, A., Schmidt, R.: Explicit refinements of Böcherer’s conjecture for Siegel modular forms of squarefree level. J. Math. Soc. Jpn. 72, 251–301 (2020)
Duke, W., Imamoğlu, Ö.: A converse theorem and the Saito-Kurokawa lift. Int. Math. Res. Not. 7, 347–355 (1996)
Duke, W., Imamoğlu, Ö., Toth, A.: Geometric invariants for real quadratic fields. Ann. Math. 184, 949–990 (2016)
Duke, W., Friedlander, J., Iwaniec, H.: The subconvexity problem for Artin \(L\)-functions. Invent. Math. 149, 489–507 (2002)
Eichler, M., Zagier, D.: The theory of Jacobi forms. In: Progress in Mathematics, vol. 55. Birkhäuser, Boston (1985)
Erdélyi, A., Magnus, W., Oberhettinger, F., Tricomi, F.: Higher Transcendental Functions II. McGraw-Hill, New York (1953)
Furusawa, M., Moromoto, K.: Refined global Gross-Prasad conjecture on special Bessel periods and Böcherer’s conjecture. J. Eur. Math. Soc. 23, 1295–1331 (2021)
Ghosh, A., Reznikov, A., Sarnak, P.: Nodal domains of Maass forms I. Geom. Funct. Anal. 23, 1515–1568 (2013)
Goldfeld, D., Hoffstein, J.: Eisenstein series of \(\frac{1}{2}\)-integral weight and the mean value of real Dirichlet \(L\)-series. Invent. Math. 80, 185–208 (1985)
Good, A.: Local analysis of Selberg’s trace formula. In: Lecture Notes in Mathematics, vol. 1040 (1983)
Gradshteyn, I.S., Ryzhik, I.M.: Table of Integrals, Series and Products, 7th edn. Academic Press, New York (2007)
Heath-Brown, D.R.: A mean value estimate for real character sums. Acta Arith. 72, 235–275 (1995)
Hirzebruch, F., Zagier, D.: Intersection numbers of curves on Hilbert modular surfaces and modular forms of nebentypus. Invent. Math. 36, 57–113 (1976)
Holowinksy, R., Soundararajan, K.: Mass equidistribution for Hecke eigenforms. Ann. Math. 172, 1517–1528 (2010)
Ichino, A.: Pullbacks of Saito-Kurokawa lifts. Invent. Math. 162, 551–647 (2005)
Ichino, A., Ikeda, T.: On the periods of automorphic forms on special orthogonal groups and the gross-Prasad conjecture. GAFA 19, 1378–1524 (2010)
Imai, K.: Generalization of Hecke’s correspondence to Siegel modular forms. Am. J. Math. 102, 903–936 (1980)
Iwaniec, H.: Fourier coefficients of modular forms of half-integral weight. Invent. Math. 87, 385–401 (1987)
Iwaniec, H.: The spectral growth of automorphic \(L\)-functions. J. Reine Angew. Math. 428, 139–159 (1992)
Iwaniec, H.: Topics in classical automorphic forms. In: Graduate Studies in Mathematics, vol. 17. AMS (1997)
Iwaniec, H.: Spectral methods of automorphic forms. In: Graduate Studies in Mathematics, vol. 53 , 2nd edn. AMS (2002)
Iwaniec, H., Kowalski, E.: Analytic number theory. AMS Colloquium Publications 53. Providence (2004)
Iwaniec, H., Sarnak, P.: \(L^{\infty }\) norms of eigenfunctions of arithmetic surfaces. Ann. Math. 141, 301–320 (1995)
Jutila, M.: On the mean value of \(L(1/2, \chi )\) for real characters. Analysis 1, 149–161 (1981)
Katok, S., Sarnak, P.: Heegner points, cycles and Maass forms. Israel J. Math. 84, 193–227 (1993)
Katsurada, H., Kawamura, H.: Ikeda’s conjecture on the period of the Duke-Imamoğlu-Ikeda lift. Proc. Lond. Math. Soc. 111, 445–483 (2015)
Kim, H.: Second moments of twisted Koecher-Maass series. Abh. Math. Semin. Univ. Hamb. 82, 153–172 (2012)
Kitaoka, Y.: Fourier coefficients of Siegel cusp forms of degree 2. Nagoya Math. J. 93, 149–171 (1984)
Kohnen, W.: Modular forms of half-integral weight on \(\Gamma _0(4)\). Math. Ann. 248, 249–266 (1980)
Kohnen, W.: Fourier coefficients of modular forms of half-integral weight. Math. Ann. 271, 237–268 (1985)
Kohnen, W., Skoruppa, N.-P.: A certain Dirichlet series attached to Siegel modular forms of degree two. Invent. Math. 95, 541–558 (1989)
Kohnen, W., Zagier, D.: Values of \(L\)-series of modular forms at the center of the critical strip. Invent. Math. 64, 175–198 (1981)
Kuznetsov, N.: The Petersson conjecture for cusp forms of weight zero and the Linnik conjecture. Math. USSR-Sb 39, 299–342 (1981)
Li, X., Young, M.: The \(L^2\) restriction norm of a \({\rm GL}_3\) Maass form. Compos. Math. 148, 675–717 (2012)
Li, X., Liu, S.-C., Young, M.: The \(L^2\) restriction norm of a Maass form on \({\rm SL}_{n+1}(\mathbb{Z})\). Math. Ann. 371, 1301–1335 (2018)
Liu, S.-C., Masri, R., Young, M.: Subconvexity and equidistribution of Heegner points in the level aspect. Compos. Math. 149, 1150–1174 (2013)
Liu, S.-C., Young, M.: Growth and nonvanishing of restricted Siegel modular forms arising as Saito-Kurokawa lifts. Am. J. Math. 136, 165–201 (2014)
Maaß, H.: Siegel’s modular forms and Dirichlet series. In: Lecture Notes in Mathematics, vol. 216. Springer-Verlag (1971)
Marshall, S.: Restrictions of \({\rm SL}_3\) Maass forms to maximal flat subspaces. IMRN, pp. 6988–7015 (2015)
Marshall, S.: Geodesic restrictions of arithmetic eigenfunctions. Duke Math. J. 165, 463–508 (2016)
Martin, K., McKee, M., Wambach, E.: A relative trace formula for a compact Riemann surface. Int. J. Numb. Theory 7, 389–429 (2011)
Pitt, N.: On pairs of closed geodesics on hyperbolic surfaces. Ann. Inst. Fourier 49, 1–25 (1999)
Pitt, N.: A sum formula for a pair of closed geodesics on a hyperbolic surface. Duke Math. J. 143, 407–435 (2008)
Proskurin, N.: On the general Kloosterman sums. J. Math. Sci. 129, 3874–89 (2005)
Reznikov, A.: Geodesic restrictions for the Casimir operator. J. Funct. Anal. 261, 2437–2460 (2011)
Reznikov, A.: A uniform bound for the geodesic period. Forum Math. 27, 1569–1590 (2015)
Sarnak, P.: Letter to Andrei Reznikov (June 2008)
Silberman, L., Venkatesh, A.: Entropy bounds and quantum unique ergodicity for Hecke eigenfunctions on division algebras, In: Probabilistic Methods in Geometry, Topology and Spectral Theory. Contemp. Math., vol. 739, pp. 171–197 (2019)
Tenenbaum, G., Introduction to analytic and probabilistic number theory. In: Graduate Studies in Mathematics, vol. 163, 3rd edn. Am. Math. Soc, Providence, RI (2015)
Vinogradov, I.M.: On the number of integral points in a three-dimensional region. Izv. Akad. Nauk. SSSR Ser. Math. 27 (1963), 3–8, translated in Am. Math. Soc. Transl., II. Ser. 58 (1966), 104–109
Waibel, F.: Fourier coefficients of half-integral weight cusp forms and Waring’s problem. Ramanujan J. 47, 185–200 (2018)
Young, M.: The quantum unique ergodicity conjecture for thin sets. Adv. Math. 286, 958–1016 (2016)
Zagier, D.: Modular forms whose Fourier coefficients involve zeta-functions of quadratic fields. In: Modular functions of one variable. Lecture Notes in Math., vol. 627. Springer, Berlin (1977)
Zhang, S.-W.: Gross-Zagier formula for \({\rm GL}_2\). Asian J. Math. 5, 183–290 (2001)
Zhang, S.-W.: Gross-Zagier formula for \({\rm GL}(2)\). II. Heegner points and Rankin \(L\)-series. Math. Sci. Res. Inst. Publ., vol. 49, pp. 191–214. Cambridge University Press (2004)
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There is no conflict of interest and no associated data.
Additional information
Communicated by Kannan Soundararajan.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The first author was supported in part by the DFG-SNF lead agency program grant BL 915/2-2.
Appendices
Appendix A: A period formula on average
In order to verify the constant 1/4 in the period formula (4.10), we take a large parameter T, a very small \(\varepsilon > 0\) and consider the two averages
and
for a (fixed) negative fundamental discriminant \(\Delta \) with class number 1 (for simplicity) and
The computation is relatively standard, so we can be brief. Using the same approximate functional equation as in (4.14) we have
for even u with \(W_t\) as in (4.15). For odd u, each summand in \(A_1(T)\) vanishes. This gives
To make this spectrally complete, we artificially add the corresponding Eisenstein contribution
Using (4.16), this can be written in terms of moments of the Riemann zeta function and Dirichlet L-functions. By standard mean value bounds the contribution is \(O(T^{1+\varepsilon })\) (recall that \(\Delta \) is fixed). We apply the Kuznetsov formula for the even spectrum given in Lemma 7. In this way we get a main term
We shift the \(s_1, s_2\)-contour to real part \(-1/4\), obtaining
with main term \(\gg T^{2-\varepsilon }\). It remains to deal with the off-diagonal terms in (3.5) and we briefly sketch why both of them are negligible. By (9.17) we can restrict \(n, m \ll T^{1+\varepsilon }\) at the cost of a negligible error. The first off-diagonal term contributes a term of the shape
By Lemma 21, the t-integral is negligible unless \(c \leqslant \sqrt{nm} T^{-2+\varepsilon }\) which does not happen for \(n, m \ll T^{1+\varepsilon }\). The second off-diagonal term contributes a term of the shape
Again by Lemma 21, up to a small error the t-integral is negligible unless \(c \leqslant \sqrt{nm} T^{-1+\varepsilon }\) in which case it is essentially non-oscillating in n, m, so we can restrict to \(n, m = T^{1+o(1)}\), \(c \ll T^{\varepsilon }\). Poisson summation in the m-variable now shows that the entire expression is negligible, since \(\chi _{\Delta }\) is a non-trivial character.
We continue with the analysis of \(A_2(T)\). Since we assume class number 1, we have
where \(z_{\Delta }\) is the unique Heegner point (modulo \(\Gamma \)) and \(\epsilon _{\Delta } \in \{1, 2, 3\}\) is half the number of roots of unity in \(\mathbb {Q}(\sqrt{\Delta })\). We artificially add the constant function and the Eisenstein series at a cost of O(T) and apply the pre-trace formula for the entire spectrum–for odd u we have \(u(z_{\Delta }) = 0\) automatically. This gives
where
The stabilizer of \(z_{\Delta }\) contributes
It is easy to see that \(u(\gamma z, z) \leqslant \delta \) implies \(\Vert \gamma \Vert \ll \sqrt{\delta + 1}\), cf. e.g. [45, (A.7) with \(n=1\)]. From [43, (1.64)] we see that k(z, w) is negligible as soon as \(u \gg T^{\varepsilon - 2}\), so that the contribution of all matrices not in the stabilizer is negligible. Combining (A.1), (A.2) and the class number formula in the case \(h_{\Delta } = 1\), we obtain
in accordance with (4.10).
Appendix B: A Dirichlet series with Hurwitz class numbers
The aim of this section is an analysis of the L-function
for \(4 \mid c\), \((a, c) = 1\). As before, H(D) denotes the Hurwitz class number, and the series converges absolutely in \(\mathfrak {R}s > 5/4\). The results may be known to specialists, but do not seem to be in the literature and may be of independent interest. We recall the notation (1.8).
Lemma 26
Let \(c > 0\), \(4 \mid c\), \((a, c) = 1\). The Dirichlet series \(L_+(s, a/c)\) has meromorphic continuation to all \(\mathbb {C}\). It has two simple poles at \(s = 5/4\), \(s = 3/4\) (and no other poles) with residues
Proof
We recall the definition of \(\mathcal {H}(z)\) in (5.4) and compute
with \(c_1 = - (4\pi )^{3/4}/12\), \(c_2 = 1/(\sqrt{8} \pi ^{1/4})\) in two ways. Let
This L-function is obviously holomorphic in \(\mathfrak {R}s > 3/4\). It has a simple pole at \(s=3/4\) with residue
and it has a functional equation
This follows from the corresponding properties of Hurwitz zeta function and standard computations with Gauß sums (or the transformation behaviour of one-dimensional theta series). In particular we obtain the analytic continuation of \(L_-(s, a/c)\) to all of \(\mathbb {C}\) with only a simple pole at \(s=3/4\). Moreover, \(L_-(s, a/c) = 0\) for \(s = 1/4 - n\), \(n = 1, 2, 3 \ldots \).
Returning to (B.1), we have
where
Since \(W_{3/4, 1/4}(x) = e^{-x/2} x^{3/4}\), we can explicitly compute
For the analysis of \(G_{-3/4}\), we combine [33, 7.621.3, 9.131.1, 9.111] to obtain
In particular, by repeated partial integration in the t-integral we see that \(G_{-3/4}(s)\) is meromorphic with simple poles at most at \(s= (2n+1)/4\), for integers \(n \leqslant 0\). From [33, 9.121.24] we get
On the other hand, we may complete the pair (a, c) to a matrix \(({\begin{matrix} a &{} b\\ c &{} d\end{matrix}})\in \Gamma _0(4)\). Then
in particular
Splitting the integral in (B.1) into \(\int _0^{1/c}\) and \(\int _{1/c}^{\infty }\) and applying the functional equation in the former, we obtain
This establishes the analytic continuation of \(\mathcal {I}(s, a/c)\) to all of \(\mathbb {C}\) except for poles at \(s = 5/4, 3/4, 1/4, -1/4\). From the preceding analysis we conclude the meromorphic continuation of \(L^+(s, a/c)\) as a function of finite order with possible poles at most at 5/4, 3/4, 1/4. Since \(L_{-}(s, a/c) G_{-3/4}(s)\) is holomorphic at \(s = 5/4\), we obtain the formula for the residue at \(s = 5/4\) from (B.4), (B.5) and (B.7). Using in addition (B.2), (B.6), we obtain
which confirms the residue formula at \(s = 3/4\), since \({\bar{\epsilon }}_{-a}(\frac{c}{-a}) = (-i) (\frac{-c}{a}) {\bar{\epsilon }}_a\). Similarly, using also (B.3) and the simple formula \(F(1/2, 0, 3/2; 1/2) = 1\), we get
with \(d \equiv -{\bar{a}}\) (mod c), which vanishes. \(\square \)
Remark: One can show that away from the two poles, the function \(L^+(s, a/c)\) satisfies the growth condition \(L^+(s, a/c) \ll _{\mathfrak {R}s} ((1 + |s|) c)^{\max (0, 5/4 - \mathfrak {R}s, 1 - 2\mathfrak {R}s) + \varepsilon }.\)
Appendix C: A volume computation
In this appendix we justify (1.4) for a Saito–Kurokawa lift \(F \in S_k^{(2)}\) associated with a Hecke eigenform \(f \in S_{2k-2}\). Following [8, Sect. 2], we write its Fourier expansion at \(Z = iY\) as
normalized such that \(\alpha (T)^2 = L(f \times \chi _{-\det 2T}, 1/2)\) if \(-\det 2T\) is a fundamental discriminant. In this case, \(\Vert F \Vert = (2\pi )^{-k}\Gamma (k) k^{-1/4 + o(1)}\) by [8, (2.8)]. We conclude that
The function
is invariant under conjugation, and for a diagonal matrix \(X = \text {diag}(x_1, x_2)\) it is negligible unless \(x_1, x_2 = k/4\pi + O(\sqrt{k}\log k)\), cf. [8, Sect. 4]. For large k, we conclude that \(\mathcal {F}(Y)\) is negligible unless there exists \(T \in \mathcal {P}(\mathbb {Z})\) such that the two eigenvalues of TY are \(k/4\pi + O(\sqrt{k}\log k)\). In particular, the essential support of \(\mathcal {F}\) can be restricted to matrices Y whose maximal eigenvalue \(\lambda _{\max }(Y)\) satisfies \(\lambda _{\max }(Y) \ll k\). Since \(|\mathcal {F}|\) is invariant under \(Y \mapsto Y^{-1}\), its minimal eigenvalue \(\lambda _{\min }(Y)\) must satisfy \(\lambda _{\min }(Y) \gg 1/k\). The implied constants will not be relevant. As in (6.1), we write
with \(x + iy \in \Gamma \backslash \mathbb {H}\). Let \(\mathcal {Y}\) be the set of such matrices Y with \(1/k \ll \lambda _{\min }(Y) \leqslant \lambda _{\max }(Y) \ll k\). Without loss of generality we may assume that Y is Minkowski-reduced, equivalently \(x+iy\) is in the standard fundamental domain \(|x|\leqslant 1/2\), \(x^2 + y^2 \geqslant 1\). The two eigenvalues of Y are given by
Thus we have
as desired.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Blomer, V., Corbett, A. A symplectic restriction problem. Math. Ann. 382, 1323–1424 (2022). https://doi.org/10.1007/s00208-021-02268-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00208-021-02268-6