Abstract
We prove power-saving bounds for general Kloosterman sums on \({\text {Sp}}(4)\) associated to all Weyl elements via a stratification argument coupled with p-adic stationary phase methods. We relate these Kloosterman sums to the Fourier coefficients of \({\text {Sp}}(4)\) Poincare series.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The classical Kloosterman sum is given by
where \({\text {e}}\left( {x}\right) = e^{2\pi i x}\). Kloosterman sums naturally appear in the Fourier expansion of \({\text {GL}}(2)\) Poincaré series
which play an important role in number theory. In [2], Bump, Friedberg and Goldfeld introduced \({\text {GL}}(r)\) Poincaré series for \(r\ge 2\), and gave a generalisation of Kloosterman sums to \({\text {GL}}(3)\). The notion of Kloosterman sums was then generalised to \({\text {GL}}(r)\) for \(r\ge 2\) by Friedberg [9], and then to arbitrary simply connected Chevalley groups by Dąbrowski [5].
By methods of algebraic geometry, Weil [16] obtained a bound for \({\text {GL}}(2)\) Kloosterman sums
where \(\tau \) denotes the divisor function. However, it remains a major open problem to give non-trivial bounds for Kloosterman sums in general, and currently only a small set of examples can be treated. Bounds for \({\text {GL}}(3)\) Kloosterman sums were first obtained by Larsen [2, Appendix] and Stevens [14], and were improved by Dąbrowski and Fisher [7]. Bounds for \({\text {GL}}(4)\) Kloosterman sums were given by Huang [10, Appendix]. Friedberg [9] generalised the results to \({\text {GL}}(r)\) Kloosterman sums attached to certain Weyl elements. On reductive groups, Dąbrowski and Reeder [8] gave the size of Kloosterman sets, establishing a trivial bound for Kloosterman sums on reductive groups.
Other than Poincaré series, another application of Kloosterman sums is found in the relative trace formula, which integrates an automorphic kernel over two subgroups with their respective characters. In particular, a prime application for bounds of Kloosterman sums is the analysis of the arithmetic side of the Petersson/Kuznetsov spectral summation formula. A more detailed description of this can be found in [4].
Now we introduce the main results. Let
be the standard symplectic group. When k is a field, G(k) is the group of linear transformations of \(k^{2r}\) preserving a symplectic bilinear form on \(k^{2r}\). The standard torus and the standard unipotent subgroup of G are given by
respectively. We denote by \(N = N_G(T)\) the normaliser of T in G. The Weyl group is given by \(W := N_G(T)/T\). Let \(w: N \rightarrow W\) be the canonical projection map with respect to this decomposition. For \(n\in N\), we also define \(U_n := U \cap n^{-1} U^T n\), and \(\overline{U}_n := U \cap n^{-1} U n\). Note that \(U_n, \overline{U}_n\) depend only on the image w(n) of the canonical projection.
Here we follow the notations in Stevens [14]. Let p be a rational prime. We have a Bruhat decomposition
For \(n\in N\left( {\mathbb {Q}_p}\right) \), we define
and projection maps
by the relation \(x = u(x) n u'(x)\) for \(x \in X(n)\).
Remark
In [14], the notions above are defined for \({\text {GL}}(r)\), but it is straightforward to check that the construction also works for \({\text {Sp}}(2r)\), with essentially the same proofs. In particular, X(n) is finite, and the projection maps \(u,u'\) are well-defined.
Let \(n\in N\left( {\mathbb {Q}_p}\right) \), \(\psi _p\) a character of \(U\left( {\mathbb {Q}_p}\right) \) which is trivial on \(U\left( {\mathbb {Z}_p}\right) \), and \(\psi '_p\) a character of \(U_n\left( {\mathbb {Q}_p}\right) \) trivial on \(U_n\left( {\mathbb {Z}_p}\right) \), such that \(\psi '_p\) is the restriction of some character of \(U\left( {\mathbb {Q}_p}\right) \) trivial on \(U\left( {\mathbb {Z}_p}\right) \). Then the local Kloosterman sum is given by
If \(\psi '_p\) is given as a character of \(U\left( {\mathbb {Q}_p}\right) \) which is trivial on \(U\left( {\mathbb {Z}_p}\right) \), we write \({\text {Kl}}_p\left( {n, \psi _p, \psi '_p}\right) \) to mean \({\text {Kl}}_p\left( {n, \psi _p, \psi '_p|_{U_n\left( {\mathbb {Q}_p}\right) }}\right) \).
To define a global Kloosterman sum, let \(n \in N(\mathbb {Q})\), \(\psi = \prod \limits _p \psi _p\) a character of \(U(\mathbb {A})\) which is trivial on \(\prod \limits _p U\left( {\mathbb {Z}_p}\right) \), and \(\psi '\) a character of \(U_n\left( {\mathbb {A}}\right) \) trivial on \(\prod \limits _p U_n\left( {\mathbb {Z}_p}\right) \), such that \(\psi '\) is the restriction of some character of \(U\left( {\mathbb {A}}\right) \) trivial on \(\prod \limits _p U\left( {\mathbb {Z}_p}\right) \). Then the global Kloosterman sum is given by
Remark
This definition of Kloosterman sums is different from the symplectic Kloosterman sums introduced by Kitaoka [11], which are more relevant for classical \({\text {Sp}}(4)\) Fourier expansions with respect to the upper right 2-by-2 block, which however is not a full parabolic subgroup. Tóth [15] proved some properties and estimates of such Kloosterman sums. The Kloosterman sums introduced here fit into the general framework of Kloosterman sums defined on reductive groups, see e.g. Dąbrowski [5].
For \(G = {\text {Sp}}\left( {4, \mathbb {Q}_p}\right) \), a set of simple roots of G with respect to the maximal torus T is given by \(\Delta = \left\{ {\alpha , \beta }\right\} \), where
Then \(\Psi ^+ = \left\{ {\alpha , \beta , \alpha +\beta , 2\alpha +\beta }\right\} \) is a set of positive roots. We denote by \(s_\alpha \) and \(s_\beta \) the simple reflections in the hyperplane orthogonal to \(\alpha \) and \(\beta \) respectively. Then the Weyl group of G with respect to T is given by
We fix once and for all representatives for \(s_\alpha \) and \(s_\beta \):
We also denote the long Weyl element \(s_\alpha s_\beta s_\alpha s_\beta \) by \(w_0\). Characters of \(U\left( {\mathbb {Q}_p}\right) \) trivial on \(U\left( {\mathbb {Z}_p}\right) \) are given by \(\psi _{m_1, m_2}\) for \(m_1, m_2\in \mathbb {Z}\), where
where \({\text {e}}:\mathbb {Q}_p/\mathbb {Z}_p \rightarrow \mathbb {C}^\times \) is the standard additive character satisfying \({\text {e}}(p^{-r}) = e^{2\pi i/p^r}\). For \(w\in W\), \(r,s\in \mathbb {Z}\), we set
The exact conditions r, s have to satisfy for \(X(n_{w,r,s})\) to be nonempty are given in Sect. 3, but in general we require \(r,s\ge 0\). By counting the number of terms in the Kloosterman sum [8, Theorem 0.3], we obtain a trivial bound
Now we state the main results of the paper. The first result concerns non-trivial bounds for local \({\text {Sp}}(4)\) Kloosterman sums.
Theorem 1.1
Let \(\psi = \psi _{m_1,m_2}\), \(\psi ' = \psi _{n_1,n_2}\) be characters of \(U(\mathbb {Q}_p)/U(\mathbb {Z}_p)\), and r, s be non-negative integers. Then we have
Moreover, the Kloosterman sum \({\text {Kl}}_p(n_{w,r,s},\psi ,\psi ')\) vanishes if the conditions on the right are not satisfied.
Now we state the bounds for global \({\text {Sp}}(4)\) Kloosterman sums. Recall that characters of \(U(\mathbb {Q})/U(\mathbb {Z})\) are given by \(\psi _{m_1,m_2}\) for \(m_1,m_2\in \mathbb {Z}\), where
For \(w\in W\), \(c_1,c_2\in \mathbb {N}\), let
Theorem 1.2
Let \(\psi = \psi _{m_1,m_2}\), \(\psi ' = \psi _{n_1,n_2}\) be characters of \(U(\mathbb {Q})/U(\mathbb {Z})\). For every \(\varepsilon >0\) and \(c_1,c_2\in \mathbb {N}\), the following bounds hold:
Moreover, the Kloosterman sum \({\text {Kl}}(n_w(c_1,c_2),\psi ,\psi ')\) vanishes if the conditions on the right are not satisfied.
Theorems 1.1, 1.2 are proved in Sect. 4. We develop a stratification of \({\text {Sp}}(2r)\) Kloosterman sums in Sect. 2, generalising the stratification of \({\text {GL}}(r)\) Kloosterman sums introduced by Stevens [14]. Let
be a subgroup of diagonal matrices. We will show that for \(n\in N\left( {\mathbb {Q}_p}\right) \), there is a group action \(\mathcal T\times X(n) \rightarrow X(n)\) sending (t, x) to \(txs^{-1}\), where \(s = n^{-1}tn\). The Kloosterman sum, as a sum over X(n), can then be partitioned into sums over \(\mathcal T\)-orbits, in Theorem 2.4. The summands are then evaluated using results of Adolphson and Sperber [1] for multi-dimensional exponential sums of Laurent polynomials, as well as the p-adic stationary phase method for higher prime powers.
We give some brief comments on the bounds obtained here. As we shall see in Sect. 3, the Kloosterman sums corresponding \(w=s_\alpha , s_\beta \) are just classical Kloosterman sums, and the bounds given here are just the optimal bounds for classical Kloosterman sums. The Kloosterman sums corresponding to \(w=s_\alpha s_\beta , s_\beta s_\alpha \) can be expressed in terms of exponential sums of Laurent polynomials, whose optimal bounds are also well understood. Meanwhile, the bounds for \(w=s_\alpha s_\beta s_\alpha , s_\beta s_\alpha s_\beta , w_0\), obtained using the stratification technique, are believed to be not optimal, since only the cancellations within individual \(\mathcal T\)-orbits are considered. In particular, we expect square-root cancellation for Kloosterman sums \({\text {Kl}}_p(n_{w_0,r,s},\psi _{m_1,m_2},\psi _{n_1,n_2})\) when \(m_1,m_2,n_1,n_2\) are coprime to p, but this is beyond reach of current methods.
Let \(F: T\left( {\mathbb {R}^+}\right) \rightarrow \mathbb {C}\) be a smooth function with rapid decay. Let \(\psi , \psi '\) be characters of \(U(\mathbb {R})\) trivial on \(U(\mathbb {Z})\). For \(g = uy\in G/K\), where \(u \in U(\mathbb {R})\), \(y \in T\left( {\mathbb {R}^+}\right) \), define \(\mathcal F_\psi (g) := \psi \left( {\eta }\right) F\left( {y}\right) \). The symplectic Poincaré series associated to F is given by
where \(\Gamma = {\text {Sp}}(2r, \mathbb {Z})\), and \(P_0\) is the standard minimal parabolic subgroup of G. The \(\psi '\)-th Fourier coefficient of \(P_\psi (g)\) is given by
We compute in Sect. 5 the Fourier coefficients \(P_{\psi , \psi '} (g)\) of the Poincaré series \(P_\psi (g)\), in terms of auxiliary Kloosterman sums, which are also defined in Sect. 5. The bounds given in Theorem 1.1 also apply to these auxiliary Kloosterman sums, via Proposition 5.2.
2 Stratification of symplectic Kloosterman sums
Consider the subgroup of diagonal matrices
Note that in general elements of \(\mathcal T\) are not symplectic.
Lemma 2.1
Let \(u \in U\left( {\mathbb {Q}_p}\right) \), and \(t\in \mathcal T\). Then \(tut^{-1}\in U\left( {\mathbb {Q}_p}\right) \).
Proof
Trivial.
Lemma 2.2
Let \(n\in N\left( {\mathbb {Q}_p}\right) \), and \(t \in \mathcal T\). Then \(n^{-1} t n \in \mathcal T\).
Proof
Suppose \(t = {\text {diag}}\left( {a_1, \ldots , a_r, ca_1^{-1}, \ldots , ca_r^{-1}}\right) \). Then in general \(n^{-1} t n\) has the form
where \(\sigma \) is a permutation of \(\left\{ {1,\ldots , r}\right\} \), and \(\tau _i: \left\{ {a_i, ca_i^{-1}}\right\} \rightarrow \left\{ {a_i, ca_i^{-1}}\right\} \) are permutations for \(i = 1,\ldots , r\). Since \(a_i = c\left( {ca_i^{-1}}\right) ^{-1}\), we see that
Let \(x = unu' \in C(n)\), and \(t \in \mathcal T\). By Lemma 2.2, \(s := n^{-1}tn \in \mathcal T\). By Lemma 2.1, we see that
As conjugation by t and s preserves \(U\left( {\mathbb {Z}_p}\right) \) and \(U_n\left( {\mathbb {Z}_p}\right) \), this induces an action on \(X(n)= U(\mathbb {Z}_p) \backslash C(n) / U_n(\mathbb {Z}_p)\):
For characters \(\psi : U\left( {\mathbb {Q}_p}\right) /U\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \), \(\psi ': U_n\left( {\mathbb {Q}_p}\right) /U_n\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \), decomposition of X(n) into \(\mathcal T\)-orbits gives a decomposition of Kloosterman sums:
Let \(\alpha _i = e_i - e_{i+1}\), \(1 \le i \le r-1\), and \(\alpha _r = 2e_r\) be the simple roots of T in G. Denote \(\Delta = \left\{ {\alpha _1, \ldots , \alpha _r}\right\} \), and \(\Delta _w = \left\{ {\alpha \in \Delta }\;\big |\;{w(\alpha )<0}\right\} \). Let \(U_{\alpha _i}(\mathbb {Q}_p) \subseteq U(\mathbb {Q}_p)\) be the root subgroup corresponding to \(\alpha _i\). For \(u\in U(\mathbb {Q}_p)\), we also use \(\alpha _i\) to denote the canonical projection map
Explicitly, for \(u = (u_{ij}) \in U(\mathbb {Q}_p)\), the projection maps are given by
Characters of \(U\left( {\mathbb {Q}_p}\right) /U\left( {\mathbb {Z}_p}\right) \) have the form
where \({\text {e}}: \mathbb {Q}_p/\mathbb {Z}_p\rightarrow \mathbb {C}^\times \) is the standard additive character. For \(x = u(x) n u'(x)\), define projections
Note that \(\kappa '_i(x) = 0\) unless \(\alpha _i \in \Delta _w(n)\). For \(u\in U(\mathbb {Q}_p)\), and \(t = {\text {diag}}\left( {a_1, \ldots , a_r, ca_1^{-1}, \ldots , ca_r^{-1}}\right) \in \mathcal T\), we compute that
Suppose \(t = {\text {diag}}(a_1, \ldots , a_r, ca_1^{-1}, \ldots , ca_r^{-1}) \in \mathcal T\), and \(s = n^{-1}tn = {\text {diag}}(a'_1, \ldots , a'_r, {ca'_1}^{-1}, \ldots {ca'_r}^{-1}) \in \mathcal T\). Note from the proof of Lemma 2.2 that we have the same c. Then we have
and
For \(\ell \in \mathbb {N}\), we define
and
Lemma 2.3
We have \(\left| {V_w(\ell )} \right| = \left( {p^\ell \left( {1-p^{-1}}\right) }\right) ^r\).
Proof
For every \(\lambda \in \left( {(\mathbb {Z}/p^\ell \mathbb {Z})^\times }\right) ^\Delta \), we can find \(t\in \mathcal T\) such that \(\kappa _i(t*x) = \lambda _i\kappa _i(x)\) for \(1\le i\le r\). Using (2.2), we find a unique \(\lambda ' \in \left( {(\mathbb {Z}/p^\ell \mathbb {Z})^\times }\right) ^{\Delta _w}\) such that \(\kappa '_i(t*x) = \lambda '_i\kappa _i(x)\) for \(1\le j\le r\) with \(\alpha _j\in \Delta _w\), and it is straightforward to check that \(\lambda '\) is independent of the choice of \(t\in \mathcal T\). Therefore
as claimed.\(\square \)
For a character \(\theta : A_w(\ell ) \rightarrow \mathbb {C}^\times \), we define
Theorem 2.4
Let \(n\in N\left( {\mathbb {Q}_p}\right) \), and suppose \(\ell \) is large enough such that the matrix entries of \(u(x), u'(x)\) lie in \(p^{-\ell }\mathbb {Z}_p / \mathbb {Z}_p\) for every \(x\in X(n)\). Let \(\psi = \psi _{n_1, \ldots , n_r}: U\left( {\mathbb {Q}_p}\right) /U\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \) and \(\psi ' = \psi _{n'_1, \ldots , n'_r}|_{U_n\left( {\mathbb {Q}_p}\right) }: U_n\left( {\mathbb {Q}_p}\right) / U_n\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \) be characters. Define the character \(\theta _x: A_w(\ell ) \rightarrow \mathbb {C}^\times \) by
Then
where \(\mathfrak N(x) = \left| {\mathcal T*x} \right| \) is the size of \(\mathcal T\)-orbit of \(x\in X(n)\).
Proof
We rewrite the Kloosterman sum
For \((\lambda ,\lambda ')\in V_w(\ell )\), we can find \(t\in \mathcal T\) such that \(\kappa _i(t*x) = \lambda _i\kappa _i(x)\), \(\kappa '_j(t*x) = \lambda '_i\kappa '_j(x)\) for \(x\in X(n)\), \(1\le i,j\le r\), \(w(\alpha _j)<0\). Hence
Summing over \(V_w(\ell )\), we have
Dividing both sides by \(\left| {V_w(\ell )} \right| \) yields the statement.
3 \({\text {Sp}}(4)\) Kloosterman sums
Now we give explicit formulations for Kloosterman sums for \(G = {\text {Sp}}\left( {4, \mathbb {Q}_p}\right) \), classified by the image w(n) of the projection onto W. Fix \(\psi = \psi _{m_1, m_2}\), \(\psi ' = \psi _{n_1, n_2}\), with \(\psi _{m_1, m_2}\), \(\psi _{n_1, n_2}\) as in (1.1).
Proposition 3.1
[14, Theorem 3.2] Let \(n\in N\left( {\mathbb {Q}_p}\right) \), and \(\psi : U\left( {\mathbb {Q}_p}\right) /U\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \), \(\psi ': U_n\left( {\mathbb {Q}_p}\right) / U_n\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \) be characters. If \(t\in T\left( {\mathbb {Z}_p^\times }\right) \), then
where \(\psi _t (u) = \psi (tut^{-1})\).
By Proposition 3.1, it suffices to consider Kloosterman sums \({\text {Kl}}_p\left( {n, \psi , \psi '}\right) \) for n such that entries of n are powers of p, and X(n) is nonempty. To express the Kloosterman sums, we express the coset representatives for X(n) in terms of Plücker coordinates, which were introduced in [3, 12]. For \(g = (g_{ij}) \in G = {\text {Sp}}(4,\mathbb {Q}_p)\), we define Plücker coordinates
The Plücker coordinates satisfy the following relations:
Hence, we can associate to every \(g\in G(\mathbb {Q}_p)\) its Plücker coordinates
It follows from the definition that if \(g\in G(\mathbb {Z}_p)\), then the corresponding Plücker coordinates \(v_g\) are integral, and satisfy the coprimality conditions
It is proved in [12] that there is a bijection
In particular, this means the coset representatives for X(n) can be described using Plücker coordinates. For notational convenience, for \(n\in N(\mathbb {Q}_p)\) we write \(X^v(n)\) for a complete system of coset representatives of X(n), in terms of Plücker coordinates.
Now we give explicit formulations for Kloosterman sums \({\text {Kl}}_p(n,\psi ,\psi ')\). Note that by Proposition 3.1, it suffices to consider the case \(n = n_{w,r,s}\), where \(n_{w,r,s}\) is given as in (1.2). By looking at the Plücker coordinates, one deduces that \(X(n_{w,r,s})\) is nonempty only if \(r,s\ge 0\). Explicit formulations for \(X^v(n_{w,r,s})\) are obtained by unfolding the conditions (3.1) and (3.2), and are given in [12], and we shall use the results from there directly.
-
(i)
\(n = n_{{\text {id}},r,s}\). Then \(X(n_{{\text {id}},r,s})\) is empty unless \(r=s=0\), where \(X(n_{{\text {id}},0,0}) = \left\{ {I_4}\right\} \) is a singleton. So the Kloosterman sum is trivial:
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p\left( {n_{{\text {id}},0,0}, \psi , \psi '}\right) = 1. \end{aligned}\end{aligned}$$ -
(ii)
\(n = n_{s_\alpha ,r,s}\). Then \(X(n_{s_\alpha ,r,s})\) is nonempty when \(s=0\). In this case we have
$$\begin{aligned}\begin{aligned} X^v(n_{s_\alpha ,r,0}) = \left\{ {(0,0,v_3,p^r;0,0,0,0,0,1)}\right\} , \end{aligned}\end{aligned}$$where \(v_3 \pmod {p^r}\) such that \((v_3,p^r) = 1\). The corresponding Kloosterman sum is actually a \({\text {GL}}(2)\) Kloosterman sum:
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p\left( {n_{s_\alpha ,r,0}, \psi , \psi '}\right) = S\left( {m_1, n_1; p^r}\right) . \end{aligned}\end{aligned}$$ -
(iii)
\(n = n_{s_\beta ,r,s}\). Then \(X(n_{s_\beta ,r,s})\) is nonempty when \(r=0\). In this case we have
$$\begin{aligned}\begin{aligned} X^v(n_{s_\beta ,0,s}) = \left\{ {(0,0,1,0;0,0,0,p^s,0,v_{34})}\right\} , \end{aligned}\end{aligned}$$where \(v_{34}\pmod {p^s}\) such that \((v_{34},p^s) = 1\). The corresponding Kloosterman sum is actually a \({\text {GL}}(2)\) Kloosterman sum:
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p\left( {n_{s_\beta ,0,s}, \psi , \psi '}\right) = S\left( {m_2, n_2; p^s}\right) . \end{aligned}\end{aligned}$$ -
(iv)
\(n = n_{s_\alpha s_\beta , r,s}\). Then \(X(n_{s_\alpha s_\beta ,r,s})\) is nonempty when \(r\ge s\). Unfolding the conditions (3.1) and (3.2), we compute
$$\begin{aligned}\begin{aligned} X^v(n_{s_\alpha s_\beta ,r,s}) = \left\{ {(0,p^r,v_3,v_4;0,0,0,p^s,0,-v_4p^{s-r})}\right\} , \end{aligned}\end{aligned}$$where \(v_3,v_4\pmod {p^r}\) such that \((v_4,p^r) = p^{r-s}\) and \((v_3,p^{r-s}) = 1\). We write \(v_4 = v'_4 p^{r-s}\), so \((v'_4,p^s) = 1\). Bruhat decomposition gives
$$\begin{aligned}\begin{aligned} x =&\begin{pmatrix} 1 &{} \beta _1 &{} \beta _2 &{} \beta _3\\ &{} 1 &{} \beta _4 &{} \beta _5\\ &{}&{}1\\ &{}&{}-\beta _1 &{} 1\end{pmatrix} \begin{pmatrix} &{}&{}&{}-p^{-r}\\ p^{r-s}\\ &{}p^r\\ &{}&{}p^{s-r}\end{pmatrix} \begin{pmatrix} 1 &{}&{}&{} v_3p^{-r}\\ &{}1&{}v_3p^{-r}&{}v'_4 p^{-s}\\ &{}&{}1\\ &{}&{}&{}1\end{pmatrix}\\ =&\begin{pmatrix} \beta _1 p^{r-s} &{} \beta _2p^r &{} \beta _2v_3 + \beta _3 p^{s-r} &{} \beta _2 v'_4 p^{r-s} + \beta _1 v_3 p^{-s} - p^{-r}\\ p^{r-s} &{} \beta _4 p^r &{} \beta _4 v_3 + \beta _5 p^{s-r} &{} \beta _4 v'_4 p^{r-s} + v_3 p^{-s}\\ 0 &{} p^r &{} v_3 &{} v'_4 p^{r-s}\\ 0 &{} -\beta _1 p^r &{} -\beta _1 v_3 + p^{s-r} &{} -\beta _1 v'_4 p^{r-s}\end{pmatrix}. \end{aligned}\end{aligned}$$The entry \(-\beta _1 v_3 + p^{s-r}\) being an integer says \(\beta _1 \equiv \overline{v_3} p^{s-r} \pmod {1}\). The entry \(\beta _4 v'_4 p^{r-s} + v_3 p^{-s}\) being an integer says \(\beta _4 \equiv -\overline{v'_4} v_3 p^{-r} \pmod {p^{s-r}}\). Write \(\beta _4 = -\overline{v'_4} v_3 p^{-r} + \gamma _4 p^{s-r}\) for some \(\gamma _4\in \mathbb {Z}\). The entry \(\beta _4 v_3 + \beta _5 p^{s-r}\) being an integer says \(\gamma _4 v_3 + \beta _5 \equiv \overline{v'_4} v_3^2 p^{-s} \pmod {p^{r-s}}\), hence \(\beta _5 \equiv \overline{v'_4} v_3^2 p^{-s} \pmod {1}\). After writing \(v_4\) for \(v'_4\), the Kloosterman sum is given by
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p \left( {n_{s_\alpha s_\beta , r, s}, \psi , \psi '}\right) = \sum \limits _{\begin{array}{c} v_4 \pmod {p^s}\\ (v_4, p^s) = 1 \end{array}} \sum \limits _{\begin{array}{c} v_3 \pmod {p^r}\\ (v_3, p^{r-s}) = 1 \end{array}}{\text {e}}\left( {\frac{m_1\overline{v_3}p^s}{p^r}}\right) {\text {e}}\left( {\frac{m_2 \overline{v_4} v_3^2 + n_2 v_4}{p^s}}\right) . \end{aligned}\end{aligned}$$ -
(v)
\(n = n_{s_\beta s_\alpha ,r,s}\). Then \(X(n_{s_\beta s_\alpha ,r,s})\) is nonempty when \(s\ge 2r\). Unfolding the conditions (3.1) and (3.2), we compute
$$\begin{aligned}\begin{aligned} X^v(n_{s_\beta s_\alpha ,r,s}) = \left\{ {(0,0,-v_{24}p^{r-s},p^r;0,-v_{24},p^s,-v_{24}p^{-s},v_{24},v_{34})}\right\} , \end{aligned}\end{aligned}$$where \(v_{24}, v_{34}\pmod {p^s}\) such that \((v_{24},p^s) = p^{s-r}\) and \((v_{34}, p^{s-2r}) = 1\). We write \(v_{24} = v'_{24} p^{s-r}\), so \((v'_{24},p^r) = 1\). Bruhat decomposition gives
$$\begin{aligned}\begin{aligned} x =&\begin{pmatrix} 1 &{} \beta _1 &{} \beta _2 &{} \beta _3\\ &{} 1 &{} \beta _4 &{} \beta _5\\ &{}&{}1\\ &{}&{}-\beta _1 &{} 1\end{pmatrix} \begin{pmatrix} &{} p^{-r}\\ &{}&{} p^{r-s}\\ &{}&{}&{} p^r\\ -p^{s-r}\end{pmatrix} \begin{pmatrix} 1 &{} v'_{24} p^{-r} &{} v_{34} p^{-s}\\ &{}1\\ &{}&{}1\\ &{}&{}-v'_{24} p^{-r} &{} 1\end{pmatrix}\\ =&\begin{pmatrix} -\beta _3 p^{s-r} &{} -\beta _3 v'_{24} p^{s-2r} + p^{-r} &{} -\beta _2 v'_{24} - \beta _3 v_{34} p^{-r} + \beta _1 p^{r-s} &{} \beta _2 p^r\\ -\beta _5 p^{s-r} &{} -\beta _5 v'_{24} p^{s-2r} &{} -\beta _4 v'_{24} - \beta _5 v_{34} p^{-r} + p^{r-s} &{} \beta _4 p^r\\ 0 &{} 0 &{} -v'_{24} &{} p^r\\ -p^{s-r} &{} -v'_{24} p^{s-2r} &{} \beta _1 v'_{24} - v_{34} p^{-r} &{} -\beta _1 p^r\end{pmatrix}. \end{aligned}\end{aligned}$$The entry \(\beta _1 v'_{24} - v_{34} p^{-r}\) being an integer says \(\beta _1 \equiv \overline{v'_{24}} v_{34} p^{-r} \pmod {1}\). The entry \(\beta _4 p^r\) being an integer says \(\beta _4 = \beta '_4 p^{-r}\) for some \(\beta '_4\in \mathbb {Z}\). The entry \(-\beta _4 v'_{24} - \beta _5 v_{34} p^{-r} + p^{r-s}\) being an integer says \(\beta '_4 v'_{24} + \beta _5 v_{34} \equiv p^{2r-s} \pmod {p^r}\), hence \(\beta _5 \equiv \overline{v_{34}} p^{2r-s} \pmod {1}\). After writing \(v_{24}\) for \(v'_{24}\), the Kloosterman sum is given by
$$\begin{aligned}\begin{aligned}&{\text {Kl}}_p \left( {n_{s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \\&\qquad = \sum \limits _{\begin{array}{c} v_{24} \pmod {p^r}\\ (v_{24}, p^r) = 1 \end{array}} \sum \limits _{\begin{array}{c} v_{34}\pmod {p^s}\\ (v_{34}, p^{s-2r}) = 1 \end{array}}{\text {e}}\left( {\frac{m_1 \overline{v_{24}} v_{34} + n_1 v_{24}}{p^r}}\right) {\text {e}}\left( {\frac{m_2 \overline{v_{34}} p^{2r}}{p^s}}\right) . \end{aligned} \end{aligned}$$Remark. This Kloosterman sum is related to a \({\text {GL}}(3)\) Kloosterman sum. Precisely, following the notation in [2, (4.3)], we have
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p \left( {n_{s_\beta s_\alpha , r, s}, \psi , \psi '}\right) = p^r S\left( {n_1, m_1, m_2; p^r, p^{s-r}}\right) . \end{aligned}\end{aligned}$$A non-trivial bound for \({\text {Kl}}_p \left( {n_{s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \) then follows from Larsen [2, Appendix].
-
(vi)
\(n=n_{s_\alpha s_\beta s_\alpha ,r,s}\). Then \(X(n_{s_\alpha s_\beta s_\alpha ,r,s})\) is nonempty when \(2r\ge s\). Unfolding the conditions (3.1) and (3.2), we compute
$$\begin{aligned}\begin{aligned}&X(n_{s_\alpha s_\beta s_\alpha , r,s}) \\&\qquad = \left\{ {\left( {p^r,v_2,v_3,v_4; 0, -v_2 p^{s-r}, p^s, -v_2^2 p^{s-2r}, v_2 p^{s-r}, (p^rv_3+v_2v_4)p^{s-2r}}\right) }\right\} , \end{aligned}\end{aligned}$$where \(v_2, v_3, v_4 \pmod {p^r}\), such that \((v_2,v_3,v_4,p^r) = 1\), and if \(d := (v_2, p^r)\), then \((d^2, p^rv_3+v_2v_4) = p^{2r-s}\). Let \(d = p^{r-a}\). Then a satisfies \(s-r\le a \le s/2\). We write \(v_2 = v'_2 p^{r-a}\), so \((v'_2, p^a) = 1\). Bruhat decomposition gives
$$\begin{aligned}\begin{aligned} x =&\begin{pmatrix} 1 &{} \beta _1 &{} \beta _2 &{} \beta _3\\ &{} 1 &{} \beta _4 &{} \beta _5\\ &{}&{}1\\ &{}&{}-\beta _1 &{} 1\end{pmatrix} \begin{pmatrix} &{}&{}-p^{-r}\\ {} &{}p^{r-s}\\ p^r\\ &{}&{}&{}p^{s-r}\end{pmatrix} \begin{pmatrix} 1 &{} v'_2 p^{-a} &{} v_3 p^{-r} &{} v_4 p^{-r}\\ &{} 1 &{} v_4 p^{-r} \\ &{}&{}1\\ &{}&{}-v'_2 p^{-a} &{} 1\end{pmatrix}\\ =&\begin{pmatrix} \beta _2 p^r &{} \beta _2 v'_2 p^{r-a} + \beta _1 p^{r-s} &{} \beta _2 v_3 - \beta _3 v'_2 p^{s-a-r} + \beta _1 v_4 p^{-s} - p^{-r} &{} \beta _2 v_4 + \beta _3 p^{s-r}\\ \beta _4 p^r &{} \beta _4 v'_2 p^{r-a} + p^{r-s} &{} \beta _4 v_3 - \beta _5 v'_2 p^{s-a-r} + v_4 p^{-s} &{} \beta _4 v_4 + \beta _5 p^{s-r}\\ p^r &{} v'_2 p^{r-a} &{} v_3 &{} v_4\\ -\beta _1 p^r &{} -\beta _1 v'_2 p^{r-a} &{} -\beta _1 v_3 - v'_2 p^{s-a-r} &{} -\beta _1 v_4 + p^{s-r}\end{pmatrix}. \end{aligned}\end{aligned}$$The entry \(-\beta _1 v'_2 p^{r-a}\) being an integer says \(\beta _1 = \beta '_1 p^{a-r}\) for some \(\beta '_1 \in \mathbb {Z}\). Entries \(-\beta _1 v_3 - v'_2 p^{s-a-r}\) and \(-\beta _1 v_4 + p^{s-r}\) being integers says
$$\begin{aligned} \beta '_1 v_3&\equiv -v'_2 p^{s-2a} \pmod {p^{r-a}},&\beta '_1 v_4&\equiv p^{s-a} \pmod {p^{r-a}}. \end{aligned}$$(3.3)As \(\left( {v_3, v_4, p^{r-a}}\right) = 1\), these equations determine \(\beta _1\) uniquely modulo 1. The entry \(\beta _4 v'_2 p^{r-a} + p^{r-s}\) being an integer says \(\beta _4 \equiv -\overline{v'_2} p^{a-s} \pmod {p^{a-r}}\). Write \(\beta _4 = -\overline{v'_2} p^{a-s} + \gamma _4 p^{a-r}\) for some \(\gamma _4 \in \mathbb {Z}\). Then \(\beta _4 v_3 - \beta _5 v'_2 p^{s-a-r} + v_4 p^{-s}\) being an integer says
$$\begin{aligned} -\overline{v'_2} v_3 p^a + \gamma _4 v_3 p^{s+a-r} - \beta _5 v'_2 p^{2s-a-r} + v_4 \equiv 0 \pmod {p^s}. \end{aligned}$$(3.4)Write \(\beta _5 = \beta '_5 p^{a+r-2s}\) for some \(\beta '_5 \in \mathbb {Z}\). Then we solve
$$\begin{aligned} \beta '_5 \equiv -\overline{v'_2}^2 v_3 p^a + \gamma _4 \overline{v'_2} v_3 p^{s+a-r} + \overline{v'_2} v_4 \pmod {p^s}. \end{aligned}$$(3.5)Then \(\beta _4 v_4 + \beta _5 p^{s-r}\) being an integer says
$$\begin{aligned} \gamma _4 \left( {p^a v_3 + v'_2 v_4}\right) p^{s+a-r} \equiv v_3 p^{2a} \pmod {p^s}. \end{aligned}$$(3.6)Recall that \(\left( {p^{r-a}, p^a v_3 + v'_2 v_4}\right) = p^{r+a-s}\). Hence, unless \(a = \frac{s}{2}\), we can write \(p^a v_3 + v'_2 v_4 = V' p^{r+a-s}\), with \((V',p) = 1\). Then we solve (3.6):
$$\begin{aligned}\begin{aligned} \gamma _4 \equiv \overline{V'} v_3 \pmod {p^{s-2a}}. \end{aligned}\end{aligned}$$Putting back to (3.5) gives
$$\begin{aligned}\begin{aligned} \beta '_5 \equiv -\overline{v'_2}^2 v_3 p^a + \overline{V' v'_2} v_3^2 p^{s+a-r} + \overline{v'_2} v_4 \pmod {p^{2s-a-r}}, \end{aligned}\end{aligned}$$hence \(\beta _5\) is uniquely determined modulo 1. When \(a=\frac{s}{2}\), \(\gamma _4\) can be arbitrary, and we have
$$\begin{aligned}\begin{aligned} \beta '_5 \equiv -\overline{v'_2}^2 v_3 p^a + \overline{v'_2} v_4 \pmod {p^{2s-a-r}}, \end{aligned}\end{aligned}$$hence \(\beta _5\) is also uniquely determined modulo 1 in this case. So, after writing u for \(\beta _5 p^s\), the Kloosterman sum is given by
$$\begin{aligned}\begin{aligned}&{\text {Kl}}_p\left( {n_{s_\alpha s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \\&\qquad = \sum \limits _{s-r \le a \le s/2}\sum \limits _{\begin{array}{c} v_2, v_3, v_4 \pmod {p^r}\\ v_2 = v'_2 p^{r-a}, \; (v'_2, p^a) = 1\\ (v_3, v_4, p^{r-a}) = 1\\ \left( {p^{r-a}, p^a v_3 + v'_2 v_4}\right) = p^{r+a-s} \end{array}}{\text {e}}\left( {\frac{m_1\hat{v}_2 + n_1v_2}{p^r}}\right) {\text {e}}\left( {\frac{m_2 u}{p^s}}\right) , \end{aligned}\end{aligned}$$where \(\hat{v}_2\) is chosen modulo \(p^r\) such that
$$\begin{aligned} \hat{v}_2 v_3&\equiv -v'_2 p^{s-a} \pmod {p^r},&\hat{v}_2 v_4 \equiv p^s \pmod {p^r}, \end{aligned}$$(3.7)and
$$\begin{aligned} u \equiv {\left\{ \begin{array}{ll} -\overline{v'_2}^2 v_3 p^{2a+r-s} + \overline{V' v'_2} v_3^2 p^{2a} + \overline{v'_2} v_4 p^{a+r-s} \pmod {p^s} &{} \text {if } a<\frac{s}{2},\\ -\overline{v'_2}^2 v_3 p^{2a+r-s} + \overline{v'_2} v_4p^{a+r-s} \pmod {p^s} &{} \text {if } a=\frac{s}{2},\end{array}\right. } \end{aligned}$$(3.8)where \(V' = p^{s-r-a} \left( {p^a v_3 + v'_2 v_4}\right) \).
-
(vii)
\(n = n_{s_\beta s_\alpha s_\beta ,r,s}\). Then \(X(n_{s_\beta s_\alpha s_\beta ,r,s})\) is nonempty when \(s\ge r\). Unfolding the conditions (3.1) and (3.2), we compute
$$\begin{aligned}\begin{aligned}&X(n_{s_\beta s_\alpha s_\beta , r,s}) \\&\qquad = \left\{ {\left( {0,p^r, v_{13} p^{r-s}, v_{14} p^{r-s}; p^s, v_{13}, v_{14}, v_{23}, -v_{13}, -(v_{13}^2+v_{14}v_{23}) p^{-s}}\right) }\right\} , \end{aligned}\end{aligned}$$where \(v_{13}, v_{14}, v_{23} \pmod {p^s}\), such that \((v_{13}, v_{14}, p^s) = p^{s-r}\), \((v_{14}, p^s) \mid v_{13}^2\), and \((p^{s-r}, v_{23}, v_{34}) = 1\). Recall that \(v_{34} = -(v_{13}^2+v_{14}v_{23})p^{-s}\). We write \(v_{13} = v'_{13} p^{s-r}\), \(v_{14} = v'_{14} p^{s-r}\), so \((v'_{13}, v'_{14}, p^r) = 1\). Bruhat decomposition gives
$$\begin{aligned} \begin{aligned} x =&\begin{pmatrix} 1 &{} \beta _1 &{} \beta _2 &{} \beta _3\\ &{} 1 &{} \beta _4 &{} \beta _5\\ &{}&{}1\\ &{}&{}-\beta _1 &{} 1\end{pmatrix} \begin{pmatrix} &{}&{}&{} -p^{-r}\\ &{}&{}p^{r-s}\\ &{}p^r\\ -p^{s-r}\end{pmatrix} \begin{pmatrix} 1 &{}&{} -v_{23} p^{-s} &{} v'_{13} p^{-r}\\ &{}1&{} v'_{13} p^{-r} &{} v'_{14} p^{-r}\\ &{}&{}1\\ &{}&{}&{}1\end{pmatrix}\\ =&\begin{pmatrix} -\beta _3 p^{s-r} &{} \beta _2 p^r &{} \beta _2 v'_{13} + \beta _1 p^{r-s} + \beta _3 v_{23} p^{-r} &{} \beta _2 v'_{14} - \beta _3 v'_{13} p^{s-2r} - p^{-r}\\ -\beta _5 p^{s-r} &{} \beta _4 p^r &{} \beta _4 v'_{13} + \beta _5 v_{23} p^{-r} + p^{r-s} &{} \beta _4 v'_{14} - \beta _5 v'_{13} p^{s-2r}\\ 0 &{} p^r &{} v'_{13} &{} v'_{14}\\ -p^{s-r} &{} -\beta _1 p^r &{} -\beta _1 v'_{13} + v_{23} p^{-r} &{} -\beta _1 v'_{14} - v'_{13} p^{s-2r}\end{pmatrix}. \end{aligned}\end{aligned}$$The entry \(-\beta _1 p^r\) being an integer says \(\beta _1 = \beta '_1 p^{-r}\) for \(\beta '_1 \in \mathbb {Z}\). Entries \(-\beta _1 v'_{13} + v_{23} p^{-r}\) and \(-\beta _1 v'_{14} - v'_{13} p^{s-2r}\) being integers says
$$\begin{aligned} \beta '_1 v'_{13}&\equiv v_{23} \pmod {p^r},&\beta '_1 v'_{14}&\equiv - v'_{13} p^{s-r} \pmod {p^r}. \end{aligned}$$(3.9)As \((v'_{13}, v'_{14},p^r)=1\), this determines \(\beta _1\) uniquely modulo 1. Entries \(\beta _4 p^r\) and \(-\beta _5 p^{s-r}\) being integers says \(\beta _4 = \beta '_4 p^{-r}\) and \(\beta _5 = \beta '_5 p^{r-s}\) for some \(\beta '_4, \beta '_5 \in \mathbb {Z}\). The entry \(\beta _4 v'_{13} + \beta _5 v_{23} p^{-r} + p^{r-s}\) being an integer says
$$\begin{aligned} \beta '_4 v'_{13} p^{s-r} + \beta '_5 v_{23} + p^r \equiv 0 \pmod {p^s}, \end{aligned}$$(3.10)which implies
$$\begin{aligned} \beta '_5 v_{23} \equiv -p^r \pmod {p^{s-r}}. \end{aligned}$$(3.11)The entry \(\beta _4 v'_{14} - \beta _5 v'_{13} p^{s-2r}\) being an integer says
$$\begin{aligned} \beta '_4 v'_{14} p^{s-r} - \beta '_5 v'_{13} p^{s-r} \equiv 0 \pmod {p^s}. \end{aligned}$$(3.12)Then, \(v'_{13}\) times (3.12) minus \(v'_{14}\) times (3.10) gives
$$\begin{aligned} \beta '_5 (- {v'_{13}}^2 p^{s-r} - v'_{14} v_{23})&\equiv p^r v'_{14} \pmod {p^s},\nonumber \\ \beta '_5 p^r v_{34}&\equiv p^r v'_{14} \pmod {p^s},\nonumber \\ \beta '_5 v_{34}&\equiv v'_{14} \pmod {p^{s-r}}. \end{aligned}$$(3.13)As \(\left( {p^{s-r}, v_{23}, v_{34}}\right) = 1\), (3.11) and (3.13) determine \(\beta _5\) uniquely modulo 1. So, after writing u for \(\beta _1 p^r\) and \(\hat{v}_{14}\) for \(\beta _5 p^s\), the Kloosterman sum is given by
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p\left( {n_{s_\beta s_\alpha s_\beta , r, s}, \psi , \psi '}\right) = \sum \limits _{\begin{array}{c} v_{13}, v_{14}, v_{23} \pmod {p^s}\\ \left( {p^s, v_{13}, v_{14}}\right) = p^{s-r}\\ \left( {p^s, v_{14}}\right) \mid v_{13}^2\\ \left( {p^{s-r}, v_{23}, v_{34}}\right) = 1 \end{array}}{\text {e}}\left( {\frac{m_1 u}{p^r}}\right) {\text {e}}\left( {\frac{m_2 \hat{v}_{14} + n_2 v_{14}}{p^s}}\right) , \end{aligned}\end{aligned}$$where u is chosen modulo \(p^r\) such that
$$\begin{aligned} u v_{13} p^{r-s}&\equiv v_{23} \pmod {p^r},&u v_{14} p^{r-s}&\equiv - v_{13} \pmod {p^r}, \end{aligned}$$(3.14)and \(\hat{v}_{14}\) is chosen modulo \(p^s\) such that
$$\begin{aligned} \hat{v}_{14} v_{23}&\equiv -p^{2r} \pmod {p^s},&\hat{v}_{14} v_{34}&\equiv v_{14} p^{2r-s} \pmod {p^s}. \end{aligned}$$(3.15) -
(viii)
\(n=n_{w_0,r,s}\). Then \(X(n_{w_0,r,s})\) is nonempty whenever \(r,s\ge 0\). Unfolding the conditions (3.1) and (3.2), we compute
$$\begin{aligned}\begin{aligned} X(n_{w_0,r,s}) = \left\{ {\left( {p^r,v_2,v_3,v_4;p^s,v_{13},v_{14},(v_2v_{13}-v_3p^s)p^{-r},-v_{13},(v_3v_{14}-v_4v_{13})p^{-r}}\right) }\right\} , \end{aligned}\end{aligned}$$where \(v_2,v_3,v_4\pmod {p^r}\), \(v_{13},v_{14}\pmod {p^s}\), such that \(v_{13}p^r+v_2v_{14}-v_4p^s = 0\), \((v_2,v_3,v_4,p^r)=1\), and \((v_{13},v_{14},v_{23},v_{34},p^s)=1\). Recall that
$$\begin{aligned}\begin{aligned} v_{23} = (v_2v_{13}-v_3p^s)p^{-r},&\quad v_{34} = (v_3v_{14}-v_4v_{13})p^{-r}. \end{aligned}\end{aligned}$$Bruhat decomposition gives
$$\begin{aligned} \begin{aligned} x =&\begin{pmatrix} 1 &{} \beta _1 &{} \beta _2 &{} \beta _3\\ &{} 1 &{} \beta _4 &{} \beta _5\\ &{}&{}1\\ &{}&{}-\beta _1 &{} 1\end{pmatrix} \begin{pmatrix} &{}&{} -p^{-r}\\ &{}&{}&{}-p^{r-s}\\ p^r\\ &{}p^{s-r}\end{pmatrix} \begin{pmatrix} 1 &{} v_2 p^{-r} &{} v_3 p^{-r} &{} v_4 p^{-r}\\ &{} 1 &{} v_{13} p^{-s} &{} v_{14} p^{-s}\\ &{}&{}1\\ &{}&{}-v_2 p^{-r} &{} 1\end{pmatrix}\\ =&\displaystyle \begin{pmatrix} \beta _2 p^r &{} \beta _2 v_2 + \beta _3 p^{s-r} &{} \beta _2 v_3 + \beta _3 v_{13} p^{-r} + \beta _1 v_2 p^{-s} - p^{-r} &{} \beta _2 v_4 - \beta _1 p^{r-s} + \beta _3 v_{14} p^{-r}\\ \beta _4 p^r &{} \beta _4 v_2 + \beta _5 p^{s-r} &{} \beta _4 v_3 + \beta _5 v_{13} p^{-r} + v_2 p^{-s} &{} \beta _4 v_4 + \beta _5 v_{14} p^{-r} - p^{r-s}\\ p^r &{} v_2 &{} v_3 &{} v_4\\ -\beta _1 p^r &{} -\beta _1 v_2 + p^{s-r} &{} -\beta _1 v_3 + v_{13} p^{-r} &{} -\beta _1 v_4 + v_{14} p^{-r}\end{pmatrix}. \end{aligned}\end{aligned}$$The entry \(-\beta _1 p^r\) being an integer says \(\beta _1 = \beta '_1 p^{-r}\) for some \(\beta '_1\in \mathbb {Z}\). The last row of \(\gamma \) being integral gives
$$\begin{aligned} \beta '_1 v_2&\equiv p^s \pmod {p^r},&\beta '_1 v_3&\equiv v_{13} \pmod {p^r},&\beta '_1 v_4&\equiv v_{14} \pmod {p^r}. \end{aligned}$$(3.16)As \(\left( {p^r, v_2, v_3, v_4}\right) = 1\), these equations determine \(\beta _1\) uniquely modulo 1. The entry \(\beta _4 p^r\) being an integer says \(\beta _4 = \beta '_4 p^{-r}\) for some \(\beta '_4 \in \mathbb {Z}\). Then \(\beta _4 v_2 + \beta _5 p^{s-r}\) being an integer says
$$\begin{aligned} \beta '_4 v_2 + \beta _5 p^s \equiv 0 \pmod {p^r}. \end{aligned}$$(3.17)In particular, this means \(\beta _5 = \beta '_5 p^{-s}\) for some \(\beta '_5 \in \mathbb {Z}\). The entries \(\beta _4 v_3 + \beta _5 v_{13} p^{-r} + v_2 p^{-s}\) and \(\beta _4 v_4 + \beta _5 v_{14} p^{-r} - p^{r-s}\) being integers says
$$\begin{aligned} \beta '_4 v_3 p^s + \beta '_5 v_{13} + v_2 p^r&\equiv 0 \pmod {p^{r+s}}, \end{aligned}$$(3.18)$$\begin{aligned} \beta '_4 v_4 p^s + \beta '_5 v_{14} - p^{2r}&\equiv 0 \pmod {p^{r+s}} . \end{aligned}$$(3.19)In particular we deduce
$$\begin{aligned} \beta '_5 v_{13} + v_2 p^r&\equiv 0 \pmod {p^s}, \end{aligned}$$(3.20)$$\begin{aligned} \beta '_5 v_{14} - p^{2r}&\equiv 0 \pmod {p^s}. \end{aligned}$$(3.21)Then, \(v_2\) times (3.18) minus \(v_3 p^s\) times (3.17) gives
$$\begin{aligned} \beta '_5 \left( {v_2v_{13} - v_3p^s}\right) + v_2^2 p^r \equiv 0 \pmod {p^{r+s}}, \end{aligned}$$which implies
$$\begin{aligned} \beta '_5 v_{23} + v_2^2 \equiv 0 \pmod {p^s}. \end{aligned}$$(3.22)Similarly, \(v_3\) times (3.18) minus \(v_4\) times (3.19) gives
$$\begin{aligned}&\beta '_5 \left( {v_3v_{14}-v_4v_{13}}\right) - p^r\left( {v_3 p^r + v_2v_4}\right) \equiv 0 \pmod {p^{r+s}}, \end{aligned}$$which implies
$$\begin{aligned} \beta '_5 v_{34} \equiv \left( {v_3 p^r + v_2 v_4}\right) \pmod {p^s}. \end{aligned}$$(3.23)In summary, \(\beta '_5\) satisfies the following equations:
$$\begin{aligned}\begin{aligned} \beta '_5 v_{13}&\equiv - v_2 p^r \pmod {p^s},&\beta '_5 v_{14}&\equiv p^{2r} \pmod {p^s},\\ \beta '_5 v_{23}&\equiv -v_2^2 \pmod {p^s},&\beta '_5 v_{34}&\equiv v_3 p^r + v_2v_4 \pmod {p^s}. \end{aligned}\end{aligned}$$As \(\left( {p^s, v_{13}, v_{14}, v_{23}, v_{34}}\right) = 1\), these equations determine \(\beta _5\) uniquely modulo 1. So, after writing \(\hat{v}_2\) for \(\beta _1 p^r\) and \(\hat{v}_{14}\) for \(\beta _2 p^s\), the Kloosterman sum is given by
$$\begin{aligned}\begin{aligned} {\text {Kl}}_p \left( {n_{w_0, r, s}, \psi , \psi '}\right) = \sum \limits _{\begin{array}{c} v_2, v_3, v_4\pmod {p^r}\\ v_{13}, v_{14}\pmod {p^s}\\ v_{13}p^r + v_2v_{14} - v_4p^s = 0\\ (p^r, v_2, v_3, v_4) = 1\\ (p^s, v_{13}, v_{14}, v_{23}, v_{34}) = 1 \end{array}}{\text {e}}\left( {\frac{m_1\hat{v}_2 + n_1 v_2}{p^r}}\right) {\text {e}}\left( {\frac{m_2\hat{v}_{14} + n_2 v_{14}}{p^s}}\right) , \end{aligned}\end{aligned}$$where \(\hat{v}_2\) is chosen modulo \(p^r\) such that
$$\begin{aligned} \hat{v}_2 v_2 \equiv p^s \pmod {p^r}, \quad \hat{v}_2 v_3 \equiv v_{13} \pmod {p^r}, \quad \hat{v}_2 v_4 \equiv v_{14} \pmod {p^r}; \end{aligned}$$(3.24)and \(\hat{v}_{14}\) chosen modulo \(p^s\) such that
$$\begin{aligned} \begin{aligned} \hat{v}_{14}v_{13}&\equiv -v_2p^r \pmod {p^s},&\hat{v}_{14}v_{14}&\equiv p^{2r} \pmod {p^s},\\ \hat{v}_{14}v_{23}&\equiv -v_2^2 \pmod {p^s},&\hat{v}_{14} v_{34}&\equiv v_3p^r+v_2v_4\pmod {p^s}. \end{aligned} \end{aligned}$$(3.25)
Now we give a few reduction formulae for Kloosterman sums, which are straightforward to prove.
Proposition 3.2
Let \(\psi = \psi _{m_1, m_2}\), \(\psi ' = \psi _{n_1, n_2}\). Then
We end the section by proving that the Kloosterman sum attached to the long Weyl element \(w_0\) is symmetric with respect to characters \(\psi , \psi '\). Note that this holds for \(G = {\text {Sp}}(2r)\) in general.
Proposition 3.3
Let \(G = {\text {Sp}}(2r,\mathbb {Q}_p)\), and \(n\in N\left( {\mathbb {Q}_p}\right) \), such that \(w(n) = w_0\) is the long Weyl element. Let \(\psi , \psi ': U\left( {\mathbb {Q}_p}\right) / U\left( {\mathbb {Z}_p}\right) \rightarrow \mathbb {C}^\times \) be characters. Then
Proof
The definition of Kloosterman sums reads
The key idea of the proof is to find a bijection \(X(n) \rightarrow X(n)\), \(x\mapsto \tilde{x}\) such that \(\psi (u(\tilde{x})) = \psi (u'(x))\) and \(\psi '(u'(\tilde{x})) = \psi '(u(x))\). Since \(w(n) = w_0\) is the long Weyl element, \(n\in N(\mathbb {Q}_p)\) is of the form
Let \(x\in X(n)\), and suppose
Then we have
Now set
where
It is straightforward to verify that \(\tilde{u}, \tilde{u}' \in U(\mathbb {Q}_p)\). Now set
Now observe
and similarly
Hence \(\tilde{x} \in G(\mathbb {Z}_p)\). Moreover, we may directly verify that \(\alpha _i(\tilde{u}) = \alpha _i(u')\), \(\alpha _i(\tilde{u}') = \alpha _i(u)\) for \(1\le i\le r\). So \(\psi (u(\tilde{x})) = \psi (u'(x))\) and \(\psi '(u'(\tilde{x})) = \psi '(u(x))\). Finally, using the bijection \(X(n) \rightarrow X(n)\), \(x\mapsto \tilde{x}\), we deduce that
\(\square \)
4 Bounds for \({\text {Sp}}(4)\) Kloosterman sums
Fix \(\psi = \psi _{m_1,m_2}\), \(\psi ' = \psi _{n_1,n_2}\) as in (1.1). We first establish non-trivial bounds for local Kloosterman sums \({\text {Kl}}_p\left( {n_{w,r,s}, \psi , \psi '}\right) \), that is, prove Theorem 1.1.
We start with the local bounds. For \({\text {Kl}}_p(n_{{\text {id}},0,0},\psi ,\psi ')\), there is nothing to prove. Meanwhile, \({\text {Kl}}_p\left( {n_{s_\alpha ,r,0}, \psi , \psi '}\right) \) and \({\text {Kl}}_p\left( {n_{s_\beta ,0,s}, \psi , \psi '}\right) \) are just \({\text {GL}}(2)\) Kloosterman sums. A well-known bound for \({\text {GL}}(2)\) Kloosterman sums is given by [13]
So
as claimed.
4.1 Bound for \({\text {Kl}}_p\left( {n_{s_\alpha s_\beta , r, s}, \psi , \psi '}\right) \)
We recall
Without loss of generality, we assume \({\text {ord}}_p(m_1) \le r-s\), and \({\text {ord}}_p(m_2), {\text {ord}}_p(n_2) \le s\). Observe that
whenever \(p^k \mid \left( {m_1, p^{r-s}}\right) \) and \(p^l \mid \left( {m_2, n_2, p^s}\right) \). So we may assume \(s=0\), \(r=s\), or \(p\not \mid m_1\left( {m_2, n_2}\right) \).
If \(s=0\), then
If \(r=s\), then
is just a summation of quadratic Gauss sums, and is easily evaluated.
Now suppose \(p \not \mid m_1 \left( {m_2, n_2}\right) \). If \(p \mid m_2\) and \(s>1\), then
Now suppose \(p \mid m_2\) and \(s=1\). We may also assume \(r\ge 2\). Then
The same argument shows that the bound also holds when \(p\mid n_2\). Now assume \(p\not \mid m_1m_2n_2\). When p is odd, the same argument shows that the sum is zero unless \(r=2s\); when \(p=2\), the sum is zero unless \(r=2s\) or \(r=2s-1\).
We first consider the case \(r=2s\). When \(s=1\), we have
We apply a theorem of Adolphson and Sperber on exponential sums of Laurent polynomials. Let \(k = \mathbb {F}_q\) be a finite field of characteristic p. Let
be a Laurent polynomial in n variables. We assume that \(a_j \ne 0\) for all \(j\in J\). Let \(\Psi \) be a nontrivial additive character of k, we set
The Newton polyhedron of f, denoted by \(\Delta (f)\), is the convex hull in \(\mathbb {R}^n\) of the set \(J \cup \left\{ {(0,\ldots , 0)}\right\} \). We denote by V(f) the volume of \(\Delta (f)\) with respect to the Lebesgue measure on \(\mathbb {R}^n\). For a face \(\sigma \) (of any dimension) of \(\Delta (f)\), we set
We say that f is non-degenerate with respect to \(\Delta (f)\) if for every face \(\sigma \) of \(\Delta (f)\) that does not contain the origin, the polynomials
have no common zeroes in \((\overline{k}^\times )^n\), where \(\overline{k}\) denotes an algebraic closure of k. Then we have the following estimate.
Theorem 4.1
[1, Corollary 4.3] Given an n-dimensional integral polyhedron \(\Delta \) in \(\mathbb {R}^n\), there is a set \(\mathscr {S}_\Delta \) consisting of all but finitely many prime numbers, such that if \({\text {char}}(k) \in \mathscr {S}_\Delta \), and
is a non-degenerate Laurent polynomial with \(\Delta (f) = \Delta \), then \(\left| {S^*(f)} \right| \le n! V(f) q^{n/2}\). Moreover, when \(n=2\), the restriction on \({\text {char}}(k)\) can be removed.
Now set \(f(x,y) = \frac{m_1}{x}+\frac{m_2x^2}{y}+n_2y \in \mathbb {F}_p[x,y,(xy)^{-1}]\), and \(\Psi :\mathbb {F}_p\rightarrow \mathbb {C}^\times \) the standard additive character on \(\mathbb {F}_p\). Then \(p S^*(f) = {\text {Kl}}_p(n_{s_\alpha s_\beta ,2,1},\psi ,\psi ')\). We claim that f is non-degenerate whenever \(p\ne 2\). The Newton polyhedron \(\Delta (f)\) is the triangle with vertices \((x,y) = (-1,0), (2,-1), (0,1)\), and we evaluate \(V(f) = 2\). We list the faces \(\sigma \) that do not contain the origin, and compute the derivatives \(\frac{\partial {f_\sigma }}{\partial {x}}, \frac{\partial {f_\sigma }}{\partial {y}}\). We denote by \(\left\langle {a_0, \ldots , a_j}\right\rangle \) the j-dimensional face of \(\Delta (f)\) containing \(a_0,\ldots , a_j\). We compute:
Observe that when \(p\ne 2\), the terms \(-\frac{m_1}{x^2}\), \(\frac{2m_2x}{y}\), \(-\frac{m_2x^2}{y^2}\), \(n_2\) have no zeroes in \((\overline{\mathbb {F}_p}^\times )^2\). Hence we conclude that f is non-degenerate when \(p\ne 2\). Now we apply Theorem 4.1 and conclude that
for \(p\ne 2\). However, by direct computation, the bound (4.2) also holds for \(p=2\). Therefore, for all primes p, we have
So the bound holds in this case.
If \(s>1\), we apply the stationary phase method, following [7]. Let V be a smooth scheme of dimension n, and \(f:V\rightarrow \mathbb {A}^1 = \mathbb {A}_{\mathbb {Z}_p}^1\) a \(\mathbb {Z}_p\)-morphism. We consider the exponential sum
Let \(j\le m\) be a positive integer. We write
to denote the “approximate critical points” of f. For \(\overline{x} \in (\mathbb {Z}/p^j\mathbb {Z})^n\), we define
Clearly we have
Theorem 4.2
[7, Theorem 1.8(a)] If \(2j\le m\), then \(S_{\overline{x}} = 0\) unless \(\overline{x} \in D(\mathbb {Z}/p^j\mathbb {Z})\). Now suppose \(m=2j\) or \(2j+1\), and let \(x\in (\mathbb {Z}/p^m\mathbb {Z})^n\) map to \(\overline{x} \in D(\mathbb {Z}/p^j\mathbb {Z})\). If \(m=2j\), then we have
If \(m=2j+1\), then we have
where \(H_x\) is the Hessian matrix of f at x. In particular, if we let t denote the maximum value of \(n - {\text {rank}}_{\mathbb {F}_p} H_{\overline{x}}\) for \(\overline{x}\in D(\mathbb {Z}/p^j\mathbb {Z})\), then \(\left| {S} \right| \le \left| {D(\mathbb {Z}/p^j\mathbb {Z})} \right| p^{(mn+t)/2}\).
Now we apply the stationary phase method. Let \(f(x,y) = \frac{m_1}{x} + \frac{m_2 x^2}{y} + n_2 y\). Consider the sum
Let \(j\ge 1\) be such that \(2j\le s\). Define as in (4.4)
It is straightforward to check that \(\left| {D\left( {\mathbb {Z}/p^j\mathbb {Z}}\right) } \right| \le 4\), and \(H_{x,y}\) is invertible over \(\mathbb {F}_p\) for all \((x,y)\in D\left( {\mathbb {Z}/p^j\mathbb {Z}}\right) \), so \({\text {rank}}_{\mathbb {F}_p} H_{x,y} = 2\). So we deduce from Theorem 4.2 that
Now it remains to tackle the case \(p=2\), \(r=2s-1\). As p is fixed, it suffices to prove the bound for sufficiently large s, so we can always use the stationary phase method. Let \(f(x,y) = \frac{2m_1}{x} + \frac{m_2x^2}{y} + n_2 y\). Consider the sum
Let \(j\ge 1\) be such that \(2j\le s\). Define as in (4.4)
Then we have \(\left| {D\left( {\mathbb {Z}/p^j\mathbb {Z}}\right) } \right| \le 16\). The Hessian \(H_{x,y}\) is not invertible, but nevertheless we have from Theorem 4.2 that
This finishes the proof of the bound for \({\text {Kl}}_p\left( {n_{s_\alpha s_\beta , r,s}, \psi , \psi '}\right) \).
4.2 Bound for \({\text {Kl}}_p\left( {n_{s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \)
As we have mentioned in Sect. 3, the Kloosterman sum \({\text {Kl}}_p\left( {n_{s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \) differs from the \({\text {GL}}(3)\) Kloosterman sum \(S\left( {n_1, m_1, m_2; p^r, p^{s-r}}\right) \) just by a factor of \(p^r\). So our bound immediately follows from the estimate given by Larsen [2, Appendix], whose proof we omit here.
4.3 Bound for \({\text {Kl}}_p\left( {n_{s_\alpha s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \)
We make use of the decomposition for Kloosterman sums in Sect. 2 to obtain a non-trivial bound for \({\text {Kl}}_p\left( {n_{s_\alpha s_\beta s_\alpha , r, s}, \psi , \psi '}\right) \).
Let \(w = s_\alpha s_\beta s_\alpha \), and \(n = n_{s_\alpha s_\beta s_\alpha , r, s}\). Note that we have \(s\le 2r\). Then \(\Delta _w = \left\{ {\alpha }\right\} \), and
Let \(t = {\text {diag}}\left( {a_1, a_2, ca_1^{-1}, ca_2^{-1}}\right) \in \mathcal T\). Then \(s = n^{-1}tn = {\text {diag}}\left( {ca_1^{-1}, a_2, a_1, ca_2^{-1}}\right) \). We compute
So
If \(\theta :A_w(\ell ) \rightarrow \mathbb {C}^\times \) is given by
then
Suppose \(x_{a,b}^{v_3} \in X(n)\) has Plücker coordinates
Let \(\delta = \left( {p^{r-a}, p^a v_3 + p^{r-b}}\right) \). Then \(v_{14} = \frac{p^{r+a}}{\delta }\). This says \(s-r \le a \le \frac{s}{2}\), \(b\le r\). Then \(\delta = p^{r+a-s}\). Then
Let \(X_{a,b}^{v_3}(n) = \mathcal T * x_{a,b}^{v_3}\), and define
We also let
and
Let \(x\in X(n)\), with Plücker coordinates \(v_2 = v_{2,x}\), \(v_4 = v_{4,x}\). Then \({\text {ord}}_p(v_{2,x}) = r-a\), \({\text {ord}}_p(v_{4,x}) = r-b\) for some \(s-r \le a \le s/2\), \(0\le b\le r\). So x lies in the \(\mathcal T\)-orbit of \(x_{a,b}^{v_3}\) for some \(v_3\pmod {p^r}\), and hence \(x \in X_{a,b}(n)\). This gives a partition
As \(r\ge \frac{s}{2} \ge a\), \(r\ge b\), we see that \(u(x), u'(x)\) have entries in \(p^{-r}\mathbb {Z}_p/\mathbb {Z}_p\) for all \(x \in X(n)\). Let \(\mathcal S_{a,b}\) be a finite subset of \(\mathbb {Z}_p\) such that
By Theorem 2.4, we have
where
with \(\hat{v}_2\) and u given as in (3.7) and (3.8). By (4.5), we have
Since the size of the \(\mathcal T\)-orbit of \(x_{a,b}^{v_3}\) is bounded by \(p^{a+b}\), we have
We estimate the size of \(S_w \left( {\theta _{a,b}^{v_3}; 2r}\right) \) below. We start by computing the order of \(\hat{v}_2\) and u in (4.6). From (3.7), it is clear that \({\text {ord}}_p\left( {\hat{v}_2}\right) = s-a\). Now we consider \({\text {ord}}_p(u)\). If \(a\ne \frac{s}{2}\), then we have (after putting \(v'_2 = \overline{v'_2} = 1\))
So \({\text {ord}}_p(u) = 2\left( {r-b}\right) \). If \(a=\frac{s}{2}\), then (again we set \(v'_2 = \overline{v'_2} = 1\))
This form will be useful in computing \({\text {ord}}_p(u)\), when more conditions are given.
Case I: Suppose \(s<r\). We deduce from (3.7) that \({\text {ord}}_p(v_3) = 0, {\text {ord}}_p(v_4) = a\), so only terms with \(r=a+b\) contribute. When \(a \ne \frac{s}{2}\), we have \({\text {ord}}_p(u) = 2\left( {r-b}\right) = 2a\). When \(a = \frac{s}{2}\), we can still take \({\text {ord}}_p(u) = s = 2a\). So \({\text {ord}}_p(u) = 2a\) always holds.
-
(i)
Suppose \(a\le \frac{2s-r}{3}\). Write \(u = p^{2a} u'\). Let
$$\begin{aligned}\begin{aligned} t = \min \left\{ {{\text {ord}}_p(m_2), {\text {ord}}_p(m_1)+2s-r-3a, {\text {ord}}_p(n_1) + s-3a}\right\} , \end{aligned}\end{aligned}$$and
$$\begin{aligned}\begin{aligned} f(x,y) = p^{-t} \left( {m_2 u' y + \frac{m_1 \hat{v}_2 p^{s-r-2a} x}{y} + \frac{n_1 p^{s-3a}}{x}}\right) = m'_2 y + \frac{m'_1 x}{y} + \frac{n'_1}{x}, \end{aligned}\end{aligned}$$where \(m'_1 = m_1 \hat{v}_2 p^{s-r-2a-t}\), \(m'_2 = m_2 u' p^{-t}\), \(n'_1 = n_1p^{s-3a-t}\). Consider the sum
$$\begin{aligned}\begin{aligned} S = \sum \limits _{x,y\in (\mathbb {Z}/p^{s-2a-t}\mathbb {Z})^\times } e\left( {\frac{f(x,y)}{p^{s-2a-t}}}\right) = p^{2s-4a-4r-2t} S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) . \end{aligned}\end{aligned}$$When \(s-2a-t>1\), let \(j\ge 1\) be such that \(2j\le s-2a-t\). Define as in (4.4)
$$\begin{aligned} D\left( {\mathbb {Z}/p^j\mathbb {Z}}\right)&= \left\{ {(x,y) \in \left( {\mathbb {Z}/p^j\mathbb {Z}}\right) ^\times \times \left( {\mathbb {Z}/p^j\mathbb {Z}}\right) ^\times }\;\big |\;{\nabla f(x,y) \equiv 0\pmod {p^j}}\right\} \\&= \left\{ {\left( {x,y}\right) \in \left( {\mathbb {Z}/p^j\mathbb {Z}}\right) ^\times \times \left( {\mathbb {Z}/p^j\mathbb {Z}}\right) ^\times }\;\big |\;{\begin{array}{l} m'_1 x^2 \equiv n'_1y \pmod {p^j},\\ m'_2 y^2 \equiv m'_1 x \pmod {p^j}\end{array}}\right\} . \end{aligned}$$Note that at least one of \(m'_1\), \(m'_2\) and \(n'_1\) is not divisible by p. It then follows that \(D\left( {\mathbb {Z}/p^j\mathbb {Z}}\right) \) is empty unless \({\text {ord}}_p(m_2) = {\text {ord}}_p(m_1)+2s-r-3a = {\text {ord}}_p(n_1) + s-3a = t\). But then
$$\begin{aligned}\begin{aligned} S = p^{4a+2t-2s} {\text {Kl}}_p(n_{s_\beta s_\alpha , s-2a-t, 3s-6a-3t}, \psi _{m'_1,m'_2}, \psi _{n'_1,0}), \end{aligned}\end{aligned}$$with \(p\not \mid m'_1m'_2n'_1\). So it follows from the bound for \({\text {Kl}}_p(n_{s_\beta s_\alpha ,r,s},\psi ,\psi ')\) that
$$\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) } \right| \ll p^{4r+2a-s+t}. \end{aligned}$$(4.9)Now suppose \(s-2a-t=1\). If \(p \not \mid m'_1m'_2n'_1\), then it follows by the theorem of Deligne [6, Sommes. trig., 7.1.3] that \(S\ll p\). When p divides some (but not all) of \(m'_1\), \(m'_2\), \(n'_1\), then the sum reduces to a Ramanujan sum, and is easily evaluated that \(S\ll p\) as well. So the bound (4.9) also holds for this case. Remark. Theorem 4.1, itself a generalisation of Deligne’s theorem, also applies to give the same bound. The bounds for \(S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) \) in other cases are obtained analogously, and we shall omit the repetitive computations thereafter.
-
(ii)
Suppose \(a>\frac{2s-r}{3}\). Write \(\hat{v}_2 = p^{s-a} \hat{v}'_2\). Let
$$\begin{aligned}\begin{aligned} t = \min \left\{ {{\text {ord}}_p(m_2)+r+3a-2s, {\text {ord}}_p(m_1), {\text {ord}}_p(n_1)+r-s}\right\} , \end{aligned}\end{aligned}$$and
$$\begin{aligned}\begin{aligned} f(x,y) = p^{-t} \left( {m_2 u p^{r+a-2s} y+\frac{m_1 \hat{v}'_2 x}{y} + \frac{n_1 p^{r-s}}{x}}\right) = m'_2 y + \frac{m'_1 x}{y} + \frac{n'_1}{x}, \end{aligned}\end{aligned}$$where \(m'_1 \hat{v}'_2 p^{-t}\), \(m'_2 = m_2 u p^{r+a-2s-t}\), \(n'_1 = n_1 p^{r-s-t}\). Then we have
$$\begin{aligned}\begin{aligned} S = \sum \limits _{x,y\in (\mathbb {Z}/p^{r+a-s-t}\mathbb {Z})^\times } e\left( {\frac{f(x,y)}{p^{r+a-s-t}}}\right) = p^{2a-2r-2s-2t} S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) . \end{aligned}\end{aligned}$$Then we obtain analogously
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) } \right| \ll p^{3r-a+s+t}. \end{aligned}\end{aligned}$$
Note that we have \(\left( {p^{r-a}, p^a \left( {v_3+1}\right) }\right) = p^{r+a-s}\). A necessary condition for this to hold is that \(p^{r-s} \mid v_3+1\). So \(\left| {\mathcal S_{a,b}} \right| \le p^s\). So, from (4.7) we actually have
Hence
Case II: Suppose \(s=r\). We deduce from (3.7) that when \(a\ne 0\), then \({\text {ord}}_p(v_3) = 0, {\text {ord}}_p(v_4) \ge a\). So, only terms with \(r\ge a+b\) contribute. When \(a\ne \frac{s}{2}\), we have \({\text {ord}}_p(u) = 2\left( {r-b}\right) \). When \(a=\frac{s}{2}\), we still have \({\text {ord}}_p(u) \le s = 2\left( {r-b}\right) \). So \({\text {ord}}_p(u) \le 2\left( {r-b}\right) \) always holds. We compute
Hence
Case III: \(2r>s>r\). We consider the following subcases:
-
(a)
Suppose \(a=s-r\). Then the condition \(\left( {p^{r-a}, p^a v_3 + p^{r-b}}\right) = 1\) implies \(b=r\). So \({\text {ord}}_p(u) = 0\). We deduce from (3.7) that \(\hat{v}_2 = 0\). So
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) } \right| \ll p^{3r-s} \min \left\{ {p^{r+{\text {ord}}_p(m_2)}, p^{2r+{\text {ord}}_p(n_1)}}\right\} . \end{aligned}\end{aligned}$$ -
(b)
Suppose \(s-r< a < \frac{s}{2}\). Then we deduce from (3.7) that \({\text {ord}}_p(v_3) = 0\), \({\text {ord}}_p(v_4) \ge a\). So \(a+b\le r\). Meanwhile, as \(r+a-s < a\), the condition \(\left( {p^{r-a}, p^a v_3 + p^{r-b}}\right) = p^{r+a-s}\) says \(r-b = r+a-s\), which implies \(a+b = s >r\), a contradiction. So there is no contribution from this case.
-
(c)
Suppose \(a = \frac{s}{2}\). Again, we deduce from (3.7) that \({\text {ord}}_p(v_3) = 0\), \({\text {ord}}_p(v_4) \ge a\). So, only terms with \(r\ge a+b\) contribute. In this case, we don’t have a good bound for \({\text {ord}}_p(u)\). So
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) } \right| \ll p^{3r+\min \left\{ {\frac{s}{2}+{\text {ord}}_p(m_1), r-\frac{s}{2}+{\text {ord}}_p(n_1)}\right\} }. \end{aligned}\end{aligned}$$
Hence
Case IV: \(s = 2r\). In this case, we have \(a = r\), and \(v_3, v_4 = p^{r-b}\) is arbitrary. We deduce from (3.7) that \(\hat{v}_2 = 0\). We consider the following subcases:
-
(a)
Suppose \(b=0\). We may assume \(v_4=0\). Then \({\text {ord}}_p(u) = r+{\text {ord}}_p(v_3)\). We compute
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) } \right| \ll p^r \min \left\{ {p^{2r+{\text {ord}}_p(v_3)+{\text {ord}}_p(m_2)}, p^{2r+{\text {ord}}_p(n_1)}}\right\} . \end{aligned}\end{aligned}$$Fix \(c\le r\). Then
$$\begin{aligned}\begin{aligned} \left| {\left\{ {v_3\in \mathcal S_{a,b}}\;\big |\;{{\text {ord}}_p(v_3) = c}\right\} } \right| \le p^{r-c}. \end{aligned}\end{aligned}$$ -
(b)
Suppose \(b>0\). Then \({\text {ord}}_p(u) = r-b\). We compute
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_3}; 2r}\right) } \right| \ll p^r \min \left\{ {p^{2r-b+{\text {ord}}_p(m_2)}, p^{2r+{\text {ord}}_p(n_1)}}\right\} . \end{aligned}\end{aligned}$$
Hence
This finishes the proof of the bound for \({\text {Kl}}_p\left( {n_{s_\alpha s_\beta s_\alpha ,r,s}, \psi , \psi '}\right) \).
4.4 Bound for \({\text {Kl}}_p\left( {n_{s_\beta s_\alpha s_\beta , r, s}, \psi , \psi '}\right) \)
We make use of the decomposition for Kloosterman sums in Sect. 2 to obtain a non-trivial bound for \({\text {Kl}}_p\left( {n_{s_\beta s_\alpha s_\beta , r, s}, \psi , \psi '}\right) \).
Let \(w = s_\beta s_\alpha s_\beta \), and \(n = n_{s_\beta s_\alpha s_\beta , r, s}\). Note that we have \(r\le s\). Then \(\Delta _w = \left\{ {\beta }\right\} \), and
Let \(t = {\text {diag}}\left( {a_1, a_2, ca_1^{-1}, ca_2^{-1}}\right) \in \mathcal T\). Then \(s = n^{-1}tn = {\text {diag}}\left( {ca_2^{-1}, ca_1^{-1}, a_2, a_1}\right) \). We compute
So
If \(\theta : A_w(\ell ) \rightarrow \mathbb {C}^\times \) is given by
then
Suppose \(x_{a,b}^{v_{23}} \in X(n)\) has Plücker coordinates
The condition \((v_{12}, v_{14}) \mid v_{13}^2\) says \(s-b \le 2\left( {s-a}\right) \), that is, \(2a-b\le s\). We also have \(\max \left\{ {a,b}\right\} = r\). Then
Let \(X_{a,b}^{v_{23}} (n) = \mathcal T * x_{a,b}^{v_{23}}\), and define
We also let
and
Again we have a partition
It is clear that \(u(x), u'(x)\) have entries in \(p^{-s}\mathbb {Z}_p/\mathbb {Z}_p\) for all \(x\in X(n)\). Let \(\mathcal S_{a,b}\) be a finite subset of \(\mathbb {Z}_p\) such that
By Theorem 2.4, we have
where
with \(\hat{v}_{14}\) and u given as in (3.14) and (3.15). By (4.10), we have
Since the size of the \(\mathcal T\)-orbit of \(x_{a,b}^{v_{23}}\) is bounded by \(p^{a+b}\), we have
We estimate the size of \(S_w \left( {\theta _{a,b}^{v_{23}}; s}\right) \). We start by computing the order of \(\hat{v}_{14}\) and u in (4.11). From (3.14), we see that
So, if \(a=r\), then \(u\equiv v_{23}\pmod {p^r}\), and if \(b=r\), then \(u\equiv -p^{s-a}\pmod {p^r}\). (Recall that \(\max \left\{ {a,b}\right\} =r\).) Also, we know that
for some \(\beta \in \mathbb {Z}\) such that \(\left( {\beta , p^{s-2r+b}}\right) = 1\) (see [12, Sect. 3.2]). Meanwhile, from (3.15), we see that unless \(r=s\), we have \({\text {ord}}_p\left( {\hat{v}_{14}}\right) = 2r-b\).
Case I: Suppose \(r<\frac{s}{2}\). We deduce from (4.14) that \({\text {ord}}_p(v_{23}) = b\). From (4.13), we deduce \(a\ge b\). So we actually have \(a=r\), and then \({\text {ord}}_p(u) = b\).
-
(i)
Suppose \(b\le \frac{3r-s}{2}\). Write \(u = p^b u'\). Let
$$\begin{aligned}\begin{aligned} t = \min \left\{ {{\text {ord}}_p(m_1), {\text {ord}}_p(m_2)+3r-2b-s, {\text {ord}}_p(n_2)+r-2b}\right\} \end{aligned}\end{aligned}$$and
$$\begin{aligned}\begin{aligned} f(x,y) = p^{-t} \left( {\frac{m_1u'}{x} + \frac{m_2 \hat{v}_{14} p^{r-b-s} x^2}{y} + n_2 p^{r-2b}y}\right) = \frac{m'_1}{x} + \frac{m'_2 x^2}{y} + n'_2 y, \end{aligned}\end{aligned}$$where \(m'_1 = m_1 u' p^{-t}\), \(m'_2 = m_2 \hat{v}_{14} p^{r-b-s-t}\), \(n'_2 = n_2 p^{r-2b-t}\). Consider the sum
$$\begin{aligned}\begin{aligned} S = \sum \limits _{x,y\in (\mathbb {Z}/p^{r-b-t}\mathbb {Z})^\times } e\left( {\frac{f(x,y)}{p^{r-b-t}}}\right) = p^{2r-2s-2b-2t} S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) . \end{aligned}\end{aligned}$$When \(r-b-t>1\), let \(j\ge 1\) be such that \(2j\le r-b-t\). Define as in (4.4)
$$\begin{aligned}\begin{aligned} D(\mathbb {Z}/p^j\mathbb {Z}) =&\left\{ {(x,y)\in (\mathbb {Z}/p^j\mathbb {Z})^\times \times (\mathbb {Z}/p^j\mathbb {Z})^\times }\;\big |\;{\nabla f(x,y) \equiv 0\pmod {p^j}}\right\} \\ =&\left\{ {(x,y)\in (\mathbb {Z}/p^j\mathbb {Z})^\times \times (\mathbb {Z}/p^j\mathbb {Z})^\times }\;\big |\;{\begin{array}{l} 2m'_2 x^3 \equiv m'_1 y \pmod {p^j}\\ m'_2 x^2 \equiv n'_2 y^2 \pmod {p^j}\end{array}}\right\} . \end{aligned}\end{aligned}$$Note that at least one of \(m'_1\), \(m'_2\) and \(n'_2\) is not divisible by p. It then follows that when p is odd, \(D(\mathbb {Z}/p^j\mathbb {Z})\) is empty unless \({\text {ord}}_p(m_1) = {\text {ord}}_p(m_2)+3r-2b-s = {\text {ord}}_p(n_2)+r-2b = t\). But then
$$\begin{aligned}\begin{aligned} S = p^{t+b-r} {\text {Kl}}_p(n_{s_\alpha s_\beta , 2r-2b-2t, r-b-t}, \psi _{m'_1,m'_2}, \psi _{0,n'_2}), \end{aligned}\end{aligned}$$with \(p\not \mid m'_1m'_2n'_2\). So it follows from the bound for \({\text {Kl}}_p(n_{s_\alpha s_\beta ,r,s},\psi ,\psi ')\) that
$$\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_{23}};s}\right) } \right| \ll p^{2s-r+b+t}. \end{aligned}$$(4.15)When \(p=2\), \(D(\mathbb {Z}/p^j\mathbb {Z})\) is empty unless \({\text {ord}}_p(m_1)-1 = {\text {ord}}_p(m_2)+3r-2b-s = {\text {ord}}_p(n_2)+r-2b = t\). Then
$$\begin{aligned}\begin{aligned} S = p^{t+b-r+1} {\text {Kl}}_p(n_{s_\alpha s_\beta , 2r-2b-2t-1, r-b-t}, \psi _{m'_1/2,m'_2}, \psi _{0,n'_2}), \end{aligned}\end{aligned}$$with \(p \not \mid (m'_1/2)m'_2n'_2\). Again, from the bound for \({\text {Kl}}_p(n_{s_\alpha s_\beta ,r,s},\psi ,\psi ')\), we see that (4.15) also holds for this case. Now suppose \(r-b-t=1\). If \(p\not \mid m'_1 m'_2 n'_1\), then it again follows from Theorem 4.1 that \(\left| {S} \right| \ll p\). When p divides some (but not all) of \(m'_1, m'_2, n'_1\), then the sum reduces to Gauß sums or Ramanujan sums, and is easily evaluated that \(\left| {S} \right| \ll p\) as well. So the bound (4.15) also holds for this case. The bounds for \(S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) \) in other cases are obtained analogously, and we shall omit the repetitive computations thereafter.
-
(ii)
Suppose \(b> \frac{3r-s}{2}\). Write \(\hat{v}_{14} = p^{2r-b} \hat{v}'_{14}\). Let
$$\begin{aligned}\begin{aligned} t = \min \left\{ {{\text {ord}}_p(m_1)+s+2b-3r, {\text {ord}}_p(m_2), {\text {ord}}_p(n_2)+s-2r}\right\} , \end{aligned}\end{aligned}$$and
$$\begin{aligned}\begin{aligned} f(x,y) = p^{-t} \left( {\frac{m_1up^{s+b-3r}}{x} + \frac{m_2 \hat{v}'_{14} x^2}{y} + n_2 p^{s-2r} y}\right) = \frac{m'_1}{x} + \frac{m'_2 x^2}{y} + n'_2 y, \end{aligned}\end{aligned}$$where \(m'_1 = m_1 u p^{s+b-3r-t}\), \(m'_2 = m_2 \hat{v}'_{14} p^{-t}\), \(n'_2 = n_2 p^{s-2r-t}\). Then we have
$$\begin{aligned}\begin{aligned} S = \sum \limits _{x,y\in (\mathbb {Z}/p^{s+b-2r-t}\mathbb {Z})^\times } e\left( {\frac{f(x,y)}{p^{s+b-2r-t}}}\right) = p^{2b-4r-2t} S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) . \end{aligned}\end{aligned}$$Then we obtain analogously
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) } \right| \ll p^{s+2r-b+t}. \end{aligned}\end{aligned}$$
Hence
Case II: Suppose \(r=\frac{s}{2}\). We consider the following subcases:
-
(a)
Suppose \(b=r\). From (4.13), we may assume \(u=0\). We compute
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) } \right| \ll p^{\frac{3s}{2}+\min \left\{ {{\text {ord}}_p(m_2), {\text {ord}}_p(n_2)}\right\} }. \end{aligned}\end{aligned}$$ -
(b)
Suppose \(b<r\). Then \(a=r\). From (4.14), we see that \(v_{23} = \left( {\beta -1}\right) p^b\) for some \(\beta \in \mathbb {Z}\) such that \(\left( {\beta , p^b}\right) = 1\). So \({\text {ord}}_p(v_{23}) \ge b\). And from (4.13), we deduce that \({\text {ord}}_p(u) = {\text {ord}}_p(v_{23})\). We compute
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) } \right| \ll p^{s/2} \min \left\{ {p^{s+{\text {ord}}_p(v_{23}) + {\text {ord}}_p(m_1)}, p^{\frac{3s}{2}-b+\min \left\{ {{\text {ord}}_p(m_2), {\text {ord}}_p(n_2)}\right\} }}\right\} . \end{aligned}\end{aligned}$$
Fix \(c\ge b\). Then
Hence
Case III: Suppose \(s>r>\frac{s}{2}\). We consider the following subcases:
-
(a)
Suppose \(b=r\). Then \({\text {ord}}_p(u) = s-a\), and \({\text {ord}}_p(\hat{v}_{14}) = r\). We compute
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) } \right| \ll p^{s-r} \min \left\{ {p^{2s-a+{\text {ord}}_p(m_1)}, p^{r+\min \left\{ {r+{\text {ord}}_p(m_2)}\right\} , s-r+{\text {ord}}_p(n_2)}}\right\} . \end{aligned}\end{aligned}$$ -
(b)
Suppose \(b<r\). Then \(a=r\). Then from (4.14) we deduce that \({\text {ord}}_p(v_{23}) = p^{s-2r+b}\), and hence \({\text {ord}}_p(u) = p^{s-2r+b}\). We compute
$$\begin{aligned}\begin{aligned} \left| {S_w\left( {\theta _{a,b}^{v_{23}}; s}\right) } \right| \ll p^{s-r} \min \left\{ {p^{2s-2r+b+{\text {ord}}_p(m_1)}, p^{r-b+\min \left\{ {2r+{\text {ord}}_p(m_2), s+{\text {ord}}_p(n_2)}\right\} }}\right\} . \end{aligned}\end{aligned}$$
Hence
Case IV: \(r=s\). In this case we only have to consider terms with \(b=r\). Indeed, if \(b<r\), then \(a=r\), and then by (4.13), we see that \(u p^{r-b} \equiv -1\pmod {p^r}\), which says \(b=r\), a contradiction. When \(b=r\), we have \({\text {ord}}_p(u) = s-a\), and from (3.15) we may assume \(\hat{v}_{14} = 0\). We compute
Hence
This finishes the proof of the bound for \({\text {Kl}}_p\left( {n_{s_\beta s_\alpha s_\beta ,r,s}, \psi , \psi '}\right) \).
4.5 Bounds for \({\text {Kl}}_p\left( {n_{w_0, r, s}, \psi , \psi '}\right) \)
We show that under the stratification introduced in Sect. 2, \({\text {Kl}}_p\left( {n_{w_0, r, s}, \psi , \psi '}\right) \) decomposes into a sum of products of \({\text {GL}}(2)\) Kloosterman sums. So the Kloosterman sum can be bounded using (4.1).
Let \(w = w_0\), and \(n = n_{w_0, s, r}\). Then \(\Delta _{w_0} = \Delta \), and
Let \(t = {\text {diag}}\left( {a_1, a_2, ca_1^{-1}, ca_2^{-1}}\right) \in \mathcal T\). Then \(s = n^{-1}tn = {\text {diag}}\left( {ca_1^{-1}, ca_2^{-1}, a_1, a_2}\right) \). We compute
So
If \(\theta : A_{w_0}(\ell ) \rightarrow \mathbb {C}^\times \) is given by
then
Suppose \(x_{a,b}^{v_3, v_4, v_{13}}\in X(n)\) has Plücker coordinates
Note that this also says \(r\ge a, s\ge b\). Then
Let \(X_{a,b}^{v_3, v_4, v_{13}}(n) = \mathcal T * x_{a,b}^{v_3, v_4, v_{13}}\), and define
We also let
and
We have a partition
Now we consider cases \(r\ge s\) and \(r<s\) separately.
-
(i)
Suppose \(r> s\). As \(r\ge a, r\ge s\ge b\), we see that \(u(x), u'(x)\) have entries in \(p^{-r}\mathbb {Z}_p/\mathbb {Z}_p\) for all \(x\in X(n)\). Let \(\mathcal S_{a,b}\) be a finite subset of \(\mathbb {Z}_p^3\) such that
$$\begin{aligned}\begin{aligned} X_{a,b}(n) = \coprod \limits _{(v_3, v_4, v_{13}) \in \mathcal S_{a,b}} X_{a,b}^{v_3, v_4, v_{13}} (n). \end{aligned}\end{aligned}$$By Theorem 2.4, we have
$$\begin{aligned}\begin{aligned} S_{a,b} \left( {n, \psi , \psi '}\right) = p^{-2r} \left( {1-p^{-1}}\right) ^{-2} \sum \limits _{(v_3, v_4, v_{13}) \in \mathcal S_{a,b}} \left| {X_{a,b}^{v_3, v_4, v_{13}} (n)} \right| S_{w_0} \left( {\theta _{a,b}^{v_3, v_4, v_{13}}; r}\right) , \end{aligned}\end{aligned}$$where
$$\begin{aligned}\begin{aligned} \theta _{a,b}^{v_3, v_4, v_{13}} (\lambda ,\lambda ') = {\text {e}}\left( {\frac{m_1\hat{v}_2\lambda _1 + n_1p^{r-a}\lambda '_1}{p^r}}\right) {\text {e}}\left( {\frac{m_2\hat{v}_{14} + n_2 p^{s-b}}{p^s}}\right) . \end{aligned}\end{aligned}$$By (4.16), we have
$$\begin{aligned}\begin{aligned} S_{w_0} \left( {\theta _{a,b}^{v_3, v_4, v_{13}}; r}\right) = S\left( {m_1\hat{v}_2, n_1\hat{p}^{r-a}; p^r}\right) S\left( {m_2\hat{v}_{14} p^{r-s}, n_2p^{r-b}; p^r}\right) . \end{aligned}\end{aligned}$$And we obtain a bound by applying (4.1):
$$\begin{aligned}\begin{aligned}&\left| {S_{w_0}\left( {\theta _{a,b}^{v_3, v_4, v_{13}}; r}\right) } \right| \\&\qquad \le 4 p^r \left( {\gcd \left( {m_1\hat{v}_2, n_1p^{r-a}, p^r}\right) \gcd \left( {m_2\hat{v}_{14} p^{r-s}, n_2p^{r-b}, p^r}\right) }\right) ^{1/2}. \end{aligned}\end{aligned}$$ -
(ii)
Suppose \(s\ge r\). Then \(u(x), u'(x)\) has entries in \(p^{-s}\mathbb {Z}_p/\mathbb {Z}_p\) for all \(x\in X(n)\). Again, by Theorem 2.4 we have
$$\begin{aligned}\begin{aligned} S_{a,b}\left( {n,\psi ,\psi '}\right)&= p^{-2s} \left( {1-p^{-1}}\right) ^{-2} \\&\quad \times \sum \limits _{(v_3, v_4, v_{13}) \in \mathcal S_{a,b}} \left| {X_{a,b}^{v_3, v_4, v_{13}} (n)} \right| S_{w_0} \left( {\theta _{a,b}^{v_3, v_4, v_{13}}; s}\right) , \end{aligned}\end{aligned}$$where
$$\begin{aligned}\begin{aligned}&\theta _{a,b}^{v_3, v_4, v_{13}} (\lambda ,\lambda ') \\&\qquad = {\text {e}}\left( {\frac{\left( {m_1\hat{v}_2 p^{s-r}}\right) \lambda _1 + \left( {m_2\hat{v}_{14}}\right) \lambda _2 + \left( {n_1p^{s-a}}\right) \lambda '_1 + \left( {n_2p^{s-b}}\right) \lambda '_2}{p^s}}\right) . \end{aligned}\end{aligned}$$By (4.16), we have
$$\begin{aligned}\begin{aligned} S_{w_0} \left( {\theta _{a,b}^{v_3, v_4, v_{13}}; s}\right) = S\left( {m_1\hat{v}_2 p^{s-r}, n_1p^{s-a}; p^s}\right) S\left( {m_2\hat{v}_{14}, n_2p^{s-b}; p^s}\right) . \end{aligned}\end{aligned}$$Applying (4.1) gives
$$\begin{aligned}\begin{aligned}&\left| {S_{w_0} \left( {\theta _{a,b}^{v_3, v_4, v_{13}}; s}\right) } \right| \\&\qquad \le 4 p^s \left( {\gcd \left( {m_1\hat{v}_2p^{s-r}, n_1p^{s-a}, p^s}\right) , \gcd \left( {m_2\hat{v}_{14}, n_2p^{s-b}, p^s}\right) }\right) ^{1/2}. \end{aligned}\end{aligned}$$
Now we give a bound to the size of \({\text {Kl}}_p\left( {n, \psi , \psi '}\right) \). To ease computation, we consider a relaxed bound by ignoring \(\hat{v}_2\) and \(\hat{v}_{14}\).
Suppose \(r>s\). Then the bound says
Note that
Hence
So it suffices to give an upper bound to \(\left| {\mathcal S_{a,b}} \right| \). Such bounds were computed in [12, Sect. 5]. Note that we require \(r\ge a+b\) in order to have \(\mathcal S_{a,b}\) nonempty.
Case I: Suppose \(s-r+a\ge 0\).
-
(a)
If \(s-2r+2a+b \ge 0\), then \(\left| {\mathcal S_{a,b}} \right| \le p^{r+s-a-b}\).
-
(b)
If \(s-2r+2a+b < 0\), then \(\left| {\mathcal S_{a,b}} \right| \le p^{2s-b-\lceil \frac{s-b}{2}\rceil } \le p^{3s/2-b/2}\).
Case II: Suppose \(s-r+a<0\). Then \(\left| {\mathcal S_{a,b}} \right| \le p^{2s-b-\lceil \frac{s-b}{2}\rceil } \le p^{3s/2-b/2}\).
Combining the cases, we obtain
Hence, we have for \(r>s\)
For \(r\le s\), applying the same argument gives
Combining (4.17) and (4.18), we get
By Proposition 3.3, we can swap the characters, so
as well. Combining (4.19) and (4.20) yields the bound for \({\text {Kl}}_p(n_{w_0,r,s},\psi ,\psi ')\).
4.6 Bounds for global Kloosterman sums
By combining the bounds for local Kloosterman sums \({\text {Kl}}_p(n_{w,r,s}, \psi , \psi ')\), we obtain bounds for global Kloosterman sums, and prove Theorem 1.2.
Proof of Theorem 1.2
The statement for \({\text {Kl}}(n_{{\text {id}}}(c_1,c_2),\psi ,\psi ')\) follows because
Meanwhile, \({\text {Kl}}(n_{s_\alpha }(c_1,1),\psi ,\psi ')\) and \({\text {Kl}}(n_{s_\beta }(1,c_2),\psi ,\psi ')\) are just classical Kloosterman sums. Combining local bounds for classical Kloosterman sums gives the global bounds, which read
For \({\text {Kl}}(n_{s_\alpha s_\beta }(c_1,c_2),\psi ,\psi ')\) and \({\text {Kl}}(n_{s_\beta s_\alpha }(c_1,c_2),\psi ,\psi ')\), we again combine the local bounds given in Theorem 1.1 yields the global bounds.
For \({\text {Kl}}(n_{s_\alpha s_\beta s_\alpha }(c_1,c_2),\psi ,\psi ')\) and \({\text {Kl}}(n_{s_\beta s_\alpha s_\beta }(c_1,c_2),\psi ,\psi ')\), the situation is more complicated, since the shapes of the local bounds depend on the relative size of r, s. Therefore, in order to obtain a global bound, we have to find an expression for the local bound that works for all values of r, s.
We start with \({\text {Kl}}(n_{s_\alpha s_\beta s_\alpha }(c_1,c_2),\psi ,\psi ')\). For \(s\le r\), we have
For \(r<s<2r\), we have
and we have inequalities
For \(s=2r\), we have
So we can conclude for \(0\le s \le 2r\) that
Since we may assume from (4.6) that \({\text {ord}}_p(m_1), {\text {ord}}_p(n_1)\le r\), and \({\text {ord}}_p(m_2) \le s\), we have
for every \(\varepsilon >0\).
Now we consider \({\text {Kl}}(n_{s_\beta s_\alpha s_\beta }(c_1,c_2),\psi ,\psi ')\). For \(r\le s/2\), we have
For \(s/2<r<s\), we have
For \(s=r\), we have
So we can conclude for \(0\le r\le s\) that
Since we may assume from (4.11) that \({\text {ord}}_p(m_1)\le r\), and \({\text {ord}}_p(m_2), {\text {ord}}_p(n_2) \le s\), we have
for every \(\varepsilon >0\).
For \({\text {Kl}}(n_{w_0}(c_1,c_2),\psi ,\psi ')\), the local bound again consists of a single expression, so the local bounds given in Theorem 1.1 can be combined directly to give the stated global bound.
5 Symplectic Poincaré series
In this section, we compute the Fourier coefficients of symplectic Poincaré series, in terms of auxiliary Kloosterman sums.
Definition
-
(a)
Let \(n\in N\left( {\mathbb {Q}_p}\right) \), and \(\psi _p, \psi '_p\) be characters of \(U\left( {\mathbb {Q}_p}\right) \) which are trivial on \(U\left( {\mathbb {Z}_p}\right) \). Then the local auxiliary Kloosterman sum is defined to be
$$\begin{aligned}\begin{aligned} \underline{{\text {Kl}}}_p\left( {n, \psi _p, \psi '_p}\right) = \sum \limits _{\begin{array}{c} x\in X(n)\\ x = b_1 n b_2 \end{array}} \psi _p\left( {b_1}\right) \psi '_p\left( {b_2}\right) \end{aligned}\end{aligned}$$if \(\psi _p\left( {nun^{-1}}\right) = \psi '_p\left( {u}\right) \) for \(u \in \overline{U}_n\left( {\mathbb {Q}_p}\right) \), and zero otherwise. We say \(\underline{{\text {Kl}}}_p\left( {n, \psi _p, \psi '_p}\right) \) is well-defined if \(\psi _p\left( {nun^{-1}}\right) = \psi '_p\left( {u}\right) \) for \(u \in \overline{U}_n\left( {\mathbb {Q}_p}\right) \).
-
(b)
Let \(n\in N\left( {\mathbb {Q}}\right) \), and \(\psi = \prod \limits _p \psi _p\), \(\psi ' = \prod \limits _p \psi '_p\) be characters of \(U\left( {\mathbb {A}}\right) \) which are trivial on \(\prod \limits _p U\left( {\mathbb {Z}_p}\right) \). Then the global auxiliary Kloosterman sum is defined to be
$$\begin{aligned}\begin{aligned} \underline{{\text {Kl}}}\left( {n, \psi , \psi '}\right) = \prod \limits _p \underline{{\text {Kl}}}_p\left( {n, \psi _p, \psi '_p}\right) . \end{aligned}\end{aligned}$$
We first show that the auxiliary Kloosterman sums are well-defined.
Proposition 5.1
[9, Proposition 1.3] Let \(G = {\text {Sp}}\left( {2r, \mathbb {Q}_p}\right) \), \(n \in N\left( {\mathbb {Q}_p}\right) \), and \(x \in X(n)\), with Bruhat decomposition \(x = b_1 n b_2\), with \(b_1, b_2 \in U\left( {\mathbb {Q}_p}\right) \). Let \(\psi , \psi '\) be characters of \(U\left( {\mathbb {Q}_p}\right) \) which are trivial on \(U\left( {\mathbb {Z}_p}\right) \). Then the quantity \(\psi \left( {b_1}\right) \psi '\left( {b_2}\right) \) is well-defined as a function on X(n) if \(\psi \left( {nun^{-1}}\right) = \psi '\left( {u}\right) \) for \(u \in \overline{U}_n\left( {\mathbb {Q}_p}\right) \).
Proof
Suppose \(\psi \left( {nun^{-1}}\right) = \psi '\left( {u}\right) \) for all \(u \in \overline{U}_n\left( {\mathbb {Q}_p}\right) \). Let \(x = b_1 n b_2 = b'_1 n b'_2\) be two Bruhat decompositions. This says \(b'_1 = \gamma b_1\) for some \(\gamma \in U(\mathbb {Z}_p)\), and \(b'_2 = b_2 \delta \) for some \(\delta \in U_n\left( {\mathbb {Z}_p}\right) \). Then we have
which implies \(b_2 {b'_2}^{-1} = b_2 \delta ^{-1} b_2^{-1} \in \overline{U}_n\left( {\mathbb {Q}_p}\right) \). Now, from the equivalence of Bruhat decompositions, we deduce that
which implies \(\psi ' \left( {b_2{b'_2}^{-1}}\right) = \psi \left( {nb_2{b'_2}^{-1} n^{-1}}\right) = \psi \left( {b_1^{-1} b'_1}\right) \).
Proposition 5.2
If \(\underline{{\text {Kl}}}_p\left( {n, \psi _p, \psi '_p}\right) \) is well-defined, then \(\underline{{\text {Kl}}}_p\left( {n, \psi _p, \psi '_p}\right) = {\text {Kl}}_p\left( {n, \psi _p, \psi '_p}\right) \).
Proof
Trivial.
The Fourier coefficients \(P_{\psi , \psi '} (g)\) can be evaluated using the following theorem of Friedberg:
Theorem 5.3
[9, Theorem A] The Fourier coefficient \(P_{\psi , \psi '} (g)\) of \({\text {Sp}}(2r)\) Poincaré series is given by
Remark
In [9], the statement concerns \({\text {GL}}(r)\) Poincaré series, but the proof also works for \({\text {Sp}}(2r)\) Poincaré series.
5.1 \({\text {Sp}}(4)\) Poincaré series
Let \(G = {\text {Sp}}\left( {4, \mathbb {Q}_p}\right) \), and \(\psi = \psi _{m_1, m_2}\), \(\psi ' = \psi _{n_1, n_2}\). We give a table of conditions for auxiliary \({\text {Sp}}(4)\) Kloosterman sums \(\underline{{\text {Kl}}}_p\left( {n_{w, r, s}, \psi , \psi '}\right) \) to be well-defined.
w | Well-definedness conditions | w | Well-definedness conditions |
---|---|---|---|
\({\text {id}}\) | \(m_1=n_1, m_2=n_2\) | \(s_\beta s_\alpha \) | \(m_1=n_2=0\) |
\(s_\alpha \) | \(m_2=n_2=0\) | \(s_\alpha s_\beta s_\alpha \) | \(n_2 = m_2 p^{2r-2s}\) |
\(s_\beta \) | \(m_1=n_1=0\) | \(s_\beta s_\alpha s_\beta \) | \(n_1 = m_1 p^{s-2r}\) |
\(s_\alpha s_\beta \) | \(m_2=n_1=0\) | \(w_0\) | - |
Remark
From this table, we see that not all Kloosterman sums \({\text {Kl}}_p\left( {n, \psi , \psi '}\right) \) correspond to a well-defined auxiliary Kloosterman sum \(\underline{{\text {Kl}}}_p\left( {n, \psi , \psi '}\right) \).
From the well-definedness conditions for \(\underline{{\text {Kl}}}_p(n_{w,r,s},\psi ,\psi ')\), we see that when \(\psi = \psi _{m_1,m_2}\), \(\psi ' = \psi _{n_1,n_2}\) are non-degenerate, i.e. \(m_1m_2,n_1n_2\ne 0\), then
are the only Weyl elements that contribute to the Fourier coefficient \(P_{\psi ,\psi '}(g)\).
References
Adolphson, A., Sperber, S.: Expoonential sums and Newton polyhedra: cohomology and estimates. Ann. Math. 130(2), 367–406 (1989)
Bump, D., Friedberg, S., Goldfeld, D.: Poincaré series and Kloosterman sums for \(\operatorname{SL}(3, \mathbb{Z})\). Acta Arith. 50, 31–89 (1988)
Bump, D., Friedberg, S., Hoffstein, J.: Eisenstein series on the metaplectic group and nonvanishing theorems for automorphic L-functions and their derivatives. Ann. Math. 131, 53–127 (1990)
Blomer, V.: The relative trace formula in analytic number theory. To appear in Simons Symp. https://arxiv.org/abs/1912.08137 (2019)
Dąbrowski, R.: Kloosterman sums for Chevalley groups. Trans. Am. Math. Soc. 337, 757–769 (1993)
Deligne, P.: Cohomologie étale (SGA 4 1/2). Lecture Notes in Mathematics, vol. 569. Springer, Berlin (1977)
Dąbrowski, R., Fisher, B.: A stationary phase formula for exponential sums over \(\mathbb{Z}/p^m\mathbb{Z}\) and applications to \(\operatorname{GL}(3)\)-Kloosterman sums. Acta Arith. 80, 1–48 (1997)
Dąbrowski, R., Reeder, M.: Kloosterman sets in reductive groups. J. Number Theory 73, 228–255 (1998)
Friedberg, S.: Poincaré series for \(\operatorname{GL}(n)\): Fourier expansion, Kloosterman sums, and algebreo-geometric estimates. Math. Z. 196, 165–188 (1987)
Goldfeld, D., Stade, E., Woodbury, M.: An orthogonality relation for \(\text{GL}(4,R)\). In preprint. https://arxiv.org/abs/1910.13586 (2019)
Kitaoka, Y.: Fourier coefficients of Siegel cusp forms of degree two. Nagoya Math. J. 93, 149–171 (1984)
Man, S.H.: Symplectic Eisenstein series. In preprint. https://arxiv.org/abs/2003.06890 (2020)
Smith, R.A.: Estimates of exponential sums. Proc. Am. Math. Soc. 79(3), 365–368 (1980)
Stevens, G.: Poincaré series on \(\operatorname{GL}(r)\) and Kloostermann sums. Math. Ann. 277, 25–51 (1987)
Tóth, Á.: Symplectic Kloosterman sums. Studia Sci. Math. Hungar. 50, 143–158 (2013)
Weil, A.: On some exponential sums. Proc. Natl. Acad. Sci. 34, 204–207 (1948)
Acknowledgements
The author would like to thank Valentin Blomer for his guidance on the project.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The author was supported in part by the DAAD Graduate School Scholarship Programme.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Man, S.H. Symplectic Kloosterman sums and Poincaré series. Ramanujan J 57, 707–753 (2022). https://doi.org/10.1007/s11139-021-00498-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11139-021-00498-5