Abstract
We study the density of rational points on a higher-dimensional orbifold \((\mathbb {P}^{n-1},\Delta )\) when \(\Delta \) is a \(\mathbb {Q}\)-divisor involving hyperplanes. This allows us to address a question of Tanimoto about whether the set of rational points on such an orbifold constitutes a thin set. Our approach relies on the Hardy–Littlewood circle method to first study an asymptotic version of Waring’s problem for mixed powers. In doing so we make crucial use of the recent resolution of the main conjecture in Vinogradov’s mean value theorem, due to Bourgain–Demeter–Guth and Wooley.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
This paper is about the arithmetic of rational points on higher-dimensional orbifolds, in the spirit of Campana [4]. We shall be concerned with orbifolds \((\mathbb {P}^{n-1},\Delta )\), where \(\Delta \) is a \(\mathbb {Q}\)-divisor that takes the shape
for irreducible divisors \(D_0,\dots , D_r\) on \(\mathbb {P}^{n-1}\) and integers \(m_0,\dots ,m_r\geqslant 2\). The arithmetic of Campana-points on orbifolds interpolates between the theory of rational and integral points on classical algebraic varieties, thereby opening up a new field of enquiry.
The orbifold \((\mathbb {P}^{n-1},\Delta )\) is said to be smooth if the divisor \(\sum _{i=0}^r D_i\) is strict normal crossings and it is said to be log-Fano if \(-K_{\mathbb {P}^{n-1},\Delta }\) is ample, where \(K_{\mathbb {P}^{n-1},\Delta }=K_{\mathbb {P}^{n-1}}+\Delta \). Very recent work by Pieropan, Smeets, Tanimoto and Várilly-Alvarado [11] builds on the programme of Campana [4], by studying the distribution of Campana-points on vector group compactifications. Inspired by the Manin conjecture for rational points on Fano varieties [7], they formulate in [11, Conj. 1.1] a new prediction for the density of Campana-points of bounded height on arbitrary smooth log-Fano orbifolds.
We shall address this conjecture in the special case \((\mathbb {P}^{n-1},\Delta )\), when the divisors \(D_0,\dots ,D_r\) form a set of distinct hyperplanes in \(\mathbb {P}^{n-1}\), all defined over \(\mathbb {Q}\). Then \((\mathbb {P}^{n-1},\Delta )\) is log-Fano precisely when
Since \(m_i\geqslant 2\) this forces us to have \(r\leqslant 2(n-1).\) It turns out that the analysis is rather easy when \(r\leqslant n-1\), a case that is covered as a special case of concurrent work by Pieropan and Schindler [10] on toric varieties. The first really challenging case is when \(r=n\), in which case the condition for being log-Fano is
We shall take
for a fixed choice of non-zero integers \(c_0,\dots ,c_{n-1}\). We let
for given integers \(m_i\geqslant 2\). The Campana-points in \((\mathbb {P}^{n-1},\Delta )\) are defined to be the rational points \((x_0:\dots :x_{n-1})\in \mathbb {P}^{n-1}(\mathbb {Q})\), represented by primitive integer vectors \((x_0,\dots ,x_{n-1})\in \mathbb {Z}_{\ne 0}^{n}\) for which \(x_i\) is \(m_i\)-full for \(0\leqslant i\leqslant n-1\) and \(c_0x_0+\cdots +c_{n-1}x_{n-1}\) is \(m_n\)-full. Here, we recall that a non-zero integer x is said to be m-full if \(p^m\mid x\) whenever there is a prime p such that \(p\mid x\).
We employ the height function \(H: \mathbb {P}^{n-1}(\mathbb {Q})\rightarrow \mathbb {R}\), given by
if \((x_0,\dots ,x_{n-1})\in \mathbb {Z}^{n}\) is primitive. This is the standard exponential height associated to the line bundle \(L=\mathscr {O}_{\mathbb {P}^{n-1}}(1)\). The counting function of interest to us here is then
where \(\mathbf {x}=(x_0,\dots ,x_n)\) and \(|\mathbf {x}|=\max _{0\leqslant i\leqslant n}|x_i|.\) If [L] denotes the divisor class of a hyperplane section in \(\mathbb {P}^{n-1}\) and \(\Lambda _{\text {eff}}\) is the effective cone of \(\mathbb {P}^{n-1}\), then we define
Moreover, we define \(b=b(L,\Delta )\) to be the codimension of the minimal supported face of \(\Lambda _\text {eff}\) that contains the class \(a [L]+[K_{\mathbb {P}^{n-1}}+\Delta ]\). With these definitions in mind, the conjecture [11, Conj. 1.1] predicts that there exists a constant c such that
as \(B\rightarrow \infty \). (As a matter of fact, [11, Conj. 1.1] allows for the removal of a thin set of rational points from \(\mathbb {P}^{n-1}(\mathbb {Q})\), a topic that we shall return to in our discussion of Theorem 1.2 below.) In our example, the degree function gives an isomorphism \({{\,\mathrm{Pic}\,}}(\mathbb {P}^{n-1}) \cong \mathbb {Z}\). Under this isomorphism \(\Lambda _{\text {eff}}\cong \mathbb {R}_{\geqslant 0}\) and the line bundle L has degree 1. Moreover, \(K_{\mathbb {P}^{n-1}}\) has degree \(-n\) and \(\Delta \) has degree \(\sum _{i=0}^n (1-\frac{1}{m_i})\). Hence
Furthermore, the minimal supported face of \(\Lambda _{\text {eff}}\) which contains 0 is \(\{0\}\), which has codimension 1 in \(\Lambda _{\text {eff}}\), whence \(b = 1\).
In the special case \(m_0=\cdots =m_n=2\), work of Van Valckenborgh [14] establishes an asymptotic formula for \(N(\mathbb {P}^{n-1},\Delta ;B)\) for all \(n\geqslant 4\), which agrees with (1.2). Drawing inspiration from this, we have the following generalisation, which is also in accordance with (1.2).
Theorem 1.1
Assume that \(m_0,\dots ,m_{n}\geqslant 2\) and
for some \(j\in \{0,\dots ,n\}\). Then there exist constants \(c\geqslant 0\) and \(\eta >0\) such that
The implied constant in this estimate is allowed to depend on \(m_0,\dots ,m_n, n\) and \(c_0,\dots , c_{n-1}\), a convention that we shall adopt for all of the implied constants in this paper. There is an explicit expression for the leading constant c in (3.14) and (3.15), as a convergent sum of local densities. It can be shown that \(c>0\) if the underlying equations admit suitable non-singular solutions everywhere locally.
Let us return briefly to the case \(m_0=\cdots =m_n=2\), so that \(a=\frac{n-1}{2}\) and \(b=1\). When \(n=3\) we have the following lower bound, in which the exponent of \(\log B\) is at odds with the asymptotic formula (1.2).
Theorem 1.2
Let \(n=3\) and \(m_0=m_1=m_2=m_3=2\). Then
As explained by Pieropan, Smeets, Tanimoto and Várilly-Alvarado [11, § 3.5], the points contributing to the lower bound for \(N(\mathbb {P}^{2},\Delta ;B)\) in this result actually correspond to rational points contained in a thin set in \(\mathbb {P}^{2}(\mathbb {Q}).\) Let X be an integral variety over \(\mathbb {Q}\). Recall from Serre [13, §3.1] that a thin set is a set contained in a finite union of thin subsets of \(X(\mathbb {Q})\) of type I and II. Here, a type I thin subset is a set of the form \(Z(\mathbb {Q}) \subset X(\mathbb {Q})\), where Z is a proper closed subvariety, and a type II thin subset is a set of the form \(f(Y(\mathbb {Q}))\), where \(f : Y \rightarrow X\) is a generically finite dominant morphism with \(\dim Y=\dim X\), \(\deg f \geqslant 2\) and Y geometrically integral. Theorem 1.2 illustrates that it is important to allow the possibility of removing thin sets of rational points from the statement of [11, Conj. 1.1].
On the other hand, when \(m_0=m_1=m_2=2\) and \(n=2\) we expect the counting function to satisfy an asymptotic formula of the form (1.2) with associated constants \(a=\frac{1}{2}\) and \(b=1\). In fact, Browning and Van Valckenborgh [2] have produced an explicit constant \(c>0\) such that
as \(B\rightarrow \infty \).
A well-known result of Cohen [5], as expounded by Serre [12, Thm. 13.3], states that the set \(\mathbb {P}^{n-1}(\mathbb {Q})\) is not thin. At the workshop “Rational and integral points via analytic and geometric methods” in Oaxaca (May 27th–June 1st, 2018), Sho Tanimoto raised the question of whether the same is true for the set of Campana-points. Our next goal is to provide some partial evidence in favour of this.
Associated to any type II thin subset \(\Omega \subset \mathbb {P}^{n-1}(\mathbb {Q})\) coming from a morphism \(Y \rightarrow \mathbb {P}^{n-1}\) of degree \(d\geqslant 2\) is a degree d extension of function fields \(\mathbb {Q}(Y)/\mathbb {Q}(t_1,\dots ,t_{n-1})\). We let \(\mathbb {Q}(Y )^{\mathrm {Gal}}\) be the Galois closure of \(\mathbb {Q}(Y)\) over the function field \(\mathbb {Q}(t_1,\dots ,t_{n-1})\) of \(\mathbb {P}^{n-1}\) and we let \(\mathbb {Q}_\Omega \subset \mathbb {Q}(Y )^{\mathrm {Gal}}\) be the largest subfield that is algebraic over \(\mathbb {Q}\). Finally we let \({\mathsf {P}}_\Omega \) be the set of rational primes that split completely in \(\mathbb {Q}_\Omega \). It follows from the Chebotarev density theorem that \({\mathsf {P}}_\Omega \) has density \([\mathbb {Q}_\Omega :\mathbb {Q}]^{-1}\) in the set of primes, since \(\mathbb {Q}_\Omega /\mathbb {Q}\) is Galois. Next, let
for any \(\mathbf {m}=(m_0,\dots ,m_n)\in \mathbb {Z}_{\geqslant 2}^{n+1}\). The following result provides an explicit condition on the possible thin sets in \(\mathbb {P}^{n-1}(\mathbb {Q})\) that the Campana-points in \((\mathbb {P}^{n-1},\Delta )\) are allowed to lie in.
Theorem 1.3
Assume that \(m_0,\dots ,m_n\geqslant 2\) and (1.3) holds. Let \(\Omega \) be a thin set contained in a finite union \(\bigcup _{i}\Omega _i\), where each \(\Omega _i\subset \mathbb {P}^{n-1}(\mathbb {Q})\) is a thin subset of type I or II. Assume that
whenever \(\Omega _i\) is type II. Then
where \(N_{\Omega }(\mathbb {P}^{n-1},\Delta ;B)\) is defined as in (1.1), but with the additional constraint that the point \((x_0:\dots :x_{n-1})\in \Omega \).
Assuming that (1.3) holds, we may combine this result with Theorem 1.1 to deduce that the Campana-points in \((\mathbb {P}^{n-1},\Delta )\) are not contained in any thin subset of \(\mathbb {P}^{n-1}(\mathbb {Q})\) satisfying the hypotheses of the theorem.
The statement of this result is rather disappointing at first glance, but in fact the conclusion is false when the condition (1.5) is dropped. To see this, take \(m_0=\cdots =m_n=3\) and \(n\geqslant 12\). Then \(\sum _{i=0}^n \frac{1}{m_i}-1=\frac{n-2}{3}\) and (1.3) holds in Theorem 1.1. Consider the thin set \(\Omega _0\subset \mathbb {P}^{n-1}(\mathbb {Q})\) that arises from the morphism
where \(Z\subset \mathbb {P}^{n}\) is the cubic hypersurface \(x_0^3+\cdots +x_{n-1}^3=x_n^3\). Then the counting function \(N_{\Omega _0}(\mathbb {P}^{n-1},\Delta ;B)\) has exact order \(B^{\frac{n-2}{3}}\) for sufficiently large n. However, (1.5) fails in this case. Indeed, \({\mathsf {Q}}_{\mathbf {m}}\) is the set of primes \(p\not \equiv 1 \bmod {3}\), whereas \({\mathsf {P}}_{\Omega _0}\) is the set of primes \(p \equiv 1 \bmod {3}\), since \(\mathbb {Q}_{\Omega _0}=\mathbb {Q}(\sqrt{-3})\). This shows that it is hard to approach Tanimoto’s question in full generality through counting arguments alone.
The hypothesis (1.5) is a little awkward to work with. If one restricts attention to \(\mathbf {m}\) such that
then \({\mathsf {Q}}_\mathbf {m}\) is equal to the full set of rational primes. Moreover, it follows from the Chebotarev density theorem that \({\mathsf {P}}_{\Omega }\) has density \([\mathbb {Q}_{\Omega }:\mathbb {Q}]^{-1}\), for any type II thin subset \(\Omega \). Thus the conditions of Theorem 1.3 are met for any thin set. However, the assumption (1.3) is too stringent to cope with a sequence of integers \(\geqslant 2\) that satisfies (1.6).
Our proof of Theorems 1.1–1.3 relies on an explicit description of m-full integers x. For such integers every exponent of a prime appearing in the prime factorisation of x can be written \(k m + (m + r)\), for integers \(k \geqslant 0\) and \(0 \leqslant r < m\). Thus any non-zero m-full integer x can be written uniquely in the form
for \(u, v_1, \ldots , v_{m - 1} \in \mathbb {N}\), such that \(\mu ^2(v_r) =1\) for \(1 \leqslant r \leqslant m-1\) and \(\gcd (v_r, v_{r'}) = 1\) for \(1 \leqslant r < r' \leqslant m-1\).
It may be instructive to illustrate this notation by discussing the special case \(m_0=\cdots =m_n=2\), in which case Campana-points in \((\mathbb {P}^{n-1},\Delta )\) correspond to vectors \(\mathbf {u},\mathbf {v}\in \mathbb {N}^{n+1}\) and \(\varvec{\epsilon }\in \{\pm 1\}^{n+1}\) with each \(v_j\) square-free, for which
When \(n=3\) we can clearly find vectors \(\mathbf {v}\in \mathbb {N}^4\) with square-free components and \(\varvec{\epsilon }\in \{\pm 1\}^4\) in such a way that
Fixing such a choice and applying [8, Thm. 7] to estimate the residual number of \(\mathbf {u}\in \mathbb {N}^{4}\) that lie on the split quadric, with \(u_j\leqslant \sqrt{B/v_j^3}\) for \(0\leqslant j\leqslant 3\), we readily deduce that \(N(\mathbb {P}^{2},\Delta ;B)\gg B\log B\), as claimed in Theorem 1.2
Returning now to the case of general \(m_0,\dots ,m_n\geqslant 2\), we summarise the structure of the paper. Under the representation (1.7) it follows that Campana-points on \((\mathbb {P}^{n-1},\Delta )\) can be viewed through the lens of Waring’s problem for mixed exponents. Given its proximity to Vinogradov’s mean value theorem, this is an area that has received a radical new injection of ideas at the hands of Wooley [16, 18, 19] and Bourgain, Demeter and Guth [1]. Based on this, in Sect. 2 we shall give a completely general treatment of the counting function associated to suitably constrained integer solutions to the Diophantine equation
for given \(N\in \mathbb {Z}\) and non-zero \(c_j,\gamma _j\in \mathbb {Z}\), in which the vectors \(\mathbf {u}\) are asked to lie in a congruence class modulo H. In this part of the argument we shall need to retain uniformity in the coefficients \(\gamma _j\) and in the modulus H. It is here that the condition (1.3) arises. The resulting asymptotic formula is recorded in Theorem 2.7. In Sect. 3 we shall use Theorem 2.7 to establish the version of orbifold Manin that we have presented in Theorem 1.1. One of the chief difficulties in this part of the argument comes from dealing with the coprimality conditions implicit in the counting function \(N(\mathbb {P}^{n-1},\Delta ;B)\). Next, in Sect. 4 we shall combine Theorem 2.7 with information about the size of thin sets modulo p (for many primes p) to tackle Theorem 1.3.
Finally, when \(H=1\) and \(c_j=\gamma _j=1\) for all \(0\leqslant j\leqslant n\), it is easy to derive from Theorem 2.7 an asymptotic formula for the mixed Waring problem. The following result may be of independent interest.
Theorem 1.4
Assume that \(m_0,\dots ,m_n\geqslant 2\) and (1.3) holds. Let R(N) denote the number of representations of a positive integer N as
Then there exists \(\eta >0\) such that
where \(\mathfrak {S}(N)\) is given by (2.19).
There is relatively little in the literature concerning asymptotic formulae for R(N) for mixed exponents. The best result is due to Brüdern [3] who obtains an asymptotic formula for R(N) when \(m_0=m_1=2\), under some further conditions on the exponents, the most demanding of which is that
Theorem 1.4 is not competitive with this, although it does not suffer from the defect that 2 must appear twice among the list of exponents. It remains an interesting open challenge to prove an asymptotic formula for R(N) for any value of n, when \(m_i=2+i\) for \(0\leqslant i\leqslant n\).
When \(m = m_0 = \cdots = m_n\), which is the traditional setting of Waring’s problem, the condition in (1.3) reduces to \( n \geqslant m^2+m. \) This shows that our approach is not completely optimal in the equal exponent situation, since as explained in [19, Cor. 14.7], we know that \(n\geqslant m^2-m+O(\sqrt{m})\) variables suffice to get an asymptotic formula in Waring’s problem. It seems likely that by combining methods developed by Wooley in [17] and [19, §14], one can recover this loss. (The authors are grateful to Professor Wooley for this remark.)
2 The Hardy–Littlewood circle method
We shall assume without loss of generality that \(2 \leqslant m_0\leqslant m_1 \leqslant \cdots \leqslant m_n\). Our assumption (1.3) translates into
In what follows it will be convenient to set
Let \(N\in \mathbb {Z}\) and let \(\mathbf {c}=(c_0,\dots ,c_n)\in (\mathbb {Z} \backslash \{ 0 \})^{n+1}\). Let \(H \in \mathbb {N}\), \(\varvec{\gamma } \in \mathbb {N}^{n+1}\) and let \(\mathbf {h} \in \{ 0, 1, \ldots , H-1 \}^{n+1}\). The main results in this paper are founded on an analysis of the counting function
We shall view \(\mathbf {c}\) as being fixed, once and for all, but \(\varvec{\gamma }\) can grow and so we will need all of our estimates to depend explicitly on it. In Theorems 1.1 and 1.3 we shall take \(N=0\) and \(c_n=-1\), whereas in Theorem 1.4 we take \(H=1\), \(c_j=\gamma _j=1\) and \(B =N\).
We let \(e(z) = \exp (2 \pi i z)\) for any \(z \in \mathbb {R}\). Let
and
for \(0\leqslant j \leqslant n\). Then we may write
where
Note that we may freely assume that \(\gamma _j \leqslant B\) for \(0 \leqslant j \leqslant n\), since otherwise \(M_{\varvec{\mathbf {c} ; \gamma } }(B ; \mathbf {h},{H};N ) = 0\). Let \(\delta \) be such that
We define the major arcs \(\mathfrak {M}\) to be
where
We define the minor arcs to be \(\mathfrak {m} = [0,1) \backslash \mathfrak {M}\).
2.1 Contribution from the major arcs
In the standard way we shall need to show that on the major arcs our exponential sums can be approximated by integrals, with acceptable error. The following result is a straightforward adaptation of familiar facts.
Lemma 2.1
Let \(h,H \in \mathbb {N} \cup \{0\}\) with \(0 \leqslant h < H\). Let \(X \geqslant 1\). Let \(a \in \mathbb {Z}\), \(q \in \mathbb {N}\), \(\beta \in \mathbb {R}\) and \(\alpha = a/q + \beta \). Then
Proof
Let \(X' = (X-h)/H\). If \(X'<q\) then the absolute value of the left hand side is trivially bounded by \(q+1\), and so we may proceed under the assumption that \(X' \geqslant q\). We write
The inner sum is
An application of the Euler–Maclaurin summation formula to this sum now yields the result. \(\square \)
Now let \(\alpha = a/q + \beta \in \mathfrak {M}(a,q)\). We apply Lemma 2.1 with \(X=B_{j}\), and \(\alpha \) (resp. a) replaced by \(\alpha c_{j} \gamma _{j}\) (resp. \(a c_{j} \gamma _{j}\)). Thus \(\alpha c_{j} \gamma _{j} - a c_{j} \gamma _{j}/q = \beta c_{j} \gamma _{j}\) and
Put
and set \(\mathfrak {S}_{ \mathbf {c} ; {\varvec{\gamma }} }(L ; \mathbf {h},{H};N)\) to be
for any \(L>1\). Then it follows from Lemma 2.1 that
where
Taking \(H\geqslant 1\) and observing that \(B_j\geqslant 1\) for all \(0\leqslant j\leqslant n\) we see that
On executing the sum over q we therefore conclude that
It remains to analyse the terms \(\mathfrak {S}_{\mathbf {c};\varvec{\gamma } }(B^{\delta } ; \mathbf {h},{H};N)\) and \(\mathfrak {J}_{\mathbf {c}}(B^{\delta })\) .
Beginning with the singular series, it follows from [15, Theorem 7.1] that
for any \(\varepsilon > 0\). Therefore
where
Put
Clearly \(E_2(\varvec{\gamma }; H ; B^{\delta }, \infty )\leqslant B^{-\delta } E_2(\varvec{\gamma }; H )\) and \(E_2(\varvec{\gamma }; H ; 0, \infty )\leqslant E_2(\varvec{\gamma }; H )\).
In view of (2.1), we have
Let us define
This is absolutely convergent, since (2.10) yields
Moreover,
Turning to the singular integral, it follows from [15, Lemma 2.8] that
Thus, in view of (2.10), we deduce that
Hence
is well-defined, and we have
We are now ready to conclude our treatment of the major arcs. Note that
On combining (2.6), (2.12), (2.13) and (2.14), we therefore obtain the following result.
Lemma 2.2
Assume that (2.10) holds. Then
2.2 Contribution from the minor arcs
According to work of Wooley [19, Eq. (1.8)], the main conjecture in Vinogradov’s mean value theorem asserts that for each \(\varepsilon > 0\) and \(t, k \in \mathbb {N}\), one has
This result was established recently by Bourgain, Demeter and Guth [1] using \(\ell ^2\)-decoupling and also by Wooley [18, 19] using efficient congruencing. The following mean value estimate is a straightforward consequence of their work.
Lemma 2.3
Let \(k \in \mathbb {N}\) and let s be a real number satisfying \(s \geqslant k(k+1)\). Let \(A, H \in \mathbb {Z} \backslash \{0\}\) and \(h \in \mathbb {Z}\). Then we have
where the implied constant does not depend on A, H or h.
Proof
Let 2t be the largest even integer such that \(2t\leqslant s\). Then it follows that \(t \geqslant k(k+1)/2\). Hence
On considering the underlying equations of the integral on the right hand side using the orthogonality relation, we deduce that the integral on the right hand side is the number of
such that
It therefore follows that
where \(I(\mathbf {n})\) is the number of (2.16) satisfying (2.17), for which
for \(1 \leqslant j < k\). We may clearly write
where \( \varvec{\alpha }=(\alpha _k,\dots ,\alpha _1)\) and \( \varvec{\alpha }'=(\alpha _{k-1},\dots ,\alpha _1)\). Summing trivially over \(\mathbf {n}\), the right hand side of our estimate is
where the last equality is an immediate consequence of the fact that
An application of (2.15) now yields our result. \(\square \)
We also require the following Weyl type estimate, which is another consequence of the recent work on Vinogradov’s mean value theorem. We omit the proof since it is obtained by invoking the main conjecture (2.15) in the proof of [16, Theorem 1.5].
Lemma 2.4
Let \(k \geqslant 2\) and let \(\alpha _k, \ldots , \alpha _1 \in \mathbb {R}\). Suppose there exist \(a \in \mathbb {Z}\) and \(q \in \mathbb {N}\) with \(\gcd (a,q)=1\) satisfying \(| \alpha _k - a/q | \leqslant q^{-2}\) and \(q \leqslant X^k\). Let
Then
for any \(\varepsilon >0\).
Using this result we obtain the following bound for the exponential sum on the minor arcs.
Lemma 2.5
Let \(\varepsilon >0\). Then
Proof
It will be convenient throughout the proof to write
Let \(\alpha \in \mathfrak {m}\) and let \(\beta = \alpha c_{n} \gamma _{n} H^{m_n}\). We put
When \({\widetilde{B}} \leqslant 1\) it is clear that \(S_{n} (\alpha ) \ll 1\). Since \(\gamma _{n} \leqslant B\) we have
in this case. Thus we may suppose that \({\widetilde{B}} > 1\).
By Dirichlet’s theorem on Diophantine approximation we know there exist \(b \in \mathbb {Z}\) and \(1 \leqslant r \leqslant {\widetilde{B}}\) such that \(\gcd (b,r)=1\) and
Note that \(b\ne 0\) since \(\alpha \in \mathfrak {m}\). For simplicity let us write \(A = c_{n} \gamma _{n} H^{m_n}\). We claim that \(b A > 0\). But if \(b A < 0\) then
since \(\alpha >0\), which is a contradiction. This establishes the claim. Let \(A' = A / \gcd (A, b)\) and \(b' = b / \gcd (A, b)\).
Let \(X = (B_{n} - h_{n})/H\). First suppose \(B_{n}/2H > X\). Then \(B_{n}< 2 h_{n} < 2 H.\) In this case we clearly have \( S_{n} (\alpha ) \ll 1\), which is satisfactory. Thus we suppose \(B_{n}/2H \leqslant X\). In this case \(r \leqslant {\widetilde{B}} \leqslant X^{m_n}\) and Lemma 2.4 yields
for any \(\varepsilon >0\). Next, we note that
If \(2B^{1 - \delta } \leqslant B/|c_{n} \gamma _{n} H^{m_n} 2^{m_n} |\) it follows that
On the other hand, if \(2B^{1 - \delta } > B/|c_{n} \gamma _{n} H^{m_n} 2^{m_n} |\) then
We now verify that \(1 \leqslant |b'| \leqslant r |A'|\). We’ve already seen that \(|b'|\geqslant 1\), so we suppose that \(|b'| > r |A'|\). Since \(\alpha \in [0,1)\) we have
whence \( 1 < {\widetilde{B}} \leqslant |A'|/|A| \leqslant 1. \) This is a contradiction, so that we do indeed have \(1 \leqslant |b'| \leqslant r |A'|\). We also have \(\gcd (r|A'|, |b'|) = 1\). Finally, \(\alpha \in \mathfrak {M}\) if \(r|A'| \leqslant B^{\delta }\) and B exceeds an appropriate constant. But this is a contradiction, so that \(r |A'| > B^{\delta }\) and (2.18) becomes
This completes the proof of the lemma, since \(\sigma (m_n)-1/m_n=-1/(m_n+1)\). \(\square \)
We now have the tools in place to establish the following bound for the minor arc contribiution.
Lemma 2.6
Assume that (2.1) holds and let \(\varepsilon >0\). Then
Proof
Let \(\ell _n=m_n(m_n+1)\) and let \(\ell _{0},\cdots ,\ell _{n-1}>0\) be such that
In the light of (2.1) we can assume that \(\ell _{j} \geqslant m_j(m_j+1)\) for all \(0\leqslant j\leqslant n-1\). It now follows from Hölder’s inequality and Lemma 2.3 that
since \(H\geqslant 1\) and \(\gamma _j\leqslant B\) for all \(0\leqslant j\leqslant n-1\). We apply Lemma 2.5 to estimate \(S_n(\alpha )\). Finally, observing that
for all \(0\leqslant j\leqslant n-1\), we obtain
as required. \(\square \)
2.3 Final estimate
We may now bring together Lemmas 2.2 and 2.6 in (2.4), in order to record the following estimate for the counting function (2.3).
Theorem 2.7
Assume that \(2\leqslant m_0\leqslant \cdots \leqslant m_n\) and (2.1) holds. Let \(\delta \) satisfy (2.5) and let \(\varepsilon >0\). Then
where \(E_1\) and \(E_2\) are given by (2.7) and (2.9), respectively.
We end this section by indicating how this implies Theorem 1.4, for which we observe that \(M_{\mathbf {c};\varvec{\gamma } } (B; \mathbf {h},{H};N )=R(N)\) when \(H=1\), \(B=N\) and \(c_j=\gamma _j=1\) for \(0\leqslant j\leqslant n\). The error term is clearly in the desired shape and recourse to (2.11) shows that \(\mathfrak {S}_{ \mathbf {c};\varvec{\gamma } } ( \mathbf {h},{H};N)=\mathfrak {S}(N)\), with
Finally, the standard arguments described in Chapter 4 of Davenport [6] readily yield
This therefore completes the proof of Theorem 1.4.
3 Orbifold Manin: proof of Theorem 1.1
We now turn to the task of proving an asymptotic formula for the counting function \(N(\mathbb {P}^{n-1},\Delta ;B)\) in Theorem 1.1. We shall assume without loss of generality that \(2\leqslant m_0\leqslant \cdots \leqslant m_{n}\), so that (1.3) implies (2.1). The counting function can be written
where we henceforth follow the convention that \(c_n=-1\). In view of (1.7), we may write
Suppose that we are given vectors \(\mathbf {s}\) and \(\mathbf {t}\) with coordinates \(s_j\in \mathbb {N}\) and \(t_{j,r}\in \mathbb {N}\) for \(0\leqslant j\leqslant n\) and \( 1 \leqslant r \leqslant m_j-1\). It will be convenient to introduce the set
Given \(\varvec{\epsilon } \in \{ \pm 1 \}^{n+1}\) let \(\varvec{\epsilon } \mathbf {c}\) denote the vector with coordinates \(\epsilon _{j} c_{j}\). Then
where \(\mathbf {1}\) is the vector with all coordinates equal to 1.
We need to develop an inclusion-exclusion argument to cope with the coprimality condition in this expression. To ease notation we replace \(\varvec{\epsilon } \mathbf {c}\) by \( \mathbf {c}\). Let \(\mathbf {x} \in \mathscr {N}_{\mathbf {c}}(B; \mathbf {1}, \mathbf {1})\). It is clear that \(\gcd (x_0, \dots , x_n) > 1\) if and only if there exists a prime p and a subset \(I \subseteq \{0,\dots ,n\}\) for which \(p\mid u_{j}\) for all \(j \in I\) and also \(p \mid \prod _{r=1}^{m_j-1} v_{j,r}\) for all \(j \not \in I\). (Note that I is allowed to be the empty set here.)
Let \(\mathscr {G}\) denote the set of all possible vectors \(\mathbf {g}\in \mathbb {N}^{n+1}\) with \(1\leqslant g_{j}\leqslant m_j-1\) for \(0\leqslant j\leqslant n\). Let \({\mathsf {P}} = \{ 2, 3, 5, \ldots \}\) denote the set of primes and let \(\mathscr {R}\) be a non-empty finite collection of triples \((\mathbf {g}; p; I)\) where \(\mathbf {g} \in \mathscr {G}\), \(p \in {\mathsf {P}}\) and (possibly empty) \(I \subseteq \{0,\dots ,n\}\). Let \(\mathscr {R}(p)\) be the subset of \(\mathscr {R}\) containing all the triples in \(\mathscr {R}\) with prime p. In what follows we adhere to common convention and stipulate that a union over the empty set is the empty set and a product over the empty set is 1. We let
Next, we define \(\mathbf {a}(\mathscr {R})\) to be the vector in \(\mathbb {N}^{n+1}\) with coordinates
and we define \(\mathbf {b}(\mathscr {R})\) to be the vector in \(\mathbb {N}^{\sum _{j=0}^n (m_j - 1)}\) with coordinates
It is easy to see that \((\mathbf {a}(\mathscr {R}), \mathbf {b}(\mathscr {R})) \not = (\mathbf {1}, \mathbf {1})\) as soon as \(\mathscr {R} \not = \emptyset \). Moreover, when \(\mathscr {R} = \{ ( \mathbf {g}; {p};I) \}\) we see that \(\mathscr {N}_{\mathbf {c}}(B ; \mathbf {a}( \mathbf {g}; {p};I), \mathbf {b}( \mathbf {g}; {p};{I} ) )\) is precisely the set of \(\mathbf {x} \in \mathscr {N}_{\mathbf {c}} (B; \mathbf {1}, \mathbf {1})\) satisfying \(p\mid u_j\) for all \(j \in I\) and \(p\mid v_{j, g_j}\) for all \(j \not \in I\). In particular, it is now clear that
We proceed by establishing the following result.
Lemma 3.1
Given any \(\mathscr {R} \not = \emptyset \), we have
Proof
Let \(\mathbf {x}\) belong to the intersection on the right hand side. Then, given any \((\mathbf {g}; p; I) \in \mathscr {R}\), we have \(p \mid u_{j}\) for all \(j \in I\) and \(p \mid v_{j,r}\) if \(j \not \in I\) and \(r = g_{j}\), where \(x_j = u_j^{m_j} \prod _{r=1}^{m_j-1} v^{m_j + r}_{j,r}\). Therefore, \(p \mid u_{j}\) for all p such that \(j \in I(\mathscr {R}(p))\) and \(p \mid v_{j,r}\) for all p such that
Thus (3.2) and (3.3) imply that \(a_{j} \mid u_{j}\) and \(b_{j,r} \mid v_{j,r}\), whence it follows that \(\mathbf {x} \in \mathscr {N}_{\mathbf {c}}(B ; \mathbf {a}(\mathscr {R}), \mathbf {b}(\mathscr {R}) )\). On the other hand, if \(\mathbf {x} \in \mathscr {N}_{\mathbf {c}}(B ; \mathbf {a}(\mathscr {R}), \mathbf {b}(\mathscr {R}) )\) then we may reverse the argument to deduce that \(\mathbf {x}\) also belongs to the intersection of all the sets \(\mathscr {N}_{\mathbf {c}}(B ; \mathbf {a}( \mathbf {g}; {p};I), \mathbf {b}( \mathbf {g}; {p};{I} ) )\) for \((\mathbf {g}; p; I) \in \mathscr {R}\). This completes the proof of the lemma. \(\square \)
Given vectors \(\mathbf {s}\) and \(\mathbf {t}\) composed from positive integers, let
We henceforth set
Then, on combining the inclusion-exclusion principle with Lemma 3.1, we obtain
Note that \(\# \mathscr {N}_{\mathbf {c}}(B; \mathbf {a}(\mathscr {R}), \mathbf {b}(\mathscr {R}) ) = 0\) when \(\# \mathscr {R}\) is sufficiently large with respect to B. Bringing this together with (3.4), we conclude that
It remains to asymptotically estimate these quantities.
We collect together some properties of the function \(\varpi (\mathbf {s}, \mathbf {t})\).
Lemma 3.2
Let \((\mathbf {s}, \mathbf {t}) \not = (\mathbf {1}, \mathbf {1})\) and let \(p \in {\mathsf {P}}\). We let \(\mathbf {s}^{[p]}\) be the vector whose jth coordinate is \(s_{j}^{[p]} = p^{\mathrm{val}_p(s_{j})}\) and \(\mathbf {t}^{[p]}\) be the vector whose (j, r)th coordinate is \(t_{j,r}^{[p]} = p^{\mathrm{val}_p(t_{j,r})}\). Then the following are true:
-
(i)
\(\varpi (\mathbf {s}, \mathbf {t}) = \prod _{p \in {\mathsf {P}} } \varpi ( \mathbf {s}^{[p]}, \mathbf {t}^{[p]} )\);
-
(ii)
\(\varpi (\mathbf {s}, \mathbf {t} ) = 0\) if one of the coordinates of \(\mathbf {s}\) or \(\mathbf {t}\) is divisible by \(p^2\);
-
(iii)
\(\varpi (\mathbf {s}, \mathbf {t} ) = 0\) if one of the coordinates of \(\mathbf {s}\) or \(\mathbf {t}\) is divisible by p, but there exists \(0 \leqslant j \leqslant n\) with \(s^{[p]}_{j} = t^{[p]}_{j,1} = \cdots = t^{[p]}_{j,m_j-1} = 1\); and
-
(iv)
\( \varpi ( \mathbf {s}^{[p]}, \mathbf {t}^{[p]} ) \ll 1.\)
Proof
It follows from the definitions of \(\mathbf {a}(\mathscr {R})\) and \( \mathbf {b}(\mathscr {R})\) that
where we define multiplication of vectors by multiplying the corresponding coordinates. We clearly have \((\mathbf {s}, \mathbf {t}) = \prod _{p \in {\mathsf {P}}} (\mathbf {s}^{[p]}, \mathbf {t}^{[p]})\) and \(\# \mathscr {R} = \sum _{p \in {\mathsf {P}} } \# \mathscr {R}(p)\). Thus
where
It follows that
which thereby establishes (i).
To prove (ii) we note that it is not possible for \(p^2\) to divide any coordinate of \(\mathbf {a}(\mathscr {R}(p) )\) or \(\mathbf {b}(\mathscr {R}(p))\) for any prime p and \(\mathscr {R} \not = \emptyset \). Thus \(\varpi (\mathbf {s}^{[p]}, \mathbf {t}^{[p]}) = 0\) if one of the coordinates of \(\mathbf {s}^{[p]}\) or \(\mathbf {t}^{[p]}\) is divisible by \(p^2\).
Next, to prove (iii) let \((\mathbf {s}, \mathbf {t}) \not = (\mathbf {1}, \mathbf {1})\) and assume without loss of generality that \(p \mid s_1 t_{1,1} \dots t_{1, m_1-1}\) and \(s^{[p]}_{2} = t^{[p]}_{2,1} = \cdots = t^{[p]}_{2,m_{2}-1} =1\). Suppose there exists \(\mathscr {R}\) such that \((\mathbf {s}, \mathbf {t}) = (\mathbf {a}(\mathscr {R}), \mathbf {b}(\mathscr {R}) )\). Then we have \(\mathscr {R}(p) \not = \emptyset \), and also
If \(2 \in I(\mathscr {R}(p))\) then \(p\mid s_{2}\). On the other hand, if \(2 \in J(\mathbf {g}; \mathscr {R}(p))\) then \(p\mid t_{2, g_{2}}\). In either case we have a contradiction, whence \(\varpi (\mathbf {s}, \mathbf {t} ) = 0\).
Finally, to prove (iv) we note there are only O(1) options for \(\mathscr {R}(p)\) for any fixed \(p \in {\mathsf {P}}\). It now follows from the definition that
as required. \(\square \)
Given \(\varepsilon > 0\), it follows from Lemma 3.2 that
We now proceed by studying
Let
Then
where the indices run over \(0 \leqslant j \leqslant n\) and \(1 \leqslant r < r' \leqslant m_j-1\). For each \(\mathbf {s}\) and \(\mathbf {t}\) we let
denote the sum over all \(\mathbf {v}\) satisfying \(\gamma _{j} \leqslant B\) , \(\gcd \left( {v}_{j,r} t_{j,r}, {v}_{j,r'} t_{j,r'} \right) = 1\) and \( \mu ^2 \left( {v}_{j,r} t_{j,r} \right) = 1\). (If there is no \(\mathbf {v}\) which satisfies the above conditions then the sum is considered to be 0.) We may now write
where \(M_{\mathbf {c}; \varvec{\gamma } }(B )=M_{\mathbf {c}; \varvec{\gamma } }(B ; \mathbf {0},{1};0)\), in the notation (2.3). Guided by Lemma 3.2, we let
denote the sum over \(\mathbf {s},\mathbf {t}\) satisfying \(s_{j}^{m_j} \prod _{r=1}^{m_j-1} t_{j,r}^{m_j + r}\leqslant B\) and \(\gcd \left( t_{j,r}, t_{j,r'} \right) = 1\), together with the condition that none of the coordinates of \(\mathbf {s}\) or \(\mathbf {t}\) is divisible by \(p^2\) for any prime p and if one of the coordinates of \(\mathbf {s}\) or \(\mathbf {t}\) is divisible by p then \(p \mid s_j t_{j,1} \dots t_{j,m_j-1}\) for all \(0 \leqslant j \leqslant n\).
We want to apply Theorem 2.7 with \(H=1\) and \(N=0\). Let \(\delta >0\) satisfy (2.5) and let \(\mathfrak {S}_{ \mathbf {c};\varvec{\gamma } } = \mathfrak {S}_{ \mathbf {c};\varvec{\gamma } } ( { \mathbf {0}},{1};0)\). Then, on appealing to Lemma 3.2 and (3.6), we deduce that
for any \(\varepsilon >0\), where
Moreover, in view of (2.7) and (2.9), the error terms are given by
We now need to estimate these three error terms. In doing so it will be convenient to set \(w_{j} = v_{j,1}^{m_j+1} \dots v_{j, m_j-1}^{2m_j - 1}\) and \(\tau _{j} = s_{j}^{m_j} \prod _{r=1}^{m_j-1} t_{j,r}^{m_j + r}\).
Now for any \(\tau \geqslant 1\), we have
Similarly,
Using these estimates it follows that
where we recall that \(\tau _{j} = s_{j}^{m_j} \prod _{r=1}^{m_j-1} t_{j,r}^{m_j + r}\). Lemma 3.2 now yields
since \(\sum _{j=0}^n m_j/(m_j + 1)>1\). (Note that the factor \(2m_j-1\) on the right hand side comes from taking into account the \(2m_j-1\) possibilities where the factor p appears in \(s_j\) or \(t_j\).) We have therefore shown that
Turning to the estimation of \(F_2(B)\), we may write
where
We first show that
if \(r\geqslant 1\). To see this we note that the left hand side is at most
When \(\mu ^2(x)=1\), any \(d\mid x^{m+r}\) admits a factorisation \(d = d_1d_2^2 \dots d_{m+r}^{m+r}\) such that \(d_1\dots d_{m+r}\mid x\), where \(\mu ^2(d_i)=1\) and \(\gcd (d_i, d_j) = 1\) for \(i \not = j\). If we write \(x = x' d_1 \dots d_{m+r}\), then this sum is
The inner \(x'\)-sum is absolutely convergent since \(r\geqslant 1\). The remaining sum over \(d\mid q\) is \(O(q^\varepsilon )\) for any \(\varepsilon >0\), by the standard estimate for the divisor function. This therefore establishes (3.9).
An application of (3.9) immediately yields
for any \(\varepsilon >0\). Next, let
for any \(\varepsilon > 0\) and \(T\geqslant 1\), where \(\tau _{j} = s_{j}^{m_j} \prod _{r=1}^{m_j-1} t_{j,r}^{m_j + r}\). In particular we have \(f_1(q)\leqslant f_{1,1}(q)\). We claim that
for any sufficiently small \(\varepsilon >0\). Once achieved, it will follow that
since (2.10) implies that \(\Gamma -1>1\).
To check the claim we let \(\mathscr {T}\) denote the set of vectors \((\tau _0,\dots ,\tau _n)\in \mathbb {N}^{n+1}\) with the property that for any prime p we have \(\mathrm{val}_p(\tau _j)\in \{0,m_j,\dots ,3m_j-1\}\) and, furthermore, if \(p\mid \tau _0\dots \tau _n\) then \(\mathrm{val}_p(\tau _j)>0\) for all \(0\leqslant j\leqslant n\). Associated to any \((\tau _0,\dots ,\tau _n)\in \mathscr {T}\) is a unique choice for \(\mathbf {s}, \mathbf {t}\). Thus we find that
When \(p\not \mid q\) the corresponding local factor takes the shape
Alternatively, when \(p\mid q\) the factor is \(O(p^{6\varepsilon (m_0+\dots +m_n)})\) Assuming that \(\varepsilon \) is sufficiently small this therefore concludes the proof of (3.11).
Finally we must analyse
We note that
Applying (3.8) to handle the resulting sum over \(\mathbf {s}\) and \(\mathbf {t}\) it easily follows that
for any \(\varepsilon >0\).
We substitute our bounds for the error terms back into (3.7). This yields
Remark 3.3
Let us rerun the above argument, with the special choice \((\mathbf {s},\mathbf {t}) = (\mathbf {1}, \mathbf {1})\), since it will be used again in the proof of Theorem 1.3. The starting point is to write
where now \(\varvec{\gamma }\) has components \( \gamma _{j} = \prod _{r=1}^{m_j - 1} {v}_{j,r}^{m_j + r}. \) Tracing through the argument, this ultimately leads to the conclusion
for any \(\varepsilon >0\), where
We are now ready to complete the proof of Theorem 1.1. Repeating the arguments used in (3.11) during our analysis of \(F_2(B)\), it is easy to remove the constraint \(s_{j}^{m_j} \prod _{r=1}^{m_j-1} t_{j,r}^{m_j + r}\leqslant B\) from the summation over \(\mathbf {s}, \mathbf {t}\) in the main term M(B). The total error in doing this is \(O(B^{\Gamma -\eta _1})\), for some \(\eta _1>0\). We proceed under the assumption that (2.1) holds and \(\delta \) satisfies (2.5). We may combine (3.1) and (3.5) with (3.12) in order to conclude that \(N(\mathbb {P}^{n-1},\Delta ;B)\) is
for any \(\varepsilon >0\), where
The error term is of the shape claimed in Theorem 1.1 and so it remains to analyse the quantity \(c_B\).
The dependence on B in the factor \(c_B\) arises from the definition of the sum \(\sum ^{(1)}\). A straightforward repetition of our arguments above suffice to show that
for some \(\eta _2>0\), where c is the constant that is defined as in \(c_B\), but with the summation conditions \(\gamma _j\leqslant B\) removed from \(\sum ^{(1)}\), for \(0\leqslant j\leqslant n\). This shows that \(N(\mathbb {P}^{n-1},\Delta ;B)= c B^{\Gamma }+O(B^{\Gamma -\eta })\) for an appropriate \(\eta >0\), as claimed in Theorem 1.1. To go further, we adopt the notation \(\mathbf {s}^{\mathbf {m}} \mathbf {w}= (s_0^{m_0} w_0, \dots , s_n^{m_n} w_n)\), where we recall that \(w_{j} = v_{j,1}^{m_j+1} \dots v_{j, m_j-1}^{2m_j - 1}\) for \(0\leqslant j\leqslant n\). Changing the order of summation, we may write
with the understanding that \(\mathbf {t}\mid \mathbf {v}\) means \(t_{j,r} \mid v_{j,r}\) for all j and r. We claim that
where
This will complete our analysis of the leading constant c in Theorem 1.1.
To check the claim we put \(c'_j = \epsilon _j c_j\) for \(0\leqslant j\leqslant n\). It follows from (2.11) and multiplicativity that
where
Letting
we deduce that
Next, we put
for any \((\mathbf {s}, \mathbf {t})\) such that \((\mathbf {s}, \mathbf {t}) = (\mathbf {s}^{[p]}, \mathbf {t}^{[p]})\). It is clear that \(N(p^T)=\#\mathscr {X}_{p, T}(\mathbf {s}, \mathbf {t}) \) when \(p\not \mid s_0\dots s_n\) and that \( N(p^t)/ \prod _{j=0}^n s_j^{[p]} = \#\mathscr {X}_{p, T}(\mathbf {s}, \mathbf {t}) \) when \(p\not \mid s_0\dots s_n\). It follows that
where
Using the fact that \(\mathbf {t}\mid \mathbf {v}\) if and only if \(\mathbf {t}^{[p]} \mid \mathbf {v}^{[p]}\) for all p, it follows from part (i) of Lemma 3.2 that
On the other hand, on appealing to the inclusion-exclusion principle and the definition of \(\varpi \), for any prime p we return to (3.16) and see that
Dividing by \(p^{nT}\) and taking the limit \(T\rightarrow \infty \), we are now easily led to the proof of the claim (3.15).
4 Thin sets: proof of Theorem 1.3
Let \(\Gamma = \sum _{j=0}^n \frac{1}{m_j} - 1\), as in (2.2). In this section we assume that (1.3) holds and we let \(\Omega \subset \mathbb {P}^{n-1}(\mathbb {Q})\) be a thin set. Theorem 1.3 is concerned with an upper bound for the quantity
under the conditions on \(\Omega \) that are stated in the theorem. Let us write \(N_{\Omega }(B)= N_{\Omega }(\mathbb {P}^{n-1},\Delta ;B)\) to ease notation. All of the implied constants in this section are allowed to depend on the thin set \(\Omega \).
We shall proceed by using information about the size of thin sets modulo p on a set of primes p of positive density. Our thin set \(\Omega \) is contained in a finite union \(\bigcup _{i=1}^t \Omega _i\) of thin subsets of type I and type II. We shall abuse notation and write \(\Omega _i(\mathbb {F}_p)\) for the image of \(\Omega _i\) in \(\mathbb {P}^{n-1}(\mathbb {F}_p)\) under reduction modulo p. Similarly, we shall write \({\widehat{\Omega }}_i(\mathbb {F}_p)\) for the set of \(\mathbb {F}_p\)-points on the affine cone over this set of points.
Let \(\Omega _i \subset \mathbb {P}^{n-1}(\mathbb {Q})\) be a thin subset of type I. Then it follows from the Lang-Weil estimates [9] that there exits \(C_1 > 0\) such that
for every sufficiently large prime p. If \(\Omega _i \subset \mathbb {P}^{n-1}(\mathbb {Q})\) is a thin subset of type II, then according to Serre [13, Thm. 3.6.2] there exists a constant \(\kappa \in (0,1)\) such that
for every sufficiently large prime \(p\in {\mathsf {P}}_{\Omega _i}\), in the notation introduced before the statement of Theorem 1.3.
We take advantage of this information by noticing that
for any finite subset of primes \(\mathscr {S}_i\). We stipulate that \(\min _{p \in \mathscr {S}_i} p\) is greater than some absolute constant depending only on \(\prod _{i=0}^n |c_i| m_i\) and the thin subset \(\Omega _i\). Let
and put
Given \(\mathbf {b}' = (b_0, \ldots , b_{n-1})\) we let
and we put \(\mathbf {b} = (b_0, \ldots , b_n)\). Appealing to (1.7) and putting \(c_n=-1\), we deduce that \(N_{\Omega }(B) \) is
where \(w_j=\prod _{r=1}^{m_j-1} {v}_{j,r}^{m_j+r}\) and \( \sum ^{(1')}_{\mathbf {v}} \) denotes a sum over \(\mathbf {v}=(v_0,\dots ,v_n)\in \mathbb {N}^{n+1}\) satisfying \(w_{j} \leqslant B\) and the coprimality conditions
(This should not be confused with the notation \(\sum ^{(1)}_{\mathbf {v}}\) in Sect. 3, in which the condition \(\gcd (w_0, \ldots , w_n) = 1 \) does not appear.)
Let us define
where
In view of the coprimality conditions we are only interested in \(\mathbf {b}\not \equiv \mathbf {0} \bmod {p}\) for all \(p \in \mathscr {S}_i\). Thus, for each \(p \in \mathscr {S}_i\) and \(\mathbf {h}\in \Omega _{\mathbf {w}; p}^{(i)}\) we have
With this notation, we may write
Note that \(\#\Omega _{\mathbf {w};H_i}^{(i)}\leqslant H_i^{n+1}\). We now seek to apply Theorem 2.7 to the inner sum, much as in (3.13). Let \(\eta >0\) be sufficiently small and assume that \(\delta \) is chosen so that \((2n+5)\delta =\frac{1}{m_*(m_*+1)}-3\eta \), where we have found it convenient to set \(m_* = \max _{0 \leqslant j \leqslant n} m_j\). This is plainly satisfactory for (2.5). We take \(\varepsilon =\eta \) in the statement of Theorem 2.7 and we assume that \(H_i\) satisfies
where the second equality is true provided that \(\eta \) is small enough in terms of \(m_*\) and n. Under this assumption it can be verified that the overall contribution from the error term in Theorem 2.7 is \(O(B^{\Gamma - \eta })\). It follows that
since \(\mathfrak {J}_{\varvec{\epsilon } \mathbf {c} }\ll 1\).
Before proceeding with an analysis of the singular series, we first record some estimates for the size of \(\Omega _{\mathbf {w};p}^{(i)}\), for any \(i\in \{1,\dots ,t\}\).
Lemma 4.1
We have \(\#\Omega _{\mathbf {w}; p}^{(i)}\leqslant m_* p^n\) for any \(p\in \mathscr {S}_i\).
Proof
Suppose without loss of generality that \(p\not \mid w_0\) and let \(h_1,\dots ,h_n\in \mathbb {F}_p\) be such that \(h_j^{m_j}w_j\equiv b_j \bmod {p}\) for \(1\leqslant j\leqslant n\), for some \(\mathbf {b}'\), where \(b_0=c_0^{-1}(b_n-c_1b_1-\dots -c_{n-1}b_{n-1})\). Then there are at most \(m_0\leqslant m_*\) choices for \(h_0\). This confirms the lemma. \(\square \)
Lemma 4.2
Assume that \(p\in \mathscr {S}_i\) and \(p\not \mid w_j \) for \(0 \leqslant j \leqslant n\). Then we have
where \({\mathsf {Q}}_{\mathbf {m}}\) is defined in (1.4).
Proof
Let \( z \in \Omega _i(\mathbb {F}_p)\). Either there are no points in \(\Omega _{\mathbf {w};p}^{(i)}\) corresponding to z, or else we may assume that there exists \(\mathbf {h}\in \mathbb {F}_p^{n+1}\setminus \{\mathbf {0}\}\) such that
for \(0 \leqslant j \leqslant n\), for some \((b_0, \ldots , b_{n}) \in \mathbb {F}_p^{n+1}\) such that \((b_0: \cdots : b_{n-1}) = z\), in which \(b_n\) satisfies (4.3). Then the number of points in \(\Omega _{\mathbf {w};p}^{(i)}\) associated to z is at most the number of \(a \in \mathbb {F}_p^{*}\) and \(\mathbf {k}\in \mathbb {F}_p^{n+1}\) such that \(a b_j \equiv k_j^{m_j} w_j \bmod {p}\) for \(0 \leqslant j \leqslant n\). For fixed \(a\in \mathbb {F}_p^*\), since \(w_j \not \equiv 0 \bmod {p}\) for \(0 \leqslant j \leqslant n\), it follows that the number of \(\mathbf {k}\) is precisely the number of solutions to the set of congruences
for \(0 \leqslant j \leqslant n\).
If \(b_j=0\) then it forces \(h_j= 0\), and so \(k_j=0\). Suppose without loss of generality that \(b_j\ne 0\) for \(0 \leqslant j \leqslant R\) and \(b_{R+1}=\dots =b_n=0\). Let us fix a choice of a primitive element \(g \in \mathbb {F}_p^{*}\) and put \(a = g^u\), where \(1 \leqslant u \leqslant p-1\). Then
In this way we see that
The factor in front of \((p-1)\) is 1 when \(p\in {\mathsf {Q}}_{\mathbf {m}}\) and at most \(m_*^{n}\) in general. The statement of the lemma now follows. \(\square \)
We are now ready to analyse the singular series in (4.7). Let us put \(c'_j = \varepsilon _j c_j\) for indices \(0 \leqslant j \leqslant n\). We recall from (2.11) that
Put
so that
If \(p \not \mid H_i\) then
It now follows from (2.8) that
for any \(\varepsilon >0\). Moreover, in the usual way, for any prime p we have
where
In order to deal with primes \(p \mid H_i\), we require the following simple form of Hensel’s lemma.
Lemma 4.3
Let \(m, y, T \in \mathbb {N}\) and let \(A, B \in \mathbb {Z}\). Assume that p is prime such that \(p \not \mid A m y\) and \(A y^m + B \equiv 0 \bmod p\). Then
Let \(p\mid H_i\). Then \(H_i=pH_i'\) for some \(H_i'\in \mathbb {N}\) that is coprime to p. It readily follows that
If \(\mathbf {h} \bmod p\) is a solution to the congruence
then necessarily it is a non-singular solution by (4.5), since each prime \(p\mid H_i\) is large enough that \(p\not \mid \prod _j c_j'm_j\). Hence for \(T > 1\) it follows from Lemma 4.3 that \( N(p^T)= p^{n+1} p^{n (T - 1)}=p^{nT+1}. \) Bringing this together with (4.8) and (4.9) we conclude that
Inserting this into (4.7), our work so far has shown that
for any \(\eta >0\), where
Let \(1\leqslant i\leqslant t\) and recall that \(H_i=\prod _{p\in \mathscr {S}_i}p\). Appealing to (4.1), (4.2), together with Lemmas 4.1 and 4.2, we deduce that
for some \(\kappa \in (0,1).\)
Suppose first that \(\Omega _i\) is type I and let \(\omega (H_i)=\#\mathscr {S}_i\). Then
for any \(\varepsilon >0\). But then it follows that
Observe that \(\mathrm{val}_p(w_j) \geqslant m_j + 1\) whenever \(p \mid w_j\). In this way we can see that
for every prime p such that \(p\mid H_i\) and \(p \mid w_0 \dots w_n\). Thus, on removing common factors of \(w_j\) with \(H_i\), one easily concludes that
where \( g(H_i) =\prod _{p \mid H_i} ( 1 + O(p^{-1/m_*}) ). \) It is an elementary exercise to show that
for an absolute constant \(C_2>0\). Taking \(g(H_i)\ll H_i^{\varepsilon }\) for any \(\varepsilon >0\), it now follows from (2.10) and (3.10) that \( U(B,H_i)\ll _\varepsilon H_i^{-1+ 2\varepsilon }. \) Once inserted into (4.10) and choosing \(\mathscr {S}_i\) in such a way that \(H_i\) is a small enough power of B for (4.6), this shows that thin subsets of type I make a satisfactory overall contribution.
Suppose next that \(\Omega _i\) is type II. We may assume that \(p\geqslant m_*/\kappa \) for each \(p\mid H_i\). Then
We choose \(\mathscr {S}_i\) to be set of primes \(1\ll p\leqslant \log B/\log \log B\) drawn from the set \({\mathsf {P}}_{\Omega _i}\cap {\mathsf {Q}}_{\mathbf {m}}\). In particular \(H_i\) satisfies (4.6). Moreover, it follows from our assumption (1.5) that this set of primes has positive lower density \(\varrho \), say. But then
Feeding this into the argument that we have just given yields
from which it follows that the thin subsets of type II make a satisfactory overall contribution to (4.10) under the assumption (1.5). This completes the proof of Theorem 1.3.
References
Bourgain, J., Demeter, C., Guth, L.: Proof of the main conjecture in Vinogradov’s mean value theorem for degrees higher than three. Ann. Math. 184, 633–682 (2016)
Browning, T.D., Van Valckenborgh, K.: Sums of three squareful numbers. Experiment. Math. 21, 204–211 (2012)
Brüdern, J.: Sums of squares and higher powers. J. Lond. Math. Soc. 35, 233–243 (1987)
Campana, F.: Fibres multiples sur les surfaces: aspects geométriques, hyperboliques et arithmétiques. Manuscr. Math. 117, 429–461 (2005)
Cohen, S.D.: The distribution of Galois groups and Hilbert’s irreducibility theorem. Proc. Lond. Math. Soc. 3, 227–250 (1981)
Davenport, H.: Analytic Methods for Diophantine Equations and Diophantine Inequalities, 2nd edn. Cambridge University Press, Cambridge (2005)
Franke, J., Manin, Y.I., Tschinkel, Y.: Rational points of bounded height on Fano varieties. Invent. Math. 95, 421–435 (1989)
Heath-Brown, D.R.: A new form of the circle method and its application to quadratic forms. J. Reine Angew. Math. 481, 149–206 (1996)
Lang, S., Weil, A.: Number of points of varieties in finite fields. Am. J. Math. 76, 819–827 (1954)
Pieropan, M., Schindler, D.: Hyperbola method on toric varieties (2020). arXiv:2001.09815
Pieropan, M., Smeets, A., Tanimoto, S., Várilly-Alvarado, A.: Campana points of bounded height on vector group compactifications. Proc. Lond. Math. Soc
Serre, J.-P.: Lectures on the Mordell–Weil Theorem. Aspects of Mathematics, 3rd edn. Friedr. Vieweg & Sohn, Braunschweig (1997)
Serre, J.-P.: Topics in Galois theory. 2nd edition. Research Notes in Mathematics 1, A. K. Peters, Ltd., Wellesley, MA (2008)
Van Valckenborgh, K.: Squareful numbers in hyperplanes. Algebra Number Theory 6, 1019–1041 (2012)
Vaughan, R. C.: The Hardy–Littlewood Method, 2nd ed., Cambridge Tracts in Math. 125, Cambridge Univ. Press, Cambridge (1997)
Wooley, T.D.: Vinogradov’s mean value theorem via efficient congruencing. Ann. Math. 175, 1575–1627 (2012)
Wooley, T.D.: The asymptotic formula in Waring’s problem. Int. Math. Res. Not. 2012(7), 1485–1504 (2012)
Wooley, T.D.: The cubic case of the main conjecture in Vinogradov’s mean value theorem. Adv. Math. 294, 532–561 (2016)
Wooley, T.D.: Nested efficient congruencing and relatives of Vinogradov’s mean value theorem. Proc. Lond. Math. Soc. 118, 942–1016 (2019)
Acknowledgements
While working on this paper the authors were both supported by EPSRC grant EP/P026710/1, and the second author received additional support from the NWO Veni Grant 016.Veni.192.047. Thanks are due to Marta Pieropan, Arne Smeets and Sho Tanimoto for useful conversations related to this topic, and to the anonymous referee for numerous helpful suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Browning, T., Yamagishi, S. Arithmetic of higher-dimensional orbifolds and a mixed Waring problem. Math. Z. 299, 1071–1101 (2021). https://doi.org/10.1007/s00209-021-02695-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00209-021-02695-w