Abstract
This paper is motivated by two problems in the theory of Diophantine approximation, namely, Davenport’s problem regarding badly approximable points on submanifolds of a Euclidean space and Schmidt’s problem regarding the intersections of the sets of weighted badly approximable points. The problems have been recently settled in dimension two but remain open in higher dimensions. In this paper we develop new techniques that allow us to tackle them in full generality. The techniques rest on lattice points counting and a powerful quantitative result of Bernik, Kleinbock and Margulis. The main theorem of this paper implies that any finite intersection of the sets of weighted badly approximable points on any analytic nondegenerate submanifold of \(\mathbb {R}^n\) has full dimension. One of the consequences of this result is the existence of transcendental real numbers badly approximable by algebraic numbers of any bounded degree.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The notion of badly approximable numbers, as much of the classical and modern theory of Diophantine approximation, is underpinned by Dirichlet’s fundamental result. It states that for every \(\alpha \in \mathbb {R}\) and any \(Q>1\) there exists \(q\in \mathbb {N}\) and \(p\in \mathbb {Z}\) such that \(|q\alpha -p|<Q^{-1}\) and \(q\le Q\). In particular, it implies that for every real irrational number \(\alpha \) the inequality
holds for infinitely many rational numbers \(p/q\) written as reduced fractions of integers \(p\) and \(q\). A real number \(\alpha \) is then called badly approximable if there exists a constant \(c=c(\alpha )>0\) such that
for all \((q,p)\in \mathbb {N}\times \mathbb {Z}\). In what follows, the set of badly approximable real numbers will be denoted by \(\mathbf {Bad}\).
It is well known that a real irrational number \(\alpha \) is badly approximable if and only if the partial quotients of its continued fraction expansion are uniformly bounded. For instance, any real quadratic irrational number is in \(\mathbf {Bad}\), since its continued fraction expansion is eventually periodic.Footnote 1 Using continued fractions one can easily produce continuum many examples of badly approximable real numbers. Beyond the cardinality, Jarník [27] established that \( \dim \mathbf {Bad}\) (the Hausdorff dimension of \(\mathbf {Bad}\)) is \(1\). However, the Lebesgue measure of \(\mathbf {Bad}\) is known to be zero. This is a trivial consequence of the divergence case of Khintchine’s theorem [37], and can also be relatively easily proved using the Lebesgue density theorem, see [18] or [15, Corollary 2].
1.1 Higher dimensions: Schmidt’s conjecture
Higher dimensions offer various ways of generalising the notion of badly approximable numbers. For now, we restrict ourselves to considering simultaneous Diophantine approximations by rationals. The point \(\mathbf{{y}}=(y_1,\dots ,y_n)\in \mathbb {R}^n\) is called badly approximable if there exists a constant \(c=c(\mathbf{{y}})>0\) such that
for all \(q\in \mathbb {N}\), where \(\Vert x\Vert \) denotes the distance of \(x\) from the nearest integer. The quantities \(\Vert qy_i\Vert \) are equal to \(|qy_i-p_i|\) for some \(p_i\in \mathbb {Z}\) and thus give rise to ‘approximating’ rationals \(p_1/q,\dots ,p_n/q\). Once again, the notion of badly approximable points is underpinned by Dirichlet’s theorem, this time for \(\mathbb {R}^n\), which implies that the inequality \(\max _{1\le i\le n}\Vert qy_i\Vert <q^{-1/n}\) holds for infinitely many \(q\in \mathbb {N}\). The set of badly approximable points in \(\mathbb {R}^n\) will be denoted by \(\mathbf {Bad}(n)\). Observe that \(\mathbf {Bad}(1)=\mathbf {Bad}\).
The first examples of badly approximable points in \(\mathbb {R}^n\) were given by Perron [40] who used an algebraic construction and produced infinitely yet countably many elements of \(\mathbf {Bad}(n)\). For instance, \((\alpha ,\dots ,\alpha ^n)\in \mathbf {Bad}(n)\) whenever \(\alpha \) is a real algebraic number of degree \(n+1\). However, it was not until 1954 when first Davenport [21] for \(n=2\) and then Cassels [19] for \(n\ge 2\) showed that \(\mathbf {Bad}(n)\) was uncountable. The fact that \(\mathbf {Bad}(n)\) has full Hausdorff dimension was proved by Schmidt [43] who introduced powerful ideas based on a specific type of games. The dimension result for \(\mathbf {Bad}(n)\) comes about as a consequence of the fact that \(\mathbf {Bad}(n)\) is winning for Schmidt’s game. Furthermore, Schmidt proved that affine transformations of \(\mathbf {Bad}(n)\) are winning and that the collection of winning sets in \(\mathbb {R}^n\) is closed under countable intersections.
In his 1983 paper [46] Schmidt formulated a conjecture that later became the catalysis for some remarkable developments. Schmidt’s conjecture rests on the modified notion of badly approximable points in which approximations in each coordinate are given some weights, say \(r_1,\dots ,r_n\). In short, he conjectured that there exist points in \(\mathbb {R}^2\) that are simultaneously badly approximable with respect to two different collections of weights. The weights of approximation are required to satisfy the following conditions:
Throughout this paper the set of all \(n\)-tuples \(\mathbf{{r}}=(r_1,\dots ,r_n)\) subject to (3) will be denoted by \(\mathcal {R}_n\). Formally, given \(\mathbf{{r}}\in \mathcal {R}_n\), the point \(\mathbf{{y}}=(y_1,\dots ,y_n)\in \mathbb {R}^n\) will be called \(\mathbf{{r}}\)-badly approximable if there exists \(c=c(\mathbf{{y}})>0\) such that
for all \(q\in \mathbb {N}\). Here, by definition, \(\Vert qy_i\Vert ^{1/0}=0\). Again, a version of Dirichlet’s theorem tells us that when \(c=1\) inequality (4) fails infinitely often.
The set of \(\mathbf{{r}}\)-badly approximable points in \(\mathbb {R}^n\) will be denoted by \(\mathbf {Bad}(\mathbf{{r}})\). As is readily seen, the classical set of badly approximable points \(\mathbf {Bad}(n)\) is simply \(\mathbf {Bad}(\tfrac{1}{n},\dots ,\tfrac{1}{n})\). Using this notation we can now specify the following concrete statement conjectured by Schmidt:
It is worth mentioning that the sets \(\mathbf {Bad}(\mathbf{{r}})\) have been studied at length in all dimensions and for arbitrary collections of weights, see [22, 34–36, 41]. Partly the interest was fueled by natural links with homogeneous dynamics and Littlewood’s conjecture in multiplicative Diophantine approximation, another long standing problem—see [13] for further details. Schmidt’s conjecture withstood attacks for nearly 30 years. However, the recent progress has been dramatic.
In 2011 Badziahin et al. [13] made a breakthrough by proving that for any sequence \(\mathbf{{r}}_k=(i_k,j_k)\in \mathcal {R}_2\) such that
and any vertical line \(L_\theta =\{(\theta ,y):y\in \mathbb {R}\}\subset \mathbb {R}^2\) with \(\theta \in \mathbf {Bad}\) one has that
This readily gives that \(\dim \bigcap _k\mathbf {Bad}(\mathbf{{r}}_k)=2\) and proves Schmidt’s conjecture in a much stronger sense. Shortly thereafter, An [1] proves that for any \(\mathbf{{r}}\in \mathcal {R}_2\) and any \(\theta \in \mathbf {Bad}\) the set \(\mathbf {Bad}(\mathbf{{r}})\cap L_\theta \) is winning for a Schmidt game in \(L_\theta \). This immediately leads him to removing condition (5) from the theorem of Badziahin et al., since the collection of Schmidt’s winning sets is closed under arbitrary countable intersections. In a related paper An [2] establishes that \(\mathbf {Bad}(i,j)\) is winning for the \(2\)-dimensional Schmidt game, thus giving another proof of Schmidt’s conjecture. Generalising the techniques of [13] in yet another direction Nesharim [39], independently from An, proves that the set in the left hand side of (6) intersected with naturally occurring fractals embedded in \(L_\theta \) is uncountable for any sequence \((\mathbf{{r}}_k)_{k\in \mathbb {N}}\). Subsequently, Nesharim jointly with Weiss establishes the winning property of these intersections—see Appendix B in [39].
As already mentioned, the sets \(\mathbf {Bad}(\mathbf{{r}})\) and even their restrictions to naturally occurring fractals have been investigated in higher dimensions, see [26, 34–36]. In particular, the sets \(\mathbf {Bad}(\mathbf{{r}})\) were shown to have full Hausdorff dimension for any \(\mathbf{{r}}\in \mathcal {R}_n\). However, the theory of their mutual intersections is a different story. In an apparent attempt to prove Schmidt’s conjecture, Kleinbock and Weiss [36] introduced a modified version of Schmidt’s games. As they have shown, winning sets for the same modified Schmidt game inherit the properties of classical winning sets. Namely, they have full Hausdorff dimension and their countable intersections are winning with respect to the same game. Also Kleinbock and Weiss have proved that \(\mathbf {Bad}(\mathbf{{r}})\) is winning for a relevant modified Schmidt game. However, it was not possible to prove that the intersection \(\mathbf {Bad}(\mathbf{{r}}_1)\cap \mathbf {Bad}(\mathbf{{r}}_2)\) was a winning set for some modified Schmidt game as, with very few exceptions, the corresponding modified Schmidt games were not ‘compatible’. As a result the following key problem that generalises Schmidt’s original conjecture has remained open in dimensions \(n\ge 3\):
Problem 1
Let \(n\in \mathbb {N}\). Prove that for any finite or countable subset \(W\) of \(\mathcal {R}_n\) one has that
The main result of this paper implies (7) in arbitrary dimensions \(n\) and for arbitrary countable subsets \(W\) of weights satisfying a condition similar to (5). For instance, the result is applicable to arbitrary finite collections of weights \(W\). The proof will be given by restricting the sets of interest to a suitable family of curves in \(\mathbb {R}^n\). Interestingly, this approach, which was innovated in [13] in the case \(n=2\), turns out to face another intricate problem that was first communicated by Davenport.
1.2 \(\mathbf {Bad}(\mathbf{{r}})\) on manifolds and Davenport’s problem
In 1964 Davenport [22] established that, given a finite collection \(\mathbf{{f}}_i:\mathbb {R}^m\rightarrow \mathbb {R}^{n_i}\) \((1\le i\le N)\) of \(C^1\) maps, if for some \(\mathbf{{x}}_0\in \mathbb {R}^m\) and every \(i=1,\dots ,N\) the Jacobian of \(\mathbf{{f}}_i\) at \(\mathbf{{x}}_0\) has rank \(n_i\), then the set
has the power of continuum. For instance, taking \(f_1(x,y)=x\), \(f_2(x,y)=y\) and \(\mathbf{{f}}_3(x,y)=(x,y)\) shows that \(\mathbf {Bad}(1,0)\cap \mathbf {Bad}\left( \tfrac{1}{2},\tfrac{1}{2}\right) \cap \mathbf {Bad}(0,1)\) has the power of continuum. Another natural example obtained by taking \(f_i(x)=x^i\) for \(i=1,\dots ,k\) shows that there are continuum many \(\alpha \in \mathbb {R}\) such that \(\alpha ,\alpha ^2,\dots ,\alpha ^k\) are all in \(\mathbf {Bad}\).
Clearly, the Jacobian condition above implies that \(m\ge n_i\) for every \(i\). Commenting on this, Davenport writes [22, p. 52] “Problems of a much more difficult character arise when the number of independent parameters is less than the dimension of simultaneous approximation. I do not know whether there is a set of \(\alpha \) with the cardinal of the continuum such that the pair \((\alpha ,\alpha ^2)\) is badly approximable for simultaneous approximation”. Essentially, if \(m<n_i\) then \(\mathbf{{f}}_i(\mathbf{{x}})\) lies on a submanifold of \(\mathbb {R}^{n_i}\). Hence, Davenport’s problem boils down to investigating badly approximable points restricted to submanifolds of Euclidean spaces.
In the theory of Diophantine approximation on manifolds, see for instance, [7–9, 31, 33], there are already well established classes of manifolds of interest. These include non-degenerate manifolds and affine subspaces and should likely be of primary interest when resolving Davenport’s problem.
It is worth pointing out that the result of Perron [40] mentioned in § 1.1 implies the existence of algebraic badly approximable points on the Veronese curves \(\mathcal {V}_n=\{(x,\dots ,x^n):x\in \mathbb {R}\}\). However, there are only countably many of them. Khintchine [28] proved that \(\mathbf {Bad}(n)\cap \mathcal {V}_n\) had zero 1-dimensional Lebesgue measure. Baker [4] generalised this to arbitrary \(C^1\) submanifold of \(\mathbb {R}^n\). Apparently, \(\mathbf {Bad}(n)\) can be relatively easily replaced with \(\mathbf {Bad}(\mathbf{{r}})\) in Baker’s result, though, to the best of author’s knowledge, this has never been formally addressed. To make a long story short, until recently there has been no success in relation to Davenport’s problem even for planar curves, let alone manifolds in higher dimension. The aforementioned work of Badziahin et al. [13] was the first step forward. Very recently, assuming (5), Badziahin and Velani [17] have proved (6) with \(L_\theta \) replaced by any \(C^2\) planar curve which is not a straight line. In particular, this shows that there exist uncountably many real numbers \(\alpha \) such that \((\alpha ,\alpha ^2)\) is in \(\mathbf {Bad}(2)\). Also they have dealt with a family of lines in \(\mathbb {R}^2\) satisfying a natural Diophantine condition. The most recent results established in [3] by An, Velani and the author of this paper remove condition (5) from the findings of [17] and at the same time settle Davenport’s problem for a larger class of lines in \(\mathbb {R}^2\) defined by a near optimal condition. As a result, the following general version of Davenport’s problem is essentially settled in the case \(n=2\):
Problem 2
Let \(n,m\in \mathbb {N}\), \(B\) be a ball in \(\mathbb {R}^m\), \(W\) be a finite or countable subset of \(\mathcal {R}_n\) and \(\mathcal {F}_n(B)\) be a finite or countable collection of maps \(\mathbf{{f}}:B\rightarrow \mathbb {R}^n\). Determine sufficient (and possibly necessary) conditions on \(W\) and/or \(\mathcal {F}_n(B)\) so that
Despite the success in resolving Problem 2 for planar curves, no progress has been made on Davenport’s problem for \(n\ge 3\). The results of this paper imply (8) in arbitrary dimensions \(n\) and for arbitrary countable subsets \(W\) of weights satisfying a condition similar to (5) and arbitrary finite collection \(\mathcal {F}_n(B)\) of analytic non-degenerate maps. The proof introduces new ideas based on lattice points counting and a powerful quantitative result of Bernik, Kleinbock and Margulis. Indeed, the arguments presented should be of independent interest even for \(n=2\).
2 Main results and corollaries
In what follows, an analytic map \(\mathbf{{f}}:B\rightarrow \mathbb {R}^n\) defined on a ball \(B\subset \mathbb {R}^m\) will be called nondegenerate if the functions \(1,f_1,\dots ,f_n\) are linearly independent over \(\mathbb {R}\). The more general notion of nondegeneracy that does not require analyticity can be found in [33]. Given an integer \(n\ge 2\), \(\mathcal {F}_n(B)\) will denote a family of maps \(\mathbf{{f}}:B\rightarrow \mathbb {R}^n\) with a common domain \(B\). To avoid ambiguity, let us agree from the beginning that all the intervals and balls mentioned in this paper are of positive and finite diameter. Recall that \(\mathcal {R}_n\) denotes the collection of weights of approximation and is defined by (3). Given \(\mathbf{{r}}=(r_1,\dots ,r_n)\in \mathcal {R}_n\), let
that is \(\tau (\mathbf{{r}})\) is the smallest strictly positive weight within \(\mathbf{{r}}\). The following result regarding Problem 2 represents the main finding of this paper.
Theorem 1
Let \(m,n\in \mathbb {N}\), \(1\le m\le n\), \(B\) be an open ball in \(\mathbb {R}^m\) and \(\mathcal {F}_n(B)\) be a finite family of analytic nondegenerate maps. Let \(W\) be a finite or countable subset of \(\mathcal {R}_n\) such that
Then (8) is satisfied.
Condition (10) matches (5) and is satisfied whenever \(W\) is finite. Now we consider the following basic corollary regarding badly approximable points on manifolds.
Corollary 1
Let \(\mathcal {M}\) be a manifold immersed into \(\mathbb {R}^n\) by an analytic nondegenerate map. Let \(W\subset \mathcal {R}_n\) be a finite or countable set of weights. Assume that (10) is satisfied. Then \( \textstyle \dim \bigcap _{\mathbf{{r}}\in W}\mathbf {Bad}(\mathbf{{r}})\cap \mathcal {M}=\dim \mathcal {M}\,. \) In particular, for any finite collection \(\mathbf{{r}}_1,\dots ,\mathbf{{r}}_N\in \mathcal {R}_n\) we have that
Note that the corollary is applicable to \(\mathcal {M}=\mathbb {R}^n\), which is clearly analytic and nondegenerate. In this case Corollary 1 establishes an analogue of Schmidt’s conjecture in arbitrary dimensions \(n\ge 2\) by settling Problem 1 subject to condition (10).
2.1 Reduction to curves
When \(m=1\) the nondegeneracy of an analytic map \(\mathbf{{f}}=(f_1,\dots ,f_n)\) is equivalent to the Wronskian of \(f'_1,\dots ,f'_n\) being not identically zero. More generally, the map \(\mathbf{{f}}\) (not necessarily analytic) defined on an interval \(I\subset \mathbb {R}\) will be called nondegenerate at \(x_0\in I\) if \(\mathbf{{f}}\) is \(C^n\) on a neighborhood of \(x_0\) and the Wronskian of \(f'_1,\dots ,f'_n\) does not vanish at \(x_0\). This definition of nondegeneracy at a single point is adopted within the following more general result for curves. Note that if \(\mathbf{{f}}\) is nondegenerate at least at one point, then the functions \(1,f_1,\dots ,f_n\) are linearly independent over \(\mathbb {R}\).
Theorem 2
Let \(n\in \mathbb {N}\), \(n\ge 2\), \(I\subset \mathbb {R}\) be an open interval and \(\mathcal {F}_n(I)\) be a finite family of maps defined on \(I\) nondegenerate at the same point \(x_0\in I\). Let \(W\) be a finite or countable subset of \(\mathcal {R}_n\) satisfying (10). Then
Our immediate goal is to show that Theorem 1 is a consequence of Theorem 2. In metric Diophantine approximation the idea of reducing the case of manifolds to curves is not new. For instance, Badziahin et al. [13] use fibering of \(\mathbb {R}^2\) into vertical lines in their proof of Schmidt’s conjecture. Underpinning our reduction of Theorem 1 to Theorem 2 is the following version of Marstrand’s slicing lemma, see [24, Corollary 7.12] or [25, Theorem 10.11].
Marstrand’s slicing Lemma Let \(m>1\) and \(S\) be a subset of \(\mathbb {R}^m\). Let \(s>0\) and let \(U\) be a subset of \(\mathbb {R}^{m-1}\) such that \(\dim \{(t,u_2,\dots ,u_m)\in S\}\ge s\) for each \((u_2,\dots ,u_m)\in U\). Then
We will also need the following formal statement which is a slightly modified extract from Sprindžuk’s survey [48, pp. 9–10].
The Fibering Lemma Let \(f_0,\ldots ,f_n\) be analytic functions in \(m\) real variables defined on an open neighborhood of \(\mathbf{{0}}\). Assume that \(f_0,\ldots ,f_n\) are linearly independent over \(\mathbb {R}\). Then there is a sufficiently large integer \(d_0>1\) such that for every \(d> d_0\) and every \(\mathbf{{u}}=(u_1,u_2,\ldots ,u_m)\in \mathbb {R}^{m}\) with \(u_1\ldots u_m\ne 0\) the following functions of one real variable
given by
where \(E_{\mathbf{{u}}}\subset \mathbb {R}\) is a neighbourhood of \(0\), are linearly independent over \(\mathbb {R}\).
Although the proof of the Fibering Lemma mostly follows the argument of [48, pp. 9–10], for completeness full details are given in Appendix C. Note that Sprindžuk’s version of fibering involves the parametrisation \(\widetilde{\phi }_{\mathbf{{u}},i}(t)=f_i(u_1t,u_2t^{d},\ldots ,u_mt^{d^{m-1}})\).
Proof of Theorem 1 modulo Theorem 2
Let \(\mathcal {F}_n(B)\) be as in Theorem 1 and let \(\mathbf{{f}}=(f_1,\ldots ,f_n)\in \mathcal {F}_n(B)\). Without loss of generality we will assume that \(B\) is centred at \(\mathbf{{0}}\). Also assume that \(m\ge 2\) as otherwise there is nothing to prove. Let \(u_1=1\), \(t_0>0\) and \(\delta _2, \ldots , \delta _m>0\) be sufficiently small numbers such that
whenever
The existence of \(t_0,\delta _2,\ldots ,\delta _m\) is guaranteed by the fact that \(\mathbf{{0}}\) is an interior point of \(B\). Let \(U\) be the set of \(\mathbf{{u}}=(u_2,\ldots ,u_m)\) satisfying the right hand side inequalities of (12) and \(D\) be the set of \((t,u_2,\ldots ,u_m)\) satisfying (12).
By the nondegeneracy of \(\mathbf{{f}}\), the functions \(1,f_1,\dots ,f_n\) are linearly independent. Since they are also analytic, by the Fibering Lemma, there exists \(d_0(\mathbf{{f}})>0\) such that for every \(d>d_0(\mathbf{{f}})\) and every \(\mathbf{{u}}\in U\) the coordinate functions of the map
defined on the interval \(I=\left( \tfrac{1}{2}t_0,t_0\right) \) together with \(1\) are linearly independent over \(\mathbb {R}\). Since \(\mathcal {F}_n(B)\) is finite,
is well defined. Let \(d>d_0\). Then for every \(\mathbf{{f}}\in \mathcal {F}_n(B)\) and every \(\mathbf{{u}}\in U\) the coordinate functions of the map (13) together with \(1\) are linearly independent over \(\mathbb {R}\). By the well known criterion of linear independence, their Wronskian is not identically zero. Hence, the Wronskian of \(\mathbf{{f}}'_{\mathbf{{u}}}= \frac{d}{dt}\mathbf{{f}}_{\mathbf{{u}}}\) is not identically zero. As an analytic function, it has isolated zeros. Hence, for a fixed \(\mathbf{{u}}\), there are at most countably many points in \(I\) where the Wronskian of \(\mathbf{{f}}'_{\mathbf{{u}}}\) vanishes for some \(\mathbf{{f}}\in \mathcal {F}_n(B)\). Hence, there exists a point \(x_0\in I\), which may depend on \(\mathbf{{u}}\), such that for every \(\mathbf{{f}}\in \mathcal {F}_n(B)\) the Wronskian of \(\mathbf{{f}}'_{\mathbf{{u}}}\) is not zero, that is \(\mathbf{{f}}_{\mathbf{{u}}}\) is non-degenerate at \(x_0\). Thus, Theorem 2 is applicable and we conclude that the following subset of \(I\)
has Hausdorff dimension \(1\). Here, by definition, \(\mathbf{{f}}_{\mathbf{{u}}}^{-1}(\mathbf {Bad}(\mathbf{{r}}))\) is the set of \(t\in I=\left( \tfrac{1}{2}t_0,t_0\right) \) such that \(\mathbf{{f}}_{\mathbf{{u}}}(t)\in \ \mathbf {Bad}(\mathbf{{r}})\). Then, by Marstrand’s slicing lemma, the set
has Hausdorff dimension \(\ge \dim U+1=m\). Let \(S'\subset B\) be the image of \(S\) under the map
Then, in view of the definitions of \(S\), \(S_{\mathbf{{u}}}\) and \(\mathbf{{f}}_{\mathbf{{u}}}\), we have that
Further, note that (14) maps \(D\) into \(B\) injectively and is bi-Lipschitz on \(D\), since the map itself and its inverse (defined on the image of \(D\)) have continuous bounded derivatives. It is well known that bi-Lipschitz maps preserves Hausdorff dimension, see for example [24, Corollary 2.4]. Therefore, \(\dim S'=\dim S\ge m\). By (15), and the fact that any subset of \(\mathbb {R}^m\) is of dimension \(\le m\), we obtain (8) and thus complete the proof of Theorem 1 modulo Theorem 2.\(\square \)
2.2 The dual form of approximation
So far we have been dealing with simultaneous rational approximations. Here we introduce the dual definition of badly approximable points—see part (iii) of Lemma 1 below. This has two purposes. Firstly, it is the dual form that will be used in the proof of the results. Secondly, the dual form provides a natural environment for considering Diophantine approximation by algebraic numbers and will allow us to deduce further corollaries of our main results.
Lemma 1
(Equivalent definitions of \(\mathbf {Bad}(\mathbf{{r}})\) ) Let \(\mathbf{{r}}=(r_1,\ldots ,r_n)\in \mathcal {R}_n\) and \(\mathbf{{y}}=(y_1,\ldots ,y_n)\in \mathbb {R}^n\). Then the following three statements are equivalent:
-
(i)
\(\mathbf{{y}}\in \mathbf {Bad}(\mathbf{{r}}).\)
-
(ii)
There exists \(c>0\) such that for any \(Q\ge 1\) the only integer solution \((q,p_1,\ldots ,p_n)\) to the system
$$\begin{aligned} |q|< Q, \quad |qy_i-p_i| < \left( c\,Q^{-1}\right) ^{r_i}\quad (1\le i\le n) \end{aligned}$$(16)is \(q=p_1=\cdots =p_n=0\).
-
(iii)
There exists \(c>0\) such that for any \(H\ge 1\) the only integer solution \((a_0,a_1,\ldots ,a_n)\) to the system
$$\begin{aligned} |a_0+a_1y_1+\cdots +a_ny_n|< c H^{-1},\quad |a_i|< H^{r_i}\quad (1\le i\le n) \end{aligned}$$(17)is \(a_0=\cdots =a_n=0\).
The equivalence of (i) and (ii) is a straightforward consequence of the definition of \(\mathbf {Bad}(\mathbf{{r}})\). The equivalence of (ii) and (iii) is relatively well known, see Appendix in [13] for a similar statement. Indeed, this equivalence is essentially a special case of Mahler’s version of Khintchine’s transference Principle appearing in [38]. To make this paper self-contained we provide further details in Appendix A.
2.3 Approximation by algebraic numbers of bounded degree
There are two classical interrelated settings in the theory of approximation by algebraic numbers of bounded degree. One of them boils down to investigating small values of integral polynomials \(P\) with \(\deg P\le n\) at a given number \(\xi \). The other deals with the proximity of algebraic numbers \(\alpha \) of degree \(\le n\) to a given number \(\xi \), see [14] for further background. In particular, the long standing Wirsing–Schmidt conjecture [45, p. 258], which was motivated by Wirsing’s theorem [49], states that for any \(n\in \mathbb {N}\) and any real transcendental number \(\xi \) there is a constant \(C=C(\xi ,n)>0\) such that
holds for infinitely many algebraic numbers \(\alpha \) of degree \(\le n\), where \(H(\alpha )\) denotes the height of \(\alpha \) (to be recalled a few lines below). The \(n=1\) case of the conjecture is a trivial consequence of the theory of continued fractions. For \(n=2\) it was proved by Davenport and Schmidt [23]. However, there are only partial results for \(n>2\). Note, however, that using Dirichlet’s theorem it is easily shown that for any \(\xi \in \mathbb {R}\) there exists \(c_0=c_0(\xi ,n)>0\) such that \(|P(\xi )|<c_0H(P)^{-n}\) for infinitely many \(P\in \mathbb {Z}[x]\) with \(\deg P\le n\).
In this section we will deal with real numbers badly approximable by algebraic numbers. Given a polynomial \(P\) with integer coefficients, \(H(P)\) will denote the height of \(P\), which, by definition, is the maximum of the absolute values of the coefficients of \(P\). Given an algebraic number \(\alpha \in \mathbb {C}\), \(H(\alpha )\) will denote the (naive) height of \(\alpha \), which, by definition, is the height of the minimal defining polynomial \(P\) of \(\alpha \) over \(\mathbb {Z}\). It is also convenient to introduce the following three sets:
The sets \(\mathcal {B}_n\) and \(\mathcal {B}_n^*\) are the natural generalisations of badly approximable numbers to the context of approximation by algebraic numbers. They are known to have Lebesgue measure zero, e.g., by a Khintchine type theorem proved in [10]. Within this paper we will deal with the following two conjectures that Bugeaud formulated as Problems 24 and 25 in his Cambridge Tract [14, §10.2]:
Conjecture B1 \(\mathcal {B}_n\) contains a real transcendental number.
Conjecture B2 \(\mathcal {W}_n^*\cap \mathcal {B}_n^*\) contains a real transcendental number.
Note that Conjecture B1 is stronger than Conjecture B2 since we have that
The proof of (18) is rather standard. Indeed, it rests on the Mean Value Theorem and Minkowski’s theorem for convex bodies, see Appendix B for details. Here we establish the following Hausdorff dimension result that easily settles the above conjectures.
Theorem 3
For any natural number \(n\) and any interval \(I\) in \(\mathbb {R}\)
Proof
Without loss of generality we will assume that \(n\ge 2\). Let
\(1\le k\le n\) be an integer and \(\mathbf{{r}}_k=\left( \frac{1}{k},\ldots ,\tfrac{1}{k},0,\ldots ,0\right) \in \mathcal {R}_n\), where the number of zeros is \(n-k\). Let \(\xi \in \mathbb {R}\) be such that \(\mathbf{{f}}(\xi )\in \mathbf {Bad}(\mathbf{{r}}_k)\). By Property (iii) of Lemma 1, there exists \(c(\xi ,n,k)>0\) such that for any \(H\ge 1\) the only integer solution \((a_0,a_1,\ldots ,a_n)\) to the system
is \(a_0=\cdots =a_n=0\). Hence, for any non-zero polynomial \(P(x)=a_kx^k+\cdots +a_0\in \mathbb {Z}[x]\) with \(H(P)<H^{1/k}\) we must have that \(|P(\xi )|\ge c(\xi ,n,k) H^{-1}>c(\xi ,n,k) H(P)^{-k}\). By definition, this means that \(\xi \in \mathcal {B}_k\). To sum up, we have just shown that \(\mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}_k))\subset \mathcal {B}_k\). Hence
By Theorem 2, for any interval \(I\subset \mathbb {R}\) we have that \(\dim \bigcap _{k=1}^n\mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}_k))\cap I=1\). In view of the above inclusions the statement of Theorem 3 now readily follows. \(\square \)
Remark
An interesting problem is to show that Theorem 3 holds when \(n=\infty \).
3 Lattice points counting
The rest of the paper will be concerned with the proof of Theorem 2, which will rely heavily on efficient counting of lattice points in convex bodies. The lattices will arise upon reformulating \(\mathbf {Bad}(\mathbf{{r}})\) in the spirit of Dani [20] and Kleinbock [30]. This will require the following notation. Given a subset \(\Lambda \) of \(\mathbb {R}^{n+1}\), let
where \(\Vert \mathbf{{a}}\Vert _\infty =\max \{|a_0|,\ldots ,|a_n|\}\) for \(\mathbf{{a}}=(a_0,\ldots ,a_n)\). Given \(0<\kappa <1\), let
where \(\mathbf{{y}}\in \mathbb {R}^n\) is regarded as a row and \(I_n\) is the \(n\times n\) identity matrix. Finally, given \(\mathbf{{r}}\in \mathcal {R}_n\), \(b>1\) and \(t\in \mathbb {R}\), define the \((n+1)\times (n+1)\) unimodular diagonal matrix
Lemma 2
Let \(\mathbf{{y}}\in \mathbb {R}^n\), \(\mathbf{{r}}\in \mathcal {R}_n\). Then \(\mathbf{{y}}\in \mathbf {Bad}(\mathbf{{r}})\) if and only if there exists \(\kappa \in (0,1)\) and \(b>1\) such that for all \(t\in \mathbb {N}\)
Proof
The necessity is straightforward as all one has to do is to take \(H=b^t\) and divide each inequality in (17) by its right hand side. Then, assuming that \(\mathbf{{y}}\in \mathbf {Bad}(\mathbf{{r}})\), the non-existence of integer solutions to (17) would imply (22) with \(\kappa =c\). The sufficiency is only slightly harder. Assume that for some \(\kappa \) and \(b\) inequality (22) holds for all \(t\in \mathbb {N}\), while \(\mathbf{{y}}\not \in \mathbf {Bad}(\mathbf{{r}})\). Take \(c=\kappa /b\). By definition, there is an \(H>1\) such that (17) has a non-zero integer solution \((a_0,\ldots ,a_n)\). Take \(t=[\log H/\log b]+1\), where \([\cdot ]\) denotes the integer part. Note that \(Hb^{-t}<1\) and \(H^{-1}b^t\le b\). Then (17) implies that \(\delta (g^t_{\mathbf{{r}},b}G(\kappa ;\mathbf{{y}})\mathbb {Z}^{n+1})<1\), contrary to (22). The proof is thus complete. \(\square \)
Remark
Lemma 2 can be regarded as a variation of the Dani–Kleinbock correspondence between badly approximable points in \(\mathbb {R}^n\) and bounded orbits of certain lattices under the actions by the diagonal semigroup \(\left\{ g^t_{\mathbf{{r}},b}:t>0\right\} \), where \(b>1\). It is easily seen that this semigroup is independent of the choice of \(b>1\), which is usually taken to be \(e = \exp (1)\). The correspondence was first established by Dani [20] in the case \(\mathbf{{r}}=\left( \tfrac{1}{n},\ldots ,\tfrac{1}{n}\right) \) and then extended by Kleinbock [30] to the case of arbitrary positive weights and can be stated as follows. The point \(\mathbf{{y}}\in \mathbb {R}^n\) is \(\mathbf{{r}}\)-badly approximable if and only if the orbit of the lattice \(G(1;\mathbf{{y}})\mathbb {Z}^{n+1}\) under the action by \(\left\{ g^t_{\mathbf{{r}},e}:t>0\right\} \) is bounded.
We proceed by recalling two classical results from the geometry of numbers. In what follows, \(\mathrm {vol}_\ell (X)\) denotes the \(\ell \)-dimensional volume of \(X\subset \mathbb {R}^\ell \) and \(\#X\) denotes the cardinality of \(X\). Also \(\det \Lambda \) will denote the determinant or covolume of a lattice \(\Lambda \).
Minkowski’s Convex Body Theorem (see [45, Theorem 2B]) Let \(K\subset \mathbb {R}^\ell \) be a convex body symmetric about the origin and let \(\Lambda \) be a lattice in \(\mathbb {R}^\ell \). Suppose that \(\mathrm {vol}_\ell (K)>2^\ell \det \Lambda \). Then \(K\) contains a non-zero point of \(\Lambda \).
Theorem (Blichfeldt [12]) Let \(K\subset \mathbb {R}^\ell \) be a convex bounded body and let \(\Lambda \) be a lattice in \(\mathbb {R}^\ell \) such that \(\mathrm{rank\,}(K\cap \Lambda )=\ell \). Then
The following lemma is a straightforward consequence of Blichfeldt’s theorem.
Lemma 3
(cf. Lemma 4 in [34]) Let \(K\) be a convex bounded body in \(\mathbb {R}^\ell \) with \(\mathbf{{0}}\in K\) and \(\mathrm {vol}_\ell (K)<1/\ell !\). Then \(\mathrm{rank\,}(K\cap \mathbb {Z}^\ell )\le \ell -1\).
Proof
Assume the contrary, that is assume that \(\mathrm{rank\,}(K\cap \mathbb {Z}^\ell )=\ell \) (note that the rank cannot be bigger than \(\ell \)). It means that \(K\) contains at least \(\ell \) non-zero integer points. Since \(\mathbf{{0}}\in K\), we then have that \(\#\left( K\cap \mathbb {Z}^\ell \right) \ge \ell +1\). However, since \(\det \mathbb {Z}^\ell =1\) and \(\mathrm {vol}_\ell (K)<1/\ell !\), by Blichfeldt’s theorem, we conclude that
contrary to the above lower bound. \(\square \)
The bodies K of interest will arise as the intersection of parallelepipeds
with \(\ell \)-dimensional subspaces of \(\mathbb {R}^{n+1}\), where \(\varvec{\theta }=(\theta _0,\ldots ,\theta _n)\) is an \((n+1)\)-tuple of positive numbers. In view of this, we now obtain an estimate for the volume of the bodies that arise this way (Lemma 4 below) and then verify what Blichfeldt’s theorem means for such bodies (Lemma 5 below).
Lemma 4
Let \(\ell \in \mathbb {N}\), \(\ell \le n+1\), \(\varvec{\theta }=(\theta _0,\ldots ,\theta _n)\) with \(\theta _0,\ldots ,\theta _n>0\). Then for any linear subspace \(V\) of \(\mathbb {R}^{n+1}\) of dimension \(\ell \) we have that
Proof
Since \(V\) is a linear subspace of \(\mathbb {R}^{n+1}\) of dimension \(\ell \), it is given by \(n+1-\ell \) linear equations. Using Gaussian elimination, we can rewrite these equations to parametrise \(V\) with a linear map \(\mathbf{{f}}:\mathbb {R}^\ell \rightarrow \mathbb {R}^{n+1}\) of \(x_{i_1},\ldots ,x_{i_\ell }\) such that
where \(M=(m_{i,j})\) is an \(\ell \times (n+1)\) matrix with \(|m_{i,j}|\le 1\) for all \(i\) and \(j\). Then note that \(\mathrm {vol}_\ell (\Pi _{\varvec{\theta }}\cap V)\) is bounded by the area of the intersection of \(V\) with the cylinder \(|x_{i_j}|\le \theta _{i_j}\) for \(j=1,\ldots , \ell \). This area is equal to
where \(\Vert \cdot \Vert _e\) is the Euclidean norm on \(\bigwedge ^\ell \left( \mathbb {R}^{n+1}\right) \). Since \(|m_{i,j}|\le 1\), every coordinate of every partial derivative of \(\mathbf{{f}}\) is bounded by \(1\) in absolute value. Hence \(\Vert \partial \mathbf{{f}}/\partial x_{i_j}\Vert _e\le \sqrt{n+1}\) and the integrand in (24) is bounded above by \((\sqrt{n+1})^{\ell }\). This readily implies that the area given by (24) is bounded above by \(2^\ell (n+1)^{\ell /2}\theta _{i_1}\ldots \theta _{i_\ell }\le 2^\ell (n+1)^{\ell /2}\Theta _\ell \), whence the result follows. \(\square \)
Lemma 5
Let \(c(n)=4^{n+1}(n+1)^{(n+1)/2}(n+1)!\) and let \(\varvec{\theta }\) and \(\Theta _\ell \) be as in Lemma 4. Then for any discrete subgroup \(\Gamma \) of \(\mathbb {R}^{n+1}\) with \(\ell =\mathrm{rank\,}\big (\Gamma \cap \Pi _{\varvec{\theta }}\big )>0\) we have that
Proof
Let \(V={\text {span}}(\Gamma \cap \Pi _{\varvec{\theta }})\) and \(\Lambda =V\cap \Gamma \). Clearly, \(\mathrm{rank\,}(\Lambda )=\ell \) and furthermore \(\Lambda \) is a lattice in \(V\). Also note that \(\Gamma \cap \Pi _{\varvec{\theta }}=\Lambda \cap \Pi _{\varvec{\theta }}\). Since \(\Lambda \subseteq \Gamma \), we have that \(\delta (\Gamma )\le \delta (\Lambda )\). Let \(B(r)\) denote the open ball in \(V\) of radius \(r\) centred at the origin. Note that the length of any non-zero point in \(\Lambda \) is bigger than or equal to \(\delta (\Lambda )\ge \delta (\Gamma )\). Hence, by Minkowski’s convex bodies theorem, we must have that \(\mathrm {vol}_\ell \big (B(\delta (\Gamma )\big )\le 2^\ell \det \Lambda \), whence we obtain \(\det \Lambda \ge \mathrm {vol}_\ell (B(\delta (\Lambda )))2^{-\ell }\ge (\delta (\Lambda )/2)^{\ell }\). Now using this inequality, Blichfeldt’s theorem, Lemma 4 and the fact that \(\ell \le n+1\) readily gives (25).
We are now approaching the key counting result of this section. Let
where \(u>0\), \(b>1\) and \(\Pi _{\varvec{\theta }}\) is given by (23). Given \(\mathbf{{r}}\in \mathcal {R}_n\), let
Recall that \(\tau (\mathbf{{r}})\), \(\delta (\cdot )\), \(g^t_{\mathbf{{r}},b}\) and \(\Pi (b,u)\) are given by (9), (19), (21) and (26) respectively, and \([x]\) denotes the integer part of \(x\).
Lemma 6
Let \(b>1\), \(\mathbf{{r}}\in \mathcal {R}_n\), \(\lambda =\lambda (\mathbf{{r}})\), \(z=z(\mathbf{{r}})\), \(t\in \mathbb {N}\), \(u\in \mathbb {R}\), \(1\le \lambda u\le t\) and \(c(n)\) be as in Lemma 5. Let \(g^t=g^t_{\mathbf{{r}},b}\). Let \(\Lambda \) be a discrete subgroup of \(\mathbb {R}^{n+1}\) such that \(\mathrm{rank\,}\Lambda \le n-z\) and
Then
Proof
Let \(\mathbf{{x}}=(x_0,\ldots ,x_n)\in \Lambda \) be such that \(g^t\mathbf{{x}}\in \Pi (b,u)\). By the definitions of \(g^t=g^t_{\mathbf{{r}},b}\) and \(\Pi (b,u)\), we have that \(b^{t}|x_0|<b^{u}\) and \(b^{-r_it}|x_i|<1\) for \(i=1,\ldots ,n\). Equivalently, for \(s\in \mathbb {Z}\), \(1\le s\le u-1\), we have that
This can be written as \(g^{t-s}\mathbf{{x}}\in \Pi _{\varvec{\theta }}\), where \(\varvec{\theta }=(b^{u-s},b^{r_1s},\ldots ,b^{r_ns})\). Therefore,
where \(\Gamma =g^{t-s}\Lambda \). Now take \(s=[\lambda u]\). Recall that \(\lambda <1\) and that, by the conditions of Lemma 6, \([\lambda u]\le t\). Then, by the left hand side of (28), we have that \(\delta (\Gamma )\ge 1\). Hence, by Lemma 5 and (30), we get
where \(\ell =\mathrm{rank\,}\Gamma =\mathrm{rank\,}\Lambda \le n-z\). Note that all the components of \(\varvec{\theta }\) are \(\ge 1\) and exactly \(z\) of them equal \(1\). Then, since \(\ell \le n-z\) and \(s=[\lambda u]\), we get that
Combining this estimate with (31) and the obvious fact that \(n+1<c(n)b^{\tau }b^{\lambda u}\) gives (29). \(\square \)
4 ‘Dangerous’ intervals
In view of Lemma 2, when proving Theorem 2 we will aim to avoid the solutions of the inequalities \(\delta \left( g^t_{\mathbf{{r}},b}G_x\mathbb {Z}^{n+1}\right) <1\), where \(G_x=G(\kappa ;\mathbf{{y}})\) with \(\mathbf{{y}}=\mathbf{{f}}(x)\) and \(\kappa \) is a sufficiently small constant. For fixed \(\mathbf{{r}},b,t,\mathbf{{f}}\) and \(\kappa \) the above inequality is equivalent to the existence of \((a_0,\mathbf{{a}})\in \mathbb {Z}^{n+1}\) with \(\mathbf{{a}}\ne \mathbf{{0}}\) satisfying
Here the dot means the usual inner product. That is \(\mathbf{{a}}.\mathbf{{b}}=a_1b_1+\cdots +a_nb_n\) for any given \(\mathbf{{a}}=(a_1,\ldots ,a_n)\) and \(\mathbf{{b}}=(b_1,\ldots ,b_n)\). In this section we study intervals arising from (32) that, for obvious reasons, are referred to as dangerous (see [45] for similar terminology). We will consider several cases that are tied up with the magnitude of \(\mathbf{{a}}.\mathbf{{f}}'(x)\); i.e., the derivative of \(a_0+\mathbf{{a}}.\mathbf{{f}}(x)\)—see Propositions 1 and 2 below.
Throughout \(\mathcal {F}_n(I)\) and \(x_0\) are as in Theorem 2. First we discuss some conditions that arise from the nondegeneracy assumption on maps in \(\mathcal {F}_n(I)\). Let \(\mathbf{{f}}=(f_1,\ldots ,f_n)\in \mathcal {F}_n(I)\). Since \(\mathbf{{f}}\) is nondegenerate at \(x_0\in I\), there is a sufficiently small neighborhood \(I_{\mathbf{{f}}}\) of \(x_0\) such that the Wronskian of \(f'_1,\ldots ,f'_n\), which, by definition, is the determinant \(\det \big (f^{(i)}_{j}\big )_{1\le i,j\le n}\), is non zero everywhere in \(I_{\mathbf{{f}}}\). Then every coordinate function \(f_{j}\) is non-vanishing at all but countably many points of \(I_{\mathbf{{f}}}\subset I\)—see, e.g., [5, Lemma 3]. Since \(\mathbf{{f}}\in C^n\) and \(\mathcal {F}_n(I)\) is finite, we can choose a compact interval \(I_0\subset \bigcap _{\mathbf{{f}}\in \mathcal {F}_n(I)}I_{\mathbf{{f}}}\subset I\) satisfying
Property F There are constants \(0<c_0<1<c_1\) such that for every map \(\mathbf{{f}}=(f_1,\ldots ,f_n)\in \mathcal {F}_n(I)\), for all \(x\in I_0\), \(1\le i\le n\) and \(0\le j\le n\) one has that
Next, we prove two auxiliary lemmas that are well known in a related context.
Lemma 7
(cf. Lemma 5 in [5]) Let \(I_0\subset I\) be a compact interval satisfying Property F. Let \(2c_2=c_0c_1^{-n+1}n!^{-1}\), where \(c_0\) and \(c_1\) arise from (33). Then for any \(\mathbf{{f}}\in \mathcal {F}_n(I)\), any \(\mathbf{{a}}=(a_1,\dots ,a_n)\in \mathbb {Z}^n\!\setminus \!\{0\}\) and any \(x\in I_0\) there exists \(i\in \{1,\dots ,n\}\) such that \( |\mathbf{{a}}.\mathbf{{f}}^{(i)}(x)| \ge 2c_2\max _{1\le j\le n}|a_j|. \)
Proof
Solving the system \(a_1f^{(i)}_{1}(x)+\cdots +a_nf^{(i)}_{n}(x)=\mathbf{{a}}.\mathbf{{f}}^{(i)}(x)\), where \(1\le i\le n\), by Cramer’s rule with respect to \(a_i\) and using (33) to estimate the determinants involved in the rule we obtain
for each \(j=1,\ldots ,n\), whence the statement of lemma readily follows. \(\square \)
Lemma 8
(cf. Lemma 6 in [5]) Let \(I_0\subset I\) and \(c_2\) be as in Lemma 7. Then there is \(\delta _0>0\) such that for any interval \(J\subset I_0\) of length \(|J|\le \delta _0\), any \(\mathbf{{f}}\in \mathcal {F}_n(I)\) and \(\mathbf{{a}}=(a_1,\cdots ,a_n)\in \mathbb {Z}^n\!\setminus \!\{0\}\), there is an \(i\in \{1,\ldots ,n\}\) satisfying
Proof
Since \(I_0\) is compact, for each \(\mathbf{{f}}\in \mathcal {F}_n(I)\) and \(1\le i\le n\), the map \(\mathbf{{f}}^{(i)}\) is uniformly continuous on \(I_0\). Hence, there is a \(\delta _{i,\mathbf{{f}}}>0\) such that for any \(x,y\in I_0\) with \(|x-y|\le \delta _{i,\mathbf{{f}}}\) we have \(\left| \mathbf{{f}}^{(i)}(x)-\mathbf{{f}}^{(i)}(y)\right| <c_2/n\). Let \(J\subset I_0\) be an interval of length \(|J|\le \delta _{i,\mathbf{{f}}}\) and \(x,y\in J\). By Lemma 7, there is \(i\in \{1,\dots ,n\}\) such that \(|\mathbf{{a}}.\mathbf{{f}}^{(i)}(x)|\ \ge \ 2c_2h\), where \(h=\max _{1\le j\le n}|a_j|\). Then
Since \(\mathcal {F}_n(I)\) is finite, \(\delta _0=\inf _{i,\mathbf{{f}}}\delta _{i,\mathbf{{f}}}>0\). Hence (35) implies (34) provided that \(|J|\le \delta _0\). \(\square \)
Proposition 1
Let \(I_0\subset I\) be a compact interval satisfying Property F and \(\mathbf{{f}}\in \mathcal {F}_n(I)\). Further, let \(\delta _0\) be as in Lemma 8, \(\mathbf{{r}}\in \mathcal {R}_n\) and
Finally, let \(t\in \mathbb {N}\), \(\ell \in \mathbb {Z}_{\ge 0}\), \(b>1\), \(\mathbf{{a}}\in \mathbb {Z}^n\!\setminus \!\{\mathbf{{0}}\}\), \(a_0\in \mathbb {Z}\), \(0<\kappa <1\) and
Then, there is a constant \(c_3>0\) depending on \(n\), \(|I_0|\), \(c_1\), \(c_2\) and \(\delta _0\) only such that the set \(D^1_{t,\ell ,\mathbf{{r}},b,\kappa ,\mathbf{{f}}}(a_0,\mathbf{{a}})\) can be covered by a collection \(\mathcal {D}^1_{t,\ell ,\mathbf{{r}},b,\kappa ,\mathbf{{f}}}(a_0,\mathbf{{a}})\) of at most \(c_3\) intervals \(\Delta \) of length \( |\Delta |\le \kappa b^{-(1+\gamma )(t-\ell )}. \)
Proof
We will abbreviate \(D^1_{t,\ell ,\mathbf{{r}},b,\kappa ,\mathbf{{f}}}(a_0,\mathbf{{a}})\) as \(D^1_{}\) and naturally assume that \(D^1_{}\ne \emptyset \) as otherwise there is nothing to prove. Since \(I_0\) can be covered by at most \(\left[ \delta _0^{-1}|I_0|\right] +1\) intervals \(J\) of length \(|J|\le \delta _0\), it suffices to prove the proposition under the assumption that \(|I_0|\le \delta _0\). Let \(f(x)=a_0+\mathbf{{a}}.\mathbf{{f}}(x)\). Then, by Lemma 8, we have that \(|f^{(i)}(x)|>0\) for a fixed \(i\in \{1,\cdots ,n\}\) and all \(x\in I_0\). First consider the case \(i>1\). Then, using Rolle’s theorem, one finds that the function \(f^{(j)}(x)\) vanishes on \(I_0\) at \(\le i-j\) points \((0\le j\le i-1)\). Assuming that \(I_0=[a,b]\), let \(x_0=a<x_1<\cdots <x_{s-1}<x_s=b\) be the collection consisting of the points \(a\) and \(b\) and all the zeros of \(\prod _{j=0}^{i-1}f^{(j)}(x)\). Then, as we have just seen \(s\le 1+\sum _{j=0}^{i-1}(i-j)= i(i+1)/2+1\). By the choice of the points \(x_i\), we have that for \(1\le q\le s\) and \(0\le j\le i-1\) the function \(f^{(j)}(x)\) is monotonic and does not change sign on the interval \([x_{q-1},x_q]\). Therefore, in view of the definition of \(D^1_{}\) we must have that \(\Delta _q=D^1_{}\cap [x_{q-1},x_q]\) is an interval. Hence, \(D^1_{}=\bigcup _{q=1}^s\Delta _q\), a union of at most \((i+1)i/2+1\le (n+1)n/2+1\) intervals.
It remains to estimate the length of each \(\Delta _q\). To this end, take any \(x_1,x_2\in \Delta _q\). By the construction of \(\Delta _q\), the numbers \(f(x_1)\) and \(f(x_2)\) have the same sign and satisfy the inequality \(|f(x_i)|<\kappa b^{-t}\). Hence, \(|f(x_1)-f(x_2)|<\kappa b^{-t}\). By the Mean Value Theorem, \(|f(x_1)-f(x_2)|=|f'(\theta )(x_1-x_2)|\). Hence \(|x_1-x_2|\le \kappa b^{-t}/|f'(\theta )|\). Since \(\Delta _q\subset D^1_{}\) is an interval, \(\theta \in D^1_{}\). Hence, \(|f'(\theta )| \ge b^{\gamma t-(1+\gamma )\ell }\) and we obtain that \(|x_1-x_2|\le \kappa b^{-t} b^{-\gamma t+(1+\gamma )\ell }=\kappa b^{-(1+\gamma )(t-\ell )}\). This estimate together with the obvious equality \(|\Delta _q|=\sup _{x_1,x_2\in \Delta _q}|x_1-x_2|\) implies that \(|\Delta _q|\le \kappa b^{-(1+\gamma )(t-\ell )}\). Thus, if \(i > 1\), the set \(D^1\) can be covered by at most \(n(n+1)/2+1\) intervals of length \(\kappa b^{-(1+\gamma )(t-\ell )}\).
Now consider the case \(i=1\). Recall that \(f(x)=a_0+\mathbf{{a}}.\mathbf{{f}}(x)\). Then, by the definition of \(D^1_{}\) and (33), for \(x\in D^1\) we get
Further, (34)\({}_{i=1}\) implies that \(\inf _{x\in I_0}|f'(x)|\ge c_2\max _{1\le j\le n}|a_j|\). Therefore, \(f\) is monotonic on \(I_0\) and \(D^1_{}\) is covered by a single interval \(\Delta \) defined by the inequality \(|f(x)|<\kappa b^{-t}\). Arguing as above and using (37) we get
Thus, by splitting \(\Delta \) into smaller intervals if necessary, \(D^1_{}\) can be covered by at most \(\left[ \frac{2c_1n}{c_2}\right] +1\) intervals of length \(\kappa b^{-(1+\gamma )(t-\ell )}\). \(\square \)
Proposition 2
Let \(I_0\subset I\) be a compact interval satisfying Property F and \(\gamma =\gamma (\mathbf{{r}})\) be given by (36). Then there are constants \(K_0>0\) and \(0<\kappa _0<1\) such that for any \(\mathbf{{f}}\in \mathcal {F}_n(I)\), any \(\mathbf{{r}}\in \mathcal {R}_n\), \(t\in \mathbb {N}\), \(0\le \varepsilon <\gamma \), \(b>1\) and \(0<\kappa <\kappa _0\) the set
can be covered by a collection \(\mathcal {D}^2_{t,\varepsilon ,\mathbf{{r}},b,\kappa ,\mathbf{{f}}}\) of intervals such that
and
where \(\delta _t=\kappa \,b^{-t(1+\gamma -\varepsilon )}\) and \(\alpha =\tfrac{1}{(n+1)(2n-1)}\).
Proposition 2 will be derived from a theorem due to Bernik, Kleinbock and Margulis using the ideas of [6]. In what follows \(|X|\) denotes the Lebesgue measure of a set \(X\subset \mathbb {R}\). The following is a simplified version of Theorem 1.4 from [11] that refines the results of [33].
Theorem 4
(Theorem 1.4 in [11]) Let \(I\subset \mathbb {R}\) be an open interval, \(x_0\in I\) and \(\mathbf{{f}}\,:\,I\rightarrow \mathbb {R}^n\) be nondegenerate at \(x_0\). Then there is an open interval \(J\subset I\) centred at \(x_0\) and \(E_J>0\) such that for any real \(\omega ,K,T_1,\ldots ,T_n\) satisfying
the set
satisfies
We will also use the following elementary consequence of Taylor’s formula.
Lemma 9
Let \(f:J\rightarrow \mathbb {R}\) be a \(C^2\) function on an interval \(J\). Let \(\omega ,K>0\) and \(y\in J\) be such that \(|f''(x)|<K^2/\omega \) for all \(x\in J\) and
Then \(|f(x)|<\omega \) and \(|f'(x)|<K\) for all \(x\in J\) with \(|x-y|<\omega /2K\).
Proof of Proposition 2
Fix any \(\mathbf{{f}}\in \mathcal {F}_n(I)\). We will abbreviate \(D^2_{t,\varepsilon ,\mathbf{{r}},b,\kappa ,\mathbf{{f}}}\) as \(D^2_{}\) and naturally assume that it is non-empty as otherwise there is nothing to prove. By (33), \(\mathbf{{f}}\) is nondegenerate at any \(x\in I_0\) and therefore Theorem 4 is applicable. Let \(J=J(x)\) be the interval centred at \(x\) that arises from Theorem 4. Since \(I_0\) is compact there is a finite cover of \(I_0\) by intervals \(J(x_1),\ldots , J(x_s)\), where \(s=s_{\mathbf{{f}}}\) depends on \(\mathbf{{f}}\). Let \(0<\kappa _0<1\) and \(\kappa _0\le \min _{1\le i\le s}|I_0\cap J(x_i)|\). The existence of \(\kappa _0\) is obvious because \(|I_0\cap J(x)|>0\) for each \(x\in I_0\).
Let \(0<\kappa <\kappa _0\), \(\mathbf{{r}}\in \mathcal {R}_n\), \(t\in \mathbb {N}\), \(0\le \varepsilon <\gamma \), \(b>1\) and let
Note that since \(\varepsilon <\gamma \) and \(c_1>1\) we have that \(K>2\). Also note that \(\omega <2\kappa \). For each \(i\in \{1,\ldots ,s\}\) define the interval \(J_i=(a_i+\omega /2K,b_i-\omega /2K)\), where \([a_i,b_i]\) is the intersection of \(I_0\) and the closure of \(J(x_i)\). Since \(\kappa <\kappa _0\le |I_0\cap J(x_i)|\), \(\omega <2\kappa \) and \(K>2\), we have that \(J_i\ne \emptyset \) for each \(i\). Let
Our goal now is to use Lemma 9 with \(f(x)=a_0+\mathbf{{a}}.\mathbf{{f}}(x)\) in order to show that
In view of the definitions of \(D^2_{}\) and \(S(\omega ,K,T_1,\ldots ,T_n)\) and the choice of parameters (42), inequalities (41) hold for every \(y\in D^2\). Further, by (33), the inequalities \(|a_i|< b^{r_it}\) and the fact that \(r_i\le \gamma \) for all \(i\) implied by (36), we get that
Next, \(K^2/\omega =\frac{1}{2}n^2c_1^2\kappa ^{-1}b^{2(\gamma -\varepsilon )t}b^t>nc_1b^{\gamma t}\) because \(\varepsilon \le \gamma \le 1\), \(c_1>1\) and \(\kappa <1\). Therefore, by (45), we have that \(|f''(x)|\le K^2/\omega \) for all \(x\in I_0\). Thus, Lemma 9 is applicable and for \(1\le i\le s\) we have that \(\{x:|x-y|<\omega /2K\}\subset S(\omega ,K,T_1,\ldots ,T_n)\cap J(x_i)\) each \(y\in D^2_{}\cap J_i\). This proves (44).
Next, by Theorem 4, condition \(r_1+\cdots +r_n=1\) and (44) we conclude that
where \(E_{\mathbf{{f}}}=s\,\max _{1\le i\le s} E_{J(x_i)}\). By (43), \(\tilde{D}^2_{}\) can be written as a union of disjoint intervals of length \( \ge \omega /K=(nc_1)^{-1}\kappa \,b^{-t(1+\gamma -\varepsilon )}=(nc_1)^{-1}\delta _t. \) By splitting some of these intervals if necessary, we get a collection \(\tilde{\mathcal {D}}^2_{}\) of disjoint intervals \(\Delta \) such that \( \tfrac{1}{c_1n}\delta _t\le |\Delta |\le \delta _t. \) Let
Then, by (46) and the above inequality, we get
Let \(\mathcal {D}^2_{}\) be the collection of all the intervals in \(\tilde{\mathcal {D}}^2_{}\) together with the \(2s\) intervals \([a_i,a_i+\omega /2K]\) and \([b_i-\omega /2K,b_i]\) \((1\le i\le s)\). It is easily seen that \(2s\) is less than or equal to the right hand side of (47). Then, by (47) and the definition of \(\mathcal {D}^2_{}\), we get (38) and (39). Also, by construction, we see that \(\mathcal {D}^2_{}\) is a cover of \(D^2_{}\). The proof is thus complete. \(\square \)
5 A Cantor sets framework
Let \(R\ge 2\) be an integer. Given a collection \(\mathcal {I}\) of compact intervals in \(\mathbb {R}\), let \(\tfrac{1}{R}\mathcal {I}\) denote the collection of intervals obtained by dividing each interval in \(\mathcal {I}\) into \(R\) equal closed subintervals. For example, for \(R=3\) and \(\mathcal {I}=\{[0,1]\}\) we have that \(\frac{1}{R}\mathcal {I}=\left\{ [0,\tfrac{1}{3}],[\tfrac{1}{3},\tfrac{2}{3}],[\tfrac{2}{3},1]\right\} \). Let \(I_0\subset \mathbb {R}\) be a compact interval. The sequence \((\mathcal {I}_q)_{q\ge 0}\) will be called an \(R\)-sequence in \(I_0\) if
The intervals lying in \(\mathcal {I}_q\) will be called to be of level \(q\). Thus, the intervals of level \(q\) are obtained from intervals of level \(q-1\) by, firstly, splitting the intervals of \(\mathcal {I}_{q-1}\) into \(R\) equal parts to form \(\tfrac{1}{R}\mathcal {I}_{q-1}\), and, secondly, removing some of the intervals from \(\tfrac{1}{R}\mathcal {I}_{q-1}\) to form \(\mathcal {I}_q\). Given \(q\in \mathbb {N}\), the intervals that are being removed in this procedure will be denoted by
Naturally, \(I_q\) will denote any interval from the collection \(\mathcal {I}_q\), that is any interval of level \(q\). Observe that
By definition, given \(I_q\in \mathcal {I}_q\) with \(q\ge 1\), there is a unique interval \(I_{q-1}\in \mathcal {I}_{q-1}\) such that \(I_{q}\subset I_{q-1}\); this interval \(I_{q-1}\) will be called the precursor of \(I_q\). Obviously it is independent of the choice of the \(R\)-sequence \((\mathcal {I}_q)_{q\ge 0}\) with \(I_q\in \mathcal {I}_q\).
We also define the limit set of \((\mathcal {I}_q)_{q\ge 0}\) as
This is a Cantor type set. The classical middle third Cantor set can be constructed this way in an obvious manner with \(R=3\) and \(I_0=[0,1]\). Theorem 2 will be proved by finding suitable Cantor type sets \(\mathcal {K}((\mathcal {I}_q)_{q\ge 0})\). The construction of the corresponding \(R\)-sequences will be based on removing the intervals that intersect dangerous intervals—see Sect. 4.
Note that if \(\mathcal {I}_q\ne \emptyset \) for all \(q\) so that \((\mathcal {I}_q)_{q\ge 0}\) is genuinely an infinite sequence, then \(\mathcal {K}((\mathcal {I}_q)_{q\ge 0})\ne \emptyset \). However, ensuring that \(\mathcal {K}((\mathcal {I}_q)_{q\ge 0})\) is large requires better understanding of the sets \(\mathcal {I}_q\). There are various techniques in fractal geometry that are geared towards this task—see [24]. We shall use a recent powerful result of Badziahin and Velani [16] restated below using our notation. Naturally, if we expect that the Cantor set \(\mathcal {K}((\mathcal {I}_q)_{q\ge 0})\) is large, then the number of removed intervals at level \(q\), that is the cardinality of \(\widehat{\mathcal {I}}_q\), should be relatively small. In what follows, given \(q\in \mathbb {N}\) and an interval \(J\), let
This denotes the subcollection of removed intervals (when going from level \(q-1\) to level \(q\)) that lie over a given interval \(J\). The key characteristic that is ‘assessing’ the proportion of removed intervals at a particular level is given by
where the minimum is taken over all partitions \(\{\widehat{\mathcal {I}}_{q,p}\}_{p=0}^{q-1}\) of \(\widehat{\mathcal {I}}_q\), that is \(\widehat{\mathcal {I}}_q=\bigcup _{p=0}^{q-1}\widehat{\mathcal {I}}_{q,p}\). Also define the corresponding global characteristic as
The goal is to ensure that \(d((\mathcal {I}_q)_{q\ge 0})\) is small. Then as we shall shortly see the corresponding Cantor set is large. Note that when estimating \(d_q(\mathcal {I}_q)\) the key is to arrange the removed intervals into a partition \(\bigcup _{p=0}^{q-1}\widehat{\mathcal {I}}_{q,p}\) which makes the sum on the right of (51) small.
Theorem 5
(Theorem 4 in [16]) Let \(R\ge 4\) be an integer, \(I_0\) be a compact interval in \(\mathbb {R}\) and \((\mathcal {I}_q)_{q\ge 0}\) be an \(R\)-sequence in \(I_0\). If \(d((\mathcal {I}_q)_{q\ge 0})\le 1\) then
In order to facilitate the comparison of Theorem 5 to [16, Theorem 4] we summarise the correspondence between the notation and objects used in this paper and in [16]:
Given the above correspondence table, it is readily verified that our condition \(d((\mathcal {I}_q)_{q\ge 0})\le 1\) corresponds to condition (16) within [16, Theorem 4]. Hence Theorem 5 above is an immediate consequence of Theorem 4 from [16].
Let \(M>1\), \(X\subset \mathbb {R}\) and \(I_0\) be a compact interval. We will say that \(X\) is \(M\)-Cantor rich in \(I_0\) if for any \(\varepsilon >0\) and any integer \(R\ge M\) there exists an \(R\)-sequence \((\mathcal {I}_q)_{q\ge 0}\) in \(I_0\) such that \(\mathcal {K}((\mathcal {I}_q)_{q\ge 0})\subset X\) and \(d((\mathcal {I}_q)_{q\ge 0})\le \varepsilon \). We will say that \(X\) is Cantor rich in \(I_0\) if it is \(M\)-Cantor rich in \(I_0\) for some \(M\). We will say that \(X\) is Cantor rich if it is Cantor rich in \(I_0\) for some compact interval \(I_0\). The following statement readily follows from Theorem 5 and our definitions.
Theorem 6
Any Cantor rich set \(X\) satisfies \(\dim X=1\).
We now proceed with a discussion of the intersections of Cantor rich sets. To some extent this already appears in [16, Theorem 5] and in [13]. First we prove the following auxiliary statement.
Lemma 10
Let \(\left( \mathcal {I}^j_q\right) _{q\ge 0}\) be a family of \(R\)-sequences in \(I_0\) indexed by \(j\). Given \(q\in \mathbb {Z}_{\ge 0}\), let \(\mathcal {J}_q=\bigcap _{j}\mathcal {I}^j_q\). Then \((\mathcal {J}_q)_{q\ge 0}\) is an \(R\)-sequence in \(I_0\) such that
and
Proof
The validity of (48) for \((\mathcal {J}_q)_{q\ge 0}\) follows from the uniqueness of the precursor of an interval in any \(R\)-sequence from that sequence and the fact that \(\mathcal {I}^j_0=\{I_0\}\) for all \(j\), which means that \(\mathcal {J}_0=\bigcap _j\mathcal {I}^j_0=\{I_0\}\). Thus, \((\mathcal {J}_q)_{q\ge 0}\) is truly an \(R\)-sequence. The inclusion (53) is obvious for \(q=0\) for both sides of the inclusion are empty sets in this case. To see (53) for \(q>0\), observe that \(\mathcal {J}_{q-1}\subset \mathcal {I}^j_{q-1}\) and this implies that \(\tfrac{1}{R}\mathcal {J}_{q-1}\subset \tfrac{1}{R}\mathcal {I}^j_{q-1}\) for each \(j\). Then we have
Finally, by the inclusion \(\mathcal {J}_q\subset \mathcal {I}^j_q\), we have that \(\bigcup J_q\subset \bigcup I^j_q\) for each pair of \(j\) and \(q\), where the union is taken over \(J_q\in \mathcal {J}_q\) and \(I^j_q\in \mathcal {I}^j_q\) respectively. Hence, by (50), we have that \(\mathcal {K}((\mathcal {J}_q)_{q\ge 0})\subset \mathcal {K}((\mathcal {I}^j_q)_{q\ge 0})\) for all \(j\), whence (54) now follows. \(\square \)
Theorem 7
Let \(I_0\) be a compact interval. Then any countable intersection of \(M\)-Cantor rich sets in \(I_0\) is \(M\)-Cantor rich in \(I_0\). In particular, any finite intersection of Cantor rich sets in \(I_0\) is Cantor rich in \(I_0\).
Proof
Let \(\{X_j\}_{j\in \mathbb {N}}\) be a collection of \(M\)-Cantor rich sets in \(I_0\). Let \(\varepsilon >0\). Then, by definition, for each \(j\in \mathbb {N}\) and \(R\ge M\) there is an \(R\)-sequence \((\mathcal {I}^j_q)_{q\ge 0}\) in \(I_0\) such that \(\mathcal {K}((\mathcal {I}^j_q)_{q\ge 0})\subset X_j\) and \(d_q(\mathcal {I}^j_q)\le \varepsilon 2^{-j}\) for all \(q>0\). By (51), for each \(j\) and \(q>0\) there exists a partition \(\{\widehat{\mathcal {I}}^j_{q,p}\}_{p=0}^{q-1}\) of \(\widehat{\mathcal {I}}^j_q\) such that
For \(q\in \mathbb {Z}_{\ge 0}\) define \(\mathcal {J}_q=\bigcap _{j\in \mathbb {N}}\mathcal {I}^j_q\) and \(\widehat{\mathcal {J}}_{q,p} = \widehat{\mathcal {J}}_q\cap \bigcup _{j\in \mathbb {N}} \widehat{\mathcal {I}}^j_{q,p}\). Since \(\widehat{\mathcal {I}}^j_q=\bigcup _{p=0}^{q-1}\widehat{\mathcal {I}}^j_{q,p}\) for each \(j\), by (53), we have that \(\widehat{\mathcal {J}}_q=\bigcup _{p=0}^{q-1}\widehat{\mathcal {J}}_{q,p}\), where \(q>0\). Then, for each \(q>0\) we get that
This inequality together with (55) and the definition of \(d((\mathcal {J}_q)_{q\ge 0})\) implies that \(d((\mathcal {J}_q)_{q\ge 0})\le \varepsilon \). By (54) and the fact that \(\mathcal {K}\left( \left( \mathcal {I}^j_q\right) _{q\ge 0}\right) \subset X_j\) for each \(j\), we have that \(\mathcal {K}((\mathcal {J}_q)_{q\ge 0})\subset \bigcap _jX_j\). Thus the intersection \(\bigcap _jX_j\) meets the definition of \(M\)-Cantor rich sets and the proof is complete. \(\square \)
The winning sets in the sense of Schmidt have been used a lot to investigate various sets of badly approximable points. Hence we suggest the following
Problem 3
Verify if an \(\alpha \)-winning set in \(\mathbb {R}\) as defined by Schmidt [45] is \(M\)-Cantor rich for some \(M\) and, if this so, find an explicit relation between \(M\) and \(\alpha \).
6 Proof of Theorem 2
The following proposition is a key step to establishing Theorem 2. We will use the Vinogradov symbol \(\ll \) to simplify the calculations. The expression \(X\ll Y\) will mean that \(X\le CY\) for some \(C>0\), which only depends on \(n\), the family of maps \(\mathcal {F}_n(I)\) from Theorem 2 and the interval \(I_0\) occurring in Property F.
Proposition 3
Let \(\mathcal {F}_n(I)\) be as in Theorem 2, \(I_0\subset I\) be a compact interval satisfying Property F, \(c_0,c_1\) be the same as in (33), \(\sigma =1-(2n)^{-4}\) and \(\kappa _0\) be as in Proposition 2. Further, let
and let
and
Then for any \(\mathbf{{f}}\in \mathcal {F}_n(I)\), \(\mathbf{{r}}\in \mathcal {R}_n\) and any integers \(m\ge m_0\) and \(R\ge R_0\), there exists an \(R\)-sequence \((\mathcal {I}_q)_{q\ge 0}\) in \(I_0\) such that
-
(i)
for any \(t\in \mathbb {N}\) and any \(I_{t+m}\in \mathcal {I}_{t+m}\) we have that
$$\begin{aligned} \delta \big (g^tG_x\mathbb {Z}^{n+1}\big )\ge 1 \quad \text {for all } \quad x\in I_{t+m}; \end{aligned}$$(59)where \(g^t=g^t_{\mathbf{{r}},b}\) is given by (21) with \(b^{1+\gamma }=R\), \(\gamma =\gamma (\mathbf{{r}})\) and \(G_x=G(\kappa ;\mathbf{{f}}(x))\) is given by (20) with \(\kappa =R^{-m};\)
-
(ii)
if \(q\le m\) then \(\#\widehat{\mathcal {I}}_q=0;\)
-
(iii)
if \(q=t+m\) for some \(t\in \mathbb {N}\) then \(\widehat{\mathcal {I}}_{q}\) can be written as the union \(\widehat{\mathcal {I}}_q=\bigcup _{p=0}^{q-1}\widehat{\mathcal {I}}_{q,p}\) such that for integers \(p=t+3-2\ell \) with \(0\le \ell \le \ell _t=[t/2n]+1\) and \(I_{p}\in \mathcal {I}_{p}\) we have that
$$\begin{aligned} \displaystyle \#(\widehat{\mathcal {I}}_{q,p}\sqcap I_{p})\ \ll \ R^{\frac{1+\lambda }{2}(q-p)-\frac{1-\lambda }{2}m+3}\,, \end{aligned}$$(60)$$\begin{aligned} \#\widehat{\mathcal {I}}_{q,0}\ll R^{\sigma q} \end{aligned}$$(61)and \(\widehat{\mathcal {I}}_{q,p}=\emptyset \) for all other \(p<q\), where \(\lambda =\lambda (\mathbf{{r}})\) is given by (27).
Proof
Note that since \(\varrho _0,\varrho _1>1\) and \(c_0<1\), we have that \(R_0>4\). Let \(m \ge m_0\) and \(R\ge R_0\) be any integers. Define \(\mathcal {I}_0=\{I_0\}\) and then for \(q=1,\ldots ,m\) let \(\mathcal {I}_q=\tfrac{1}{R}\mathcal {I}_{q-1}\). In this case conditions (i) and (iii) are irrelevant, while (ii) is obvious. Continuing by induction, let \(q=t+m\) with \(t\ge 1\) and let us assume that \(\mathcal {I}_{q'}\) with \(q'<q\) are given and satisfy conditions (i)–(iii). Define \(\mathcal {I}_q\) to be the collection of intervals from \(\tfrac{1}{R}\mathcal {I}_{q-1}\) that satisfy (59). By construction, (i) holds, (ii) is irrelevant and we only need to verify condition (iii). We shall assume that \(\widehat{\mathcal {I}}_q\ne \emptyset \) as otherwise (iii) is obvious. By construction, \(\widehat{\mathcal {I}}_q\) consists of intervals \(I_q\) such that \(\delta \left( g^tG_x\mathbb {Z}^{n+1}\right) <1\) for some \(x\in I_q\). Recall that this is equivalent to the existence of \((a_0,\mathbf{{a}})\in \mathbb {Z}^{n+1}\) with \(\mathbf{{a}}\ne \mathbf{{0}}\) satisfying the system (32). We shall use Propositions 1 and 2 and Lemma 6 to estimate the number of these intervals \(I_q\). Before we proceed with the estimates note that, by (33) and (36), the validity of (32) implies that \(|\mathbf{{a}}.\mathbf{{f}}'(x)|\le nc_1 \max _{1\le j\le n}|a_j|\le nc_1\max _{1\le j\le n}b^{r_jt}=nc_1b^{\gamma t}\). Thus,
The arguments split into two cases depending on the size of \(t\) as follows. Note that in view of our choice of \(m_0\) we have that
and so Proposition 2 is applicable as appropriate.
Case 1: \(t\le 2nm\). In this case let \(\widehat{\mathcal {I}}_{q,0}=\widehat{\mathcal {I}}_q\) and \(\widehat{\mathcal {I}}_{q,p}=\emptyset \) for \(0<p<q\). Then, the only thing we need to verify is (61). Let \(\varepsilon =0\). Then, by (62), we have that
where \(D^2_{0}=D^2_{t,\varepsilon ,\mathbf{{r}},b,\kappa ,\mathbf{{f}}}\) (with \(\varepsilon =0\)) as defined in Proposition 2. Hence, \(\#\widehat{\mathcal {I}}_{q,0}\) is bounded by the number of intervals in \(\frac{1}{R}\mathcal {I}_{q-1}\) that intersect an interval from the corresponding collection \(\mathcal {D}^2_{0}\) of intervals arising from Proposition 2. By (49), the intervals in \(\frac{1}{R}\mathcal {I}_{q-1}\) are of length \(R^{-q}|I_0|\). By (38), the intervals from \(\mathcal {D}^2_{0}\) have length \(\le \delta _t=\kappa \,b^{-t(1+\gamma )}\). Hence, each interval from \(\mathcal {D}^2_{0}\) can intersect at most \(\delta _t/(R^{-q}|I_0|)+2=\kappa \,b^{-t(1+\gamma )}R^{q}|I_0|^{-1}+2\) intervals from \(\tfrac{1}{R}\mathcal {I}_{q-1}\). Since \(b^{1+\gamma }=R\), \(\kappa =R^{-m}\) and \(q=t+m\), we have that \(\delta _t/(R^{-q}|I_0|)= |I_0|^{-1}\). Hence each interval from \(\mathcal {D}^2_0\) can intersect \(\ll \delta _tR^{q}\) intervals from \(\tfrac{1}{R}\mathcal {I}_{q-1}\). Then, by (39), we get
Using \(q=t+m\), \(\kappa =R^{-m}\) and \(t\le 2nm\) we obtain from (64) that
Recall from Proposition 2 that \(\alpha =\frac{1}{(n+1)(2n-1)}\). Consequently, \(\sigma \ge 1-\frac{\alpha }{2n+1}\) and (65) implies (61).
Case 2: \(t>2nm\). Let \(\varepsilon =(2n)^{-1}\). Since \(\sum _ir_i=1\) and \(\gamma =\max \{r_1,\ldots ,r_n\}\), we have that \(\gamma \ge 1/n\). Hence \(\varepsilon <\gamma \). Recall that \(R>nc_1\). Then, by (62) and the choice of \(\varepsilon \), for any \(x\in I_0\) such that \(\delta (g^tG_x\mathbb {Z}^{n+1})<1\) we have that either
or for some \(\ell \in \mathbb {Z}\) with \(0\le \ell \le \ell _t= [t/2n]+1\)
Then, once again using the equivalence of \(\delta \left( g^tG_x\mathbb {Z}^{n+1}\right) <1\) to the existence of \((a_0,\mathbf{{a}})\in \mathbb {Z}^{n+1}\) with \(\mathbf{{a}}\ne \mathbf{{0}}\) satisfying (32), we write that
where
as defined in Propositions 1 and 2 respectively.
By definition, intervals in \(\widehat{\mathcal {I}}_q\) are characterised by having a non-empty intersection with the left hand side of (66). We now use the right hand side of (66) to define the subcollections \(\widehat{\mathcal {I}}_{q,p}\) of \(\widehat{\mathcal {I}}_q\). More precisely, for \(p=t+3-2\ell \) with \(0\le \ell \le \ell _t\) let \(\widehat{\mathcal {I}}_{q,p}\) consist of the intervals \(I_q\in \widehat{\mathcal {I}}_q\) that intersect \(D^1_{\ell }(a_0,\mathbf{{a}})\) for some \(\mathbf{{a}}\in \mathbb {Z}^n\!\setminus \!\{\mathbf{{0}}\}\) and \(a_0\in \mathbb {Z}\). Next, let \(\widehat{\mathcal {I}}_{q,0}\) consist of the intervals \(I_q\in \widehat{\mathcal {I}}_q\) that intersect \(D^2_{}\). Finally, define \(\widehat{\mathcal {I}}_{q,p}=\emptyset \) for all other \(p<q\). By (66), it is easily seen that \(\widehat{\mathcal {I}}_q=\bigcup _{p=0}^{q-1}\widehat{\mathcal {I}}_{q,p}\). It remains to verify (60) and (61).
\(\bullet \) Verifying (61) This is very much in line with Case 1. The goal is to count the number intervals in \(\frac{1}{R}\mathcal {I}_{q-1}\) that intersect some interval from the collection \(\mathcal {D}^2_{}\) arising from Proposition 2. By (49), the intervals in \(\frac{1}{R}\mathcal {I}_{q-1}\) are of length \(R^{-q}|I_0|\). By (38), the intervals from \(\mathcal {D}^2_{}\) have length \(\le \delta _t=\kappa \,b^{-t(1+\gamma -\varepsilon )}\). Hence, each interval from \(\mathcal {D}^2_{}\) can intersect at most \(\delta _t/(R^{-q}|I_0|)+2\ll \delta _tR^{q}\) intervals from \(\tfrac{1}{R}\mathcal {I}_{q-1}\). Then, by (39), we get
Using \(\kappa =R^{-m}\), \(b^{1+\gamma }=R\), \(q=t+m\) and \(0<\gamma \le 1\), we obtain that
Once again using the value of \(\alpha \) from Proposition 2 we verify that \(\sigma \ge 1-\frac{1}{2}\varepsilon \alpha \) and so (67) implies (61) as required.
\(\bullet \) Verifying (60). Let \(p=t+3-2\ell \) with \(0\le \ell \le \ell _t\) and \(I_{p}\in \mathcal {I}_{p}\). Let \(S(I_p)\) be the set of points \((a_0,\mathbf{{a}})\in \mathbb {Z}^{n+1}\) with \(\mathbf{{a}}\ne \mathbf{{0}}\) such that \(D^1_{\ell }(a_0,\mathbf{{a}})\cap I_{p}\not =\emptyset \). By Proposition 1, for every \((a_0,\mathbf{{a}})\in S(I_p)\) any interval in \(\mathcal {D}^1_{\ell }(a_0,\mathbf{{a}})\) is of length
as \(\kappa =R^{-m}\), \(b^{1+\gamma }=R\) and \(q=t+m\). Then, by (49), any interval from \(\mathcal {D}^1_{\ell }(a_0,\mathbf{{a}})\) intersects \(\ll R^{\ell }\) intervals from \(\tfrac{1}{R}\mathcal {I}_{q-1}\). By Proposition 1, \(\#\mathcal {D}^1_{\ell }(a_0,\mathbf{{a}})\,\ll \, 1\). Hence,
and our main concern becomes to obtain a bound for \(\#S(I_p)\). We shall prove that
Armed with this estimate establishing (60) and thus completing our task becomes simple. Indeed, using (68) and (69) gives
which implies (60) upon observing that \(2\ell +m-3=q-p\) and \(\frac{3+\lambda }{2}+\frac{\tau }{1+\gamma }<3\).
Proof of (69) We assume that \(S(I_p)\ne \emptyset \) as otherwise (69) is trivial. The proof will be split into several relatively simple steps.
Step 1: We show that for any \((a_0,\mathbf{{a}})\in S(I_{p})\) and any \(x\in I_p\) we have
where \(\varrho _0\) and \(\varrho _1\) are given by (56).
First we prove the right hand side of (70). To this end, fix any \((a_0,\mathbf{{a}})\in S(I_{p})\) and let \(x_0\in D^1_{\ell }(a_0,\mathbf{{a}})\cap I_{p}\). To simplify notation define \(f(x)=a_0+\mathbf{{a}}.\mathbf{{f}}(x)\). By the Mean Value Theorem, for any \(x\in I_{p}\) we have
where \(\tilde{x}_0\) is a point between \(x\) and \(x_0\). By the definition of \(D^1_{\ell }(a_0,\mathbf{{a}})\), we have that \(|f'(x_0)|\le b^{\gamma t-(1+\gamma )(\ell -1)}\). Proceeding as in (45), we get that \(|f''(\tilde{x}_0)|< nc_1b^{\gamma t}\). Substituting the estimates for \(|f'(x_0)|\) and \(|f''(\tilde{x}_0)|\) into (71) and using the inequity
implied by (49), we get
Since \(b^{1+\gamma }=R\), we have that
Since \(\ell \le \ell _t<t/4+1\), one easily verifies that \((t+3-2\ell )>(\ell -1)\). Therefore (73) implies the right hand side of (70).
Now we prove the left hand side of (70). Again fix any \((a_0,\mathbf{{a}})\in S(I_{p})\), \(x_0\in D^1_{\ell }(a_0,\mathbf{{a}})\cap I_{p}\) and let \(f(x)=a_0+\mathbf{{a}}.\mathbf{{f}}(x)\). By the Mean Value Theorem, for any \(x\in I_{p}\) we have that
where \(\widehat{x}_0\) is a point between \(x\) and \(x_0\). In particular, \(\widehat{x}_0\in I_p\) and therefore, by the right hand side of (70), which we have already established, \(|f'(\widehat{x}_0)|< \varrho _1b^{\gamma t-(1+\gamma )(\ell -1)}\). By the definition of \(D^1_{\ell }(a_0,\mathbf{{a}})\), we have that \(|f(x_0)|\le \kappa b^{-t}=b^{-t-m(1+\gamma )}\). Hence, using these estimates together with inequality (72) and equation \(b^{1+\gamma }=R\), we get from (74) that
Since \(m\ge m_0\ge 4\) we have that \(-m(1+\gamma )\le (1+\gamma )(\ell -2)\) for all \(\ell \ge 0\). Therefore (75) implies the left hand side of (70).
Step 2: Now we utilize (70) to show that \(\mathrm{rank\,}S(I_p)\le n-z\). First of all, observe that if \((a_0,\mathbf{{a}})\in S(I_{p})\), where \(\mathbf{{a}}=(a_1,\ldots ,a_n)\), then \(|a_j|<b^{r_jt}=1\) whenever \(r_j=0\). Since \(a_j\in \mathbb {Z}\) in this case, we have that
Let \(J=\{j:r_j\ne 0\}\) and \(\overline{J}=\{1,\ldots ,n\}\!\setminus \!J\). Note that \(J\) contains exactly \(n-z>0\) elements, where \(z=z(\mathbf{{r}})\) is the number of zeros in \(\mathbf{{r}}\). Let \(J_0\) be the subset of \(J\) obtained by removing the smallest index \(j_0\) such that \(r_{j_0}=\gamma (\mathbf{{r}})\). Note that if \(\mathbf{{r}}\) has only one non-zero component then \(J_0=\emptyset \). Let \(x\in I_p\). Then, using (76) and (70) we obtain that every \((a_0,\mathbf{{a}})\in S(I_{p})\) satisfies the system
where \(\varrho _0\) and \(\varrho _1\) ar given by (56). Let \(\mathbf{{B}}_{p,x}\) denote the set of \((a_0,a_{1},\ldots ,a_{n})\in \mathbb {R}^{n+1}\) satisfying (77). Then, \(S(I_p)\subset \mathbf{{B}}_{p,x}\). Clearly, \(\mathbf{{B}}_{p,x}\) is a convex body lying over the \(n-z+1\) dimensional linear subspace of \(\mathbb {R}^{n+1}\) given by the equations \(a_j=0\) for \(j\in \overline{J}\). As is well known the \(n-z+1\)-dimensional volume of \(\mathbf{{B}}_{p,x}\) is equal to
where \(\Omega \) is the determinant of the system of linear forms in the variables \(a_j\), \(j\in J\cup \{0\}\), staying in the first three lines of (77). Note that \(|\Omega \left| =|f'_{j_0}(x)\right| \). Hence, using (33) and the fact that \(b^{1+\gamma }=R\ge R_0\), we conclude that the volume of \(\mathbf{{B}}_{p,x}\) is
In this case, Lemma 3 is applicable and we have that \(\mathrm{rank\,}S(I_p)\le n-z\) as claimed at the start of Step 2.
Step 3 : Finally, we obtain (69). To this end, let \(\Gamma \) denote the \(\mathbb {Z}\)-span of \(S(I_p)\). Since \(\mathrm{rank\,}S(I_p)\le n-z\), we have that \(\mathrm{rank\,}\Gamma \le n-z\). Discarding the second inequality from (77) and using the fact that \(\varrho _0\le R=b^{1+\gamma }\), which is implied by (57), we obtain that the points \((a_0,\mathbf{{a}})\in S(I_p)\) satisfy the system
On applying \(g^t\) to both sides of the system and dividing its first inequality by \(\kappa =R^{-m}=b^{-m(1+\gamma )}\), (78) becomes
Hence, in view of the definitions of \(\Pi (b,u)\), \(g^t=g^t_{\mathbf{{r}},b}\) and \(G_x=G(\kappa ;\mathbf{{f}}(x))\) given in Sect. 3, namely (20), (21) and (26), we obtain that
Note that \(0\le \tau (\mathbf{{r}})\le 1/n\le \gamma (\mathbf{{r}})\le 1\). Therefore \(\lambda (1+\gamma )=(1+\gamma )/(1+\tau )\in [1,2]\). Hence \((m+\ell -1)\le \lambda u\le 2(m+\ell -1)\). Since \(t>2nm\), \(m\ge 4\) and \(\ell -1\le t/2n\), one can easily see that \(1<\lambda u<t\). Hence \(1\le t-[\lambda u]<t\). Take \(x\in I_{t-[\lambda u]}\cap I_p\) for an appropriate interval \(I_{t-[\lambda u]}\). By induction, (59) holds when \(t\) is replaced by \(t-[\lambda u]\). This verifies (28) with \(\Lambda =G_x\Gamma \). Clearly, \(\mathrm{rank\,}\Lambda =\mathrm{rank\,}\Gamma \le n-z\). Hence, by Lemma 6 and (79), we obtain that \( \#S(I_p)\,=\,\#g^tG_xS(I_p)\,\ll \, b^{\tau }b^{\lambda u}. \) Now (69) readily follows upon substituting \(b=R^{1/(1+\gamma )}\) and \(u=(1+\gamma )(m+\ell -1)\). \(\square \)
The following key statement is essentially a corollary of Proposition 3.
Theorem 8
Let \(\mathcal {F}_n(I)\) be as in Theorem 2, \(I_0\subset I\) be a compact interval satisfying Property F. Then there is a constant \(M_0\ge 4\) such that for any \(\mathbf{{r}}\in \mathcal {R}_n\) and any \(\mathbf{{f}}\in \mathcal {F}_n(I)\) the set \(\mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}))\) is \(M\)-Cantor rich in \(I_0\) for any \(M> \max \big \{M_0,16^{1+1/\tau }\big \}\), where \(\tau =\tau (\mathbf{{r}})\) is defined by (9).
Proof
Let \(R_0\) and \(m_0\) be as in Proposition 3 and \(M_0=\max \left\{ R_0,4^{(2n)^4}\right\} \). Let \(M> \max \big \{M_0,16^{1+1/\tau }\big \}\), \(R\ge M\) and \(m\ge m_0\). Take any \(\mathbf{{f}}\in \mathcal {F}_n(I)\) and \(\mathbf{{r}}\in \mathcal {R}_n\). Let \((\mathcal {I}_q)_{q\ge 0}\) denote the \(R\)-sequence in \(I_0\) that arises from Proposition 3. By (59) and Lemma 2, we have that \(\mathcal {K}((\mathcal {I}_q)_{q\ge 0})\subset \mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}))\). Thus, by definition, the fact that \(\mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}))\) is \(M\)-Cantor rich in \(I_0\) will follow on showing that \(d((\mathcal {I}_q)_{q\ge 0})\) can be made \(\le \varepsilon \) for any \(\varepsilon >0\).
Observe that \((1-\lambda )/2=\tau /(2+2\tau )\). Then, since \(R\ge M>16^{1+1/\tau }\), we have that \(4R^{-\frac{1-\lambda }{2}}<1\). By conditions (ii) and (iii) of Proposition 3, for \(q>0\)
as \(m\rightarrow \infty \). Further, since \(R\ge M> M_0\ge 4^{(2n)^4}\), we have that \(4R^{-(1-\sigma )}=4R^{-(2n)^{-4}}<1\). Once again, by conditions (ii) and (iii) of Proposition 3, for \(q\le m\) we have that \(\#\left( \widehat{\mathcal {I}}_{q,0}\sqcap I_0\right) =0\), while for \(q>m\)
as \(m\rightarrow \infty \). By (51), combining (80) and (81) gives \(d_q(\mathcal {I}_q)\le \varepsilon \) for all \(q>0\) provided that \(m\) is sufficiently large. This completes the proof. \(\square \)
Proof of Theorem 2
Let \(I_0\) and \(M_0\) be the same as in Theorem 8 and \(M=\max \big \{M_0,16^{1+1/\tau _0}\big \}+1\), where \(\tau _0=\inf \{\tau (\mathbf{{r}}):\mathbf{{r}}\in W\}\). By (10), \(\tau _0>0\) and so \(M<\infty \). By Theorem 8, \(\mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}))\) is \(M\)-Cantor rich in \(I_0\) for each \(\mathbf{{f}}\in \mathcal {F}_n(I)\) and each \(\mathbf{{r}}\in W\). By Theorem 7, so is \(S=\bigcap _{\mathbf{{f}}\in \mathcal {F}_n(I)}\bigcap _{\mathbf{{r}}\in W}\mathbf{{f}}^{-1}(\mathbf {Bad}(\mathbf{{r}}))\). By Theorem 6, \(\dim S=1\). The proof is thus complete. \(\square \)
7 Final remarks
In this section we discuss possible generalisations of our main results and further problems. First of all, the analyticity assumption within Theorem 1 can be relaxed by making use of more general fibering techniques such as that of [42]. This however leaves the question of whether Theorem 1 holds for arbitrary nondegenerate submanifold of \(\mathbb {R}^n\) as defined in [33] open. Beyond nondegenerate manifolds, it would be interesting to obtain generalisations of Theorems 1 and 2 for friendly measures as defined in [32] as well as for affine subspaces of \(\mathbb {R}^n\) and their submanifolds—see [31] for a related context. In another direction, it would be interesting to develop the theory of badly approximable systems of linear forms. Removing condition (10) is another appealing problem that would be settled if the sets of interest were shown to be winning in the sense of Schmidt (see [1, 3], [17, §1.3] and [45]). However, the techniques of this paper could also help accomplishing this task: the key is to make the lower bound on \(M\) appearing in Theorem 8 independent of \(\tau (\mathbf{{r}})\). Finally, all of the above questions make sense and are of course interesting in the case of Diophantine approximation over \(\mathbb {Q}_p\) and in positive characteristic.
Notes
It is not known whether there are any real algebraic numbers of degree \(\ge 3\) that are badly approximable.
References
An, J.: Badziahin–Pollington–Velani’s theorem and Schmidt’s game. Bull. Lond. Math. Soc. 45(4), 721–733 (2013)
An, J.: Two dimensional badly approximable vectors and Schmidt’s game. Preprint, arXiv:1204.3610, p. 12
An, J., Beresnevich, V., Velani, S.: Badly approximable points on planar curves and winning. Preprint, arXiv:1409.0064, p. 49
Baker, R.C.: Metric Diophantine approximation on manifolds. J. Lond. Math. Soc. 14, 43–48 (1976)
Beresnevich, V., Bernik, V.I.: On a metrical theorem of W. Schmidt. Acta Arith. 75(3), 219–233 (1996)
Beresnevich, V., Bernik, V., Dodson, M.: On the Hausdorff dimension of sets of well-approximable points on nondegenerate curves. Dokl. Nats. Akad. Nauk Belarusi 46(6), 18–20 (2002). (In Russian)
Bernik, V.I., Dodson, M.M.: Metric Diophantine approximation on manifolds. In: Cambridge Tracts in Mathematics, vol. 137. Cambridge University Press, Cambridge (1999)
Beresnevich, V.: A Groshev type theorem for convergence on manifolds. Acta Math. Hung. 94(1–2), 99–130 (2002)
Beresnevich, V.: Rational points near manifolds and metric Diophantine approximation. Ann. Math. (2) 175(1), 187–235 (2012)
Beresnevich, V.: On approximation of real numbers by real algebraic numbers. Acta Arith. 90(2), 97–112 (1999)
Bernik, V., Kleinbock, D., Margulis, G.A.: Khintchine-type theorems on manifolds: the convergence case for standard and multiplicative versions. Intern. Math. Res. Not. 2001(9), 453–486 (2001)
Blichfeldt, H.F.: Notes on geometry of numbers. Bull. Am. Math. Soc. 27, 150–153 (1921)
Badziahin, D., Pollington, A., Velani, S.: On a problem in simultaneous diophantine approximation: Schmidt’s conjecture. Ann. Math. (2) 174(3), 1837–1883 (2011)
Bugeaud, Y.: Approximation by algebraic numbers. In: Cambridge Tracts in Mathematics, vol. 160. CUP (2004)
Beresnevich, V., Velani, S.: A note on zero-one laws in metrical Diophantine approximation. Acta Arith. 133(4), 363–374 (2008)
Badziahin, D., Velani, S.: Multiplicatively badly approximable numbers and generalised Cantor sets. Adv. Math. 225, 2766–2796 (2011)
Badziahin, D., Velani, S.: Badly approximable points on planar curves and a problem of Davenport. Math. Ann. 359(3–4), 969–1023 (2014)
Cassels, J.W.S.: Some metrical theorems in Diophantine approximation. I. Proc. Camb. Philos. Soc. 46, 209–218 (1950)
Cassels, J.E.S.: Simultaneous diophantine approximation. II. Proc. Lond. Math. Soc. (3) 5, 435–448 (1955)
Dani, S.G.: Divergent trajectories of flows on homogeneous spaces and Diophantine approximation. J. Reine Angew. Math. 359, 55–89 (1985)
Davenport, H.: Simultaneous Diophantine approximation. Mathematika 1, 51–72 (1954)
Davenport, H.: A note on Diophantine approximation. II. Mathematika 11, 50–58 (1964)
Davenport, H., Schmidt, W.M.: Approximation to real numbers by quadratic irrationals. Acta Arith. 13, 169–176 (1967/1968)
Falconer, K.: Fractal geometry. In: Mathematical Foundations and Applications, 2nd ed. John Wiley & Sons Inc., Hoboken (2003)
Mattila, P.: Geometry of sets and measures in Euclidean spaces. In: Fractals and rectifiability, Cambridge Studies in Advanced Mathematics, vol. 44. Cambridge University Press, Cambridge (1995)
Fishman, L.: Schmidt’s game on fractals. Isr. J. Math. 171, 77–92 (2009)
Jarník, V.: Zur metrischen theorie der diophantischen approximationen. Prace mar. fiz. 36, 91–106 (1928)
Khintchine, A.J.: Zwei Bemerkungen zu einer Arbeit des Herrn Perron. Math. Zeitschr. 22, 274–284 (1925)
Khintchine, A.J.: Zur metrischen Theorie der diophantischen Approximationen. Math. Zeitschr. 24, 706–714 (1926)
Kleinbock, D.: Flows on homogeneous spaces and Diophantine properties of matrices. Duke Math. J. 95(1), 107–124 (1998)
Kleinbock, D.: Extremal subspaces and their submanifolds. Geom. Funct. Anal. 13(2), 437–466 (2003)
Kleinbock, D., Lindenstrauss, E., Weiss, B.: On fractal measures and Diophantine approximation. Selecta Math. (N.S.) 10(4), 479–523 (2004)
Kleinbock, D.Y., Margulis, G.A.: Flows on homogeneous spaces and Diophantine approximation on manifolds. Ann. Math. (2) 148(1), 339–360 (1998)
Kristensen, S., Thorn, R., Velani, S.: Diophantine approximation and badly approximable sets. Adv. Math. 203(1), 132–169 (2006)
Kleinbock, D., Weiss, B.: Badly approximable vectors on fractals. Isr. J. Math. 149, 137–170 (2005)
Kleinbock, D., Weiss, B.: Modified Schmidt games and Diophantine approximation with weights. Adv. Math. 223(4), 1276–1298 (2010)
Khintchine, A.: Einige Sätze über Kettenbrüche, mit Anwendungen auf die Theorie der Diophantischen Approximationen. Math. Ann. 92, 115–125 (1924)
Mahler, K.: Ein Übertragungsprinzip für lineare Ungleichungen. Časopis Pěst. Mat. Fys. 68, 85–92 (1939)
Nesharim, E.: Badly approximable vectors on a vertical Cantor set. Preprint, arXiv:1204.0110, p. 19
Perron, O.: Über diophantische Approximationen. Math. Ann. 83(1–2), 77–84 (1921)
Pollington, A., Velani, S.: On simultaneously badly approximable numbers. J. Lond. Math. Soc. (2) 66(1), 29–40 (2002)
Pyartli, A.: Diophantine approximation on submanifolds of euclidean space. Funkts. Anal. Prilosz. 3, 59–62 (1969). (In Russian)
Schmidt, W.M.: On badly approximable numbers and certain games. Trans. Am. Math. Soc. 123, 178–199 (1966)
Schmidt, W.M.: Badly approximable systems of linear forms. J. Number Theory 1, 139–154 (1969)
Schmidt, W.M.: Diophantine Approximation. Lecture notes in mathematics, 785. Springer, New York (1980)
Schmidt, W.M.: Open problems in Diophantine approximation. In: Diophantine Approximations and Transcendental Numbers (Luminy, 1982), Progr. Math. vol. 31, pp. 271–287. Birkh auser, Boston (1983)
Sprindžuk, V.G.: Mahler’s problem in metric number theory. Translated from the Russian by B. Volkmann. In: Translations of Mathematical Monographs, vol. 25. American Mathematical Society, Providence (1969)
Sprindžuk, V.G.: Achievements and problems in Diophantine approximation theory. Russ. Math. Surv. 35, 1–80 (1980)
Wirsing, E.: Approximation mit algebraischen Zahlen beschränkten Grades. J. Reine Angew. Math. 206, 67–77 (1960)
Acknowledgments
The author is grateful to Maurice Dodson, Sanju Velani and Dmitry Kleinbock for their valuable comments on an earlier version of this paper and to Evgeniy Zorin whose suggestion of the modification of Sprindžuk’s fibering technique (specifically Lemma 12) helped filling a gap in Sprindžuk’s argument. The author is also really grateful to the anonymous reviewer of this paper for the very detailed report providing many helpful suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Supported by EPSRC Grant EP/J018260/1.
Appendices
Appendix A: Proof of Lemma 1
As was mentioned, the equivalence of (i) and (ii) is straightforward and thus left to the reader. The proof of the equivalence of (ii) and (iii) will make use of the following
Lemma 11
(Mahler [38]) Let \(L_0,\ldots ,L_n\) be a system of linear forms in variables \(u_0,\ldots ,u_n\) with real coefficients and determinant \(d\ne 0\), and let \(L'_0,\ldots ,L'_n\) be the transposed system of linear forms in variables \(v_0,\ldots ,v_n\), so that \(\sum _{i=0}^n L_iL'_i=\sum _{i=0}^n u_iv_i\). Let \(\lambda =T_0\cdots T_n/|d|\). Suppose there exists an integer point \((u_0,\ldots ,u_n)\ne (0,\ldots ,0)\) such that
Then there exists an integer point \((v_0,\ldots ,v_n)\ne (0,\ldots ,0)\) such that
Proof of the equivalence of (ii) and (iii) in Lemma 1 First consider the case when \(r_i>0\) for all \(i\). If \(n=1\) then there is nothing to prove because (16) and (17) coincide when \(c=c'\) and \(Q=H\). Thus we will assume that \(n\ge 2\). Define the linear forms \(L_0=u_0\) and \(L_i=u_0y_i-u_i\) (\(1\le i\le n\)). Then the transposed forms are \(L_0'=v_0+v_1y_1+\cdots +v_ny_n\) and \(L'_i=-v_i\) (\(1\le i\le n\)). It is easily verified that Mahler’s lemma is applicable with \(d=1\). Let \(0<c<1\). Then, the existence of a non-zero integer solution \((q,p_1,\ldots ,p_n)\) to (16) implies the existence of a non-zero integer solution \((u_0,\ldots ,u_n)\) to (82) with \(T_0=Q\) and \(T_i=\delta Q^{-r_i}\) \((1\le i\le n)\), where \(\delta =c^{\tau }<1\) and \(\tau =\min r_i>0\). By Mahler’s lemma, there is a non-zero integer solution \((v_0,\ldots ,v_n)\) to (83), where \(\lambda =\delta ^n\). This implies (17) with \(H=Q\) and \(c'=n\delta \). Note that \(c'\rightarrow 0\) as \(c\rightarrow 0\). Thus if there is \(c'>0\) such that the only integer solution to (17) is \(a_0=a_1=\cdots =a_n=0\), then there must exist a \(c>0\) such that the only integer solution to (16) is \(q=p_1=\cdots =p_n=0\). The converse is proved in exactly the same way by swapping the roles of \(L_i\) and \(L'_i\) and taking \(T_0=c'H^{-1}\), \(T_i=H^{r_i}\) and \(Q=(n+1)H\).
The case when \(\mathbf{{r}}\) contains a zero is treated by induction. The case \(n=1\) meaning \(\mathbf{{r}}=(r_1)\) with \(r_1\ne 0\) has already been done. Assume that \(n>1\) and our desired statement holds for smaller dimensions. Assume that \(\mathbf{{r}}\) contains a zero component. Without loss of generality assume that \(r_n=0\). Since \(\Vert x\Vert ^{1/0}=0\), we have that \(\max _{1\le i\le n}\Vert qy_i\Vert ^{1/r_i}=\max _{1\le i\le n-1}\Vert qy_i\Vert ^{1/r_i}\). Therefore, \(\mathbf{{y}}\in \mathbf {Bad}(\mathbf{{r}})\) if and only if \(\mathbf{{y}}'=(y_1,\ldots ,y_{n-1})\in \mathbf {Bad}(\mathbf{{r}}')\), where \(\mathbf{{r}}'=(r_1,\ldots ,r_{n-1})\). By induction, this is equivalent to the existence of \(c>0\) such that for any \(H\ge 1\) the only integer solution \((a_0,a_1,\ldots ,a_{n-1})\) to the system
is \(a_0=\cdots =a_{n-1}=0\). In turn, the latter statement is equivalent to (iii), since, by \(r_n=0\), the inequality \(|a_n|< H^{r_n}\) implies that \(a_n=0\) whenever \(a_n\in \mathbb {Z}\). \(\square \)
Appendix B: Proof of (18)
Recall that (18) is the following inclusion
Since for \(n=1\) (18) becomes trivial, we will assume that \(n\ge 2\). Fix any \(\xi \in \mathcal {B}_n\). Define
Note that, by Dirichlet’s theorem, \(c_1(\xi ,n)\le 1\) and, by the assumption that \(\xi \in \mathcal {B}_n\), we have that
Also note that \(\xi \) is not algebraic of degree \(\le n\), since otherwise \(c_1(\xi ,n)=0\).
Assume for a moment that \(\xi \not \in \mathcal {B}_n^*\). Then there exists a sequence \((\alpha _i)_{i\in \mathbb {N}}\) of algebraic numbers of degree \(\le n\) such that \(H(\alpha _i)^{n+1}|\xi -\alpha _i|\rightarrow 0\) as \(i\rightarrow \infty \). Let \(P_i\in \mathbb {Z}[x]\) be the minimal polynomial of \(\alpha _i\) over \(\mathbb {Z}\). In particular, \(P_i(\alpha _i)=0\), \(1\le \deg P_i=\deg \alpha _i\le n\) and \(H(P_i)=H(\alpha _i)\). Using Taylor’s Theorem, we get that
as \(i\rightarrow \infty \), contrary to (84). Hence, \(\xi \) must be in \(\mathcal {B}_n^*\).
In order to show that \(\xi \in \mathcal {W}_n^*\) take \(\varepsilon _0 = (1+n^2\max \{1,|\xi |^{n}\})^{-n}c_1(\xi ,n)\), any integer \(Q>1\) and consider the following system of inequalities:
By Minkowski’s theorem for convex bodies, there exists a non-zero integer vector \((a_0,\ldots ,a_n)\) satisfying this system. Define the polynomial \(P=a_nx^n+\cdots +a_1x+a_0\). Assume for a moment that \(|P'(\xi )|\le Q\). Then, using the above system we get that
and
Thus \(H(P)\le \left( 1+n^2\max \left\{ 1,|\xi |^{n}\right\} \right) Q\) and we obtain
This contradicts the definition of \(c_1(\xi ,n)\). Therefore, we must have that \(|P'(\xi )|> Q\). By (85), we have that \(|P(\xi )|<\varepsilon _0 Q^{-n}\). Hence, by Taylor’s formula and the fact that \(|P(\xi )|<\varepsilon _0 Q^{-n}<\tfrac{1}{2}Q^{-n}\), the expression
has the same sign as \(P'(\xi )\) for \(x=\xi \pm Q^{-n-1}\) provided that \(Q\) is sufficiently large. Hence \(P(x)\) must have opposite signs at \(\xi -Q^{-n-1}\) and \(\xi +Q^{-n-1}\). By continuity, this means that there is a root of \(P\), say \(\alpha \) in the interval \(|x-\xi |\le Q^{-n-1}\). Once again using (85) we obtain that \(H(P)\le c_2 Q\) with \(c_2=(1+n^2\max \{1,|\xi |^{n}\})\varepsilon _0^{-1}\). This means that
Let \(P_\alpha \) denote the minimal polynomial of \(\alpha \) over \(\mathbb {Z}\). Since \(P(\alpha )=0\), by Gauss’s lemma, \(P_\alpha \) divides \(P\), that is \(P=P_\alpha R\) for some \(R\in \mathbb {Z}[x]\). Then, by [47, §2, Lemma 8], we have that \(H(P)=H(P_\alpha R)\gg H(P_\alpha )H(R)\ge H(P_\alpha )=H(\alpha )\), where the constant in the Vinogradov symbol depends on \(n\) only. Thus, \(H(P)\gg H(\alpha )\) and (86) implies that
Note that if the same \(\alpha \) turned up in the above construction for infinitely many \(Q\), then \(\xi \) would be equal to this \(\alpha \). However, this is impossible, since, as we noted just after (84), \(\xi \) cannot be algebraic of degree \(\le n\). Therefore, there must be infinitely many different real algebraic numbers \(\alpha \) of degree \(\le n\) satisfying (87). This means that \(\xi \in \mathcal {W}_n^*\). The proof is thus complete. \(\square \)
Appendix C: Proof of the Fibering Lemma
Here we give a proof of the Fibering Lemma stated in Sect. 2.1. We will need the following technical statement.
Lemma 12
Let \(0<d_0<d\) be integers and let \(e_d:\mathbb {Z}^m_{\ge 0}\rightarrow \mathbb {Z}_{\ge 0}\) be given by
Let
Then
-
(i)
\(e_d\) maps \(S_{d_0}\) into \(\mathbb {Z}_{\ge 0}\) injectively, and
-
(ii)
\(e_d(S_{d_0})\cap e_d(\mathbb {Z}_{\ge 0}^m\!\setminus \!S_{d_0})=\emptyset \).
Proof
Let \((\alpha _1,\ldots ,\alpha _m)\) and \((\alpha '_1,\ldots ,\alpha '_m)\) be two different elements of \(S_{d_0}\) and let \(k\) be the largest index such that \(\alpha _k\ne \alpha '_k\). Note that
If \(\alpha _1+\cdots +\alpha _m\ne \alpha '_1+\cdots +\alpha '_m\) then
Thus, \(e_d(\alpha _1,\ldots ,\alpha _m)\ne e_d(\alpha '_1,\ldots ,\alpha '_m)\) in this case. Now if \(\alpha _1+\cdots +\alpha _m=\alpha '_1+\cdots +\alpha '_m\) then
Again we obtain that \(e_d(\alpha _1,\ldots ,\alpha _m)\ne e_d(\alpha '_1,\ldots ,\alpha '_m)\) and thus prove part (i) of the lemma. Finally, observe that
whence (ii) readily follows. \(\square \)
Proof of the Fibering Lemma Since \(f_0,\ldots ,f_n\) are analytic we can write them as the following absolutely convergent power series
Since they are linearly independent over \(\mathbb {R}\) for every \((c_0,\ldots ,c_n)\in \mathbb {R}^{n+1}\!\setminus \!\{\mathbf{{0}}\}\) the function
is not identically zero. Hence, there exist a multiindex \((\alpha _1,\ldots ,\alpha _m)\in \mathbb {Z}_{\ge 0}^m\) such that
Therefore, the collection of the sets
taken over \((\alpha _1,\ldots ,\alpha _m)\in \mathbb {Z}^m_{\ge 0}\) is an open cover of the unit sphere in \(\mathbb {R}^{n+1}\). Since the sphere is compact, there exists a finite subcover, say, \(\mathcal {C}(\alpha ^{(1)}_1, \ldots ,\alpha ^{(1)}_m)\), ..., \(\mathcal {C}(\alpha ^{(N)}_1,\ldots ,\alpha ^{(N)}_m)\). Let
Then, for every non-zero collection \(c_0,\ldots ,c_n\) there exists a multiindex \((\alpha _1,\ldots ,\alpha _m)\in S_{d_0}\), where \(S_{d_0}\) is given by (89), such that
Take any integer \(d>d_0\) and any \(\mathbf{{u}}=(u_1,u_2,\ldots ,u_m)\in \mathbb {R}^{m}\) with \(u_1\ldots u_m\ne 0\). Then, by what we have just shown,
Note that
where \(e_d\) is given by (88). Consider the linear the combination of functions (93) with coefficients \(c_0,\ldots ,c_n\):
By Lemma 12 and (92), the above series in \(t\) is not identically zero. Since \((c_0,\dots ,c_n)\ne 0\) is arbitrary, the functions (93) are linearly independent over \(\mathbb {R}\). \(\square \)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.