Abstract
We prove limit laws for infinite horizon planar periodic Lorentz gases when, as time n tends to infinity, the scatterer size \(\rho \) may also tend to zero simultaneously at a sufficiently slow pace. In particular we obtain a non-standard Central Limit Theorem as well as a Local Limit Theorem for the displacement function. To the best of our knowledge, these are the first results on an intermediate case between the two well-studied regimes with superdiffusive \(\sqrt{n\log n}\) scaling (i) for fixed infinite horizon configurations—letting first \(n\rightarrow \infty \) and then \(\rho \rightarrow 0\)—studied e.g. by Szász and Varjú (J Stat Phys 129(1):59–80, 2007) and (ii) Boltzmann–Grad type situations—letting first \(\rho \rightarrow 0\) and then \(n\rightarrow \infty \)—studied by Marklof and Tóth (Commun Math Phys 347(3):933–981, 2016) .
1 Introduction
In this paper we are interested in limit laws for infinite horizon planar periodic Lorentz gases with small scatterers. The Lorentz gas, a popular model of mathematical physics introduced by Lorentz in 1905 [23], is a dynamical system on the infinite billiard table obtained by removing strictly convex scatterers from \({{\mathbb {R}}}^2\). We study the periodic model when these scatterers are round disks of radius \(\rho \in (0,1/2)\) positioned at the points of the Euclidean lattice \({{\mathbb {Z}}}^2\). This table can be split up into countably many compact cells, each congruent to the unit square, which can be also regarded as the 2-dimensional flat torus. As usual, a point particle on the table moves with a unit velocity vector along straight lines inside the table, and collides elastically—angle of incidence equals angle of reflection—at the scatterers. This billiard flow produces a billiard map for the Poincaré section of outgoing collisions. The phase space of the billiard map in a single cell is \({{\mathcal {M}}}= \partial O \times [-\frac{\pi }{2}, \frac{\pi }{2}]\), where O is a round disk at the origin with radius \(\rho \). The phase space representing all cells together is \({\widehat{{{\mathcal {M}}}}} = {{\mathcal {M}}}\times {{\mathbb {Z}}}^2\) and the displacement function \(\kappa _\rho :{{\mathcal {M}}}\rightarrow {{\mathbb {Z}}}^2\) indicates the difference in cell numbers going from one collision to the next. As O is strictly convex, the billiard is dispersing, and the dynamics has good hyperbolicity properties.
For any \(\rho \in (0,1/2)\), the horizon is infinite which means that the time between two consecutive collisions—and accordingly, \(\kappa _\rho :{{\mathcal {M}}}\rightarrow {{\mathbb {Z}}}^2\)—is unbounded. This corresponds to corridors, that is, infinite strips on \({{\mathbb {R}}}^2\) parallel to some direction \(\xi \in {{\mathbb {Z}}}^2\setminus \{0\}\) which do not intersect any of the scatterers of the infinite billiard table. Both the number and the geometry of these corridors depend on \(\rho \). Hence, the value of the parameter \(\rho \) strongly affects the asymptotic properties of the dynamics, and in particular, plays a central role in our exposition.
1.1 Recalling limit laws for fixed \(\rho \in (0,1/2)\) as time \(n \rightarrow \infty \)
A consequence of the infinite horizon is the superdiffusive behaviour of \(\kappa _\rho \) with \(\rho \in (0,1/2)\) fixed, captured in the first place in the Central Limit Theorem (CLT) with non-standard normalization. To recall this result along with its refinements, we introduce some notation to be used throughout this paper. Let \(T_{\rho }:{{\mathcal {M}}}\rightarrow {{\mathcal {M}}}\) be the billiard map, recall that it preserves the canonical invariant probability measure \(\mu \). Set
Choosing the initial point on \({{\mathcal {M}}}\) according to \(\mu \), we can regard \(\kappa _{n,\rho }\) as a family of random variables. Throughout we let \(\implies \) stand for convergence in distribution. We recall the CLT with non-standard normalization: for every \(\rho \in (0,\frac{1}{2})\) there exists a positive definite matrix \(\Sigma _{\rho }\) such that:
This result was conjectured by Bleher [5] and proved rigorously via two different methods: Szász and Varjú [31], and Chernov and Dolgopyat [11]. In the setting above, the requirement of having two non-parallel corridors (present in [11, 31]) is automatically satisfied because the scatterers are positioned at the lattice points.
It is important to note that there is an explicit formula for \(\Sigma _{\rho }\), which involves the scatterer geometry for fixed \(\rho \), see for example [11, Formula (2.1)]. A computation (similar to our proof of Lemma A.4) shows that
where \(\Sigma \) is the diagonal matrix given in (1). To compare these results with our Theorem A below, we point out the following direct consequence of (2) and (3):
The method of proof in [31] relies on the existence of a Young tower for \(T_\rho \) as in [9, 34] and an abstract result of Bálint and Gouëzel [4] along with several additional properties of \((\kappa _\rho , T_\rho )\) established in [31]. One notable feature of this method is that it provides a refinement of the CLT (2), namely the Local Limit Theorem (LLT):
where \(\Phi _{\Sigma _{\rho }}\) is the density of the Gaussian random variable on the r.h.s of (2).
The method of proof in [11] exploits exponential mixing for the sequence \(\{\kappa _\rho \circ T_\rho ^n\}_{n\ge 1}\). The authors of that work develop an argument based on standard pairs to establish a bound on the correlations for \(\kappa _\rho \):
Using (6), the CLT (2) is proved in [11, Proof of Theorem 8 a)] via blocking type arguments; we refer to Denker [17] for a classical reference. Furthermore, as shown in [11, Proof of Theorem 8], the limit law (2) together with a tightness argument for a truncated version of \(\kappa _\rho \) provides another refinement of the CLT, namely, the Weak Invariance Principle (WIP):
Similar versions of the CLT (2) and the WIP (7) hold for the flight time function taking values in \({{\mathbb {R}}}^2\), see [11].
In a different direction, a further important consequence of the LLT (5) established in [31] is that it allows one to study mixing of the infinite measure preserving billiard dynamics on the entire lattice \({\widehat{{{\mathcal {M}}}}} = {{\mathcal {M}}}\times {{\mathbb {Z}}}^2\). This can be modelled by a \({{\mathbb {Z}}}^2\) extension
The dynamics \({\hat{T}}_\rho \) preserves the measure \({\hat{\mu }}=\mu \times \text{ Leb}_{{{\mathbb {Z}}}^2}\), where \(\text{ Leb}_{{{\mathbb {Z}}}^2}\) denotes the counting measure. Let us introduce the notation \({{\mathcal {M}}}_{\xi }:={{\mathcal {M}}}\times \{\xi \} \subset {\widehat{{{\mathcal {M}}}}}\) for \(\xi \in {{\mathbb {Z}}}^2\), and furthermore, for brevity, let \({{\mathcal {M}}}_0:={{\mathcal {M}}}\times \{(0,0)\}\), where the label 0 refers here to the origin in \({{\mathbb {Z}}}^2\). An immediate consequence of (5) is:
A first refinement of the LLT (5) and of the mixing statement (8) was obtained by Pène [28] who proved the analogue of these statements for the class of dynamically Hölder observables. Later on, Pène and Terhesiu [29], building on the results in [4], obtained sharp error rates in the LLT and the mixing for dynamically Hölder observables, including observables supported on compact sets. Furthermore, [29] establish optimal error rates for mean zero observables.
1.2 Recalling results as first \(\rho \rightarrow 0\) and then \(n \rightarrow \infty \) (Boltzmann–Grad limit)
In [24, 25], Marklof and Strömbergsson studied the Boltzmann–Grad limit of the periodic Lorentz gas. This corresponds to letting the scatterer size \(\rho \rightarrow 0\) and investigating the displacement in the rescaled continuous time \(T=\rho t\) (so that the mean free path remains bounded). In particular, [24] proves that, in this Boltzmann–Grad limit, the displacement of the particle converges, on any finite time interval, to an explicitly given Markov process. Marklof and Tóth [26] then studied the large time asymptotic of this Markov process, and obtained the CLT and the WIP with non-standard normalization \(\sqrt{T\log T}\).
These results on the Boltzmann–Grad limit scenario hold in any dimension, not just in \(d=1, 2\) as the results mentioned in the previous subsection. For more details, we refer to the original references. What is most relevant for us is that [26, Theorem 1.1] and [26, Theorem 1.3] are reduced to discrete time statements that can be formulated in terms of the behavior of \(\kappa _{n,\rho }\) in the limits \(\rho \rightarrow 0\) first and then \(n\rightarrow \infty \). In particular, [26, Theorem 1.2] states for \(d=2\) that:
where \(\kappa _{n,\rho }\) and \(\Sigma \) are as in (1),Footnote 1 while [26, Theorem 1.4] is the corresponding WIP which, when \(d=2\), reads as (7) with the main difference of the limit paths: \(\rho \rightarrow 0\) followed by \( n\rightarrow \infty \), as opposed to fixed \(\rho \).
In [26], the authors state that an open problem is to consider the joint limit \(\rho \rightarrow 0\) and \(n\rightarrow \infty \). In the Boltzmann–Grad limit scenario with diffusive behaviour, this type of question is answered by Lutsko and Tóth in [22] for random Lorentz gases, in dimension \(d=3\), where, on top of the initial condition, additional randomness comes from the random placement of the scatterers. However, their model is very different from the model considered in [26] and it is characterized by diffusion (Brownian motion with standard normalization).
1.3 Main results as \(\rho \rightarrow 0\) and \(n \rightarrow \infty \) in the joint limit
Our main result takes a step in answering the open question in [26] for the planar periodic Lorentz gas. It reads as follows.
Theorem A
Let \(\kappa _{n,\rho }\) and \(\Sigma \) be as in (1), and let
There exists a function \(M(\rho )\) with \(M(\rho )\rightarrow \infty \) as \(\rho \rightarrow 0\) such that,
A precise expression of \(M(\rho )\) is given in Theorem 7.1 in Sect. 7. At this stage we mention that \(M(\rho )\) depends on the rate of correlation decay for Hölder observables as \(\rho \rightarrow 0\). How this decay rate depends on \(\rho \) is not known and we do not attempt to study this in the present paper. However, we comment on some relevant aspects of correlation decay below.
In the remainder of this section, we make some further comments on how our results compare to various other works, and on some key ingredients of our argument.
Comments on the rate of correlation decay Statistical limit laws in dynamical settings in general, and our results in particular, are strongly related to effective bounds on time correlations. For several decades, it has been a major problem to prove exponential decay of correlations for Hölder observables in dispersing billiards, that is, bounds of the form:
where \(\psi _1:{{\mathcal {M}}}\rightarrow {{\mathbb {R}}}\) and \(\psi _2:{{\mathcal {M}}}\rightarrow {{\mathbb {R}}}\) are centered, Hölder continuous observables, and \(\hat{\theta }_{\rho }<1\) may depend on the Hölder exponent, while \(C_\rho (\psi _1,\psi _2)>0\) on the Hölder norm of these functions, and both constants depend also on \(\rho \) (i.e. on the billiard table). Several powerful methods have been designed to prove bounds of the form (10), in particular using quasi-compactness of the transfer operator on Young towers [34] or anisotropic Banach spaces [14], coupling of standard pairs [10, Chapter 7] or most recently, Birkhoff cones [13]. However, each of these methods involve some non-constructive compactness argument which is the reason why there is no explicit information available on how the rate of decay (i.e. \(C_\rho \) and \(\hat{\theta }_{\rho }\)) depends on \(\rho \). For instance, in the framework of quasi-compact transfer operators, this corresponds to having effective bounds on the essential spectral radius, but not on the spectral gap.
In fact, depending on the method, \(\psi _1\) and \(\psi _2\) may belong to a larger space (that contains Hölder functions), however, these spaces do not contain the unbounded observable \(\kappa _{\rho }\). Hence, even for fixed \(\rho \), it requires additional effort to obtain correlation bounds for unbounded observables, in particular, to derive (6). As mentioned above, in our context of the infinite horizon Lorentz gas, (6) was proved by Chernov and Dolgopyat in [11, Proposition 9.1], which is an important reference for our work. Let us also mention [3, Lemma 3.2] on a similar bound for the induced return time arising in dispersing billiards with cusps, and the more recent paper [32] where correlation bounds for unbounded observables are studied in an axiomatic framework that includes further billiard models. Nonetheless, all these works consider the large time asymptotics of a fixed billiard system. To treat the simultaneous scaling of \(\rho \rightarrow 0\) and \(n\rightarrow \infty \), in Appendix C of the present paper we extend [11, Proposition 9.1] in two directions. On the one hand, on top of the mere existence of some \(\hat{C}_\rho > 0\) and \(\hat{\vartheta }_{\rho }<1\) in (6), we explicitly relate these constants to \(C_\rho \) and \(\hat{\theta }_{\rho }\) of (10), as expressed in (66).Footnote 2 On the other hand, to exploit correlation bounds of the type (6) when taking the joint limit, these have to be combined with the action of the perturbed transfer operator \(R_{\rho }(t)\) (introduced below) as stated in our Proposition C.1.
Comparison with results on the random Lorentz gas To compare our Theorem A with the results of Lutsko and Tóth on the random Lorentz gas, it is important to emphasize that although both [22] and our paper consider a joint limit of scatterer radius tending to 0 and time tending to infinity simultaneously, the settings of these two papers are quite different. In particular, the starting point of Lutsko and Tóth is the Boltzmann–Grad limit of the random Lorentz gas, and accordingly, [22] can handle situations when time tends to infinity at a sufficiently slow pace in relation to the scatterer size tending to 0. In contrast, the starting point of our work is the superdiffusive limit in the infinite horizon periodic Lorentz gas with fixed scatterer size (see Sect. 1.1 for a summary of previous results), and accordingly we can handle situations when time tends to infinity at a sufficiently fast pace in relation to the scatterer size tending to 0.
It is also important to note that under the condition \(M(\rho )=o(\log n)\) we have
which shows that our Theorem A is indeed a direct analogue of both (4) and (9). To simplify the exposition, we omit the case \(d=1\) (i.e., the Lorentz tube), but believe that similar results can be obtained by the same arguments.
Further comments on some corollaries of our result and some elements of our proofs A main advantage of the current method of proof via spectral methods is that it allows us to obtain (with no additional effort) the LLT (5) and the mixing statement (8) with appropriate limit paths \(\rho \rightarrow 0\) simultaneously with \(n\rightarrow \infty \), as opposed to fixed \(\rho \). For the LLT we refer to Theorem 7.3 and for the mixing result we refer to Corollary 7.5.
We mention up front that unlike in the fixed \(\rho \) scenario with main results recalled in Sect. 1.1, we cannot exploit the existence of a Young tower because it seems undoable to build such a tower in a fashion that it depends continuously on \(\rho \). Instead, we prove Theorem A via the Nagaev method on Banach spaces of distributions introduced by Demers and Zhang [14,15,16] in the spirit of the spaces constructed in Demers and Liverani [12]. See Aaronson and Denker [1, 2] for a classical reference on the Nagaev method in (Gibbs Markov) dynamics beyond the CLT with standard normalization (that is \(\sqrt{n}\)). However, as we shall explain below, the standard pairs argument in [11] plays a crucial role in our proof.
We end this introduction summarizing the main steps and challenges of our proofs. A main difficulty comes from the fact that as \(\rho \rightarrow 0\), more and more corridors open up and controlling their number and geometry is a non-trivial task. Another challenge for the proofs of Theorem A and the LLT in Theorem 7.3 comes from the fact that the spaces in [14,15,16] cannot be used in a straightforward way even in the infinite horizon case with fixed \(\rho \).
The Nagaev method requires: (1) the existence of a Banach space \(({{\mathcal {B}}}, \Vert \,\Vert _{{{\mathcal {B}}}})\) on which the transfer operator \(R_\rho \) of \(T_\rho \) has a spectral gap; (2) that the perturbed transfer operator (\(R_\rho (t) \psi =R(e^{it\kappa _\rho }\cdot \psi )\) for \(\psi \in {{\mathcal {B}}}\)) has sufficiently good continuity estimates \(\Vert R_\rho (t)-R_\rho (0)\Vert _{{{\mathcal {B}}}}\le C |t|^\nu \); the larger \(\nu > 0\), the better.
Regarding 1), using a Lasota–Yorke inequality on a strong space \({{\mathcal {B}}}\) and a weak space \({{\mathcal {B}}}_w\), Demers and Zhang [14,15,16] established the spectral gap for every fixed \(\rho \), see Sect. 4. This is the main reason why we resorted to the use of such Banach spaces.
Regarding 2), as in Keller and Liverani [20], one could work with the weak space. For infinite horizon billiards, continuity estimates in the strong or weak Banach spaces in [14,15,16] have not been obtained previously. In Sect. 5.2, we give continuity estimates in such Banach spaces (strong or weak); the estimates there rely heavily on a version of the growth lemma, namely Proposition 3.1. These continuity estimates are \(O(|t|^\nu )\) for \(\nu <1/2\) with explicit dependence on \(\rho \), in both the strong and the weak spaces. This exponent \(\nu \) is too small to obtain the asymptotics of the leading eigenvalue of \(R_\rho (t)\) directly. Therefore we resort to a decomposition of the eigenvalue in several pieces (see the proof of Proposition 6.3) and exploit the standard pairs arguments in [11] to deal with some parts of the estimate, see Appendix C. Along the way, we give a new proof of the LLT (5) for fixed \(\rho \) which is new at an abstract level as well, namely by working on the Banach spaces [14,15,16] in the absence of good continuity estimates but in the presence of exponential decay of correlations.
The paper is organised as follows: In Sect. 2 we recall some basic properties of hyperbolic billiards and also estimate widths of corridors that open up as \(\rho \rightarrow 0\). Section 3 gives the Growth Lemmas, following [14,15,16] but with estimates made explicit in terms of \(\rho \), and including sums over unbounded number of corridors (this is the reason why the estimates are worse than for the usual Growth Lemmas). Section 4 introduces the Banach spaces and recalls the proof of the spectral gap property for the unperturbed transfer operator \(R_\rho \), showing that the \(\rho \)-dependence can be controlled. Section 5 is devoted to the continuity estimates of the perturbed transfer operator \(R_\rho (t)\) and Sect. 6 gives the asymptotics of the corresponding leading eigenvalue. The precise statements and proofs of the limit theorems are gathered in Sect. 7.
The appendices give further technical details on corridor sums (Appendix A), distortion (Appendix B) and decay of correlations by a combination of tower and standard pair arguments (Appendix C).
2 Preliminaries on Lorentz gas on \({{\mathbb {Z}}}^2\)
Our general reference on hyperbolic billiards is Chernov and Markarian [10], the conventions of which are followed in our exposition, except for some minor differences. In particular, we use coordinates \((\theta ,\phi ) \in {{\mathbb {S}}}^1 \times [-\frac{\pi }{2}, \frac{\pi }{2}]\) on \({{\mathcal {M}}}\), where
-
\(\theta \in {{\mathbb {S}}}^1\) in clockwise orientation describes the collision point on the scatterer (so the corresponding point on \(\partial O\) is \((\rho \sin \theta , \rho \cos \theta )\));
-
\(\phi \in [-\frac{\pi }{2}, \frac{\pi }{2}]\) denotes the outgoing angle that the billiard trajectory makes after a collision at a point with coordinate \(\theta \) with the outward normal vector \(\textbf{N}_\theta \) at this point (so \(\phi = \frac{\pi }{2}\) corresponds to an outgoing trajectory tangent to O in the positive \(\theta \)-direction).
In these coordinates \((\theta ,\phi )\), the measure \(\mu \) has the same form \(d\mu = \frac{1}{4\pi } \cos \phi \, d\phi \, d\theta \) for all values of the radius \(\rho > 0\). Integrals involving the displacement function \(\kappa _\rho \), however, do depend on \(\rho \). If the flight between \((x,\ell )\) and \(( T_\rho (x), \ell + \kappa _\rho (x))\) goes through a corridor for a long time before hitting a scatterer at the boundary of this corridor, then the angle at which the second scatterer is hit is close to \(\pm \frac{\pi }{2}\). This sparks another long flight in the same corridor, i.e., \(\Vert \kappa _\rho (T_\rho x)\Vert \) is large, too.
In the remainder of this section, we record some properties of \(T_\rho \) and \(\kappa _\rho \). In Sect. 2.1 the geometry of corridors is described, with special emphasis on the asymptotics of small \(\rho \). In Sect. 2.2 we focus on the singularities, which, in addition to strong hyperbolicity, are the other main feature of the map \(T_{\rho }:{{\mathcal {M}}}\rightarrow {{\mathcal {M}}}\). In Sect. 2.3, the hyperbolic properties of \(T_{\rho }:{{\mathcal {M}}}\rightarrow {{\mathcal {M}}}\) are discussed. Some lemmas of technical character are moved to Appendices A and B.
Notation: For functions (or sequences) f and g, we use the Vinogradov notation \(f \ll g\) and the Landau big O notation interchangeably: there is a constant \(C > 0\) such that \(f \le C g\). Similarly \(f \asymp g\) means that there exists \(C>1\) such that \(C^{-1} g \le f \le Cg\).
2.1 Corridors and their widths
Let \(O_\ell \) denote the circular scatterer of radius \(\rho \) placed at lattice point \(\ell \in {{\mathbb {Z}}}^2\). The computation of \(\mu (x \in \partial O_0 \times [-\frac{\pi }{2}, \frac{\pi }{2}]: \kappa _\rho (x) = (p,q))\) is based on the division of the phase space in corridors. These are infinite strips in rational directions given by \(\xi \in {{\mathbb {Z}}}^2 {\setminus } \{ 0 \}\) for \(\rho \) sufficiently small, that are disjoint from all scatterers (but maximal with respect to this property), and they are periodically repeated under integer translations. As soon as \(\rho < \frac{1}{2}\), there are infinite corridors parallel to the coordinate axes. If \(\rho < \frac{1}{4} \sqrt{2}\), then corridors at angles of \(\pm 45^\circ \) open up, and the smaller \(\rho \) becomes, the more corridors open up at rational angles.
Given \(0 \ne \xi \in {{\mathbb {Z}}}^2\) and \(\rho > 0\) sufficiently small, there are two corridors simultaneous tangent to \(O_0\) and \(O_\xi \), one corridor on either side of the arc connecting 0 and \(\xi \). The widths of the corridors are denoted by \(d_{\rho }(\xi )\) and \(\tilde{d}_{\rho }(\xi )\), see Fig. 1.
Lemma 2.1
If \(\rho = 0\) and \(\xi = (p,q) \in {{\mathbb {Z}}}^2\) is expressed in lowest terms, then
For \(\rho > 0\), the actual width of the corridor is then \(d_{\rho }(\xi ) = {\tilde{d}}_{\rho }(\xi ) = \max \{0, |\xi |^{-1} - 2\rho \}\).
Remark 2.2
Let us call these two corridors in the direction \(\xi \) the \(\xi \)-corridors. They open up only when \(\rho < d_0(\xi )/2 = {\tilde{d}}_0(\xi )/2\). For \(\rho = 0\), the common boundary (called \(\xi \)-boundary) of the two \(\xi \)-corridors is the line through 0 and \(\xi \). The other boundaries are lines parallel to the \(\xi \)-boundary, going through lattice points that are called \(\xi '\) and \(\xi ''\) in the below proof. For \(\xi = (p,q)\) (with \(\gcd (p,q) = 1\)), these points \(\xi ' = (p',q')\), \(\xi '' = (p'',q'')\) are uniquely determined by \(\xi \) in the sense that \(p'/q'\) and \(p''/q''\) are convergents preceding p/q in the continued fraction expansion of p/q. In particular \(|\xi '|, |\xi ''| \le |\xi |\). In the sequel, we usually only need one of these two \(\xi \)-corridors, and we take the one with \(\xi '\) in its other boundary.
Proof
If \((p,q) = (0, \pm 1)\) or \((\pm 1, 0)\), then clearly \(d_0(\xi ) = {\tilde{d}}_0(\xi ) = 1\), so we can assume without loss of generality that \(p \ge q > 0\). Let L be the arc connecting (0, 0) to (p, q). The corridors associated to \(\xi \) intersect \([0,p] \times [0,q]\) in diagonal strips on either side of L.
Let \(\frac{q}{p} = [0;a_1, \dots , a_n=a]\) be the standard continued fraction expansion with \(a \ge 1\), and the previous two convergents are denoted by \(q'/p'\) and \(q''/p''\), say \(q''/p''< q/p <q'/p'\) (the other inequality goes analogously). Therefore \(q'p-qp' = 1\) and \(q''p'-q'p'' = -1\). Also
are the best rational approximations of q/p, belonging to lattice points \(\xi '\) above L and \(\xi ''\) below L. The vertical distance between \(\xi '\) and the arc L is \(|q'-p'\frac{q}{p}| = \frac{1}{p}|q'p-p'q| = \frac{1}{p}\). The vertical distance between L and \(\xi ''\) is
The corridor’s diameter is perpendicular to \(\xi \), so \(d_0(\xi )\) is computed from this vertical distance as the inner product of the vector \((0,1/p)^T\) and the vector \(\xi = (p,q)^T\) rotated over \(90^\circ \):
The computation for \({\tilde{d}}_0(\xi ) = |\xi |^{-1}\) is the same. \(\square \)
2.2 Singularities of the billiard map
In the coordinates \((\theta ,\phi ,\ell ) \in {{\mathbb {S}}}^1 \times [-\frac{\pi }{2}, \frac{\pi }{2}] \times {{\mathbb {Z}}}^2\) (or \(\times {{\mathbb {Z}}}\) if it is a Lorentz tube), the size of the scatterers \(\rho \) doesn’t appear, but it comes back in the formula of the billiard map \(T_{\rho }\) and in its hyperbolicity. Also the curvature of the scatterers is \({{\mathcal {K}}}\equiv 1/\rho \). We recall some notation from the Chernov and Markarian book [10] (going back to the work of Sinaĭ), bearing in mind that we have to redo several of their estimates to track the precise dependence on \(\rho \). The phase space is \({\widehat{{{\mathcal {M}}}}} = {{\mathcal {M}}}\times {{\mathbb {Z}}}^2 = \bigcup _{\ell \in {{\mathbb {Z}}}^2} {{\mathcal {M}}}_\ell \), where each \({{\mathcal {M}}}_\ell \) is a copy of the cylinder \({{\mathbb {S}}}^1 \times [-\frac{\pi }{2}, \frac{\pi }{2}]\), see Fig. 2.
Let \({{\mathcal {S}}}_0 = \{ \phi = \pm \frac{\pi }{2} \}\) be the discontinuity of the billiard map corresponding to grazing collisions. The forward and backward discontinuities are
so that \(T_{\rho }^n:{{\mathcal {M}}}\setminus {{\mathcal {S}}}_n \rightarrow {{\mathcal {M}}}\setminus {{\mathcal {S}}}_{-n}\) is a diffeomorphism. We line the curve \({{\mathcal {S}}}_0\) with homogeneity strips \({{\mathbb {H}}}_k\) bounded by curves \(|\pm \frac{\pi }{2}-\phi | = k^{-r_0}\) and \(|\pm \frac{\pi }{2}-\phi | = (k+1)^{-r_0}\), \(k \ge k_0\), for a fixed number \(r_0 > 1\). The standard value is \(r_0 = 2\), but as distortion results and some other estimates improve when \(r_0\) is larger, we choose the optimal value of \(r_0\) later.
The set \({{\mathcal {S}}}_{-1}\) consists of multiple curves inside \({{\mathcal {M}}}_0\), one for each scatterer from which a particle can reach \(O_0\) in the next collision. In Fig. 2 we consider the corridor in the direction of \(\xi \in {{\mathbb {Z}}}^2\), and drew the parts of \({{\mathcal {S}}}_{-1}\) coming from scatterers \(O_\xi \), \(O_{-\xi }\) and \(O_{-\kappa _\rho }\) for some scatterer on the other side of this corridor.
Lemma 2.3
For the \(\xi \)-corridor, let \((\theta _{-\xi }, \frac{\pi }{2}) \in {{\mathcal {M}}}_0\) be the point of intersection of \({{\mathcal {S}}}_0\) and the part of \({{\mathcal {S}}}_{-1}\) associated to the scatterer \(O_{-\xi }\), and \((\theta _{\kappa _\rho }, \frac{\pi }{2}) \in {{\mathcal {M}}}_0\), \(\kappa _\rho = \xi '-M\xi \), be the point of intersection of \({{\mathcal {S}}}_0\) and the part of \({{\mathcal {S}}}_{-1}\) associated to the scatterer \(O_{\kappa _\rho } = O_{\xi '-M\xi }\) at the other side (i.e., the \(\xi '\)-boundary) of the \(\xi \)-corridor, see Fig. 3. Let \((\theta '_{\kappa _\rho }, \phi '_{\kappa _\rho })\) be the intersection of the parts of \({{\mathcal {S}}}_{-1}\) associated to the scatterers \(O_{-\xi }\) and the scatterer \(O_{\kappa _\rho }\), see Fig. 2. Then
and
Proof
The angle \(\theta _{-\xi }\) refers to the point where the common tangent line of \(O_0\) and \(O_{-\xi }\) touches \(O_0\). For the value \(\theta _{\kappa _\rho }\), \(\kappa _\rho = \xi '-M\xi \), we take the common tangent line to \(O_0\) and \(O_{\kappa _\rho }\) which has slope \(\frac{d_{\rho }(\xi )}{M |\xi |} \left( 1+{{\mathcal {O}}}(\frac{\rho }{|\xi | M} ) \right) \). This is then also \(|\theta _{-\xi }-\theta _{\kappa _\rho }|\).
Illustration of the proof of Lemma 2.3
Now for the other endpoint of this piece of \({{\mathcal {S}}}_{-1}\), consider the common tangent line to \(O_{-\xi }\) and \(O_{\kappa _\rho }\) which has slope \(\tan \alpha := \frac{d_{\rho }(\xi )}{(M-1) |\xi |} (1+{{\mathcal {O}}}(\frac{\rho }{|\xi |(M-1)}))\), hitting the scatterer \(O_0\) in point P and when extended inside \(O_0\) hits the vertical line through the center \(O_0\) in point Q. Let also R be the tangent point of \(O_0\) to the corridor, and \(Q'\) is the point on \(O_0R\) at the same horizontal height as P, see Fig. 4. Then \(|RQ| = |\xi |\sin \alpha \) whereas \(|O_0Q'| = \rho - (|\xi | - \rho \sin \theta '_{\kappa _\rho }) \sin \alpha = \rho \cos \theta '_{\kappa _\rho }\). The latter gives
The triangle \(\triangle PO_0Q\) has angles \(\phi '_{\kappa _\rho }\), \(\alpha +\frac{\pi }{2}\) and \(\theta '_{\kappa _\rho }\), which add up to \(\pi \). Hence
as claimed. \(\square \)
2.3 Hyperbolicity of the Lorentz gas with small scatterers
The derivative \(DT_{\rho }: {{\mathcal {T}}}{{\mathcal {M}}}\rightarrow {{\mathcal {T}}}{{\mathcal {M}}}\) preserves the unstable cone field
This is [10, p. 74] in the coordinates \(\theta = r/2\pi \rho \), and we can sharpen this cone by replacing \(\tau _{\min }\) by \(\tau (x)\), the flight time at x before the next collision. The derivative of the inverse of the billiard map preserves the stable cone field
Clearly, these cone-fields are transversal uniformly over \({{\mathcal {M}}}\), and \({{\mathcal {S}}}_n\) is a unstable (or stable) curve if \(n > 0\) (or \(n < 0\)).
In the billiard literature it is common to use a pseudo-norm, the p-norm for unstable vectors, defined as \(\Vert dx \Vert _p = \cos \phi \, dr\). When restricted to the unstable cone, the p-norm is non-degenerate. With the notation \({{\mathcal {R}}}(x) = \frac{2}{\rho \cos \phi }\), the expansion/contraction factor \(\Lambda \) on unstable vectors in the p-norm satisfies
This proves uniform hyperbolicity of the billiard map.
In our coordinates the p-norm can be also expressed as \(\Vert dx \Vert _p = 2\pi \, \rho \, \cos \phi \, d\theta \), and it is related to the standard Euclidean norm as
The expansion of \(DT_{\rho }\) of unstable vectors is uniform in the p-norm, see [10, Formula (3.40)]:
Expressed in Euclidean norm, this gives, for \(DT_{\rho }(dx) = (d\theta _1, d\phi _1)\),
For later use, if \(T_{\rho }(x)\) is in the homogeneity strip \({{\mathbb {H}}}_k\), then \(\cos \phi _1 \approx k^{-r_0}\).
3 Growth lemmas
As already mentioned in the introduction, the main line of our argument uses perturbed transfer operators acting on the Banach spaces constructed in [14] and [16]. These works, as essentially all other methods studying statistical properties of hyperbolic billiards, rely on appropriately formulated growth lemmas, which quantify the competition of the two main dynamical effects, singularities and expansion, in these systems. The constructions of [14] and [16] involve several exponents, which thus are present in our setting, too. Additionally, we have to introduce some further exponents as we study perturbed transfer operators. Before stating the growth lemmas, here we include a table summarizing the role and the interrelation of these exponents. Essentially, we use the same notation as in [14] except for some subscripts \({}_0\), and in fact some of the constants reduce to their value in [14] if \(r_0=2\).
We use a class \({{\mathcal {W}}}^s\) of admissible stable leaves defined as \(C^2\) leaves W in the phase space such that all its tangent lines are in the stable cone bundle, their second derivative is uniformly bounded, W is contained in a single homogeneity strip, \(\kappa _\rho (x)\) is constant on W and there is a \(\rho \)-dependent upper bound on |W|, namely
where the small \(c > 0\), to be fixed below, is independent of \(\rho \).
Let \(W \in {{\mathcal {W}}}^s\) be an admissible stable leaf. The preimage \(T_{\rho }^{-1}(W)\) is cut by the discontinuity lines \({{\mathcal {S}}}_1\) and boundaries of homogeneity strips into at most countably many pieces \(V_i\). Note that we may have to cut the pieces \(V_i\) further into curves \(W_i\) of length \(\le \delta _0\).
3.1 The growth lemma in terms of \(V_i\)
The particle can reach the scatterer \(O_0\) at the origin from corridors in all directions, indexed by \((\xi , \xi ') \in {\Psi }\), see Fig. 3. If the previous scatterer is \(\pm \xi \) itself, we call this a trajectory from the \(\xi \)-boundary; if the previous scatterer is at lattice point \(\xi '-M\xi \), the trajectory comes in from the \(\xi '\)-boundary, see Remark 2.2. To each such scatterer and homogeneity strip \({{\mathbb {H}}}_k\) belongs at most one \(V_i\), and the contraction \(|T_{\rho }V_i|/|V_i|\) is governed by (14), where the distortion \(T_{\rho }:V_i \rightarrow T_{\rho }V_i\) is uniformly bounded, see Appendix B.
Proposition 3.1
Assume \(0 \le \nu < \frac{1}{2}-\frac{1}{2r_0}\). Then there is a constant \(C > 0\), uniform in \(\rho ,\nu \) and \(r_0\) such that
for every stable leaf \(W \in {{\mathcal {W}}}^s\).
Remark 3.2
(i) Since \(|W| \le \delta _0 \le c \rho ^{\nu }\), there is \(\theta _* < 1\) such that
for \(\rho \) sufficiently small, and c chosen appropriately small. In addition, we assume that
for distortion constant \(C_d\) from Lemma B.2;
(ii) As later we will need \(\nu > \frac{1}{3}\), we can take \(r_0=5\) and \(\nu = \frac{3}{8}\).
Proof
The homogeneous admissible preimage curves \(T_{\rho }^{-1}W=\cup _i V_i\) are obtained by partitioning according to
-
incoming corridors \(\xi \);
-
for a fixed corridor \(\xi \), the scatterer on which \(V_i\) is located. Accordingly, \(\kappa _\rho (V_i)=M\xi -\xi '\) for some \(M \in {{\mathbb {N}}}\), and the summation is over M;
-
for a fixed scatterer, the homogeneity strip containing \(V_i\), that is, \(V_i\subset {{\mathbb {H}}}_k\) for some k.
If W is on the scatterer \(O_{0}\) and \(V_i\) is on the scatterer \(O_{\xi '-M\xi }\), then both of these scatterers are tangent to the same corridor. The trajectory makes and angle \(\sim \frac{d_{\rho }(\xi )}{M |\xi |}\) with the corridor and there is a lower bound on the collision angle given by (11). This puts restrictions on how M is related to k; as reflected by allowed intersections of homogeneity strips and M-cells on Fig. 2. In particular
which determines the range of k for M fixed.
We sum over the homogeneity strips for \(\xi \) and M fixed on the \(\xi '\) boundary.
where we used that the exponent \(\frac{1}{2}-\frac{1}{2r_0}\) of \(d_{\rho }(\xi )\) is non-negative. By our assumption that \(\nu <\frac{1}{2}-\frac{1}{2r_0}\), this expression is summable over M, and therefore the sum over the \(\xi '\)-boundary of the entire \(\xi \)-corridor is
The sum over homogeneity strips for \(\xi \) fixed on the \(\xi \)-boundary is no different:
Next we sum over all opened-up corridors, indexed by all the “visible” lattice points inside a sector of angle \(|W|/\sqrt{1+4\pi ^2}\), because only trajectories from scatterers within such a narrow sector can hit \(O_0\) at coordinates in W. The “visible” corridors will be denoted by \({\Psi }_W\). It can happen that a single corridor, or even a single scatterer in a corridor blocks the entire sector, and we reserve one term for \(|\xi | \ge 1\) (which is the worst case because the contraction of \(T_{\rho }\) is the weakest). Apart from this corridor, and since we need an upper bound, we can replace we replace |W| by a stable curve of length \(\delta _0\), and apply Lemma A.6 for \(a = 1-\nu \) and \(a = \frac{3}{2} - \nu - \frac{1}{2r_0}\). This gives
Since \(\delta _0 = c\rho ^{\nu }\) and \(\nu < \frac{1}{2}\), this completes the proof. \(\square \)
3.2 The growth lemma in terms of \(W_i\)
The pieces of preimage leaf \(V_i \subset T_{\rho }^{-1}(W)\) emerge by natural cutting at the discontinuity set \({{\mathcal {S}}}_1\) and the homogeneity strips, but even so, their lengths can be larger than \(\delta _0\), the bound of admissible stable leaves. We therefore need to cut them into shorter pieces, denoted as \(W_i\). In the worst case, each \(V_i\) needs to be cut into \(\delta _0^{-1}\) pieces, which gives the estimate
Although this estimate suffices for some purposes, it is not always good enough for larger iterates \(T_{\rho }^n\). The next lemma (which follows [14, Lemma 3.2] or [16, Lemma 3.3]) achieves an estimate, uniform in n, for \(\nu =0\).
For the next lemma we recall some notation used in [16]. For \(W \in {{\mathcal {W}}}^s\), we construct the components \({{\mathcal {G}}}_k(W)\) of \(T_{\rho }^{-k}W\) inductively on \(k=0,\dots ,n\). That is \({{\mathcal {G}}}_0(W) = \{ W \}\), and to obtain \({{\mathcal {G}}}_{k+1}(W)\) first we apply Proposition 3.1 to each curve in \({{\mathcal {G}}}_k(W)\), and then we partition curves that are longer then \(\delta _0\) into pieces of length between \(\delta _0\) and \(\delta _0/2\). We enumerate the leaves of the k-th generation \({{\mathcal {G}}}_k(W)\) as \(W_i^k\).
Lemma 3.3
There is a constant \(C_s>0\), independent of \(\rho \), such that
and
for all \(\varsigma \in [0,1)\).
Proof
Define \({{\mathcal {L}}}_k\) as the collection of indices such that \(W_i^k \in {{\mathcal {G}}}_k(W)\) is long, i.e., \(|W_i^k| \ge \delta _1\) for \(i \in {{\mathcal {L}}}_k\), and \({{\mathcal {I}}}_n(W_j^k)\) as the collection indices of \(W_i^n\) such that their most recent long ancestor is \(W_j^k \in {{\mathcal {G}}}_k(W)\). If for some \(W^n_{i_1}\) no such long ancestor exists, then set \(k(i_1)=0\) and \(W_{i_1}^n\) belongs to \({{\mathcal {I}}}_n(W)\); if \(W^n_{i_2}\) is itself long, then set \(k(i_2)=n\). Fix some \(j \in {{\mathcal {L}}}_k\). As for \(W^i_n\in {{\mathcal {I}}}_n(W_j^k)\) the preimages under \(T_{\rho }^{n-k}\) of \(T_{\rho }^{n-k}W^i_n\) need not be cut artificially (they are already short), and due to the distortion bound from Lemma B.2,
Recall that by our assumption \(\delta _1\) is so small that \(\theta _1 < 1\). In the estimate below, we group \(W_i^n\in {{\mathcal {G}}}_n(W)\) according to their most recent long ancestors.
where we have used that for fixed k and \({W_j^k \in {{\mathcal {L}}}_k(W)}\), (i) \(|W_j^k|\ge \delta _1\), (ii) the \(T_{\rho }^kW_j^k\) are pairwise disjoint subcurves of W, and (iii) \(|W|\le \delta _1\). By Jensen’s inequality and (23),
which proves the second statement. \(\square \)
It is worth including the following bound, which follows from (22) by Jensen inequality:
Remark 3.4
For further reference, we state a version of (21) for \(\nu >0\), \(n=1\). Let \(\varsigma _0=1-\frac{2r_0\nu }{r_0-1}\).
This follows by Jensen’s inequality from (19), applied with \(\frac{\nu }{1-\varsigma }\) in place of \(\nu \). The condition \(\varsigma <\varsigma _0\) ensures that \(\frac{\nu }{1-\varsigma }<\frac{1}{2}-\frac{1}{2r_0}\). For the choices \(r_0=5\), \(\nu =\frac{3}{8}\) we have \(\varsigma _0=\frac{1}{16}\).
4 Banach spaces and spectral gap
For the exponents \(p_0\) and \(q_0\) defined in (15) we define the Banach spaces (of distributions) \(C^{p_0}, {{\mathcal {B}}}, {{\mathcal {B}}}_w, (C^{q_0})'\) in analogy to [16],Footnote 3 We recall that \((C^{q_0})'\) is the topological dual of \(C^{q_0}\).
Given \(W\in {{\mathcal {W}}}^s\), let \(m_W\) be the Lebesgue measure on W, and define
for \(\alpha \ge 0\), \(\cos W = |W|^{-1} \int _W \cos \phi \, dm_W\) (note that \(\cos W \ll k^{-r_0}\) if \(W \subset {{\mathbb {H}}}_{\pm k}\)), and \(H^{p_0}_W(\psi )\) the Hölder constant of \(\psi \) along W. Also let \(d_W(W_1, W_2)\) stand for the distance between leaves as in [14, Sect. 3.1] or [16, Sect. 3.1]; in particular, if \(W_1\) and \(W_2\) belong to the same homogeneity strip, \(d_W(W_1, W_2)\) is the \(C^1\) distance of their graphs in the \((\theta ,\phi )\) coordinates, and otherwise infinite.
Given \(W\in {{\mathcal {W}}}^s\) and \(h\in C^1(W)\), define the weak norm.Footnote 4
With \(q_0 < p_0\) fixed we define the distance between functions \(d(\psi _1,\psi _2)\) in the same way as in [14, Sect. 3.1]. We define the strong stable norm by
Choosing \(\varepsilon _0 \in (0,\delta _0)\) and \(\beta _0 \in (0, \min \{ \alpha _0, p_0-q_0\})\), we define the strong unstable norm by
The strong norm is defined by \(\Vert h\Vert _{{{\mathcal {B}}}}=\Vert h\Vert _s+c_u\Vert h\Vert _u\), where we will fix \(c_u\ll 1\) (but independent of \(\rho \)) at the beginning of Sect. 5.2.
Since \(C^{p_0}\subset {{\mathcal {B}}}\subset {{\mathcal {B}}}_w\subset (C^{q_0})'\) (see Sect. 4.1), we have \(\Vert h\Vert _{{{\mathcal {B}}}_w}+\Vert h\Vert _{{{\mathcal {B}}}}\le C\Vert h\Vert _{C^1}\). As in [16], we define \({{\mathcal {B}}}\) to be the completion of \(C^1\) in the strong norm and \({{\mathcal {B}}}_w\) to be the completion in the weak norm.
4.1 Transfer operator on \({{\mathcal {B}}}\)
Throughout we let \(R_\rho : L^1(m)\rightarrow L^1(m)\) be the transfer operator of the billiard map \(T_\rho \). We recall that [14, Lemmas 3.7\(-\)3.10] ensure that: i) \(R_\rho (C^1)\subset {{\mathcal {B}}}\) and as a consequence R is well defined on \({{\mathcal {B}}}\); \({{\mathcal {B}}}_w\); ii) the unit ball of \({{\mathcal {B}}}\) is compactly embedded in \({{\mathcal {B}}}_w\), and iii) \(C^{p_0}\subset {{\mathcal {B}}}\subset {{\mathcal {B}}}_w\subset (C^{q_0})'\).
It follows that \(R_\rho \) is well defined on \({{\mathcal {B}}}\) and \({{\mathcal {B}}}_w\), and we also let \(R_\rho \) denote the extension of this transfer operator to \({{\mathcal {B}}}_w\).
4.2 Lasota–Yorke inequalities
Using Proposition 3.1 with \(\nu =0\) and Lemma 3.3 we obtain the analogue of the Lasota–Yorke inequality [16, Proposition 2.3]. As our set-up fits [16], our only concern is the dependence on \(\rho \). It is important to point out that our all estimates in Sect. 3 and Appendix B are independent of \(\rho \), except that \(\delta _1<\delta _0\ll \rho ^{\nu }\).
Lemma 4.1
(Weak norm) There exists a uniform constant \(C>0\) so that for all \(h\in {{\mathcal {B}}}\) and for all \(n\ge 0\),
where \(C_s\) is given by (20).
Proof
For \(W\in {{\mathcal {W}}}^s\), \(h\in C^1({{\mathcal {M}}}_0)\), \(\psi \in C^{p_0}(W)\) with \(|\psi |_{W,\alpha _0,p_0}\le 1\),
Using the present definition of the weak norm,
From here on the argument goes almost word for word as the argument in [16, Sect. 4.1], except for the use of equation (20) (the analogue of [16, Lemma 3.3(a)] with \(\varsigma =0\)). \(\square \)
Lemma 4.2
(Strong stable norm) Take \(\delta _1\) as in (17) and \(\theta _1\) as in (22). There exists a uniform constant \(C>0\) so that for all \(h\in {{\mathcal {B}}}\) and all \(n\ge 0\),
Remark 4.3
The compact term \(C\delta _1^{-\alpha _0}\Vert h\Vert _{{{\mathcal {B}}}_w}\) in Lemma 4.2 is the only point in the Lasota–Yorke inequalities where a \(\rho \)-dependence arises, via \(\delta _1 = c \rho ^{\nu }\).
Proof
The argument goes almost word for word as the [16, Argument in Sect. 4.2], except for the differences:
i) We use of equation (21) with \(\varsigma =\alpha _0\) instead of [16, Lemma 3.3 (b)] (also with \(\varsigma =\alpha _0\)) in [16, Equation (4.5)]. In particular, using the present definition of the stable norm, with the same notation as in [16, Sect. 4.2], we have the following analogue of [16, Equation (4.5)]:
where we have used the distortion bounds of Appendix B and Formula (21) (with \(\varsigma =\alpha _0\)).
ii) To obtain the analogue of [16, Equation (4.6)], as in [16, Sect. 4.2], we split the sum
into a term for \(k=0\) and further terms for \(k=1,\dots , n\). For \(k=0\), we use the strong stable norm and (24) (the analogue of [16, Lemma 3.3(a)]) with \(\varsigma =\alpha _0\), giving a contribution \(\ll \Vert h\Vert _{s} \theta _1^{n(1-\alpha _0)}\). For the terms \(k=1,\dots n\), we use the weak norm, (21) (the analogue of [16, Lemma 3.3(b)]) with \(\varsigma =\alpha _0\), and the fact that \(|W^k_j|\ge \delta _1\) for \(j\in {{\mathcal {L}}}_k(W)\), resulting in a contribution of \(O(\Vert h\Vert _{{{\mathcal {B}}}_w} \delta _1^{-\alpha _0})\). \(\square \)
As in [16], dealing with the strong unstable norm is the most delicate part of the Lasota–Yorke inequality. The only difference from [16, Argument in Sect. 4.3] is that we apply (20) (instead of [16, Lemma 3.3 (b)]) multiple times. Note that our bound in (20) is independent of \(\rho \), so no \(\rho \)-dependence arises here.
Lemma 4.4
(Strong unstable norm) There exists a uniform constant \(C>0\) so that for all \(h\in {{\mathcal {B}}}\) and for all \(n\ge 0\),
Proof
Given \(W_1,W_2 \in {{\mathcal {W}}}^s\) with \(d(W_1,W_2)\le \varepsilon \), we may identify matched and unmatched pieces in \(T_\rho ^{-n}W_{\ell }\), \(\ell =1,2\). The estimates of [16] on the length of the unmatched pieces apply, thus we may estimate their contribution by the strong stable norm using (20) (instead of [16, Lemma 3.3 (b)]). As the length estimates give \(\varepsilon ^{\alpha _0/2}\), \(\beta _0<\alpha _0/2\) is essential here (cf. [16, Formulas (4.10) and (4.11)], noting that \(\gamma =0\) in our case).
To bound the contribution of the matched pieces we use, on the one hand, the strong unstable norm (as in [16, Formula (4.14)]) and, on the other hand, the strong stable norm (as in [16, Formula (4.17)]). Here again we rely on equation (20) which plays the role of [16, Lemma 3.3 (b)]. \(\beta _0<p_0-q_0\) ensures that after division by \(\varepsilon ^{\beta _0}\) the proof of Lemma 4.4 can be completed. \(\square \)
5 Perturbed transfer operators
A standard way of obtaining limit theorems for dynamical systems is via the perturbed transfer operator method. In Sect. 7 we will use the spectral properties of the family of perturbed transfer operators \({\hat{R}}_\rho (t), t\in {{\mathbb {R}}}\) with \({\hat{R}}_\rho (t) h =R(e^{it\kappa _\rho } h)\), \(h\in L^1(m)\).
5.1 Continuity properties
By definition, \({\hat{R}}_\rho (0)=R_\rho \). Take \(0 \le \nu < \frac{1}{2} - \frac{1}{2r_0}\) as in Proposition 3.1. In this subsection we show the following continuity estimate:
for some uniform constant C.
The argument goes parallel to Sect. 4.2, except that this time we need the estimates (i) for \(\nu >0\) and (ii) only for \(n=1\), we rely on (19) and (25) instead of Lemma 3.3.
Lemma 5.1
Assume (16). Then there exists a uniform constant \(C>0\) so that for all \(h\in {{\mathcal {B}}}\),
Proof
The argument goes similarly to the argument in [16, Sect. 4.1] restricted to the case \(n=1\). More precisely, for \(W\in {{\mathcal {W}}}^s\), \(h\in C^1({{\mathcal {M}}}_0)\), \(\psi \in C^{p_0}(W)\) with \(|\psi |_{W,\alpha _0,p_0}\le 1\),
Using the definition of the weak norm and the inequality \(|e^{ix}-1|\le x^\nu \),
From here on the proof goes the same as the argument in [16, Sect. 4.1] except for the use of equation (19) instead of [16, Lemma 3.3 (b)]. \(\square \)
Lemma 5.2
There exists a uniform constant \(C>0\) so that for all \(h\in {{\mathcal {B}}}\) and for all \(n\ge 0\),
Proof
This time we are only concerned with \(n=1\), and do not need a contraction of the strong stable norm. Hence, an argument analogous to the proof of Lemma 5.1 suffices, with the weak norm replaced by the strong stable norm. Accordingly, we use (25) with \(\varsigma =\alpha _0\) instead of [16, Lemma 3.3 (b)]. \(\square \)
Lemma 5.3
There exists a uniform constant \(C>0\) so that for all \(h\in {{\mathcal {B}}}\),
Proof
As with the proof of Lemma 4.4, the argument goes similar to [16, Argument in Sect. 4.3], restricted to the case \(n=1\). The matched and unmatched pieces can be again identified, this time for \(T_{\rho }^{-1}W_{\ell }\), \(\ell =1,2\). Then, as in the proof of Lemma 5.1, the factors \(|t|^{\nu }\) and \(|\kappa _\rho |^{\nu }\) arise. Clearly \(\kappa _\rho \) is constant on each of the (matched or unmatched) pieces, and takes the same value on any two pieces that are matched. Accordingly, the various contributions can be estimated in the same way as in proof of Lemma 4.4, with the only difference that, by the presence of the factor \(|\kappa _\rho |^{\nu }\), throughout the argument (19) is used instead of (20). \(\square \)
Equation (29) follows from the definition of the norm in \({{\mathcal {B}}}\) together with Lemmas 5.1, 5.2 and 5.3.
5.2 Peripheral spectrum and spectral gap
Choose \(1>\sigma >\max \{\Lambda ^{-\beta _0}, \theta _1^{(1-\alpha _0)}, \Lambda ^{-q_0}\}\). By Lemmas 4.1, 4.2 and 4.4 and arguing as in [16, Eq. (2.14)], we obtain the traditional Lasota–Yorke inequality for some \(N\ge 1\), provided \(c_u\) in the definition of \(\Vert \ \Vert _{{{\mathcal {B}}}}\) (below (28)) is chosen small enough in terms of N. That is,
Combined with the properties collected in Sect. 4.1 (that is, the relative compactness of the unit ball of \({{\mathcal {B}}}\) in \({{\mathcal {B}}}_w\)), equation (30) shows that the essential spectral radius of \(R_\rho \) is bounded by \(\sigma \) and that the spectral radius is 1.
Let \(\Pi _\rho \) be the eigenprojection (that is, the projection on the eigenspace of \(R_\rho \)) corresponding to the eigenvalue 1. In particular, \(\Pi _{\rho } \mu =\mu \) is the invariant measure for \(T_{\rho }\). Since for every \(\rho \), \(T_\rho \) is mixing, the peripheral spectrum of \(R_\rho \) consists of just the simple eigenvalue at 1. Thus, for every \(\rho >0\), the eigenprojection \(\Pi _{\rho }\) corresponding to the eigenvalue 1 of \(R_{\rho }\) can be also characterized by
for all \(h\in {{\mathcal {B}}}\).
Let \(Q_{\rho }\) be complementary spectral projection. From here onwards, we exploit that for every \(\rho >0\), there exist \(\gamma _{\rho }\in (0,1)\) and \(C_\rho >0\) so that
for every \(m\ge 1\). Altogether, \(R_{\rho }^m=\Pi _{\rho } +Q_{\rho }^m\), where \(Q_{\rho }\) satisfies (32).
6 Asymptotics of the dominant eigenvalue
To establish limit theorems (such as Theorem 7.1 below) we study the asymptotics of \( {{\mathbb {E}}}_\mu (e^{it\kappa _{m,\rho } } 1)={{\mathbb {E}}}_\mu ({\hat{R}}_{\rho }(t)^m 1), \) as \(t\rightarrow 0\) and \(m\rightarrow \infty \) via the properties of \({\hat{R}}_{\rho }(t) h= R_{\rho }(e^{it\kappa _\rho } h)\), \(h\in {{\mathcal {B}}}\).
We already know that for every \(\rho \in (\frac{1}{3},\frac{1}{2})\), 1 is a simple eigenvalue of \({\hat{R}}_{\rho }(0)=R_{\rho }\) when viewed as an operator from \({{\mathcal {B}}}\) to \({{\mathcal {B}}}\). Due to (29), \({\hat{R}}_{\rho }(t)\) is \(C^\nu \) (in t) from \({{\mathcal {B}}}\) to \({{\mathcal {B}}}\). It follows that for t in a neighbourhood of 0, \({\hat{R}}_{\rho }(t)\) has a dominant eigenvalue \(\lambda _{\rho }(t)\) (with \(\lambda _{\rho }(0)=1\)).
Let \(\gamma _{\rho }\) be as in equation (32). The continuity properties together with (32) ensure that for any \(\delta \in (0,\gamma _{\rho })\) and \(t\in B_\delta (0)\),
for some \(C_\rho >0\) and \(\Pi _{\rho }(t)^2=\Pi _{\rho }(t)\), \(\Pi _{\rho }(t)Q_{\rho }(t)=Q_{\rho }(t)\Pi _{\rho }(t)=0\). Further, for all \(t\in B_\delta (0)\),
for all t small enough. A standard consequence of (29) and (32) is that for every \(\delta \in (0,\gamma _{\rho })\) and for all u so that \(|u-1|=\delta \),
Hence, \(\Vert \Pi _{\rho }(t)-\Pi _{\rho }(0)\Vert _{{{\mathcal {B}}}}\le C\rho ^{-\nu } |t|^{\nu }\rho ^{-\nu }\gamma _{\rho }^{-2}|t|^{\nu }\).
The rest of this section is allocated to the study the asymptotics of \(\lambda _{\rho }(t)\) as \(t\rightarrow 0\).
The following property was used in [6, 7, 21] (see [7, assumption (H2)]) for the study of eigenvalues of perturbed transfer operators in the Banach spaces introduced in [12]. Here we use it to obtain an adequate analogue for the present set-up.
Lemma 6.1
Take \(s_0 = \frac{1-\alpha _0(r_0+1)}{2r_0}\) as in (15). Let \(h\in {{\mathcal {B}}}\) and \(v\in C^{p_0}\). For every corridor with boundaries determined by \(O_\xi \) and \(O_{\xi '}\), there exists a constant \(C > 0\) independent of \(\rho \) and \(\xi \) so that
Proof
Let \(\{ W_\ell \}_{\ell \in L}\) be the foliation of the set \(\{\kappa _\rho = \xi '+\xi N\}\) into stable leaves. We can parametrise these leaves by their endpoints \((\ell , \frac{\pi }{2})\) in \({{\mathcal {S}}}_0\), then L is an interval of length \(c \ll \frac{d_{\rho }(\xi )}{N^2 |\xi |}\) according to Lemma 2.3. The lengths of these stable leaves \(|W_\ell | \le c'\) for another constant \(c' \ll \sqrt{\frac{2 d_{\rho }(\xi )}{\rho N}}\), again by Lemma 2.3. The measure \(dm_{W_\ell }\) is Lebesgue on the \(C^1\) stable leaf \(W_\ell \), and it can be parametrised as \((w_\ell (\phi ), \phi )\) where w is \(C^1\) with \(-\frac{1}{2\pi } \frac{\rho + \tau _{\min }}{\tau _{\min }}< w'(\phi ) <- \frac{1}{2\pi }\) because of the direction of the stable cones, see (13).
Let \(\nu \) be a measure on L that produces the decomposition of Lebesgue measure m on \(\{ \kappa _\rho = \xi '+\xi N\}\) along stable leaves. We have \(\nu \ll m_L\) (and \(d\nu /dm_L\) is bounded above). Since we need to partition stable leaves \(W_\ell \) by the homogeneity strips \({{\mathbb {H}}}_k\) near \({{\mathcal {S}}}_0\) into pieces \(W_{\ell ,k}:= W_\ell \cap {{\mathbb {H}}}_k\), we get an extra sum over \(k \ge k(c'):= \lfloor (c')^{-1/r_0} \rfloor \). Then
for \(s_0 = \frac{1-\alpha _0(r_0+1)}{2r_0}\), as claimed. \(\square \)
Using (35), Lemmas A.2 and 6.1 we obtain the asymptotics of the eigenvalue in Proposition 6.3 below.
Lemma 6.2
For \(t\in {{\mathbb {R}}}^2\), let \({\bar{A}}(t,\rho )=\sum _{|\xi | \le 1/(2\rho )} \frac{d_{\rho }(\xi )^2 \langle t, \xi \rangle ^2}{|\xi |}\). Then
Proof
The coordinate axes \(p=0\) and \(q=0\), and the two diagonals \(p=q\) and \(p=-q\) divide the plane into eight sectors. Here we count counter-clockwise with the first sector \({\Psi }_1\) directly above the positive p-axis. Let \(\gamma = \gamma (t,\xi )\) be the angle between the vectors t and \(\xi \). Let \(\alpha = \arctan q/p\) and \(\theta \) be the polar angles of \(\xi \) and \(t \in {{\mathbb {R}}}^2\) respectively, so \(\gamma = \theta -\alpha \). For the first sector \({\Psi }_1\), taking into account that for every \(\xi \) there are two \(\xi '\), we have
The eighth sector \({\Psi }_8\) directly below the positive p-axis gives the same result with \(-q\) instead of q, and sectors \({\Psi }_4\) and \({\Psi }_5\) above and below the negative p-axis give the same results as sectors \({\Psi }_8\) and \({\Psi }_1\). Therefore
The same result holds the remaining sectors with \(\cos \theta \) replaced by \(\sin \theta \) and vice versa. Putting the results on all eight sectors together, we get by Lemma A.4
Hence \(\langle \Sigma t, t\rangle = \lim _{\rho \rightarrow 0} \frac{\rho }{2} {\bar{A}}(t,\rho ) = \frac{|t|^2}{\pi }\), as required. \(\square \)
For the result on the asymptotics of the eigenvalue in Proposition 6.3, we will also assume some correlation decay type results. Namely, we assume that there exist \(\rho \)-dependent constants \({\hat{\gamma }}_{\rho }\in (0,1)\) and \({\hat{C}}_\rho > 0\) so that for every \(j\ge 1\),
More generally, we assume that that there exist \(\rho \)-dependent constants \(\bar{\gamma }_{\rho }\in (0,1)\) and \(\bar{C}_{\rho }> 0\) so that for every \(j\ge 1\) and every \(m\ge 0\)
where \(C=0\) if \(m=0\) and \(C=1\) if \(m\ge 1\). As justified in Proposition C.1 in Appendix C via the argument used in [11, Proof of Proposition 9.1], assumptions (36) and (37) are natural.
Proposition 6.3
Assume (32), (36) and (37), and let \({\bar{A}}(t,\rho ) \) be as defined in Lemma 6.2. Let \(\nu \in (\frac{1}{3},\frac{1}{2})\) and \(\delta \in (0,\frac{1}{2}\min \{\gamma _{\rho },\hat{\gamma }_{\rho }\})\), ensuring that (33) holds. Then for any \(\delta _0\le \delta ^{4/(3\nu -1)}\) and \(t\in B_{\delta _0}(0)\),
where \(|E(t,\rho )|\le \bar{C}_{\rho }\,\bar{\gamma }_{\rho }^{-2}\,|t|^2 + C|t|^2 \rho ^{-2}\) for \(\bar{C}_{\rho }\) and \(\bar{\gamma }_{\rho }\) as in (37) and some uniform constant C.
Remark 6.4
It is possible to shrink \(\delta _0\) further to \(\delta _0<e^{-\max \{\bar{C}_{\rho }\bar{\gamma }_{\rho }^{-2},\rho ^{-2}\}}\) leading to \(E(t,\rho )=o(|t|^2\log |1/t|)\). This would mean that in the proof of main results in Sect. 7 we would work on this very small neighborhood and obtain the same range of n and \(\rho \) in the final statements. We find it more convenient to work on the neighborhood \(B_{\delta _0}(0)\) as in the statement above.
Remark 6.5
Let \(q_{\rho }\) be the flight function taking values in \({{\mathbb {R}}}^2\) as opposed to the displacement function \(\kappa _\rho \) taking values in \({{\mathbb {Z}}}^2\). A similar statement holds for the dominant eigenvalue of the perturbed operator \(R_\rho (e^{it q_{\rho }})\). The proof is similar to the one below using that \(|q_{\rho }-\kappa _\rho | \le 1\).
Proposition 6.3
In the notation of Banach spaces of distributions (see, for instance, [21]) for \(h\in C^{q_0}\) we write \(\langle h, \textbf{1}\rangle =\langle \textbf{1}, h\rangle =\int h \textbf{1}\, d m\) and \(\langle m, h\rangle =\int h\, d m\), where \(\textbf{1}\) is both an element of \({{\mathcal {B}}}\) and of \((C^{q_0})'\). Let \(v_{\rho }(t)=\frac{ \Pi _{\rho }(t) 1}{\langle \Pi _{\rho }(t) 1, 1\rangle }\) and recall that \(v_{\rho }(0)=\textbf{1}\). Recall also that for every \(\rho \), \(\lambda _{\rho }(t) v_\rho (t)={\hat{R}}_{\rho }(t) v_{\rho }(t)\) for t small enough, and that \(\lambda _{\rho }(0)=1\). Since \(\langle v_{\rho }(t),\textbf{1}\rangle =1\),
With the meaning of inner product clarified, for ease of notation from here on we will write \(V(t,\rho )=\int _{{{\mathcal {M}}}}(e^{it\kappa _\rho }-1)(v_{\rho }(t)-\textbf{1}) \, d m\). We recall the terminology in Remark 2.2. For \(\xi = (p,q)\) with \(\gcd (p,q) = 1\), we let \(\xi ' = (p',q')\) be the point uniquely determined by \(\xi \) in the sense that \(p'/q'\) is convergent preceding p/q in the continued fraction expansion of p/q; in particular \(|\xi '| \le |\xi |\). Recall that \({\Psi }\) is the set of all such pairs \((\xi ,\xi ')\) with \(|\xi | \le 1/(2\rho )\). With this specified, we write
Using the fact that \(\int \kappa _\rho \, d\mu =0\), we compute that
where the involved constants in the last big O are independent of \(\rho \). Further, using Lemma A.4,
Hence, with \({\bar{A}}(t,\rho )\) as in Lemma 6.2,
Thus, \(1-\lambda _{\rho }(t)={\bar{A}}(t,\rho ) \frac{\log (1/|t|)}{4\pi \rho }+E(t,\rho )\), where \(E(t,\rho )=O\left( |t|^2\rho ^{-1}\log (1/\rho )\right) + V(t,\rho )\). It remains to estimate \(V(t,\rho )\). Note that
Hence,
Estimating \( I_1(t,\rho )\). Since \(\int _{{{\mathcal {M}}}_0} \kappa _\rho \, d\mu =0\), we have
for some uniform C. Using also that \(|e^{ix}-1-ix|\le x^y\), for any \(y\in (0,2]\),
where in the last inequality we have used Lemma A.5. Note that for \(|t|\in B_{\delta _0}(0)\) with \(\delta _0\le \gamma _{\rho }^{4/(3\nu -1)}\), as in the statement, \(|t|^{\nu /2}<\gamma _{\rho }^{2\nu /(3\nu -1)}< \gamma _{\rho }^{2}\) for all \(\nu \in (\frac{1}{3},\frac{1}{2})\). Thus, \(|I_1(t,\rho )|\le C \rho ^{-\nu -1} |t|^{2}\).
Estimating \( I_2(t,\rho )\) Recall that (32) holds and that \(\delta \) is chosen so that (34) holds. Using the definition of \(\Pi _{\rho }(t)\) and noting that for every \(\rho \), \((u-{\hat{R}}_{\rho }(0))^{-1}\textbf{1}=(1-u)^{-1}\),
Thus,
for
and
where
and
We first treat \(K_2(t,\rho )\). Note that for u in the chosen contour, \(\Vert (u-R_\rho (t))^{-1}\Vert _{{{\mathcal {B}}}}\le \gamma _{\rho }^{-1}\). Using (35), for all such u,
This together with Lemma 6.1 gives that
Hence, \(|K_2(t,\rho )| \le C \rho ^{-1}\gamma _{\rho }^{-3}\,|t|^{2}\, t^{3\nu -1}=C \rho ^{-1}\gamma _{\rho }^{-3}\,|t|^{2}\, \gamma _{\rho }^4\) for all \(|t|\in B_{\delta _0}\) with \(\delta _0<\gamma _{\rho }^{4/(3\nu -1)}\). It follows that \(|K_2(t,\rho )| \le C \rho ^{-1}|t|^{2}\).
Estimating \(J_1(t,\rho )\) in (39) and \(K_1(t,\rho )\) in (40) These terms are in, some sense, independent of the Banach space \({{\mathcal {B}}}\) (see the explanation below) and can be analysed either via the correlation function (36) or its generalization (37). The rest of the proof is allocated to this type of analysis.
We start with \(J_1(t,\rho )\) defined in (39), which is easier using (36). Recall that \({\hat{R}}_{\rho }(0)=R_{\rho }\) and \(\int _{|u-1|=\delta } (1-u)^{-2} \, du=0\) due to Cauchy’s theorem. This gives
Swapping the order of the integrals is allowed due to (36). The quantity
decays exponentially fast. Hence, we can write
Using Lemma A.5 to control the dependence on \(\rho \), \(\left( \int _{{{\mathcal {M}}}_0}(e^{it\kappa _\rho }-1)\, d \mu \right) ^2\le C |t|^2\rho ^{-2}\). Next, recall that (32) holds and that \(\delta < \frac{1}{2} \min \{ \gamma _{\rho }, {\hat{\gamma }}_{\rho } \}\). Note that for \(|u-1|=\delta \), we have \(|u|^{-(j+1)}\ll (1-{\hat{\gamma }}_{\rho }/2)^{-(j+1)}\). This together with (36) gives
An argument similar to the one above used in estimating \( J_1(t,\rho )\) with (37) instead of (36) allows us to deal with \(K_1(t,\rho )\) defined in (40). Compute that
Let
Using (37), we obtain
where in the last inequality we proceeded as in estimating \(J_1\) above.
Finally, we need to argue that E is bounded by \(|t|^2\). First,
Using (36), we have that \(|E_1^1(t,\rho )|\le 2{\hat{C}}_\rho \,|t|^2\, {\hat{\gamma }}_{\rho }^{-1}\).
Also, \( E_2(t,\rho )=\int _{|u-1|=\delta }(1-u)^{-2}\sum _{j\ge 1} u^{-j}\int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\, (e^{it\kappa _\rho }-1)\circ T_{\rho }^{j}\, d\mu \) and again by (36) and Cauchy’s theorem, \(| E_2(t,\rho )|\le 4{\hat{C}}_\rho \,|t|^2\, {\hat{\gamma }}_{\rho }^{-2}\). Finally, \(E_3(t,\rho )=0\). Altogether, \(| K_1(t,\rho )|\le 8\bar{C}_{\rho }\,|t|^2\, \bar{\gamma }_{\rho }^{-2}\). \(\square \)
7 Limit theorems and mixing as \(\rho \rightarrow 0\)
The first result below is the non-standard Gaussian limit law, known to hold when the horizon is infinite. It is a precise version of Theorem A stated in Sect. 1.3.
Our main contribution lies in characterizing the limit paths allowed as \(\rho \rightarrow 0\); this is done up to the unknown \(\gamma _{\rho }\), \(C_\rho \) in (32) and \(\bar{C}_{\rho }\), \(\bar{\gamma }_{\rho }\) as in (37).
Throughout this section, the notation is the same in Sect. 1.1. In particular, \(b_{n,\rho }=\frac{\sqrt{n\log (n/\rho ^2)}}{\sqrt{4\pi }\ \rho }\), and the variance matrix \(\Sigma \) are defined as in (1), in agreement with Lemma 6.2. We recall that \(\implies \) stands for convergence in distribution with respect to the invariant measure \(\mu \).
Theorem 7.1
Let \(\gamma _{\rho }\), \(C_\rho \) be as in (32), let \(\bar{\gamma }_{\rho }\), \(\bar{C}_{\rho }\) be as in (37) and let C be as in Proposition 6.3.
Set \(M(\rho )=\max \{ C_\rho \gamma _{\rho }^{-2},\rho ^2 \bar{C}_{\rho }\bar{\gamma }_{\rho }^{-2}\} + C\). Let \(\rho \rightarrow 0\) and simultaneously \(n\rightarrow \infty \) in such a way that \(M(\rho )= o(\log n)\). Then
Remark 7.2
A similar statement holds for the flight function \(q_{\rho }\). The only change in the proof is the use of Remark 6.5 instead of Proposition 6.3.
Proof
Throughout we let \(\delta <\frac{1}{2} \min \{\gamma _{\rho },{\hat{\gamma }}_{\rho }\}\), so that we can use Proposition 6.3 with \(\delta _0=\delta ^{4/(3\nu -1)}\). By (33), for \(t\in B_{\delta _0}(0)\),
Note that the assumption \(M(\rho )=o(\log n)\) ensures that, for \(\rho \) small enough, \(\frac{t}{b_{n,\rho }}\in B_{\delta _0}(0)\) for all \(t\in {{\mathbb {R}}}^2\). Hence, as \(n\rightarrow \infty \) and given the range of n, equivalently as \(\rho \rightarrow 0\),
for all \(t\in {{\mathbb {R}}}^2\).
Also, it follows from (35) that \(\Vert \Pi _{\rho }\left( \frac{t}{b_{n,\rho }}\right) -\Pi _{\rho }(0) \Vert _{{{\mathcal {B}}}}\rightarrow 0\), as \(n\rightarrow \infty \) and given the range of n, equivalently as \(\rho \rightarrow 0\). Thus, a standard argument based on the dominated convergence theorem shows that as \(n\rightarrow \infty \), equivalently as \(\rho \rightarrow 0\),
It remains to understand \(\lambda _{\rho } \left( \frac{t}{b_{n,\rho }}\right) ^n\) as \(\rho \rightarrow 0\). Since \(\delta _0=\delta ^{4/(3\nu -1)}\), we can apply Proposition 6.3 to obtain
By assumption, \(M(\rho )=o(\log n)\). Hence, as \(\rho \rightarrow 0\),
Now, given that \({\bar{A}}\) is as in Lemma 6.2,
Also, using Lemma 6.2 and recalling the range of n,
where in the last equality we have used Lemma 6.2 and the uniform convergence theorem for slowly varying functions. Putting the above together,
for any \(t\in {{\mathbb {R}}}^2\). This completes the proof of Theorem 7.1 by Levy’s continuity theorem. \(\square \)
The next result gives a local limit theorem as \(\rho \rightarrow 0\), again up to the unknown \(\gamma _{\rho }\), \(C_\rho \) and \(\bar{\gamma }_{\rho }\), \(\bar{C}_{\rho }\). This is possible due to the present proof based on spectral methods which produces the fine control of the eigenvalue in Proposition 6.3. The present proof of local limit theorem for the infinite horizon is new even for \(\rho \) fixed. We recall that the only proof of such a local limit is given in [31] via the abstract results in [4] for Young towers. Our proof relies on Proposition 6.3, which is new in the set-up of the Banach spaces considered here and it relies heavily on Appendix C and on Proposition 3.1 (which provides useful continuity estimates).
In the notation of Theorem 7.1 we let \(\Phi _\Sigma \) be the density of a Gaussian random variable distributed according to \({{\mathcal {N}}}(0,\Sigma )\) and recall from Sect. 4.1 that \(C^{p_0}\subset {{\mathcal {B}}}\).
Theorem 7.3
Assume the assumptions and notation of Theorem 7.1; in particular \(M(\rho )\) is defined in the same way. Let \(v\in C^{p_0}({{\mathcal {M}}})\) and \(w\in L^a({{\mathcal {M}}})\), for \(a>1\).
Let \(\rho \rightarrow 0\) and simultaneously \(n\rightarrow \infty \) in such a way that \(M(\rho )=o(\log n)\). Then
uniformly in \(N\in {{\mathbb {Z}}}^2\).
Remark 7.4
A similar statement holds for the flight function \(q_{\rho }\). By a similar argument, using Remark 6.5 instead of Proposition 6.3, we obtain \((b_{n,\rho })^2\mu (\{q_{n,\rho }\in V\})\rightarrow \Phi _\Sigma (0)Leb_{{{\mathbb {R}}}^2}(V)\), for any compact neighborhood \(V\in {{\mathbb {R}}}^2\) with \(Leb_{{{\mathbb {R}}}^2}(\partial V)=0\). A uniform LLT for \(q_{\rho }\) can be obtained by, for instance, a straightforward adaptation of the argument used in [27, Proof of Theorem 2.7].
It is known that for every \(\rho >0\), \(\kappa _\rho \) is aperiodic, i.e., there exists no non-trivial solution to the equation \(e^{it\kappa _\rho }g\circ T_\rho =g\). The aperiodicity of \(\kappa _\rho \) has been used in [31] (in fact, in [30]) to provide LLT for fixed \(\rho \). Given Proposition 6.3 and the aperiodicity of \(\kappa _\rho \), the proof of Theorem 7.3 is classic, see [1] and for a variation of it that provides the uniformity in N, see, for instance, [28, First part of Proof of Theorem 2]. The proof below recalls the main elements needed to obtain the range of n in the statement.
Theorem 7.3
Let \(\delta _0=\delta ^{4/(3\nu -1)}\) be so that (34), (32) and Proposition 6.3 hold for all \(|t|\in B_{\delta _0}(0)\). Since \(\kappa _\rho \) is aperiodic, a known argument (see [Lemma 4.3 and Theorem 4.1] [1]) shows that \(\Vert {\hat{R}}_{\rho }(t)^n\Vert _{{{\mathcal {B}}}}\le C_\rho (1-\gamma _{\rho })^n\), for all \(|t|\ge \delta _0\). It follows that \(|{{\mathbb {E}}}_\mu ({\hat{R}}_{\rho }(t)^n 1)|\le \Vert {\hat{R}}_{\rho }(t)^n \Vert _{{{\mathcal {B}}}}\le C_\rho \,(1-\gamma _{\rho })^n\) for every \(|t| \in (\delta _0,\pi )\). Thus, using that \(v\in C^{p_0}\subset {{\mathcal {B}}}\),
Recall that \(w\in L^a\), \(a>1\) and set \(b=a/(a-1)\). Using the Hölder inequality,
Recall \(v\in {{\mathcal {B}}}\). Using (35), (29) and Lemma 6.1 and proceeding as in equation (38),
for some uniform C and some \(\varepsilon >0\). In the last inequality we have used that \(|t|<\delta _0\). Thus,
With a change of variables,
Given the range of n in the statement, we use (41) to obtain
To deal with the big O term in (43), we use that by (41) there exists a uniform constant C so that
Since \(M(\rho )=o(\log n)\), we have \(n\gg \exp \left( C\rho ^{-2}\right) \). Thus, \(\frac{\rho ^{-2}}{(b_{n,\rho })^{2+\varepsilon }}\ll \frac{\log n}{(b_{n,\rho })^{2+\varepsilon }}=o\left( \frac{1}{(b_{n,\rho })^2}\right) \) as \(\rho \rightarrow 0\). Putting these together and using (43),
This together with (42) gives that as \(\rho \rightarrow 0\),
where in the last equation we used that \(M(\rho )=o(\log n)\). This concludes the proof. \(\square \)
It is known that the local limit theorem for \(\kappa _\rho \) and the billiard map \(T_{\rho }\) (with \(\rho \) fixed) implies mixing for the planar Lorentz map \({\hat{T}}_{\rho }\) (again \(\rho \) fixed), see [28]. In fact, sharp error rates in local limit theorems and mixing are also known, see [28] for the finite horizon case and [29] for the infinite horizon case.
We recall from Sect. 1 that the Lorentz map \({\hat{T}}_{\rho }\) defined on \({\widehat{{{\mathcal {M}}}}}={{\mathcal {M}}}\times {{\mathbb {Z}}}^2\) is given by \({\hat{T}}_{\rho }(\theta ,\phi ,\ell ) = ( T_{\rho }(\theta ,\phi ), \ell +\kappa _\rho (\theta ,\phi ))\) for \( (\theta ,\phi ) \in {{\mathcal {M}}},\ \ell \in {{\mathbb {Z}}}^2\). Let \({\hat{\mu }}=\mu \times \text{ Leb}_{{{\mathbb {Z}}}^2}\), where \(\text{ Leb}_{{{\mathbb {Z}}}^2}\) is the counting measure on \({{\mathbb {Z}}}^2\). An immediate consequence of Theorem 7.3 is
Corollary 7.5
Assume the assumptions and notation of Theorem 7.3. Let \(\rho \rightarrow 0\) and simultaneously \(n\rightarrow \infty \) in such a way that \(M(\rho )=o(\log n)\). Then
Remark 7.6
The class of functions in Corollary 7.5 is rather restrictive as the functions v, w are supported on the cell \({{\mathcal {M}}}\). Given the work [28] (see also [29, Sect. 6]), it is very plausible that the present mixing result can be generalized to a suitable class of dynamically Hölder functions supported on the whole of \({\widehat{{{\mathcal {M}}}}}\). Since the involved argument is rather delicate and not a main concern of the present work, we omit this.
Data Availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Notes
We will also use the notations \(\gamma _{\rho }=1-\hat{\theta }_{\rho }\) and \(\hat{\gamma }_{\rho }=1-\hat{\vartheta }_{\rho }\).
Note that our set-up fits the conditions (H1)-(H5) in [16, Sect. 2.1] with \(f(x)=f(\theta ,\phi )=\cos \phi \) and \(\kappa _\rho =1\) in (H1), \(r_h=r_0+1\) in (H2), \(\xi =\frac{1}{2}\) and \(t_0=1\) in (H3), \(p_0=\frac{1}{r_0+1}\) in (H4) and \(\gamma _0=0\) in (H5).
In the definition of the weak norm [16] uses test functions with \(|\psi |_{W,\gamma ,p}\le 1\) for some \(\gamma >0\), and requires \(p<\gamma \). However, this is needed only to ensure that the inclusion \({{\mathcal {B}}}_w\hookrightarrow (C^p)'\) is injective, cf. [16, Lemma 3.8] Since we do not use this property, we can take \(\gamma =0\) in the definition of the weak norm, and avoid additional restrictions on \(p_0\).
Usually called \({\mathcal {B}}\) in billiard literature such as [10], but we write \(\Omega \) to avoid confusion with Banach spaces \({{\mathcal {B}}}\).
References
Aaronson, J., Denker, M.: Local limit theorems for partial sums of stationary sequences generated by Gibbs–Markov maps. Stoch. Dyn. 1, 193–237 (2001)
Aaronson, J., Denker, M.: A local limit theorem for stationary processes in the domain of attraction of a normal distribution. In: Balakrishnan, N., Ibragimov, I.A., Nevzorov, V.B. (eds.) Asymptotic Methods in Probability and Statistics with Applications. International Conference, St. Petersburg, Russia, 1998, Basel: Birkhäuser, 215–224 (2001)
Bálint, P., Chernov, N., Dolgopyat, D.: Limit theorems in dispersing billiards with cusps. Commun. Math. Phys. 308, 479–510 (2011)
Bálint, P., Gouëzel, S.: Limit theorems in the stadium billiard. Commun. Math. Phys. 263, 461–512 (2006)
Bleher, P.: Statistical properties of two-dimensional periodic Lorentz gas with infinite horizon. J. Stat. Phys. 66(1), 315–373 (1992)
Bruin, H., Terhesiu, D.: Regular variation and rates of mixing for infinite measure preserving almost Anosov diffeomorphisms. Ergod. Theory Dyn. Syst. 40, 663–698 (2020)
Bruin, H., Terhesiu, D., Todd, M.: Pressure function and limit theorems for almost Anosov flows. Commun. Math. Phys. 382(1), 1–47 (2021)
Calkin, N., Wilf, H.: Recounting the rationals. Amer. Math. Monthly 107, 360–363 (2000)
Chernov, N.: Decay of correlations and dispersing billiards. J. Stat. Phys. 94, 513–556 (1999)
Chernov, N., Markarian, R.: Chaotic billiards. Mathematical Surveys and Monographs, American Mathematical Society 127 (2006)
Chernov, N., Dolgopyat, D.: Anomalous current in periodic Lorentz gases with infinite horizon. Russian Math. Surveys 64–4, 651–699 (2009)
Demers, M., Liverani, C.: Stability of statistical properties in two dimensional piecewise hyperbolic maps. Trans. Am. Math. Soc. 360, 4777–4814 (2008)
Demers, M., Liverani, C.: Projective cones for sequential dispersing billiards. Commun. Math. Phys. (2023). https://doi.org/10.1007/s00220-023-04657-1
Demers, M., Zhang, H.-K.: Spectral analysis of the transfer operator for the Lorentz gas. J. Mod. Dyn. 5(4), 665–709 (2011)
Demers, M., Zhang, H.-K.: A functional analytic approach to perturbations of the Lorentz gas. Commun. Math. Phys. 324, 767–830 (2013)
Demers, M., Zhang, H.-K.: Spectral analysis of hyperbolic systems with singularities. Nonlinearity 27, 379–433 (2014)
Denker, M.: Uniform integrability and the central limit theorem for strongly mixing processes. In: Eberlein, E., Taqqu, M. S., (eds.) Dependence in Probability and Statistics. A Survey of Recent Results. Birkhäuser, Boston (1986)
Gouëzel, S.: Central limit theorems and stable laws for intermittent maps. Prob. Theory Rel. Fields 1, 82–122 (2004)
Hardy, G., Wright, E.: An Introduction to the Theory of Numbers, 5th edn. Oxford University Press, Oxford (1979)
Keller, G., Liverani, C.: Stability of the spectrum for transfer operators. Ann. della Scuola Normale Superiore di Pisa Classe di Sci. 19, 141–152 (1999)
Liverani, C., Terhesiu, D.: Mixing for some non-uniformly hyperbolic systems. Ann. Henri Poincaré 17(1), 179–226 (2016)
Lutsko, C., Tóth, B.: Invariance principle for the random Lorentz gas: beyond the Boltzmann–Grad limit. Commun. Math. Phys. 379, 1–44 (2020)
Lorentz, H.A.: The motion of electrons in metallic bodies. Proc. Amsterdam Acad. 7, 438–453 (1905)
Marklof, J., Strömbergsson, A.: The Boltzmann–Grad limit of the periodic Lorentz gas. Ann. Math. 174, 225–298 (2011)
Marklof, J., Strömbergsson, A.: The periodic Lorentz gas in the Boltzmann–Grad limit: asymptotic estimates. Geometric Funct. Anal. 21, 560–647 (2011)
Marklof, J., Tóth, B.: Superdiffusion in the periodic Lorentz gas. Commun. Math. Phys. 347(3), 933–981 (2016)
Melbourne, I., Terhesiu, D.: Renewal theorems and mixing for non Markov flows with infinite measure. Ann Inst. H. Poincaré (B) Probab Stat. 56, 449–476 (2020)
Pène, F.: Mixing and decorrelation in infinite measure: the case of the periodic Sinaĭ Billiard. Ann Inst. H. Poincaré (B) Probab. Stat. 55(1), 378–411 (2019)
Pène, F., Terhesiu, D.: Sharp error term in local limit theorems and mixing for Lorentz gases with infinite horizon. Commun. Math. Phys. 382(3), 1625–1689 (2021)
Szász, D., Varjú, T.: Local limit theorem for the Lorentz process and its recurrence in the plane. Ergod. Theory Dyn. Syst. 24, 254–278 (2004)
Szász, D., Varjú, T.: Limit laws and recurrence for the planar Lorentz process with infinite horizon. J. Stat. Phys. 129(1), 59–80 (2007)
Wang, F., Zhang, H., Zhang, P.: Decay of correlations for unbounded observables. Nonlinearity 34(4), 2402–2429 (2021)
Weber, M.: On Farey sequence and quadratic Farey sums. Res. Number Theory 8 (2022), no. 1, Paper No. 14, 22 pp
Young, L.-S.: Statistical properties of dynamical systems with some hyperbolicity. Ann. Math. 147, 585–650 (1998)
Acknowledgements
We thank the anonymous referees for useful comments. HB was supported by the FWF Project P31950-N35. PB was supported by National Research, Development and Innovation Office—NKFIH, Projects K123782, K142169 and KKP144059. PB and HB acknowledge the support of Stiftung AÖU Project 103öu6. DT was partially supported by EPSRC grant EP/S019286/1.
We would like to thank M. Weber for his helpful correspondence on the results established in [33].
Funding
Open access funding provided by Budapest University of Technology and Economics.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
A Estimates on corridors
1.1 A.1 Estimating \({{\mathbb {P}}}(\kappa _\rho = \xi '+N\xi )\)
Given a corridor associated to \(\xi \), there a neighborhood \(U_0\) of \(x_0 = x_0(\xi )\) in \(\partial O_0 \times [-\frac{\pi }{2},\frac{\pi }{2}]\) of initial conditions x such that the next collision occurs at a scatterer on the opposite side of the corridor. For this situation, [31] use the coordinates \((\alpha , z)\), where \(\alpha \) is the angle the trajectory of some \(x \in \partial O_0\) makes with the tangent line at \(x_0\), and the intersection point is \(y = x_0+z \xi \), see Fig. 5.
Lemma A.1
In coordinates \((z,\alpha )\) the volume form in a neighborhood of \(x_0 = x_0(\xi )\) is
Proof
The part \(\sin \alpha \, d\alpha \, dz\) can be understood because the Liouville measure of the billiard flow projects to a form \(\cos \varphi \, d\varphi \, dr\) for any transversal section parametrised by arc-length r and with \(\varphi \) the angle of the trajectory to the normal vector at the collision point. When this section is the line \(y = x_0 + x \xi \), we have \(\alpha = \frac{\pi }{2}-\varphi \), so \(\cos \varphi = \sin \alpha \). But to get the correct normalizing constant, we give a more extensive argument. From Fig. 5 we have
After making \(\alpha \) and z subject of these equations, we see that the change of coordinates involved is
The Jacobian determinant is
Thus, using (44) and some trigonometric formulas,
as claimed. \(\square \)
The following is [31, Proposition 6] in more detail:
Lemma A.2
Suppose that the scatterers have radius \(\rho > 0\) and the width of the corridor given by \(\xi \) is \(d_\rho (\xi )\). Then
where \(\xi '\) as in Remark 2.2 is the integer vector on the boundary of the corridor opposite to the \(\xi \)-boundary.
Proof
We take the region in \((z,\alpha )\)-coordinates where \(\kappa _\rho = N \xi + \xi '\). In the z-direction this is an interval \([z_0, z_1]\), where for \(z = z_0\), there is only one line connecting \(O_0\) and \(O_{\kappa _\rho }\), namely the common tangent line of \(O_0\) and \(O_{\kappa _\rho -\xi }\). For \(z = z_1\) there is also is only one line, namely the common tangent line of \(O_\xi \) and \(O_{\kappa _\rho }\), see Fig. 6. These two lines are obtained from each other by translation over one unit \(\xi \), so \(z_1\xi - z_0\xi = |\xi |\). However, if \(\rho \) is small compared to N, these two tangent lines are the common tangent lines at the upper sides of \(O_0\) and \(O_{\kappa _\rho }\) and at the lower sides of \(O_0\) and \(O_{\kappa _\rho }\). In this case
This also shows that the transition between the two cases is when \(2\rho = \frac{d_\rho (\xi )}{N}\).
For each \(z \in [z_0, z_1]\), the range of possible values of \(\alpha \) is again bounded by the \(\alpha \)’s obtained at the tangent lines to \(O_{\kappa _\rho -\xi }\) and \(O_{\kappa _\rho }\). Therefore, see Fig. 7,
Since \(|\xi '| \le |\xi |\) (see Remark 2.2) and \(z \le |\xi |\) as well, each \(\alpha \) in this interval satisfies \(\alpha = \frac{d_\rho (\xi )}{N\, |\xi |}(1 + {{\mathcal {O}}}(N^{-1}))\) and
Integrating the density given in Lemma A.1 for the case \(2\rho \ge \frac{d_\rho (\xi )}{N}\) (so \(|z_1-z_0| = |\xi |\)) and using \(|z_1-z_0|=|\xi |\) and the approximation \(\cos \alpha _0 - \cos \alpha _1 \sim \frac{1}{2} (\alpha _1+\alpha _0) (\alpha _1-\alpha _0)\) gives:
Now for the case \(2\rho < \frac{d_\rho (\xi )}{N}\), see Fig. 7 with small version of \(O_{\kappa _\rho }\), we have
so still \(\alpha = \frac{d_\rho (\xi )}{N\, |\xi |} + {{\mathcal {O}}}(N^{-2})\) and \(\alpha _1(z) - \alpha _0(z) = \frac{2\rho }{N|\xi |} (1+ {{\mathcal {O}}}(N^{-1})\).
Integrating as before gives, using (45) and the fact that \(d_{\rho }(\xi )+2\rho =|\xi |^{-1}\) from Lemma 2.1:
as required. \(\square \)
1.2 A.2 Corridors sums
Let \(\varphi \) be Euler’s totient function, i.e., the number of integers \(1 \le q \le p\) coprime with p. The following lemma is classical number theory, but we couldn’t locate a proof of the full statement.
Lemma A.3
For every \(a > -2\), we have
where \(\zeta \) is the Riemann \(\zeta \)-function, so \(\zeta (2) = \frac{\pi ^2}{6}\).
Proof
Let \(\mu \) be the Möbius function. A standard equality is \(\varphi (n) = \sum _{d|n} \mu (d) \frac{n}{d}\). Therefore
where we used the Dirichlet series identity \(\sum _{d=1}^\infty \frac{\mu (d)}{d^s} = \frac{1}{\zeta (s)}\) for \(s=2\).
As an aside, there are asymptotic formulas for \(s > 2\)
see [19, Theorem 288]. \(\square \)
In the course of this paper we denote, for a fixed value of \(\rho \), the set of corridors that are “visible” from the origin by \({\Psi }\). As described in Lemma 2.1, these can be characterized by pairs \((\xi ,\xi ')\in {{\mathbb {Z}}}^2 \times {{\mathbb {Z}}}^2\) where \(\xi = (p,q)\), \(\gcd (p,q) = 1\) and \(|\xi | \le (2\rho )^{-1}\), while \(\xi '\) may denote either the first or the second convergent preceding \(\xi \) in the continued fraction expansion of p/q, see Remark 2.2. Sums of the type in the following lemma are used throughout the paper.
Lemma A.4
We have
Proof
Using the two coordinate axes and their bisectrices, we divide the plane into eight sectors and for each sector, we sum the scatterers in \({{\mathcal {S}}}\). Circular sections of radius R have asymptotically \(\frac{\pi }{4}\) as many points as triangular sectors with base R. Also, every corridor direction in this sector comes with two corridors \((\xi ,\xi ')\) and \((\xi ,\xi '')\). By Lemma A.3, their sum is, for \(a > -2\),
If \(a = -2\), then a similar computation gives \(\asymp |\log \rho |\), and for \(a < -2\), the series is summable: \(4\pi \sum _{1 \le p \le (2\rho )^{-1}} \phi (p) p^a \le 4\pi \int _1^\infty x^a \, dx = -\frac{4\pi }{2+a}\). \(\square \)
Lemma A.5
For \(p \in [1,2)\), the p-norm of the displacement function satisfies
Proof
Take \(p \in [1,2)\). We estimate over all \(\xi \)-corridors similarly as in Lemma A.4:
Taking the p-th root gives the result. \(\square \)
Lemma A.6
Let \(W \in {{\mathcal {W}}}^s\) be a stable leaf, and let \({\Psi }_W\) stand for all lattice points \(\xi = (p,q) \in {\Psi }\) that can be reached from \(O_0\) with coordinates in W. Then for every \(a \in (\frac{1}{2},1)\),
Proof
There is an arc \({\tilde{W}} \in {{\mathbb {S}}}^1\) of length \(|{\tilde{W}}| \ll |W|\) such that every lattice point that can be reached from \(O_0\) with coordinates in W has its polar angle in \({\tilde{W}}\). Due to the symmetries in the \({{\mathbb {Z}}}^2\), it suffices to study \({\tilde{W}} \subset [0, \pi /2]\), so the lattice point \(\xi = (p,q)\) in this sector satisfy \(0 \le q \le p\) and \(\tan ({\tilde{W}}) \subset [0,1]\). In fact, we will start by assuming that \(\tan ({\tilde{W}}) \in [\frac{1}{10}, \frac{9}{10}]\).
Because \(p^2+q^2 \ge 2pq\) for all \((p,q) = \xi \), we have \(\sum _{(\xi ,\xi ') \in {\Psi }_W} |\xi |^{-a} \ll 2^{-a/2} \sum _{(\xi ,\xi ') \in {\Psi }} \frac{1}{(pq)^{a/2}} 1_{{\tilde{W}}}(\frac{q}{p})\). We will apply an estimate from [33, Theorem 2.2], which, in our terminology, reduces to
where \(C_a\) is a constant depending only on a, and \(c_{\psi }(\ell )\) is the \(\ell \)-th Fourier coefficient of \(x \mapsto \psi (x) x^{-a}\).
If \(\psi = 1_{{\tilde{W}}}\), then these Fourier coefficients are not summable, so we first smoothen \(1_{{\tilde{W}}}\) to a function \(\psi \) with \(\mathop {\text{ supp }}(\psi )\) concentric to \({\tilde{W}}\) and \(|\mathop {\text{ supp }}(\psi )| = |{\tilde{W}}|= 3|W|\). On \({\tilde{W}}\) itself, \(\psi \equiv 1\) and on the two interval components \(\psi \) is a translated copy of the function \(f_W:[-\frac{|W|}{2}, \frac{|W|}{2}] \rightarrow {{\mathbb {R}}}\) defined by
Then \(\int \psi \, dx = 2|W|\) and integrating by parts twice gives an estimate of the Fourier coefficients of \(x \mapsto \psi (x) x^{-a}\).
because \(\mathop {\text{ supp }}(\psi )\) is bounded away from \(\{ 0,1\}\) (so \(x^{-a}\) doesn’t blow up) and \((\psi (x) x^{-a})'' = 0\) outside \(\mathop {\text{ supp }}(\psi )\).
The Dirichlet series of the Möbius function can be estimated as \(\left| \sum _{k=1}^{\lfloor 1/(2\rho d)\rfloor } \mu (k) k^{-a}\right| \le (2\rho d)^{1-a}\). We use this and the fact that \(\ell \in {{\mathbb {N}}}\) has \(O(\ell ^{1/2})\) divisors to estimate the last big O-term in (48).
Hence (48) becomes
as required.
It remains to consider the cases that \(\tan ({\tilde{W}}) \not \subset [\frac{1}{10}, \frac{9}{10}]\). Suppose instead that \(\tan ({\tilde{W}}) \subset (0, \frac{1}{10}]\) (we ignore \(\xi = (0,1)\) and \(\xi = (1,0)\)). In this case, we give an injection between the lattice points in the \({\tilde{W}}\)-sector with coprime coordinates to the set of lattice points (with coprime coordinates and comparable norm) in a sector of comparable width, but near polar angle \(\frac{1}{2}\). Indeed, set \({{\mathbb {Q}}}_{cp} = \{ q/p: 0 \ne p, q \in {{\mathbb {Z}}}, \gcd (p,q) = 1\} \cup \{0\}\) and \({{\mathbb {Z}}}_{cp}:= \{ (p, q) \in {{\mathbb {Z}}}^2: \gcd (p,q) = 1\}\), and define the Calkin-Wilf map \(f:{{\mathbb {Q}}}_{cp} \rightarrow {{\mathbb {Q}}}_{cp}\) as well as \(g:{{\mathbb {Z}}}_{cp} \rightarrow {{\mathbb {Z}}}_{cp}\) by
The f-orbit of 0 enumerates all non-negative lowest-term rationals, see [8], and g is the same map expressed on the collection of lattice points. Since \(f^2( (0,\frac{1}{10}] ) \subset (\frac{1}{2},\frac{10}{21}]\) and \(|g(\xi )| \le 4|\xi |\), the second iterate \(g^2\) provides the required injection. In case \(\tan (\tilde{W}) \subset [\frac{9}{10},1)\) we use \(g^3\). \(\square \)
B Distortion properties
Throughout, a uniform constant is a constant that is independent of \(\rho \).
Let us recall some terminology and notations from [10, Chapter 4]. Unstable curves generate dispersing wavefronts, which are evolved by the free flight, and then leave traces of unstable curves on the scatterer at the next collision. For wavefronts it is convenient to use the Jacobi coordinates \((d\xi ,d\omega )\), and an important quantityFootnote 5\(\Omega =\frac{d\omega }{d\xi }\), the curvature of the wavefront. Let \(\Omega ^-\) and \(\Omega ^+\) denote its value immediately before and after a particular collision, respectively.
On the scatterer, the traditional coordinates are \((r,\phi )\) yet, we prefer to use the \(\rho \)-independent \((\theta ,\phi )\) and take advantage of
First we relate \(\Omega ^-\) to the slope of the unstable curve: \(\frac{1}{2\pi } \frac{d\phi }{d\theta } =\rho \Omega ^- \cos \phi +1\). Differentiating with respect to \(\theta \) gives
Lemma B.1
There exists a uniform constant \(C>0\) such that for any \(C^2\) smooth unstable curve W there exists \(n_W\) such that for \(n\ge n_W\) on all components of \(T_{\rho }^nW\) we have
Thus we may restrict to the class of regular unstable curves for which (50) holds. Also, this shows that as \(\rho \rightarrow 0\), the unstable curves limit in a \(C^2\) sense to straight lines of slope \(2\pi \).
Proof
The properties of the free flight are not effected by shrinking the scatterers or using the \(\theta \)-coordinate. Thus
and, by (49), it is enough to show
to prove the lemma. Now \(\frac{d\Omega ^-}{d\theta }=(2\pi \rho ) \frac{d\Omega ^-}{dr}\), and the evolution of \(\frac{d\Omega ^-}{dr}\) is discussed in [10, section 4.6]. Following the notation there, introduce
and use superscripts − and \(+\) to denote pre- and post-collision values of these quantities, respectively. [10, Formula (4.37)] states
where
and by the analysis of [10, page 81]:
-
\(F_1\) remains constant between collisions
-
there exists a uniform constant \(\Theta <1\) such that \(\frac{\Omega ^-}{\Omega ^+}\le \Theta \),
-
there exists a uniform constant \(C_1>0\) such that \(|H_1|\le C_1\). This remains valid for shrinking \(\rho \) as the denominator scales with \(\rho ^{-3}\) while the numerator scales with \(\rho ^{-2}\).
Hence it follows that \(|F_1(n+1)|\le \Theta ^3 |F_1(n)| +C\), where \(F_1(n)\) is the value of \(F_1\) between the n-th and the \((n+1)\)st collision. This implies that there exists \(C_2>0\) and \(n_W\) (depending on the curve W) such that for any \(n\ge n_W\) we have \(|F_1(n)|\le C_2\).
Now \(|{{\mathcal {E}}}_1^-|=|F_1^-|\cdot (\Omega ^-)^3\le C_3\) for some uniform \(C_3>0\), and finally [10, Formula (4.24)] states
which thus implies that \(\left| \frac{d\Omega ^-}{dr}\right| \le C_4\) for some uniform constant \(C_4>0\). This bound completes the proof of the lemma. \(\square \)
It follows that regular unstable curves can be parametrised by the coordinate \(\theta \), and for any smooth function \(f:W\rightarrow {{\mathbb {R}}}\), \(\frac{df}{d\theta }\asymp \frac{df}{dx}\), where x is (Euclidean) arc-length along the curve—\(dx^2=d\theta ^2+d\phi ^2\) (not to be confused with the arc-length r along the scatterer).
Let us also recall that an unstable curve is homogeneous if it is regular and contained in one of the homogeneity strips \({{\mathbb {H}}}_k=\{(\theta ,\phi ): \frac{\pi }{2}-k^{-r_0}< \phi < \frac{\pi }{2}-(k+1)^{-r_0} \}\). For such curves, analogous to [10, Formula (5.13)], we have
for some uniform constant \(C>0\), where \(\phi \) corresponds to any point of W. (This follows as the slope of the curve is uniformly bounded away from 0 and \(\infty \).)
Distortion bounds are stated as follows. Let W be a homogeneous unstable curve, and assume that for some \(N\ge 1\), \(W_n=T_{\rho }^{-n}W\) is a homogeneous unstable curve for \(n=0,1,\dots ,N\). For \(x\in W\), let \(x_n=T_{\rho }^{-n}x\in W_n\). Let \(J_WT_{\rho }^{-n}(x)\) and \(J_{W_n} T_{\rho }^{-1}(x_n)\) denote the respective Jacobians.
Lemma B.2
Consider W and N as above and \(y,z\in W\) arbitrary. There exists a uniform constant \(C_d>0\) such that
Proof
The lemma relies on the inequality
for some uniform \(C>0\), cf. [10, Formula (5.8)].
Using this formula the argument in the proof of [10, Lemma 5.27] can be repeated literally:
where we have used the chain rule, (52), and (51) and the uniform hyperbolicity.
It remains to prove (52). Here we essentially follow [10, pp. 106–107]. We have
We consider the derivatives of these terms separately. As noted above, differentiation with respect to \(\theta _n\) and \(x_n\) can be interchanged. By Lemma B.1, the derivative of the second term w.r.t. \(\theta _n\) is uniformly bounded. The same applies to the derivative of the third term with respect to \(\theta _{n+1}\), while
is uniformly bounded from above. The first term gives the main contribution: as \(\cos \phi _n\) is not bounded away from 0, the derivative of its logarithm is
The fourth term is the logarithm of the quantity
which is bounded from below, but not from above. It is thus (more than) enough to show that, when taking the derivative, all contributions to the numerator are uniformly bounded. This holds immediately by the previous discussion for all the terms except \(\frac{2}{\rho } \frac{d\tau _{n+1}}{dx_n}\) which requires further investigation. Note that
where \(P(x_n)\) and \(P(x_{n+1})\) are points on the billiard table (and thus on \({{\mathbb {R}}}^2\)) associated to the points \(x_n\in W_n\) and \(x_{n+1}\in W_{n+1}\) on the two scatterers, respectively. In an appropriate reference frame \(P(x_n)=(\rho \cos \theta _n,\rho \sin \theta _n)\) hence the \(\theta _n\)-derivatives of both coordinates are \(\ll \rho \), and the same holds for the \(\theta _{n+1}\)-derivatives of the coordinates of \(P(x_{n+1})\). Thus
which is sufficient for our purposes. \(\square \)
C Decay of correlation for \(\kappa _\rho \).
The main result of this section is the justification of (37), that is
Proposition C.1
There exist \({\hat{C}}_\rho >0\) and \({\hat{\vartheta }}_\rho <1\) such that
-
for any \(j \ge 1\) we have
$$\begin{aligned}&\Big | \int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\, (e^{it\kappa _\rho }-1)\circ T_{\rho }^{j}\, d\mu - \int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1) \,d\mu \int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\,d\mu \Big | \nonumber \\&\quad \le {\hat{C}}_\rho |t|^2 {\hat{\vartheta }}_\rho ^j, \quad \end{aligned}$$(54) -
furthermore, there exist \(\bar{C}_{\rho }>0\) and \(\bar{\vartheta }_{\rho }<1\) such that for any \(j, \ell \ge 1\) we have
$$\begin{aligned} \Big | \int _{{{\mathcal {M}}}_0}&(e^{it\kappa _\rho }-1) R_\rho ^{\ell }(e^{it\kappa _\rho }-1)\, (e^{it\kappa _\rho }-1)\circ T_{\rho }^{j}\, d\mu \nonumber \\&-\int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1) R_\rho ^{\ell }(e^{it\kappa _\rho }-1)\,d\mu \int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\,d\mu \nonumber \\&-\,\Big (\int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\,d\mu \Big ) \int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\, (e^{it\kappa _\rho }-1)\circ T_{\rho }^{j}\, d\mu \nonumber \\&+\,\Big (\int _{{{\mathcal {M}}}_0} (e^{it\kappa _\rho }-1)\,d\mu \Big )^3\Big |\ \le \ \bar{C}_{\rho }|t|^2 \bar{\vartheta }_{\rho }^{\ell +j}. \end{aligned}$$(55)
The \(\rho \)-dependence of this exponential rate gives the main source of unknown dependence on \(\rho \) in the main results of our paper. During the proof we will point out the exact sources of unknown dependence of \(\bar{C}_{\rho }>0\) and \(\bar{\vartheta }_{\rho }<1\) on \(\rho \).
Let us make some comments on the relations of the two estimates of Proposition C.1. We will first prove (54) with some \({\hat{C}}_\rho >0\) and \({\hat{\vartheta }}_\rho <1\) that we can explicitly relate to the correlation decay rates of the map \(T_{\rho }\) on Hölder functions, as expressed in (66) below. Then we extend our argument to obtain (55) for some \(\bar{C}_{\rho }>{\hat{C}}_\rho \) and \(\bar{\vartheta }_{\rho }\in ({\hat{\vartheta }}_\rho ,1)\). Obtaining relations similar to (66) for the constants \(\bar{C}_{\rho }\) and \(\bar{\vartheta }_{\rho }\) seems quite difficult and we do not push this point.
The proof of Proposition C.1 consists in: a) reconsider [11, Proposition 9.1]; b) only for (55), work with a version of \(R_\rho \) with spectral gap in a Banach space embedded in some \(L^p\) space with \(p>1\). Item a) is needed in order to obtain the bound \(|t|^2\) and the decay of correlation in j. Item b) is needed to obtain the joint decay in j and \(\ell \). Item b) is possible because for every \(\rho >0\), there exists a Young tower \(\Delta _\rho \) and a tower map \(T_{\Delta _\rho }\) associated with the billiard map \(T_\rho \); this is ensured by the construction in [9, 34]. We emphasize that we will not exploit any fine dependence on \(\rho \) of \(T_{\Delta _\rho }\) (the mere existence is enough), which is why this part of our arguments can be worked on the Young tower \(\Delta _\rho \).
1.1 C.1 Standard pair argument
In this section we reconsider [11, Proposition 9.1]. Let us introduce truncation levels \(H,{\hat{H}} > 0\) to be fixed later and
As \(|\kappa _\rho | \asymp |\xi | m\) on \(D_{\xi ,m}\), the truncation \(\kappa _\rho '\) restricts \(\kappa _\rho \) to the cells \(D_{\xi ,m}\) with \(m\le H |\xi |^{-1}\).
The result we will use in the proof of Proposition C.1 below is
Lemma C.2
For any \(c_0>2\) we have
-
(i)’
\(\int |\kappa _\rho '| \cdot |\kappa _\rho ''''| \circ T_{\rho }^j \, d\mu \le CH^2 {\hat{H}}^{-1} \rho ^{-3}\),
-
(ii)’
\(\int |\kappa _\rho '' | \cdot |\kappa _\rho | \circ T_{\rho }^j \, d\mu \le C |\log \rho |\cdot \left( H^{-\frac{1}{2} +\frac{1}{2r_0}} \, \log H \, \rho ^{-3-\nu }+ H^{2-c_0} \rho ^{-2-c_0}\right) \).
Furthermore, for any \(q\in \left( 1,\frac{8}{7}-\frac{6}{7(7r_0-1)}\right) \) and \(c \in (\frac{q+1}{2-q}\,, \, \frac{1-1/r_0}{2q-2}-1)\),
-
(i)
\(\int |\kappa _\rho '|^q \cdot |\kappa _\rho ''''|^q \circ T_{\rho }^j \, d\mu \le CH^{q+1} {\hat{H}}^{q-2} \rho ^{-3}\),
-
(ii)
\(\int |\kappa _\rho '' |^q \cdot |\kappa _\rho |^q \circ T_{\rho }^j \, d\mu \le C \Big (H^{-\frac{3}{2}+q+c(q-1)+\frac{1}{2r_0}} \rho ^{c(q-1)-q-2-\nu } + H^{c(q-2)+q+1} \rho ^{-1-q-c(2-q)}\Big )\).
Remark C.3
Let \(q(r_0)=\frac{8}{7}-\frac{6}{7(7r_0-1)}\), the upper bound on q for \(r_0\) fixed. Furthermore, let \(c_1(q)=\frac{q+1}{2-q}\) and \(c_2(q)=\frac{1-1/r_0}{2q-2}-1\), the lower and upper bounds on c for q fixed. Note that \(c_1(q)\) is increasing in q, while \(c_2(q)\) is decreasing in q, and \(c_1(q(r_0))=c_2(q(r_0))\). Also \(c_1(1)=2\) and \(c_2(1)=\infty \), which is in accordance with the conditions on \(c_0\). Note also that:
-
The condition \(c<c_2(q)=\frac{1-\frac{1}{r_0}}{2q-2}-1\) is equivalent to \(q+c(q-1)<\frac{3}{2}-\frac{1}{2r_0}\). This ensures that the power of H in the first term of (ii) is negative.
-
Since \(c>c_1(q)=\frac{q+1}{2-q}\), the power of H in the second term of (ii) is negative.
-
Choosing \({\hat{H}}=H^c\), the power of H in (i) is also negative, again for \(c>c_1(q)=\frac{q+1}{2-q}\).
Standard pairs and families Let us recall some terminology related to standard pairs, see also [11, page 29]. A standard pair \(\ell =(W,h_W)\) is a regular unstable curve W that supports a dynamically log-Hölder continuous probability density \(h_W\). As such, it can be regarded as a probability measure on the phase space \({{\mathcal {M}}}\), which will be denoted by \(\ell \), too.
A standard family is a collection of standard pairs \({{\mathcal {G}}}=\{\ell _a\}\), \(a\in {{\mathcal {A}}}\) equipped with a probability factor measure \(\lambda _{{{\mathcal {G}}}}\) on \({{\mathcal {A}}}\). This induces a probability measure \({{\mathbb {P}}}_{{{\mathcal {G}}}}\) on \({{\mathcal {M}}}\).
For a standard pair \(\ell =(W,h_W)\) any \(x\in W\) splits W into two subcurves, let \(r_W(x)\) denote the length of the shorter, and let \({{\mathcal {Z}}}_{\ell }=\sup _{\varepsilon >0} \varepsilon ^{-1} \ell (r_W\le \varepsilon )\). By Hölder continuity of \(\log h_W\), \(\ell \) is equivalent to the normalized Lebesgue measure on W and thus \({{\mathcal {Z}}}_{\ell } \asymp |W|^{-1}\). This generalizes for the Z-function of a standard family \({{\mathcal {Z}}}_{{{\mathcal {G}}}} \asymp \int \frac{\lambda _{{{\mathcal {G}}}}(a)}{|W_a|} dm_W\).
The \(T_{\rho }\)-image of a standard pair is a countable collection of standard pairs. Hence, the image of a standard family is a standard family. Given a standard family \({{\mathcal {G}}}\), for \(n\ge 1\), \({{\mathcal {G}}}_n\) denotes the \(T_{\rho }^n\)-image of \({{\mathcal {G}}}\). It follows from the growth lemma (Proposition 3.1) that there exists \(\vartheta <1\) and \(C_1,C_2>0\) such that
where \(\delta _0 \asymp \rho ^{\nu }\) (see (16) and Remark 3.2, part (i)). As consequence, for any standard pair and \(n\ge 1\)
Cells For \(\xi \in {{\mathbb {Z}}}^2\) such that the corridor is opened up, and for \(m\in {{\mathbb {Z}}}\) let \(D_{\xi ,m}\subset {{\mathcal {M}}}\) denote the set of points for which \(\kappa _\rho =m\xi +\xi '\). The geometric properties of \(D_{\xi ,m}\) and its image \(T_{\rho }D_{\xi ,m}\) will play an important role in the argument. \(T_{\rho }D_{\xi ,m}\) is depicted in Fig. 2. A similar description applies to \(D_{\xi ,m}\); it is delimited by a long singularity curve, decreasing in the \((\theta ,\varphi )\) coordinates, which is connected to the boundary of \({{\mathcal {M}}}\) by two shorter decreasing singularity curves, of length \(\asymp (|\xi |\rho m)^{-1/2}\), running at a distance \(\asymp (|\xi | m)^{-2}\) from each other. Further properties:
-
\(\mu (D_{\xi ,m})=\mu (T_{\rho }D_{\xi ,m})\asymp \rho ^{-1} |\xi |^{-3}m^{-3}\) (due to the factor \(\cos \phi \) in the measure);
-
an unstable curve may intersect \(D_{\xi ,m}\) in a subcurve of length \(\le C (|\xi | m)^{-2}\);
-
\(T_{\rho }D_{\xi ,m}\) intersects homogeneity strips of index \(k\ge C(\rho |\xi | m)^{\frac{1}{2r_0}}\)
If \(\ell =(W,h_W)\) is a standard pair, then it can intersect \(D_{\xi ,m}\) in a subcurve of length \(\le C (|\xi | m)^{-2}\), thus the intersection has probability bounded above by \(C (|\xi | m)^{-2}|W|^{-1} \asymp {{\mathcal {Z}}}_{\ell } (|\xi | m)^{-2}\). It follows that for a standard family \({{\mathcal {G}}}\) we have
Our argument below follows the proof of [11, Proposition 9.1] taking into account that the corridor structure depends on \(\rho \).
Proof of Lemma C.2
For item (i), using \(\mu (T_{\rho }^{-j}D_{\hat{\xi },{\hat{m}}}) \ll \rho ^{-1} {\hat{m}}^{-3} |\hat{\xi }|^{-3}\) as well as Lemma A.4 several times, we get
We will take \({\hat{H}}=H^c\) for \(c>0\) to be determined. To get a negative power of H, we need \(q<2\) and \(c>\frac{q+1}{2-q}\).
For the proof of (ii), we need to estimate
For different ranges of the indices, we will use two different estimates to bound \(\mu (D_{\xi ,m}\cap T_{\rho }^{-j}D_{\hat{\xi },\hat{m}})\). On the one hand, as before, we have
For the other estimate, foliate \(D_{\xi ,m}\) with unstable curves |W| of length \(\asymp (|\xi |m)^{-2}\). The image of any such curve stretches along \(T_{\rho }D_{\xi ,m}\), crossing homogeneity strips with indices \(k\ge C(\rho |\xi | m)^{\frac{1}{2r_0}}\). The piece of \(T_{\rho }W\) in the k-th homogeneity strip will be denoted by \(T_{\rho }W_k\), it has length \(\asymp k^{-r_0-1}\), and its preimage has length
as the expansion factor of \(T_{\rho }\) on \(W_k\) is \(\asymp \rho ^{-1} |\xi | m k^{r_0}\). Equipped with the conditional measure induced by \(\mu \), W is a standard pair \(\ell =(W,h_W)\), and its image is a standard family \(T_{\rho }\ell \) associated to the curves \(T_{\rho }W_k\). To obtain the Z function, we use that the weight of \(|T_{\rho }W_k|\) within this family is \(\frac{|W_k|}{|W|}\), thus
This analysis applies to all the curves in the foliation. Accordingly, \(\mu \) conditioned on \(D_{\xi ,m}\) can be regarded as a standard family \({{\mathcal {G}}}\), and the \({{\mathcal {Z}}}\)-function of its \(T_{\rho }\)-image satisfies
For further iterates, it follows form (56) that
Now we apply (57) to get
We split (58) into two parts. If \(\hat{m}\le m^c\) (for some \(c > 0\) to be determined), we use (60) and get
where we have used that because \(q+c(q-1)<\frac{3}{2}-\frac{1}{2r_0}\), the contribution of m is summable (this condition is equivalent to \(c<c_2(q)=\frac{1-\frac{1}{r_0}}{2q-2}\), cf. Remark C.3). Note that if \(q=1\) then this contribution is independent of c; however, there is an additional factor of \(|\log \rho |\cdot \log H\).
For \(m > m^c\) we use (59) and get
and in case \(q=1\) we still have an additional \(|\log \rho |\) factor. The condition of summability \(c(q-2)+q<-1\) is satisfied because \(c>\frac{q+1}{2-q}\). Summarizing, we need
First we may fix q such that
and then we can fix c slightly larger than \(\frac{q+1}{2-q}\), such that the conditions are still met. The range of allowed q depends on \(r_0\), it can never exceed \(\frac{8}{7}\); for the traditional \(r_0=2\) the upper bound is \(\frac{14}{13}\), while for \(r_0=5\) the upper bound is \(\frac{19}{17}\). \(\square \)
1.2 C.2 Exploiting the existence of a Young tower for \(T_\rho \)
Let \(({\bar{\Delta }}_\rho , T_{{\bar{\Delta }}_\rho }, \mu _{{\bar{\Delta }}_\rho })\) be the corresponding one-sided Young tower (i.e., with stable leaves quotiented out) and let \(R_{{\bar{\Delta }}_\rho }\) be the transfer operator of \(T_{{\bar{\Delta }}_\rho }\). Let \({\hat{\kappa }}_\rho \) be the version of \(\kappa _\rho \) on \({\bar{\Delta }}_\rho \). We will also use the notations \({\hat{\kappa }}_\rho ', {\hat{\kappa }}_\rho '', {\hat{\kappa }}_\rho ''', {\hat{\kappa }}_\rho ''''\) for the Young tower versions of the truncations \(\kappa _\rho ', \kappa _\rho '', \kappa _\rho ''', \kappa _\rho ''''\), respectively, hence for example \({\hat{\kappa }}_\rho '={\hat{\kappa }}_\rho \cdot {\textbf {1}}_{|{\hat{\kappa }}_\rho |\le H}\). Since \(\kappa _\rho \) is constant on stable leaves, we have for any \(j, \ell \ge 0\),
Let r be the roof function of the tower \(({\bar{\Delta }}_{\rho },\mu _{{\bar{\Delta }}_{\rho }})\). We recall that if \(d:= \gcd (r) > 1\), then for every \(\rho >0\), \(R_{{\bar{\Delta }}_\rho }\), when viewed as an operator acting on the Young Banach space \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\subset L^p(\mu _{{\bar{\Delta }}_\rho })\), \(p>1\), has a spectral gap (see [9, 34]). As clarified in Remarks C.4 and C.5, the decomposition of \(R_{{\bar{\Delta }}_\rho }\) we shall need in the proof below holds when \(d>1\).
Before proceeding to the proof of Proposition C.1, we recall one property of the norm the Young space \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\subset L^p(\mu _{{\bar{\Delta }}_\rho })\), \(p>1\) that we shall need in the proof below. (The precise definition of \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\) is not important in the proof below, and we omit it.) For any function that is constant on the partition elements of the Young tower, the involved seminorm is zero. This is the case for \({\hat{\kappa }}_\rho '\) and thus, \(\Vert {\hat{\kappa }}_\rho '\Vert _{{{\mathcal {B}}}_{{\bar{\Delta }}_\rho }}\le H\) (and a similar version holds for \({\hat{\kappa }}_\rho '''\)).
Proof of Proposition C.1
We first prove the statement for the case when \(\ell =0\) and point out the required modifications when \(\ell \ge 1\).
Case \({\varvec{\ell }}=\textbf{0}\). Given (61), in this case we need to show that
for some \(\rho \)-dependent constants \({\hat{\vartheta }}_\rho <1\) and \({\hat{C}}_\rho > 0\).
Throughout this proof, we let \(\kappa _\rho ', \kappa _\rho '', \kappa _\rho ''', \kappa _\rho ''''\) also denote their corresponding versions on the tower \(\Delta _\rho \) and the context in which they appear will make it clear which version we are referring to.
Write
For \(I_3(t,\rho )\) we use the exponential decay of correlation (see Remark C.4 below for the case that the roof function r of the tower has \(\gcd (r) > 1\)). This gives the only source of unknown dependence on \(\rho \) in the case \(m=0\). More precisely, for every \(\rho >0\), there exists \({\hat{\theta }}_\rho <1\) and \(C_\rho >0\) so that
Thus,
By definition,
and we note that \(J(t,\rho )\) is bounded by the sum of
and a similar term with \({\hat{H}}\) instead of H. Using the Hölder inequality (with exponents \(\frac{2}{1+\delta }\) and \(\frac{2}{1-\delta }\)), the tail behaviour of \(\kappa _\rho \) and Lemma A.5, we obtain that
Also \(\int _{{{\mathcal {M}}}_0} |\kappa _\rho '''| d\mu \le \Vert \kappa _\rho \Vert _{L^1(\mu )} \ll \rho ^{-1}\). Hence,
Finally, note that
For this \(\ell =0\) case, if we fix any \(r_0\ge 2\) (taking into account that \(\hat{H}=H^{c_0}\)), then we may bound the coefficients of \(|t|^2\) in \(|J(t,\rho )|\) from (64), \(|I_1(t,\rho )|\) and \(|I_2(t,\rho )|\) from (65), respectively by
where in the bound for \(|I_1(t,\rho )|\) the exponents of H and \(\rho \) have been slightly decreased to bound the logarithmic factors. Fixing \(c_0=\frac{11}{5}\) and \(\delta =\frac{4}{5}\), all these are dominated by \(H^{-\frac{1}{5}}\rho ^{-\frac{22}{5}}\). On the other hand the coefficient of \(|t|^2\) in \(|I_3(t,\rho )|\) is \(C_{\rho }\, {\hat{\theta }}^j_{\rho } H^{c_0+1}=C_{\rho }\, {\hat{\theta }}^j_{\rho } H^{\frac{16}{5}}\). Thus letting \(H=\left( C_{\rho }^{-1}\, {\hat{\theta }}^{-j}_{\rho } \rho ^{-\frac{22}{5}}\right) ^{\frac{5}{17}}\) we conclude that all terms are dominated by
Case \(\ell \ge 1\). The main differences in this case come down to dealing with integrals containing unbounded terms \(\kappa _\rho ''\) and \(\kappa _\rho ''''\) in such a way that can gain exponential decay in \(\ell \) and then proceed as in the case \(\ell =0\) treated above. To do this, we exploit that \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\subset L^p(\mu _{{\bar{\Delta }}_\rho })\).
Using (61), we need to estimate
By Remark C.4, for every \(\rho >0\) and for every \(\ell \ge 1\),
for some \(\rho \)-dependent \(C_\rho \) and \({\hat{\theta }}_{\rho }<1\). This is the first source of unknown dependence on \(\rho \). Since \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\subset L^p(\mu _{{\bar{\Delta }}_\rho })\),
for some \(\rho \)-dependent \(C_\rho ^0\). This is the second source of unknown dependence on \(\rho \).
With these specified, we can write
Rearranging as in the case \(\ell =0\),
Let \(q\in (1,\frac{8}{7}-\frac{6}{7r_0-1})\) so that Lemma C.2 holds. By the Hölder inequality with \(\frac{1}{p}+\frac{1}{q}=1\) and (68),
Similar to estimating (65), using Lemma C.2 and Remark C.3 and without trying for optimal bounds, we can pick q close to 1 and \(c_0 < \frac{5}{2}\) such that \(c_0(q-2)+q+1 = -\frac{1}{5}\). For these values,
Next, let
and note that
By the exponential decay of correlations as in (63) as well as (68):
where as before \(c_0 < \frac{5}{2}\). Finally, by the equation before (64), we have
A similar argument applies to \(L_3(t,\rho )\).
The conclusion follows with a similar choice of H as in the case \(\ell =0\) treated above. \(\square \)
Remark C.4
Let r be the roof function of the one-sided tower map \(({\bar{\Delta }}_{\rho },\mu _{{\bar{\Delta }}_{\rho }})\). If \(d:= \gcd (r) > 1\), then \(T_{{\bar{\Delta }}}\) is not mixing on the Banach space \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\). However, the underlying billiard map \(T_{\rho }\) is mixing and thus,
for \(\phi , \psi \in {{\mathcal {B}}}\) with \(\int _{{{\mathcal {M}}}} \phi \, d\mu = 0\). If \(\gcd (r) = d > 1\), then the eigenvalues on the unit circle are the d-th roots of unity. Hence,
where \(\Pi _\lambda \) denotes the projection on the (generalised) eigenspace \( {{\mathcal {B}}}_{{\bar{\Delta }}_\rho ,\lambda }\) of eigenvalue \(\lambda \), and \(Q_{{\bar{\Delta }}_\rho }\) is the complementary projection. The Banach space \({{\mathcal {B}}}_{{\bar{\Delta }}_{\rho }}\) on \({\bar{\Delta }}_{\rho }\) can be written as the direct sum
As the kernels of projections, \({{\mathcal {B}}}_{\Pi }\) and \({{\mathcal {B}}}_Q\) are closed \(R_{{\bar{\Delta }}_{\rho }}\)-invariant subspaces of \({{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\), and hence Banach spaces themselves. Also, as clarified below, for every \(\rho >0\), the restriction \(R_{{\bar{\Delta }}_{\rho }}\) to \({{\mathcal {B}}}_{Q}\) has spectral radius less than 1. That is, for every \(\rho >0\), there exists \({\hat{\theta }}_\rho < 1\) so that
Consider the lifted version of \(\phi \): \(\phi _{{\bar{\Delta }}_\rho }(x) = \int _{\ell (x)} \phi \circ \pi \, d\mu _{\Delta _\rho ,\ell (x)}\) where \(\ell (x)\) is the stable leaf through \(x \in {\bar{\Delta }}\) and \(\mu _{\Delta _\rho ,\ell (x)}\) the measure on this leaf emerging from the disintegration of the measure \(\mu _{\Delta _\rho }\) of the two-sided tower. The transfer operator \(R_{{\bar{\Delta }}_\rho }\) on the one-sided tower satisfies
If \(\Pi _{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho } \ne 0\), then there exists \(\psi _{{\bar{\Delta }}_\rho } \in {{\mathcal {B}}}_{{\bar{\Delta }}_\rho }\) such that \(\int _{{{\mathcal {M}}}} R^n_{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho } \cdot \psi _{\Delta _\rho } \, d\mu _{\Delta _\rho } \not \rightarrow 0\). (In fact, taking \(\psi _{{\bar{\Delta }}_\rho } = \Pi _{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho }\), we get \(\int _{{{\mathcal {M}}}} R^{dn}_{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho } \cdot \psi _{{\bar{\Delta }}_\rho } \, d\mu _{{\bar{\Delta }}_\rho } \rightarrow \int _{{{\mathcal {M}}}} \Pi _{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho } \overline{\Pi _{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho }} \, d\mu _{{\bar{\Delta }}_\rho } \ne 0\).) This contradicts (70) and/or (73). Hence \(\phi _{{\bar{\Delta }}_\rho } \in {{\mathcal {B}}}_{Q}\) and, for the operator norm \(\Vert R^n_{{\bar{\Delta }}_\rho } \phi _{{\bar{\Delta }}_\rho } \Vert _{{{\mathcal {B}}}_Q} \le \Vert \, R_{{\bar{\Delta }}_{\rho }}|_{{{\mathcal {B}}}_Q}\, \Vert ^n \, \Vert \phi _{{\bar{\Delta }}_\rho }\Vert _{{{\mathcal {B}}}_Q} \ll {\hat{\theta }}_\rho ^n\, \Vert \phi _{{\bar{\Delta }}_\rho }\Vert _{{{\mathcal {B}}}_Q}\). Property (72) follows.
Remark C.5
We note that mixing of the underlying map \(T_\rho \) is not required for a useful version of (67) to hold. Indeed the property of \(Q_{{\bar{\Delta }}_\rho }\) in (67) holds independently of mixing and for this we just need to work with (72), which holds for \(d>1\). The downside of using (72) directly is that in assumption (37) we would have to extract \(\sum _{\lambda ^d=1} \lambda \Pi _\lambda (e^{it\kappa _\rho }-1)\) instead of \(\int _{{{\mathcal {M}}}} (e^{it\kappa _\rho }-1)\, d\mu \). We found it more convenient to work with the ’clean’ assumption (37).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bálint, P., Bruin, H. & Terhesiu, D. Periodic Lorentz gas with small scatterers. Probab. Theory Relat. Fields 186, 159–219 (2023). https://doi.org/10.1007/s00440-023-01197-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00440-023-01197-6
Keywords
- Lorentz gas
- Small scatterers
- Billiards
- Limit theorems
- Nagaev–Guivarc’h method
Mathematics Subject Classification
- Primary: 37D50
- Secondary 37A60
- 60F05
- 60F17
- 82C05
- 82C40