Abstract
We study the macroscopic transport properties of the quantum Lorentz gas in a crystal with short-range potentials, and show that in the Boltzmann–Grad limit the quantum dynamics converges to a random flight process which is not compatible with the linear Boltzmann equation. Our derivation relies on a hypothesis concerning the statistical distribution of lattice points in thin domains, which is closely related to the Berry–Tabor conjecture in quantum chaos.
Similar content being viewed by others
1 Introduction
In 1905, Lorentz [33] introduced a kinetic model for electron transport in metals, which he argued should in the limit of low scatterer density be described by the linear Boltzmann equation. Although Lorentz’ paper predates the discovery of quantum mechanics, the Lorentz gas has since served as a fundamental model for chaotic transport in both the classical and quantum setting, with applications to radiative transfer, neutron transport, semiconductor physics, and other models of transport in low-density matter. There has been significant progress in the derivation of the linear Boltzmann equation from first principles in the case of classical transport, starting from the pioneering works [12, 25, 46] for random scatterer configurations, to the more recent derivation of new, generalised kinetic transport equations that highlight the limited validity of the Boltzmann equation for periodic [13, 39] and other aperiodic scatterer configurations [40].
In the quantum setting, the only complete derivation of the linear Boltzmann equation in the low-density limit is for random scatterer configurations [19], which followed analogous results in the weak-coupling limit [20, 45]. The theory of quantum transport in periodic potentials on the other hand is a well developed theory of condensed matter physics. The general consensus is transport in periodic potentials without the presence of any disorder is ballistic, that is particles move almost freely, with minimal interaction with the scatterers; there is no diffusion. In this paper we propose that this picture changes in the low-density limit, where under suitable rescaling of space and time units the quantum dynamics is asymptotically described by a random flight process with strong scattering, similar to the setting of random potentials in the work of Eng and Erdös [19]. Our work is motivated by Castella’s important studies [14,15,16] of both the weak-coupling and low-density limits for periodic potentials in the case of zero Bloch vector. Castella shows that the weak-coupling limit gives rise to a linear Boltzmann equation with memory [14]. The low-density limit on the other hand diverges [17], and only the introduction of physically motivated off-diagonal damping terms leads to a limit, which for small damping is compatible with the linear Boltzmann equation. As we will show here, the case of random or generic Bloch vector does not diverge in the Boltzmann–Grad limit, without any requirement for damping, but the limit process differs significantly from that described by the linear Boltzmann equation. Our results complement our recent paper [28] which establishes convergence rigorously up to second order in perturbation theory. The aim of the present paper is thus to give a derivation of all higher order terms and identify the full random flight process, conditional on an assumption on the distribution of lattice points in a particular scaling limit. A rigorous verification of this hypotheses seems currently out of reach.
We will here focus on the case when the particle wavelength h is comparable to the potential range r, and much smaller than the fundamental cell of the lattice. This choice of scaling means that a wave-packet will evolve semiclassically far away from the scatterers, but that any interaction with the potential is truly quantum. This is a scaling not traditionally discussed in homogenisation theory in which one usually assumes the characteristic wavelength is either much larger than the period (low-frequency homogenisation) or of the same or smaller order (high-frequency homogenisation); see for example [4,5,6, 8, 18, 26, 27, 30, 42, 43]. Our scaling is also different from that leading to the classic point scatterer (or s-wave scatterer, Fermi pseudo-potential), where the potential scale r is taken to zero with an appropriate renormalisation of the potential strength. In contrast to the setting of smooth finite-range potentials discussed in the present paper, periodic (and other) superpositions of point scatterers are exactly solvable [1,2,3, 24, 29, 31].
Our set-up is as follows. We assume throughout that the space dimension d is three or higher. Consider the Schrödinger equation
where
\(\Delta \) is the standard d-dimensional Laplacian and \({\text {Op}}(V)\) denotes multiplication by the \({{\mathcal {L}}}\)-periodic potential
with W the single-site potential, scaled by \(r>0\), and \({{\mathcal {L}}}\) a full-rank Euclidean lattice in \({{\mathbb {R}}}^d\). We re-scale space units by a constant factor so that the co-volume of \({{\mathcal {L}}}\) (i.e. the volume of its fundamental cell) is one. (One example to keep in mind is the cubic lattice \({{\mathcal {L}}}= {{\mathbb {Z}}}^d\).) The coupling constant \(\lambda >0\) will remain fixed throughout, and h is a scaling parameter which measures the characteristic wave lengths of the quantum particle. We will assume throughout that h is comparable with the potential scaling r.
We assume that W is in the Schwartz class \({{\mathcal {S}}}({{\mathbb {R}}}^d)\) and real-valued. This short-range assumption on the potential is key—a small wavepacket moving through this potential should experience long stretches of almost free evolution, followed by occasional interactions with localised scatterers. One could also consider adding an external potential living on the macroscopic scale although we will not pursue that idea here.
We denote by
the single-scatterer Hamiltonian for the unscaled potential W with \(h=1\). Its resolvent is denoted by \(G_\lambda (E)=(E-H_\lambda ^{\text {loc}})^{-1}\), and the corresponding T-operator is defined as
Rather than consider solutions of the Schrödinger equation directly, we instead consider the time evolution of a quantum observable A given by the Heisenberg evolution
Here \(U_{h,\lambda }(t)= \mathrm {e}^{-\frac{2\pi \mathrm {i}}{h} H_{h,\lambda } t}\) is the propagator corresponding to the Hamiltonian \(H_{h,\lambda }\).
Let us now take an observable \(A={\text {Op}}(a)\) given by the quantisation of a classical phase-space density \(a=a({{\varvec{x}}},{{\varvec{y}}})\), with \({{\varvec{x}}}\) denoting particle position and \({{\varvec{y}}}\) momentum. The question now is whether, in the low density limit \(r= h\rightarrow 0\) and with the appropriate rescaling of length and time units, the phase-space density of the time-evolved quantum observable A(t) (i.e. its principal symbol) can be described asymptotically by a function \(f(t,{{\varvec{x}}},{{\varvec{y}}})\) governed by a random flight process. Eng and Erdös [19] confirmed this in the case of random scatterer configurations, and established that the density \(f(t,{{\varvec{x}}},{{\varvec{y}}}\)) is a solution of the linear Boltzmann equation,
where \(\Sigma ({{\varvec{y}}},{{\varvec{y}}}')\) is the collision kernel of the single site potential \(W({{\varvec{x}}})\) and \({{\varvec{y}}}'\) and \({{\varvec{y}}}\) denote the incoming and outgoing momenta, respectively. The collision kernel is given by the formula
where the T-matrix \(T({{\varvec{y}}},{{\varvec{y}}}')\) is the kernel of T(E) in momentum representation, with \(E=\tfrac{1}{2} \Vert {{\varvec{y}}}\Vert ^2\) (“on-shell”). The total scattering cross section is defined as
Solutions of the linear Boltzmann equation can be written in terms of the collision series
with the zero-collision term
and the \((k-1)\)-collision term
with
The product form of the density \(\rho _{\mathrm {LB}}^{(k)}\) shows that the corresponding random flight process is Markovian, and describes a particle moving along a random piecewise linear curve with momenta \({{\varvec{y}}}_i\) and exponentially distributed flight times \(u_i\).
The principal result of this paper is that, for periodic potentials of the form (1.3) and using the same scaling as in the random setting [19], there exists a limiting random flight process describing macroscopic transport. The derivation requires a hypothesis on the fine-scale distribution of lattice points which is discussed in detail as Assumption 1 in Sect. 6.
Theorem 1
(Main result) Under Assumption 1 on the distribution of lattice points, there exists an evolution operator L(t), distinct from that of the linear Boltzmann equation, such that for any \(a\in {{\mathcal {S}}}({{\mathbb {R}}}^d\times {{\mathbb {R}}}^d)\) we have
where \({\text {Op}}_{r,h}(a)\) is the Weyl quantisation of the phase-space symbol a in the Boltzmann–Grad scaling and \(\Vert \cdot \Vert _{{{\text {HS}}}}\) is the Hilbert–Schmidt norm.
The precise scaling of the quantum observable \({\text {Op}}_{r,h}(a)\) is explained in Sect. 2. The limiting evolution operator L(t) is given by the series
where \(f^{(k)}\) coincides with \(f_{\mathrm {LB}}^{(k)}\) for \(k=1\),
but deviates significantly at higher order. For \(k\ge 2\), the \((k-1)\)-collision term is given by
with the collision densities
Here
and \(g_{\ell m}^{(k)}\) are the coefficients of the matrix valued function
where \({{\mathbb {D}}}({{\varvec{z}}})={\text {diag}}(z_1,\ldots ,z_k)\) and \({{\mathbb {W}}}= {{\mathbb {W}}}({{\varvec{y}}}_1,\dots ,{{\varvec{y}}}_k)\) with entries
The paths of integration in (1.20) are circles around the origin with radius strictly greater than \(r_0= k \max |w_{ij}|\). The matrix \({{\mathbb {G}}}^{(k)}({{\varvec{u}}},{{\varvec{y}}}_1,\dots ,{{\varvec{y}}}_k)\) is in fact the derivative \(\partial _{u_1}\cdots \partial _{u_k}\) of the Borel transform of the function \(F({{\varvec{u}}})=({{\mathbb {D}}}({{\varvec{u}}})^{-1} - {{\mathbb {W}}})^{-1}\). We furthermore note that the above formulas are independent of the choice of scatterer configuration \({{\mathcal {L}}}\), as in the classical setting. For the one-collision terms we will furthermore derive the following explicit representation in terms of the Lorentz–Boltzmann density (1.13) and J-Bessel functions,
and
The remaining matrix elements can be computed via the identities
A notable difference with the solution (1.12) to the linear Boltzmann equation is that in (1.17) there is a non-zero probability that the final momentum \({{\varvec{y}}}_\ell \) is equal to the initial momentum \({{\varvec{y}}}_m\).
The paper is organised as follows. We will first explain in Sect. 2 the precise scaling needed to observe our limiting process and state the main result. In Sect. 3 we recall the well-known Floquet–Bloch decomposition for periodic potentials and in Sect. 4 we recall an explicit formula for the T-operator in our specific setting. Section 5 explains the perturbative approach to calculate the series expansion for the time evolution of A(t). This is followed by a discussion of the main hypothesis in this study in Sect. 6 which in brief can be viewed as a phase-space generalisation of the Berry–Tabor conjecture for the statistics of quantum energy levels for integrable systems [7, 37]. In Sect. 7 we provide an explicit computation of terms appearing in the formal series, and in Sect. 8 we prove that the series is absolutely convergent provided \(\lambda \) is small enough. In Sect. 9 we take the low-density limit using the formulas from Sect. 7 and show how the limiting object can be written in terms of the T-operator described in Sect. 4. In Sect. 11 we establish positivity of this limiting expansion and derive the formulas for (1.17). A key observation is that the one-collision term is distinctly different from the corresponding term for the linear Boltzmann equation. We conclude the paper with a discussion and outlook in Sect. 12. The appendix provides detailed background of the combinatorial structures used in this paper.
2 Microlocal Boltzmann–Grad Scaling
The phase space of the underlying classical Hamiltonian dynamics is \({\text {T}}({{\mathbb {R}}}^d)={{\mathbb {R}}}^d\times {{\mathbb {R}}}^d\), where the first component parametrises the position \({{\varvec{x}}}\) and the second the momentum \({{\varvec{y}}}\) of the particle. Given a function \(a:{\text {T}}({{\mathbb {R}}}^d)\rightarrow {{\mathbb {R}}}\), we associate with it the observable \(A={\text {Op}}(a)\) acting on functions \(f\in {{\mathcal {S}}}({{\mathbb {R}}}^d)\) through the Weyl quantisation
where we have used the shorthand \(\mathrm {e}(z)=\mathrm {e}^{2\pi \mathrm {i}z}\). The Weyl quantisation is useful to capture the phase space distribution of quantum states. In the case of free quantum dynamics, with \(\lambda =0\) and \(h=1\), we have for example the well-known quantum-classical correspondence principle
with the classical free evolution \([L_0(t) a]({{\varvec{x}}},{{\varvec{y}}})=a({{\varvec{x}}}-t{{\varvec{y}}},{{\varvec{y}}})\). It is convenient to incorporate the scaling parameter \(h>0\) in (1.2) by setting
with \(\mathrm {e}_h(z)=\mathrm {e}^{\tfrac{2\pi \mathrm {i}}{h} z}\). Note that we have \({\text {Op}}_h(a)={\text {Op}}(D_{1,h} a)\) for \(D_{1,h} a({{\varvec{x}}},{{\varvec{y}}}) = h^{d/2} \, a({{\varvec{x}}}, h {{\varvec{y}}})\). We refer to \(D_{1,h}\) as the microlocal scaling. In particular, (2.2) becomes
The mean free path length of a particle travelling in a potential of the form (1.3) is asymptotic (for r small) to the inverse total scattering cross section of the single-site potential W [35, 40]; the total scattering cross section in turn equals \(r^{d-1}\), up to constants. In the low-density it is natural to measure length units in terms of the mean free path lengths or, equivalently, in units of \(r^{1-d}\). We refer to the corresponding scaling \(D_{r,1}\) defined by \(D_{r,1} a({{\varvec{x}}},{{\varvec{y}}}) = r^{d(d-1)/2} \, a( r^{d-1} {{\varvec{x}}}, {{\varvec{y}}})\) as the Boltzmann–Grad scaling, and the combined scaling
as the microlocal Boltzmann–Grad scaling. We define the corresponding scaled Weyl quantisation by \({\text {Op}}_{r,h}={\text {Op}}\circ D_{r,h}\). The quantum-classical correspondence (2.2) for the free dynamics reads in this scaling
The key point here is that we require an extra scaling in time relative to the mean free path. The challenge for the present study is thus to understand the asymptotics of \(A(t r^{1-d})\) as in (1.6), for every fixed \(t>0\), with initial data \(A={\text {Op}}_{r,h}(a)\) and a in the Schwartz class \({{\mathcal {S}}}({\text {T}}({{\mathbb {R}}}^d))\) (i.e. a is infinitely differentiable and all its derivatives decay rapidly as \(\Vert {{\varvec{x}}}\Vert ,\Vert {{\varvec{y}}}\Vert \rightarrow \infty \)). The question is, more precisely, whether there is a family of linear operators L(t) so that
in the Hilbert–Schmidt norm, defined as
To understand (2.7), it is sufficient to establish the convergence of
with A(t) as in (1.6), \(A={\text {Op}}_{r,h}(a)\), \(B={\text {Op}}_{r,h}(b)\), and \(a,b\in {{\mathcal {S}}}({\text {T}}({{\mathbb {R}}}^d))\). The inner product on the right hand side of (2.9) is defined by
We direct the reader towards [28, Appendix A] for an explanation of how (2.9) can be reformulated as a statement about solutions of the Schrödinger equation. As mentioned previously, we will here restrict our attention to the case when r is of the same order of magnitude as h, i.e. \(r=h\, c_0\) for fixed effective scattering radius \(c_0\). By adjusting W, we may in fact assume without loss of generality that \(c_0=1\). This is precisely the scaling used in [19] for the case of random potentials, although in a slightly different formulation in terms of Husimi functions for the phase-space presentation of quantum states.
3 Floquet–Bloch Decomposition
Floquet–Bloch theory allows us to reduce the quantum evolution in periodic potentials to invariant Hilbert spaces \({{\mathcal {H}}}_{{\varvec{\alpha }}}\) of quasiperiodic functions \(\psi \), satisfying
for all \({{\varvec{b}}}\in {{\mathcal {L}}}\) where \({{\varvec{\alpha }}}\in {{\mathbb {T}}}^*={{\mathbb {R}}}^d/{{\mathcal {L}}}^*\) is the quasimomentum and
is the dual (or reciprocal) lattice of \({{\mathcal {L}}}\). We denote by \({{\mathcal {H}}}_{{\varvec{\alpha }}}\) the Hilbert space of such functions that have finite \({\text {L}}^2\)-norm with respect to the inner product
with \({{\mathbb {T}}}={{\mathbb {R}}}^d/{{\mathcal {L}}}\). We define the corresponding Hilbert–Schmidt product for linear operators on \({{\mathcal {H}}}_{{\varvec{\alpha }}}\) by
For a given quasi-momentum \({{\varvec{\alpha }}}\in {{\mathbb {T}}}^*\), consider the Bloch functions
and define the Bloch projection \(\Pi _{{\varvec{\alpha }}}: {{\mathcal {S}}}({{\mathbb {R}}}^d) \rightarrow {{\mathcal {H}}}_{{\varvec{\alpha }}}\) by
with inner product
Note that, by Poisson summation,
and hence that by integrating over \({{\varvec{\alpha }}}\in {{\mathbb {T}}}^*\) one regains \(f({{\varvec{x}}})\). The kernel of \(\Pi _{{\varvec{\alpha }}}\) is thus
Instead of (2.9) the plan is now to consider the convergence
for typical \({{\varvec{\alpha }}}\). The advantage is that we are working in a Hilbert space with discrete basis. One can then obtain information on (2.9) by integrating over \({{\varvec{\alpha }}}\). In fact we will argue that the right hand side of (3.10) is independent of \({{\varvec{\alpha }}}\) for almost every \({{\varvec{\alpha }}}\).
4 The T-operator for a Single Scatterer
Recall from (1.5) that the T-operator for the single scatterer potential W is defined by
in the half-plane \({\text {Im}}E>0\) where the resolvent \(G_\lambda (E)\) is resonance-free, and then extended by analytic continuation. The Born series for \(G_\lambda (E)\) leads to the formal series expansion
Using \(\varphi _{{\varvec{y}}}({{\varvec{x}}})=\mathrm {e}({{\varvec{y}}}\cdot {{\varvec{x}}})\) as a basis for the momentum representation, the free resolvent \(G_0(E)\) has the kernel
and similarly \({\text {Op}}(W)\) has kernel \(\langle \varphi _{{\varvec{y}}}, {\text {Op}}(W) \varphi _{{{\varvec{y}}}'}\rangle ={\hat{W}}({{\varvec{y}}}-{{\varvec{y}}}')\). The T-matrix is defined as the kernel of T(E) in momentum representation, i.e.,
It will be convenient to set \(E=\tfrac{1}{2}\Vert {{\varvec{y}}}\Vert ^2+\mathrm {i}\gamma \), with \({\text {Re}}\gamma \ge 0\), and define
The corresponding perturbation series is
where the \(T_n^\gamma \) are defined by
and for \(n\ge 2\)
The analytic continuation of \(T^\gamma \) from \({\text {Re}}\,\gamma >0\) to the boundary \({\text {Re}}\,\gamma =0\) is obtained via the integral representation
The choice \(\gamma =0\) is referred to as “on-shell”. We drop the superscript if \(\gamma =0\), i.e., \(T({{\varvec{y}}},{{\varvec{y}}}')=T^0({{\varvec{y}}},{{\varvec{y}}}')\), \(T_n({{\varvec{y}}},{{\varvec{y}}}')=T_n^0({{\varvec{y}}},{{\varvec{y}}}')\). The T-matrix \(T({{\varvec{y}}},{{\varvec{y}}}')\) is then related to the on-shell scattering matrix via
The unitarity of the S-matrix is equivalent to the relation, for \(\Vert {{\varvec{y}}}\Vert =\Vert {{\varvec{y}}}'\Vert \),
This in particular implies the optical theorem
We will now prove that the integrals defining (4.9) converge uniformly in \({\text {Re}}\gamma \ge 0\) in dimensions \(d > 2\). For \(f \in {{\mathcal {S}}}({{\mathbb {R}}}^{dk})\) and \(S \subset \{1,\dots ,k\}\) we denote by \(f_S \in {{\mathcal {S}}}({{\mathbb {R}}}^{dk})\) the partial inverse Fourier transform of f in the variables \({{\varvec{y}}}_i\) for \(i \in S\):
We use the notation \(\langle x \rangle = \sqrt{1+x^2}\) and \(\langle {{\varvec{x}}}\rangle = \sqrt{1+\Vert {{\varvec{x}}}\Vert ^2}\).
Lemma 1
Let \(f \in {{\mathcal {S}}}({{\mathbb {R}}}^{dk})\). Then,
Proof
We partition \({{\mathbb {R}}}^k\) into \(2^k\) regions according to whether \(|\theta _i| \le 1\) or \(|\theta _i|>1\). Take \(S \subset \{1,\dots ,k\}\) and assume that for \(i \in S\), \(|\theta _i|>1\) and for \(i \notin S\), \(|\theta _i| \le 1\). We have that
Therefore, using the identity
we obtain
Taking a supremum over all \(2^k\) regions we see
The result then follows since
and \(\max \{1,x\} \ge \tfrac{1}{\sqrt{2}} \langle x \rangle \). \(\square \)
Let us apply this Lemma in our situation, in particular to the inner integral in (4.9). For multi-indices \({{\varvec{\alpha }}}, {{\varvec{\beta }}}\) we define
and the norm
Proposition 1
There exists a constant \(C_d\) depending only on the dimension d such that for all \({{\varvec{y}}}_0,{{\varvec{y}}}_n \in {{\mathbb {R}}}^d\), \(W\in {{\mathcal {S}}}({{\mathbb {R}}}^d)\), \({{\varvec{\theta }}}\in {{\mathbb {R}}}^{n-1}\), \({\text {Re}}\,\gamma \ge 0\),
Proof
The exponentially decaying factors can be pulled outside immediately. We then want to apply Lemma 1. Let \(S = \{s_1,\dots ,s_k\} \subset \{1,\dots ,n-1\}\) and consider the norm \( \left\| f_S\right\| _{L^1} \) where \(f({{\varvec{y}}}_1,\dots ,{{\varvec{y}}}_{n-1}):= {\hat{W}}({{\varvec{y}}}_0-{{\varvec{y}}}_1) \cdots {\hat{W}}({{\varvec{y}}}_{n-1}-{{\varvec{y}}}_n)\) with \({{\varvec{y}}}_0\) and \({{\varvec{y}}}_n\) constant. By definition we have that
Let \({{\varvec{m}}}= (m_1,\dots ,m_d) \in {{\mathbb {Z}}}_{\ge 0}^{d}\) and put \(|{{\varvec{m}}}| = m_1 +\cdots +m_d\). We define the multinomial coefficient
and use that
to bound (4.22) above by
Integrating by parts with respect to \({{\varvec{y}}}_{i}\) for \(i \in S\) and pulling the absolute value inside the integral yields the upper bound
Using the triangle inequality, the \({{\varvec{y}}}_i\) integral can be bounded above by a sum of \(\prod _{i\in S} 2^{|{{\varvec{m}}}_i|}\) terms of the form
where each \(\varphi _i\) is a derivative of \({\hat{W}}\) of order \(\le 2d+2\). Pulling absolute values inside tells us that (4.26) is bounded above by
We have that
For the \(L^1\) norms we similarly have
Note that \(D_{{\varvec{x}}}^{{\varvec{m}}}{{\varvec{x}}}^{{\varvec{\alpha }}}W({{\varvec{x}}})\) yields a sum of up to \(2^{|{{\varvec{m}}}|}\) terms each of which is of the form \({{\varvec{x}}}^{{\tilde{{{\varvec{\alpha }}}}}} D_{{{\varvec{x}}}}^{{\tilde{{{\varvec{m}}}}}} W({{\varvec{x}}})\) where \(|{\tilde{{{\varvec{\alpha }}}}}|<|{{\varvec{\alpha }}}|\) and \(|{\tilde{{{\varvec{m}}}}}| < |{{\varvec{m}}}|\). Hence, using the identity
we obtain
Finally then, combining these bounds with (4.25) and applying (4.30) we obtain
\(\square \)
This proposition shows that
for all \({\text {Re}}\gamma \ge 0\), provided \(d >2 \). Therefore, the series (4.6) converges absolutely, uniformly for all \({\text {Re}}\gamma \ge 0\), if
5 The Perturbation Series
We set \(H_\lambda =H_{1,\lambda }\), \(U_\lambda (t)=U_{1,\lambda }(t)\), and note that
The first term in (2.7) thus can be written
which is the quantity of interest for this work. To simplify notation, we will write in the upcoming discussion \(\lambda \) for \(\lambda /h^2\) and t for \(t h r^{1-d}\) and later re-substitute when taking limits. Furthermore, we can declutter our expressions by passing to the so-called interaction picture,
After the relevant calculations we then simply replace a by \(L_0(t) a\) due to (2.4). Because of the gauge invariance of A(t) in (1.6) under the substitution \(H_{h,\lambda }\mapsto H_{h,\lambda }+E\) for any \(E\in {{\mathbb {R}}}\), we may replace the potential V by \(V-\int _{{\mathcal {L}}}V({{\varvec{x}}})\mathrm {d}{{\varvec{x}}}\) in the following. This means that the potential now has the Fourier series
with \({\hat{W}}({{\varvec{y}}}) = \int W({{\varvec{x}}}) e(-{{\varvec{x}}}\cdot {{\varvec{y}}})\,\mathrm {d}{{\varvec{y}}}\). Thus we may ignore in the following expansions all terms with \({\hat{W}}({{\varvec{0}}})\); but note that we have not assumed here that \({\hat{W}}({{\varvec{0}}})=0\).
We proceed using Duhamel’s principle. In particular one has that
Iterating this expression yields a formal perturbative expansion for \(U_\lambda (t) U_0(-t)\) and \(U_0(t) U_{\lambda }(-t)\). After multiplying these two series together one obtains as in [28, §5] the formal perturbative expansion
with \({{\mathcal {I}}}_0(t)= {\text {Tr}}[ \Pi _{{\varvec{\alpha }}}{\text {Op}}_{r,h}( a) {\text {Op}}_{r,h}(b) ]\) and for \(n\ge 1\)
For \(1\le \ell \le n-1\) we have, with the shorthand \({{\mathcal {P}}}={{\mathcal {L}}}^*+{{\varvec{\alpha }}}\),
where
and the summation “non-consec” is restricted to terms with \({{\varvec{p}}}_j\ne {{\varvec{p}}}_{j+1}\); recall the comment after (5.2). We make the variable substitutions \(u_0= s_1\), \(u_j = s_{j+1}-s_j\) for \(j = 1,\ldots , \ell -1\), and \(u_j = s_j - s_{j+1}\) for \(j = \ell +1,\ldots , n-1\) and \(u_n = s_n\). Let \(\Box _{\ell ,n}(t)\) denote the simplex
and let \(\mathrm {d}^\perp {{\varvec{u}}}=\prod _{j\ne \ell } \mathrm {d}u_j\). Then
The terms \({{\mathcal {I}}}_{0,n}(t)\) and \({{\mathcal {I}}}_{n,n}(t)\) have an analogous representation.
As in the case of the T-matrix, it will be useful to embed these quantities in an analytic family by extending to complex energy. To this end, we define for \({\text {Re}}\gamma \ge 0\),
In the following, we will drop the superscript \(\gamma \) in the case \(\gamma =0\).
We wish to consider the limit of this quantity as \(h=r \rightarrow 0\), uniformly for \({\text {Re}}\gamma \ge 0\). The first simplification we make is to replace the second argument of \({\tilde{a}}\) and \({\tilde{b}}\) by \(h {{\varvec{p}}}_\ell \) and \(h {{\varvec{p}}}_n\) respectively which incurs an error of order \(r^d\). Recall now that, in view of (2.9) and (5.1), we are interested in the quantity \(h^{-2n}{{\mathcal {I}}}_{\ell ,n}^\gamma (t h r^{1-d})\). Since \(h^{-2n} r^{nd} h^d (h r^{1-d})^n = h^{d-n} r^n = r^d\) we see for \(h=r\)
where
Using the definition of \({\tilde{a}}\) and \({\tilde{b}}\) and integrating over \({{\varvec{\eta }}}\) yields
where
We recall here that we are working in the interaction picture. To return to the original lab frame, we replace a by the evolved symbol (i.e., replacing \({{\varvec{x}}}\) by \({{\varvec{x}}}-t{{\varvec{y}}}_\ell \)), so that
becomes
One can interpret this as corresponding to a classical trajectory in which the particle initially has momentum \({{\varvec{y}}}_\ell \), undergoes straight line motion for time \((t-u_{\ell +1}-\cdots -u_n)\), and then experiences \(n-\ell \) collisions separated by straight line motion for times \(u_j\) with momenta \({{\varvec{y}}}_{j}\) for \(j= \ell +1,\dots ,n\).
6 The Poisson Model
We note that the momenta \({{\varvec{p}}}_j\) in the summation (5.11) are of order \(r^{-1}\), and that for \(r\rightarrow 0\)
where \({{\mathcal {B}}}_1^d\) is the d-dimensional unit ball. This means that the average spacing between consecutive values of the set \(\{ \Vert {{\varvec{p}}}\Vert ^2 \le r^{-2}:{{\varvec{p}}}\in {{\mathcal {P}}}\}\) is of the order \(r^{d-2}\). Thus (5.11) measures correlations between the \(\Vert {{\varvec{p}}}_j\Vert ^2\) precisely on the scale of the mean spacing. Starting with the influential work of Berry and Tabor in the context of quantum chaos [7], it has been conjectured that the statistics on this correlation scale should be governed by a one-dimensional Poisson process. Rigorous results towards a proof of this conjecture are mostly limited to two-point statistics, where the problem reduces to a variant of quantitative versions of the Oppenheim conjecture [9, 23, 34, 36, 38, 44]; results on higher correlation functions are obtained in [47].
Assumption 1
We assume in the following that in the \(r=h\rightarrow 0\) asymptotics of (5.11) the lattice \({{\mathcal {P}}}= {{\mathcal {L}}}^* + {{\varvec{\alpha }}}\) with \({{\mathcal {L}}}\) fixed (arbitrary) and \({{\varvec{\alpha }}}\) random can be replaced by a Poisson process in \({{\mathbb {R}}}^d\) with unit intensity.
This assumption should be thought of as a generalisation of the Berry–Tabor conjecture on the Poisson distribution of energy levels of quantum systems with integrable classical Hamiltonian. We assume both that the lengths of lattice vectors (which represent the energy levels) behave as if they belonged to a Poisson process, and that the angular distribution of the lattice vectors is uniform on the \((d-1)\)-sphere and independent of the length (on the correct scale).
Assumptions of this kind have previously been used in modeling spectral correlations of diffractive systems, see for instance [10, 11, 32]. To formulate Assumption 1 in precise terms, define \({{\mathcal {J}}}_{\ell ,n}^\gamma (t)\) via the relation
where \({{\mathcal {P}}}\) is a Poisson point process in \({{\mathbb {R}}}^d\) with intensity one, and \({{\mathbb {E}}}\) denotes expectation. Then Assumption 1 should be understood as
for all \({\text {Re}}\,\gamma \ge 0\), \(t>0\). Similarly, it is a conjecture that
for Lebesgue almost every \({{\varvec{\alpha }}}\), and indeed for \({{\varvec{\alpha }}}\) satisfying a mild diophantine condition as in [36, 38]. Statement (6.4) is more subtle than (6.3), though the implication (6.4) \(\Rightarrow \) (6.3) would require uniform upper bounds for dominated convergence; cf. [28, Sect. 12]. Statement (6.3) is the only heuristic assumption made in this study.
As an illustrative example, fix \({{\varvec{\alpha }}}=(\sqrt{2},\sqrt{3})\) and consider the sequence \((\lambda _i, \theta _i)_{i \in {{\mathbb {N}}}}\) of elements of the set \(\{( \pi \Vert {{\varvec{n}}}+{{\varvec{\alpha }}}\Vert ^2, \frac{1}{2\pi } \arg ({{\varvec{n}}}+{{\varvec{\alpha }}}) ) \in {{\mathbb {R}}}_{\ge 0} \times [0,1) \mid {{\varvec{n}}}\in {{\mathbb {Z}}}^2 \} \) arranged in increasing order according to the first component where \(0\le \arg {{\varvec{z}}}< 2 \pi \) is the polar angle of \({{\varvec{z}}}\). Our assumption is concerned with the distribution of points \((\lambda _i,\theta _i)\) restricted to a strip \([R- \Delta R,R) \times [0,1)\) for \(\Delta R > 0\) fixed and \(R \rightarrow \infty \). Due to the choice of normalisation, a strip of this form contains roughly \(\Delta R\) points. Broadly speaking, the points contained in the strip should behave more and more randomly as R increases—see Fig. 1.
The Berry–Tabor conjecture, and by extension our assumption, is more readily expressed in terms of the gap distribution. Consider the sequence \((\lambda _{i+1}-\lambda _i,\theta _i)\) for all points in the window \([R-\Delta R,R) \times [0,1)\). The Berry–Tabor conjecture states that in the limit \(R \rightarrow \infty \), the sequence of gaps \(\lambda _{i+1}-\lambda _i\) has an exponential distribution with mean 1. Our Assumption 1 then implies that in the limit \(R\rightarrow \infty \), the sequence of pairs \((\lambda _{i+1}-\lambda _i,\theta _i)\) is distributed according to the product of an exponential distribution with mean 1 and the uniform distribution on [0, 1)—see Fig. 2.
Let \({{\mathcal {J}}}_n^\gamma = \sum _{\ell =0}^n (-1)^\ell {{\mathcal {J}}}_{\ell ,n}^\gamma \). The principal objective of this paper is to now prove that the limit
exists for \(\lambda >0\) sufficiently small but fixed, every \(t>0\) and \({\text {Re}}\gamma >0\), and to evaluate the limit at \(\gamma =0\). The convergence is stated in Proposition 5 and explicit formulas for the limit are discussed in Sect. 11.
Now, in order to take the expectation value of the sum over the momenta in in (5.11), with \({{\mathcal {P}}}\) a Poisson point process, one needs to keep track of terms where the various \({{\varvec{p}}}_i\) and \({{\varvec{p}}}_j\) are equal or distinct. This is best done through the notion of set partitions, which are presented in detail in Appendix A. We denote by \(\Pi (n,k)\) the collection of set partitions \(\underline{F}=[F_1,\ldots ,F_k]\) of the set \(\{0,\ldots ,n\}\) into k blocks \(F_1,\ldots ,F_k\). Let us then define, for a given set partition \(\underline{F}\in \Pi (n,k)\), \({{\varvec{\xi }}}\in {{\mathbb {R}}}^{k}\), \({{\varvec{y}}}_1,\ldots ,{{\varvec{y}}}_k\in {{\mathbb {R}}}^d\)
where \(\iota _{\underline{F}}({{\varvec{\xi }}})\) is the embedding \(\iota _{\underline{F}}: {{\mathbb {R}}}^k\rightarrow {{\mathbb {R}}}^{n+1}\) defined by \({{\varvec{\xi }}}=(\xi _1,\ldots ,\xi _k)\mapsto (x_0,\ldots ,x_n)\) where \((x_j=\xi _i \iff j\in F_i)\), and \(\iota _{\underline{F}}({{\varvec{y}}}_1,\ldots ,{{\varvec{y}}}_k)\) is the vector analogue \(\iota _{\underline{F}}: ({{\mathbb {R}}}^d)^k\rightarrow ({{\mathbb {R}}}^d)^{n+1}\). See Sect. A.3 for details.
With this we can write
Note that for \({{\varvec{e}}}=(1,1,\ldots ,1)\in {{\mathbb {R}}}^k\) and any \(\omega \in {{\mathbb {R}}}\)
This decomposition allows us to compute the expectation over \({{\mathcal {P}}}\). Specifically, Campbell’s theorem yields, for \({{\mathcal {P}}}\) a Poisson process with intensity one,
Due to its translation invariance, \(H_{t,\ell ,\underline{F}}^\gamma \) is determined by its values in the \(l^\text {th}\) coordinate plane \(\Sigma ^\perp =\{{{\varvec{x}}}\in {{\mathbb {R}}}^k: x_l=0\}\). It will be convenient in our calculations below to fix l so that \(\ell \in F_l\). We define the corresponding Fourier transform of \(H_{t,\ell ,\underline{F}}^\gamma \) restricted to \(\Sigma ^\perp \) by
where \({{\varvec{\theta }}}\in \Sigma ^\perp \) and \(\mathrm {d}^\perp {{\varvec{\xi }}}= \prod _{\begin{array}{c} j = 1\\ j\ne l \end{array}}^k \mathrm {d}{{\varvec{\xi }}}_j\) denotes the standard Lebesgue measure on \(\Sigma ^\perp \). For \({{\varvec{\xi }}}\in \Sigma ^\perp \) we then have
Define the \((k\times (n+1))\)-matrix \(\Delta _{\ell ,\underline{F}}\) by
In particular \((\Delta _{\ell ,\underline{F}})_{i\ell } = 0\) for all \(i=1,\dots ,k\). In view of (5.13) and (6.10) we find in the case \(\gamma =0\) that
with
and
Thus if \(\ell \in F_l\)
More generally, for \({\text {Re}}\,\gamma \ge 0\), we have the formula
Our task is now to determine the convergence of
as \(r=h\rightarrow 0\) and calculate the limit.
7 Explicit Formulas
In this section we provide more explicit formulas for \({\widehat{{{\mathcal {A}}}}}_{t,\ell ,\underline{F}}({{\varvec{\theta }}},{{\varvec{y}}}_1,\ldots ,{{\varvec{y}}}_{k})\). The main results are the expression (7.12) for general \({{\varvec{\theta }}}\) and (7.15) for \({{\varvec{\theta }}}= {{\varvec{0}}}\). We assume that \(\underline{F}\) is some set partition into k blocks with \(\ell \in F_l\). We define
and similarly
We also write
Note that \(J_-\) (resp. \(J_+\)) contains indices corresponding to one-sided blocks of the partition, i.e. blocks, all of whose elements are less than or equal to (resp. greater than or equal to) \(\ell \). The set J contains indices corresponding to blocks which are not one-sided, i.e. they contain elements both less than and greater than \(\ell \). This provides a complete categorisation of blocks \(F_i\):
Let us furthermore define
Combining (5.14) and (6.14) we have
First we can freely integrate over all \(u_j\) for \(j \in F_l\). This yields a factor of
where \((x)_+ = \max \{0 , x \}\). If we use the convention \((x)_+^0 = \mathbb {1}[x > 0]\) then, writing \(r_i = \sum _{j \in F_i \cap [0,\ell -1]} u_j\) and \(s_i = \sum _{j \in F_i \cap [\ell +1,n]} u_j\), we obtain
with
and
In other words
Integrating over \(r_i\) for \(i \in I_- = J_- \cup J\) and \(s_i\) for \(i \in J_+\) yields
When \({{\varvec{\theta }}}\rightarrow {{\varvec{0}}}\) note that the integrand vanishes unless \(\mu _i = 0\) for all \(i \in J_-\) and \(\nu _i = 0\) for all \(i \in J_+\), that is to say that a partition \(\underline{F}\) only contributes in the limit if the only one-sided blocks are singletons. This motivates the definition of \(\Omega (n,k) \subset \{0,\ldots , n\}\times \Pi _\circ (n,k)\) as the set of marked partitions \((\ell ,\underline{F})\) such that every block \(F_i\) not containing \(\ell \) either (i) is a singleton or (ii) contains at least one number strictly less than \(\ell \) and at least one number strictly greater than \(\ell \) (cf. Appendix A.2).
The support of \({\widehat{{{\mathcal {A}}}}}_{t,\ell ,\underline{F}}({{\varvec{\theta }}},{{\varvec{y}}}_1,\ldots ,{{\varvec{y}}}_{k})\) as a function of \({{\varvec{\theta }}}\) is contained in the domain
and is continuous in this domain in a sufficiently small neighbourhood of the origin. Putting \({{\varvec{\theta }}}={{\varvec{0}}}\) in (7.12) yields
In other words, for \((\ell ,\underline{F})\in \Omega (n,k)\) we have
whereas for \((\ell ,\underline{F})\notin \Omega (n,k)\) we have that \({\widehat{{{\mathcal {A}}}}}_{t,\ell ,\underline{F}}({{\varvec{0}}},{{\varvec{y}}}_1,\ldots ,{{\varvec{y}}}_{k})=0\).
8 Decay Estimates
In this section we prove Proposition 3 which establishes the absolute and uniform convergence of the series defining (6.18). The proof will be similar in spirit to that of Proposition 1, and the key ingredient is the following decay estimate.
Proposition 2
Let \(a,b\in {{\mathcal {S}}}({\text {T}}({{\mathbb {R}}}^d))\). There exists a constant \(C_{a,b,d}\) such that for all \(W\in {{\mathcal {S}}}({{\mathbb {R}}}^d)\), \(t>0\), \({\text {Re}}\gamma \ge 0\), \(r>0\),
Proof
We first write
where we have used the shorthand \(\theta _l := -\sum _{i=1,i \ne l}^k \theta _i\). The inner integral can be treated using Lemma 1, so all we must do is compute the relevant norm appearing inside the Lemma. Let \(K=\{ 1,\dots ,k \}\) and \(S =\{s_1,\dots ,s_p\} \subset K\). We need to compute the norm \(\Vert f_S\Vert _{L^1}\) where
We have
and, more explicitly,
where we write
We pull the integral over \({{\varvec{\eta }}}\) and \({{\varvec{s}}}\) outside the absolute value and bound \(\Vert f_S\Vert _{L^1}\) above by
This can be bounded above by
where we use the convention \({\tilde{\mu }}_i = (\mu _i)_+ = \max \{0,\mu _i\}\). As in the proof of Proposition 1 we bound this above by
Integrating by parts with respect to \({{\varvec{y}}}_i\) for \(i\in S\) and pulling the absolute value inside the integral gives the upper bound
If \(l \ne 1\), each \({{\varvec{y}}}_i\) appears \(2|F_i|\) times in the product of \({{\mathcal {W}}}\), \({\tilde{a}}\) and \({\tilde{b}}\), except for \({{\varvec{y}}}_1\) and \({{\varvec{y}}}_l\) which appear \(2|F_i| + 1\) times (due to their appearance in \({\tilde{a}}\) and \({\tilde{b}}\)). If \(l=1\), then each \({{\varvec{y}}}_i\) appears \(2|F_i|\) times, except for \({{\varvec{y}}}_1\) which appears \(2|F_i|+2\) times. Either way we can find a constant \(c_d\) such that the number of terms inside the absolute value can be bounded above by
Applying the triangle inequality yields a sum of terms of the form
where
each \(\varphi _i\) is a derivative of \({\hat{W}}\) of order \(\le 2d+2\) and \({\mathfrak {a}}\) and \({\mathfrak {b}}\) are derivatives of \({\tilde{a}}\) and \({\tilde{b}}\) with respect to the second argument of order \(\le d+1\). Define the map \(\kappa : \{0,\dots ,n\} \rightarrow \{1,\dots ,k\}\) implicitly dependent on \(\underline{F}\) by \(\kappa (i) = j\) if \(i \in F_j\). We then have that
By the definition of \(\Omega (n,k)\) we have that \(\kappa (0)= \kappa (n) = 1\). For \(i = 2,\dots ,k\), we define the partial inverse
For \(i =2,\dots ,k\), the first factor of \(\Phi (\iota _{\underline{F}}({{\varvec{y}}}_1,\dots ,{{\varvec{y}}}_k))\) in which \({{\varvec{y}}}_i\) appears is then by definition
We define \({{\mathcal {K}}}\subset \{1,\dots ,n\}\) to be the image of \(\kappa ^{-1}(\{2,\dots , k\}\setminus \{l\})\). Equation (8.11) can thus be bounded above by
Make the variable substitutions \({{\varvec{y}}}_{\kappa (i)} \rightarrow {{\varvec{y}}}_{\kappa (i-1)}-{{\varvec{y}}}_{\kappa (i)}\) to bound this above by
As in (4.28) we have that
and as in (4.31) we have that
Combining with the fact that the functions \({\tilde{a}}\) and \({\tilde{b}}\) are Schwartz class implies that there exists a constant \( c_{a,b,d}\) such that
We then use the fact that
so in particular for \(i \in J_+\) or \(i \in J_-\)
respectively. For \(i \in J\) we have
and for \(i = l\) we have
For simplicity we bound all of these uniformly by
Using the fact that \(|J| + \sum _{i=1}^k ({\tilde{\mu }}_i+{\tilde{\nu }}_i) = n-k+1\) thus yields
Finally, observe that, since \(\sum _{i=1}^k |F_i| = n+1\), we have that
This completes the proof. \(\square \)
This upper bound allows us to ensure convergence of the series (6.18).
Proposition 3
Let \(a,b\in {{\mathcal {S}}}({\text {T}}({{\mathbb {R}}}^d))\), \(W\in {{\mathcal {S}}}({{\mathbb {R}}}^d)\), \(t>0\) and set
Then the series
in (6.18) converges absolutely for all \(|\lambda | < R\), uniformly in \({\text {Re}}\gamma \ge 0\), \(0< r\le 1\).
Proof
First of all, by integrating over \({{\varvec{\theta }}}\) we can obtain from Proposition 2
where
We can replace the set \(\Pi _\circ (n,k)\) by the set of all partitions of \(\{1,\dots ,n\}\) into k blocks to obtain the upper bound
Inserting this into our upper bound yields
which converges for \(2 \pi A |\lambda | \langle t \rangle \Vert W\Vert _{2d+2,d+1,1} < \mathrm {e}^{-1}\) by Stirling’s formula. \(\square \)
9 The Microlocal Boltzmann–Grad Limit
In this section we combine the results of Sects. 7 and 8 to prove Proposition 4 which establishes the limit of the full perturbation series. Given \((\ell ,\underline{F})\in \Omega (n,k)\), let \({{\mathcal {C}}}^\perp _{\ell ,\underline{F}}\) be the set of \({{\varvec{\theta }}}\in \Sigma ^\perp \) with ith coordinate \(\theta _i\) ranging over
For \((\ell ,\underline{F})\in \Omega (n,k)\) we define
which converges for \({\text {Re}}\gamma >0\) and can be extended (in the distributional sense) by analytic continuation to \({\text {Re}}\gamma \ge 0\). In other words, we have that
with
with \(g^\gamma \) as in (4.5). Note furthermore that
as \(\gamma \rightarrow 0\).
Proposition 4
Let \(a,b\in {{\mathcal {S}}}({\text {T}}({{\mathbb {R}}}^d))\), \(W\in {{\mathcal {S}}}({{\mathbb {R}}}^d)\), \(t>0\), \(|\lambda | < R\) with R as in (8.26), and \({\text {Re}}\gamma \ge 0\). Then
Proof
In view of the uniform convergence of the series in n (Proposition 3), it is sufficient to establish convergence term by term. Now
Due to the uniform decay \(\prod _{i\ne l} \langle \theta _i \rangle ^{-d/2}\) guaranteed by Proposition 2, the outer integral converges uniformly in \(r>0\) (and \({\text {Re}}\gamma \ge 0\)), and we can therefore take the limit \(r\rightarrow 0\) inside. Relations (7.12) and (7.15) tell us that the only-non zero terms come from the marked partitions \((\ell ,\underline{F})\in \Omega (n,k)\) and for \({{\varvec{\theta }}}\in {{\mathcal {C}}}^\perp _{\ell ,\underline{F}}\). \(\square \)
10 The Collision Series
The main result of this section is Proposition 5 which specialises Proposition 4 to the case of \(\gamma =0\). Let us define
and for \({{\varvec{m}}}\in {{\mathbb {Z}}}_{> 0}^{n}\),
Note that
Define furthermore
Note that in (10.5) for \(k>1\) only terms with \(n\ge k\) contribute.
Proposition 5
Let \(a,b\in {{\mathcal {S}}}({\text {T}}({{\mathbb {R}}}^d))\), \(W\in {{\mathcal {S}}}({{\mathbb {R}}}^d)\), \(t>0\), \(|\lambda | < R\) with R as in (8.26). Then
Proof
We begin from the result of Proposition 4. Let \((\ell ,\underline{F}) \in \Omega (n,k)\) and let \((\ell ',\underline{F}') \in {\widehat{\Omega }}(n',k')\) be the corresponding reduced marked partition. Order the blocks of \((\ell ,\underline{F})\) such that the following three conditions hold:
-
(1)
\(|F_i|>1\) for \(i=1,\dots ,k'-1\),
-
(2)
\(\ell \in F_{k'}\),
-
(3)
\(|F_i|=1\) for \(i=k'+1,\dots ,k\).
Define \(i_{k'+1},\dots ,i_n\) so that \(F_j = \{i_j\}\) for \(j=k'+1,\dots ,n\). We can then write
By first integrating over \(u_{i_{k'+1}},\dots ,u_{i_{n}}\), and then relabelling the remaining \(u_i\) variables with the indices \(0,\dots ,n'\) (preserving their order) we obtain
or in other words
Furthermore, by (9.4) every non-singleton in \(\underline{F}\) contains indices both to the left and right of \(\ell \), so every such term yields a delta function and we see that the distribution \(D_{\ell ,\underline{F}}\) is given by
This allows us to replace instances of \(\Vert {{\varvec{y}}}_{k'}\Vert ^2\) with \(\Vert {{\varvec{y}}}_j\Vert ^2\) for any \(j = 1,\ldots ,k'-1\). Let
be the list of elements of \(\{0,\ldots ,n\}\) that lie in non-singleton blocks. For \(i=1,\dots ,\mu +\nu \) we define \(m_i = j_{i+1} - j_i -1\) as the number of singletons between \(j_{i+1}\) and \(j_i\), and set \(M_i=m_1+\ldots +m_i\). Thus \(M_{\mu +\nu }=M\) is the total number of singletons in \(\underline{F}\). From the definition (9.4) one can see that
Combining the above with the definition of the T-matrix allows us to obtain
Due to the Dirac delta functions appearing in \(\omega _{k'}\), for \(i \in [k'+1, k'+M_\mu ]\) we can replace \(g({{\varvec{y}}}_{k'},{{\varvec{y}}}_i)\) with \(g({{\varvec{y}}}_{i_-},{{\varvec{y}}}_i)\) where \(i_-\) is defined to be the largest non-singleton element smaller than i. Similarly, for \(i \in [k'+M_\mu + 1, k'+M]\) we replace \(\overline{g({{\varvec{y}}}_{k'},{{\varvec{y}}}_i)}\) with \(\overline{g({{\varvec{y}}}_{i_+},{{\varvec{y}}}_i)}\) where \(i_+\) is defined to be the smallest non-singleton element larger than i. This allows us to conclude that (10.12) is equal to
Now use the fact that \(M+n' = n\) and \(M_\mu +\ell ' = \ell \) we can write
and so
where on the right hand side we now write k, \(\ell \), n instead of \(k'\), \(\ell '\), \(n'\). The result then follows by changing the summation variables \(m_j \rightarrow m_j -1\) and using (10.4). \(\square \)
11 The Limit Process
In this section we derive explicit formulas for \({{\mathcal {R}}}^{(k)}(t)\), assuming throughout that \(k\ge 1\). These show in particular that \({{\mathcal {R}}}^{(k)}(t)\) can be expressed as the \((k-1)\)-collision term with a real and non-negative kernel. The main results are equations (11.15) and (11.26) which together yield the formula (1.17), as well as equations (11.17) and (11.28) which respectively yield the expressions (1.22) and (1.23) for \(\rho _{11}^{(2)}\) and \(\rho _{12}^{(2)}\) in terms of Bessel functions.
Let us write \({{\mathcal {R}}}^{(k)}(t) = {{\mathcal {R}}}_{\mathrm {d}}^{(k)}(t) + {{\mathcal {R}}}_{\mathrm {off}}^{(k)}(t)\), where \({{\mathcal {R}}}_{\mathrm {d}}^{(k)}(t)\), \({{\mathcal {R}}}_{\mathrm {off}}^{(k)}(t)\) are as in the definition of \({{\mathcal {R}}}^{(k)}(t)\) (10.5), with \({\widehat{\Omega }}(n,k)\) replaced by \({\widehat{\Omega }}_\mathrm {d}(n,k)\) and \({\widehat{\Omega }}_\mathrm {off}(n,k)\), respectively.
11.1 Diagonal Terms
This is the case \(0,\ell \in F_1\) and so
with the function
We have here used the symmetry of the integrand under permutation of the indices of the \({{\varvec{y}}}_i\) with \(i\ge 2\), and taken an average over all ordered partitions in \(\underrightarrow{{\widehat{\Omega }}}_{\mathrm {d}}(n,k)\) rather than the original sum over the unordered partitions in \({\widehat{\Omega }}_{\mathrm {d}}(n,k)\).
We now apply the bijection \((\ell ,\underrightarrow{F})\mapsto (\underrightarrow{F}^+,\underrightarrow{F}^-)\) in (A.9) (Lemma 4), which, together with the relation
yields
Next we use the bijection \(\underrightarrow{F}\mapsto (\underrightarrow{F}',{{\varvec{m}}})\) (A.7) to non-consecutive ordered partitions , which yields
with \(|F_i|_{{{\varvec{m}}}} = \sum _{j\in F_i} m_j\).
We identify \({{\varvec{m}}}\in {{\mathbb {Z}}}_{\ge 0}^n\) with \(({{\varvec{m}}}^{(1)},\ldots ,{{\varvec{m}}}^{(k)})\in {{\mathbb {Z}}}_{\ge 0}^{|F_1|}\times \cdots \times {{\mathbb {Z}}}_{\ge 0}^{|F_k|}\). For each i we then use the identity
with \(p=|F_i|\) to write
This yields in view of the optical theorem (4.12),
When \(k=1\) we are summing over partitions into 1 block. Note that is empty unless \(n=0\), in which case contains only the partition \([\{0\}]\). Formula (11.9) therefore yields
When \(k\ge 2\), using the results in Appendix A.4, we can write (11.9) as
Here \(g_{11}^{(k)}\) is the 11 coefficient of the \(k\times k\) matrix-valued function (recall (1.20)),
with the diagonal matrix \({{\mathbb {D}}}({{\varvec{z}}})={\text {diag}}(z_1,\ldots ,z_k)\) and the matrix \({{\mathbb {W}}}={{\mathbb {W}}}({{\varvec{y}}}_1,\dots ,{{\varvec{y}}}_k)\) with coefficients
If we extend the definition of (11.11) to
we see that, by symmetry under the permutation of indices, the function (11.1) can be written as
which yields the diagonal part of the expression (1.17).
In the case \(k=2\), Eq. (A.28) yields an explicit formula in terms of the \(J_1\)-Bessel function (assuming here \(u_1,u_2>0\)),
We conclude
This formula can also be obtained directly from the combinatorial expression (11.9): note that for n even the only is
and for n odd is empty. Hence
By shifting the summation index this can be written
Equation (11.17) then follows from the series representation of the Bessel function
11.2 Off-Diagonal Terms
Here \(0\in F_1\) and \(\ell \in F_k\), and thus
with the function
The argument is identical to the diagonal case, and we obtain
In the case \(k=1\), there is no off-diagonal term as is empty for every n.
Furthermore, for \(k\ge 2\) (again using the results in Appendix A.4),
with \({{\mathbb {G}}}^{(k)}({{\varvec{u}}},{{\varvec{y}}}_1,\dots ,{{\varvec{y}}}_k)\) as in (11.12). Again, by symmetry under the permutation of indices, (11.22) can be expressed as
which yields the off-diagonal part of (1.17).
For \(k=2\), Eq. (A.29) yields
We conclude
Let us derive this formula also directly from the combinatorial expression (11.24): for n odd the only element of is
and for n even is empty. Hence
This can be written
Identifying the summation as a \(J_0\) Bessel function yields the result.
12 Discussion
The main conclusion of this work is that quantum transport in a periodic potential converges, in the microlocal Boltzmann–Grad limit, to a limiting random flight process. Unlike in the random setting, there is a positive probability that a path of the limit process revisits the same momentum several times. This is ultimately a consequence of the Floquet–Bloch reduction to discrete Hilbert spaces. The only hypothesis, Assumption 1, in our derivation is that Bloch momenta have asymptotically the same fine-scale distribution as a Poisson point process. This assumption can be viewed as a phase-space extension of the Berry–Tabor conjecture in quantum chaos [7, 37], which to-date has been confirmed only in special cases [9, 23, 34, 36, 38, 44, 47]. In the setting discussed in this paper, present techniques permit a rigorous analysis up to second order perturbation theory which, perhaps surprisingly, is consistent with the linear Boltzmann equation as well as our limit process. Thus extending the perturbative analysis to higher order terms unconditionally is an important open challenge. This would require the rigorous understanding of higher-order correlation functions for lattice point statistics, and we refer the reader to [47] for the best current results in this direction.
It follows from standard invariance principles for Markov processes that for large times the solution of the linear Boltzmann equation is governed by Brownian motion with the standard diffusive mean-square displacement (i.e., linear in time) [22, 46]. Therefore, the work of Eng and Erdös [19] for random potentials implies convergence to Brownian motion, if we first take the Boltzmann–Grad and then the diffusive limit. (Note that Erdös, Salmhofer and Yau [21, 22] have established convergence to Brownian motion in long-time/weak-coupling scaling limits directly, i.e., without first taking the weak-coupling limit to obtain the linear Boltzmann equation as in [20].) An immediate challenge is thus to understand the diffusive nature of the random flight process derived in the present paper. Recall that in the classical setting the Boltzmann–Grad limit of the periodic Lorentz gas does not satisfy the linear Boltzmann equation [13, 39], and we have superdiffusion with a \(t\log t\) mean-square displacement [41]. A further challenge is to expand our current understanding to more singular single-site potentials (such as hard core and/or long-range potentials) and to include background electromagnetic fields.
References
Albeverio, S., Gesztesy, F., Hoegh-Krohn, R.: The low energy expansion in nonrelativistic scattering theory. Annales de l’IHP Physique theorique 37(1), 1–28 (1982)
Albeverio, S., Hoegh-Krohn, R.: Point interactions as limits of short range interactions. J. Oper. Theory 6, 313–339 (1981)
Albeverio, S., Gesztesy, F., Hoegh-Krohn, R., Holden, H.: Solvable Models in Quantum Mechanics. Springer, Berlin (2012)
Allaire, G., Piatnitski, A.: Homogenization of the Schrödinger equation and effective mass theorems. Commun. Math. Phys. 258(1), 1–22 (2005)
Benoit, A., Gloria, A.: Long-time homogenization and asymptotic ballistic transport of classical waves. Ann. Sci. Éc. Norm. Supér. (4) 52(3), 703–759 (2019)
Bensoussan, A., Lions, J.-L., Papanicolaou, G.: Asymptotic Analysis for Periodic Structures. AMS Chelsea Publishing, Providence (2011)
Berry, M.V., Tabor, M.: Level clustering in the regular spectrum. Proc. R. Soc. Lond. A 356, 375–394 (1977)
Birman, M. Sh., Suslina, T.A.: Periodic second-order differential operators. Threshold properties and averaging. (Russian) Algebra i Analiz 15(5), 1–108 (2003); translation in St. Petersburg Math. J. 15(5), 639–714 (2004)
Bleher, P.M., Lebowitz, J.L.: Variance of number of lattice points in random narrow elliptic strip. Annales de l‘I.H.P., section B, tome 31(1), 27–58 (1995)
Bogomolny, E., Giraud, O.: Semiclassical calculations of the two-point correlation form factor for diffractive systems. Nonlinearity 15(4), 993–1018 (2002)
Bogomolny, E., Gerland, U., Schmit, C.: Singular statistics. Phys. Rev. E 63, 036206 (2001)
Boldrighini, C., Bunimovich, L.A., Sinai, Y.G.: On the Boltzmann equation for the Lorentz gas. J. Stat. Phys. 32, 477–501 (1983)
Caglioti, E., Golse, F.: On the Boltzmann–Grad limit for the two dimensional periodic Lorentz gas. J. Stat. Phys. 141, 264–317 (2010)
Castella, F.: On the derivation of a quantum Boltzmann equation from the periodic von Neumann equation. ESAIM: M2AN 33(2), 329–349 (1999)
Castella, F.: From the von Neumann equation to the quantum Boltzmann equation in a deterministic framework. J. Stat. Phys. 104, 387–447 (2001)
Castella, F.: From the von Neumann equation to the quantum Boltzmann equation. II. Identifying the Born series. J. Stat. Phys. 106, 1197–1220 (2002)
Castella, F., Plagne, A.: Non derivation of the quantum Boltzmann equation from the periodic von Neumann equation. Indiana Univ. Math. J. 51(4), 963–1016 (2001)
Craster, R.V., Kaplunov, J., Pichugin, A.V.: High-frequency homogenization for periodic media. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 466(2120), 2341–2362 (2010)
Eng, D., Erdös, L.: The linear Boltzmann equation as the low density limit of a random Schrödinger equation. Rev. Math. Phys. 17(06), 669–743 (2005)
Erdös, L., Yau, H.-T.: Linear Boltzmann equation as the weak coupling limit of the random Schrödinger equation. Commun. Pure Appl. Math. LII I, 667–735 (2000)
Erdös, L., Salmhofer, M., Yau, H.-T.: Quantum diffusion of the random Schrödinger evolution in the scaling limit. II. The recollision diagrams. Commun. Math. Phys. 271(1), 1–53 (2007)
Erdös, L., Salmhofer, M., Yau, H.-T.: Quantum diffusion of the random Schrödinger evolution in the scaling limit. Acta Math. 200(2), 211–277 (2008)
Eskin, A., Margulis, G., Mozes, S.: Quadratic forms of signature \((2,2)\) and eigenvalue spacings on rectangular \(2\)-tori. Ann. Math. 161(2), 679–725 (2005)
Exner, P., Seba, P.: Point interactions in two and three dimensions as models of small scatterers. Phys. Lett. A 222(1–2), 1–4 (1996)
Gallavotti, G.: Divergences and approach to equilibrium in the Lorentz and the Wind-tree-models. Phys. Rev. 185, 308–322 (1969)
Gérard, P.: Mesures semi-classiques et ondes de Bloch. Séminaire sur les Équations aux Dérivées Partielles, 1990–1991, Exp. No. XVI, École Polytech, Palaiseau (1991)
Gérard, P., Markowich, P.A., Mauser, N.J., Poupaud, F.: Homogenization limits and Wigner transforms. Commun. Pure Appl. Math. 50(4), 323–379 (1997)
Griffin, J., Marklof, J.: Quantum transport in a low-density periodic potential: homogenisation via homogeneous flows. Pure Appl. Anal. 1(4), 571–614 (2019)
Grossmann, A., Hoegh-Krohn, R., Mebkhout, M.: A class of explicitly soluble, local, many-center Hamiltonians for one-particle quantum mechanics in two and three dimensions. I. J. Math. Phys. 21(9), 2376–2385 (1980)
Harutyunyan, D., Milton, G., Craster, R.V.: High-frequency homogenization for travelling waves in periodic media. Proc. R. Soc. A. 472(2191), 20160066 (2016)
Holden, H., Hoegh-Krohn, R., Johannesen, S.: The short-range expansion in solid state physics. Annales de l’IHP Physique theorique 41(4), 335 (1984)
Letendre, T., Ueberschär, H.: Random moments for the new eigenfunctions of point scatterers on rectangular flat tori. Ann. H. Poincare, to appear
Lorentz, H.: Le mouvement des électrons dans les métaux. Arch. Néerl. 10, 336–371 (1905)
Margulis, G., Mohammadi, A.: Quantitative version of the Oppenheim conjecture for inhomogeneous quadratic forms. Duke Math. J. 158(1), 121–160 (2011)
Marklof, J.: Kinetic limits of dynamical systems. In: Hyperbolic Dynamics, Fluctuations and Large Deviations, Proceedings of Symposia in Pure Mathematics. American Mathematical Society, pp. 195–223 (2015)
Marklof, J.: Pair correlation densities of inhomogeneous quadratic forms II. Duke. Math. J. 115, 409-434 (2002), Correction, ibid. 120, 227–228 (2003)
Marklof, J.: The Berry-Tabor conjecture. In: Proceedings of the 3rd European Congress of Mathematics, Barcelona 2000, Progress in Mathematics 202, pp. 421–427. Basel, Birkhäuser (2001)
Marklof, J.: Pair correlation densities of inhomogeneous quadratic forms. Ann. Math. 158, 419–471 (2003)
Marklof, J., Strömbergsson, A.: The Boltzmann–Grad limit of the periodic Lorentz gas. Ann. Math. 174, 225–298 (2011)
Marklof, J., Strömbergsson, A.: Kinetic theory for the low-density Lorentz gas. Mem. Amer. Math. Soc., to appear
Marklof, J., Tóth, B.: Superdiffusion in the periodic Lorentz gas. Commun. Math. Phys. 347(3), 933–981 (2016)
Markowich, P.A., Mauser, N.J., Poupaud, F.A.: Wigner-function approach to (semi)classical limits: electrons in a periodic potential. J. Math. Phys. 35(3), 1066–1094 (1994)
Panati, G., Spohn, H., Teufel, S.: Effective dynamics for Bloch electrons: Peierls substitution and beyond. Commun. Math. Phys. 242, 547–578 (2003)
Sarnak, P.: Values at integers of binary quadratic forms. In: Harmonic Analysis and Number Theory (Montreal, PQ, 1996), pp. 181–203, CMS Conference Proceedings 21, American Mathematical Society, Providence (1997)
Spohn, H.: Derivation of the transport equation for electrons moving through random impurities. J. Stat. Phys. 17, 385–412 (1977)
Spohn, H.: The Lorentz process converges to a random flight process. Commun Math. Phys. 60, 277–290 (1978)
VanderKam, J.M.: Correlations of eigenvalues on multi-dimensional flat tori. Commun. Math. Phys. 210, 203–223 (2000)
Acknowledgements
We thank Søren Mikkelsen for valuable comments on the first draft of this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Peter Balint.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Research supported by EPSRC Grant EP/S024948/1.
Appendix A: Partitions, Diagrams and Graphs
Appendix A: Partitions, Diagrams and Graphs
1.1 A.1: Set Partitions
A set partition \(\underline{F}=[F_1,\ldots ,F_k]\) of the finite set \(\{0,\ldots ,n\}\) is a decomposition into disjoint and non-empty subsets \(F_1,\ldots ,F_k\). The order in which we list the \(F_i\) is not relevant (we will discuss ordered partitions further down). We call \(F_i\) a block of \(\underline{F}\), and denote by \(k=|\underline{F}|\le n+1\) the number of blocks. We furthermore define \(\nu (\underline{F})=|F_1\cup \cdots \cup F_k|-1\). We denote by \(\Pi (n)\) the collection of all set partitions \(\underline{F}\) with \(\nu (\underline{F})=n\), and by \(\Pi (n,k)\) the collection of \(\underline{F}\in \Pi (n)\) with \(|\underline{F}|=k\). We write \(\underline{F}\preceq {\underline{G}}\), if every subset of \({\underline{G}}\) is a subset of unions of subsets of \(\underline{F}\). This defines a partial ordering on \(\Pi (n)\). The minimal and maximal elements of \(\Pi (n)\) are \({\underline{O}}=[\{0\},\ldots ,\{n\}]\) and \({\underline{N}}=[\{0,\ldots ,n\}]\), respectively.
We further denote by \(\Pi _\circ (n,k)\subset \Pi (n,k)\) the sub-collection of all set partitions where 0 and n are in the same block (\(F_1\), say) and by \(\Pi _\circ ^{\mathrm {nc}}(n,k)\subset \Pi _\circ (n,k)\) the sub-collection of non-consecutive partitions where each subset \(F_i\) does not contain consecutive indices; that is \(|j_1-j_2|\ne 1\) for all \(j_1,j_2\in F_i\).
1.2 A.2: Marked and Reduced Set Partitions
Given a set partition \(\underline{F}\in \Pi (n)\) and integer \(\ell \in \{0,\ldots , n\}\) we call \((\ell ,\underline{F})\) the corresponding marked partition. Let \(\Omega (n) \subset \{0,\ldots , n\}\times \Pi _\circ (n)\) denote the set of marked partitions \((\ell ,\underline{F})\) such that every block \(F_i\) not containing \(\ell \) either (i) is a singleton or (ii) contains at least one number strictly less than \(\ell \) and at least one number strictly greater than \(\ell \). Let furthermore \(\Omega (n,k) \subset \Omega (n) \) denote the subset where \(\underline{F}\) has k blocks.
These marked partitions are more easily understood diagrammatically. Given \((\ell ,\underline{F})\in \Omega (n,k)\) first draw \(n+1\) circles in a horizontal line representing the indices \(0,1,\ldots ,n\); then fill in the circle corresponding to the index \(\ell \); finally, connect indices with lines beneath if and only if they lie in the same block. An example diagram for a typical partition can be seen in Fig. 3.
We say that \((\ell ,\underline{F}) \in \Omega (n,k)\) is reduced if for every block \(F_i\) we have either \(|F_i|>1\) or \(F_i = \{\ell \}\), i.e. the partition contains no singleton blocks except possibly \(\{\ell \}\). We denote by \({\widehat{\Omega }}(n,k) \subset \Omega (n,k)\) the collection of reduced marked partitions. From this point forward when we say ‘singleton’ we will mean a block of the form \(\{j\}\) and \(j\ne \ell \).
Given \((\ell ,\underline{F}) \in \Omega (n,k)\) with M singleton blocks we construct the corresponding reduced marked partition \((\ell ',\underline{F}') \in {\widehat{\Omega }}(n-M,k-M)\) by removing all singletons and relabelling the remaining elements with the labels \(\{0,\dots ,n-M\}\) such that the order is preserved. This process is described explicitly below.
Let
be the list of numbers in \(\{0,\ldots ,n\}\) that lie in non-singleton blocks. For \(i=1,\dots ,\mu +\nu \) we define \(m_i = j_{i+1} - j_i -1\) as the number of singletons between \(j_{i+1}\) and \(j_i\), and set \(M_i=m_1+\ldots +m_i\). Thus \(M_{\mu +\nu }=M\) is the total number of singletons in \(\underline{F}\). For \(1\le M_{\mu +\nu }\le n\) define the map \(\kappa _{\underline{F}}:\{0,\ldots ,n\}\rightarrow \{0,\ldots ,n'\}\) with \(n'=n-M_{\mu +\nu }\) by
If there are no singletons, i.e. \((\ell ,\underline{F})\) is already reduced, then \(\kappa _{\underline{F}}={\text {id}}\). Given a pair \((\ell ,\underline{F}) \) we obtain the reduced marked partition \((\ell ', \underline{F}')\) by setting \(\ell ' = \kappa _{\underline{F}}(\ell ) = \mu \), \(\underline{F}'= [F'_1,\ldots ,F'_{k'}]\) with \(k' = k-M_{\mu +\nu }\) and
Given \((\ell ,\underline{F})\in \Omega (n,k)\) the above provides a unique \((\ell ',\underline{F}') \in {\widehat{\Omega }}(n',k')\). We thus have the following Lemma.
Lemma 2
The map
is bijective.
Furthermore note that every block \(F_i\ne \{\ell \}\) in \({\widehat{\Omega }}(n)\) contains at least one number strictly less than \(\ell \) and at least one number strictly greater than \(\ell \). Diagrammatically, the reduction of a marked partition described above then simply corresponds to removing all isolated, unfilled circles—see Fig. 4.
Assume \(k\ge 2\) in the following. Define \({\widehat{\Omega }}_{\mathrm {d}}(n,k) \subset {\widehat{\Omega }}(n,k)\) to be the set of reduced marked partitions \((\ell ,\underline{F})\) such that 0 and \(\ell \) (and therefore also n) lie in the same block i.e.,
We call this the set of diagonal reduced marked partitions. The corresponding set of off-diagonal reduced marked partitions is defined as
No order of blocks is specified here, so indeed any marked partition in \({\widehat{\Omega }}_{\mathrm {d}}(n,k)\) is either in \({\widehat{\Omega }}_{\mathrm {d}}(n,k)\) or in \({\widehat{\Omega }}_{\mathrm {off}}(n,k)\).
1.3 A.3: Ordered Partitions
We introduce an ordering of a partition \(\underline{F}\) by specifying an order in which the blocks appear. That is, for a partition \(\underline{F}=[F_1,\ldots ,F_k]\) we have k! corresponding ordered partitions, which we write as \(\underrightarrow{F}=\langle F_{\sigma (1)},\ldots ,F_{\sigma (k)}\rangle \); here \(\sigma \in S_k\) (the symmetric group of k elements). We denote the corresponding set of ordered partitions by \(\underrightarrow{\Pi }(n,k)\). We call an ordering canonical if each \(F_i\) contains the smallest of all elements in the blocks \(F_j\) with \(j\ge i\); in particular this means that \(0\in F_1\). This yields a one-to-one correspondence between partitions and canonically ordered partitions.
Given \(\underrightarrow{F}\in \underrightarrow{\Pi }(n,k)\) we define the embedding \(\iota _{\underrightarrow{F}}: {{\mathbb {R}}}^{k}\rightarrow {{\mathbb {R}}}^{n+1}\),
By abuse of notation, we also define the vector analogue \(\iota _{\underrightarrow{F}}: ({{\mathbb {R}}}^d)^{k}\rightarrow ({{\mathbb {R}}}^d)^{n+1}\),
For the unordered partition \(\underline{F}\in \Pi (n,k)\) we define the corresponding embedding \(\iota _{\underline{F}}=\iota _{\underrightarrow{F}}\) where \(\underrightarrow{F}\) has the canonical order.
Let us define
That is, we specify that the first block contains 0 (and thus also n); with this convention there are \((k-1)!\) ordered partitions in for every given \(\underline{F}\in \Pi _\circ (n,k)\). Let denote the set of partitions of \(\{0,\dots ,n\}\) into k blocks where 0 and n lie in different blocks. Define to be the set of ordered partitions \(\underrightarrow{F}=\langle F_1,\ldots ,F_k\rangle \) where \(0 \in F_1\) and \(n \in F_k\).
Let \(\underrightarrow{\Pi }^\mathrm {nc}(n,k) \subset \underrightarrow{\Pi }(n,k)\) (and equally and ) be the subset of ordered partitions such that all consecutive elements lie in separate blocks, that is \(j\in F_i\) implies \(j+1\notin F_i\). Let us take and construct the corresponding non-consecutive partition; the construction is similar to that of removing singletons. Let \(j_1<\ldots <j_{n'}\) be the list of elements \(j\in \{1,\ldots ,n\}\) for which \(j_s\in F_i\) implies \(j_s-1\notin F_i\). Let \(m_s\in {{\mathbb {Z}}}_{\ge 0}\) be the largest integer so that \(j_s+1,\ldots ,j_s+m_s\in F_i\). Then \(n=n'+m_1+\ldots +m_{n'}\). We map to the pair \((\underrightarrow{F}',{{\varvec{m}}}) \in \underrightarrow{\Pi }^\mathrm {nc}(n',k) \times {{\mathbb {Z}}}_{\ge 0}^{n'}\), where \(F_i'=\{ s \mid j_s\in F_i\}\) and \({{\varvec{m}}}=(m_1,\ldots ,m_{n'})\) as defined as above. In particular note that if then \((\underrightarrow{F}',{{\varvec{m}}})=(\underrightarrow{F},{{\varvec{0}}})\). The map \(\underrightarrow{F}\mapsto (\underrightarrow{F}',{{\varvec{m}}})\) is clearly invertible, and we have the following.
Lemma 3
Let \(k\ge 1\). The maps
are bijective.
The notion of a marked partition \((\ell ,\underline{F})\) naturally extends to an ordered marked partition \((\ell ,\underrightarrow{F})\), and we construct a reduced ordered partition from an ordered partition by preserving the order of surviving blocks in the construction. The ordered partitions corresponding to \(\Omega (n,k)\) and \({\widehat{\Omega }}(n,k)\) are denoted by \(\underrightarrow{\Omega }(n,k)\) and \(\underrightarrow{{\widehat{\Omega }}}(n,k)\), respectively. We define the ordered partitions corresponding to \({\widehat{\Omega }}_{\mathrm {d}}(n,k)\) and \({\widehat{\Omega }}_{\mathrm {off}}(n,k)\) by
and
Whereas there are \((k-1)!\) ordered partitions in \(\underrightarrow{{\widehat{\Omega }}}_{\mathrm {d}}(n,k)\) for every fixed partition in \({\widehat{\Omega }}_{\mathrm {d}}(n,k)\), there are only \((k-2)!\) in \(\underrightarrow{{\widehat{\Omega }}}_{\mathrm {off}}(n,k)\) for every fixed partition in \({\widehat{\Omega }}_{\mathrm {off}}(n,k)\). The notion of ordering can also be represented diagrammatically, namely, we insist that if \(i < j\) then the line connecting elements in \(F_i\) is lower than the line connecting elements in \(F_j\)—see Fig. 5. In the event that the marked partition contains a singleton, we can attach a vertical line below to the desired height—see Fig. 6
Given \((\ell ,\underrightarrow{F})\in \underrightarrow{{\widehat{\Omega }}}\), with \(\underrightarrow{F}=\langle F_1,\ldots ,F_k\rangle \), we define \(\underrightarrow{F}^\pm =\langle F_1^\pm ,\ldots ,F_k^\pm \rangle \) with \(F_i^+=F_i\cap [0,\ell ]\) and \(F_i^-=n-(F_i \cap [\ell ,n])\subset [0,n-\ell ]\). Recall that \(F_i\) is either the singleton \(\{\ell \}\) or contains at least one number strictly less than \(\ell \) and at least one number strictly greater than \(\ell \). This implies in particular that \(F_i^\pm \ne \emptyset \) for all i, and thus \(\underrightarrow{F}^+\in \Pi (\ell ,k)\) and \(\underrightarrow{F}^-\in \Pi (n-\ell ,k)\). We in fact have the following bijection.
Lemma 4
The maps
are bijective.
Proof
For \((\ell ,\underrightarrow{F})\in \underrightarrow{{\widehat{\Omega }}}(n,k)\), we have by construction \(0,n\in F_1\); therefore \(0\in F_1^+\) and \(n\in n-F_1^-\); this in turn implies \(0\in F_1^-\).
If furthermore \((\ell ,\underrightarrow{F})\in \underrightarrow{{\widehat{\Omega }}}_{\mathrm {d}}(n,k)\), then \(\ell \in F_1\) and thus \(\ell \in F_1^+\) and \(\ell \in n-F_1^-\); the latter can be written as \(n-\ell \in F_1^-\). This shows \(\underrightarrow{F}^+\in \Pi _\circ (\ell ,k)\) and \(\underrightarrow{F}^-\in \Pi _\circ (n-\ell ,k)\). If on the other hand \((\ell ,\underrightarrow{F})\in \underrightarrow{{\widehat{\Omega }}}_{\mathrm {off}}(n,k)\), then \(\ell \in F_k\). So \(\ell \in F_k^+\) and \(\ell \in n-F_k^-\); that is \(n-\ell \in F_k^-\). This means that and . We conclude that (A.9), (A.10) have the correct range (Figs. 7, 8).
Now \((\underrightarrow{F}^+,\underrightarrow{F}^-)\) are uniquely determined by \(\underrightarrow{F}\) and hence (A.9), (A.10) are injective. The inverse maps are given by
from which we infer that (A.9), (A.10) are surjective. \(\square \)
1.4 A.4: Graphs and Paths
Let \({{\mathbb {K}}}_k\) be the complete graph with \(k\ge 2\) vertices which we label as \(1,2,\ldots ,k\), and denote by \(P=i_0 i_1\cdots i_n\) (with \(i_{s}\ne i_{s+1}\)) the path of length n which visits the listed vertices in the given order. This means that consecutive indices \(i_{s}i_{s+1}\) correspond to an edge. Backtracking is allowed, e.g. \(P=121\) is an admissible path. We denote by \(\Sigma (n,k)\) the set of all paths of length n, and by \({\overline{\Sigma }}(n,k)\) the set of such paths that visit every vertex at least once. \({\overline{\Sigma }}(n,k)\) is non-empty for \(n\ge k-1\). We denote by \(\Sigma _{ij}(n,k)\subset \Sigma (n,k)\) resp. \({\overline{\Sigma }}_{ij}(n,k)\subset {\overline{\Sigma }}(n,k)\) the subset of paths that start at vertex i and end at vertex j. For \(i\ne j\), \({\overline{\Sigma }}_{ij}(n,k)\) is non-empty if \(n\ge k-1\), and \({\overline{\Sigma }}_{ii}(n,k)\) is non-empty if \(n\ge k\).
With an ordered partition \(\underrightarrow{F}\in \underrightarrow{\Pi }^\mathrm {nc}(n,k)\) we associate a path \(P(\underrightarrow{F})=i_0 i_1\cdots i_n\in {\overline{\Sigma }}(n,k)\) by identifying each block \(F_i\) with the ith vertex of \(K_n\). That is, \(i_s=i\) if and only if \(s\in F_i\). This yields the following.
Lemma 5
The maps
are bijective.
We assign a \((k\times k)\) edge matrix \({{\mathbb {W}}}=(w_{ij})\) to the graph \({{\mathbb {K}}}_k\), by assigning a weight \(w_{ij}\) to each edge ij, and \(w_{ii}=0\) to the coefficients on the diagonal. We also assign the diagonal vertex matrix \({{\mathbb {D}}}({{\varvec{u}}})={\text {diag}}(u_1,\ldots ,u_k)\) which assigns weight \(u_i\) to vertex i. The edge weight of a path \(P=i_0\ldots i_n\) is then defined as \(w_P=w_{i_0i_1} w_{i_1i_2}\cdots w_{i_{n-1}i_n}\), and the vertex weight as \(u_P=u_{i_0} u_{i_1} \cdots u_{i_{n-1}} u_{i_n}\); the total weight is thus
The combinatorics of a path of length n can be understood by means of the matrix \([{{\mathbb {D}}}({{\varvec{u}}}) {{\mathbb {W}}}]^n {{\mathbb {D}}}({{\varvec{u}}})\), as we will explain now. We define the linear operator \({{\mathcal {L}}}\) acting on functions F of \({{\mathbb {C}}}^k\) by
More explicitly, if F has the Taylor series
then
Thus \({{\mathcal {L}}}F({{\varvec{u}}})\) is the derivative \(\partial _{u_1}\cdots \partial _{u_k}\) of the Borel transform of \(F({{\varvec{u}}})\).
Lemma 6
For \(n\ge 1\),
Proof
Note that
since the terms in \(\sum _{P} u_P w_P\) that are constant in \(u_l\) correspond exactly to those paths that do not visit vertex l. Equation (A.17) then follows from simple matrix algebra. \(\square \)
Consider the \((k\times k)\) matrix valued function
We have the series expansion
with the Taylor coefficients given by \((k\times k)\) matrices \(C_{\nu _1,\ldots ,\nu _k}\). The first series in (A.20) (and hence also the above Taylor series) converges absolutely for \(|u_i|<r_0^{-1}\) with \(r_0= k \max |w_{ij}|\). Summing over n and using the geometric series (note that for the \(n=0\) term \({{\mathcal {L}}}{{\mathbb {D}}}({{\varvec{u}}})=0\)), we then have
After the variable substitution \(z_i\mapsto 1/z_i\), (A.14) becomes
Let us work out the example \(k=2\) in some more detail. In this case
and hence
We contract the path of integration in \(z_2\) to zero, and pick up the residue at \(z_2 = z_1^{-1} w_{12}w_{21}\). This yields
For \(u_1,u_2\ne 0\) set \(z_1 = \mathrm {i}u_1^{-1/2} u_2^{1/2} (-w_{12} w_{21})^{1/2} \mathrm {e}^{\mathrm {i}\theta }\) with any fixed choice for the branch of the square-root so that \(\mathrm {d}{{\varvec{z}}}_1/{{\varvec{z}}}_1 = \mathrm {i}\mathrm {d}\theta \) . Then
Compare this with the classical integral representation for the J-Bessel function,
Thus
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Griffin, J., Marklof, J. Quantum Transport in a Crystal with Short-Range Interactions: The Boltzmann–Grad Limit. J Stat Phys 184, 16 (2021). https://doi.org/10.1007/s10955-021-02797-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10955-021-02797-z