Abstract
We introduce a Kac’s type walk whose rate of binary collisions preserves the total momentum but not the kinetic energy. In the limit of large number of particles we describe the dynamics in terms of empirical measure and flow, proving the corresponding large deviation principle. The associated rate function has an explicit expression. As a byproduct of this analysis, we provide a gradient flow formulation of the Boltzmann-Kac equation.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The statistics of rarefied gas is described, at the kinetic level, by the Boltzmann equation. It has become paradigmatic since it encodes most of the conceptual and technical issues in the description of the statistical properties for out of equilibrium systems. In the spatially homogeneous case the Boltzmann equation reads
where \(f(v)\mathop {}\!\mathrm {d}v\) is the one-particle velocity distribution, \(v, v_*\) (resp. \(v', v'_*\)) are the incoming (resp. outgoing) velocities in the binary collision and r is the collision rate. In the classical case the collision rate is concentrated on the set of velocities satisfying the constraints of momentum and energy conservation \(v+v_*= v' + v'_*\) and \(|v|^2+|v_*|^2= |v'|^2 + |v'_*|^2\), and it satisfied the detailed balance condition \(M(\mathop {}\!\mathrm {d}v) M(\mathop {}\!\mathrm {d}v_*) r(v,v_*;\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v_*')=M(\mathop {}\!\mathrm {d}v') M(\mathop {}\!\mathrm {d}v_*')r(v',v_*';\mathop {}\!\mathrm {d}v,\mathop {}\!\mathrm {d}v_*)\), where \(M(\mathop {}\!\mathrm {d}v)\) is the Maxwellian.
In the pioneering work [13], Kac derived (1.1) from a stochastic model of N particles, interacting via binary collisions satisfying the conservation of the energy. This result can be seen as the law of large number for the empirical measure of the microscopic dynamics. In particular, Kac’s result established the validity of the Stosszahlansatz. We refer to [19] for further developments and references. The analysis of the corresponding fluctuations, in the central limit regime, has been carried out in [20,21,22]. Regarding the large deviation asymptotics, we point out that while the law of large numbers depends on the validity of the Stosszahlansatz with probability converging to one as \(N\rightarrow \infty \), the large deviation principle requires that it holds with probability super-exponentially close to one for N large. It is therefore a non trivial improvement of the Kac’s result. The first statement in this direction has been obtained in [15], where a large deviation upper bound is proven. In [18] a large deviation result has been derived for a stochastic model in the setting of one dimensional spatially dependent Boltzmann equation with discrete velocities. A main issue behind the proof of large deviation lower bound is to establish a law of large number for a perturbed dynamics, proving in particular the uniqueness of the perturbed Boltzmann equation, which in general fails.
Regarding the Newtonian dynamics, in view of the previous discussion, the validity of a large deviation principle is a most challenging issue. The general structure of the rate function associated to the Boltzmann equation for hard sphere is discussed in [7]. A derivation from Newtonian dynamics is presented in [6], see also [5] for a comparison of the rate function derived in [6] with the one proposed in [7].
We focus on the large deviation principle for Kac-type spatially homogeneous models. Beside the empirical density, it is convenient to introduce another observable, the empirical flow, which records the incoming and outgoing velocities in the collisions, i.e., letting N the number of particles, \(N q_t(\mathop {}\!\mathrm {d}v,\mathop {}\!\mathrm {d}v_*, \mathop {}\!\mathrm {d}v', \mathop {}\!\mathrm {d}v'_*)\mathop {}\!\mathrm {d}t\) is the number of collisions in the time window \([t, t+\mathop {}\!\mathrm {d}t]\) with incoming velocities in \(\mathop {}\!\mathrm {d}v\,\mathop {}\!\mathrm {d}v_*\) and outgoing velocities in \(\mathop {}\!\mathrm {d}v'\,\mathop {}\!\mathrm {d}v'_*\). In particular, the mass of the empirical flow is the normalized total number of collisions. The empirical measure and flow are linked by the balance equation
which expresses the conservation of the mass. The idea of considering the pair of observables empirical measure and flow has been exploited in the context of Markov processes in [3, 4, 12]. In this setting the rate function relative to the pair empirical measure and flow has a closed form, and it is equal to \(I(f, q)=I_0(f_0) + J(f,q)\), where \(I_0(f_0)\) takes into account the fluctuations of the initial datum and the dynamical term J is given by
where \(q^f(\mathop {}\!\mathrm {d}v, \mathop {}\!\mathrm {d}v_*, \mathop {}\!\mathrm {d}v', \mathop {}\!\mathrm {d}v'_*)= \frac{1}{2} f(v)f(v_*)\mathop {}\!\mathrm {d}v\,\mathop {}\!\mathrm {d}v_* r(v, v_*; \mathop {}\!\mathrm {d}v', \mathop {}\!\mathrm {d}v'_*) \). By projecting J on the empirical density f(v)dv one recovers the variational expression for the rate function obtained for the empirical measure in [7, 15, 18].
The rate function (1.3) has a simple interpretation in terms of Poisson point processes. Let \(\{Z^N\}\) be a sequence of Poisson random variables with parameters \(N\lambda \). By Stirling’s formula, for N large
Therefore, for N large the statistics of the collisions in the Kac’s walk can be thought as sampled from a Poisson point process with intensity \(N q^f\), where now f and q are related by the balance equation (1.2).
Here we implement this program for a model of N particles, interacting via stochastic binary collisions satisfying the conservation of the momentum, but not of the kinetic energy. Such a model is relevant, as example, in the case of a molecular gas when the internal degrees of freedom are disregarded. We do not assume a detailed balance condition and the corresponding Boltzmann-Kac equation is of the form (1.1). We prove the large deviation upper bound with the rate function introduced above. The proof of the matching lower bound is achieved when q has bounded second moment. From a technical viewpoint, the advantage of momentum conservation with respect to energy conservation is the linearity of the constraint, which allows to use convolution in the approximation argument for the lower bound. We also derive the variational formula for the projection of the rate function on the empirical measure.
In the context of i.i.d. Brownians, the connection between the large deviation rate function and the gradient flow formulation of the heat equation is discussed in [1], see also [2, 17] for the case of i.i.d. reversible Markov chains. Here we derive a gradient flow formulation for the Boltzmann-Kac equation from the large deviation rate function (1.3). On general grounds, a gradient flow formulation of evolution equations is based on the choice of a pair of functions in Legendre duality. In [10] it is shown how this pair can be chosen so that the Boltzmann-Kac equation is the gradient flow of the entropy with respect to a suitable distance on the set of probability measures. As we here show, the choice in [10] is not the one associated to the large deviation rate function. Instead, analogously to [2], in the formulation here presented the non-linear Dirichlet form associated to the Boltzmann-Kac equation plays the role of the slope of the entropy.
2 Notation and Main Result
2.1 Kac Walk
For a Polish space X we denote by \({\mathcal M}(X)\) the set of positive Radon measures on X with finite mass; we consider \({\mathcal M}(X)\) endowed with the weak* topology and the associated Borel \(\sigma \)-algebra.
Given \(N\ge 2\), a configuration is defined by N velocities in \({\mathbb R}^d\). The configuration space is therefore given by \(\Sigma _N:=({\mathbb R}^d)^N\). Elements of \(\Sigma _N\) are denoted by \({\varvec{v}} =(v_k)_{k=1,\ldots ,N}\), with \(v_k\in {\mathbb R}^d\). The Kac walk that we here consider is the Markov processes on \(\Sigma _N\) whose generator acts on continuous and bounded functions \(f:\Sigma _N\rightarrow {\mathbb R}\) as
where the sum is carried over the unordered pairs \(\{i,j\}\subset \{1,..,N\}\), \(i\ne j\), and
Here
and the collision rate r is a continuous function from \({\mathbb R}^d\times {\mathbb R}^d\) to \({\mathcal M}({\mathbb R}^d\times {\mathbb R}^d)\). Finally, the scattering rate \(\lambda :{\mathbb R}^d \times {\mathbb R}^d \rightarrow {\mathbb R}_+\) is
We assume that r satisfies the following conditions in which we set \({\mathscr {V}} :=\{v+v_*=v'+v_*'\}\subset ({\mathbb R}^d)^2\times ({\mathbb R}^d)^2\).
Assumption 2.1
-
(i)
Conservation of momentum. For each \((v,v_*) \in {\mathbb R}^d\times {\mathbb R}^d\) the measure \(r(v,v_*;\cdot )\) is supported on the hyperplane \(\{(v',v_*') \in {\mathbb R}^d \times {\mathbb R}^d :v'+v_*' = v+v_*\}\).
-
(ii)
Collisional symmetry. For each \((v,v_*) \in {\mathbb R}^d\times {\mathbb R}^d\) the scattering kernel satisfies \(r(v,v_*;\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v_*')=r(v_*, v; \mathop {}\!\mathrm {d}v', \mathop {}\!\mathrm {d}v'_*)=r(v,v_*;\mathop {}\!\mathrm {d}v'_*,\mathop {}\!\mathrm {d}v')\).
-
(iii)
Non degeneracy of the scattering kernel. There exists a density \(B :{\mathscr {V}}\rightarrow {\mathbb R}_+\) such that \(r(v, v_*, \mathop {}\!\mathrm {d}v', \mathop {}\!\mathrm {d}v'_*)= B(v, v_*, w') \mathop {}\!\mathrm {d}w'\), where \(w'=(v' -v'_*)/\sqrt{2}\). Moreover, there exists \(c_0>0\) such that \(B(v, v_*, w')\ge c_0\exp \{- c_0 \, |w'|^2\}\).
-
(iv)
Gaussian tails. There exists \(C<+\infty \), \(\eta >0\), \(\gamma \in [0,2)\) such that for any v, \(v_* \in {\mathbb R}^d\)
$$\begin{aligned} \int \!dw'\, B(v,v_*,w') e^{\eta |w'|^2} \le C (1+ |v-v_*|^\gamma ). \end{aligned}$$ -
(v)
Point-wise bound. There exists a constant \(C<+\infty \) such that \( B(v,v_*,w')\le Ce^{C(|v|^2+|v_*|^2 + |w'|^2)}\).
We remark that we do not assume balance conditions. Observe that item (iv) implies
An example of a scattering kernel meeting the above conditions is \(B(v, v_*, w')= (1 + |v-v_*|) e^{-|w'|^2}\).
We denote by \(({\varvec{v}}(t))_{t\ge 0}\) the Markov process generated by \({\mathscr {L}}_N\). Let \(\Sigma _{N,0} := \big \{ {\varvec{v}} \in \Sigma _N :N^{-1}\sum _{k} v_k=0 \big \}\) be the subset of configurations with zero average velocity, that it is invariant by the dynamics in view of the conservation of the momentum. By the positivity of the collision rate (see Assumption 2.1, item (iii)), the Kac walk is ergodic when restricted to \(\Sigma _{N,0}\). We shall consider the Kac’s walk restricted to \(\Sigma _{N,0}\).
Fix hereafter \(T>0\). Given a probability \(\nu \) on \(\Sigma _{N,0}\) we denote by \({\mathbb P}_\nu ^N\) the law of the Kac walk on the time interval [0, T]. Observe that \({\mathbb P}_\nu ^N\) is a probability on the Skorokhod space \(D([0,T];\Sigma _{N,0})\). As usual if \(\nu =\delta _{{\varvec{v}}}\) for some \({\varvec{v}} \in \Sigma _{N,0}\), the corresponding law is simply denoted by \({\mathbb P}_{{\varvec{v}}}^N\).
2.2 Empirical Measure and Flow
We denote by \({\mathscr {P}}_0({\mathbb R}^d)\) the set of probability measures on \({\mathbb R}^d\) with zero mean. We consider \({\mathscr {P}}_0({\mathbb R}^d)\) as a closed subset of the space of probability measure with finite mean equipped with the \(W_1\) Wasserstein distance. Then \({\mathscr {P}}_0({\mathbb R}^d)\) endowed with the relative topology is a Polish space. The empirical measure is the map \(\pi ^N:\Sigma _{N,0} \rightarrow {\mathscr {P}}_0({\mathbb R}^d)\) defined by
Let \(D\big ([0,T]; {\mathscr {P}}_0({\mathbb R}^d)\big )\) the set of \({\mathscr {P}}_0({\mathbb R}^d)\)-valued cádlág paths endowed with the Skorokhod topology and the corresponding Borel \(\sigma \)-algebra. With a slight abuse of notation we denote also by \(\pi ^N\) the map from \(D\big ([0,T]; \Sigma _{N,0} \big )\) to \(D\big ([0,T]; {\mathscr {P}}_0({\mathbb R}^d)\big )\) defined by \(\pi ^N_t({\varvec{v}}(\cdot )):=\pi ^N({\varvec{v}}(t))\), \(t\in [0,T]\).
We denote by \({\mathscr {M}}\) the (closed) subset of \({\mathcal M}([0,T]\times ({\mathbb R}^d)^2\times ({\mathbb R}^d)^2\big )\) given by the measures Q that satisfy
The empirical flow is the map \(Q^N:D\big ([0,T]; \Sigma _{N,0} \big ) \rightarrow {\mathscr {M}}\) defined by
where \(F:[0,T]\times ({\mathbb R}^d)^2\times ({\mathbb R}^d)^2\rightarrow {\mathbb R}\) satisfies \(F(t;v,v_*,v',v'_*) =F(t;v_*,v,v',v'_*) = F(t;v,v_*,v'_*,v')\), and F is continuous and bounded, while \((\tau ^{i,j}_k)_{k\ge 1}\) are the jump times of the pair \((v_i,v_j)\). Finally, \(v_i(t-) = \lim _{s\uparrow t} v_i(s)\).
For each \({\varvec{v}} \in \Sigma _{N,0}\) with \({\mathbb P}^N_{{\varvec{v}}}\) probability one the pair \((\pi ^N,Q^N)\) satisfies the following balance equation that express the conservation of probability. For each \(\phi :[0,T]\times {\mathbb R}^d\rightarrow {\mathbb R}\) bounded, continuous, and continuously differentiable with respect to time
In view of the conservation of the momentum, the measure \(Q^N(\mathop {}\!\mathrm {d}t;\cdot )\) is supported on the hyperplane \({\mathscr {V}} \).
2.3 The Rate Function
Let \({\mathscr {S}}\) be the (closed) subset of \(D\big ([0,T]; {\mathscr {P}}_0({\mathbb R}^d)\big )\times {\mathscr {M}}\) given by elements \((\pi ,Q)\) that satisfies the balance equation
for each \(\phi :[0,T]\times {\mathbb R}^d\rightarrow {\mathbb R}\) continuous, bounded and continuously differentiable in t, with bounded derivative. We consider \({\mathscr {S}}\) endowed with the relative topology and the corresponding Borel \(\sigma \)-algebra.
For \(\pi \in D\big ([0,T]; {\mathscr {P}}_0({\mathbb R}^d)\big )\) let \(Q^\pi \) be the measure defined by
and observe that \(Q^\pi (\mathop {}\!\mathrm {d}t,\cdot )\) is supported on \({\mathscr {V}}\).
Definition 2.2
Let \({\mathscr {S}}_\mathrm {ac}\) be the subset of \({\mathscr {S}}\) given by the elements \((\pi ,Q)\) that satisfy the following conditions:
-
(i)
\(\pi \in C\big ([0,T];{\mathscr {P}}_0({\mathbb R}^d)\big )\);
-
(ii)
\(\sup _{t\in [0,T]} \pi _t (\zeta ) <+\infty \), where \(\zeta (v) = |v|^2\);
-
(iii)
\(Q\ll Q^\pi \).
Observe that by item (iv) of Assumption 2.1, condition (ii) implies that if \((\pi ,Q)\in {\mathscr {S}}_\mathrm {ac}\) then \(Q^\pi \) is a finite measure. Moreover, by choosing positive functions \(\phi \) not depending on t in the balance equation (2.6) and neglecting the loss term we obtain
Since \(Q\ll Q^\pi \) and, by Assumption 2.1, item (iii), the marginal on \(v'\) of \(Q^\pi \) is absolutely continuous with respect to the Lebesgue measure, we deduce that \(\pi _0\ll \mathop {}\!\mathrm {d}v\) implies \(\pi _t\ll \mathop {}\!\mathrm {d}v\), for any \(t\ge 0\). As a consequence, also Q is absolutely continuous with respect to the Lebesgue measure on \([0,T]\times {\mathscr {V}}\).
The dynamical rate function \(J:{\mathscr {S}} \rightarrow [0,+\infty ]\) is defined by
In order to obtain a large deviation principle, chaotic initial conditions are not sufficient but we need that the empirical measure at time zero satisfies a large deviation principle. Referring to [8] for a discussion on entropically chaotic initial conditions, we next provide an example of a class of allowed initial data.
Assumption 2.3
Given \(m\in {\mathscr {P}}_0({\mathbb R}^d)\) set \(\mu ^N=m^{\otimes N}\) and choose as initial distribution of the Kac’s walk the probability on \(\Sigma _{N,0}\) given by \(\nu ^N = \mu ^N (\ \cdot \ \vert \sum _{i} v_i = 0)\). We assume that m is absolutely continuous with respect to the Lebesgue measure and still denote by m its density. We furthermore assume that there exists \(\gamma >0\) such that
-
(i)
\(\int \mathop {}\!\mathrm {d}v\, m(v) \exp \{\gamma |v|^2\} < +\infty \);
-
(ii)
the Fourier transform of \(m(v)\exp \{\gamma |v|^2\}\) is in \(L^1({\mathbb R}^d)\);
-
(iii)
\(m(v)\ge \gamma \exp \{-\frac{1}{\gamma }|v|^2\}\).
Given two probabilities \(\mu _1, \mu _2 \in {\mathscr {P}}_0({\mathbb R}^d)\), the relative entropy \(H(\mu _2\vert \mu _1)\) is defined as \(H(\mu _2\vert \mu _1)=\int \mathop {}\!\mathrm {d}\mu _1 \rho \log \rho \), where \(\mathop {}\!\mathrm {d}\mu _2=\rho \, \mathop {}\!\mathrm {d}\mu _1\), understanding that \(H(\mu _2|\mu _1)=+\infty \) if \(\mu _2\) is not absolutely continuous with respect to \(\mu _1\).
Denoting by \(\hat{{\mathscr {S}}}\) the set of paths \((\pi , Q)\in {\mathscr {S}}\) such that
and letting \(I:{\mathscr {S}}\rightarrow [0,+\infty ]\) be the functional defined by
the large deviation principle for the Kac’s walk is stated as follows.
Theorem 2.4
Let \(\nu ^N\) as in Assumption 2.3. Then for each closed \(C \subset {\mathscr {S}}\), respectively each open \(A\subset {\mathscr {S}}\),
The proof of the upper bound does not rely on item (iii) of Assumption 2.1. In particular it holds also when the collision rate conserves the energy. Likewise, item (iii) in Assumption 2.3 is used only in the proof of the lower bound. We also remark that, if we replace item (iii) in Assumption 2.1 by the condition \(\int \mathop {}\!\mathrm {d}w' B(v, v_*, w') e^{\eta [|v'|^2 + |v'_*|^2]}\le C(1 + |v-v_*|^2)\), or the collision r has non degenerate density on \(({\mathbb R}^d)^4\), the lower bound holds in the whole \({\mathscr {S}}\). As we show in Proposition 5.1, the projection of I on the empirical measure coincides with the variational expression in [15, 18].
3 Upper Bound
The upper bound is achieved by an established pattern in large deviation theory. We first prove the exponential tightness, which allows us to reduce to compacts. By an exponential tilting of the measure, we prove an upper bounds for open balls and finally we use a mini-max argument to conclude.
Proposition 3.1
(Exponential tightness) There exists a sequence of compacts \(K_\ell \subset {\mathscr {S}}\) such that for any N
By standard compactness criteria (Banach-Alaoglu, Prokhorov and Ascoli-Arzelà theorems), the proof follows from the bounds in the next three lemmata.
Lemma 3.2
Let \(\zeta :{\mathbb R}^d \rightarrow [0,\, +\infty )\) be the function \(\zeta (v) = |v|^2\). Then
Lemma 3.3
Lemma 3.4
For each \(\varepsilon >0\) and \(\phi :{\mathbb R}^d\rightarrow {\mathbb R}\) continuous and bounded
Lemmata 3.2 and 3.4 imply that if the large deviation upper bound rate function is finite then \(\sup _{t\in [0,T]}\pi _t(\zeta )<+\infty \) and \(\pi \in C([0,T], {\mathscr {P}}_0({\mathbb R}^d))\), i.e. \(\pi \) meets the conditions in items (i), (ii) in Definition 2.2.
To deal with the initial conditions, as detailed in Assumption 2.3, we need the following elementary statement whose proof is omitted.
Lemma 3.5
Pick \(\phi \in C({\mathbb R}^d)\) such that \(m(e^{\phi })<+\infty \), and let \(m_\phi \) be the probability on \({\mathbb R}^d\) defined by \(m_\phi (\mathop {}\!\mathrm {d}v) = m(\mathop {}\!\mathrm {d}v) e^{\phi (v)} /m(e^{\phi })\). Then
where \(f_N^\phi \) and \(f_N\) are the densities of \(N^{-1/2}\sum _i v_i\) with \(v_i\) i.i.d. with law \(m_\phi \) and m respectively.
Proof of Lemma 3.2
Given \(\gamma >0\) to be chosen later, let \(\Psi ({\varvec{v}})=\gamma \sum _{k} |v_k|^2\) and set
By e.g. [14, App. 1, Prop. 7.3], \({\mathbb M}^\Psi \) is a positive super martingale, in particular for any bounded stopping time \(\tau \) and any \({\varvec{v}}_0\in \Sigma _{N,0}\) \({\mathbb E}_{{\varvec{v}}_0}^N \big [{\mathbb M}^\Psi _\tau \big ]\le 1\). By simple computations, in view of Assumption 2.1, item (iv), we can choose \(\gamma >0\) such that there exists a constant c such that for any \(N > 1\)
Set \(\tau _\ell :=\inf \{t>0 : \pi _t^N(\zeta )> \ell \}\wedge T\), then
To complete the proof we show that Assumption 2.3 implies that, possibly by redefining \(\gamma >0\), there exists a constant c such that for any N
In order to prove this bound, we apply Lemma 3.5 with \(\phi (v)=\gamma |v|^2 +\alpha \cdot v\), where \(\alpha \in {\mathbb R}^d\) is chosen so that \(m_\phi \) is centered. Observing that with \(\nu ^N\)-probability one \(\pi ^N(\gamma |v|^2 +\alpha \cdot v)=\gamma \pi ^N(|v|^2)\), we get
By Assumption 2.3, the densities \(f_N^\phi \) and \(f_N\) satisfy the local central limit theorem, see e.g. [11, Ch. XV.5, Thm. 2]). In particular \(\frac{1}{N} \log f^\phi _N(0)\) and \(\frac{1}{N} \log f_N(0)\) vanish as \(N\rightarrow +\infty \). The proof or (3.5) is thus achieved.\(\square \)
Proof of Lemma 3.3
Given \(\ell \), \(h>0\), set \(B_{\ell ,h}:=\{(\pi ,Q):\, \sup _{t}\pi _t(\zeta )\le h, Q(1)>\ell \}\), with \(\zeta (v)= |v|^2\). In view of the previous lemma, it is enough to show that for each \(h>0\)
Recall that the scattering rate \(\lambda \) has been defined in (2.1). Given a bounded measurable function \(F:[0,T] \times {\mathbb R}^{4d}\rightarrow {\mathbb R}\) such that \(F(t; v,v_*, v',v_*')=F(t; v_*,v, v',v_*') =F(t; v,v_*, v'_*,v')\), let
Denoting by \(Q^N_{[0,t]}\) the restriction of the measure \(Q^N\) on [0, t], and setting \(\Lambda (v)=\lambda (v,v)\), \(\Lambda ^F(t,v)=\lambda ^F(t;v,v)\), \(v\in {\mathbb R}^d\), the process
is a \({\mathbb P}^N_{{\varvec{v}}}\) positive super-martingale for each \({\varvec{v}}\in \Sigma _{N,0}\), see e.g. [14, App. 1, Prop. 2.6]. Choosing \(F=\gamma \), with \(\gamma \) a positive constant, for each \(\ell >0\)
where in last inequality we have used (2.2).\(\square \)
Proof of Lemma 3.4
In view of the balance equation (2.6) and Lemma 3.2, it is enough to show that for any \(h>0\) there exists a function \(c:(0,1)\rightarrow {\mathbb R}_+\) with \(c(\delta )\uparrow +\infty \) as \(\delta \downarrow 0\) such that, for any \(\varepsilon >0\)
with \(\zeta (v)=|v|^2\). By a straightforward inclusion of events, the previous bound follows from
Consider the super-martingale (3.7) with \(F=\gamma \,{1 \mathrm{I}}_{[t, t+\delta ]}\), \(\gamma >0\). Using the same argument of the previous lemma and (2.2) we deduce
The proof is concluded by choosing \(\gamma = \log (1/\delta )\).\(\square \)
3.1 Upper Bound on Compacts
Given a bounded continuous function \(\phi \) on \({\mathbb {R}}^d\) we define the probability measure \(m_\phi \) on \({\mathbb {R}}^d\) by \(m_\phi (\mathop {}\!\mathrm {d}v) = m(\mathop {}\!\mathrm {d}v) e^{\phi }/m(e^\phi )\) and we set \(\mu _\phi ^N=m_\phi ^{\otimes N}\). Moreover, recall the definition (3.6) of \(\lambda ^F\).
Lemma 3.6
For any \((\phi ,F)\in C_b({\mathbb {R}}^d)\times C_b({\mathbb {R}}^+\times {({\mathbb {R}}^{d})}^2\times {({\mathbb {R}}^{d})}^2)\) such that \(m_\phi \) is centered and \(F(t;v,v_*,v',v'_*)=F(t;v_*,v,v',v'_*)=F(t;v,v_*,v'_*,v')\), and any measurable subset \(B\subset {\mathscr {S}}\)
where
Proof
Consider the perturbed initial distribution \({{{\tilde{\nu }}}}^N = \mu _\phi ^N (\ \cdot \ \vert \sum _{i} v_i = 0)\). Recalling the definition of the super-martingale \({\mathbb N}^F_t\) in (3.7), we write
Recalling that \(\Lambda (v):=\lambda (v,v)\), and using Lemma 3.5, we get
where in the last inequality we used (2.2). The statement is achieved by observing that \({\mathbb E}_{{{{\tilde{\nu }}}}^N}^N ( {\mathbb N}_T^F)\le 1\), and noting that by the local central limit theorem both \(\frac{1}{N} \log f^\phi _N(0)\) and \(\frac{1}{N} \log f_N(0)\) vanish as \(N\rightarrow +\infty \).
Recall that \(H(\cdot |m)\) denotes the relative entropy and let J be the functional defined in (2.8).
Proposition 3.7
(Variational characterization of the rate functional) For any pair \((\pi ,Q)\in {\mathscr {S}}\) satisfying (i) and (ii) in Definition 2.2
In the first formula the supremum is carried out over the continuous and bounded \(\phi :{\mathbb R}^d\rightarrow {\mathbb R}\) such that the probability \(m_\phi \) (as defined in Lemma 3.5) is centered. In the second formula the supremum is carried out over all continuous and bounded \(F:[0,T]\times ({\mathbb R}^d)^2\times ({\mathbb R}^d)^2 \rightarrow {\mathbb R}\) such that \(F(t;v,v_*,v',v'_*)=F(t;v_*,v,v',v'_*)=F(t;v,v_*,v'_*,v')\).
Since the set of \(\pi \) satisfying the condition in Definition 2.2, items (i), (ii), is a closed subset of \(D([0,T]; {\mathscr {P}}_0({\mathbb R}^d))\), the previous characterization of the rate functional readily implies the lower semicontinuity of I; moreover, if \(\sup _F\{ Q(F)-\frac{1}{2} \int _0^T \mathop {}\!\mathrm {d}t \, \pi _t\otimes \pi _t(\lambda ^F-\lambda ) \} < + \infty \) then \(Q\ll Q^\pi \), i.e., \((\pi ,Q)\in {\mathscr {S}}_\mathrm {ac}\).
Proof
The first statement follows from the variational characterization of the relative entropy and the observation that since \(\pi _0\) is centered it is enough to consider \(\phi \) satisfying the stated constraint.
To prove the second statement, recall the definition of \(Q^\pi \) in (2.7) and observe that
This implies that if \(\sup _F \Big [Q(F)-\frac{1}{2} \int _0^T \mathop {}\!\mathrm {d}t \, \pi _t\otimes \pi _t(\lambda ^F-\lambda )\Big ]\) is finite, then Q is absolutely continuous with respect to \(Q^\pi \).
The proof is now completed by a direct computation, see Lemma 4.4 in [3]. \(\square \)
Proof of Theorem 2.4: upper bound
By the exponential tightness in Proposition 3.1, to prove the upper bound it is enough to show the statement for compacts. By Lemma 3.6 and a mini-max argument, see e.g. [14, App.2, Lemma 3.2], the upper bound holds with the functional
Finally, by Proposition 3.7, \({\hat{I}} = I\). \(\square \)
4 Lower Bound
In order to obtain the large deviation lower bound, given \((\pi , Q)\) we need to produce a perturbation of the dynamics such that the law of large number for \((\pi ^N, Q^N)\) is \((\pi , Q)\). While the compactness of \((\pi ^N, Q^N)\) follows from the arguments of the previous section, in order to identify the limit point we need uniqueness of the perturbed Boltzmann-Kac equation that we are able to prove only if the perturbed scattering rate is bounded. Therefore, we shall first prove the lower bound for open neighborhoods of “nice” \((\pi , Q)\), and then use a density argument, that will be completed with the restriction that Q has bounded second moment.
4.1 Perturbed Kac Walks
We start by the following law of large numbers for a class of perturbed Kac’s walks. Consider perturbed time-dependent collision rates \({{\tilde{r}}}_t\), with density \({{\tilde{B}}}_t\), i.e.
which we assume to meet condition (iv) in Assumption 2.1 uniformly for \(t\in [0,T]\), and to satisfy the following extra condition. There exists \(C<+\infty \) such that for any \(t, v, v_* \in [0,T] \times ({\mathbb {R}}^d)^2\)
Given a probability \(\nu \) on \(\Sigma _{N,0}\) we denote by \(\tilde{{\mathbb P}}_\nu ^N\) the law of the perturbed Kac walk with initial datum \(\nu \).
Lemma 4.1
Fix a sequence of initial conditions \(\nu ^N\) as in Assumption 2.3. As \(N\rightarrow \infty \), the pair \((\pi ^N,Q^N)\) converges, in \(\tilde{{\mathbb P}}_{\nu ^N}^N\) probability, to \((f \mathop {}\!\mathrm {d}v\,, q\, \mathop {}\!\mathrm {d}t \mathop {}\!\mathrm {d}v\mathop {}\!\mathrm {d}v_* \mathop {}\!\mathrm {d}w')\), where \(q_t(v,v_*,w') =\frac{1}{2} f_t(v) f_t(v_*) {{\tilde{B}}}_t(v,v_*,w')\) and \(f\in C\big ([0,T]; L^1({\mathbb R}^d)\big )\) is the unique solution to the perturbed Kac’s equation
where \(v'= \frac{v+v_*}{2} + \frac{w'}{\sqrt{2}}\), \(v_*'= \frac{v+v_*}{2} - \frac{w'}{\sqrt{2}}\), and \(w = \frac{v-v_*}{\sqrt{2}}\). Here we understand that (4.3) holds by integrating against continuous, bounded test functions which are continuous differentiable in time.
Proof
Observe that the large deviation upper bound proven in the previous section holds also for the perturbed Kac’s walk. The exponential tightness implies that the sequence \(\{\widetilde{{\mathbb P}}_{\nu ^N}^N\circ (\pi ^N,Q^N)^{-1}\}\) is precompact in \({\mathscr {P}}_0 ({\mathscr {S}})\). Moreover, by the large deviation upper bound, any cluster point \(\tilde{{\mathcal P}}\) of this sequence satisfies \(\tilde{{\mathcal P}}\big ( \big \{ (\pi ,Q) :{{\tilde{I}}}(\pi ,Q) =0 \} \big )=1\) where the rate function \({{\tilde{I}}}\) is defined as I in (2.8), (2.10), with the rate r replaced by the perturbed rate \({{\tilde{r}}}\). Since \({{\tilde{I}}} (\pi , Q)< +\infty \) we get \((\pi , Q)\in {\mathscr {S}} _{\mathrm {ac}}\), in particular \(\pi _t(\mathop {}\!\mathrm {d}v) = f_t(v) \mathop {}\!\mathrm {d}v\) and \(Q(\mathop {}\!\mathrm {d}t;\mathop {}\!\mathrm {d}v,\mathop {}\!\mathrm {d}v_*,\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v_*') = q_t(v,v_*;w') \mathop {}\!\mathrm {d}t \mathop {}\!\mathrm {d}v \mathop {}\!\mathrm {d}v_* \mathop {}\!\mathrm {d}w'\) for some densities f and q. Then \({{\tilde{I}}} (\pi , Q)=0\) implies \(Q={{\tilde{Q}}} ^\pi \), where \({{\tilde{Q}}} ^\pi \) is defined as in (2.7) with r replaced by \({{\tilde{r}}}\). The balance equation (2.6) thus amounts to the weak formulation of (4.3).
It remains to show that \(f\in C([0,T]; L^1({\mathbb R}^d))\) and that the solution to (4.3) is unique. Choosing test functions independent of time and integrating (4.3) we deduce that for each \(t\in [0,T]\) and Lebesgue almost every v it actually holds
Since \({{\tilde{\lambda }}}\) is bounded, it is now straightforward to show that \(f\in C([0,T]; L^1({\mathbb R}^d))\). Indeed, letting C be the constant in (4.2), from (4.4) we get \(\Vert f_t-f_s \Vert _{L_1}\le 2C (t-s)\) for \(0\le s\le t\le T\). Finally, using again (4.2), uniqueness is achieved by applying Gronwall’s lemma to (4.4) . \(\square \)
The collection of “nice” \((\pi , Q)\) is specified as follows.
Definition 4.2
Let \(\tilde{{\mathscr {S}}}\) be the collection of elements \((\pi , Q)\in {\mathscr {S}}_{\mathrm {ac}}\) whose densities (f, q) are such that
and
for some \(\eta >0\).
Given \((\pi , Q)\in \tilde{{\mathscr {S}}}\), denote by \({{\tilde{r}}}_t\) the time dependent perturbed rate whose density is defined by
that meets the condition (iv) in Assumption 2.1 uniformly for \(t\in [0,T]\) and the extra assumption (4.2).
The next statement provides the large deviation lower bound for neighborhood of elements in \(\tilde{{\mathscr {S}}}\).
Proposition 4.3
Let \((\pi , Q)\in \tilde{{\mathscr {S}}}\). Assume that \(\pi _0\) satisfies items (i), (ii) in Assumption 2.3, and suppose \(\pi _0(\mathop {}\!\mathrm {d}v)=e^\phi m(\mathop {}\!\mathrm {d}v)/m(e^\phi )\) for some \(\phi \) bounded and continuous. Moreover, denote by \({{\tilde{\nu }}}^N=\pi _0^{\otimes N}(\cdot \vert \sum _i v_i=0)\) the corresponding probability on \(\Sigma _{N,0}\). Then
We premise the following lemma.
Lemma 4.4
Set \(\zeta (v) = |v|^2\) and \({\bar{F}} (t,v,v_*,w')={\bar{F}}(w')=1+ |w'|^2\), then
Proof
We write
The first bound in the statement is achieved by observing that Lemma 3.2 holds also for the perturbed chain.
In order to prove the second bound, let
that it is a \(\tilde{{\mathbb P}}_{{{\tilde{\nu }}}^N}\) martingale, with predictable quadratic variation
In view of (4.7) and (4.6) in Definition 4.2, the random variable \(\langle {\tilde{M}}^{N} \rangle _T \) is uniformly bounded in N. This completes the proof. \(\square \)
Proof of Proposition 4.3
We first prove that
By Lemma 3.5,
where, by the local central limit theorem, the last term on the right hand side vanishes as \(N\rightarrow +\infty \). As a corollary of Lemma 4.1 we deduce that \(\pi ^N\) converges in \({{\tilde{\nu }}}^N\)-probability to \(\pi _0\). Hence, in view of assumptions on \(\phi \), we deduce
We now show that
In view of the assumptions on \({{\tilde{r}}}_t\) the super-martingale defined in (3.7) with \(F_t=\log (\mathop {}\!\mathrm {d}{{\tilde{r}}}_t/ \mathop {}\!\mathrm {d}r)\) is actually a martingale and its value at time T is the Radon-Nykodim derivative of \(\tilde{{\mathbb P}} ^N_{{{\tilde{\nu }}}^N}\) with respect to \({\mathbb P} ^N_{{{\tilde{\nu }}}^N}\). Since \(\lambda _t^F={{\tilde{\lambda }}}_t\) and \(\Lambda ^F_t(v)={{\tilde{\lambda }}}_t(v,v)=:{{\tilde{\Lambda }}}_t\),
By definition of \(\tilde{{\mathscr {S}}}\)
where we used Assumption 2.1, item (iii).
Now observe that, by Lemma 4.1, \((\pi ^N,Q^N)\) converges to \((\pi ,Q)\) in \(\tilde{{\mathbb P}}^N_{{{\tilde{\nu }}}^N}\) probability. Moreover, Lemma 4.4 provides sufficient conditions for the uniform integrability of \(Q^N(F)\) and of \(\int _0^T\!\! \mathop {}\!\mathrm {d}t\, \pi _t^N\otimes \pi _t^N(\lambda )\). Finally, by the boundedness of \({{\tilde{\lambda }}}_t\) and the absolutely continuity of \(\pi _t\) we obtain
Recalling (2.10), the statement follows from (4.8) and (4.9). \(\square \)
By general results, see e.g. [16], the previous proposition implies the following lower bound statement.
Corollary 4.5
Let \(\nu ^N\) be as in Assumption 2.3 and set
Then the sequence \(\big \{{\mathbb P}^N_{\nu ^N}\circ (\pi ^N,Q^N)^{-1}\big \}\) satisfies a large deviations lower bound with rate function \({{\tilde{I}}} \).
4.2 Approximating Paths with Bounded Rate Function
Recall that the set \(\hat{{\mathscr {S}}}\) has been defined in (2.9).
Theorem 4.6
For each \((\pi , Q)\in \hat{{\mathscr {S}}}\) such that \(I(\pi ,Q) < +\infty \) there exists a sequence \(\{(\pi _n,Q_n)\}\subset \tilde{{\mathscr {S}}}\cap \hat{{\mathscr {S}}}\) satisfying \((\pi _n,Q_n)\rightarrow (\pi , Q)\) and \(I(\pi _n,Q_n)\rightarrow I(\pi , Q)\).
The lower bound in Theorem 2.4 follows directly from Corollary 4.5 and the above theorem. In order to prove it, we premise the following lemma.
Lemma 4.7
Let \((\pi , Q)\in \hat{{\mathscr {S}}}\) be such that \(I(\pi , Q)<+\infty \). Denote by (f, q) the densities of \((\pi , Q)\). Moreover, set \(\sigma :=B(v,v_*,w')/g(w')\) and \(\Phi := \log \big ( \frac{2\, q(v,v_*,w')}{f(v) f(v_*) g(w')}\big )\), where g is the standard Gaussian density on \({\mathbb R}^d\). Then
-
(i)
\(\displaystyle \int Q(\mathop {}\!\mathrm {d}t; \mathop {}\!\mathrm {d}v,\mathop {}\!\mathrm {d}v_*,\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v'_*)\big (|v|^2+|v_*|^2+|v'|^2+|v_*'|^2\big ) < +\infty \).
-
(ii)
\(\displaystyle I(\pi , Q)= H(\pi _0\vert m) + Q(\Phi ) - Q(\log \sigma ) -Q(1) + Q^\pi (1)\).
Proof
We start by proving
By Assumption 2.1, item (iv), there exists \(\eta >0\) such that
Recall that \(I(\pi , Q)= H(\pi _0\vert m) + J(\pi , Q)\), where J is defined in (2.8). By the variational representation of J in Proposition 3.7, for any F bounded and continuous
By a truncation argument, we can choose \(F=\eta |v'-v'_*|^2\), and thus deduce (4.10).
Let \(\zeta _n(v)=|v|^2\wedge n\). By using the continuity equation we get
By item (ii) in Definition 2.2, (2.9), and (4.10), taking the limit \(n\rightarrow \infty \) we deduce
which, together with the conservation of the momentum, implies the statement (i).
In view of Assumption 2.1, items (iii) and (v), the statement (ii) follows from (i). \(\square \)
Proof of Theorem 4.6
Observe that by the lower semicontinuity of I, for any sequence \((\pi _n, Q_n)\rightarrow (\pi , Q)\) we have \(\varliminf _n I(\pi _n, Q_n)\ge I(\pi , Q)\). The converse inequality is achieved by combining steps 1 and 2 below and a standard diagonal argument.
Step 1 - Convolution. Since \(I(\pi , Q)<+\infty \), there exist (f, q) such that \(d\pi _t=f_t(v)\mathop {}\!\mathrm {d}v\) and \(dQ= q_t(v,v_*, w')\mathop {}\!\mathrm {d}t \mathop {}\!\mathrm {d}v \mathop {}\!\mathrm {d}v_* \mathop {}\!\mathrm {d}w'\) where \(w=(v-v_*)/\sqrt{2}\).
Given \(0<\delta <1\), let \(g_\delta \) be the Gaussian kernel on \({\mathbb R}^d\) with variance \(\delta \) and define
We now show that the pair \((f^\delta , q^\delta )\) satisfies the balance equation. Given a test function \(\phi \) and denoting by \(*\) the convolution
One can repeat the same argument with \(\phi (v)\) replaced by \(\phi (v_*)\). Moreover, since \(g_\delta (v)g_\delta (v_*)g_\delta (w')=g_\delta (v')g_\delta (v'_*)g_\delta (w)\), where \(w=(v-v_*)/\sqrt{2} \), it holds also when \(\phi (v)\) is replaced by \(\phi (v')\), \(\phi (v'_*)\). Using the balance equation for the pair (f, q) with the test function \(g_\delta *\phi \) we deduce that \((f^\delta , q^\delta )\) satisfies the balance equation.
Now we show that
To this end, we use the decomposition provided by item (ii) of Lemma (4.7). We start by observing that, in view of (2.2) and \(\sup _{t}\pi _t(\zeta )< +\infty \), \(\zeta (v)= |v|^2\), by dominated convergence
Analogously, since \(|\log \sigma |\le C(|v|^2 + |v_*|^2 + |w'|^2)\), in view of Lemma 4.7, item (i)
By the convexity of the map \([ 0, +\infty )^2 \ni (a,b)\mapsto a\log (a/b)\) and Jensen’s inequality
By Lemma 4.7, item (i),
Gathering the above statements we deduce
To conclude the proof of (4.11) it remains to show that
By item (iii) in Assumption 2.3, since \(f_0\) has bounded second moment, we have
The bound (4.12) follows by using item (iii) in Assumption 2.3 and Jensen’s inequality.
Step 2 - Truncation of the flux. Given a pair (f, q) that satisfies the balance equation, we denote by \(q_t^{(i)}\), \(i=1,\ldots ,4\) the marginal of \(q_t\) respectively on \(v,v_*,v',v_*'\). Then \(q_t^{(1)}= q_t^{(2)}\), \(q_t^{(3)}= q_t^{(4)}\), and the balance equation reads
In the sequel we assume (f, q) such that \(I(f,q)<+\infty \), f strictly positive on compacts uniformly in time, and \(q_t^{(3)}\in L^1([0,T]; L^2({\mathbb R}^d))\). Observe that pair \((f^\delta , q^\delta )\) constructed in Step 1 meets the above conditions. Indeed, by item (ii) in Definition 2.2, for each compact subset \(K\in {\mathbb R}^d\) and \(\delta \in (0,1)\), there exists \(c_{K,\delta } >0\) such that, for any \(t\in [0,T]\) we have \(\inf _{v\in K} g_\delta * f_t \ge c_{K, \delta }\). Moreover, by Young inequality, \(\int _0^T \mathop {}\!\mathrm {d}t\,\Vert q_t^{\delta , (3)}\Vert _{L^2} \le c_\delta Q(1)\).
Given \(\ell >0\), set \(\Omega _\ell \) the subset of \({\mathscr {V}}\) given by
and define \(({{\tilde{f}}}^\ell , {{\tilde{q}}}^\ell )\) by
Set \((f^\ell , q^\ell ) = c_\ell ({{\tilde{f}}}^\ell , {{\tilde{q}}}^\ell )\) where
Observe that
The previous identity implies that if \(|v|\le \ell \), then \(f^\ell _t \ge c_\ell f_t \), while if \(|v|> \ell \), then \(f^\ell _t = c_\ell f_0\). In particular, for any t, \(f_t^\ell >0\), and \(\int f_t^\ell = 1\). Observe that by construction the pair \((f^\ell , q^\ell )\) satisfies the balance equation and it is an element of \(\tilde{{\mathcal {S}}}\) (see definition 4.2), since \(q^\ell \) is bounded and compactly supported and f is strictly positive on compacts, uniformly in time. Moreover, \((f^\ell , q^\ell )\) converges to (f, q).
Now we prove that
We start by proving that
By definition
where \({\bar{h}}^\ell = \frac{ h^\ell }{\int h^\ell }\) where
Since \(c_\ell \rightarrow 1\), by the convexity of \(H(\cdot | m)\) it is enough to show
Observe that
Since, by assumption on \(q^{\ell , (3)}\), \(h^\ell \in L^2\) and it converges to zero point-wise, using item (iii) of Assumption 2.3, we deduce (4.15) by dominated convergence.
It remains to show that
Recalling the scattering rate \(\lambda \) defined in (2.1), we rewrite
Since \( \int q|\log (2\, q/ff_*B)| <+\infty \), using the bound \(f^\ell \ge c_\ell f\) for \(|v|\le \ell \), by dominated convergence
Recalling (2.2), item (i) in Lemma 4.7 implies the uniform integrability of \(\lambda \) with respect to \(f^\ell f^\ell _*\), hence
which concludes the proof of (4.16). \(\square \)
4.3 Example of \((\pi , Q)\in {\mathscr {S}}\setminus \hat{{\mathscr {S}}}\) with Finite Rate Function
In Theorem 2.4 the upper and lower bounds match only for \((\pi , Q)\in \hat{{\mathscr {S}}}\). Here we provide an example of \((\pi ,Q)\in {\mathscr {S}}\setminus \hat{{\mathscr {S}}}\) with finite cost.
Fix \(T>1\). Recalling that g is the standard Gaussian density on \({\mathbb R}^d\), consider the pair (f, q) given by
where \(A^{-1}=\int \mathop {}\!\mathrm {d}w'\frac{1}{1+|w'|^{d+3}} \), and
where
For \(t\in [1,T]\), \(q_t\) is invariant with respect to \((v, v_*, v', v_*')\mapsto (v', v'_*, v, v_*)\). Hence, by construction, the pair \((\pi , Q)\) whose densities are (f, q) satisfies the balance equation (2.6). Moreover, \((\pi , Q)\in {\mathscr {S}}_{\text {ac}}\) and \((\pi , Q)\notin \hat{{\mathscr {S}}}\). We next show that \(I(\pi , Q)<+\infty \). By item (iii) in Assumption 2.3, \(H(g|m)< +\infty \) and, by construction, Q(1) and \(Q^{\pi }(1)\) are both finite.
We observe that, since \(f_t(v)\ge (1-t) g(v)\), \(t\in [0,1]\), by item (iii) in Assumption 2.1
which is finite. Moreover, for \(t\in [1,T]\)
Since there exists C such that \( h(v)\ge C \frac{1}{1+ |v|^{d+3}}, \) we deduce
The previous bounds imply \(I(\pi , Q)<+\infty \).
5 Projection on the Empirical Measure
In this section we analyze the large deviation asymptotics of the empirical measure only. By contraction principle, the corresponding rate function is obtained by projecting the joint rate function I. Regarding the upper bound, we prove that this projection corresponds to the rate function in [7, 15, 18]. For the lower bound, we identify the projection of I only for suitable \(\pi \).
Let \(I_1:D([0,T]; {\mathscr {P}}_0({\mathbb R}^d))\rightarrow [0, +\infty ]\) be defined by \(I_1(\pi )= H(\pi _0|m)+ J_1(\pi )\), when \(\pi \) meets conditions (i) and (ii) in Definition 2.2, and \(I_1(\pi )=+\infty \) otherwise. Here
where \({\bar{\nabla }} \phi =\phi (v')+\phi (v'_*)-\phi (v)-\phi (v_*)\) and the supremum is carried over the functions \(\phi :[0,T]\times {\mathbb R}^d\rightarrow {\mathbb R}\) continuous, bounded and continuously differentiable in t with bounded derivative.
Proposition 5.1
For any \(\pi \in D([0,T]; {\mathscr {P}}_0({\mathbb R}^d))\)
Moreover, if \(\pi \) is such that the supremum in (5.1) is achieved, then
Proof
Recalling Proposition 3.7, the proof of (5.2) is achieved by choosing \(F={\bar{\nabla }} \phi \) in (3.10). To prove the second statement, we first note that if the supremum is achieved at some \(\phi \), then for any \(\psi \) continuous, bounded and continuously differentiable in t with bounded derivative
Recalling (2.8), by choosing \(Q=Q_\phi \), a direct computation implies \(\inf _Q J(\pi , Q)\le J(\pi , Q_\phi )=J_1(\pi )\). \(\square \)
6 Gradient Flow Formulation of the Boltzmann-Kac Equation
Assuming the detailed balance condition, here we derive the gradient flow formulation of the Boltzmann-Kac equation (1.1) associated to the large deviation rate function (1.3). We remark that such formulation is logically independent of the validity of the large deviation principle.
Let M be the standard Maxwellian on \({\mathbb R}^d\). In this section we assume that the collision rate r satisfies the following detailed balance condition
This implies that the Kac walk on \(\Sigma _{N}\) is reversible with respect to the product measure \(\prod _{k=1}^NM(\mathop {}\!\mathrm {d}v_k)\). We still consider the Kac walk restricted to \(\Sigma _{N,0}\), then the corresponding reversible measure is the product measure \(\prod _{k=1}^NM(\mathop {}\!\mathrm {d}v_k)\) conditioned to \(N^{-1}\sum _k v_k =0\), that is a Gaussian measure on \(\Sigma _{N,0}\).
In this section we express the empirical measure and flow in terms of their densities with respect to Maxwellians.
Let \({\mathcal {H}} :{\mathscr {P}}_0({\mathbb R}^d)\rightarrow [0, +\infty ]\) be the relative entropy with respect to M, i.e. \({\mathcal {H}}(\pi ):= H(\pi \vert M)\). For \(\pi \in {\mathscr {P}}_0({\mathbb R}^d)\) with bounded second moment, define the non linear Dirichlet form \({\mathcal D}:{\mathscr {P}}_0({\mathbb R}^d)\rightarrow [0, +\infty ]\) as the lower semicontinuous map defined by
where the supremum is carried out over the continuous and bounded functions \(\phi :{\mathbb R}^d\rightarrow {\mathbb R}\). Note that \({\mathcal {D}}(\pi )\) is well defined in view of (2.2). To illustrate this definition, consider the Markov generator \({\mathcal L}\) acting on functions \(\xi :{\mathbb R}^d \times {\mathbb R}^d\rightarrow {\mathbb R}\) as
By the detailed balance condition, \({\mathcal L}\) is reversible with respect to the product measure \(M(\mathop {}\!\mathrm {d}v)\, M(\mathop {}\!\mathrm {d}v_*)\). The variational representation (6.2) thus corresponds to the Donsker-Varadhan functional \({\mathcal E}\), that it is defined on the probabilities on \({\mathbb R}^d \times {\mathbb R}^d\) by
where the supremum is carried out over the continuous and bounded functions \(\xi :{\mathbb R}^d\times {\mathbb R}^d\rightarrow {\mathbb R}\). Then \({\mathcal D} (\pi ) = {\mathcal E}(\pi \times \pi )\), observe indeed, as proven in Lemma 6.2 below, that for product measures \(\Pi \) we can restrict the class test functions \(\xi \) to functions of the form \(\xi (v,v_*) = \phi (v)+\phi (v_*)\).
On the set of functions \(G:{\mathbb R}^d \times {\mathbb R}^d \times {\mathbb R}^d \times {\mathbb R}^d \rightarrow {\mathbb R}\), let \(\Upsilon \) be the involution defined by \((\Upsilon G) \,(v,v_*,v',v_*') := G (v',v_*',v,v_*)\). Recalling the definition of \(Q^\pi \) in (2.7), we define the kinematic term as the lower semicontinuous functional \({\mathcal R} \) on the pairs \((\pi , Q)\) satisfying conditions (i) and (ii) in Definition 2.2
where the supremum is carried out over the bounded and continuous \(F,\alpha :[0,T] \times \big ({\mathbb R}^d \big )^2 \times \big ({\mathbb R}^d \big )^2 \rightarrow {\mathbb R}\) satisfying \(F(t,v,v_*,v', v'_*)=F(t,v_*,v,v', v'_*)=F(t,v,v_*,v'_*, v')\), \(\alpha (t,v,v_*,v', v'_*)=\alpha (t,v_*,v,v', v'_*)=\alpha (t,v,v_*,v'_*, v')\), and \(\inf \alpha >0\).
The main result of this section provides, when the detailed balance condition (6.1) holds, a gradient flow formulation of the Boltzmann-Kac equation. Recall that the functionals J and I have been introduced in (2.8) and (2.10) and that \(\hat{{\mathscr {S}}}\) is the set of paths \((\pi , Q)\in {\mathscr {S}}_{\mathrm {ac}}\) that satisfy (2.9).
Theorem 6.1
Assume that \({\mathcal H}(\pi _0) < +\infty \). For each \((\pi ,Q)\in \hat{{\mathscr {S}}}\)
In particular, when the scattering rate \(\lambda \) is bounded, \(I(\pi ,Q)=0\) if and only if \(\pi _0=m\) and
We start by the following characterization of the Dirichlet form \({\mathcal D}\) and the kinematic term \({\mathcal R}\) in which we recall that \({\mathcal V}\) is the hyperplane of \(({\mathbb R}^d)^2\times ({\mathbb R}^d)^2\) defined by \(v+v_*= v'+v_*'\) and \(r(v,v_*;\mathop {}\!\mathrm {d}v_*,\mathop {}\!\mathrm {d}v_*')= \sigma (v,v_*,w') M(\mathop {}\!\mathrm {d}w')\) where \(w'=(v'-v_*')/\sqrt{2}\).
Lemma 6.2
Let \(\pi \in {\mathscr {P}}_0({\mathbb R}^d)\) be such that \(\pi (\mathop {}\!\mathrm {d}v) = h(v) M(\mathop {}\!\mathrm {d}v)\), \(\pi (\zeta ) <+\infty \), \(\zeta (v) = |v|^2\), and \({\mathcal D}(\pi ) <+\infty \). Then
and
Moreover, set \(d\pi _t= h_t \mathop {}\!\mathrm {d}M\), \(\mathop {}\!\mathrm {d}Q=\mathop {}\!\mathrm {d}t\,p_t(V,w,w') M(\mathop {}\!\mathrm {d}V) M(\mathop {}\!\mathrm {d}w) M(\mathop {}\!\mathrm {d}w')\), and \( r(v,v_*;\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v_*') = {\bar{\sigma }}(V;w,w') M(\mathop {}\!\mathrm {d}w')\) where \(V=(v+v_*)/\sqrt{2}\), \(w=(v-v_*)/\sqrt{2}\), and \(w=(v'-v_*')/\sqrt{2}\). Then, if \({\mathcal R}(\pi , Q) <+\infty \),
Proof
We first note that
Indeed, by standard arguments, see e.g. [14, App. 1, Thm. 10.2], \({\mathcal D}(\pi )\) is bounded above by the right hand side in the previous displayed formula. The converse inequality is obtained by choosing as test function a sequence of continuous and bounded \(\phi ^n\) that converges to \(\frac{1}{2} \log h\).
Recalling (2.2), the proof of (6.6) is achieved by expanding the square on the right hand side of (6.9) and using the detailed balance condition. The representation (6.7) now follows directly by (6.9). Finally, by (6.6) and using that \(Q(1)<+\infty \), the representation (6.8) is achieved by a direct computation. \(\square \)
Proof of Theorem 6.1
Recalling that \(\tilde{{\mathscr {S}}}\) has been introduced in (4.2), we show that (6.4) holds for \((\pi ,Q)\in \tilde{{\mathscr {S}}}\). We write \(\pi _t(dv)= h_t(v) M(\mathop {}\!\mathrm {d}vv)\), \(Q(\mathop {}\!\mathrm {d}t;\mathop {}\!\mathrm {d}v,\mathop {}\!\mathrm {d}v_*,\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v_*')=p_t(V,w,w')\, dt M(\mathop {}\!\mathrm {d}V) M(\mathop {}\!\mathrm {d}w) M(\mathop {}\!\mathrm {d}w')\), \( r(v,v_*;\mathop {}\!\mathrm {d}v',\mathop {}\!\mathrm {d}v_*') = {\bar{\sigma }}(V;w,w') M(\mathop {}\!\mathrm {d}w')\) where \(V=(v+v_*)/\sqrt{2}\), \(w=(v-v_*)/\sqrt{2}\), and \(w=(v'-v_*')/\sqrt{2}\). Setting \(\Psi (a,b) := a\log (a/b) -(a-b)\), by (2.8)
We observe that for each \({\bar{a}}>0\),
and use this decomposition with \({\bar{a}} = \frac{1}{2} \sqrt{h_t(v)h_t(v_*)h_t(v')h_t(v_*')} \, {\bar{\sigma }}(V;w,w')\). We deduce
where, by using the detailed balance condition \({\bar{\sigma }}(V,w,w')={\bar{\sigma }}(V,w',w)\),
Again by the detained balance condition and the balance equation (2.6),
Finally,
that concludes the proof of (6.4) when \((\pi ,Q)\in {\tilde{S}}\).
In particular, we have shown that for \((\pi ,Q)\in {\tilde{S}}\) it holds
By Theorem 4.6 and the lower-semicontinuity of the relative entropy we then get the previous bound for any path \((\pi ,Q)\in \hat{S}\); hence \({\mathcal H}(\pi _T) <+\infty \) when \( {\mathcal H}(\pi _0)\) and \(J(\pi ,Q)\) are bounded.
Fix \((\pi ,Q)\in \hat{{\mathscr {S}}}\). To prove that (6.4) holds for \((\pi ,Q)\), we will construct a sequence \(\tilde{{\mathscr {S}}}\cap \hat{{\mathscr {S}}}\ni (\pi ^n, Q^n)\rightarrow (\pi , Q)\) such that
Observe in fact that the converse inequalities follows from the lower-semicontinuity of J, \({\mathcal H}\), \({\mathcal D}\), and \({\mathcal R}\).
Let \((\pi ^n, Q^n)\) be the sequence constructed in the proof of Theorem 4.6, so that the first inequality in (6.11) holds. The proof of the others is achieved in two steps.
Step 1 - Convolution. Let \((f^{\delta },q^{\delta })\) be the sequence constructed in Step 1 in the proof of Theorem 4.6, and let \((h_\delta , p_\delta )\) such that \(\pi ^\delta (\mathop {}\!\mathrm {d}v)=f^\delta (v)\mathop {}\!\mathrm {d}v=h^\delta (v)M(\mathop {}\!\mathrm {d}v)\) and \(dQ^\delta = \mathop {}\!\mathrm {d}t q_t^\delta (v,v_*,w')\mathop {}\!\mathrm {d}v \mathop {}\!\mathrm {d}v_* \mathop {}\!\mathrm {d}w'= \mathop {}\!\mathrm {d}t\, p_t^\delta (V,w,w')M(\mathop {}\!\mathrm {d}V)M(\mathop {}\!\mathrm {d}w)M(\mathop {}\!\mathrm {d}w')\).
We observe that proof of the second line in (6.11) is achieved by the same argument in Step 1 of Theorem 4.6. We now show that
We use the representation of \({\mathcal D}\) provided by Lemma 6.2. By (2.2) and item (ii) in Definition 2.2 we deduce
On the other hand, since \(h_t^\delta \rightarrow h_t\) a.e., by Fatou’s lemma
which concludes the proof of (6.12). Observe that the lower semicontinuity of \({\mathcal D}\) actually implies \(\int _0^T \mathop {}\!\mathrm {d}t \, {\mathcal D}(\pi _t^\delta )\rightarrow \int _0^T \mathop {}\!\mathrm {d}t \,{\mathcal D}(\pi _t)\), so that by Lemma 6.2
We conclude the step by showing that
By the representation provided by (6.8) and Lemma 4.7
where \(\Phi =\log \frac{2q(v,v_*,w')}{f(v) f(v_*) g_1(w')}\) , \(\Phi '=\log \frac{2q(v,v_*,w')}{f(v') f(v'_*) g_1(w)}\) ,
We start by observing that \(Q^\delta (1)\rightarrow Q(1)\), then in view of (6.13) it is enough to show that
Observe that
The proof is achieved by the same argument in Step 1 of Theorem 4.6.
Step 2 - Truncation of the flux. As in the Theorem 4.6 we now assume f strictly positive on compacts uniformly in time and \(q_t^{(3)}\in L^1([0,T]; L^2({\mathbb R}^d))\). We denote by \((f^{\ell },q^{\ell })\) the sequence constructed in Step 2 in the proof of Theorem 4.6.
By the argument in Step 2 of Theorem 4.6, \(\lim _{\ell } {\mathcal H}(\pi ^\ell _0) ={\mathcal H}(\pi _0)\), we now show that
Recalling (4.13) we write
Set
where \(1/a_\ell = c_\ell + 1/c_\ell - 1\). Observe that \(\hat{f}^\ell _T\) is a probability density. Since \(0< c_\ell a_\ell < 1\), by using (6.10) and applying the argument leading to (4.15), we deduce that \(\varlimsup _{\ell } {\mathcal H}(\hat{f}^\ell _T )\le {\mathcal H}({f}_T )\) and \(\lim _{\ell } \big | {\mathcal H}({f}^\ell _T ) - {\mathcal H}(\hat{f}^\ell _T ) \big | = 0\). This completes the proof of (6.15).
By the representation of \({\mathcal D}\) provided by Lemma 6.2, using (4.17) and Fatou’s lemma we conclude that
It remains to show that
This is achieved by using the representation (6.8) and the argument in Step 2 of Theorem 4.6.
To prove the second statement of the theorem, observe that if \(I(\pi , Q)=0\) then \(\pi \) has bounded second moment, \(Q=Q^\pi \), and \(H(\pi \vert m)=0\). Therefore \(\pi _0=m\) and, when \(\lambda \) is bounded, \((\pi , Q^\pi )\in \hat{{\mathscr {S}}}\). By (6.4), the inequality (6.5) amounts to \(J(\pi , Q^\pi )\le 0\). \(\square \)
References
Adams, S., Dirr, N., Peletier, M.A., Zimmer, J.: From a large-deviations principle to the Wasserstein gradient flow: a new micro-macro passage. Commun. Math. Phys. 307(3), 791–815 (2011)
Basile G., Benedetto D., Bertini, L.; A gradient flow approach to linear Boltzmann equation, Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) XXI, 955–987 (2020)
Basile, G., Bertini, L.: Donsker-Varadhan asymptotics for degenerate jump Markov processes, ALEA. Lat. Am. J. Probab. Math. Stat. 12(1), 1–34 (2015)
Bertini, L., Faggionato, A., Gabrielli, D.: Large deviations of the empirical flow for continuous time Markov chains. Ann. Inst. H. Poincaré Probab. Statist. 51(3), 867–900 (2015)
Bodineau, T., Gallagher, I., Saint-Raymond, L., Simonella, S.: Fluctuation theory in the Boltzmann-grad limit. J. Stat. Phys. 180, 873–895 (2020)
Bodineau T., Gallagher I., Saint-Raymond L., Simonella S.: Statistical dynamics ofa hard sphere gas: fluctuating Boltzmann equation and large deviations, preprint, arXiv:2008.10403
Bouchet, F.: Is the Boltzmann Equation Reversible? J. Stat. Phys, A Large Deviation Perspective on the Irreversibility Paradox (2020)
Carlen, E.A., Carvalho, M.C., Le-Roux, J., Loss, M., Villani, C.: Entropy and chaos in the Kac model Kinet. Relat. Models 3(1), 85–122 (2010)
Donsker M.D., Varadhan S.R.S.; Asymptotic evaluation of certain Markov process expectations for large time. I. II., Commun. Pure Appl. Math. 28, 1–47, ibid. 28, 279–301 (1975)
Erbar M.: A gradient flow approach to the Boltzmann equationt, preprint arXiv:1603.00540v2 (2017)
Feller, W.: An Introduction to Probability Theory and Its Applications, Vol. II. Wiley, New York (1971)
Gabrielli, D., Renger, D.R.M.: Dynamical Phase Transitions for Flows on Finite Graphs, preprint arXiv:2005.03262, (2020)
Kac M.: Foundations of Kinetic Theory. In: Proc. 3rd Berkeley Symp. Math. Stat. Prob., J. Neyman, ed. Univ. of California, vol. 3, 171–197 (1956)
Kipnis C., Landim C.: Scaling Limits of Interacting Particle Systems, volume 320 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer-Verlag, Berlin (1999)
Léonard, C.: On large deviations for particle systems associated with spatially homogeneous Boltzmann type equations. Probab. Theor. Relat. Fields 101, 1–44 (1995)
Mariani, M.: A \(\Gamma \)-convergence approach to large deviations. Ann. Sc. Norm. Super. Pisa Cl. Sci. 18, 951–976 (2018)
Mielke, A., Peletier, M.A., Renger, D.R.M.: On the relation between gradient flows and the large-deviation principle, with applications to Markov chains and diffusion. Potential Anal. 41(4), 1293–1327 (2014)
Rezakhanlou, F.: Large deviations from a kinetic limit. Ann. Probab. 26(3), 1259–1340 (1998)
Sznitman, A.S.: Topics in propagation of chaos. In: Hennequin PL (Eds.), Ecole d’Eté de Probabilités de Saint-Flour XIX - 1989. Lect. Notes Math., Vol. 1464, Springer, Berlin (1991)
Tanaka, H.: Fluctuation theory for Kac’s one-dimensional model of Maxwellian molecules. Indian J. Statist. Ser. A 44(1), 23–46 (1982)
Uchiyama, K.: A fluctuation problem associated with the Boltzmann equation for a gas of molecules with a cutoff potential. Japan. J. Math. 9, 27–53 (1983)
Uchiyama, K.: Fluctuations in a Markovian system of pairwise interacting particles. Probab. Theory Relat. Fields 79, 289–302 (1988)
Acknowledgements
We are grateful to R. Di Leonardo for suggesting us the example of molecular gases. We also thank M. Erbar for useful discussions.
Funding
Open access funding provided by University degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Isabelle Gallagher.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Basile, G., Benedetto, D., Bertini, L. et al. Large Deviations for Kac-Like Walks. J Stat Phys 184, 10 (2021). https://doi.org/10.1007/s10955-021-02794-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10955-021-02794-2