Abstract
We show that the effective mass of the Fröhlich Polaron is bounded below by \(c \alpha ^{2/5}\) for some constant \(c>0\) and for all coupling constants \(\alpha \). The proof uses the point process representation of the path measure of the Fröhlich Polaron.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and Results
The Polaron models the slow movement of an electron in a polar crystal. The electron drags a cloud of polarization along and thus appears heavier, it has an “effective mass” larger then its mass without the interaction. In the Fröhlich model of the Polaron, the Hamiltonian describing the interaction of the electron with the lattice vibrations is the operator given by
that acts on \(L^2(\mathbb {R}^3) \otimes \mathcal F\) (where \(\mathcal F\) is the bosonic Fock space over \(L^2(\mathbb {R}^3)\)). Here x and p are the position and momentum operators of the electron, \(\textbf{N}\equiv \int _{{\mathbb {R}}^3} a^*_k a_k\, \mathrm d k\) is the number operator, the creation and annihilation operators \(a_k^*\) and \(a_k\) satisfy the canonical commutation relations \([a^*_k, a_{k'}] = \delta (k-k')\) and \(\alpha >0\) is the coupling constant. The Hamiltonian commutes with the total momentum operator \(p\otimes \mathbbm {1} + \mathbbm {1} \otimes P_f\), where \(P_f \equiv \int _{\mathbb {R}^3} k a_k^* a_k \mathrm dk\) is the momentum operator of the field, and has a fiber decomposition in terms of the Hamiltonians
at fixed total momentum P (which act on \(\mathcal F\)). Of particular interest is the energy-momentum relation \(E_\alpha (P) :=\inf {\text {spec}}(H_\alpha (P))\). It is known that \(E_\alpha \) is rotationally symmetric and has a global minimum at \(P=0\), which is believed to be unique [DS20]. The effective mass \(m_{\text {eff}}(\alpha )\) is defined as the inverse curvature at \(P=0\) i.e.
While the asympotics of the ground state energy \(E_\alpha (0)\) in the strong coupling limit \(\alpha \rightarrow \infty \) are known, the asymptotics of the effective mass \(m_\text {eff}(\alpha )\) still remain an important open problem. For the ground state energy, Donsker and Varadhan [DV83] gave a probabilistic proof (using the path integral formulation) of Pekars conjecture [Pek49] that states
There also exists a functional analytic proof by Lieb and Thomas [LT97], that additionally contains explicit error bounds. Formal computations by Landau and Pekar [LP48] suggest that
and Feynman [Fey55] obtains the same \(\alpha ^4\)-asymptotics (but not the prefactor) by completely different methods. Rigorous results on the effective mass have been obtained only much later and they are significantly weaker: Lieb and Seiringer [LS20] show that \( \lim _{\alpha \rightarrow \infty } m_\text {eff}(\alpha ) = \infty \) by estimating
with suitably chosen trial states \(\phi _{\alpha , P}\). For models with stronger regularity assumptions (excluding the Fröhlich Polaron), quantitative estimates on the effective mass were recently obtained in [MS21]. Our goal is to show that there exists some constant \(c>0\) such that for all \(\alpha >0\)
and thereby giving a first quantitative growth bound for the effective mass of the Fröhlich Polaron.
We give a short introduction into the probabilistic representation of the effective mass. We refer the reader to [DS20] for details in this representation, and to [Moe06] for a review covering functional analytic properties of the Polaron. An application of the Feynman-Kac formula to the semigroup \((e^{-TH_\alpha })_{T \;\geqslant \;0}\) leads to the path measure
in finite volume \(T>0\), where \(\mathcal W\) is the distribution of three dimensional Brownian motion and the partition function \(Z_{\alpha , T}\) is a normalization constant. In particular, the partition function can be expressed as the matrix element \(Z_{\alpha , T} = \langle \Omega , e^{-T H_\alpha (0)} \Omega \rangle \), where \(\Omega \) is the Fock vacuum. This leads to \(E_\alpha (0) = - \lim _{T\rightarrow \infty } \log (Z_{\alpha , T})/T\), which was applied (up to boundary conditions) in the aforementioned proof of Pekars conjecture by Donsker and Varadhan. Spohn conjectured [Spo87] that the path measure (in infinite volume) converges under diffusive rescaling to Brownian motion with some diffusion constant \(\sigma ^2(\alpha )\), and showed that the effective mass then has the representation
The existence of the infinite volume measure and the validity of the central limit theorem were shown by Mukherjee and Varadhan [MV19] for a restricted range of coupling parameters. The proof relies on a representation of the measures (1.4) as a mixture of Gaussian measures, the mixing measure being the distribution of a point process on \(\{(s, t)\in \mathbb {R}^2: s<t\} \times (0, \infty )\) which has a renewal structure. This approach was extended in [BP21] to a broader class of path measures and a functional central limit theorem, and, for the Fröhlich Polaron, to all coupling parameters (relying on known spectral properties of \(H_\alpha (0)\)). We also refer the reader to [MV21, BMPV22], where proofs are given that do not need the spectral properties of \(H_\alpha (0)\). We use this point process representation and derive a “variational like” formula for the effective mass (see Proposition 3.2). We obtain the estimate (1.3) by minimizing over sub-processes of the full point process.
Without additional effort, we can allow for a bit more generality and look at the probability measures \(\mathbb P_{\alpha , T}\) on \(C([0, \infty ), \mathbb {R}^d)\) defined by
where \(d\;\geqslant \;2\), \(v(x) = 1/|x|^\gamma \) with \(\gamma \in [1, 2)\) and \(g:[0, \infty ) \rightarrow (0, \infty )\) is a probability density with finite first moment satisfying \(\sup _{t\;\geqslant \;0} (1+t)g(t) <\infty \) and \(\textbf{x}_{s, t} :=\textbf{x}_t - \textbf{x}_s\) for \(\textbf{x}\in C([0, \infty ), \mathbb {R}^d)\) and \(s, t\;\geqslant \;0\). By the results in [BMPV22], these conditions are sufficient for the existence of an infinite volume measure and the validity of a functional central limit theorem in infinite volume. By the proof of the central limit theorem given in [MV19]Footnote 1 the respective diffusion constant \(\sigma ^2(\alpha )\) can also be obtained by taking the “diagonal limit”, that is
Here \(X_t(\textbf{x}) :=\textbf{x}_t\) for any \(\textbf{x}\in C([0, \infty ), \mathbb {R}^d)\) and \(t\;\geqslant \;0\), that is we denote by X the random variable that is given by the identity on \(C([0, \infty ) , {\mathbb {R}}^d)\).
Theorem 1.1
There exists a constant \(C>0\) (depending on \(\gamma \), g and d) such that
for all \(\alpha >0\). Consequently, there exists a constant \(c>0\) such that the effective mass of the Fröhlich Polaron satisfies \(m_{{\text {eff}}}(\alpha ) \;\geqslant \;c \alpha ^{2/5}\) for all \(\alpha >0\).
As we will point out in Remark 2.1, the results obtained in [BMPV22] imply for the Fröhlich Polaron with minor additional effort the existence of some \(\tilde{c}>0\) such that \(\sigma ^2(\alpha ) \;\geqslant \;e^{-\tilde{c}\alpha ^2}\) for reasonably large \(\alpha \).
2 Point Process Representation
By using the identity \( \frac{1}{|x|} = \sqrt{\frac{2}{\pi }} \int _0^\infty \mathrm du \, e^{-u^2 |x|^2/2} \) and expanding the exponential into a series, Mukherjee and Varadhan [MV19] represented the path measure of the Fröhlich Polaron as a mixture of Gaussian measures \(\textbf{P}_{\xi , u}\), the mixing measure \(\widehat{\Theta }_{\alpha , T}\) being the distribution of a suitable point process on \(\{(s, t)\in \mathbb {R}^2:\, s<t\} \times [0, \infty )\). In the following, we give an introduction to the point process representation in the form given in [BP21]. We will additionally use the invariance of (1.5) under replacing v by \(v_\varepsilon (x) :=v(x) + \varepsilon \) for some \(\varepsilon \;\geqslant \;0\). While this transformation does not change the measure \(\mathbb P_{\alpha , T}\), we will obtain different point process representations depending on the choice of \(\varepsilon \).
Let \(\Gamma _{\alpha , T}\) be the distribution of a Poisson point process on \(\triangle :=\{(s, t)\in \mathbb {R}^2:\, s<t\}\) with intensity measure
By expanding the exponential into a series and interchanging the order of summation and integration, we obtain for any \(A\in \mathcal B(C([0, \infty ), \mathbb {R}^d))\) and any \(\varepsilon \;\geqslant \;0\)
where \(\textbf{N}_f(\triangle )\) is the set of finite integer valued measures on \(\triangle \). We will view \(\xi = \sum _{i=1}^n \delta _{(s_i, t_i)}\in \textbf{N}_f(\triangle )\) as a collection of intervals \(\{[s_i, t_i]: 1\;\leqslant \;i \;\leqslant \;n\}\).
The measure \(\Gamma _{\alpha , T}\) can be interpreted in the following way: Consider a \(M/G/\infty \)-queue started empty at time 0 where the arrival process \(\sum _{n=0}^\infty \delta _{s_n}\) is a homogeneous Poisson process with rate \(\alpha \), and the service times \((\tau _n)_n\) are iid with density g and are independent of the arrival process. Consider the process \(\eta = \sum _{n=1}^\infty \delta _{(s_n, t_n)}\) where \(t_n :=s_n + \tau _n\) is for \(n\in {\mathbb N}\) the departure of customer n. Then the process off all customers in \(\eta \) that arrive and depart before T has distribution \(\Gamma _{\alpha , T}\).
For \(\xi \in \textbf{N}_f(\triangle )\) we define
and the perturbed point process \(\widehat{\Gamma }_{\alpha , T}^\varepsilon \) by
Then Eq. (2.1) yields the representation \(\mathbb P_{\alpha , T}(\cdot ) = \int \widehat{\Gamma }_{\alpha , T}^\varepsilon (\mathrm d \xi ) \textbf{P}^\varepsilon _\xi (\cdot )\) of \(\mathbb P_{\alpha , T}\) as a mixture of the perturbed path measures
The function \(F_\varepsilon \) depends in an intricate manner on the configuration \(\xi \), depending on number, length and relative position of the intervals. Nevertheless, the perturbed measure \(\widehat{\Gamma }^\varepsilon _{\alpha , T}\) can still be interpreted as a queuing process by identifying \((s,t) \in {\text {supp}}(\xi )\) with a service interval [s, t] as indicated above. Under the given assumptions on g and v, \(\widehat{\Gamma }^\varepsilon _{\alpha , T}\) can then be expressed in terms of an iid sequence of “clusters” of overlapping intervals, separated by exponentially distributed dormant periods not covered by any interval. The processes of increments can be drawn independently along these dormant periods and clusters (according to the kernel \((\xi , A) \mapsto \textbf{P}^\varepsilon _\xi (A)\)). This yields the existence of the infinite volume limit as well as the functional central limit theorem [BP21, BMPV22].
In order to make the connection to the representation used in [MV19], we notice that for all \(x\in \mathbb {R}^d \setminus \{0\}\)
where
It will turn out to be useful to make the atoms of out point process distinguishable. We will abuse notation and identify probability measures on \(\textbf{N}_f(\triangle )\) with symmetric probability measures on \(\triangle ^\cup :=\bigcup _{n=0}^\infty \triangle ^n\). With Eq. (2.1), we obtain
where \(N :=\sum _{n=0}^\infty n\cdot \mathbbm {1}_{\triangle ^n}\). We define for \(\xi = ((s_i, t_i))_{1\;\leqslant \;i \;\leqslant \;N(\xi )} \in \triangle ^\cup \) and \(u \in [0, \infty )^{N(\xi )}\) the Gaussian measures
where \(\phi (\xi , u) :=\mathbb E_\mathcal W\big [e^{-\sum _{i=1}^{N(\xi )} u_i^2 |X_{s_i, t_i}|^2/2}\big ]\) and, for \(\xi \in \triangle ^\cup \) withFootnote 2\(F_\varepsilon (\xi ) < \infty \) the measures
and obtain
We will view \((s, t, u) \in \triangle \times [0, \infty )\) as an interval [s, t] equipped with the mark u. In contrast to the representation in [MV19], we draw a sample of the mixing measure \(\widehat{\Theta }_{\alpha , T}\) (which is the distribution of a point process on \(\triangle \times [0, \infty )\)) in two steps: First, draw a sample \(\xi \) according to \(\widehat{\Gamma }_{\alpha , T}^0\) and then draw marks u according to the kernel \(\kappa _0(\xi , \cdot )\). This added flexibility will be useful later. If we define
we get with the previous considerations
Taking the limit \(T\rightarrow \infty \) in (2.4) then yields the diffusion constant i.e. the inverse of the effective mass.
One can also express the perturbed measure \(\widehat{\Gamma }_{\alpha , T}^\varepsilon \) in terms of the path measures \(\mathbb P_{\alpha , T}\) (a similar calculation was already made in [MV21, (4.18)]). The Laplace transform of \(\widehat{\Gamma }^\varepsilon _{\alpha , T}\) evaluated at the measurable function \(f: \triangle \rightarrow [0, \infty )\) is
That is, \(\widehat{\Gamma }_{\alpha , T}^\varepsilon \) is the distribution of a Cox processFootnote 3 with driving measure \(\mu _{\alpha , T}(\mathrm ds \mathrm dt) v_\varepsilon (X_{s, t})\) with \((X_t)_{t} \sim \mathbb P_{\alpha , T}\).
The introduction of \(v_\varepsilon \) with \(\varepsilon > 0\), which in the context of (1.5) is a simple shift of energy, thus creates a component of \(\widehat{\Gamma }_{\alpha , T}^\varepsilon \) that is quite easy to understand. Indeed, \(\widehat{\Gamma }_{\alpha , T}^\varepsilon \) is the distribution of the sum \(\eta _{\alpha , T} + \eta _{\alpha , T}^\varepsilon \) where \(\eta _{\alpha , T} \sim \widehat{\Gamma }_{\alpha , T}^0\) and where the Poisson point process \(\eta _{\alpha , T}^\varepsilon \sim \Gamma _{\varepsilon \alpha , T}\) is independent of \(\eta _{\alpha , T}\). \(\eta _{\alpha , T}^\varepsilon \) leads to additional intervals, but on the other hand, for given \(\xi \), under the measure \(\kappa _\varepsilon (\xi , \cdot )\) now u has components that are equal to zero with positive probability. Intervals of \(\xi \) corresponding to these components will be invisible to the measure \(\textbf{P}_{\xi ,u}\) as can be seen directly from its definition (2.2). These two effects balance since \(\mathbb P_{\alpha ,T}\) is independent of \(\varepsilon \). However, given a realization \(\xi \) of \(\eta _{\alpha , T} + \eta _{\alpha , T}^\varepsilon \) it is not the case that \(\kappa _\varepsilon (\xi , \cdot )\) just produces zero components of u precisely for those intervals coming from \(\eta _{\alpha , T}^\varepsilon \); on the contrary, \(\kappa _\varepsilon (\xi ,\cdot )\) does not depend on whether any given interval in \(\xi \) was produced by \(\eta _{\alpha , T}\) or by \(\eta _{\alpha , T}^\varepsilon \). The proof of Theorem 1.1 now uses the following strategy:
-
(1)
We show in Lemma 3.1 that \(u \mapsto \sigma ^2_T(\xi ,u)\) is decreasing along each coordinate axis. This in particular implies that \(\sigma ^2_T(\xi ,u)\) increases if intervals are deleted from \(\xi \), which corresponds to setting the suitable components of u to zero. While not necessaryFootnote 4 for the proof of Theorem 1.1, we also give a characterization of \(\sigma ^2_T(\xi ,u)\) in terms of a \(L^2\)-distance which emphasizes the variational type flavor of our approach and might be useful for further refining the method.
-
(2)
The dependence of the distribution \(\kappa _\varepsilon (\xi , \cdot )\) on the whole configuration \(\xi \) prevents us from simply deleting intervals from \(\xi \) in order to obtain an estimate on the variance in (2.4). To get around this, we fix some \(C(\alpha )\) (to be determined later), and show that \(\kappa _\varepsilon (\xi ,\cdot )\) stochastically dominates a kernel that assigns the marks 0 and \(C(\alpha )\) independently to each interval in \(\xi \) with suitable probabilities. This is done in Sect. 4.
-
(3)
The two previous items mean that we obtain an upper bound on the value of (2.4) when we first replace \(\kappa _\varepsilon \) by our new kernel and then delete intervals from \(\xi \). On the other hand, for interval configurations \(\xi \) that consist only of non-overlapping intervals, \(\sigma ^2_T(\xi ,u)\) can be computed explicitly, see Lemma 3.3. By deleting a sufficient number of intervals (including all those coming from \(\eta _{\alpha ,T}\)), and by optimizing over the parameter C, we obtain our result.
Remark 2.1
For the Fröhlich Polaron, i.e. for \(d=3\), \(\gamma =1\) and \(g(t) = e^{-t}\) for all \(t\;\geqslant \;0\), the point process representation implies the existence of some constant \(\tilde{c}>0\) such that \(\sigma ^2(\alpha ) \;\geqslant \;e^{-\tilde{c} \alpha ^2}\) for all reasonably large \(\alpha >0\). Given \(\xi \in \textbf{N}_f(\triangle )\) and \(r\in \mathbb {R}\) let \(N_r(\xi ) :=\xi (\{(s,t): s\;\leqslant \;r \;\leqslant \;t\})\) be the number of intervals that contain r. We call t dormant under \(\xi \), if \(N_t(\xi ) = 0\), else we call t active under \(\xi \). Under \(\xi \), the interval [0, T] can be decomposed into alternating dormant and active periods. Let
(where \(\lambda \) is the Lebesgue measure) be the sum of lengths of all dormant periods in [0, T]. One can easily convince oneself that \( \sigma ^2_T(\xi , u) \;\geqslant \;\frac{1}{T}\mathcal D_T(\xi ) \) for all \(u\in [0, \infty )^n\) (this follows from Proposition 3.2 later in the text as well). By applying Jensens inequality to the measure \( \frac{1}{T} \mathrm dt \, \mathbb P_{\alpha , T}(\mathrm d \textbf{x})\) and using Brownian scaling in order to interpret the resulting expression in terms of the derivative of \(\psi (\alpha ) :=\lim _{T\rightarrow \infty } \log (Z_{\alpha , T})/T = -E_\alpha (0)\), it was shown in [BMPV22] that
By Fubini’s Theorem
and thus \(\sigma ^2(\alpha ) \;\geqslant \;e^{-\tfrac{3}{2} \alpha \psi '(\alpha )}\) by (2.4). By convexity of \(\psi \) and since \(\lim _{\alpha \rightarrow \infty } \psi (\alpha )/\alpha ^2 \in (0, \infty )\) we have
and hence there exists some \(\tilde{c}>0\), \(\alpha _0 >0\) such that \(\sigma ^2(\alpha ) \;\geqslant \;e^{-\tilde{c}\alpha ^2}\) for all \(\alpha \;\geqslant \;\alpha _0\).
3 Properties of \(\sigma _T^2(\xi , u)\)
We derive a few properties of \(\sigma _T^2(\xi , u)\). First, we will show that \(\sigma _T^2(\xi , \cdot )\) is decreasing with respect to the partial order on \([0, \infty )^{N(\xi )}\) defined by \(u\;\leqslant \;\tilde{u}\) iff \(u_i\;\leqslant \;\tilde{u}_i\) for all \(1\;\leqslant \;i \;\leqslant \;N(\xi )\). We will then express \(\sigma _T^2(\xi , u)\) in terms of a \(L^2\)-distance and calculate \(\sigma _T^2(\xi , u)\) in case that \(\xi \) consists of pairwise disjoint intervals.
Lemma 3.1
For fixed \(\xi \in \triangle ^\cup \), the function \(\sigma ^2_T(\xi , \cdot )\) is decreasing on \([0, \infty )^{N(\xi )}\).
Proof
Let \( \xi = ((s_i, t_i))_{1\;\leqslant \;i \;\leqslant \;N(\xi )} \in \triangle ^\cup \). Then
By Isserlis’ theorem (and as the coordinate processes are iid under the centered Gaussian measure \(\textbf{P}_{\xi , u}\))
\(\square \)
In particular, \(\sigma ^2_{T}(\xi , u)\) is increasing under deletion of a marked interval \((s_i, t_i, u_i)\) (set \(u_i = 0\)).
As already used in [MV19], the normalization constant \(\phi (\xi , u)\) can also be expressed in terms of a determinant of the covariance matrix of a suitable Gaussian vector. To see this, let \((B_t)_{t\;\geqslant \;0}\) be an one dimensional Brownian motion and \((Z_n)_n\) be an iid sequence of \(\mathcal N(0, 1)\) distributed random variables, independent of \((B_t)_{t\;\geqslant \;0}\). Then, by Lemma 6.1 of the appendix, we have
where \(C(\xi , u)\) is the covariance matrix of the Gaussian vector
Proposition 3.2
The diffusion constant has the representation
where for \(\xi = ((s_i, t_i))_{1\;\leqslant \;i \;\leqslant \;N(\xi )} \in \triangle ^\cup \) and \(u\in [0, \infty )^{N(\xi )}\)
Proof
It is left to show Equality (3.1). We have
For \(v, w \in \mathbb {R}\) let \(C_T(\xi , u, v, w) \in \mathbb {R}^{({N(\xi )}+1)\times ({N(\xi )}+1)}\) be the covariance matrix of the Gaussian vector . Then, with Lemma 6.1, Eq. (3.2) becomes
since \(\det (C_T(\xi , u, 0, 1)) = \det (C(\xi , u))\). By using that the determinant is multilinear in the rows and columns, we have
and hence
Finally, by Lemma 6.2
which concludes the proof.
Lemma 3.3
Let \(\xi = ((s_i, t_i))_{1\;\leqslant \;i \;\leqslant \;N(\xi )} \in \triangle ^\cup \) with \([s_i, t_i] \subseteq [0, T]\) for all \(1\;\leqslant \;i \;\leqslant \;N(\xi )\) and \([s_i, t_i] \cap [s_j, t_j] = \emptyset \) for \(i\ne j\). Then, for \(u \in [0, \infty )^{N(\xi )}\)
where \(\tau _i :=t_i - s_i\) for \(1\;\leqslant \;i \;\leqslant \;{N(\xi )}\).
Proof
We project \(B_{0, T}\) onto \({\text {span}} \{u_iB_{s_i, t_i} + Z_i:\, 1 \;\leqslant \;i \;\leqslant \;{N(\xi )}\}\) and obtain
Hence
\(\square \)
In the situation above, \(1 - \frac{1}{T}\sum _{i=1}^{N(\xi )} \tau _i = \frac{1}{T} \lambda \big ([0, T]\setminus \bigcup _{i=1}^{N(\xi )}[s_i, t_i]\big )\), where \(\lambda \) denotes the Lebesgue measure. Assume that \(\tau _i \;\geqslant \;1\) and \(u_i \;\geqslant \;C\) for all \(1\;\leqslant \;i \;\leqslant \;{N(\xi )}\) where \(C>0\). Then
For a general configuration \((\xi , u)\), the previous considerations allow us to obtain estimates on \(\sigma _{T}^2(\xi , u)\) by considering subconfigurations \((\xi ', u')\) of pairwise disjoint marked intervals with intervals lengths exceeding 1 and marks exceeding C. In combination with an application of renewal theory, this implies the following technical Lemma:
Lemma 3.4
Let \(f:[0, \infty ) \rightarrow [0, \infty )\) be measurable with \(\int _0^\infty (1+t)f(t) \, \mathrm dt <\infty \) and \(f(t) = 0\) for almost all \(t\in [0, 1]\). For \(T>0\) let \(\Xi _T\) be the law of a Poisson point process with intensity measure \( f(t-s) \mathbbm {1}_{\{0< s<t<T\}} \mathrm ds \mathrm dt \) and let \(C>0\). Then
Proof
In case that \(f=0\) a.e. the statement is trivial, so assume \(\int _0^\infty f(t) \, \mathrm dt >0\). Let
for \(t\;\geqslant \;0\). We consider the Poisson point process \(\eta = \sum _{n=1}^\infty \delta _{(s_n, t_n)}\) of a \(M/G/\infty \)-queue where the arrival process \(\sum _{n=1}^\infty \delta _{s_n}\) is a homogeous Poisson point process with intensity \(\beta \), the service times \((\tau _n)_n\) are iid, have density \(\rho \) and are independent of the arrival process and \(t_n :=s_n + \tau _n\) is for \(n\in {\mathbb N}\) the departure of customer n. Then the restriction of \(\eta \) to the process of all customers that arrive and depart before T has distribution \(\Xi _T\). We inductively define \(i_0 :=0\), \(t_0 :=0\) and
for \(n\;\geqslant \;0\), i.e. customer \(i_{n+1}\) is for \(n\;\geqslant \;1\) the first customer that arrives after the departure of customer \(i_n\). The waiting times \((s_{i_{n}} - t_{i_{n-1}})_{n\;\geqslant \;1}\) are iid \({\text {Exp}}(\beta )\) distributed and are independent of the iid service times \((t_{i_n} - s_{i_n})_{n\;\geqslant \;1}\) which have density \(\rho \). Notice that \((t_{i_n})_{n\;\geqslant \;0}\) defines a renewal process. Let \(N_T :=\inf \{n: t_{i_n} \;\geqslant \;T\}\). By the considerations after Lemma 3.3, we have
where \(t_0 :=0\) and \(B_T :=T - t_{i_{N_T-1}}\) is the backward recurrence time of the renewal process \((t_{i_n})_{n}\) at time T. By renewal theory, \((B_T)_{T\;\geqslant \;0}\) converges in distribution as \(T\rightarrow \infty \). Let \((T_k)_k\) be a sequence in \([0, \infty )\) that converges to infinity. Then \(B_{T_k}/T_k \rightarrow 0\) in probability as \(k\rightarrow \infty \). Hence, there exists a subsequence \((T_{k_j})_j\) such that \(B_{T_{k_j}}/T_{k_j} \rightarrow 0\) almost surely as \(j\rightarrow \infty \). By dominated convergence, \(\mathbb E[B_{T_{k_j}}/T_{k_j}] \rightarrow 0\) as \(j\rightarrow \infty \). As \((T_k)_k\) was arbitrary, we have \(\mathbb E[B_{T}/T] \rightarrow 0\) as \(T\rightarrow \infty \). By renewal theory
almost surely as \(T\rightarrow \infty \) and by the law of large numbers we thus have
almost surely as \(T\rightarrow \infty \). By the dominated convergence theorem, we have convergence in \(L^1\) as well and the statement follows.
4 Estimation of the Kernel \(\kappa _\varepsilon \)
While the component \(\eta _{\alpha , T}^\varepsilon \sim \Gamma _{\varepsilon \alpha , T}\) is far easier to understand than the whole process, the distribution of marks of \(\eta _{\alpha , T}^\varepsilon \) under \(\kappa _\varepsilon \) still depends on the whole process. We get rid of this problem by estimating \(\kappa _\varepsilon \) by a suitable product kernel that marks the intervals independent of each other. For \(t, C>0\) we define
and
Notice that there exists some constant \(c_{\gamma , d}\) such that \(h_\varepsilon (t) = c_{\gamma , d} t^{-\gamma /2} + \varepsilon \) for all \(t>0\). In particular, \(\varepsilon /h_\varepsilon (2)<1\) and
defines for \(t>0\) a probability measure. Fix \(\xi = ((s_i, t_i))_{1\;\leqslant \;i\;\leqslant \;N(\xi )} \in \triangle ^\cup \) with \(F_\varepsilon (\xi )< \infty \). We will show that \(\kappa _\varepsilon (\xi , \cdot )\) stochastically dominates
that is
for any decreasing function \(f: [0, \infty )^{N(\xi )} \rightarrow \mathbb {R}\). By Proposition 3.2 and the monotonicity of \(\sigma _T^2(\xi , \cdot )\), we then obtain
Since \(\sigma _{T}^2(\xi , u)\) is increasing under deletion of marked intervals we can further estimate
Notice that intervals with zero mark do not contribute to \(\sigma ^2_T(\xi , u)\). Estimate (4.1) in combination with Lemma 3.4 will yield Theorem 1.1.
Lemma 4.1
Let \(C>0\) and \(\xi = ((s_i, t_i))_{1 \;\leqslant \;i \;\leqslant \;N(\xi )} \in \triangle ^\cup \) with \(F_\varepsilon (\xi )<\infty \). Then \(\kappa _\varepsilon (\xi ,\, \cdot \,)\) stochastically dominates \(\tilde{\kappa }_{\varepsilon , C}(\xi , \cdot )\).
Proof
Let \( J(\xi ) :=\{1\;\leqslant \;i \;\leqslant \;N(\xi ):\, t_i - s_i \;\leqslant \;2\} \). The statement is trivial for \(J(\xi ) = \emptyset \), so assume \(J(\xi ) \ne \emptyset \). We define
for \(i \in J(\xi )\). Let \(\pi : [0, \infty )^{N(\xi )} \rightarrow [0, \infty )^{J(\xi )}\) be the restriction map \(u \mapsto (u_i)_{i \in J(\xi )}\) and let \(\phi : [0, \infty )^{N(\xi )} \rightarrow [0, \infty )^{N(\xi )}\) be defined by
for \(u \in [0, \infty )^{N(\xi )}\). Let \(f: [0, \infty )^{N(\xi )} \rightarrow \mathbb {R}\) be decreasing and let \(\tilde{f}:[0, \infty )^{J(\xi )} \rightarrow \mathbb {R}\) be such that \(\tilde{f} \circ \pi = f \circ \phi \). Then
Hence, it is sufficient to show that the distribution of \(\chi \) under \(\mu :=\kappa _\varepsilon (\xi , \cdot )\) stochastically dominates the distribution of \(\chi \) under \(\tilde{\mu }:=\tilde{\kappa }_{\varepsilon , C}(\xi , \cdot )\). In order to show this, it is sufficient (see e.g. [FV17, pp. 137–138]) to show that for any \(\omega \in \{0, C\}^{J(\xi )}\) and any \(i \in J(\xi )\)
To simplify notation, we assume w.l.o.g. that \(i=1 \in J(\xi )\) and set \(\tau _1 :=t_1-s_1\). Let \(\omega \in \{0, C\}^{J(\xi )}\) be fixed and let \(A \subseteq [0, \infty )^{{N(\xi )}-1}\) be such that \( \{\chi _j = \omega _j \, \forall j \ne 1\} = [0, \infty ) \times A. \) For \(u_1\in [0, \infty )\) and we have by Corollary 6.3 of the appendix and since \((C+u_1)^2/2 \;\leqslant \;C^2 + u_1^2\)
since \(\tau _1 \;\leqslant \;2\) as \(1\in J(\xi )\). We get (by using the fact that the density of \(\nu _\varepsilon |_{(0, \infty )}\) with respect to the Lebesgue measure is increasing)
where . By Corollary 6.3, for any \(u_1\in [0, \infty ), u\in [0, \infty )^{{N(\xi )}-1}\)
which yields
Hence, we get
which yields the claim. \(\square \)
5 Finishing the Proof of Theorem 1.1
Proof of Theorem 1.1
For \(C>0\) let \(\zeta _{\alpha , T}^{\varepsilon , C}\) be a Poisson point process on \(\triangle \times \{0, C\}\) with intensity measure
Then Estimate (4.1) becomesFootnote 5
where \(\tilde{\zeta }_{\alpha , T}^{\varepsilon , C}\) is the restriction of \(\zeta _{\alpha , T}^{\varepsilon , C}\) to \(\{(s, t): 1 \;\leqslant \;t-s \;\leqslant \;2\} \times \{C\}\) and thus a Poisson point process with intensity measure
By Lemma 3.4 we have
Choosing \(C = \alpha ^{1/(2+d)}\) yields the claim.
Remark 5.1
Replacing the arbitrarily chosen cut-off parameters 1 and 2 for the lifespan by general \(A<B\) and optimizing over \(A, B, C, \varepsilon \) leads to
which leads to the same exponent \(-2/(2+d)\) for the decay in \(\alpha \). The non-dependency of this exponent on \(\gamma \) is presumably the result from the estimate \((C+ r)^{\gamma -1} \;\geqslant \;r^{\gamma - 1}\) that has been used in the proof of Lemma 4.1.
6 Appendix: Facts About Gaussian Measures
In the following, we collect a few facts about Gaussian measures (which were in a similar form already applied in [MV19]) in order to be self contained.
Lemma 6.1
Let be a centered Gaussian vector with covariance matrix C. Then
Proof
Write \(C = O \Lambda O^T\) with \(O \in \mathbb {R}^{n\times n}\) orthogonal and \(\Lambda \in \mathbb {R}^{n\times n}\) diagonal with non-negative entries. Then \(C = A A^T\) with \(A :=O \Lambda ^{1/2}\). Let . Then \(AZ \overset{d}{=}\ X\) and thus
Since \(\prod _{i=1}^n (1+ \lambda _i) = \det (I + \Lambda )= \det (I + C)\) the statement follows. \(\square \)
Lemma 6.2
Let be a centered Gaussian vector with covariance matrix C. Then
Proof
Write \(X_n = Y_n + Z_n\) with and . Define \(Y_i :=X_i\) for \(1\;\leqslant \;i \;\leqslant \;n-1\). Let \(C'\) and \(C''\) be the covariance matrices of and respectively. Then, by the Leibniz formula for the determinant
Since we have \(\det (C'') = 0\) and the statement follows inductively.
Corollary 6.3
For any centered Gaussian vector we have
Proof
Let be independent of X. Then the covariance matrix of \(Z+X\) is \(I+C\), where C is the covariance matrix of X. By the previous two lemmas,
The statement follows from
\(\square \)
Notes
It is generalizable to the measures above, see Remark 4.14 in [BP21].
By finiteness of the partition function, we have \(F_\varepsilon (\xi ) < \infty \) for \(\Gamma _{\alpha , T}\) almost all \(\xi \).
I.e. \(\widehat{\Gamma }^\varepsilon _{\alpha , T}\) is the distribution of a point process that is conditionally on \((X_t)_t \sim \mathbb P_{\alpha , T}\) a Poisson point process with intensity measure \(\mu _{\alpha , T}(\mathrm ds \mathrm dt) v_\varepsilon (X_{s, t})\).
Lemma 3.3 later in the text can also be shown directly by using independence of Brownian increments.
Here .
References
Betz, V., Polzer, S.: A functional central limit theorem for polaron path measures. arXiv:2106.06447 (2021)
Betz, V., Mukherjee, C., Polzer, S., Varadhan, S.R.S.: Polaron path measures with attractive interaction. In preparation
Dybalski, W., Spohn, H.: Effective mass of the polaron-revisited. Annales Henri Poincaré 21(5), 1573–1594 (2020)
Donsker, M.D., Varadhan, S.R.S.: Asymptotics for the polaron. Commun. Pure Appl. Math. 36(4), 505–528 (1983)
Feynman, R.P.: Slow electrons in a polar crystal. Phys. Rev. 97(3), 660–665 (1955)
Friedli, S., Velenik, Y.: Statistical Mechanics of Lattice Systems: A Concrete Mathematical Introduction. Cambridge University Press, Cambridge (2017)
Landau, L.D., Pekar, S.I.: Effective mass of a polaron. Zh. Eksp. Teor. Fiz. 18, 419–423 (1948)
Lieb, E.H., Thomas, L.E.: Exact ground state energy of the strong-coupling polaron. Commun. Math. Phys. 183(3), 511–519 (1997)
Lieb, E.H., Seiringer, R.: Divergence of the effective mass of a polaron in the strong coupling limit. J. Stat. Phys. 180, 23–33 (2020)
Müller, J.S.: The polaron revisited. Rev. Math. Phys. 18(5), 485–517 (2006)
Mukherjee, C., Varadhan, S.R.S.: Identification of the polaron measure I: fixed coupling regime and the central limit theorem for large times. Commun. Pure Appl. Math. 73(2), 350–383 (2019)
Mukherjee, C., Varadhan, S.R.S.: Identification of the polaron measure I: fixed coupling regime and the central limit theorem for large times. arXiv:1802.05696, September 2021
Myśliwy, K., Seiringer, R.: Polaron models with regular interactions at strong coupling. J. Stat. Phys. 186, 1–24 (2021)
Pekar, S.I.: Theory of polarons. Zh. Eksperimen. i Teor. Fiz. 19 (1949)
Spohn, H.: Effective mass of the polaron: a functional integral approach. Ann. Phys. 175(2), 278–318 (1987)
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by H. Spohn.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Betz, V., Polzer, S. Effective Mass of the Polaron: A Lower Bound. Commun. Math. Phys. 399, 173–188 (2023). https://doi.org/10.1007/s00220-022-04553-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00220-022-04553-0