Abstract
We consider a discrete Schrödinger operator \( H_\varepsilon = -\varepsilon ^2\Delta _\varepsilon + V_\varepsilon \) on \(\ell ^2(\varepsilon \mathbb {Z}^d)\), where \(\varepsilon >0\) is a small parameter and the potential \(V_\varepsilon \) is defined in terms of a multiwell energy landscape f on \(\mathbb {R}^d\). This operator can be seen as a discrete analog of the semiclassical Witten Laplacian of \(\mathbb {R}^d\). It is unitarily equivalent to the generator of a diffusion on \(\varepsilon \mathbb {Z}^d\), satisfying the detailed balance condition with respect to the Boltzmann weight \(\exp {(-f/\varepsilon )}\). These type of diffusions exhibit metastable behavior and arise in the context of disordered mean field models in Statistical Mechanics. We analyze the bottom of the spectrum of \(H_\varepsilon \) in the semiclassical regime \(\varepsilon \ll 1\) and show that there is a one-to-one correspondence between exponentially small eigenvalues and local minima of f. Then we analyze in more detail the bistable case and compute the precise asymptotic splitting between the two exponentially small eigenvalues. Through this purely spectral-theoretical analysis of the discrete Witten Laplacian we recover in a self-contained way the Eyring–Kramers formula for the metastable tunneling time of the underlying stochastic process.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
This paper derives sharp semiclassical spectral asymptotics for Schrödinger operators acting on \(\ell ^2(\varepsilon \mathbb {Z}^d)\) of the form
where \(\Delta _\varepsilon \) is the discrete nearest-neighbor Laplacian of \(\varepsilon \mathbb {Z}^d\) and \(V_\varepsilon \) is a possibly unbounded multiplication operator, defined in terms of a multiwell energy landscape f. More precisely, given \(f\in C^2(\mathbb {R}^d)\), we identify \(V_\varepsilon \) with the function
We shall dub \(H_\varepsilon \) the discrete semiclassical Witten Laplacian associated with f. This is motivated by the following observation: the continuous space version of \(H_\varepsilon \), i.e. the Schrödinger operator \(\mathcal {H}_\varepsilon \) on \(L^2(\mathbb {R}^d)\) obtained from (1),(2) by substituting \(\Delta _\varepsilon \) with the Laplacian \(\Delta \) of \(\mathbb {R}^d\), reads
and thus coincides with the restriction on functions of the Witten Laplacian of \(\mathbb {R}^d\) [27, 30, 40, 46]. It is well known that the latter has deep connections to problems in Statistical Mechanics [24]. In some situations, e.g. when considering lattice models of Statistical Mechanics as discussed below, one is led in a natural way to its discrete version (1),(2). The continuous space operator (3) is then rather a simplifying idealization of (1),(2): it is indeed easier to analyze \(\mathcal {H}_\varepsilon \) by exploiting the standard machinery of differential and semiclassical calculus, but the results might be a priori less accurate in making predictions. This paper shows a general strategy which permits to obtain sharp semiclassical estimates directly in the discrete setting.
We are mainly inspired by the analysis [26] on the continuous space Witten Laplacian and by the series of papers [33,34,35,36] by M. Klein and E. Rosenberger, who develop an approach to the semiclassical spectral analysis of discrete Schrödinger operators of the form (1) via microlocalization techniques. We refer also to the earlier work [31] and to [13, 14, 39] for semiclassical investigations in discrete settings.
1.1 Brief description of the main results
Following in particular the approach of [34] we show that under mild regularity assumptions on f there is a low-lying spectrum of exponentially small eigenvalues which is well separated from the rest of the spectrum. Moreover the number of exponentially small eigenvalues equals the number of local minima of f, see Theorem 2.2 below.
Then we analyze in more detail the case of two local minima of f and compute the precise asymptotic splitting between the two small eigenvalues. From a general point of view, this corresponds to a subtle tunneling calculation through other, non-resonant wells [29] of the Schrödinger potential \(V_\varepsilon \), corresponding to saddle points of f.
As opposed to [26] we work again under mild regularity assumptions on f and proceed with a streamlined, direct strategy that avoids WKB expansions, a priori Agmon estimates and also the underlying complex structure of the Witten Laplacian. Much of the simplification is obtained via a suitable choice of global quasimodes. We show that the leading asymptotic of the exponentially small eigenvalue gap is given by an Eyring-Kramers formula:
where \(A,E>0\) are explicit constants depending on f (see Theorem 2.3 for a precise statement) that turn out to coincide with the one obtained in the continuous case for \(\mathcal {H}_\varepsilon \) in [26] (see also [10, 21]). In other terms, the geometric constraint imposed by the lattice turns out to be negligible in first order approximation. The vanishing rate of the remainder term depends on the regularity of f around its critical points. We show that \(f\in C^3(\mathbb {R}^d)\) leads to an error of order \(O(\sqrt{\varepsilon })\), while for \(f\in C^4(\mathbb {R}^d)\) this error improves to \(O(\varepsilon )\).
The spectral Eyring-Kramers formula in the discrete setting considered here is not new. Indeed, up to some minor variants, this type of result has been derived in the framework of discrete metastable diffusions, by analyzing mean transition times of Markov processes via potential theory [8]. We shall discuss below more in detail the probabilistic interpretation of our results. The present paper shows that, as in the continuous setting, also in the discrete setting the Eyring-Kramers formula can be obtained by a direct and self-contained spectral approach, without relying at all on probabilistic potential theory.
We remark that the method we use to analyze the exponentially small eigenvalues can be extended also to the general case with more than two local minima. The extension is based on an iterative finite-dimensional matrix procedure, very similar to the one considered in [26] (see also [16] and references therein). This procedure is independent of the rest and not related to the peculiar analytical difficulties arising from the discrete character of the setting. To not obscure the exposition of the main ideas of this paper, the general case will be discussed somewhere else.
1.2 Connection to discrete metastable diffusions
Our main motivation for investigating the spectral properties of \(H_\varepsilon \) stems from its close connection to certain metastable diffusions with state space \(\varepsilon \mathbb {Z}^d\). These have been extensively studied in the probabilistic literature, mainly due to their paradigmatic properties and their applications to problems in Statistical Mechanics [2, 5, 8, 12, 20, 38, 42]. The general, continuous time version might be described in terms of a Markovian generator \(L_\varepsilon \) of the form
with \(r_\varepsilon (x, x+\varepsilon v)\) being the rate of a jump from x to \(x+\varepsilon v\). The jump rates are assumed to satisfy the detailed balance condition with respect to the Boltzmann weight \(\rho _\varepsilon = e^{-f/\varepsilon }\) on \(\varepsilon \mathbb {Z}^d\), so that \(L_{\varepsilon }\) may be realized as a selfadjoint operator acting on the weighted space \(\ell ^2(\varepsilon \mathbb {Z}^d; \rho _\varepsilon )\). Moreover the scaling is chosen so that \( L_\varepsilon \) formally converges for \(\varepsilon \rightarrow 0\) to a first order differential operator on \(\mathbb {R}^d\), corresponding to a deterministic transport along a vector field. One might thus think of the dynamics as a small stochastic perturbation of a deterministic motion. A standard choice of jump rates satisfying the above requirements is given by
where \((e_1, \ldots , e_d)\) is the standard basis of \(\mathbb {R}^d\).
There is a direct link between the discrete Witten Laplacian and discrete diffusions as described above: up to a change of sign and multiplicative factor \(\varepsilon \), the Markovian generator \(L_\varepsilon \) given by (4),(5) and the discrete Witten Laplacian given by (1),(2) are formally unitarily equivalent. This can be seen by the well-known ground state transformation, which turns a Schrödinger operator into a diffusion operator [32], see Proposition 2.5 below for the precise statement. As a consequence, our spectral analysis of \(H_\varepsilon \) can be immediately translated into analogous results on \(L_\varepsilon \), see Corollary 2.6. The advantage of working with \(H_\varepsilon \) is that in the flat space \(\ell ^2(\varepsilon \mathbb {Z}^d)\) one can exploit Fourier analysis and related microlocalization techniques.
We remark that discrete diffusions as described above naturally arise in the context of disordered mean field models in Statistical Mechanics. A prominent example is the dynamical random field Curie-Weiss model [5, 8, 22, 42], which is well described by a discrete diffusion on \(\varepsilon \mathbb {Z}^d\) after a suitable reduction in terms of order parameters. The limit \(\varepsilon \rightarrow 0 \) then corresponds to the thermodynamic limit of infinite volume.
A characteristic feature of the dynamics \(\partial _t \psi = L_\varepsilon \psi \) for small \(\varepsilon \) is metastability: if f admits several local minima the system remains trapped for exponentially large times in neighborhoods of local minima of f before exploring the whole state space. This is due to the fact that the local minima of f turn out to be exactly the stable equilibrium points of the limiting deterministic motion. We refer to [7, 23, 41] for comprehensive introductions to metastability of Markov processes and e.g. to [3, 17, 37] for shorter surveys.
A key issue in the understanding of metastability is to quantify the time scales at which metastable transitions between local minima occur. For discrete diffusions of type (4) sharp asymptotic estimates have been obtained in [8, 9] in terms of average hitting times. The formula for the leading asymptotics is called Eyring-Kramers formula. In [8] it is also shown that there is a very clean relationshp between the metastable transition times and the low-lying spectrum of \(-L_\varepsilon \). Indeed, there is a cluster of exponentially small eigenvalues, each one being asymptotically equivalent to the inverse of a metastable transition time.
The problem of determining the asymptotic behavior of metastable transition times can therefore be equivalently phrased as a problem of spectral asymptotics of the generator \(L_\varepsilon \) and thus of \(H_\varepsilon \). Due to these facts, one can view the method presented in this paper as a spectral approach to the computation of metastable transition times in discrete setting.
1.2.1 Plan of the paper
In Sect. 2 we introduce the setting, provide precise definitions and basic properties for the discrete Witten Laplacian \(H_\varepsilon \), the diffusion generator \(L_\varepsilon \) and state our main results: Theorem 2.2, saying that there are as many exponentially small eigenvalues of \(H_\varepsilon \) as minima of f and that there is a large gap of order \(\varepsilon \) between them and the rest of the spectrum; Theorem 2.3, giving the precise splitting between exponentially small eigenvalues due to the tunnel effect (Eyring-Kramers formula). In Sect. 3 we collect some preliminary tools which can be seen as general means for a semiclassical analysis on the lattice: the IMS formula for the discrete Laplacian which permits to localize quadratic forms on the lattice; estimates on the discrete semiclassical Harmonic oscillator based on microlocalization techniques; and results on sharp Laplace asymptotics on the lattice \(\varepsilon \mathbb {Z}^d\) based on the Poisson summation formula. In Sects. 4 and 5 we provide the proofs of Theorems 2.2 and 2.3 respectively.
2 Precise Setting and Main Results
Throughout the paper we shall use the following notation. We consider the symmetric set
where \((e_1, \ldots , e_d)\) is the standard basis of \(\mathbb {R}^d\). For \(\varepsilon >0\) the symbols \(\nabla _\varepsilon \) and \(\Delta _\varepsilon \) denote respectively the rescaled discrete gradient and the rescaled discrete Laplacian of the lattice \(\varepsilon \mathbb {Z}^d\), with graph structure induced by \(\varepsilon \mathcal {N}\). More precisely, for every \(\psi : \varepsilon \mathbb {Z}^d \rightarrow \mathbb {R}\) we define
We shall work on the Hilbert space \(\ell ^2(\varepsilon \mathbb {Z}^d) =\{ \psi \in \mathbb {R}^{\varepsilon \mathbb {Z}^d}: \Vert \psi \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d)} < \infty \}\), where \(\Vert \cdot \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d)}\) is the norm corresponding to the scalar product
The discrete Laplacian \(\Delta _\varepsilon \) is a bounded linear operator on \(\ell ^2(\varepsilon \mathbb {Z}^d)\). It is also selfadjoint and \(-\Delta _\varepsilon \) is nonnegative. More precisely, for \(\psi , \psi '\in \ell ^2(\varepsilon \mathbb {Z}^d)\), once can check that
and in particular
Here \(\Vert \cdot \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d;\mathbb {R}^\mathcal {N})}\) is the norm induced by the scalar product
defined for \(\alpha , \alpha '\in \ell ^2(\varepsilon \mathbb {Z}^d;\mathbb {R}^\mathcal {N}):=\{ \alpha \in \mathbb {R}^{\varepsilon \mathbb {Z}^d \times \mathcal {N}}: \Vert \alpha (\cdot , v) \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d)} < \infty \text { forall } v\in \mathcal {N}\} \) (the space of square integrable 1-forms on the graph \(\varepsilon \mathbb {Z}^d\)).
2.1 Definition and basic properties of \(H_\varepsilon \)
Given a function \(f:\mathbb {R}^d\rightarrow \mathbb {R}\) and a parameter \(\varepsilon >0\), we define a new function \( V_\varepsilon :\mathbb {R}^d\rightarrow \mathbb {R}\) by setting
Note that the expression (6) for \(V_\varepsilon \) and the one given in the introduction in (2) are equal by definition of \(\Delta _\varepsilon \) and \(\nabla _\varepsilon \). We shall identify in the sequel \(V_\varepsilon \) with the corresponding multiplication operator in \(\ell ^2(\varepsilon \mathbb {Z}^d)\) having dense domain \(\textrm{Dom}(V_\varepsilon ) = \{ \psi \in \ell ^2(\varepsilon \mathbb {Z}^d): V_\varepsilon \psi \in \ell ^2(\varepsilon \mathbb {Z}^d) \}\). The restriction of \( V_\varepsilon \) to \(C_c(\varepsilon \mathbb {Z}^d)\) (i.e. the set of \(\psi \in \mathbb {R}^{\varepsilon \mathbb {Z}^d}\) such that \(\psi (x)=0\) for all but finitely many x) is essentially selfadjoint.
We are interested in the Schrödinger-type operator \(H_\varepsilon : \textrm{Dom}(V_\varepsilon ) \rightarrow \ell ^2(\varepsilon \mathbb {Z}^d) \) given by
Note that \(H_\varepsilon \) is a selfadjoint operator in \( \ell ^2(\varepsilon \mathbb {Z}^d) \) and its restriction to \(C_c(\varepsilon \mathbb {Z}^d)\) is essentially selfadjoint. This follows e.g. from the Kato-Rellich Theorem [45, Theorem 6.4], using the analogous properties of \(V_\varepsilon \) and the fact that \(\Delta _\varepsilon \) is bounded and selfadjoint.
Moreover, from the pointwise bound \(V_\varepsilon \geqslant - 2d\) and the nonnegativity of \(-\Delta _\varepsilon \) it follows immediately that \(H_{\varepsilon }\) is bounded from below. An important observation is that the quadratic form associated with \(H_{\varepsilon }\) is not only bounded from below, but even nonnegative. This is due to the special form of the potential \(V_\varepsilon \). Indeed, a straightforward computation yields
where \(\nabla _{f, \varepsilon }\) denotes a suitably weighted discrete gradient:
It follows in particular that the spectrum of \(H_\varepsilon \) is contained in \([0,\infty )\).
Remark 2.1
The property (7) states that \(H_\varepsilon \) is the Laplacian associated to the distorted gradient \(\nabla _{f,\varepsilon }\). As it is done for the continuous space Witten Laplacian [30, 46], it is possible to give an extension of \(H_\varepsilon \) in the sense of Hodge theory. The extended operator is then defined on a suitable algebra of discrete differential forms and satisfies the usual intertwining relations. We shall not use this fact and refer to [16] for details.
2.2 Assumptions and main results
We shall consider the following two sets of hypotheses on the function f. Here and in the following \(|\cdot |\) denotes the standard euclidean norm on \(\mathbb {R}^d\). The gradient and Hessian of a function on \(\mathbb {R}^d\) are denoted by \(\nabla \) and \({{\,\textrm{Hess}\,}}\).
H1
\(f\in C^3(\mathbb {R}^d)\) and all its critical points are nondegenerate. Moreover
-
(i)
\(\liminf _{|x| \rightarrow \infty } |\nabla f (x)| >0 \).
-
(ii)
\({{\,\textrm{Hess}\,}}f\) is bounded on \(\mathbb {R}^d\).
Note that H1 implies that the set of critical points of f is finite. Indeed, nondegenerate critical points are necessarily isolated and by (i) the critical points of f must be contained in a compact subset of \(\mathbb {R}^d\).
To analyze the exponential splitting between small eigenvalues we will assume for simplicity the following more restrictive hypothesis.
H2
Hyptohesis H1 holds true. Moreover
-
(i)
\(\liminf _{|x| \rightarrow \infty } \frac{f(x)}{|x|} > 0 \).
-
(ii)
The function f has exactly two local minimum points \(m_0, m_1\in \mathbb {R}^d\).
The first result we present shows that under Assumption H1 the essential spectrum of \(H_\varepsilon \), denoted by \({{\,\textrm{Spec}\,}}_{\text {ess}} (H_{\varepsilon })\), is uniformly bounded away from zero and that its discrete spectrum, denoted by \({{\,\textrm{Spec}\,}}_{\text {disc}} (H_{\varepsilon })\), is well separated into two parts: one consists of exponentially small eigenvalues, the other of eigenvalues which are at least at distance of order \(\varepsilon \) from zero. Moreover, counting multiplicity, the number of exponentially small eigenvalues equals exactly the number of local minima of f:
Theorem 2.2
Assume H1 and denote by \(N_0\in \mathbb {N}_0\) the number of local minima of f. There exist constants \(\varepsilon _0 \in (0, 1)\) and \(C>0\) such that for each \(\varepsilon \in (0, \varepsilon _0]\) the following properties hold true.
-
(i)
\( {{\,\textrm{Spec}\,}}_{\text {ess}} (H_{\varepsilon }) \subset [C, \infty ) \).
-
(ii)
\( | {{\,\textrm{Spec}\,}}_{\text {disc}} (H_{\varepsilon }) \cap [0, C\varepsilon ] | \leqslant N_0\).
-
(iii)
\(H_\varepsilon \) admits at least \(N_0\) eigenvalues counting multiplicity. In the nontrivial case that \(N_0\ne 0\), the \(N_0\)-th eigenvalue \(\lambda _{N_0}(\varepsilon )\) (according to increasing order and counting multiplicity) satisfies the bounds
$$\begin{aligned} 0\leqslant \lambda _{N_0}(\varepsilon ) \leqslant e^{-C/\varepsilon }. \end{aligned}$$
The properties stated in Theorem 2.2 are well-known in the continous space setting [28, 43] and have also been recently extended to certain infinite-dimensional situations [11]. In the finite-dimensional continuous space setting the standard proof consists in approximating the Schrödinger operator with harmonic oscillators around the critical points of f. The error is then estimated using the IMS localization formula, which permits to connect the local estimates around the critical points to global estimates. The discrete case is analytically more difficult, due to the nonlocal character of the discrete Laplacian. The main idea to overcome these difficulties is taken from [34] and consists in localizing not only the potential \(V_\varepsilon \) but the full operator \(H_\varepsilon \). This amounts in localizing the symbol in phase space and is also referred to as micolocalization. The setting in [34] is very general and requires the machinery of pseudodifferential operators, which makes the proof rather involved and requires strong regularity assumptions on the potential \(V_\varepsilon \) which are not assumed here. Here we give a more elementary proof which is adapted to our special case and works well under Hypthesis H1.
We now assume the stronger Hypothesis H2. Then, thanks to the superlinear growth condition H2 (i), it holds
This implies that \(e^{- \frac{f}{2\varepsilon }} \) is in the domain of \(H_{\varepsilon }\) and therefore, since \(H_{ \varepsilon } e^{- \frac{f}{2\varepsilon }} =0\) by direct computation, that 0 is an eigenvalue of \(H_{\varepsilon }\). Moreover, due to the fact that \(\mathcal {N}\) generates the group \(\mathbb {Z}^d\), it follows for example from (7) that only multiples of \(\Psi _\varepsilon \) can be eigenfunctions corresponding to the eigenvalue 0. Thus we conclude that 0 is an eigenvalue with multiplicity 1 for every \(\varepsilon >0\).
Since, by assumption, there are \(N_0=2\) local minima of f, it follows from Theorem 2.2 that, for \(\varepsilon >0\) sufficiently small, there is exactly one eigenvalue \(\lambda _\varepsilon \) of \(H_{\varepsilon }\), which is different from 0 and is exponentially small in \(\varepsilon \). Moreover, by the same theorem, \(\lambda _\varepsilon \) must have multiplicity 1. Our second main result provides the precise leading asymptotic behavior of \(\lambda (\varepsilon )\). This behavior is expressed in terms of two constants \(A,E>0\), giving respectively the prefactor and the exponential rate. More precisely one defines
where \(h_*:= \max \{f(m_0), f(m_1)\}\in \mathbb {R}\) and where \(h^*\in \mathbb {R}\) is given by the height of the barrier which separates the two minima. More precisely, \(h^* \) can be defined as follows [26]. For \(h\in \mathbb {R}\) we denote by \(\mathcal {S}_f(h):= f^{-1} \left( (-\infty , h) \right) \) the (open) sublevel set of f corresponding to the height h and by \( N_{f}(h)\) the number of connected components of \(\mathcal {S}_f(h) \). Then \(h^*(f)\in \mathbb {R}\) is defined as the maximal height which disconnects \(\mathcal {S}_f(h)\) into two components (Fig. 1):
By simple topological arguments, on the level set \(f^{-1}(h^*)\) there must be at least one critical point of f of index 1 and at most a finite number n of them, which we label in an arbitrary order as \(s_1, \ldots , s_{n}\). We denote by \(\mu (s_k)\) the only negative eigenvalue of \({{\,\textrm{Hess}\,}}f(s_k)\). The constant A is then defined in terms of the quadratic curvature of f around the two minima and the relevant saddle points. More precisely, one defines
Our second main theorem is the following.
Theorem 2.3
Assume H2 and take \(\varepsilon _0>0\) as in Theorem 2.2. Let A, E be given respectively by (8), (10) and let, for \(\varepsilon \in (0, \varepsilon _0)\), \(\lambda (\varepsilon )\) be the smallest non-zero eigenvalue of \(H_{\varepsilon }\). Then the error term \(\mathcal {R}(\varepsilon )\), defined for \(\varepsilon \in (0, \varepsilon _0)\) by
satisfies the following: there exists a constant \(C>0\) such that \(| \mathcal {R}(\varepsilon ) | \leqslant C \varepsilon ^\gamma \) for every \(\varepsilon \in (0, \varepsilon _0)\), with \(\gamma =\frac{1}{2}\). The same holds true with \(\gamma =1\) under the additional asumption \(f\in C^4(\mathbb {R}^d)\).
Remark 2.4
As shown in the author’s PhD thesis [16], using the underlying Witten complex structure, the additional assumption \(f\in C^\infty (\mathbb {R}^d)\) implies that \(\mathcal {R}_\varepsilon \) admits full asymptotic expansions in powers of \(\varepsilon \). The latter result is analogous to the result of [26] in continuous space setting. However the proof of complete expansions given in [16] is substantially more involved, since it requires a construction and detailed analysis of discrete WKB expansions on the level of 1-forms. We remark also that the geometric constraint imposed by the lattice becomes appreciable at the level of higher order corrections since the asymptotic expansion of \(\mathcal {R}_\varepsilon \) differs in general from the one given in the continous setting.
As anticipated in the introduction, our main results can be easily translated into results on spectral properties of the class of metastable discrete diffusions with generator (4), (5). Since this might be a particularly interesting application of our results, we shall spell out precisely their consequences from the stochastic point of view.
2.3 Results on the diffusion operator \(L_{\varepsilon }\)
Given a function \(f:\mathbb {R}^d\rightarrow \mathbb {R}\) and a parameter \(\varepsilon >0\), we consider the weight functions
Note that \(\rho _\varepsilon \) and \(r_\varepsilon \) are related by the identity
We work now in the weighted Hilbert space \(\ell ^2(\rho _\varepsilon )\) obtained as subspace of \(\mathbb {R}^{\varepsilon \mathbb {Z}^d}\) by introducing the weighted scalar product
and the corresponding induced norm \(\Vert \cdot \Vert _{\ell ^2(\rho _\varepsilon )}\). We shall denote by \(L_{\varepsilon }\) the Laplacian of the weighted graph \(\varepsilon \mathbb {Z}^d\), whose vertices are weighted by \(\rho _\varepsilon \) and whose edges (determined by \(\mathcal {N}\)) are weighted by \(\rho _\varepsilon r_\varepsilon \). More precisely we define \(L_{\varepsilon }: \textrm{Dom} (L_{\varepsilon }) \rightarrow \ell ^2(\rho _\varepsilon ) \) by setting
and, for each \(x\in \varepsilon \mathbb {Z}^d\),
This provides a Hilbert space realization of the formal operator (4), (5).
Proposition 2.5
For each \(\varepsilon >0\) the operators \(- \varepsilon L_\varepsilon \) and \(H_\varepsilon \) are unitarily equivalent.
Proof
Let \(\varepsilon >0\). We consider the unitary operator
Then a direct computation shows that
and that \(\Phi _\varepsilon [ \textrm{Dom}(L_{\varepsilon }) ] = \mathrm{Dom(V_\varepsilon )} \). \(\square \)
From the unitarily equivalence it follows that \(L_{\varepsilon }\) is not only symmetric and nonnegative (this can be checked by summation by parts and using the detailed balance condition (11)), but also selfadjoint. We remark also that \(C_c(\varepsilon \mathbb {Z}^d)\), which is a core for \(H_{\varepsilon }\) and is invariant under \(\Phi _\varepsilon \), is also a core of \(L_{\varepsilon }\).
Combining Proposition 2.5 with Theorems 2.2 and 2.3 yields then the following result.
Corollary 2.6
Assume H1 and denote by \(N_0\in \mathbb {N}_0\) the number of local minima of f. There exist constants \(\varepsilon _0 \in (0, 1)\), \(C>0\) such that for each \(\varepsilon \in (0, \varepsilon _0]\) the following properties hold true.
-
(i)
\( {{\,\textrm{Spec}\,}}_{\text {ess}} (-L_{\varepsilon }) \subset [\varepsilon ^{-1}C, \infty ) \) and \( | {{\,\textrm{Spec}\,}}_{\text {disc}} (-L_{\varepsilon }) \cap [0, C] | \leqslant N_0\).
-
(ii)
\(-L_\varepsilon \) admits at least \(N_0\) eigenvalues \(\lambda _1, \ldots , \lambda _{N_0}(\varepsilon )\) counting multiplicity. In the nontrivial case that \(N_0\ne 0\), the \(N_0\)-th eigenvalue \(\lambda _{N_0}(\varepsilon )\) (according to increasing order and counting multiplicity) satisfies the bounds
$$\begin{aligned} 0\leqslant \lambda _{N_0}(\varepsilon ) \leqslant e^{-\frac{C}{\varepsilon }}. \end{aligned}$$
Moreover, assuming in addition H2, it holds \(\lambda _1(\varepsilon )=0\) and, taking A, E as in (8), (10), the error term \(\mathcal {R}(\varepsilon )\), defined for \(\varepsilon \in (0, \varepsilon _0)\) by
satisfies the following: there exists a constant \(C>0\) such that \(| \mathcal {R}(\varepsilon ) | \leqslant C \varepsilon ^\gamma \) for every \(\varepsilon \in (0, \varepsilon _0)\), with \(\gamma =\frac{1}{2}\). The same holds true with \(\gamma =1\) under the additional asumption \(f\in C^4(\mathbb {R}^d)\).
We stress that (i) implies a quantitative scale separation between the \(N_0\) slow modes, corresponding to the metastable tunneling times, and all the other modes, corresponding to fast relaxations to local equilibria. In principle it is also possible to refine the analysis of the fast modes revealing the full hierarchy of scales governing the dynamics in the small \(\varepsilon \) regime, see [19] for a \(\Gamma \)-convergence formulation in continuous space setting and the recent [4].
As already mentioned, the rigorous derivation of an Eyring-Kramers formula of type (13) in the setting of discrete metastable diffusions had already been derived by a different approach based on capacity estimates [7, 9]. Compared to these previous results the formula given in (13) differs in two aspects:
-
(1)
The estimate on the error term \( \mathcal {R}(\varepsilon )\) is improved by our approach, since in [7, Theorems 10.9 and 10.10], under the same regularity assumptions as considered here (\(f\in C^3(\mathbb {R}^d)\)) a logarithmic correction appears. More precisely our result improves the error estimate from \(\mathcal {R}(\varepsilon )= \mathcal {O}(\sqrt{\varepsilon [\log {1/\varepsilon }]^3})\) to \(\mathcal {R}(\varepsilon )= \mathcal {O}(\sqrt{\varepsilon })\). Further, we show that \(\mathcal {R}(\varepsilon )= \mathcal {O}(\varepsilon )\) under the stronger assumption \(f\in C^4(\mathbb {R}^d)\).
-
(2)
The prefactor A given in (13) differs from the one given in [7, 9]. This is due to our slightly different choice of jump rates, compare (5) with [7, (10.1.2.), p. 248]. Indeed it is clear that the prefactor is sensible to the particular choice of jump rates among the infinitely many possible jump rates satisfying the detailed balance condition with respect to the Boltzmann weight \( e^{-f/\varepsilon }\). This sensitivity of the prefactor is opposed to the robustness of the exponential rate E, which is universal as can be seen e.g. via a Large Deviations analysis. We remark that, while the rates chosen in [7] correspond to a Metropolis algorithm, our choice (5) corresponds, in the context of the Statistical Mechanics models mentioned above, to a heat bath algorithm. This is a very natural choice and is considered for example in [38]. As observed in the introduction, it is the choice which in first order approximation gives the same prefactor as the continuous space model (3). Furthermore, [7, 9] concerns discrete time processes, which means that the rates are normalized and thus bounded over \(\mathbb {R}^d\). Our setting includes also the case of possibly unbounded rates which requires some additional technical work for the analysis outside compact sets.
3 General Tools for a Semiclassical Analysis on the Lattice
This section is devoted to some preliminary tools for a semiclassical analysis on the lattice.
Section 3.1 concerns a discrete IMS localization formula, see [45, Lemma 11.3] or [15, Theorem 3.2], where also an explanation of the name can be found, for the standard continuous space setting and [34]. The IMS formula is a simple observation based on a computation of commutators. It will be used repeteadly for decomposing the quadratic form induced by a Schrödinger operator into localized parts.
Section 3.2 provides estimates on the first two eigenvalues of the discrete semiclassical Harmonic oscillator. These estimates follow from more general results proven in [34]. Nevertheless we shall include a relatively short and completely selfcontained proof, which focuses on the estimates needed to prove the separation between exponentially small eigenvalues of \(H_\varepsilon \) and the rest of its spectrum, as provided by Theorem 2.2. The proof is based on a microlocalization which permits to separate high and low frequency actions of the operator.
Section 3.3 provides sharp asymptotic results for Laplace-type sums. These are instrumental in almost all the computations necessary for deriving the Eyring-Kramers formula for the eigenvalue splitting and for tunneling calculations in general. Our proofs are again based on Fourier analysis. In particular, following [16], we shall use the Poisson summation formula: shifting a function by an integer vector and summing over all shifts produces the same periodization as taking the Fourier series of the Fourier transform. Compared to [16], where it is shown how to get complete asymptotic expansions in the smooth setting, here we shall relax the regularity assumptions on the phase function to cover the applications we have in mind.
3.1 The discrete IMS formula
We say that the set \(\{\chi _j\}_{j\in J}\) is a smooth quadratic partition of unity of \(\mathbb {R}^d\) if J is a finite set, \(\chi _{j}\in C^\infty (\mathbb {R}^d)\) for every \(j\in J\) and \(\sum _{j\in J} \chi ^2_{j} \equiv 1\).
Proposition 3.1
There exists a constant \(C>0\) such that for every \(\varepsilon >0\), every \(\psi \in \ell ^2(\varepsilon \mathbb {Z}^d)\) and every smooth quadratic partition of unity \(\{\chi _j\}_{j\in J}\) it holds
Proof
We have
thus
Differentiating the relation \(\sum _j\chi ^2_j \equiv 1\) yields \(\sum _j \chi _j \nabla \chi _j \cdot v \equiv 0 \) for every v and therefore, by Taylor expansion, for every \(x\in \mathbb {R}^d\), \(v\in \mathcal {N}\) and \(\varepsilon >0\),
The claim follows now from (14) and (15) by noting that the assumption \(\sum _j\chi ^2_j \equiv 1\) also implies \(\sup _{j, x} |\chi _{j}(x)|\leqslant 1\), that \( \Vert \psi (\cdot +\varepsilon v) \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d)} = \Vert \psi \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d)} \) for every v and recalling that \(\mathcal {N}\) is bounded. \(\square \)
3.2 Estimates on the discrete semiclassical Harmonic oscillator
We provide lower bounds for the first and the second eigenvalue of the semiclassical discrete Harmonic oscillator.
Proposition 3.2
For every \(x\in \mathbb {R}^d\) let \(U(x) = \langle x-{\bar{x}}, M (x-{\bar{x}}) \rangle \), where \({\bar{x}}\in \mathbb {R}^d\) and M is a symmetric \(d\times d\) real matrix with strictly positive eigenvalues denoted by \(\kappa _1, \ldots , \kappa _d\). Moreover let \(\lambda _0 = \sum _j \sqrt{\kappa _j} \) and \(\lambda _ 1 = \sum _j \sqrt{\kappa _j} + 2 \min _j \sqrt{\kappa _j} \). Then there exist for every \(\varepsilon >0\) a function \(\Psi _{\varepsilon }\in \ell ^2(\varepsilon \mathbb {Z}^d)\) and constants \(\varepsilon _0, C>0\) such that for every \(\varepsilon \in (0, \varepsilon _0]\) and \(\psi \in C_c(\varepsilon \mathbb {Z}^d)\) the following hold:
-
(i)
\( \langle \left( -\varepsilon ^2\Delta _\varepsilon + U \right) \psi , \psi \rangle _{\ell ^2(\varepsilon \mathbb {Z}^d) } \geqslant \varepsilon \left( \lambda _0 - C\varepsilon ^{\frac{1}{5}} \right) \Vert \psi \Vert ^2_{\ell ^2(\varepsilon \mathbb {Z}^d) }. \)
-
(ii)
\( \langle \left( -\varepsilon ^2\Delta _\varepsilon + U \right) \psi , \psi \rangle _{\ell ^2(\varepsilon \mathbb {Z}^d) } \geqslant \varepsilon \left( \lambda _1 - C\varepsilon ^{\frac{1}{5}} \right) \Vert \psi \Vert ^2_{\ell ^2(\varepsilon \mathbb {Z}^d) } - \langle \psi , \Psi _\varepsilon \rangle ^2_{\ell ^2(\varepsilon \mathbb {Z}^d) }. \)
The proof is by localization around low frequencies in Fourier space and comparison with the corresponding continuous Harmonic oscillator on \(\mathbb {R}^d\), whose first and second eigenvalue are given respectively by \(\varepsilon \lambda _0 \) and \(\varepsilon \lambda _1 \). At low frequencies, discrete and continuous Harmonic oscillators are close, while the high frequencies do not contribute to the bottom of the spectrum.
In the proof we shall use the following notation: for \(\varepsilon >0\) and \(\psi \in \ell ^1(\varepsilon \mathbb {Z}^d)\) we define
and for \(\varepsilon >0\) and \(\phi \in L^1(\mathbb {R}^d)\) we define
Then by Parseval’s theorem
and by Plancherel’s theorem
We recall also the inversion theorem for the Fourier transform and Fourier series, which in our notation reads as follows. Let \(\phi \in S(\mathbb {R}^d)\), the Schwartz space on \(\mathbb {R}^d\), and let \({\tilde{\phi }} (x) = {\check{\phi }} (-x)\) for every \(x\in \mathbb {R}^d\). Then
Moreover, for every \(\phi \in C^\infty (\mathbb {R}^d)\) with \({{\,\textrm{supp}\,}}( \phi ) \subset (-\pi , \pi )^d\) it holds \({\check{\phi }}\in \ell ^1(\varepsilon \mathbb {Z}^d)\) and
Proof of Proposition 3.2
Let \(\varepsilon \in (0,1]\), \(\psi \in C_c(\varepsilon \mathbb {Z}^d)\) and let \(\varphi := \left( -\varepsilon ^2\Delta _\varepsilon + U \right) \psi \). Then \(\phi \in C_c(\varepsilon \mathbb {Z}^d)\) and \({\hat{\varphi }} = \left( W - A_\varepsilon \right) {\hat{\psi }}\), where \(W:\mathbb {R}^d\rightarrow \mathbb {R}\) is a multiplication operator given by
and \(A_\varepsilon \) is a second order differential operator given by
It follows then by Parseval’s theorem (16) that
We now consider a cut-off function \(\theta \in C^\infty (\mathbb {R}^d;[0,1])\) which equals 1 on \(\{\xi : |\xi | \leqslant 1\}\) and vanishes on \(\{\xi : |\xi | \geqslant 2\}\). For \(j=1, \ldots , N\) we define with \(s=\tfrac{2}{5}\) the \(\varepsilon \)-dependent smooth quadratic partition of unity \(\{ \theta _{0,\varepsilon }, \theta _{1,\varepsilon }\}\) by setting
Moreover we denote by \(W_0\) the leading term in the \(\xi \)-expansion of the function W around the origin, i.e.
A simple rearrangement of terms gives
where the localization errors \(\mathcal {E}_1(\varepsilon ), \mathcal {E}_2 (\varepsilon )\) are given by
The four terms in the right hand side of (21) are analyzed separately in the following.
(1) Analysis of the first term in the right hand side of (21).
Using that \({{\,\textrm{supp}\,}}\theta _{0,\varepsilon }\subset (-\pi , \pi )^d\) for \(\varepsilon \in (0,1]\), Plancherel’s theorem (17) and that the smallest eigenvalue of the Harmonic Oscillator \(- \varepsilon ^2 \Delta + U\) on \(\mathbb {R} ^d\) is \(\lambda _0\), gives
Moreover, considering for \(\varepsilon >0\) the ground state
an analogous computation gives for \(\varepsilon \in (0,1]\) the estimate
where for the last equality the Fourier inversion theorem (18) is used for \(g_\varepsilon \). Moreover using (17), (19) and (16) we get
Thus, setting for shortness
we can conclude that
(2) Analysis of the second term in the right hand side of (21).
Using the inequality \(\sin \tfrac{t}{2} \geqslant \tfrac{t}{4}\) for \(t\in [0,\pi ]\) gives
Since \({{\,\textrm{supp}\,}}\theta _{1,\varepsilon } \subset \{ \xi \in \mathbb {R}^d: |\xi | \geqslant 2 \varepsilon ^{\tfrac{2}{5}} \}\) for \(\varepsilon \in (0,1]\), the bound (24) implies
Moreover, since \({\hat{\psi }}\) is periodic and \(\theta _{1,\varepsilon }\) equals 1 around the boundary of \([-\pi , \pi ]^d\) for \(\varepsilon \in (0,1]\), integration by parts gives
In particular, it follows from (25) and (26) that there exists an \(\varepsilon '_0\in (0,1]\) such that for all \(\varepsilon \in (0,\varepsilon '_0]\)
(3) Analysis of the localization error \(\mathcal {E}_1\).
Since by Taylor expansion there exists a \(C'>0\) such that \( |W(\xi )-W_0(\xi )| \leqslant C' |\xi |^3\) for \(|\xi | \leqslant 2\), one gets
In particular, since \( \Vert \theta _{0,\varepsilon } {\hat{\psi }} \Vert ^2_{L^2([-\pi , \pi ]^d)} \leqslant \Vert {\hat{\psi }} \Vert ^2_{L^2([-\pi , \pi ]^d)} = \Vert \psi \Vert ^2_{\ell ^2(\varepsilon \mathbb {Z}^d)}\), it follows that
(4) Analysis of the localization error \(\mathcal {E}_2\).
A straightforward computation (see also [45, Lemma 11.3]) gives the IMS localization formula
Thus there exists a constant \(C''>0\) such that
Recalling that \(s=\tfrac{2}{5}\) and the Parseval theorem (16) we conclude that
Final step.
Statement (i) in the theorem follows by putting together (20), (21), (22), (27), (28) and (29), chosing \(C= 8C' + C''\) and \(\varepsilon _0= \varepsilon _0'\) and observing that
Statement (ii) follows similarly, but using (23) instead of (22) and chosing . \(\square \)
3.3 Laplace asymptotics on \(\varepsilon \mathbb {Z}^d\)
Given \(x_0\in \mathbb {R}^d\) and \(\delta >0\) we denote by \(B_\delta (x_0) = \{ x\in \mathbb {R}^d: |x-x_0| < \delta \}\) the open ball of radius \(\delta \) around \(x_0\) and, for each \(\varepsilon > 0 \), by \(B_\delta ^\varepsilon (x_0) = B_\delta (x_0) \cap \varepsilon \mathbb {Z}^d\) its intersection with \(\varepsilon \mathbb {Z}^d\) and by \([B^\varepsilon _\delta ( x_0)]^c = \varepsilon \mathbb {Z}^d{\setminus } B^\varepsilon _\delta ( x_0)\) the complementary of \(B^\varepsilon _\delta ( x_0)\).
Proposition 3.3
Let \(q(x) = \tfrac{1}{2} x \cdot Q x \), where Q is a symmetric, positive definite \(d\times d\) matrix and let \(x_0\in \mathbb {R}^d\) and \(m \in \mathbb {N}_0\). Then there exists a \(\gamma >0\) such that for every \(\varepsilon \in (0,1]\)
Moreover for every \(\delta >0\) there exists \(\gamma (\delta )>0\) such that for every \(\varepsilon \in (0,1]\)
Remark 3.4
The Gaussian integrals appearing on the right hand side of (30) can be computed explicitly. We shall use in the sequel the explicit value only for \(m=0\), in which case (30) becomes
We shall also use the following estimate for odd moments:
The latter follows from Proposition 3.3 and the Cauchy–Schwarz inequality
Proof of Proposition 3.3
The function \(x\mapsto u(x):= |x|^{2m} e^{-q(x-x_0)}\) is in the Schwartz space \(\mathcal {S}(\mathbb {R}^d)\) and its Fourier transform \({\hat{u}} (x):=\int _{\mathbb {R}^d} u(y)\ e^{-2\pi \text {i}x\cdot y} \ dy\) satisfies the Poisson summation formula (see e.g. [44, Corollary 2.6, p. 252])
It follows that
with
Since \({\hat{u}}\) is a linear combination of derivatives of Gaussian functions, there exist constants \(C, \gamma >0\) such that
It follows that for every \(\varepsilon \in (0,1]\)
with \(C':= C \sum _{x\in \mathbb {Z}^d {\setminus }\{0\}} e^{-\gamma (2|x|^2 - 1)} \) which concludes the proof of (30).
In order to prove (31), fix \(\delta >0\) and note that, due to the positive definiteness of Q, there exists a constant \(C>0\) such that \(q(x) > C\delta ^2\) for every \(x\in [B^\varepsilon _\delta ( x_0)]^c \). Thus, for \(\varepsilon \in (0, 1]\),
with \(K = e^{ C\delta ^2} \left( \int _{\mathbb {R}^d} e^{ - q(x-x_0) } dx +1 \right) \). To see the last inequality one can use e.g. the Poisson summation formula for \( \varepsilon ^d \sum _{x\in \varepsilon \mathbb {Z}^d} e^{ - q(x-x_0) } \). From (33), chosing \(\gamma >0\) sufficiently small and \(C'>0\) sufficiently large we obtain
The estimate (31) for \(m=0\) follows then using (30) with \(m=0\). The case of positive m can be proven in the same way. \(\square \)
The following proposition concerns more general, not necessarily quadratic phase functions.
Proposition 3.5
Let \(x_0\in \mathbb {R}^d\), \(\delta >0\), \(k\in \{3,4\}\) and \(\varphi \in C^k(\overline{B_\delta (x_0)})\) s.t.
Moreover let \(m \in \mathbb {N}_0\). Then for \(\varepsilon \in (0,1]\) it holds
where \(q(x) = \tfrac{1}{2} {{\,\textrm{Hess}\,}}\varphi (x_0) x\cdot x\) for all \(x\in \mathbb {R}^d\).
Remark 3.6
Finer asymptotic expansions for Laplace-type sums have been proven in [16, Appendix C] under the stronger regularity assumption \(\varphi \in C^\infty (B_\delta (x_0))\).
Proof
We reduce the problem to the quadratic case of Proposition 3.3. For \(x\in \overline{B_\delta (x_0) } \) let for short \(r(x)= \varphi (x) - q(x-x_0)\) and note that there exist \(\alpha , \delta '>0\) such that \({\tilde{q}}(x):= \alpha |x|^2 \) satisfies
and also \( \varphi (x) \geqslant {\tilde{q}}(x-x_0) \) for all \(x\in \overline{ B_\delta (x_0)}\). Indeed, the assumption \(\varphi \in C^3(\overline{B_\delta (x_0)})\) implies the existence of a constant \(C>0\) such that \(|r(x)| \leqslant C |x-x_0|^3\) for all \(x\in \overline{B_\delta (x_0)}\). It follows that, denoting by \(\lambda >0\) the smallest eigenvalue of \({{\,\textrm{Hess}\,}}\varphi (x_0)\) and taking e.g. \(\delta ' = \tfrac{\lambda }{4C}\) and \(\alpha ' = \tfrac{\lambda }{4} \),
Note that, a fortiori, also \( \varphi (x)\geqslant {\tilde{q}}(x)\) for every \(x \in B_{\delta '}(x_0)\). Moreover, since \(\tfrac{\varphi (x)}{|x|^2}\) is continuous and stricly positive on the compact set \(\overline{B_{\delta }(x_0)} \setminus B_{\delta '}(x_0)\) we can take e.g.
It will be enough to prove (35) with the sum on the left hand side restricted to \(B_{\delta '}^\varepsilon (x_0)\), since by Proposition 3.3 there exists a \(\gamma >0\) s.t. for \(\varepsilon \in (0,1]\)
We shall consider the decomposition
with, setting for short \(u_\varepsilon (x) = |x-x_0|^{2\,m} e^{ -\frac{ q(x) }{\varepsilon }} \),
and
It follows from Proposition 3.3 that there exists a \(\gamma >0\) s.t. for \(\varepsilon \in (0,1]\)
Moreover, using \(|r(x)| \leqslant C |x-x_0|^3\) for all \(x\in \overline{B_\delta (x_0)}\) and (36) gives
with the last estimate being a consequence of Proposition 3.3. Finally, in order to analyze the term \(I_1(\varepsilon )\), we consider first the case \(k=3\). We then have by (32)
which together with (37), (38) and (39) finishes the proof for \(k=3\). For the case \(k=4\) we write \(r(x) = t_3(x) + \rho (x)\), where \(t_3: \overline{B_{\delta '}( x_0)} \rightarrow \mathbb {R}\) is the cubic term in the Taylor expansion of \(\varphi \) around \(x_0\), thus satisfying \(t_3(x_0+x) = t_3(x_0-x)\), and \(\rho : \overline{B_{\delta '}( x_0)} \rightarrow \mathbb {R}\) satisfies \(|\rho (x)| \leqslant C' |x-x_0|^4\) for some \(C'>0\). We then have
and therefore by Proposition 3.3
which finishes the proof in the case \(k=4\). \(\square \)
4 Proof of Theorem 2.2
Recall the definition of \(V_\varepsilon \) given in (6). To prove Theorem 2.2 we shall reduce to suitable localized problems and then exploit basic pointwise estimates on \(V_\varepsilon \) as stated in the following two complementary lemmata. The first one gives a uniform strictly positive lower bound on \(V_\varepsilon \) away from critical points. The second one concerns the local behavior of \(V_\varepsilon \) around critical points. Note that these bounds are almost immediate to obtain, even under weaker assumptions, if instead of \(V_\varepsilon \) one considers the corresponding continuous space potential \( \tfrac{1}{4} |\nabla f|^2 - \tfrac{\varepsilon }{2} \Delta f\) appearing in (3). The discrete case follows from straightforward Taylor expansions and elementary estimates. We shall give the details of the arguments at the end of this section for completeness.
Lemma 4.1
Assume \(f\in C^2(\mathbb {R}^d)\) and that \({{\,\textrm{Hess}\,}}f\) is bounded on \(\mathbb {R}^d\). Let \(S\subset \mathbb {R}^d\) and \(a>0\) such that \(|\nabla f(x)|> a\) for every \(x\in S\). Then there exist constants \(\varepsilon _0,C>0\) such that
Lemma 4.2
Assume \(f\in C^3(\mathbb {R}^d)\). Let \(z\in \mathbb {R}^d\) such that \(\nabla f(z)=0\), \(R>0\) and
Then there exists a constant \(C>0\) such that for all \(x\in B_R(z)\) and \( \varepsilon >0\)
After these preliminary estimates on \(V_\varepsilon \) we turn to the proof of Theorem 2.2. We first show that the essential spectrum of \(H_{\varepsilon }\) is bounded from below by a constant, as claimed in Theorem 2.2 (i).
Proposition 4.3
(Localization of the essential spectrum). Under Assumption H1 there exist constants \(\varepsilon _0, C>0\) such that
Remark 4.4
The proof given below shows that the claim of Proposition 4.3 still holds without assuming that the critical points of f are nondegenerate. Also the regularity assumption on f can be relaxed by assuming \(f\in C^2(\mathbb {R}^d)\) instead of \(f\in C^3(\mathbb {R}^d)\).
Proof
Let \(\chi := \alpha \textbf{1}_{K}\), where \(\textbf{1}_{K}\) is the indicator function of a bounded set \(K\subset \mathbb {R}^d\) and \(\alpha \in \mathbb {R}\). Then \(\chi \), seen as a multiplication operator in \(\ell ^2(\varepsilon \mathbb {Z}^d)\), is of finite rank (in particular compact) for every \(\varepsilon >0\). It follows from Weyl’s theorem that for fixed \(\varepsilon >0\),
Moreover
The claim follows by chosing \(\alpha \) and K large enough so that for some constants \(\varepsilon _0,C>0\) the inequality \(V_\varepsilon (x)+\chi (x)\geqslant C\) holds for every \(x\in \mathbb {R}^d\) and \(\varepsilon \in (0,\varepsilon _0]\). To see that this choice is possible recall the uniform bound \(V_\varepsilon \geqslant -2d\) and note that by Assumption H1 (i) there exist \(a>0, R>0\) such that \(|\nabla f(x)| >a\) for \(|x| > R\). It follows then by Lemma 4.1 that for suitable \(C, \varepsilon _0>0\) it holds \(V_\varepsilon (x) \geqslant C\) for \(|x|>R\) and \(\varepsilon \in (0, \varepsilon _0)\).
\(\square \)
The next proposition provides the crucial estimate for the proof of statement (ii) in Theorem 2.2.
Proposition 4.5
Assume H1 and denote by \(N_0\in \mathbb {N}_0\) the number of local minima of f. Then there exist constants \(\varepsilon _0, C>0\) and, for every \(\varepsilon >0\), functions \(\Psi _{1,\varepsilon },\ldots , \Psi _{N_0, \varepsilon }\in \ell ^2(\varepsilon \mathbb {Z}^d)\) such that for every \( \psi \in \textrm{Dom}(V_\varepsilon ) \) it holds
As shown next, Statement (ii) in Theorem 2.2 is then a simple consequence of the Max-Min principle (see e.g. [25, Theorem 11.7]). Recall that the latter implies the following: considering increasing order and counting multiplicity, if it exists, the N-th eigenvalue of \( H_{\varepsilon }\) below the bottom of the essential spectrum equals \(\sup _{\mathcal {V}} \inf _{\psi } \langle H_{\varepsilon } \psi , \psi \rangle _{\ell ^2(\varepsilon \mathbb {Z}^d)} \), with the supremum taken over all N-dimensional subspaces \(\mathcal {V}\) of \(\ell ^2 (\varepsilon \mathbb {Z}^d)\) and the infimum taken over all normalized \(\psi \in \mathcal {V}^\perp \cap \textrm{Dom} (V_\varepsilon )\).
Corollary 4.6
Assume H1 and denote by \(N_0\in \mathbb {N}_0\) the number of local minima of f. Then there exist constants \(\varepsilon _0, C>0\) such that
Proof of Corollary 4.6
By Propositions 4.3 and 4.5 we can find \(\varepsilon _0,C>0\) such that
and such that (41) holds. If for every \(\varepsilon \in (0, \varepsilon _0)\) it happens that
the claim is proven. Thus, we only have to check the case in which there exists \(\varepsilon _*\in (0,\varepsilon _0)\) such that
But this case is impossible. Indeed (43) implies that there exist at least \(N_0+1\) distinct eigenvalues of \(H_{\varepsilon _*}\) in \([0,\tfrac{C \varepsilon _*}{2}]\) and thus in particular the \(N_0+1\)-th eigenvalue \(\lambda _{N_0+1}(\varepsilon _*)\)(in increasing order and counting multiplicity) exists and satisfies
In particular \(\lambda _{N_0+1}(\varepsilon _*) \leqslant \tfrac{C \varepsilon _0}{2} \) and therefore, by (42), \(\lambda _{N_0+1}(\varepsilon _*)\) is smaller than the bottom of the essential spectrum. From this, the Max-Min principle and (41) it follows that
where the infimum is taken over all normalized \(\psi \in \mathcal {V}_\varepsilon ^{\perp } \cap \mathrm{Dom(V_\varepsilon )}\), with \(\mathcal {V}_\varepsilon \) being the linear span of the set \(\{\Psi _{1,\varepsilon },\ldots , \Psi _{N_0, \varepsilon }\}\subset \ell ^2(\varepsilon \mathbb {Z}^d)\) appearing in (41). But (44) and (45) are in contradiction. \(\square \)
Proof of Proposition 4.5
We label by \(z_1,\ldots , z_N\) the critical points of f, the ordering being chosen such that \(z_1, \ldots , z_{N_0}\) are the local minima. Then we take a function \(\chi \in C^\infty (\mathbb {R}^d;[0,1])\) which equals 1 on \(\{x: |x| \leqslant 1\}\) and vanishes on \(\{x: |x| \geqslant 2\}\). We shall consider a smooth quadratic partition of unity by defining with \(s=\tfrac{2}{5}\)
for \(j=1, \ldots , N\) and \(\varepsilon \in (0, {\overline{\varepsilon }}]\), where \(\overline{\varepsilon }\in (0,1]\) is sufficiently small so that \( \chi _{0, \varepsilon } \in C^\infty (\mathbb {R}^d) \). We set moreover for \(x\in \mathbb {R}^d\) and \(j=1, \ldots , N\)
Let \(\psi \in \textrm{Dom}(V_\varepsilon )\). It follows from \(\sum _{j=0}^N \chi ^2_{ j,\varepsilon } \equiv 1\) that we can write
with the localization errors given by
The four terms in the right hand side of (46) are now analyzed separately.
(1) Analysis of the first term in the right hand side of (46).
We apply Proposition 3.2: let \(\kappa _1(z_j)\ldots , \kappa _d(z_j)\) be the eigenvalues of \(\tfrac{1}{2}{{\,\textrm{Hess}\,}}f(z_j)\), so that in particular \(\tfrac{1}{2} \Delta f(z_j)= \sum _{i}\kappa _i (z_j)\) and \(\kappa ^2_1(z_j)\ldots , \kappa ^2_d(z_j)\) are the eigenvalues of \(\tfrac{1}{4}\left[ {{\,\textrm{Hess}\,}}f(z_j) \right] ^2\).
Case 1: \(j=1, \ldots , N_0\) (i.e. \(z_j\) is a local minimum of f)
In this case \(\sum _i \left( |\kappa _i(z_j)| - \kappa _i(z_j)\right) =0\) and according to Prop. 3.2 (ii) there exist for every \(\varepsilon >0, j=1,\ldots , N_0\) a function \(\Phi _{j,\varepsilon }\in \ell ^2(\varepsilon \mathbb {Z}^d)\) and constants \(\varepsilon '_{0}, C'>0\) such that for every \(\varepsilon \in (0, \varepsilon '_{0}]\), \(j=1, \ldots , N_0\) and \(\psi \in C_c(\varepsilon \mathbb {Z}^d)\)
Case 2: \(j=N_0+1, \ldots , N\) (i.e. \(z_j\) is not a local minimum of f)
In this case \(\sum _i \left( |\kappa _i(z_j)| - \kappa _i(z_j)\right) > 0\) and according to Prop. 3.2 (i), possibly taking the constants \(\varepsilon '_{0}, C'>0\) smaller, the following holds: for every \(\varepsilon \in (0, \varepsilon '_{0}]\), \(j=N_0+1, \ldots , N\) and \(\psi \in C_c(\varepsilon \mathbb {Z}^d)\)
(2) Analysis of the second term in the right hand side of (46).
According to Lemma 4.2 there exist constants \(r, C''>0\) and \(\varepsilon ''_0\in (0,{\overline{\varepsilon }}]\) such that for every \(j=1, \ldots , N\)
Moreover, according to Lemma 4.1, possibly taking the constants \(\varepsilon _0'', C''>0\) smaller, it holds also
Since \({{\,\textrm{supp}\,}}\chi _{0,\varepsilon } \subset \{ x\in \mathbb {R}^d: |x-z_j| \geqslant 2 \varepsilon ^{\tfrac{2}{5}} \text { for all } j=1, \ldots , N \}\), the lower bounds (49), (50) imply (with possibly reducing further the constant \(\varepsilon _0''>0\))
Using \(-\Delta _\varepsilon \geqslant 0\) we conclude that
(3) Analysis of the localization error \(\mathcal {E}_1\).
Let \(R_{j,\varepsilon }(x):= V_\varepsilon (x) - U_j(x) +\tfrac{\varepsilon }{2} \Delta f(z_j)\). By Lemma 4.2 there exist constants \(C''', \varepsilon _0'''>0\) such that
Thus, since \({{\,\textrm{supp}\,}}\chi _{j,\varepsilon } \subset \{ x\in \mathbb {R}^d: |x-z_j| \leqslant 2 \varepsilon ^{\tfrac{2}{5}}\}\) for all \(j=1, \ldots , N\),
and we conclude that
(4) Analysis of the localization error \(\mathcal {E}_2\).
Using \(\sum _{j=0}^N \chi _{j, \varepsilon }^2 \equiv 1\) gives
Since there is a constant \(K>0\) such that \(\sup _{x\in \mathbb {R}^d} |{{\,\textrm{Hess}\,}}\chi _{j,\varepsilon }(x)| \leqslant K \varepsilon ^{-2\,s}\) for every \(\varepsilon \in (0, {\overline{\varepsilon }}]\) and \(j=0, \ldots , N\), it follows from Lemma 3.1 that there exists a constant \(C''''>0\) such that
In particular we shall use that
Final step.
Taking \(\Psi _{j,\varepsilon }:= \chi _{j,\varepsilon } \Phi _{j, \varepsilon }\), \({\tilde{\varepsilon }}_0:=\min \{\varepsilon _0', \varepsilon _0'', \varepsilon _0'''\}\) and \(\tilde{C}:=\min \{C', C''\}\) gives, according to (46), (47), (48), (51), (52), (53) the lower bound
which implies the desired estimate (41), by taking \(C ={\tilde{C}}/2\) and a sufficiently small \(\varepsilon _0\in (0,\tilde{\varepsilon }_0)\). \(\square \)
It remains to show part (iii) of Theorem 2.2 to complete the proof. In order to do so, we can assume \(N_0\ne 0\), since otherwise there is nothing to prove. By the Max–Min principle [25, Theorem 11.7 and Proposition 11.9] together with the bound on the essential spectrum given by Proposition 4.3 it is sufficient to show that for each \(\varepsilon >0\) there exist \(N_0\) orthonormal functions in the domain \(\textrm{Dom}(V_\varepsilon )\) of \(H_\varepsilon \) such that the quadratic form associated with \(H_\varepsilon \) is exponentially small for each of these functions. We shall now exhibit such a family of orthonormal functions.
Let \(\{z_1,\ldots , z_{N_0}\}\) be the set of local minima of f. We fix \(\delta >0\) such that \(B_{3\delta }(z_k)\cap B_{3\delta }(z_j)\) is the empty set for \(k\ne j\) and such that \(f> f(z_k)\) on \(B_{3\delta }(z_k){\setminus }\{z_k\}\). Moreover we fix for each \(k=1, \ldots , N_0\) a cutoff function \(\chi _k\in C^\infty (\mathbb {R}^d;[0,1])\), satisfying \(\chi \equiv 1\) on \(B_\delta (z_k)\) \(\chi \equiv 0\) on \(\mathbb {R}^d{\setminus } B_{2\delta }(z_k)\). We consider then for each \(\varepsilon >0\) and for each \(k=1, \ldots , N_0\) the functions \(\psi _{k,\varepsilon }:\mathbb {R}^d \rightarrow \mathbb {R}\) given by
Then for each \(\varepsilon >0\) the (restrictions to \(\varepsilon \mathbb {Z}^d\) of the) functions \(\psi _{1,\varepsilon }, \ldots , \psi _{N_0, \varepsilon }\) are in the domain of \(H_\varepsilon \) and orthonormal in \(\ell ^2(\varepsilon \mathbb {Z}^d)\). Moreover the following proposition shows that the quadratic form associated with \(H_\varepsilon \) is exponentially small for each of these functions and thus concludes the proof of Theorem 2.2.
Proposition 4.7
Assume H1 and that the set \(\{z_1,\ldots , z_{N_0}\}\) of local minima of f is not empty. Then there exist \(C, \varepsilon _0>0\) such that for each \(\varepsilon \in (0,\varepsilon _0]\) the functions \(\psi _{1,\varepsilon }, \ldots , \psi _{N_0, \varepsilon }\) defined in (54) satisfy the estimate
Proof
Fix \(k=1, \ldots , N_0\). Then, applying Proposition 3.5 with \(\varphi =f- f(z_k)\), \(k=3\) and \(m=0\), gives for a suitable constant \(K>0\) and for every \(\varepsilon \in (0,1]\)
Further, using (7) and the notation \(F_\varepsilon (x,v) = \frac{1}{2} [ f(x) + f(x+\varepsilon v) ] \),
We take \(\varepsilon '_0\in (0,1]\) small enough such that for all \(\varepsilon \in (0, \varepsilon _0]\) it holds
Moreover we take \(\gamma >0\) small enough such that for all \(\varepsilon \in (0, \varepsilon _0]\) and for all \(k=1, \ldots , N_0\) it holds
It follows then from (56), the uniform bound \(\varepsilon ^2|\nabla _\varepsilon \chi _k|\leqslant 2\) and the existence of a \({\tilde{K}}>0\) with \(\varepsilon ^d |\Omega _\varepsilon | \leqslant {\tilde{K}}\) that
Putting together (55) and (57) gives the claim with e.g. \(C=\gamma /2\) and \(\varepsilon _0\in ( 0, \varepsilon _0']\) sufficiently small. \(\square \)
In the remainder of this section we provide the proofs of the basic estimates on \(V_\varepsilon \) given in Lemmas 4.1 and 4.2.
Proof of Lemma 4.1
A Taylor expansion gives for every \(x\in \mathbb {R}^d\) the representation
where, thanks to the boundedness of \({{\,\textrm{Hess}\,}}f\),
In fact, one may write
and, using \(\cosh 2t - 1 = 2\sinh ^2 t\),
Moreover, for
using \(|e^{t}-1| \leqslant |t|e^{|t|}\) with
and noting that, due to the boundedness of \({{\,\textrm{Hess}\,}}f\), there exists a constant \(A>0\) such that
one gets for \(\varepsilon \in (0,1]\) and every \(x\in \mathbb {R}^d\)
It follows from (58) and (59) that for \(\varepsilon \in (0,1]\) and every \(x\in \mathbb {R}^d\)
Using that \(\cosh t -1 \geqslant t^2\) with \(t= \tfrac{\nabla f(x) \cdot v}{2}\) and \( \sum _{v\in \mathcal {N}} |\nabla f(x) \cdot v|^2 = 2 |\nabla f(x)|^2 \) we get for \(\varepsilon \in (0, \min \{1,\tfrac{1}{R}\})\) and every \(x\in \mathbb {R}^d\) the lower bound
In particular
The claim follows by chosing an \(\varepsilon _0 \in (0, \min \{1, \tfrac{1}{R}, \frac{a^2}{Ra^2 + 2R} \})\) and \(C = \frac{a^2}{2} \ - \ \varepsilon _0 \, \left( \frac{ R a^2}{2} + R \right) \). \(\square \)
Proof of Lemma 4.2
This follows from a straightforward Taylor expansion. Indeed, fixing \(z\in \mathbb {R}^d\) such that \(\nabla f(z)=0\) and \(R>0\), we have on \(B_R(z)\) the uniform estimate
Using the inequality \(|e^t-1-t| \leqslant \tfrac{1}{2} t^2 e^{|t|}\) with \(t = \tfrac{\varepsilon }{4} {{\,\textrm{Hess}\,}}f \, v \cdot v + \mathcal {O}(\varepsilon ^2) \) then gives
The expansion \(\cosh x = 1 + \frac{1}{2} x^2 + \mathcal {O}(x^4)\) and the equalities \(\sum _{v} |\nabla f\cdot v|^2 = 2|\nabla f|^2\) and \(\sum _{v} {{\,\textrm{Hess}\,}}f \, v \cdot v = 2\Delta f\) give
Expanding all terms in x around z, which gives in particular \(|\nabla f (x)|^2 = [{{\,\textrm{Hess}\,}}f(z)]^2 (x-z)\cdot ( x-z) + \mathcal {O}(|x-z|^3) \) and \(\Delta f(x) = \Delta f (z) + \mathcal {O}(|x-z|)\), finishes the proof. \(\square \)
5 Proof of Theorem 2.3
5.1 General strategy
In order to compute the precise asymptotics of the smallest non-zero eigenvalue \(\lambda (\varepsilon )\) of \(H_\varepsilon \) we shall consider a suitable choice of an \(\varepsilon \)-dependent test function \(\psi _\varepsilon \). The latter will be referred to as quasimode and its precise construction will be given in Sect. 5.2. Since \(\psi _\varepsilon \) will be chosen orthogonal to the ground state \(e^{-f/(2\varepsilon )}\) for every \(\varepsilon \), the upper bound on \(\lambda (\varepsilon )\) given in Theorem 2.3 will follow immediately from the Max–Min principle, giving
and from the precise computation of the right hand side in the above formula by using the Laplace asymptotics on \(\varepsilon \mathbb {Z}^d\) given in Sect. 3.3. The result of these computations is the content of Propositions 5.2 and 5.3 (see also Remark 5.5 for the improved error bound under the assumption \(f\in C^4(\mathbb {R}^d)\)).
The proof of the lower bound on \(\lambda (\varepsilon )\) given in Theorem 2.3 is more subtle. We shall derive it as a corollary of Theorem 2.2 and the following abstract Kato–Temple type estimate.
Proposition 5.1
Let \((T, \mathcal {D}(T))\) be a nonnegative selfadjoint operator on a Hilbert space with scalar product denoted by \(\langle \cdot , \cdot \rangle \). Further let \(\tau >0\) such that \((0,\tau )\cap {{\,\textrm{Spec}\,}}(T) \ne \emptyset \) and set \(\lambda = \sup ((0,\tau )\cap {{\,\textrm{Spec}\,}}(T))\). Then for every normalized \(u\in \mathcal {D}(T)\) it holds
Proof
We fix a \(u\in \mathcal {D}(T)\) and denote by \(P = \textbf{1}_{[0,\tau )}(T)\) the spectral projector of T corresponding to the interval \([0,\tau )\). Then
with \(\Vert \cdot \Vert \) the Hilbert space norm. The claim follows now from the Chebyshev-type estimate
which is a simple consequence of the spectral theorem:
\(\square \)
We shall apply Proposition 5.1 to the case \(T = H_\varepsilon \), \(\tau = C\varepsilon \), where C is the constant appearing in Theorem 2.2 and \(u= (\Vert \psi _\varepsilon \Vert _{\ell ^2(\varepsilon \mathbb {Z}^d)})^{-1} \psi _\varepsilon \), where \(\psi _\varepsilon \) is the same quasimode used for the upper bound on \(\lambda (\varepsilon )\). By Theorem 2.2 (ii) we thus obtain a lower bound on \(\lambda (\varepsilon )\). The fact that this lower bound coincides with the lower bound given in Theorem 2.3 is a consequence of the precise computation of the right hand side of (60), which we already mentioned (see Propositions 5.2 and 5.3), and the estimate
The latter estimate will be a consequence of Propositions 5.3 and 5.4, which is proven again by analyzing the Laplace asymptotics of a sum over \(\varepsilon \mathbb {Z}^d\).
We shall assume throughout the rest of this section that the Assumption H2 is satisfied.
5.2 Definition of the quasimode \(\psi _\varepsilon \)
In the semiclassical approach of [26] the eigenfunction corresponding to \(\lambda (\varepsilon )\) is approximated by two distinct quasimodes: the first quasimode is a generic smoothed version of the function equal to one on the basin of attraction of \(m_0\) and equal to \(-1\) on the basin of attraction of \(m_1\). This quasimode turns out to be a good global approximation, except possibly in vicinity of the relevant saddle points, where the transition from 1 to \(-1\) takes place. Around the relevant saddle points, a second quasimode is constructed: it is actually a vector field and used as approximation of the gradient of the eigenfunction. This second quasimode is obtained via a WKB Ansatz and fails to be an exact vector field in general. The relevant estimates are then obtained exploiting the Witten complex structure and computing interaction integrals over the regions where the two quasimodes overlap.
In this paper we use a single global quasimode obtained by a suitable gluing procedure: instead of the mentioned WKB solution, we consider just its quadratic approximation around each saddle point and only in the direction of the unstable manifold. In this way, the vector field becomes exact, and still retains sufficient information to precisely compute leading asymptotics. We then consider a primitive of this exact vector field and use it to prescribe the transition from 1 to \(-1\) around the saddle points, see the definition of \(\kappa _{\varepsilon }\) in (62) below.
Before giving the details, it might be useful to observe that a similar need of constructing good test functions arises also in the approach based on potential theory [7]: in that approach test functions approximate the so-called equilibrium potential which can be characterized as minimizer of the Dirichlet form (or capacity) under suitable boundary conditions. There is however a difference, which is worth to emphasize: in dimension one the variational problem for the equilibrium potential can be solved explicitly by a simple integration of the Euler–Lagrange equations. In higher dimensions this is still very useful, because the test function can be modelled on the 1-d case, exploiting the fact that the metastable transition occurs on the one-dimensional instanton path joining the minima. This is no longer possible for the eigenfunctions, where even in one dimension no explicit representation is available for the solution of the variational problem.
We provide now the details for the definition of the quasimode used in this paper: let \(s_1,\ldots , s_n\) be the relevant saddle points of f, i.e. the critical points of index one of f appearing in formula (10) defining the prefactor A. Given \(x\in \mathbb {R}^d\) we associate to it a linear “reaction coordinate” \(\xi _k= \xi _k(x)\) around the saddle point \(s_k\), which parametrizes the unstable direction of \({{\,\textrm{Hess}\,}}f(s_k)\). More precisely, we choose one of the two normalized eigenvectors corresponding to the only negative eigenvalue \(\mu (s_k)\) of \({{\,\textrm{Hess}\,}}f(s_k)\), denote it by \(\tau _k\), and set
Recalling our notation \(\mathcal {S}_f(h) = f^{-1} \left( (-\infty , h) \right) \) for the open sublevel set of f corresponding to the height \(h\in \mathbb {R}\) and the definition of the height \(h^*\) given in (9), we consider for \(\rho >0\) and \(k=1, \ldots , n\) the closed set
and the open set \(\mathcal {B} = \mathcal {S}_f(h^*+\rho ) {\setminus } \left( \bigcup _k \mathcal {R}_{k} \right) \).
Henceforth the parameter \(\rho >0\) appearing in the definition of \(\mathcal {R}_k\) and \(\mathcal {B}\) is fixed sufficiently small such that the following properties hold (Fig. 2):
-
the set \(\mathcal {B}\) has exactly two connected components \(\mathcal {B}^{(0)}\) and \(\mathcal {B}^{(1)} \), containing respectively \(m_0\) and \(m_1\). Moreover \(f > h_*\) on \( \bigcup _k \mathcal {R}_k\), where we recall that we have set \(h_* = \max \{f(m_0), f(m_1)\}\).
-
\(\mathcal {R}_{k}\) is disjoint from \(\mathcal {R}_{k'}\) for \(k\ne k'\).
-
For each \(k=1, \ldots , n\) the function \(\varphi _k \, = \ f \ + \ |\mu (s_k)| \xi _k^2 \) satisfies \(\varphi _k(x) > f(s_k) \) for every \(x\in \mathcal {R}_{k} {\setminus } \{s_k\}\).
Note that \({{\,\textrm{Hess}\,}}\varphi _k (s_k) = |{{\,\textrm{Hess}\,}}f(s_k)|\). In other terms the quadratic approximation of \(\varphi _k\) around \(s_k\) is obtained from that of f by flipping the sign of the only negative eigenvalue of \({{\,\textrm{Hess}\,}}f(s_k)\).
Let \(\varepsilon \in (0, 1]\). The quasimode \(\psi _\varepsilon \) for the spectral gap is defined as follows. We define first on the sublevel set \(\mathcal {S}_f( h^* + \rho )\)
The constant \(C_{k,\varepsilon }\) appearing above is defined as
and \(\chi \in C^\infty (\mathbb {R};[0,1])\) satisfies \(\chi \equiv 1\) on \([- \frac{\rho }{3},\frac{\rho }{3}]\), \(\chi (\eta )= 0\) for \(|\eta |\geqslant \frac{2}{3} \rho \) and \(\chi (\eta ) =\chi (-\eta )\). Note that
Note also that for each \(k=1, \ldots n\) the sign of the vector \(\tau _k\) defining \(\xi _k\) (see (61)) can be chosen such that \(\kappa _\varepsilon \) is \(C^\infty \) on \(\mathcal {S}_f( h^* + \rho )\), which we shall assume in the sequel. In order to extend \( \kappa \) to a smooth function defined on the whole \(\mathbb {R}^d\) we introduce another cutoff function \(\theta \in C^\infty (\mathbb {R}^d;[0,1])\) by setting for \(x\in \mathbb {R}^d\)
Finally we define the quasimode \(\psi _\varepsilon \) by setting for \(x\in \mathbb {R}^d\)
Note that \(\psi _\varepsilon \in C^\infty (\mathbb {R}^d)\) with compact support. In particular its restiction to \(\varepsilon \mathbb {Z}^d\), which we still denote by \(\psi _\varepsilon \), is in \(C_c(\varepsilon \mathbb {Z}^d) \subset \textrm{Dom}(V_\varepsilon ) \). Moreover, it follows from its very definition that \(\psi _\varepsilon \) is orthogonal to the ground state \(e^{-f/(2\varepsilon )}\) with respect to the scalar product \(\langle \cdot , \cdot \rangle _{\ell ^2(\varepsilon \mathbb {Z}^d)}\).
5.3 Quasimode estimates
We now state the crucial estimates concerning the quasimode \(\psi _\varepsilon \). The proofs follow from straightforward computations exploiting the results of Sect. 3.3 on the Laplace asymptotics for sums over \(\varepsilon \mathbb {Z}^d\). We shall give the details in Sect. 5.4.
Proposition 5.2
Assume H2 and let \(\varepsilon \in (0,1]\). The function \(\psi _\varepsilon \) defined in (65) satisfies
where \(h_*=\max \{f(m_0),f(m_1)\}\) and
Proposition 5.3
Assume H2 and let \(\varepsilon \in (0,1]\). The function \(\psi _\varepsilon \) defined in (65) satisfies
where \(\mu (s_k)\) is the only negative eigenvalue of \({{\,\textrm{Hess}\,}}f(s_k)\) and \(h^*\) is defined in (9).
Proposition 5.4
Assume H2 and let \(\varepsilon \in (0,1]\). The function \(\psi _\varepsilon \) defined in (65) satisfies
where \(h^*\) is defined in (9).
Remark 5.5
With the stronger assumption \(f\in C^4(\mathbb {R}^d)\) the \(\mathcal {O}(\sqrt{\varepsilon })\) error terms appearing in Propositions 5.2 and 5.3 can be shown to be actually \(\mathcal {O}(\varepsilon )\). Indeed it is enough to apply Proposition 3.5 with \(k=4\) instead of \(k=3\) each time it is used in the proofs given below.
5.4 Proofs of the quasimode estimates
Proof of Proposition 5.2
Let \(\varepsilon \in (0,1]\). We first consider the case \(f(m_0) < f(m_1)\) for which \(h_*= f(m_1)\). It is then convenient to write
Recalling the definitions (62), (64) of \(\kappa _\varepsilon \) and \(\theta \), we observe that there exist \(\alpha , \delta >0\) such that \(\theta \kappa _\varepsilon \equiv - 1\) on \(B_\delta (m_0)\), \(\theta \kappa _\varepsilon \equiv +1\) on \( B_\delta (m_1) \) and
-
(i)
\(f \geqslant f(m_0)+\alpha \) on \([B_\delta (m_0)]^c\),
-
(ii)
\(f \geqslant f(m_1)+\alpha \) on \([B_\delta (m_1)]^c \bigcap {{\,\textrm{supp}\,}}(1 + \theta \kappa _\varepsilon )\).
As a preliminary step we claim that
Indeed, from (i) it follows that
where for the last estimate we have used that, by Hypothesis H2 (i), the sum \(\varepsilon ^d\sum _{x\in \varepsilon \mathbb {Z}^d} e^{- f(x) } \) is uniformly bounded in \(\varepsilon \). Proposition 3.5 gives then
Following the same reasoning, but using (ii) instead of (i), yields
The claim (67) is thus obtained by taking the quotient between (68) and (69).
Next we set \(\mathcal {B}_\varepsilon ^{(1)} = \mathcal {B}^{(1)} \cap \varepsilon \mathbb {Z}^d\) and write
In order to compute the first sum we consider \(\alpha , \delta >0\) as above and write
Since \(\kappa _\varepsilon \equiv 1\) on \(B_\delta ^\varepsilon (m_1)\), recalling (66) and (67), another application of Proposition 3.5 yields
For the second sum in (71) it is enough to note that \(\mathcal {B}_\varepsilon ^{(1)} \subset {{\,\textrm{supp}\,}}(1+ \theta \kappa _\varepsilon )\) and use (ii), which, arguing as above, gives
In order to estimate the second sum in (70), note that
Using (67) and (68) and setting \(\Omega _\varepsilon = [\mathcal {B}_\varepsilon ^{(1)}]^c \cap {{\,\textrm{supp}\,}}(1 +\theta \kappa _\varepsilon )\) gives then
The claim of the proposition in the case \(f(m_0) < f(m_1)\) thus follows by observing that there exists \(\alpha '>0\) such that \(f > f(m_1)+\alpha '\) on \(\Omega _\varepsilon \), yielding
We now consider the case \(h_*= f(m_0)= f(m_1)\). It follows from the definition of \(\psi _\varepsilon \) that
Let \(\alpha , \delta >0\) such that \(f \geqslant f(m_0)+\alpha \) on \([B_\delta (m_0) \cup B_\delta (m_1) ]^c\) and \(\theta \kappa _\varepsilon \equiv -1\) on \(B_\delta (m_0)\), \(\theta \kappa _\varepsilon \equiv 1\) on \(B_\delta (m_1)\). With arguments as above one gets
Putting these expressions into (72), the desired result (5.2) follows after some algebraic manipulations. \(\square \)
Proof of Proposition 5.3
Let \(\varepsilon \in (0,1]\). Using (7) and the notation \(F_\varepsilon (x,v) = \frac{1}{2} [ f(x) + f(x+\varepsilon v) ] \) gives
Since the function \(\theta \) has support in the closure of \(\mathcal {S}_f( h^* + \frac{3}{4} \rho )\), we can restrict (for \(\varepsilon \) sufficiently small) the sum running over \( \varepsilon \mathbb {Z}^d\) to the bounded set \(\varepsilon \mathbb {Z}^d \cap \mathcal {S}_f( h^* + \rho )\). Note that \(\mathcal {S}_f( h^* + \rho )\) is the union of the disjoint sets \( \mathcal {B}\) and \(\bigcup _k (\mathcal {R}_k {\setminus } [\mathcal {S}_f( h^* + \rho )]^c)\). We write in the sequel for short \( \mathcal {R}_{k,\varepsilon }:= \varepsilon \mathbb {Z}^d \cap (\mathcal {R}_{ k} {\setminus } [\mathcal {S}_f( h^* + \rho )]^c )\) and \(\mathcal {B}_\varepsilon := \varepsilon \mathbb {Z}^d \cap \mathcal {B} \) and discuss below separately the sum over \(\bigcup _{k=1}^n \mathcal {R}_{k, \varepsilon } \), which will give the main contribution, and the sum over \(\mathcal {B}_\varepsilon \), which will give a negligible contribution.
Below we shall use the Taylor expansion
(1) Analysis on \(\bigcup _{k=1}^n \mathcal {R}_{k, \varepsilon } \).
In order to get rid of \(\theta \) we take \(\delta >0\) small enough such that for each k it holds \(B_\delta (s_k) \subset \mathcal {R}'_k:= \mathcal {R}_{k} \cap \mathcal {S}_f( h^* + \frac{1}{4} \rho ) \). Since \(\theta \) and \(\kappa _\varepsilon \) are uniformly bounded in \(\varepsilon \) and \(f\geqslant h^* + \frac{1}{4} \rho \) on \(\mathcal {R}_{k} {\setminus } B_\delta (s_k) \), we get using (73) that, for \(\varepsilon >0\) sufficiently small (and thus also for \(\varepsilon \in (0,1]\)), it holds
where we have used also that for \(\varepsilon \) sufficiently small \(\theta (x)=\theta (x+\varepsilon v)=1\) for \(x\in B_\delta (s_k)\).
We discuss now in detail the behavior of \(x \mapsto \kappa _\varepsilon \, (x+\varepsilon v) - \kappa _\varepsilon (x) \) near \(s_k\). For \(k =1, \ldots , n\) and \(x\in \mathcal {R}'_{k}\), \(v\in \mathcal {N}\) and \(\varepsilon \in (0,1]\) consider the function \(G=G_{k,x,v,\varepsilon }:[0,1] \rightarrow \mathbb {R}\) defined by
Note that \(G(0)=0\), \( G'(0) = e^{- |\mu (s_k)| \xi _k^2(x)/(2\varepsilon )} \chi (\xi _k(x)) \tau _k \cdot v \),
and for every \(\delta \in [0,1]\)
where R is not depending on \(\varepsilon \) and bounded in k, x, v. By Taylor expansion it follows that
It follows from (75), (76), (73), (63) and the two identities
that for \(k =1, \ldots , n\) and \(x\in \mathcal {R}'_{k}\) and \(\varepsilon >0\) small enough
where for shortness we have set \( \varphi _k(x) = f(x) - f(s_k) + |\mu (s_k)| \xi _k^2(x) \) and
Putting together (74), (78), using Proposition 3.5, summing over k and using the fact that \(f(s_k) = h^*\) for every k finally gives
(2) Analysis on \(\mathcal {B}_\varepsilon \).
As in Step 1) we get rid of \(\theta \) by considering the set \(\mathcal {B}' = \mathcal {B} \cap \mathcal {S}_f( h^* + \frac{1}{4} \rho )\). Arguing as before and now using that \(\kappa _\varepsilon (x)= \kappa _\varepsilon (x+\varepsilon v)\) for every \(x\in \mathcal {B}'\), \(v\in \mathcal {N}\) and \(\varepsilon \) sufficiently small, gives then, with the notation \( \mathcal {B}'_{\varepsilon } = \mathcal {B}' \cap \varepsilon \mathbb {Z}^d\),
\(\square \)
Proof of Proposition 5.4
The isomporphism (12) gives the identity
Since the function \(\theta \) has support in the closure of \(\mathcal {S}_f( h^* + \frac{3}{4} \rho )\), we can restrict (for \(\varepsilon \) sufficiently small) the sum over \( \varepsilon \mathbb {Z}^d\) to the bounded set \(\varepsilon \mathbb {Z}^d \cap \mathcal {S}_f( h^* + \rho )\). As in the proof of Proposition 5.3 we shall split the latter into the disjoint sets \(\bigcup _k \mathcal {R}_{k,\varepsilon }\),with \( \mathcal {R}_{k,\varepsilon }:= \varepsilon \mathbb {Z}^d \cap (\mathcal {R}_{ k} {\setminus } [\mathcal {S}_f( h^* + \rho )]^c )\), and \(\mathcal {B}_\varepsilon := \varepsilon \mathbb {Z}^d \cap \mathcal {B} \).
We discuss here in detail only the contribution coming from the sets \( \mathcal {R}_{k,\varepsilon }\). Indeed the sum over \(\mathcal {B}_\varepsilon \) can be neglected arguing exactly as in Step 2) of Proposition 5.3 and using, instead of (73), that by Taylor expansion
Analysis on \(\bigcup _{k=1}^n \mathcal {R}_{k, \varepsilon } \).
As in the proof of Proposition 5.3 we first get rid of \(\theta \) by taking a \(\delta >0\) small enough such that for each k it holds \(B_\delta (s_k) \subset \mathcal {R}_{k} \cap \mathcal {S}_f( h^* + \frac{1}{4} \rho ) \). Since \(\theta \) and \(\kappa _\varepsilon \) are uniformly bounded in \(\varepsilon \) and \(f\geqslant h^* + \frac{1}{4} \rho \) on \(\mathcal {R}_{k} \setminus B_\delta (s_k) \), we get using (79) that, for \(\varepsilon >0\) sufficiently small (and thus also for \(\varepsilon \in (0,1]\)), it holds
A computation already used in the proof of Proposition 5.3 (see (76)) yields
Hence, using (63), (79), the identity (77) and the identity
one obtains
with
Observing that
and that
shows that the first order terms in (82) cancel out and thus \( \alpha _k(x) = \mathcal {O}(|x-s_k|^2)\). It follows then from (80), (81) that there exists a constant \(C>0\) such that for every \(\varepsilon \in (0,1]\) and every \(k=1, \ldots , n\)
with \( \varphi _k(x) = f(x) + |\mu (s_k)| \xi _k^2(x) - f(s_k) \) and with the last estimate following from Proposition 3.5 by taking \(m=2\). \(\square \)
Data availability
Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.
References
Becker, S., Menegaki, A.: Spectral gap in mean-field \({\cal{O} }(n)\)-model. Commun. Math. Phys. 380, 1361–1400 (2020)
Beltrán, J., Landim, C.: Tunneling and metastability of continuous time Markov chains. J. Stat. Phys. 140(6), 1–50 (2010)
Berglund, N.: Kramers’ law: validity, derivations and generalisations. Markov Process. Relat. Fields 19, 459–490 (2013)
Bertini, L., Gabrielli, D., Landim, C.: Metastable \(\Gamma \)-expansion of finite state Markov chains level two large deviations rate functions. arXiv:2207.02588 (2022)
Bianchi, A., Bovier, A., Ioffe, D.: Sharp asymptotics for metastability in the random field Curie–Weiss model. Electr. J. Probab. 14, 1541–1603 (2008)
Bianchi, A., Gaudillière, A.: Metastable states, quasi-stationary distributions and soft measures. Stoch. Proc. Appl. 126(6), 1622–1680 (2016)
Bovier, A., den Hollander, F.: Metastability: A Potential Theoretic Approach. Springer, Berlin (2015)
Bovier, A., Eckhoff, M., Gayrard, V., Klein, M.: Metastability and low lying spectra in reversible Markov chains. Commun. Math. Phys. 228, 219–255 (2002)
Bovier, A., Eckhoff, M., Gayrard, V., Klein, M.: Metastability in stochastic dynamics of disordered mean-field models. Probab. Theor. Relat. Fields 119, 99–161 (2001)
Bovier, A., Gayrard, V., Klein, M.: Metastability in reversible diffusion processes. 2. Precise estimates for small eigenvalues. J. Eur. Math. Soc. 7, 69–99 (2005)
Brooks, M., Di Gesù, G.: Sharp tunneling estimates for a double-well model in infinite dimension. J. Funct. Anal. 281 (2021)
Cassandro, M., Galves, A., Olivieri, E., Vares, M.E.: Metastable behavior of stochastic dynamics: a pathwise approach. J. Stat. Phys. 35(5/6), 603–634 (1984)
Colin de Verdière, Y.: Tunneling on graphs: an approach “a la Helffer-Sjoestrand”. arXiv:1809.04973 (2018)
Colin de Verdière, Y., Pan, Y., Ycart, B.: Singular limits of Schrödinger operators and Markov Processes. J. Oper. Theory 41, 151–173 (1999)
Cycon, H. L., Froese, R. G., Kirsch, W., Simon, B.: Schrödinger operators with application to quantum mechanics and global geometry. In: Text and Monographs in Physics. Springer (1987)
Di Gesù, G.: Semiclassical spectral analysis of discrete Witten Laplacians. Ph.D Thesis, https://publishup.unipotsdam.de/opus4-ubp/frontdoor/index/index/docId/6287 (2013)
Di Gesù, G., Lelièvre, T., Le Peutrec, D., Nectoux, B.: Jump Markov models and transition state theory: the quasi-stationary distribution approach. Faraday Discuss. 195, 469–495 (2017)
Di Gesù, G., Le Peutrec, D.: Small noise spectral gap asymptotics for a large system of nonlinear diffusions. J. Spectr. Theory 7, 939–984 (2017)
Di Gesù, G., Mariani, M.: Full metastable asymptotic of the Fisher Information. SIAM J. Math. Anal. 49(4), 3048–3072 (2017)
Ding, J., Lubetzky, E., Peres, Y.: The mixing time evolution of Glauber dynamics for the mean-field Ising model. Commun. Math. Phys. 289(2), 725–764 (2009)
Eckhoff, M.: Precise asymptotics of small eigenvalues of reversible diffusions in the metastable regime. Ann. Probab. 33(1), 244–299 (2005)
Fontes, R.L., Mathieu, P., Picco, P.: On the averaged dynamics of the random field Curie–Weiss model. Ann. Appl. Probab. 10(4), 1212–1245 (2000)
Freidlin, M.I., Wentzell, A.D.: Random Perturbations of Dynamical Systems, 2nd edn. Springer, Berlin (1998)
Helffer, B.: Semiclassical Analysis, Witten Laplacians, and Statistical Mechanics. World Scientific, Singapore (2002)
Helffer, B.: Spectral Theory and Its Applications, vol. 139. Cambridge University Press, Cambridge (2013)
Helffer, B., Klein, M., Nier, F.: Quantitative analysis of metastability in reversible diffusion processes via a Witten complex approach. Matematica Contemporanea 26, 41–85 (2004)
Helffer, B., Nier, F.: Hypoelliptic estimates and spectral theory for Fokker–Planck operators and Witten Laplacians. In: Lecture Notes in Mathematics, vol. 1862. Springer (2005)
Helffer, B., Sjöstrand, J.: Multiple wells in the semiclassical limit I. Commun. Partial Difffer. Equ. 9(4), 337–408 (1984)
Helffer, B., Sjöstrand, J.: Multiple wells in the semiclassical limit III: interaction through non-resonant wells. Math. Nachr. 124, 263–313 (1985)
Helffer, B., Sjöstrand, J.: Puits multiples en mécanique semi-classique IV. Commun. Partial Difffer. Equ. 10(3), 245–340 (1985)
Helffer, B., Sjöstrand, J.: Analyse semi-classique pour l’équation de Harper (avec application à l’équation de Schrödinger avec champ magnétique. In: Mém. Soc. Math. France 2. série, tome vol. 34, pp. 1–113 (1988)
Jona-Lasinio, G., Martinelli, F., Scoppola, E.: New approach to the semiclassical limit of quantum mechanics I. Commun. Math. Phys. 80, 223–254 (1981)
Klein, M., Rosenberger, E.: Agmon-type estimates for a class of difference operators. Ann. H. Poincaré 9, 1177–1215 (2008)
Klein, M., Rosenberger, E.: Harmonic approximation of difference operators. J. Funct. Anal. 257, 3409–3453 (2009)
Klein, M., Rosenberger, E.: Asymptotic eigenfunctions for a class of difference operators. Asymptot. Anal. 73(1–2), 1–36 (2011)
Klein, M., Rosenberger, E.: Tunneling for a class of difference operators: complete asymptotics. Ann. Henri Poincaré 19, 3511–3559 (2018)
Landim, C.: Metastable Markov chains. Probab. Surv. 16, 143–227 (2019)
Landim, C., Misturini, R., Tsunoda, K.: Metastability of reversible random walks in potential fields. J. Stat. Phys. 160, 1449–1482 (2015)
Lin, Y., Lippner, G., Yau, S.-T.: Quantum tunneling on graphs. Commun. Math. Phys. 311, 113–132 (2012)
Miclo, L.: On hyperboundedness and spectrum of Markov operators. Invent. Math. 200(1), 311–343 (2015)
Olivieri, E., Vares, M.E.: Large Deviations and Metastability. Cambridge University Press, Cambridge (2005)
Schlichting, A., Slowik, M.: Poincaré and logarithmic Sobolev constants for metastable Markov chains via capacitary inequalities. Ann. Appl. Probab. 29(6), 3438–3488 (2019)
Simon, B.: Semi-classical analysis of low lying eigenvalues I. Non-degenerate minima: asymptotic expansion. Annales de l’I.H.P. Physique théorique 38(3), 295–308 (1983)
Stein, E., Weiss, G.: Introduction to Fourier Analysis on Euclidean Spaces. Princeton University Press, Princeton (1971)
Teschl, G.: Mathematical methods in quantum mechanics. In: With Applications to Schrödinger Operators, 2nd edn. Graduate Studies in Mathematics, vol. 157. American Mathematical Society, Providence, RI (2014)
Witten, E.: Supersymmetry and Morse theory. J. Differ. Geom. 17, 661–692 (1982)
Acknowledgements
The author gratefully acknowledges the financial support of HIM Bonn in the framework of the 2019 Junior Trimester Programs “Kinetic Theory” and “Randomness, PDEs and Nonlinear Fluctuations”.
Funding
Open access funding provided by Università degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Di Gesù, G. Spectral Analysis of Discrete Metastable Diffusions. Commun. Math. Phys. 402, 543–580 (2023). https://doi.org/10.1007/s00220-023-04729-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00220-023-04729-2