Abstract
Consider the complete graph on n vertices. To each vertex assign an Ising spin that can take the values \(-1\) or \(+1\). Each spin \(i \in [n]=\{1,2,\dots , n\}\) interacts with a magnetic field \(h \in [0,\infty )\), while each pair of spins \(i,j \in [n]\) interact with each other at coupling strength \(n^{-1} J(i)J(j)\), where \(J=(J(i))_{i \in [n]}\) are i.i.d. non-negative random variables drawn from a probability distribution with finite support. Spins flip according to a Metropolis dynamics at inverse temperature \(\beta \in (0,\infty )\). We show that there are critical thresholds \(\beta _c\) and \(h_c(\beta )\) such that, in the limit as \(n\rightarrow \infty \), the system exhibits metastable behaviour if and only if \(\beta \in (\beta _c, \infty )\) and \(h \in [0,h_c(\beta ))\). Our main result is a sharp asymptotics, up to a multiplicative error \(1+o_n(1)\), of the average crossover time from any metastable state to the set of states with lower free energy. We use standard techniques of the potential-theoretic approach to metastability. The leading order term in the asymptotics does not depend on the realisation of J, while the correction terms do. The leading order of the correction term is \(\sqrt{n}\) times a centred Gaussian random variable with a complicated variance depending on \(\beta ,h\), on the law of J and on the metastable state. The critical thresholds \(\beta _c\) and \(h_c(\beta )\) depend on the law of J, and so does the number of metastable states. We derive an explicit formula for \(\beta _c\) and identify some properties of \(\beta \mapsto h_c(\beta )\). Interestingly, the latter is not necessarily monotone, meaning that the metastable crossover may be re-entrant.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and Main Results
1.1 Background
Interacting particle systems evolving according to a Metropolis dynamics associated with an energy functional called the Hamiltonian, may be trapped for a long time near a state that is a local minimum of the free energy, but not a global minimum. The deepest local minima are called metastable states, the global minimum is called the stable state. The transition from a metastable state to the stable state marks the relaxation of the system to equilibrium. To describe this relaxation, one needs to identify the set of critical configurations the system must attain in order to achieve this transition and to compute the crossover time. These critical configurations correspond to saddle points in the free energy landscape.
Metastability for interacting particle systems on lattices has been studied intensively in the past. For a summary, we refer the reader to the monographs by Olivieri and Vares [13], and Bovier and den Hollander [6]. Successful attempts towards understanding metastable behaviour in random environments were made for the random field Curie–Weiss model, by Mathieu and Picco [12], Bovier et al. [3] and Bianchi et al. [1, 2]. Recently, there has been interest in metastability for interacting particle systems on random graphs. This is challenging, because the crossover times typically depend on the realisation of the graph. In den Hollander and Jovanovksi [11] and Bovier et al. [7], Glauber dynamics on dense Erdős-Rényi random graphs was analysed. Earlier work on metastability for Glauber dynamics on sparse random graphs can be found in Dommers [8] (random regular graph) and Dommers et al. [10] (configuration model). The present paper is a first step towards the study of metastability for Glauber dynamics on Chung-Lu-like random graphs.
To the best of our knowledge, Tindemans and Capel [14] and Dommers et al. [9] are the only references where the model with the interaction Hamiltonian in (1.2) below has been studied in detail. Both focus on equilibrium properties only.
1.2 Glauber dynamics on the complete graph with coupling disorder
Let \({\mathscr {K}}_n\) be the complete graph on n vertices. Each vertex carries an Ising spin that can take the values \(-1\) or \(+1\). Let \(S_n = \{-1,+1\}^{[n]}\) denote the set of spin configurations on \({\mathscr {K}}_n\), where \([n]=\{1,2,\dots , n\}\). Let \((\varOmega ,{\mathscr {F}},\mathscr {P})\) be an abstract probability space, and let \(J=(J(i))_{i \in [n]}\) be a sequence of i.i.d. random variables on this probability space taking values in a finite set \(\{a_1,\dots ,a_k\} \subset [0,\infty )\) of cardinality \(k \in \mathbb {N}\). The distribution of these random variables is given by
with \(\sum _{\ell \in [k]}\omega _{\ell }=1\).
Let \(H_n:S_n \rightarrow \mathbb {R}\) be the interaction Hamiltonian defined by
where \(h \in [0,\infty )\) is the magnetic field. We consider Glauber dynamics on \(S_n\), defined as the continuous-time Markov process with transition rates
where \(\beta \in (0,\infty )\) is the inverse temperature, \(\sigma '\sim \sigma \) means that \(\sigma '\) differs from \(\sigma \) by a single spin-flip and \([\cdot ]_+\) is the positive part. This dynamics is reversible with respect to the Gibbs measure
where the normalising constant \(Z_n\) is called the partition sum. Note that the reference measure for (1.4) is the counting measure on \(S_n\). We write
to denote a path of the Glauber dynamics on \(S_n\), and \({\mathbb {P}}_\sigma \) and \({\mathbb {E}}_\sigma \) to denote probability and expectation on path space given \(\sigma _0=\sigma \) (we suppress \(J,h, \beta \) and n from the notation).
For fixed n, if \(h=0\) the Hamiltonian in (1.2) has two global minima, at \(\sigma \equiv +1\) and \(\sigma \equiv -1\), while if \(h>0\) it achieves a global minimum at \(\sigma \equiv +1\) and a local minimum at \(\sigma \equiv -1\). The latter is the deepest local minimum not equal to the global minimum (at least for h small enough). However, in the limit as \(n\rightarrow \infty \), these do not form a metastable pair of configurations because entropy comes into play.
1.3 Metastability on the complete graph with coupling disorder
In this section we state our main results.
1.3.1 Empirical magnetisations
The relevant quantity to monitor in order to characterise the metastable behaviour is the disorder weighted magnetisation
The following quantities will be essential for coarse-graining. Define the level sets
and the level magnetisations
Put
and note that \(K_n(\sigma ) = \frac{1}{n}\sum _{\ell \in [k]} a_\ell \,\left| {A_{\ell ,n}}\right| \,m_{\ell ,n}(\sigma )\) depends on \(\sigma \) only through \(m_{n}(\sigma )\). Thus, with abuse of notation, we may define
so that \(K_n(\sigma )=K_n(m_{n}(\sigma ))\).
1.3.2 Thermodynamic limit
As \(n\rightarrow \infty \), by the law of large numbers the random function \(K_n\) converges uniformly in probability to a deterministic function K given by
Similarly, the random free energy function \(F_n\) converges uniformly in probability to a deterministic function \(F_{\beta ,h}\) (see (2.15) and (2.26) below for explicit formulas). In Sect. 3, we show that the stationary points of \(F_{\beta ,h}\) are given by \({\mathbf {m}} = ({\mathbf {m}}_\ell )_{\ell \in [k]}\), where
Note that, via (1.12), the k-dimensional vector \({\mathbf {m}}\) is fully determined by the real number \(K({\mathbf {m}})\). Therefore, finding the stationary points of \(F_{\beta ,h}\) reduces to finding the solutions of the equation
1.3.3 Metastable regime
It turns out that the critical inverse temperature \(\beta _c\) is given by
Namely, if \(\beta \in (0, \beta _c]\), then the system is not in the metastable regime for any \(h \in [0,\infty )\), while if \(\beta \in (\beta _c,\infty )\), then, for \(h \in [0,\infty )\) small enough, it is in the metastable regime (i.e., (1.13) has more than one solution at which \(T_{\beta ,h}\) is not tangent to the diagonal). Given \(\beta \in (\beta _c,\infty )\), the critical magnetic field \(h_c(\beta )\) is the minimal value of h for which the system is not metastable. The metastable regime is thus
In Sect. 3, we show that \(\beta \mapsto h_c(\beta )\) is continuous on \((\beta _c,\infty )\), with
where the explicit value of C is given in (3.12) below. Interestingly, \(\beta \mapsto h_c(\beta )\) is not necessarily monotone, i.e., the metastable crossover may be re-entrant.
It turns out that there exists an \(\ell \in [k]\) (depending on \(\beta ,h\) and on the law of the components of J), such that \(F_{\beta ,h}\) has \(2\ell +1\) stationary points.
1.3.4 Metastable crossover
Let \({\mathscr {M}}_n\) be the set of minima of \(F_n\). Given \({\mathbf {m}} \in {\mathscr {M}}_n\), define
Let \({\mathscr {G}}(A,B)\) be the gate between two disjoint subsets A and B of \({\mathscr {M}}_n\). We refer to [6, Section 10.1] for a precise definition of the gate.
Fix \({\mathbf {m}}_n \in {\mathscr {M}}_n\) as the initial magnetisation. Throughout the paper we assume that the following hypotheses hold for \({\mathbf {m}}_n\).
Hypothesis 1
-
1.
\({\mathscr {M}}_n({\mathbf {m}}_n)\) is non-empty.
-
2.
The Hessian of \(F_n\) has only non-zero eigenvalues at \({\mathbf {m}}_n\) and at all the points in \({\mathscr {G}}({\mathbf {m}}_n,{\mathscr {M}}_n({\mathbf {m}}_n))\).
-
3.
There is a unique point \({\mathbf {t}}_n\) in \({\mathscr {G}}({\mathbf {m}}_n,{\mathscr {M}}_n({\mathbf {m}}_n))\), which will often be called simply saddle point.
-
4.
The saddle point \({\mathbf {t}}_n\) is such that \(r_{\ell }\big [\left| {A_{\ell ,n}}\right| (1-{\mathbf {t}}_{\ell ,n}^2)\big ]^{-1}\) takes distinct values for different \(\ell \in [k]\), where \(r_{\ell }\) is defined in (4.9) below.
Hypothesis 1(2) and (3) are made to avoid complications. Hypothesis 1(4) is needed in the proof of Lemma 4.1 below (as in [6, Lemma 14.9]). Neither is very restrictive: if for some parameter choice they fail, then after an infinitesimal parameter change they hold. Moreover, if Hypothesis 1(3) fails, it is sufficient to compute separately the contribution to the crossover time of the various saddle points in the gate.
Let \(S_n[{\mathbf {m}}_n]\) and \(S_n[{\mathscr {M}}_n({\mathbf {m}}_n)]\) denote the sets of configurations in \(S_n\) for which the level magnetisations are \({\mathbf {m}}_n\) and are contained in \({\mathscr {M}}_n({\mathbf {m}}_n)\), respectively. For \(A \subset S_n\), write
to denote the first hitting time or return time of A.
We next state our main results for the crossover time. Theorem 1.1 provides a sharp asymptotics for the average crossover time from any metastable state to the set of states with lower free energy. Theorem 1.2 shows that asymptotically the crossover time is exponential on the scale of its mean, a property that is standard for metastable behaviour.
Theorem 1.1
(Average crossover time with coupling disorder). Let \({\mathbb {A}}_n(\cdot )\) be the \(k \times k\) Hessian matrix defined in (4.2) below, and \(\gamma _{n}\) the unique negative solution of the equation in (4.20) below. For every \(\mathbf {m_n} \in {\mathscr {M}}_n\) satisfying Hypothesis 1 and within the metastable regime (1.15), uniformly in \(\sigma \in S_n[\mathbf {m_n}]\), and with \(\mathscr {P}\)-probability tending to 1,
Theorem 1.2
(Exponential law with coupling disorder). For every \(\mathbf {m_n} \in {\mathscr {M}}_n\) satisfying Hypothesis 1 and within the metastable regime (1.15), uniformly in \(\sigma \in S_n[{\mathbf {m}}_n]\) and with \(\mathscr {P}\)-probability tending to 1,
As the average crossover time estimated in Theorem 1.1 is a random variable, we next provide more information on the randomness of the quantity in the right-hand side of (1.19), which depends on the realisation of the random variable J. The prefactor in (1.19) converges with \(\mathscr {P}\)-probability tending to 1 to a deterministic limit, which depends on the law of J but not on the realisation of J. However, the exponent does not converge to a deterministic limit. In Theorem 1.3 we compute the exponent up to order O(1). Recall that \(F_n \rightarrow F_{\beta ,h}\), \({\mathbf {m}}_n\rightarrow {\mathbf {m}}\) and \({\mathbf {t}}_n\rightarrow {\mathbf {t}}\) as \(n\rightarrow \infty \).
Theorem 1.3
(Randomness of the exponent) For every \(\mathbf {m_n} \in {\mathscr {M}}_n\) satisfying Hypothesis 1 and within the metastable regime (1.15), in distribution,
where Z is a normal random variable with mean zero and variance in \((0,\infty )\), defined on \((\varOmega ,{\mathscr {F}}, \mathscr {P})\) and independent of J.
The variance of Z turns out to be a complicated function of \(\beta \), h and the distribution of J. We refer to Sect. 6.3 for further details. Computing the exponent up to order 1 is in principle possible, but the formulas become rather complicated. Without this precision the prefactor in (1.19) is asymptotically negligible. Still, knowing this prefactor allows us to determine what the leading order behaviour of the randomness is.
1.4 Discussion on the continuous case
Bianchi et al. [1, 2] study the Curie–Weiss model with a random magnetic field whose distribution is continuous. Lumping techniques work for discrete distributions but not for continuous distributions. The latter require coarse-graining techniques to approximate the continuous distribution by a sequence of discrete distributions. In the present paper we consider pair interaction random variables with a discrete distribution only. It seems hard to obtain results with a similar precision for continuous distributions. The techniques employed in [1, 2] do not carry over, because the error introduced by the coarse-graining turns out to be quadratic rather than linear.
1.5 Techniques and outline
In order to prove Theorems 1.1–1.3 we use the potential-theoretic approach to metastability developed in Bovier et al. [4, 5]. More specifically, we first find a sharp approximation of the Dirichlet form associated with the coarse-grained dynamics. We use these results, together with lumpability properties and well-known variational principles, to obtain sharp capacity estimates that are key quantities in the proof. For a more detailed overview of the methods, we refer the reader to the monograph by Bovier and den Hollander [6].
The remainder of the paper is organised as follows. Section 2 provides quantities and notations that are needed throughout the paper. Section 3 identifies the metastable regime. Section 4 provides a sharp approximation of the Dirichlet form associated with the Glauber dynamics in the presence of the disorder. Section 5 provides estimates on capacity and on the metastable valley measure. Section 6 proves Theorems 1.1–1.3. Appendix A contains a brief overview on known results for the standard CW model, which corresponds to the setting without disorder. Appendix B gives numerical evidence for the presence of multiple metastable states for suitable choices of \(\beta \), h and of the law of the components of J. Appendix C contains an example in which \(\beta \mapsto h_c(\beta )\) is not increasing, implying the possibility of a re-entrant metastable crossover. Appendix D provides the limit as \(n \rightarrow \infty \) of the prefactor in (1.19).
2 Preparations
Section 2.1 introduces further notation and writes the Hamiltonian in terms of the level magnetisations. Section 2.2 introduces the Dirichlet form associated with the Glauber dynamics and rewrites this in terms of the level magnetisations. Section 2.3 computes gradients and Hessians of the free energy as a function of the level magnetisations. Section 2.4 closes with an approximation of the free energy that will be needed later on.
2.1 Hamiltonian
Recall (1.7). Abbreviate
Since, by the law of large numbers, \((\omega _{\ell ,n})_{\ell \in [k]} \rightarrow (\omega _{\ell })_{\ell \in [k]} \in (0,\infty )^k\) as \(n\rightarrow \infty \) with \(\mathscr {P}\)-probability tending to 1, we may and will assume that \(A_{\ell ,n} \ne \emptyset \) for all \(\ell \in [k]\) and all n large enough. Recall (1.8)–(1.9). Note that \(m_{\ell ,n}(\sigma )\) takes values in the set
Hence \(m_n(\sigma )\) takes values in the set
The configurations corresponding to \(M \subseteq \varGamma _n\) are denoted by
For singletons \(M=\{m\}\) we write \(S_n[m]\) instead of \(S_n[\{m\}]\).
Let
which is the Hamiltonian in (1.2), except for the diagonal term \(-\frac{1}{2n} \sum _{i \in [n]} J^2(i)\), which is a constant shift. Using the notation above, we can write the Hamiltonian in (2.5) as
where we abbreviate
2.2 Dirichlet form and mesoscopic dynamics
By (1.3)–(1.4), the Dirichlet form associated with the Glauber dynamics equals
where h is a test function on \(S_n\) taking values in [0, 1]. Because of (2.6), for any h such that \(h(\sigma ) = {{\bar{h}}}(m_n(\sigma ))\), with \({{\bar{h}}}\) a test function on \(\varGamma _n\), we have
where \(m=(m_\ell )_{\ell \in [k]}\). If \(\sigma ' \sim \sigma \), then \(\sigma ' = \sigma ^i\) for some \(i \in [n]\), with \(\sigma ^i\) obtained from \(\sigma \) by flipping the spin with label i. Let \(\ell ' \in [k]\) be such that \(i \in A_{\ell ',n}\). If \(\sigma (i)=\pm 1=-\sigma ^{i}(i)\), then
For \(m, m' \in \varGamma _n\), we write \(m \sim m'\) when there exists an \(\ell ' \in [k]\) such that \(m'=m^{\ell ',+}\) or \(m'=m^{\ell ',-}\), where
Moreover, for \(\ell \in [k]\) and \(\sigma \in S_n\) with \(m_n(\sigma )=m\), the cardinality of the set \(\{\sigma '\in S_n:\,\sigma ' \sim \sigma ,\,m_n(\sigma ')=m^{\ell ,\pm }\}\) equals \(\frac{1\mp m_{\ell } }{2} |A_{\ell ,n}|\), namely, the number of \((\mp 1)\)-spins in \(\sigma \) with index in \(A_{\ell ,n}\). Furthermore,
as is seen by counting the number of \((-1)\)-spins with label in \(A_{\ell ,n}\) of a configuration with \(\ell \)-th level magnetisation \(m_{\ell }\). Using these observations, we can rewrite (2.9) as
Next, abbreviate
and put
where \(E_n(m)\) is defined in (2.7). Moreover, define
With this notation, we can write the mesoscopic measure \(Q_n(\cdot )= \mu _n \circ m_n^{-1}(\cdot )\) on \(\varGamma _n\), with \(\mu _n\) defined in (1.4), as
and so the Dirichlet form in (2.13) becomes
2.3 Gradients and Hessians
Denote the Cramér entropy by
Define
Since \(\left| {A_{\ell ,n}}\right| = [1+o_n(1)]\,\omega _\ell n\), we can use Stirling’s formula \(N! = [1+o_N(1)]\,N^N\mathrm {e}^{-N} \sqrt{2\pi N}\) to obtain
where the error term is uniform in \(m \in \varGamma _n\). For \(\ell , {\bar{\ell }} \in [k]\), we compute
and
Recalling (2.7), we compute
Define
Remark 2.1
By (2.21), \(F_n(m)= {\bar{F}}_n(m) + O(n^{-1}\log n)\), where \(F_n\) is defined in (2.15).\(\spadesuit \)
For \(m \in [-1,1]^k\), define
which corresponds to the uniform limit in probability of \(F_n\) as \(n \rightarrow \infty \). Compute
and
The same formulas apply for \(I_n,F_n\), with an error term \(O(n^{-1})\).
2.4 Additional computation
We conclude with a computation that will be useful later on. Recalling (2.11), we write
where
The same formula applies for \(I_n\) with an error term of order \(O(n^{-1})\), and hence
Note that \(\varDelta _{\ell ,n}^\pm =O(1)\). Therefore, using (2.15), we get
3 Metastable Regime
Section 3.1 identifies the stationary points of \({\bar{F}}_n\). Section 3.2 identifies the metastable regime. Section 3.3 provides details on the 1-dimensional metastable landscape.
3.1 Stationary points of \({\bar{F}}_n\) and \(F_{\beta ,h}\)
By (2.27), the critical points \(m=(m_{\ell })_{\ell \in [k]}\) of \({\bar{F}}_n\) solve the system of equations (with \(\omega _{\ell ,n}\ne 0\))
Hence
Since \(\mathrm {arctanh\,}x = \tfrac{1}{2} \log \frac{1+x}{1-x}\), \(x \in (-1,+1)\), (3.2) can be rewritten as
Similarly, the critical points \(m=(m_{\ell })_{\ell \in [k]}\) of \(F_{\beta ,h}\) solve the deterministic equation
Note that this can also be obtained directly from (3.3) after replacing \(\omega _{\ell ,n}\) by its mean value \(\omega _{\ell }\).
3.2 Metastable regime
We are interested in identifying the metastable regime, i.e., the set of pairs \((\beta ,h)\) for which \(F_{\beta ,h}\) has more than one minimum. Put
From the characterisation of the critical points of \(F_{\beta ,h}\) in (3.4) it follows that
Note that any critical point \(m=(m_{\ell })_{\ell \in [k]} \in [-1,1]^k\) of \(F_{\beta ,h}\) is uniquely determined by \(K(m )\in \mathbb {R}\). Consequently, the problem of solving the k-dimensional system in (3.4) can be reduced to solving the 1-dimensional Eq. (3.6). Recalling Hypothesis 1(2), the system is in the metastable regime if and only if (3.6) has more than one solution that is not tangent to the diagonal.
Compute
For \(h=0\), the system is metastable when
in which case \(T_{\beta ,h}\) has a unique inflection point at \(K=0\), implying that (3.6) has three solutions \(K \in \{-K^*, 0,+K^*\}\) with \(K^*>0\). Otherwise (3.6) has only one solution \(K = 0\).
We proceed with the more interesting case \(h>0\).
3.2.1 Number of solutions
Lemma 3.1
(Number of solutions). For \(h>0\), the number of critical points of \(F_{\beta ,h}\), i.e., solutions of (3.6), varies in \(\{1,3, \dots , 2\ell +1\}\), where \(\ell \in [k]\) and \(2\ell -1\) is the number of inflection points of \(T_{\beta ,h}\).
Proof
For \(h>0\) and K positive and large enough, \(T_{\beta ,h}''(K)<0\). Moreover, for \(h>0\) and K negative with \(\left| {K}\right| \) large enough, \(T_{\beta ,h}''(K)>0\). Therefore, \(T_{\beta ,h}\) has at least one inflection point and the number of inflection points of \(T_{\beta ,h}\) cannot be even: it takes values in \(\{1,3, \dots , 2k-1\}\) depending on \(\beta ,h\) and on the law of the components of J. Consequently, if \(2\ell -1\) (\(\ell \in [k]\)) is the number of inflection points, then the cardinality of the solutions of (3.6) takes values in \(\{1,3, \dots , 2\ell +1\}\) depending on \(\beta ,h\) and on the law of the components of J. \(\square \)
We conjecture that for any finite k there exist \(\beta ,h\) and a law of the components of J such that (3.6) has any number of solutions in the set \(\{1, 3, \dots , 2k+1\}\). We found numerical evidence for this fact for \(k \in \{2,3,4\}\). See Appendix B.
Lemma 3.2
(Unique strictly positive solution). For every \(\beta >0\) and \(h>0\), (3.6) has exactly one strictly positive solution.
Proof
Put \(W(K)=T_{\beta ,h}(K)-K\). The solutions of (3.6) are the roots of W. Clearly, \(W(0)>0\). Moreover, \(\lim _{K\rightarrow \infty } W(K) = -\infty \) because \(\lim _{K\rightarrow \infty } T_{\beta ,h}(K) = \sum _{\ell \in [k]} a_{\ell }\,\omega _{\ell }>0\) is finite. Therefore, by continuity, a root of W(K) exists in \((0,\infty )\).
Let \({\tilde{K}}\) be the smallest positive root of W. Next we will prove that this root is unique. Indeed, \(W(K)''<0\) when \(K \in [0, \infty )\), meaning that \(K \mapsto W(K)'\) is strictly decreasing. By continuity, since \(W(K)>0\) for all \(K \in [0, {\tilde{K}})\), we have \(W({\tilde{K}})'\le 0\) and \(\lim _{K\rightarrow \infty }W(K)'=-1\). Therefore, \(W(K)'< 0\) for all \(K \in ({\tilde{K}}, \infty )\), and so W is strictly decreasing. Moreover, \(W(K)<W({\tilde{K}})=0\) for all \(K \in ({\tilde{K}},\infty )\). Thus, \({\tilde{K}}\) is the only positive root of W. \(\square \)
3.2.2 Metastable regime
Lemma 3.3
(Characterisation of the metastable regime). Equation (3.6) has at least three solutions not tangent to the diagonal if and only if there exists \({\bar{K}}<0\) such that \({\bar{K}}>T_{\beta ,h}({\bar{K}})\), i.e.,
Proof
Using Lemma 3.2, we see that (3.6) has at least three solutions if and only if it has at least two strictly negative solutions. As above, we define \(W(K)=T_{\beta ,h}(K)-K\). The solutions of (3.6) are the roots of W. Now, assume that there exists a \({\bar{K}}<0\) such that \({\bar{K}}>T_{\beta ,h}({\bar{K}})\). Since \(W({\bar{K}})<0\) and \(W(0)>0\), W(K) has a root in \(({\bar{K}},0)\), implying that (3.6) has at least one solution in \(({\bar{K}},0)\). Moreover, since \(\lim _{K\rightarrow - \infty } T_{\beta ,h}(K)= -\sum _{\ell \in [k]} a_{\ell }\,\omega _{\ell }\) is finite, we have \(\lim _{K\rightarrow - \infty } W(K)= \infty \). Because \(W({\bar{K}})<0\), it follows that W has at least one root in \((-\infty ,{\bar{K}})\). With the same argument it can be shown that the negative roots of W are always even. The opposite implication is trivial. \(\square \)
Remark 3.1
Applying the intermediate value theorem to the derivative of \(W(K)=T_{\beta ,h}(K)-K\), we get that if the condition in Lemma 3.3 is satisfied, then there exists a \({\bar{K}}<0\) such that \(T_{\beta ,h}'({\bar{K}})= 1\) and \({\bar{K}}>T_{\beta ,h}({\bar{K}})\). \(\spadesuit \)
Theorem 3.1
(Metastable regime). Define, as in (1.14),
The metastable regime is
with \(\beta \mapsto \beta h_c(\beta )\) non-decreasing on \([\beta _c,\infty )\). Furthermore, if the support of the law of the components of J is put into increasing order, i.e., \(a_1<a_2< \dots <a_k\), then
where the minimum is over all \(\ell \in [k]\) such that the quantity between brackets is positive.
Proof
Recalling Lemma 3.3, we look for conditions for the existence of a \(K<0\) satisfying (3.9). If such a K exists, then by Remark 3.1 there exists a \({\bar{K}}<0\) satisfying (3.9) such that \(T_{\beta ,h}'({\bar{K}})= 1\), which reads
Since the left-hand side of (3.13) is positive, it admits solutions only if
Therefore, (3.14) is a necessary condition for the metastable regime.
Now assume (3.14). Since \(\tanh x \sim x\), \(x \rightarrow 0\), for \(\left| {K}\right| \ll \beta (\max _{\ell \in [k]}a_{\ell })^{-1}\) and \(h\downarrow 0\), we have
which reads
and proves the existence of a negative solution. A positive solution is guaranteed by Lemma 3.2. The existence of a third (strictly negative) solution of (3.4), for every \(\beta >\beta _c\) and for \(h\downarrow 0\), follows as in the proof of Lemma 3.3. Therefore, the lower bound on \(\beta _c\) is sharp.
Since \(h \mapsto T_{\beta ,h}(K)\) is strictly increasing for every fixed \(\beta >0\) and \(K\in \mathbb {R}\), there exists a unique critical curve \(\beta \mapsto h_c(\beta )\) such that the system is metastable for \(0 \le h < h_c(\beta )\) and not metastable for \(h \ge h_c(\beta )\). We know that \(h_c(\beta ) > 0\) for \(\beta > \beta _c\). By passing to the parametrisation \(g = h\beta \), we get that \(\beta \mapsto T_{\beta ,g}(K)\) is strictly decreasing for every g and for every \(K<0\), from which it follows that \(\beta \mapsto g_c(\beta )=\beta h_c(\beta )\) is non-decreasing.
We next focus on the limit of \(h_c(\beta )\) as \(\beta \rightarrow \infty \). By Lemma 3.3, we may focus on the existence of \({\bar{K}}\) satisfying (3.9). In the limit as \(\beta \rightarrow \infty \), \(\tanh (\beta [a_{\ell }{\bar{K}}+h])\rightarrow 2\varTheta _{-h/a_{\ell }}({\bar{K}})-1\), where \(\varTheta _{x}(\cdot )\) is the Heaviside function centred in x. Thus, for all \(\ell \in [k+1]\),
and, for all \(\ell \in [k]\),
where we set \(-\frac{h}{a_{0}}=-\infty \) and \(-\frac{h}{a_{k+1}}=\infty \). Thus, for \({\bar{K}} \in \left( -\frac{h}{a_{\ell -1}}, -\frac{h}{a_{\ell }}\right) \), (3.9) can be written as
Therefore, (3.9) has a solution if and only if there exists an \(\ell \in [k]\) such that
in which case a solution \({\bar{K}}\) of (3.9) exists in
Note that the quantity between brackets in (3.12) is always positive for \(\ell =1\). Thus, the minimum is always finite.
The proof is complete after we show why we may drop the case where \({\bar{K}} = -\frac{h}{a_{\ell }}\) for some \(\ell \in [k]\). In this case the condition for \({\bar{K}}\) to satisfy (3.9) is
which implies (3.20). Thus, if \({\bar{K}} = \frac{-h}{a_{\ell }}\) satisfies (3.9), then also some other K in (3.21) satisfies (3.9). Therefore, the condition in (3.20) is equivalent to having metastability. \(\square \)
Lemma 3.4
(Re-entrant crossover). The function \(\beta \mapsto h_c(\beta )\) is not necessarily non-decreasing.
Proof
In Appendix C we provide an example of \(\beta \mapsto h_c(\beta )\) that is not increasing. \(\square \)
3.2.3 Bounds on the inflection points and on the critical curve
Lemma 3.5
(Bounds on inflection points). All solutions of \(T_{\beta ,h}''(K)= 0\) are contained in the interval
In particular, they are all strictly negative.
Proof
If \(K>-\frac{h}{\max _{\ell \in [k]}a_{\ell }}\), then \(\tanh (\beta [a_{\ell }K+h])>0\) for all \(\ell \in [k]\), which implies \(T_{\beta ,h}''(K)<0\). If \(K<-\frac{h}{\min _{\ell \in [k]}a_{\ell }}\), then \(\tanh (\beta [a_{\ell }K+h])<0\) for all \(\ell \in [k]\), which implies \(T_{\beta ,h}''(K)>0\). \(\square \)
Lemma 3.6
(Upper bound on \(h_c)\). \(\sup _{\beta \in (\beta _c,\infty )} h_c(\beta ) < \left( \max _{\ell \in [k]} a_{\ell }\right) \sum _{\ell \in [k]} a_{\ell }\,\omega _{\ell }\).
Proof
Use Lemma 3.3 to characterise the metastable regime and Remark 3.1. We claim that if a solution \({\bar{K}}\) of (3.9) with \(T_{\beta ,h}'({\bar{K}})=1\) exists, then it must be negative and strictly less than an inflection point. Using this fact, together with Lemma 3.5 and the inequality in (3.9), we obtain a necessary upper bound on h:
Using that \(\tanh (\beta [a_{\ell }{\bar{K}}+h])>-1\), we conclude the proof.
We are left to prove the claim. By Lemma 3.5, all inflection points are negative, and \(T_{\beta ,h}''(K)<0\) for \(K\ge 0\). Assume, by contradiction, that \(T_{\beta ,h}''(K)<0\) for all \(K \in ({\bar{K}}, \infty )\). Then \(T_{\beta ,h}'\) is strictly decreasing. Therefore, \(T_{\beta ,h}'(K)<1\) for all \(K \in ({\bar{K}}, \infty )\), which implies \(T_{\beta ,h}(K)-T_{\beta ,h}(0)<K\). Since \(T_{\beta ,h}(0)>0 \), there exists a \({\tilde{K}} \in ({\bar{K}},0)\) such that \(T_{\beta ,h}({\tilde{K}} )>0>{\tilde{K}}\). Thus, \(T_{\beta ,h}({\tilde{K}})-T_{\beta ,h}(0)>{\tilde{K}}\), which contradicts what we have proved for all \(K \in ({\bar{K}}, \infty )\). \(\square \)
3.3 Quasi 1-dimensional landscape
Given \(K \in \mathbb {R}\), by standard saddle point approximation, the leading order of
turns out to be the function \(G_n :\mathbb {R}\rightarrow \mathbb {R}\) defined by
Recalling definitions (2.25) and (3.5), using Lagrange multipliers and integrating the condition \(K_n(m)=K\), we obtain
Lemma 3.7
(Alternative characterisation for the critical points).
-
1.
If \(m^*\) is a (not maximal) critical point for \(F_{n}\), then \(K_n(m^*)\) is a critical point for \(G_n\).
-
2.
If K is a critical point for \(G_{n}\), then \(m^*=(m^*_{\ell })_{\ell \in [k]}\) with \(m^*_{\ell } = \tanh \left( \beta \left[ a_{\ell }K+h\right] \right) \) (recall (3.3)) is a critical point for \(F_n\).
-
3.
\(F_n(m^*)=G_n(K_n(m^*))\) for any (not maximal) critical point \(m^*\).
Proof
Similar to [3, Lemma 7.4]. \(\square \)
We have already seen that \(K_n(m)\) fully determines any critical value m of \(F_n\), and is useful to order them. Lemma 3.7 exhibits the one-dimensional structure underlying the metastable landscape and provides a tool to describe the nature of the critical points of \(F_n\).
Remark 3.2
The above results extend to the limit \(n \rightarrow \infty \): replace \(F_n\) by \(F_{\beta ,h}\) and \(G_n\) by \(G_{\beta ,\ell }\), obtained after replacing \(\omega _{\ell ,n}\) by \(\omega _{\ell }\) in (3.27), and \(K_n(\cdot )\) by \(K(\cdot )\).\(\spadesuit \)
4 Approximation of the Dirichlet form Near the Saddle Point
In this section we approximate the Dirichlet form associated with the coarse-grained dynamics near the saddle point. This is a key step to obtain capacity estimates in the following section. Further details and examples on the techniques we use here can be found in [6, Chapters 9, 10 and 14].
Section 4.1 introduces some key quantities that are needed to express the mesoscopic measure. Section 4.2 introduces an approximate mesoscopic measure that leads to an approximate dynamics. Section 4.3 approximates the harmonic functions associated with this dynamics. Section 4.4 computes an approximate Dirichlet form. Section 4.5 uses the latter to approximate the full Dirichlet form.
4.1 Key quantities
Let \({\mathbf {m}}_n = ({\mathbf {m}}_{\ell ,n})_{\ell \in [k]}\) and \({\mathbf {t}}_n = ({\mathbf {t}}_{\ell ,n})_{\ell \in [k]}\) in \(\varGamma _n\) be a local minimum of \(F_n\) and the correspondent saddle point, respectively, as defined in Sect. 1.3.4. Note that both \({\mathbf {m}}_n\) and \({\mathbf {t}}_n\) satisfy (3.3). Consider the neighbourhood of \({\mathbf {t}}_n\) defined by
where d is the Euclidean distance and \(C' \in (0,\infty )\) is a constant. Abbreviate the Hessian of \(F_n\)
and put
By (2.28),
Note that \({\mathbb {A}}_n(m)\) is a diagonal matrix minus a rank one matrix. Compute
4.2 Approximate dynamics and Dirichlet form
For any two vectors \({\mathbf {v}},{\mathbf {w}} \in \mathbb {R}^k\), let \(\left\langle {{\mathbf {v}}}, {{\mathbf {w}}}\right\rangle \) denote their scalar product. For any \(k\times k\) matrix \({\mathbf {M}}\) and any \({\mathbf {v}} \in \mathbb {R}^k\), let \({\mathbf {M}} \cdot {\mathbf {v}}\) denote their matrix product, as \({\mathbf {v}}\) was in \(\mathbb {R}^k\times 1\).
For \(m \in \mathscr {D}_n\), define
and
where \({\bar{r}}_n\) is defined in (2.16). The transition rates \({\tilde{r}}_n\) define a random dynamics on \(\mathscr {D}_n\) that is reversible with respect to the mesoscopic measure \({\tilde{Q}}_n\). The corresponding Dirichlet form is
where u is a test function on \(\mathscr {D}_n\). Put
Using (2.7) and (2.16), we get
4.2.1 Approximation estimates
Next we estimate how close the pairs \(({\bar{r}}_n,{\tilde{r}}_n)\) and \((Q_n,{\tilde{Q}}_n)\) are. By Taylor expansion around \({\mathbf {t}}_n\), we have
In particular,
where the second equality uses (4.4). Moreover, for \(m \in \mathscr {D}_n\) (\(\mathrm {e}_{\ell }\) is the unitary vector in \(\mathbb {R}^k\) whose \(\ell \)-th component is non-zero),
where the third equality uses (4.4). For \(m \in \mathscr {D}_n\), we have \( d(m,{\mathbf {t}}_n)^3 = O(n^{-3/2}\log ^{3/2}n)\). Therefore, combining (2.17), (4.6) and (4.11), we have
for some \(C'' \in (0,\infty )\) constant. Using (2.16) and (2.32), we can write
where \(\varDelta _{\ell ,n}^\pm \) is defined in (2.30).
Using (4.7), (4.12), (4.13) and (4.15), we find that, for all \(m \in \mathscr {D}_n\),
where \(C''' \in (0,\infty )\) is a constant and we abbreviate
Equations (4.14) and (4.16) are relevant for the following approximation.
4.3 Approximate harmonic function
Let \({\mathbb {B}}_n\) be the \(k \times k\) matrix defined by
where \({\mathbb {A}}_n\) is defined in (4.3). Note that
Let \(\gamma _n^{(\ell )}\), \(\ell \in [k]\), be the eigenvalues of \({\mathbb {B}}_n\), ordered in increasing order. Let \(\gamma _n=\gamma _n^{(1)}\) denote the unique negative eigenvalue of \({\mathbb {B}}_n\), and \({\hat{v}}\) the corresponding unitary eigenvector. Define \(v=(v_{\ell })_{\ell \in [k]}\) by \(v_{\ell }={\hat{v}}_{\ell }\frac{\omega _{\ell ,n}\sqrt{n}}{\sqrt{r_{\ell }}}\).
Remark 4.1
As in [6, Remark 10.4], it follows by Hypothesis 1 that \({\mathbb {A}}_n\) has all strictly positive eigenvalues but one strictly negative. It can be seen that the same property holds for the eigenvalues of \({\mathbb {B}}_n\). \(\spadesuit \)
Lemma 4.1
(Eigenvalue). The eigenvalue \(\gamma _n\) is the unique solution of the equation
Proof
We follow the line of proof of [6, Lemma 14.9], using the last point in Hypothesis 1. In our case, [6, Eq. (14.7.12)] reads
\(\square \)
Remark 4.2
As in [6, Lemma 14.9], since the left-hand side of (4.20) is increasing in \(\gamma _n\) for \(\gamma _n\ge 0\), a negative solution of (4.20) exists if and only if
Using (4.5), (4.22) holds if and only if \(\det {\mathbb {A}}_n<0\). By Remark 4.1 the latter holds true. \(\spadesuit \)
Define \(f:\,\mathbb {R}\rightarrow [0,1]\) as
and \(g:\,\mathbb {R}^k \rightarrow [0,1]\) as
Recall the definition of \({\mathscr {M}}_n({\mathbf {m}}_n)\) given in (1.17).
Let \(W_0\) be a strip in \(\varGamma _n\) of width \(Cn^{-1/2} \log ^{1/2} n\) such that \({\mathbf {t}}_n \!\in \! W_0\), \({\mathscr {M}}_n({\mathbf {m}}_n) \cap W_0\) is empty and \(W_0^c\) consists in two non-neighbouring parts: \(W_1\) containing \({\mathbf {m}}_n\) and \(W_2\) containing \({\mathscr {M}}_n({\mathbf {m}}_n)\). Moreover, we require that, for some fixed constant \(c>1\), \(W_0 \cap \mathscr {D}_n^c \subseteq \{m \in \varGamma _n :F_n(m)> F_n({\mathbf {t}}_n) + c n^{-1} \log n \}\). Define
By choosing \(W_0\) and \(\mathscr {D}_n\) suitably we have, for \(m \sim m'\) (i.e., \({\bar{r}}_n(m,m')>0\)) and \(c \in (0,\infty )\) large enough (coming from the definition of \(W_0\)),
4.4 Computation of the approximate Dirichlet form
In this section we follow [6, Sections 10.2.2–10.2.3] to approximate \({\tilde{\mathscr {E}}}_{\mathscr {D}_n}(g,g)\) defined in (4.8). As in [6, Eq. (10.2.24)], for \(m \in {\mathscr {D}}_n\) and \(\ell \in [k]\) such that \(m^{\ell ,+} \in D_n\), compute
Recalling (4.8)–(4.9), we have
where we use [6, Eq. (10.2.33)] with \(\varepsilon =\frac{1}{\beta n}\) and \(d=k\). Here \(\tfrac{1}{2} \left| {A_{\ell ,n}}\right| \) is the inverse of the step in the \(\ell \)-direction, while in [6, Eq. (10.2.33)] the step is \(\varepsilon \).
Remark 4.3
Note that
because \({\tilde{g}}(m)=g(m)\,[1+o(1)]\) for all \(m \in W_0^c \cap \mathscr {D}_n\). The latter can be proved by approximating the Gaussian integral by 0 or 1 when \(\left\langle {v}, { m-{\mathbf {t}}_n}\right\rangle \) is proportional to \(- n^{-1/2} \log ^{1/2} n\) or \(n^{-1/2} \log ^{1/2} n\), respectively.\(\spadesuit \)
4.5 Final Dirichlet form approximation
We are now ready to compare \({\mathscr {E}}_{S_n} \) with \({\tilde{\mathscr {E}}}_{\mathscr {D}_n}\). Let \(h:S_n \rightarrow [0,1]\) be such that \(h(\sigma )={{\tilde{g}}}(m_n(\sigma ))\), \(\sigma \in S_n\). We split the sum in (2.18) into four subsets of \(\varGamma _n\times \varGamma _n\): \(m \in W_0 \cap \mathscr {D}_n^c\), \(m' \in \varGamma _n\); \(m \in W_0 \cap \mathscr {D}_n\), \(m'\in W_1\); \(m \in W_0 \cap \mathscr {D}_n\), \(m'\in W_2\); \(m \in W_0 \cap \mathscr {D}_n\), \(m'\in W_0 \cap \mathscr {D}_n\). Then, using (4.25)–(4.27), we obtain
Using (4.14) and (4.16), we obtain
where the third equality follows from (4.25)–(4.27) together with (4.14), and the last equality follows from (4.29)–(4.30).
5 Capacity and Valley Estimates
Section 5.1 provides sharp asymptotic upper bounds and lower bounds on the capacity of the metastable pair between which the crossover is being considered. These estimates use the results of the Sect. 4 together with the Dirichlet principle and the Berman–Konsowa principle, which are variational representations of capacity. Section 5.2 provides a sharp asymptotic estimate for the mesoscopic measure of the valleys of the minima of \(F_n\), which leads to a sharp asymptotic estimate for \(F_n\) inside this valley.
5.1 Capacity estimates
Given a Markov process \((x_t)_{t\ge 0}\) with state space S, a key quantity in the potential-theoretic approach to metastability is the capacity \(\hbox {cap}(A,B)\) of two disjoint subsets A, B of S. This is defined by (see [6, Eq. (7.1.39)])
where \(\mu \) is the invariant measure and \({\mathbb {P}}_{x}\) is the probability distribution of the Markov process starting in x.
Recall that \({\mathscr {M}}_n\) is the set of local minima of \(F_n\).
Proposition 5.1
(Asymptotics of the capacity). Let \({\mathbf {m}}_n=({\mathbf {m}}_{\ell ,n})_{\ell \in [k]} \in {\mathscr {M}}_n\) and \(M_n \subset {\mathscr {M}}_n \backslash {\mathbf {m}}_n\), such that the gate \({\mathscr {G}}({\mathbf {m}}_n,M_n)\) consists of a unique point \({\mathbf {t}}_n=({\mathbf {t}}_{\ell ,n})_{\ell \in [k]}\). Suppose that \(\beta \in (\beta _c,\infty )\) and \(h \in [0,h_c(\beta ))\). Then, as \(n\rightarrow \infty \),
Remark 5.1
Proposition 5.1 holds for any subset \(M_n \subseteq {\mathscr {M}}_n \backslash {\mathbf {m}}_n\), separated from \({\mathbf {m}}_n\) by \({\mathbf {t}}_n\), independently on the values of \(F_n\) on \(M_n\). \(\spadesuit \)
5.1.1 Upper bound: Dirichlet principle
An important characterisation of the capacity between two disjoint sets is given by the Dirichlet principle. For our quantity of interest this states that
where \(\tilde{{\mathscr {H}}}\) is the set of functions from \(S_n\) to [0, 1] that are equal to 1 on \(S_n[{\mathbf {m}}_n]\) and 0 on \(S_n[M_n]\).
Given that, by assumption, \({\mathscr {G}}({\mathbf {m}}_n,M_n)=\{{\mathbf {t}}_n\}\), we use the Dirichlet principle in (5.3) to obtain an upper bound on the capacity. We take as test function \(h \in \tilde{{\mathscr {H}}}\) defined in Sect. 4.5 and, using (4.32), we obtain
5.1.2 Lower bound: Berman–Konsowa principle
We first note that the process \((\sigma _t)_{t \ge 0}\) is lumpable. Indeed, the process \((m_n(\sigma _t))_{t \ge 0}\) is Markovian because the Hamiltonian \(H_n(\sigma )\) depends on \(m_n(\sigma )\) only (see (2.6)). Therefore, for \({\mathbf {A}}=S_n[{A}]\) and \({\mathbf {B}}=S_n[{B}]\) with A and B disjoint subsets of \(\varGamma _n\),
where \(\hbox {cap}_{\varGamma }\) denotes the capacity for the process \((m_n(\sigma _t))_{t \ge 0}\), i.e., the projection of the process \((\sigma _t)_{t\ge 0}\) on the magnetisation space \(\varGamma _n\). We write \({\mathbb {P}}^{\varGamma ^{}}\) and \({\mathbb {E}}^{\varGamma }\) to denote the law of \((m_n(\sigma _t))_{t \ge 0}\) induced by the law \({\mathbb {P}}\) of \((\sigma _t)_{t\ge 0}\), and its expectation, respectively. By the lumpability, we can focus on the dynamics on \(\varGamma _n\).
Following the line of argument in [6, Section 10.3] (with \(\varepsilon =\frac{2}{n}\) and \(d=k\)), we obtain the lower bound
where we use (4.29) and (4.30).
We sketch the proof. The main idea is to use the Berman–Konsowa principle for a suitable defective flow. More precisely, given disjoint subsets A, B of the state space, for any defective loop-free unit flow \(f_{A,B}\) from A to B with defect function \(\delta \) (as defined in [6, Definition 9.2]), we can estimate (see [6, Lemma 9.4], and notation therein)
where \([\cdot ]_+\) denotes the positive part and \(\gamma \) is a self-avoiding path from A to B. It turns out that, with a suitable choice of the flow f, the product in the right-hand side of (5.7) is bounded from below by \(1+O(\,n^{-1/2} \log ^{1/2} n) \), and the sum over \(\gamma \) from below by \({\tilde{\mathscr {E}}}_{\mathscr {D}_n}({\tilde{g}},{\tilde{g}}) [1+o_n(1)]\). This proves (5.6).
We give a sketch of the test flow definition in our setting. Here \(A=\{{\mathbf {m}}_n\}\) and \(B=M_n\). Let \(v^*\) be the eigenvector corresponding to the unique negative eigenvalue of the Hessian of \(F_n\) at the saddle point \({\mathbf {t}}_n\) (unique gate point in \({\mathscr {G}}(\{{\mathbf {m}}_n\},M_n)\)). Let \(G_n\) be the cylinder in \(\mathbb {R}^k\) intersected with \(\varGamma _n\), centred at \({\mathbf {t}}_n\), with axis \(v^*\), radius \(\rho =C\,n^{-1/2} \log ^{1/2} n\) and length \(\rho '=C'\,n^{-1/2} \log ^{1/2} n\). We will denote by \(\partial _B G_n\) the base facing B and by \(\partial _A G_n\) the central part of radius \(C''\,n^{-1/2} \log ^{1/2} n\) of the base facing A, with \(C''<C\). Choose the constants so that \(G_n\) is contained in \(\mathscr {D}_n\) defined in (4.1).
We define a defective flow \(f_{A,B}\) from A to B consisting of three parts: \(f_A\), a unitary flow from A to \(\partial _A G_n\); f, a defective loop-free unit flow from \(\partial _A G_n\) to \(\partial _B G_n\) inside \(G_n\); \(f_B\), a unitary flow from \(\partial _B G_n\) to B. This choice implies that the sum over \(\gamma \) in (5.7) is relevant only on the paths entering \(G_n\) in \(\partial _A G_n\), exiting \(G_n\) in \(\partial _B G_n\), and afterwards reaching B without going back to \(G_n\). For this purpose we choose \(f_A\) and \(f_B\) such that \(f_A((x,y))\) and \(f_B((x,y))\) are proportional to \(Q_n(x)\). For \(m \in G_n\) such that \(m^{\ell ,+} \in G_n\), define
where g is defined in (4.24), \({\tilde{Q}}_n\) in (4.6), \(r_{\ell }\) in (4.9) and
The contribution to the sum in brackets in (5.7) turns out to be negligible outside \(G_n\). Therefore, no further conditions on the flows \(f_A\) and \(f_B\) are necessary, provided the total flow out of A is 1 and the total flow \(f_{A,B}\) is defective and loop-free.
5.2 Measure of the valley
In order to prove Theorem 1.1, we need the following estimate on the measure of the valley of the minima of \(F_n\). For \({\mathbf {m}}_n \in {\mathscr {M}}_n\), let \(A({\mathbf {m}}_n)\subset \varGamma _n\) be the valley of \({\mathbf {m}}_n\) as defined in [6, Eq. (8.2.10)].
Lemma 5.1
(Gibbs weight of the valley). Given \({\mathbf {m}}_n \in {\mathscr {M}}_n\),
where \(Q_n\) is the mesoscopic measure defined in (2.17), and \({\mathbb {A}}_n({\mathbf {m}}_n)\) is the \(k \times k\) Hessian matrix defined in (4.2).
Proof
The proof follows that of [6, Lemma 10.12 and (10.2.33)]. The relevant contribution to \(Q_n(A({\mathbf {m}}_n))\) is given by the measure of a ball \(B_{\rho }\) of radius \(\rho =C\,n^{-1/2} \log ^{1/2} n\) centred in \({\mathbf {m}}_n\), with C constant, contained in \(A({\mathbf {m}}_n)\). Indeed, if \(y \in A({\mathbf {m}}_n)\) and \(d({\mathbf {m}}_n,y)> \rho \), then by Taylor expansion of \(F_n\) around \({\mathbf {m}}_n\) we have
where c is a constant. The condition \(y \in A({\mathbf {m}}_n)\) is needed to ensure that \(F_n(y)>F_n({\mathbf {m}}_n)\), implying that c is positive. Therefore, we obtain the rough estimate
where we use that \(|\varGamma _n| \le n^k\). The bound in (5.12) is sufficient to show that \(Q_n(A({\mathbf {m}}_n)\backslash B_{\rho }) \) is negligible in \(Q_n(A({\mathbf {m}}_n))\).
Compute
where we use the Taylor expansion
and the approximation of the sum by an integral is correct up to an error \(1+O(\rho )\). In the last lines we approximated the Gaussian integral on intervals \([-\rho , \rho ]\) by the Gaussian integral on \(\mathbb {R}\), with an error \(1+O(n^{-c})\). We conclude by looking at (5.12) and (5.13), and noting that for C large enough \(Q_n(A({\mathbf {m}}_n)\backslash B_{\rho })\) is negligible compared to \( Q_n(A({\mathbf {m}}_n) \cap B_{\rho })\). \(\square \)
6 Proof of the Theorems
In this section we prove Theorems 1.1–1.3. Section 6.1 uses the asymptotics for the capacity of the metastable pair from Sect. 5.1 and the asymptotics for the mesoscopic measure from Sect. 5.2 to prove Theorem 1.1. Section 6.2 proves Theorem 1.2. Section 6.3 proves Theorem 1.3.
6.1 Average crossover time
Let us return to the notation of Theorem 1.1, where \({\mathbf {m}}_n \in {\mathscr {M}}_n\) and \({\mathscr {M}}_n({\mathbf {m}}_n)=\{m \in {\mathscr {M}}_n \backslash {\mathbf {m}}_n :\, F_n(m) \le F_n({\mathbf {m}}_n)\}\). To prove Theorem 1.1 we use the relation
Recall notation introduced in Sect. 5.1.2. Because \(F_n(m) \le F_n({\mathbf {m}}_n)\) for all \(m \in {\mathscr {M}}_n({\mathbf {m}}_n)\), (6.1) follows from [6, Theorem 8.15] after proving that \({\mathscr {M}}_n\) is a set of metastable points in the sense of [6, Definition 8.2]. The latter follows along the lines of the proof of [6, Theorem 10.6], where similar values of capacities and invariant measures occur.
Using (6.1) in combination with Proposition 5.1 and Lemma 5.1, we obtain that, for all \(\sigma \in S_n[{\mathbf {m}}_n]\),
where we use that the dynamics depends on the starting configuration \(\sigma \in S_n[{\mathbf {m}}_n]\) only, through its level magnetisations \(m_n(\sigma )={\mathbf {m}}_n\) (see (2.6)), and also use the lumpability.
6.2 Exponential law
In this section we prove Theorem 1.2. Since the dynamics depends on the starting configuration \(\sigma \in S_n[{\mathbf {m}}_n]\) through its level magnetisation \(m_n(\sigma )={\mathbf {m}}_n\) only (see (2.6)), we have
where \({\bar{\tau }}\) is the hitting time of the process projected on \(\varGamma _n\). Given the non-degeneracy hypothesis (Hypothesis 1 in Sect. 1.3.4) and the one-dimensional landscape analysis (in Sect. 3.3), we can apply [6, Theorem 8.45] to the right-hand side of (6.3) and conclude the proof.
6.3 Randomness of the exponent
In this section we prove Theorem 1.3. In particular, we compute \(F_n({\mathbf {t}}_n)-F_n({\mathbf {m}}_n) - [F_{\beta ,h}({\mathbf {t}})-F_{\beta ,h}({\mathbf {m}})]\) to leading order.
Recalling definitions (2.26) and (3.5), we have
Let \({\mathbf {m}}=({\mathbf {m}}_{\ell })_{\ell \in [k]}, {\mathbf {t}}=({\mathbf {t}}_{\ell })_{\ell \in [k]} \in [-1,1]^k\) be the critical points of \(F_{\beta ,h}\) closest to \({\mathbf {m}}_n, {\mathbf {t}}_n\) (i.e., the critical points of \(F_n\) defined above), respectively. Note that \({\mathbf {m}} \) and \({\mathbf {t}}\) satisfy (3.4), while \({\mathbf {m}}_n \) and \({\mathbf {t}}_n\) satisfy (3.3). Using (2.21), we get
and
By (3.2), we have
Thus,
Moreover,
and
Similar equalities hold after we replace \({\mathbf {t}}\) by \({\mathbf {m}}\) and \({\mathbf {t}}_n\) by \({\mathbf {m}}_n\). Using the previous computations, we obtain
Using (6.8), we find
Since
we focus on estimating \({\mathbf {t}}_{\ell ,n} -{\mathbf {t}}_{\ell }\).
From Taylor expansion, we get
Since
we have
Suppose that \({\mathbf {t}}_{\ell ,n} -{\mathbf {t}}_{\ell } \sim \frac{Y_{\ell }^{{\mathbf {t}}}}{\sqrt{n}}\). By the Central Limit Theorem, \(\omega _{\ell ,n} -\omega _{\ell } \sim \frac{Z_{\ell }}{\sqrt{n}}\), where \(Z_{\ell }\) is the normal random variable \(N(0,\omega _{\ell }(1-\omega _{\ell }))\). Hence
and so
where the denominator does not vanish because of Remark 4.2. Thus, up to a factor \(O(n^{-\frac{1}{2}})\), \(Y_{\ell }^{{\mathbf {t}}}\) is a normal random variable with mean 0 and variance
Similar results hold after we replace \({\mathbf {t}}\) by \({\mathbf {m}}\).
Going back to (6.12), using (6.13) and (6.18), and inserting \({\mathbf {t}}_{\ell ,n} -{\mathbf {t}}_{\ell } \sim \frac{Y_{\ell }^{{\mathbf {t}}}}{\sqrt{n}}\) and \({\mathbf {m}}_{\ell ,n} -{\mathbf {m}}_{\ell } \sim \frac{Y_{\ell }^{{\mathbf {m}}}}{\sqrt{n}}\) and \(\omega _{\ell ,n} -\omega _{\ell } \sim \frac{Z_{\ell }}{\sqrt{n}}\), we obtain
Thus,
Since the random variables \(Y_{\ell }^{{\mathbf {t}}}\), \(Y_{\ell }^{{\mathbf {m}}}\), \(Z_{\ell }\) are centred normal, this concludes the proof of Theorem 1.3.
From (6.21) it is possible to compute explicitly the variance of Z defined in Theorem 1.3, because the variances of all the random variables involved are known (at least to leading order).
References
Bianchi, A., Bovier, A., Ioffe, D.: Sharp asymptotics for metastability in the random field Curie–Weiss model. Electron. J. Probab. 14, 1541–1603 (2009). https://doi.org/10.1214/EJP.v14-673
Bianchi, A., Bovier, A., Ioffe, D.: Pointwise estimates and exponential laws in metastable systems via coupling methods. Ann. Probab. 40(1), 339–371 (2012). https://doi.org/10.1214/10-AOP622
Bovier, A., Eckhoff, M., Gayrard, V., Klein, M.: Metastability in stochastic dynamics of disordered mean-field models. Probab. Theory Relat. Fields 119(1), 99–161 (2001). https://doi.org/10.1007/PL00012740
Bovier, A., Eckhoff, M., Gayrard, V., Klein, M.: Metastability and low lying spectra in reversible Markov chains. Commun. Math. Phys. 228(2), 219–255 (2002). https://doi.org/10.1007/s002200200609
Bovier, A., Eckhoff, M., Gayrard, V., Klein, M.: Metastability in reversible diffusion processes. I. Sharp asymptotics for capacities and exit times. J. Eur. Math. Soc. 6(4), 399–424 (2004). https://doi.org/10.4171/JEMS/14
Bovier, A., den Hollander, F.: Metastability—A Potential-Theoretic Approach, Grundlehren der mathematischen Wissenschaften, vol. 351. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24777-9
Bovier, A., Marello, S., Pulvirenti, E.: Metastability for the dilute Curie–Weiss model with Glauber dynamics. Electron. J. Probab. 26, 1–38 (2021). https://doi.org/10.1214/21-ejp610
Dommers, S.: Metastability of the Ising model on random regular graphs at zero temperature. Probab. Theory Relat. Fields 167, 305–324 (2017). https://doi.org/10.1007/s00440-015-0682-0
Dommers, S., Giardinà, C., Giberti, C., van der Hofstad, R., Prioriello, M.L.: Ising critical behavior of inhomogeneous Curie–Weiss models and annealed random graphs. Commun. Math. Phys. 348(1), 221–263 (2016). https://doi.org/10.1007/s00220-016-2752-2
Dommers, S., den Hollander, F., Jovanovski, O., Nardi, F.R.: Metastability for Glauber dynamics on random graphs. Ann. Appl. Probab. 27, 2130–2158 (2017). https://doi.org/10.1214/16-AAP1251
den Hollander, F., Jovanovski, O.: Glauber dynamics on the Erdős-Rényi random graph. In: In and Out of Equilibrium. 3. Celebrating Vladas Sidoravicius. Progress in Probability, vol. 77, pp. 519–589. Birkhäuser/Springer, Cham (2021). https://doi.org/10.1007/978-3-030-60754-8_24
Mathieu, P., Picco, P.: Metastability and convergence to equilibrium for the random field Curie–Weiss model. J. Stat. Phys. 91(3–4), 679–732 (1998). https://doi.org/10.1023/A:1023085829152
Olivieri, E., Vares, M.E.: Large Deviations and Metastability. Encyclopedia of Mathematics and its Applications, vol. 100. Cambridge University Press, Cambridge (2005). https://doi.org/10.1017/CBO9780511543272
Tindemans, P.A.J., Capel, H.W.: An exact calculation of the free energy in systems with separable interactions. Physica 72(3), 433–464 (1974). https://doi.org/10.1016/0031-8914(74)90209-2
Funding
Open Access funding enabled and organized by Projekt DEAL. Open Access funding enabled and organized by Projekt DEAL. AB and SM were supported through DFG Excellence Cluster GZ 2047/1, Projekt-id 390685813, and DFG SFB 1060, Projektnummer 211504053. FdH was supported through NWO Gravitation Grant 024.002.003-NETWORKS, and by the Alexander von Humboldt Foundation during sabbatical leaves in Bonn and Erlangen in the Fall of 2019 and 2020.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Communicated by H. Spohn.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Metastability on the Complete Graph Without Disorder
We give a brief overview of well-known results for the standard Curie–Weiss model. We refer to [6, Chapter 13] for more details.
The Glauber dynamics is defined as in Sect. 1.2, but with \(J \equiv 1\). For convenience we write the Curie–Weiss Hamiltonian as
which is as (2.5) when \(J\equiv 1\). What makes this case easier than the one with disorder is that the interaction is mean-field. Indeed, we may write
with
the magnetisation. In this case the magnetisation process \((m_n(t))_{t \ge 0}\), defined by
is Markovian. More specifically, it is a nearest-neighbour random walk on the grid
In the limit as \(n\rightarrow \infty \), (A.4) converges to a Brownian motion on \([-1,+1]\) in the potential \(F_{\beta ,h}\) given by
with
the relative entropy of the Bernoulli measure on \(\{-1,+1\}\) with parameter m with respect to the counting measure on \(\{-1,+1\}\). \(F_{\beta ,h}(m)\) is the free energy at magnetisation m, consisting of an energy term \(- \frac{1}{2} m^2 - h m\) and an entropy term \(\frac{1}{\beta }I(m)\). See [6, Chapter 13] for more details.
Since
the stationary points of \(F_{\beta ,h}\) are the solutions to the equation
Since
\(T_{\beta ,h}\) is strictly increasing and has a unique inflection point at \(m=-h\). Consequently, (A.9) has either one or three solutions. The latter occurs if and only if
where \({\bar{\beta }}_c = 1\) is the critical inverse temperature and \({\bar{h}}_c(\beta )\) is the critical magnetic field, i.e., the unique value of h for which \(T_{\beta ,h}\) touches the diagonal at a unique value of the magnetisation, say \(-m(\beta )\). Clearly, \(1=\beta (1-m^2(\beta ))\), i.e.
and so \({\bar{h}}_c(\beta )\) solves the equation \(T_{\beta ,{\bar{h}}_c(\beta )}(-m(\beta ))=-m(\beta )\). Hence (see Fig. 1)
The range of parameters in (A.11) represents the metastable regime in which \(F_{\beta ,h}\) has a double-well shape and, in the limit as \(n\rightarrow \infty \), the Gibbs measure \(\mu _n\) in (1.4) has two phases given by the two minima of \(F_{\beta ,h}\): the metastable phase with magnetisation \({\mathbf {m}}<0\) and the stable phase with magnetisation \({\mathbf {s}}>0\). The unique saddle point in the gate \({\mathscr {G}}({\mathbf {m}},{\mathbf {s}})\) has magnetisation \({\mathbf {t}}<0\) (see Fig. 2).
Theorems A.1–A.2 can be found in Bovier and den Hollander [6, Chapter 13]. Here the notation is the same as the one in Sect. 1. Let \(S_n[{\mathbf {m}}]\), \(S_n[{\mathbf {s}}]\) denote the sets of configurations in \(S_n\) for which the magnetisation is closest to \({\mathbf {m}}\), \({\mathbf {s}}\), respectively.
Theorem A.1
(Average crossover time). Subject to (A.11), uniformly in \(\sigma \in S_n[{\mathbf {m}}]\),
Theorem A.2
(Exponential law). Subject to (A.11), uniformly in \(\sigma \in S_n[{\mathbf {m}}]\),
Figure 2 illustrates the setting: the average crossover time from \(S_n[{\mathbf {m}}]\) to \(S_n[{\mathbf {s}}]\) depends on the energy barrier \(F_{\beta ,h}({\mathbf {t}})-F_{\beta ,h}({\mathbf {m}})\) and on the curvature of \(F_{\beta ,h}\) at \({\mathbf {m}}\) and \({\mathbf {t}}\). The crossover time is exponential on the scale of its average.
Examples with Multiple Metastable States
We provide examples of distributions and parameter choices (in the metastable regime) for which the model with disorder has multiple critical points. More specifically, we provide numerical evidence that, for \(k \in \{2,3,4\}\), (3.6) can have any number of solutions in the set \(\{3,5\ldots ,2k+1\}\). The cases with strictly more than 3 solutions present multiple minimal critical points, i.e. multiple metastable states.
1.1 Case \(k=2\)
-
Figure 3a: 3 critical points, parameters \(a_1 = 77\), \(a_2 = 45\), \(\omega _1 = 0.688\), \(h = 1740\), \(\beta = 113 \,\beta _c\).
-
Figure 3b: 5 critical points, parameters \(a_1 = 774\), \(a_2 = 36.84\), \(\omega _1 = 0.59\), \(h = 1740\), \(\beta = 131 \,\beta _c\).
1.2 Case \(k=3\)
-
Figure 4a: 3 critical points, parameters \(a_1 = 77\), \(a_2 = 45\), \(a_3 = 33.5\), \(\omega _1 = 0.688\), \(\omega _2 = 0.15\), \(h = 1740\), \(\beta = 113\,\beta _c\).
-
Figure 4b: 5 critical points, parameters \(a_1 = 77\), \(a_2 = 45\), \(a_3 = 27\), \(\omega _1 = 0.59\), \(\omega _2 = 0.15\), \(h = 1740\), \(\beta = 113\,\beta _c\).
-
Figure 4c: 7 critical points, parameters \(a_1 = 77\), \(a_2 = 45\), \(a_3 = 33.5\), \(\omega _1 = 0.59\), \(\omega _2 = 0.15\), \(h = 1740\), \(\beta = 113\,\beta _c\).
1.3 Case \(k=4\)
-
Figure 5a: 3 critical points, parameters \(a_1 = 12\), \(a_2 = 16\), \(a_3 = 139.5\), \(a_4 = 24.5\), \(\omega _1 = 0.474\), \(\omega _2 = 0.22\), \(\omega _3 = 0.111\), \(h = 178\), \(\beta = 3.8\,\beta _c\).
-
Figure 5b: 5 critical points, parameters \(a_1 = 14\), \(a_2 = 27\), \(a_3 = 57\), \(a_4 = 24.5\), \(\omega _1 = 0.366\), \(\omega _2 = 0.1\), \(\omega _3 = 0.13\), \(h = 262\), \(\beta = 38.4\,\beta _c\).
-
Figure 5c: 7 critical points, parameters \(a_1 = 2.32\), \(a_2 = 4.92\), \(a_3 = 5\), \(a_4 = 11.32\), \(\omega _1 = 0.6\), \(\omega _2 = 0.096\), \(\omega _3 = 0.033\), \(h = 7.6\), \(\beta = 95.2 \,\beta _c\).
-
Figure 5d: 9 critical points, parameters \(a_1 = 12\), \(a_2 = 16\), \(a_3 = 50.5\), \(a_4 = 24.5\), \(\omega _1 =0 .474\), \(\omega _2 = 0.22\), \(\omega _3 = 0.111\), \(h = 178\), \(\beta = 63.2 \,\beta _c\).
Example of \(h_c(\beta )\) Not Increasing
We provide here an example of choice of the law of J for which the critical threshold \(\beta \mapsto h_c(\beta )\) is not monotone increasing. This implies the possibility of a re-entrant metastable crossover.
For \(k=4\), pick \(a_1 = 12\), \(a_2 = 16\), \(a_3 = 50.5\), \(a_4 = 24.5\) and \(\omega _1 = 0.474\), \(\omega _2 = 0.22\), \(\omega _3 = 0.111\). Take \(h = 100\), and plot the function \(K \mapsto T_{\beta ,h}(K)\) varying \(\beta \). For \(\beta _1 = 4 \,\beta _c = 0.00762336\) the system is metastable: \(T_{\beta ,h}\) intersects the diagonal three times (see Fig. 6a), which implies that \(h< h_c(\beta _1)\). For \(\beta _2 = 21 \,\beta _c =0.04002264 >\beta _1\) the system is not metastable: \(T_{\beta ,h}\) intersects the diagonal only once (see Fig. 6b), which implies that \(h> h_c(\beta _2)\). This shows that \(h_c(\beta )\) is not necessarily an increasing function of \(\beta \).
Limit of the Prefactor
Below Theorem 1.2 we stated that the prefactor in (1.19) converges. For completeness, in this Appendix we compute its limit, although, as we mentioned after Theorem 1.3, it is negligible because of the order of approximation of the exponent.
We focus first on \(\gamma _n\). Recall notation in (1.10), (1.11) and (2.1). Then (4.20) can be written as
In the first equality we use (3.3) for \({\mathbf {t}}_n\), i.e., the approximation of the stationary points of \(F_n\) by the stationary points of \({\bar{F}}_n\). This makes \({\mathbf {t}}_{\ell ,n}\) independent of \(\ell \), so that we can use the law of large numbers in the limit as \(n \rightarrow \infty \). Thus, we obtain that \(\gamma _n\) converges to \(\gamma \), the solution of the equation
where \({\mathfrak {E}}\) denotes expectation with respect to \(\mathscr {P}\) and \(U = -\beta [J(1)\,K({\mathbf {t}})+h]\), with \({\mathbf {t}}\) solving (3.4). Note that (D.2) is similar to [6, Eq. (14.4.14)].
We are left to find the limit of the determinants ratio. By (4.5),
Using (3.3) for \(m \in \{{\mathbf {t}}_n, {\mathbf {m}}_n\}\), we have
Using the law of large numbers as above and with the same notation, we find
where \(U({\mathbf {x}})=-\beta (J(1) K({\mathbf {x}})+ h)\).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bovier, A., den Hollander, F. & Marello, S. Metastability for Glauber Dynamics on the Complete Graph with Coupling Disorder. Commun. Math. Phys. 392, 307–345 (2022). https://doi.org/10.1007/s00220-022-04351-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00220-022-04351-8