Dedicated to the memory of Harry Kesten, who for over half a century generously showed the way in probability theory
Abstract
We consider a directed polymer of length N interacting with a linear interface. The monomers carry i.i.d. random charges \((\omega _i)_{i=1}^N\) taking values in \({\mathbb {R}}\) with mean zero and variance one. Each monomer i contributes an energy \((\beta \omega _i-h)\varphi (S_i)\) to the interaction Hamiltonian, where \(S_i \in {\mathbb {Z}}\) is the height of monomer i with respect to the interface, \(\varphi :\,{\mathbb {Z}}\rightarrow [0,\infty )\) is the interaction potential, \(\beta \in [0,\infty )\) is the inverse temperature, and \(h \in {\mathbb {R}}\) is the charge bias parameter. The configurations of the polymer are weighted according to the Gibbs measure associated with the interaction Hamiltonian, where the reference measure is given by a Markov chain on \({\mathbb {Z}}\). We study both the quenched and the annealed free energy per monomer in the limit as \(N\rightarrow \infty \). We show that each exhibits a phase transition along a critical curve in the \((\beta ,h)\)-plane, separating a localized phase (where the polymer stays close to the interface) from a delocalized phase (where the polymer wanders away from the interface). We derive variational formulas for the critical curves and we obtain upper and lower bounds on the quenched critical curve in terms of the annealed critical curve. In addition, for the special case where the reference measure is given by a Bessel random walk, we derive the scaling limit of the annealed free energy as \(\beta ,h \downarrow 0\) in three different regimes for the tail exponent of \(\varphi \).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
1.1 Motivation
Homogeneous pinning models, where a directed polymer receives a reward for every monomer that hits an interface, have been the object of intense study. Both discrete and continuous models have been analysed in detail, and a full understanding is available of the free energy, the phase diagram and the typical polymer configurations as a function of the underlying model parameters. Disordered pinning models, where the reward depends on random weights attached to the interface or where the shape of the interface is random itself, are much harder to analyse. Still, a lot of progress has been made in past years, in particular, the effect of the disorder on the scaling properties of the polymer has been elucidated to considerable depth. For an overview the reader is referred to the monographs by Giacomin [21, 22] and den Hollander [24], the review paper by Caravenna et al. [13], and references therein.
Spatially extended pinning, where the interaction of the monomers depends on their distance to the interface, remains largely unexplored. For a continuum model with an interaction potential that decays polynomially with the distance, pinning-like results have been obtained in Lacoin [25]. A continuum model for which the interaction potential is non-zero only in a finite window around the interface was analysed in Cranston et al. [18]. The goal of the present paper is to investigate what happens for more general interaction potentials, both for discrete and for continuous models with disorder.
The remainder of this section is organised as follows. In Sect. 1.2 we define our model, which consists of a directed polymer carrying random charges that interact with a linear interface at a strength that depends on their distance. In Sects. 1.3 and 1.4 we look at the quenched, respectively, the annealed free energy, and discuss the qualitative properties of the phase diagram. In Sect. 1.5 we recall certain scaling properties of the Bessel random walk and its relation to the Bessel process, both of which play an important role in our analysis. In Sect. 1.6 we state three theorems when the underlying reference measure (describing the polymer without interaction) is a Markov chain. In Sect. 1.7 we state three theorems for the limit of weak interaction when the reference measure is the Bessel random walk, and show that this limit is related to the continuum version of our model when the reference measure is the Bessel process. In Sect. 1.8 we place the theorems in their proper perspective. In Sect. 1.9 we list some open problems and explain how the proofs of the theorems are organised.
1.2 The model
Let \({\mathbb {N}}_0={\mathbb {N}}\cup \{0\}\). Our model has three ingredients:
-
(1)
An irreducible nearest-neighbour recurrent Markov chain \(S:=(S_n)_{n\in {\mathbb {N}}_0}\) on \({\mathbb {Z}}\) starting at \(S_0=0\), with law \({{\,\mathrm{\mathrm{P}}\,}}={{\,\mathrm{\mathrm{P}}\,}}_0\).
-
(2)
An i.i.d. sequences of random charges \(\omega :=(\omega _n)_{n\in {\mathbb {N}}}\) on \({\mathbb {R}}\), with law \({\mathbb {P}}\).
-
(3)
A non-negative function \(\varphi :{\mathbb {Z}}\rightarrow [0,\infty )\), playing the role of an interaction potential, such that
$$\begin{aligned} 0< \Vert \varphi \Vert _\infty < \infty , \quad \lim _{x\rightarrow \infty } \varphi (x) = 0, \quad \lim _{x\rightarrow -\infty } \varphi (x) \text { exists}. \end{aligned}$$(1.1)
Our model is defined through the quenched partition function
which describes a directed polymer chain \(n \mapsto (n,S_n)\) of length N carrying charges \(n \mapsto \omega _n\) that interact with a linear interface according to the interaction potential \(x \mapsto \varphi (x)\) at inverse temperature \(\beta \in [0,\infty )\). Without loss of generality we may replace \(\beta \omega _n\) by \(\beta \omega _n - h\), with \(h \in {\mathbb {R}}\) the charge bias parameter, and assume that \(\omega \) is standardized, i.e.,
after which (1.2) becomes
Throughout the sequel we assume that
Moreover, defining \(\tau _1 := \inf \{n\in {\mathbb {N}}:\,S_n = 0\}\) to be the first return time of S to 0, we assume that there exists an \(\alpha \in [0,\infty )\) such that
Note that \({{\,\mathrm{\mathrm{E}}\,}}(\tau _1)=\infty \) for all \(\alpha \in (0,1)\). If S has period 2, then the last asymptotics is assumed to run along \(2{\mathbb {N}}\).
Remark 1.1
(Bessel random walk). An example of a Markov chain S satisfying (1.6) that will receive special attention in this paper is the one with transition probabilities
where
for some \(\alpha \in (0,1)\) and \(\varepsilon >0\). This choice, which is referred to as the Bessel random walk, has a drift away from the origin (\(\alpha <\tfrac{1}{2}\)) or towards the origin (\(\alpha >\tfrac{1}{2}\)) that decays inversely proportional to the distance. The case \(d(x) \equiv 0\) (\(\alpha =\tfrac{1}{2}\)) corresponds to simple random walk. The Bessel random walk was studied by Lamperti [26] and, more recently, by Alexander [2] (who actually considered the one-sided version \((|S_n|)_{n\in {\mathbb {N}}_0}\)). It is known that (1.6) holds in a sharp form [2, Theorem 2.1], namely,
along \(2{\mathbb {N}}\) for some \(c \in (0,\infty )\). More refined asymptotics are available as well (see Sect. 1.5 below).
Remark 1.2
The model defined in (1.4) provides a natural interpolation between the pinning model and the copolymer model, which correspond to the choices
See Giacomin [21, 22] and den Hollander [24] for details. Actually, in the copolymer model the interaction is via the bonds rather than the sites of the path, i.e., \(\varphi ^\mathrm {cop}((x,y)) = \mathsf{1}_{\{x+y \le 0\}}\), but we will ignore such refinements. Moreover, the standard parametrisation of the disorder in the copolymer model is \(-2\beta (\omega _n + h)\) rather than \(\beta \omega _n-h\). Again, this is the same after a change of parameters. Our choice has the advantage that the free energy is jointly convex in \((\beta , h)\) and that the critical curve is non-negative (see Fig. 1).
1.3 The quenched free energy
The quenched free energy is defined by
For the constrained partition function
the existence of the limit
follows by standard super-additivity arguments. Since \(\varphi \) is bounded and \({\mathbb {P}}\) has finite exponential moments (recall (1.5)), the limit is finite. We will show in “Appendix A.2” that
so that (1.11) follows.
By (1.1), for every \(\varepsilon > 0\) there is an \(M\in {\mathbb {N}}_0\) such that \(0 \le \varphi (x) \le \varepsilon \) for \(x \ge M\). Therefore
We will show in “Appendix A.1” that, by (1.6),
and so it follows that \(\textsc {f}^\mathrm {que}(\beta ,h) \ge - \varepsilon (\beta {\mathbb {E}}[|\omega _1|]+|h|)\). Since \(\varepsilon > 0\) is arbitrary, we obtain the important inequality
It is therefore natural to define the two phases
which we refer to as the quenched localized phase, respectively, the quenched delocalized phase.
By (1.11), \((\beta ,h) \mapsto \textsc {f}^\mathrm {que}(\beta ,h)\) is the pointwise limit of jointly convex functions. Moreover, \(h \mapsto Z_{N,\beta ,h}^\omega \) is non-increasing, so that \(h \mapsto \textsc {f}^\mathrm {que}(\beta ,h)\) is non-increasing as well. Furthermore, \(\beta \mapsto {\mathbb {E}}[\log Z_{N,\beta ,h}^\omega ]\) is convex and (by direct computation) has zero derivative at \(\beta = 0\), so that \(\beta \mapsto \textsc {f}^\mathrm {que}(\beta ,h)\) is non-decreasing on \([0,\infty )\).
From the monotonicity of \(h \mapsto \textsc {f}^\mathrm {que}(\beta ,h)\) it follows that \({\mathcal {L}}^\mathrm {que}\) and \({\mathcal {D}}^\mathrm {que}\) are separated by a quenched critical curve \(h_c^\mathrm {que}:\, [0,\infty ) \rightarrow [0,\infty )\) whose graph is \(\partial {\mathcal {D}}^\mathrm {que}\) (see Fig. 1):
From the convexity of \((\beta ,h) \mapsto \textsc {f}^\mathrm {que}(\beta ,h)\) it follows that the lower level set \({\mathcal {D}}^\mathrm {que}= \{(\beta ,h):\, \textsc {f}^\mathrm {que}(\beta ,h) \le 0\}\) is convex. Since \({\mathcal {D}}^\mathrm {que}\) is the upper graph of \(h_c^\mathrm {que}\), it follows that \(h_c^\mathrm {que}\) is convex and hence continuous. In Sect. 1.6 we will see that \(h_c^\mathrm {que}\) is finite everywhere. Since S is recurrent, it follows from the theory of the homogeneous pinning model (Giacomin [21, 22], den Hollander [24]) that \(\textsc {f}^\mathrm {que}(0,h) > 0\) for \(h < 0\) and \(\textsc {f}^\mathrm {que}(0,h)= 0\) for \(h \ge 0\). Hence \(h_c^\mathrm {que}(0) = 0\). (Note that \(\varphi (x) \ge \varphi (x_*) \mathsf{1}_{\{x_*\}}(x)\) for any \(x_*\in {\mathbb {Z}}\) with \(\varphi (x_*) > 0\). Therefore we can dominate the quenched free energy for \(\beta = 0\) by the free energy of the homogeneous pinning model with a strictly positive pinning reward.)
Finally, from the monotonicity of \(\beta \mapsto \textsc {f}^\mathrm {que}(\beta ,h)\) on \([0,\infty )\) it follows that \(\textsc {f}^\mathrm {que}(\beta ,h) \ge \textsc {f}^\mathrm {que}(0,h) > 0\) for \(h < 0\), so that \(h_c^\mathrm {que}(\beta ) \ge h_c^\mathrm {que}(0) = 0\) for \(\beta \ge 0\). Since \(h_c^\mathrm {que}\) is convex, this implies that \(h_c^\mathrm {que}\) is non-decreasing, and is strictly increasing as soon as it leaves zero. In Sect. 1.6 we will see that \(h_c^\mathrm {que}(\beta ) > 0\) for all \(\beta >0\) (see Fig. 1).
1.4 The annealed free energy
The annealed partition function associated with (1.4) is
where
This is the partition function of the homogeneous pinning model with potential \(\psi _{\beta ,h}\). A delicate point is that \(\psi _{\beta ,h}\) does not have a sign: it may be a mixture of attractive and repulsive interactions. This comes from the fact that both the charge distribution and the interaction potential are general.
The annealed free energy is defined by
For the constrained partition function
the existence of the limit
again follows by standard super-additivity arguments. Since \(\psi _{\beta ,h}\) is bounded, the limit is finite. The analogue of (1.14), which will be proved in Appendix A.3, reads
so that (1.22) follows.
The annealed localized phase, annealed delocalized phase and annealed critical curve are defined as
As is clear from (1.21) and the fact that \(\varphi \) is non-negative, \(\textsc {f}^\mathrm {ann}(\beta ,h)\) is non-increasing and convex as a function of h, and non-decreasing as a function of \(\beta \) but not necessarily convex. Later we will see that nonetheless \(\beta \mapsto h_c^\mathrm {ann}(\beta )\) has a shape that is qualitatively similar to that of \(\beta \mapsto h_c^\mathrm {que}(\beta )\) (see Fig. 2).
An important property of the annealed free energy is that it provides an upper bound for the quenched free energy: by Jensen’s inequality we have \(\textsc {f}^\mathrm {que}(\beta ,h) \le \textsc {f}^\mathrm {ann}(\beta ,h)\) for all \(\beta \in [0,\infty )\) and \(h\in {\mathbb {R}}\). Recalling (1.17), we therefore see that
Unlike for the pinning model and the copolymer model, for general potentials \(\varphi \) the annealed free energy and the annealed critical curve are not known explicitly.
1.5 Scaling properties of the Bessel random walk
Part of our results below involve the annealed free energy and the annealed critical curve associated with a Brownian version of the model, where the reference measure is based on the Bessel process \(X := (X_t)_{t \ge 0}\) of dimension \(2(1-\alpha )\) (see [27, Chapter XI]) defined by
where \(\alpha \in (0,1)\) and \((B_t)_{t \ge 0}\) is standard Brownian motion on \({\mathbb {R}}\).Footnote 1 Informally, X makes infrequent visits to 0 when \(\alpha <\tfrac{1}{2}\) and frequent visits to 0 when \(\alpha >\tfrac{1}{2}\). The choice \(\alpha =\tfrac{1}{2}\) corresponds to reflected Brownian motion, i.e., \(X_t = |B_t|\) with \((B_t)_{t \ge 0}\) standard Brownian motion. We write \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_x\) to denote the law of X given \(X_0 = x\). When \(x=0\), we simply write \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}={\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_0\).
For the semigroup \(g_t(x,y) := {\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_x(X_t \in \text {d}y )/\text {d}y\) of X there is an explicit formula, namely,
where \(J_{-\alpha }(z) = \sum _{m\in {\mathbb {N}}_0} \frac{(-1)^m}{m! \, \Gamma (m+1-\alpha )} (\frac{z}{2})^{2m-\alpha }\) is the Bessel function of index \(-\alpha \). Since
we also have the explicit formula
It follows that
and this inequality becomes sharp as \(\varepsilon \downarrow 0\), namely,
where
The local time of X at 0 up to time \(T \ge 0\) is defined as the following limit in probability:
We will informally write
Note that \({\hat{{{\,\mathrm{\mathrm{E}}\,}}}}[{\hat{L}}_T(0)] = \int _0^T \frac{\text {d}t}{t^{1-\alpha }}= \int _0^T {{\hat{g}}}_t(0,0) \, \text {d}t = \frac{1}{\alpha } \, T^{\alpha }\).
The relation of the Bessel process with the Bessel random walk defined in Remark 1.1 is that the latter satisfies the invariance principle (see Lamperti [26])
Write \({{\,\mathrm{\mathrm{P}}\,}}_{k'}(\,\cdot \,) := {{\,\mathrm{\mathrm{P}}\,}}(\,\cdot \,|\,S_0 = k')\) to denote the law of the Bessel random walk started at \(k'\in {\mathbb {Z}}\), so that \({{\,\mathrm{\mathrm{P}}\,}}= {{\,\mathrm{\mathrm{P}}\,}}_0\). Local limit theorems for the transition probabilities \({{\,\mathrm{\mathrm{P}}\,}}_{k'}(|S_n|=k)\) of the Bessel random walk have been established in [2, Theorem 2.4]. The following formulas hold in the limit as \(n\rightarrow \infty \), uniformly in a specified range of \(k, k' \in {\mathbb {N}}_0\). We assume that \(k-k'\) has the same parity as n, i.e., \(k-k'-n\) is even, because otherwise \({{\,\mathrm{\mathrm{P}}\,}}_{k'}(|S_n|=k)\)=0.
-
Low ending heights: For any \(\varepsilon > 0\) and \({{\bar{k}}}_n = o(\sqrt{n}\,)\), uniformly in \(0 \le k' \le \sqrt{n}/\varepsilon \) and \(0 \le k \le {{\bar{k}}}_n\),
$$\begin{aligned} {{\,\mathrm{\mathrm{P}}\,}}_{k'}(|S_n| = k) \sim \frac{2\,c(k)}{n^{1-\alpha }}\, {{\hat{g}}}_1\bigg (\frac{k'}{\sqrt{n}},0\bigg ) \, \mathsf{1}_{\{k-k'-n \text {~is even}\}}, \quad n \rightarrow \infty , \end{aligned}$$(1.38)where \({{\hat{g}}}_1\) is defined in (1.34) and \(c:\, {\mathbb {N}}_0 \rightarrow (0,\infty )\) is an explicit function (which depends on the function \(d:\,{\mathbb {Z}}\rightarrow {\mathbb {R}}\) in (1.7)) such that
$$\begin{aligned} c(k) \sim \frac{2^{\alpha }}{\Gamma (1-\alpha )}\,k^{1-2\alpha }, \quad k \rightarrow \infty . \end{aligned}$$(1.39)In case of a low starting height \(k' = o(\sqrt{n}\,)\), (1.38) simplifies because \({{\hat{g}}}_1(\frac{k'}{\sqrt{n}},0) \sim {{\hat{g}}}_1(0,0) = 1\).
-
Intermediate ending height: For any \(\varepsilon > 0\), uniformly in \(0 \le k' \le \sqrt{n}/\varepsilon \) and \(\varepsilon \sqrt{n} \le k \le \sqrt{n}/\varepsilon \),
$$\begin{aligned} {{\,\mathrm{\mathrm{P}}\,}}_{k'}(|S_n| = k) \sim \frac{2}{\sqrt{n}} \, g_1\bigg (\frac{k'}{\sqrt{n}, }\frac{k}{\sqrt{n}}\bigg )\, \mathsf{1}_{\{k-k'-n \text {~is even}\}}, \quad n \rightarrow \infty , \end{aligned}$$(1.40)where \(g_1\) is the density in (1.29) at time 1. In case of a low starting height \(k' = o(\sqrt{n}\,)\), (1.38) simplifies because \(g_1(\frac{k'}{\sqrt{n}},\frac{k}{\sqrt{n}}) \sim g_1(0,\frac{k}{\sqrt{n}}) = g_1(\frac{k}{\sqrt{n}})\) reduces to (1.31).
-
High ending heights: There exists a \(C = C(\alpha ) < \infty \) such that, for all \(k \ge \sqrt{n}\),
$$\begin{aligned} {{\,\mathrm{\mathrm{P}}\,}}(|S_n| = k) \le \frac{C}{\sqrt{n}} \, \mathrm {e}^{-\frac{k^2}{8n}}\,\mathsf{1}_{\{k-n \text {~is even}\}}. \end{aligned}$$(1.41)
It follows from (1.38)–(1.41) that, for some \(C < \infty \),
This uniform bound will be needed to control scaling computations.
Remark 1.3
Equations (1.38) and (1.40) for \(k' = 0\) are proved in [2, Theorem 2.4], while the case \(k' \ne 0\) follows via the relation \({{\,\mathrm{\mathrm{P}}\,}}_{k'}(|S_n| = k) = \sum _{m=1}^n {{\,\mathrm{\mathrm{P}}\,}}_{k'}(\tau _1 = m) \, {{\,\mathrm{\mathrm{P}}\,}}_0(|S_{n-m}|=k) + O({{\,\mathrm{\mathrm{P}}\,}}_{k'}(\tau _1 > n))\) (see also the estimates on \({{\,\mathrm{\mathrm{P}}\,}}_{k'}(\tau _1 = m)\) provided in [2, Theorem 2.2]). We further point out the duality relation \({{\,\mathrm{\mathrm{P}}\,}}_{k'}(|S_n| = k) = \frac{1+d(k')}{1+d(k)} \frac{\lambda _{k'}}{\lambda _k} \, {{\,\mathrm{\mathrm{P}}\,}}_k(|S_n| = k')\), where \(\lambda _k := \prod _{x=1}^k \frac{1-d(x)}{1+d(x)}\).
1.6 General properties
Our first set of theorems concerns the quenched and the annealed critical curve.
Theorem 1.4
\(\beta \mapsto h_c^\mathrm {que}(\beta )\) and \(\beta \mapsto h_c^\mathrm {ann}(\beta )\) can be characterized in terms of variational formulas (see Theorems 2.4–2.5 below).
Theorem 1.5
For every \(\beta \ge 0\),
As already noted in (1.27), the second inequality in (1.43) is an immediate consequence of Jensen’s inequality applied to (1.12) and (1.23). The first inequality in (1.43), which is known as the Monthus–Bodineau–Giacomin bound, was previously shown to hold for the copolymer model [9], [11]. We show that it holds for the general class of potentials satisfying (1.1).
In Sect. 3.3 we will show that \(F^\mathrm {ann}(\beta ,0)>0\) for all \(\beta >0\). This implies that \(h_c^\mathrm {ann}(\beta )>0\) for all \(\beta >0\), which via (1.43) settles the claim made at the end of Sect. 1.3 that \(h_c^\mathrm {que}(\beta )>0\) for all \(\beta >0\).
1.7 Scaling for weak interaction
Our second set of theorems looks at the scaling of the annealed free energy in the limit of weak interaction, for the special case where S is the Bessel random walk with parameter \(\alpha \in (0,1)\) defined in Remark 1.1 and the interaction potential \(\varphi \) is symmetric: \(\varphi (-x) = \varphi (x)\) for all \(x\in {\mathbb {Z}}\). We consider three different regimes for the tail behaviour of \(\varphi \), namely,
with
Theorem 1.6
Suppose that \(\alpha \in (0,1)\) and \(\vartheta \in (0,1-\alpha )\). For every \({{\hat{\beta }}} \in (0,\infty )\) and \({\hat{h}} \in (0,\infty )\),
where
with c the constant in (1.44).
Theorem 1.7
Suppose that \(\alpha \in (0,1)\) and \(\vartheta \in (1-\alpha ,2(1-\alpha ))\). For every \({{\hat{\beta }}} \in (0,\infty )\) and \({\hat{h}} \in (0,\infty )\),
where
with c the constant in (1.44) and \(c^*[\varphi ^2]=\sum _{x\in {\mathbb {Z}}} \varphi ^2(x)\,c(x)\), where \(x \mapsto c(x) := c(|x|)\) is the function in (1.38)–(1.39).
Theorem 1.8
Suppose that \(\alpha \in (0,1)\) and \(\vartheta \in (2(1-\alpha ),\infty )\). For every \({{\hat{\beta }}} \in (0,\infty )\) and \({\hat{h}} \in (0,\infty )\),
where
with \(c^*[\varphi ] = \sum _{x\in {\mathbb {Z}}} \varphi (x) c(x)\), where \(x \mapsto c(x) := c(|x|)\) is the function in (1.38)–(1.39).
Note that, because of (1.39) and (1.44), \(c^*[\varphi ^2]<\infty \) when \(\vartheta >1-\alpha \) and \(c^*[\varphi ]<\infty \) when \(\vartheta >2(1-\alpha )\). In Appendix B.1 we will show that the annealed partition functions associated with the Bessel process appearing in Theorems 1.6–1.8 are finite, and so are the corresponding annealed free energies.
The annealed free energy \({\hat{F}}^\mathrm {ann}({{\hat{\beta }}},{\hat{h}})\) appearing in Theorems 1.6–1.8 has its own phase diagram, with phases
and with a critical curve that is a perfect power law (see Fig. 3), namely,
where
plays the role of a critical exponent (see Fig. 4). The scaling of the annealed critical curve in Theorems 1.6–1.8 can be summarised as saying that \(h_c^\mathrm {ann}(\beta ) \sim {\hat{h}}_c^\mathrm {ann}(\beta )\), \(\beta \downarrow 0\).
The constant \({\hat{C}}\) depends on \(\alpha ,\varphi \) and can be characterized as the unique solution \({\hat{C}} \in (0,\infty )\) of the equation \({\hat{F}}^\mathrm {ann}(1,{\hat{C}})=0\). This constant is hard to identify in the first two regimes. In the third regime \(\vartheta \in (2(1-\alpha ),\infty )\) it is found by inserting (1.53) into the equation
which gives
We show in “Appendix B.2” that, for the the third regime \(\vartheta \in (2(1-\alpha ),\infty )\), the annealed free energy \({\hat{F}}^\mathrm {ann}({\hat{\beta }},{\hat{h}})\) can be computed explicitly, namely,
Remark 1.9
In view of the scaling limit for the annealed free energies described in Theorems 1.6–1.8, it is natural to expect a scaling limit for the corresponding annealed critical curves as well. Indeed, the continuum critical curve is the perfect power law in (1.53), where \({\hat{C}}\) and E depend on \(\alpha \) and \(\varphi \) (and hence on \(\vartheta \)). We conjecture that (1.53) captures the asymptotic behaviour for weak interaction of the discrete critical curve \(h_c^\mathrm {ann}(\beta )\) as well, in the sense that in all three regimes we should have
This scaling relation cannot be simply deduced from Theorems 1.6–1.8, because pointwise convergence of the free energies does not imply convergence of their zero-level sets, of which the critical curves are the boundaries. However, half of (1.58) follows because if the continuum free energy is strictly positive, then the rescaled discrete free energy eventually becomes strictly positive too in the weak interaction limit, which leads to
In order to prove (1.58) extra work is needed: the scaling of the free energies in (1.46), (1.48) and (1.50) must be strengthened to a perturbative scaling, as shown in [10, 12] for the copolymer model and in [16] for the pinning model.
1.8 Discussion
We comment on the results in Sects. 1.6–1.7.
-
1.
The results in Theorems 1.4–1.5 are known for the special case where the interaction potential is that of the pinning model or the copolymer model defined in (1.10). However, the techniques used for these two cases do not carry over to the general class of potentials considered in (1.1). Intuitively, the reason why extension is possible is that the conditions stated in (1.1) say that, outside a large interval around the origin in \({\mathbb {Z}}\), the interaction potential is controlled by a multiple of that of the copolymer model.
-
2.
As we will see in Sect. 2, the variational formula for \(h_c^\mathrm {que}(\beta )\) mentioned in Theorem 1.4 involves a supremum over the space of all shift-invariant probability distributions on the set of infinite sequences of words of arbitrary length drawn from an infinite sequence of letters taking values in \({\mathbb {R}}\times {\mathbb {Z}}\). The supremum involves a quenched rate function that captures the complexity of the interplay between the disorder of the charges and the excursions of the polymer away from the interface. This variational formula is hard to manipulate, but it is the starting point for the proof of Theorems 1.5. The variational formula for \(h_c^\mathrm {ann}(\beta )\) mentioned in Theorem 1.4 is simpler, but still not easy to manipulate (see (1.62) below).
-
3.
Note that for \(h = 0\) and \(\beta > 0\) the annealed partition function \(Z_{N, \beta , h}^\mathrm {ann}\) is bounded from below by the partition function of a homogenous pinning model with a strictly positive reward, which is localized. The lower bound in Theorem 1.5 therefore shows that \(h^\mathrm {que}_c(\beta ) > 0\) for every \(\beta > 0\). Since \(\beta \mapsto h_c^\mathrm {que}(\beta )\) is convex, it must therefore be strictly increasing (see Fig. 1).
-
4.
Disorder has a tendency to smoothen the phase transition. In Caravenna and den Hollander [14] a general smoothing inequality is derived that reads as follows:
-
For every \(\beta > 0\) there exist \(C(\beta ) < \infty \) and \({\delta }(\beta ) > 0\) such that
$$\begin{aligned} 0 \le \textsc {f}^\mathrm {que}(\beta , h_c^\mathrm {que}(\beta ) - {\delta }) \le C(\beta ) {\delta }^2 \quad \forall \, 0 \le {\delta } \le {\delta }(\beta ), \end{aligned}$$(1.60)i.e., the quenched phase transition is at least of second order.
Unfortunately, the key assumption under which this smoothing inequality is derived is not obviously met by spatially extended pinning: it does when the tail exponent of \(\varphi \) (recall (1.44)) satisfies \(\vartheta \in (2,\infty )\), but it is unclear whether it also does when \(\vartheta \in (0,2]\). Indeed, the key assumption in [14] requires that \(Z_{N,\beta ,h}^{\omega ,\mathrm {c}} \ge N^{-\gamma } c^\omega _{\beta ,h}\) for some \(\gamma >0\) with \({\mathbb {E}}[\log c^\omega _{\beta ,h}]>-\infty \). Applying Jensen’s inequality to (1.12), we get
$$\begin{aligned} Z_{N,\beta ,h}^{\omega ,\mathrm {c}} \ge \mathrm {e}^{\sum _{n=1}^N (\beta \omega _n - h) {{\,\mathrm{\mathrm{E}}\,}}[\varphi (S_n) \mid S_N=0]}\, {{\,\mathrm{\mathrm{P}}\,}}(S_N=0), \quad N\in {\mathbb {N}}. \end{aligned}$$(1.61)In general, if \({{\,\mathrm{\mathrm{P}}\,}}(S_N=0) \ge CN^{-\gamma }\) for some \(\gamma ,C>0\) and all \(N\in {\mathbb {N}}\) and, furthermore, \(\sup _{N\in {\mathbb {N}}} \sum _{n=1}^N |{{\,\mathrm{\mathrm{E}}\,}}[\varphi (S_n) \mid S_N=0]| < \infty \), then the assumption is met (recall (1.3)). For the Bessel random walk, the former holds by (1.38) for \(\gamma =1-\alpha \), while the latter holds by (1.41) when \(\varphi (x) \sim c \, |x|^{-\vartheta }\) with \(\vartheta \in (2,\infty )\), because in that case, for \(n \le N/2\) (by symmetry), \(\sup _{N\in {\mathbb {N}}}{{\,\mathrm{\mathrm{E}}\,}}[\varphi (S_n) \mid S_N=0] \lesssim \varphi (\sqrt{n}\,) \asymp n^{-\vartheta /2}\) is summable.
-
-
5.
Theorems 1.6–1.8 give detailed information about the scaling of the annealed free energy and the annealed critical curve in the limit of weak interaction. The scaling limits correspond to annealed free energies and annealed critical curves for Brownian versions of the model involving the Bessel process \(X^\alpha \), which are interesting in their own right. The result is only valid for the Bessel random walk, and shows a trichotomy depending on the parameters \(\alpha \) and \(\vartheta \).
-
The regime \(\vartheta \in (0,1-\alpha )\) corresponds to a long-range interaction potential and is not pinning-like. When localized, the continuum polymer spends a positive fraction of the time near any height \(x \in {\mathbb {R}}\), and this fraction tends to zero as \(|x| \downarrow 0\) or \(|x| \rightarrow \infty \). Away from 0 it does not behave like the Bessel process conditioned to return to 0.
-
The regime \(\vartheta \in (1-\alpha ,2(1-\alpha ))\) corresponds to an intermediate-range interaction potential and exhibits some pinning-like features. When localized, the continuum polymer visits 0 a positive fraction of the time. Away from 0 it does not behave like the Bessel process conditioned to return to 0.
-
The regime \(\vartheta \in (2(1-\alpha ),\infty )\) corresponds to a short-range interaction potential and is pinning-like. When localized, the continuum polymer visits 0 a positive fraction of the time. Away from 0 it behaves like the Bessel process conditioned to return to 0.
In the last regime the behaviour is similar to that of the homogeneous pinning model with \(\varphi (x) = c\mathsf{1}_{\{x=0\}}\), for which it is known that \(h^\mathrm {ann}_c(\beta ) \sim \tfrac{1}{2} c \beta ^2\), \(\beta \downarrow 0\) (see Giacomin [21], [22], den Hollander [24]). In fact, the proof of Theorem 1.8 will show that the scaling in the last regime is valid for any \(\varphi \) such that \(c^*[\varphi ^2]\) and \(c^*[\varphi ]\) are finite, i.e., (1.44) may be replaced by the weaker condition \(\varphi (x) = O(|x|^{-2(1-\alpha )-\varepsilon })\) for some \(\varepsilon >0\).
-
-
6.
The three regimes for \(\vartheta \) represent three universality classes. The critical cases \(\vartheta = 1-\alpha \) and \(\vartheta = 2(1-\alpha )\) are more delicate and we have skipped them. Also, we have not investigated what happens when the scaling of the interaction potential in (1.44) is modulated by a slowly varying function. For the same reason we have assumed that the error term in (1.8) is \(O(|x|^{-(1+\varepsilon )})\) with \(\varepsilon >0\) rather than \(o(|x|^{-1})\), since the latter may give rise to modulation by slowly varying functions in (1.38) and (1.39) (see Alexander [2]).
-
7.
Theorems 1.6–1.8 are deduced from scaling properties of the annealed partition function. They are not specific to the annealed model. In fact, analogous results hold for every homogeneous pinning potential that is a linear combination of potentials with a polynomial tail. See Sect. 4 for more details.
-
8.
By the considerations made in Sect. 3, the annealed model defined in (1.20)–(1.21) is localized (i.e., \(\textsc {f}^\mathrm {ann}(\beta ,h) > 0\), \(h < h_c^\mathrm {ann}(\beta )\)) if and only if
$$\begin{aligned} \sum _{m\in {\mathbb {N}}} {{\,\mathrm{\mathrm{E}}\,}}\bigg [ \mathrm {e}^{\sum _{n=1}^{m} \psi _{\beta ,h}(S_n)} \, \mathsf{1}_{\{\tau _1 = m\}} \bigg ] > 1, \end{aligned}$$(1.62)where we recall that \(\tau _1\) denotes the first return time of S to 0. Although the starting point 0 seems to play a special role in (1.62), it can be shown that the criterion in (1.62) is invariant under spatial shifts of \(\psi _{\beta ,h}\) (see Appendix C).
A natural question is what happens when the random walk S is transient, i.e., \({{\,\mathrm{\mathrm{P}}\,}}(\tau _1< \infty ) = \sum _{n\in {\mathbb {N}}} K(n) =: r < 1\). For the constrained partition function \(Z_{N,\beta ,h}^{\omega ,\mathrm {c}}\), working with a transient renewal process with law K is equivalent to working with a recurrent renewal process with law K/r and adding a depinning term \(\sum _{n=1}^N (\log r) \mathsf{1}_{\{S_n = 0\}}\) in the exponential in (1.12). This amounts to replacing \(\psi _{\beta ,h}(x)\) by \(\psi _{\beta ,h}(x) + (\log r)\mathsf{1}_{\{x=0\}}\), and so instead of (1.62) the localization condition for the annealed model becomes
-
(I)
For the copolymer model we have \(\varphi (x) = \varphi ^\mathrm {cop}(x) = \mathsf{1}_{\{x \le 0\}}\) (recall (1.10)) and \(\psi _{\beta ,h}(x) = (\tfrac{1}{2}\beta ^2-h) \mathsf{1}_{(-\infty ,0]}(x)\). Therefore \(h_c^\mathrm {ann}(\beta ) = \tfrac{1}{2}\beta ^2\) and, in fact, the left-hand side of (1.62) is \(\le 1\) for \(h \le h_c^\mathrm {ann}(\beta )\) and is \(=\infty \) for \(h > h_c^\mathrm {ann}(\beta )\). This means that the annealed critical curve \(h_c^\mathrm {ann}\) does not depend on r, and hence neither do the bounds in (1.43). In other words, making the underlying renewal process transient or, equivalently, adding a homogeneous depinning term at zero, does not modify the annealed critical curve of the copolymer model. In essence this is due to the fact that the copolymer potential is long range (i.e., \(\psi _{\beta ,h}(x)\) does not vanish as \(x \rightarrow -\infty \)).
-
(II)
For the pinning model, adding a depinning term at zero amounts to shifting h and this may have an effect. In essence this is due to the fact that the pinning potential \(\varphi (x) = \varphi ^\mathrm {pin}(x) = \mathsf{1}_{\{x=0\}}\) is short range.
1.9 Open problems and outline
-
1.
Are the inequalities in (1.43) strict for all \(\beta >0\)? For the copolymer model (\(\varphi (x)=1_{\{x \le 0\}}\)) the answer is yes for all \(\alpha >0\) (Bolthausen et al. [11]). Moreover, it is known that (Bolthausen and den Hollander [10], Caravenna and Giacomin [12])
$$\begin{aligned} \lim _{\beta \downarrow 0} \beta ^{-1} h^\mathrm {ann}_c(\beta ) = 1, \quad \lim _{\beta \downarrow 0} \beta ^{-1} h^\mathrm {que}_c(\beta ) = C(\alpha ), \end{aligned}$$(1.64)with \(C(\alpha ) \in (1/(1+\alpha ),1)\) for \(0<\alpha <1\) (Bolthausen et al. [11]) and \(C(\alpha )=1/(1+\alpha )\) for \(\alpha \ge 1\) (Berger et al. [4]). For the pinning model (\(\varphi (x)=1_{\{x=0\}}\)) the answer depends on \(\alpha \): no for \(0<\alpha <\tfrac{1}{2}\) (the upper bound is an equality), yes for \(\alpha >\tfrac{1}{2}\). Moreover, it is known that
$$\begin{aligned} \lim _{\beta \downarrow 0} \beta ^{-2} h_c^\mathrm {ann}(\beta ) = \tfrac{1}{2}, \quad \lim _{\beta \downarrow 0} \beta ^{-2} h_c^\mathrm {que}(\beta ) = C'(\alpha ), \end{aligned}$$(1.65)with \(C'(\alpha )=\tfrac{1}{2}\) for \(0<\alpha <1\) and \(C'(\alpha ) \in (0,\tfrac{1}{2})\) for \(\alpha >1\). As a matter of fact, refined estimates are available:
-
\(\alpha \in (0,\tfrac{1}{2})\): \(h^\mathrm {ann}_c(\beta ) = h^\mathrm {que}_c(\beta )\) for \(\beta > 0\) small enough (Alexander [1], see also Cheliotis and den Hollander [17]);
-
\(\alpha \in (\tfrac{1}{2},1)\): \(h^\mathrm {ann}_c(\beta )-h^\mathrm {que}_c(\beta ) \sim c_\alpha \, \beta ^{2\alpha /(2\alpha -1)}\) as \(\beta \downarrow 0\) for a universal constant \(c_\alpha \in (0,\infty )\) (Caravenna et al. [16], and previously Alexander and Zygouras [3], Derrida et al. [20]).
-
\(\alpha = \frac{1}{2}\): \(h^\mathrm {ann}_c(\beta )-h^\mathrm {que}_c(\beta ) = \exp (-\frac{\pi }{2\beta ^2}[1+o(1)])\) (Berger and Lacoin [6], and previously Giacomin et al. [23]).
For an overview, we refer the reader to Giacomin [22].
-
-
2.
Determine the order of the quenched phase transition. For the copolymer model it is known that the phase transition is of infinite order when \(\alpha =0\) (Berger et al. [5]). The same is conjectured to be true for \(\alpha \in (0,1)\).
-
3.
Identify the scaling for weak interaction of the annealed model in the critical cases \(\vartheta = 1-\alpha \) and \(\vartheta = 2(1-\alpha )\).
-
4.
Identify the scaling for weak interaction of the quenched model. Because of Theorem 1.5, the same exponent E as in (1.54) applies.
-
5.
The qualitative shape of the critical curve in Fig. 1 depends on our assumption in (1.1) that \(\varphi \ge 0\). A reflected picture holds when \(\varphi \le 0\). It appears that for \(\varphi \) with mixed signs there are two critical curves \(\beta \mapsto h_ {c,1}^\mathrm {que}(\beta )\) and \(\beta \mapsto h_{c,2}^\mathrm {que}(\beta )\), separating a single quenched delocalized phase \({\mathcal {D}}^\mathrm {que}\) from two quenched localized phases \({\mathcal {L}}^\mathrm {que}_1\) and \({\mathcal {L}}^\mathrm {que}_2\) that lie above \({\mathcal {D}}^\mathrm {que}\), respectively, below \({\mathcal {D}}^\mathrm {que}\). What are the properties of these critical curves?
-
6.
What happens when \(\beta =\beta _N\) and \(h=h_N\) with \(\beta _N,h_N \downarrow 0\) as \(N\rightarrow \infty \).
-
7.
Is it possible to include non-nearest-neighbour random walks?
Outline The remainder of this paper is organized as follows. Theorem 1.4 is proved in Sect. 2, Theorem 1.5 in Sect. 3 and Theorems 1.6–1.8 in Sect. 4. “Appendices A and B” collect a few technical facts that are needed along the way.
2 Proof of Theorem 1.4
In Sect. 2.1 we formulate annealed and quenched large deviation principles (LDPs) that are an adaptation to our model of the LDPs developed in Birkner [7] and Birkner et al. [8]. The latter concern LDPs for random sequences of words cut out from random sequences of letters according to a renewal process. In Sect. 2.2 we formulate variational characterizations of the annealed and quenched critical curves that are an adaptation of the characterizations derived for the pinning model in Cheliotis and den Hollander [17] and for the copolymer model in Bolthausen et al. [11]. In Sect. 2.3 we explain how the variational characterizations follow from the LDPs via Varadhan’s lemma.
2.1 Annealed and quenched LDP
Our starting observation is that the partition function in (1.4) depends on the sequence of words \(Y=(Y_i)_{i\in {\mathbb {N}}}\) determined by the disorder and by the excursions of the polymer, namely,
where \(\tau =(\tau _i)_{i\in {\mathbb {N}}}\) is the sequence of epochs of the successive visits of the polymer to zero (\(\tau _0=0\)). Note that the random variables \(Y_i\) take their values in the space \({\tilde{\Gamma }} := \bigcup _{n\in {\mathbb {N}}} \Gamma ^n\) with \(\Gamma := {\mathbb {R}}\times {\mathbb {Z}}\).
To capture the role of Y, we introduce its empirical process,
where \({\mathcal {P}}^\mathrm {inv}({{\tilde{\Gamma }}}^{\mathbb {N}})\) denotes the set of probability measures on \({{\tilde{\Gamma }}}^{\mathbb {N}}\) that are invariant under the left-shift \({{\tilde{\vartheta }}}\) acting on \({{\tilde{\Gamma }}}^{\mathbb {N}}\). The superscript \(\omega \) reminds us that the random variables \(Y_i\) are functions of \(\omega \). We must average over S while keeping \(\omega \) fixed. Note that, under the annealed law \({\mathbb {P}}\otimes {{\,\mathrm{\mathrm{P}}\,}}\), Y is i.i.d. with the following marginal law \(q_0\) on \({{\tilde{\Gamma }}}\):
where \(K(n) := {{\,\mathrm{\mathrm{P}}\,}}(\tau _1 = n)\) and \(\nu (\text {d}x) := {\mathbb {P}}(\omega _1 \in \text {d}x)\).
The specific relative entropy of Q w.r.t. \(q_0^{\otimes {\mathbb {N}}}\) is defined by
where \({{\tilde{\pi }}}_N Q \in {\mathcal {P}}({{\tilde{\Gamma }}}^N)\) denotes the projection of Q onto the first N words, \(h(\,\cdot \mid \cdot \,)\) denotes relative entropy, and the limit is non-decreasing. The following annealed LDP is standard (see Dembo and Zeitouni [19, Section 6.5]).
Proposition 2.1
(Annealed LDP). The family \(({\mathbb {P}}\otimes {{\,\mathrm{\mathrm{P}}\,}})(R_M^\omega \in \cdot \,)\), \(M\in {\mathbb {N}}\), satisfies the LDP on \({\mathcal {P}}^{\mathrm {inv}}({{\tilde{\Gamma }}}^{\mathbb {N}})\) with rate M and with rate function \(I^{\mathrm {ann}}\) given by
This rate function is lower semi-continuous, has compact level sets, has a unique zero at \(q_0^{\otimes {\mathbb {N}}}\), and is affine.
The quenched LDP is more delicate and requires extra notation. The reverse operation of cutting words out of a sequence of letters is glueing words together into a sequence of letters. Formally, this is done by defining a concatenation map \(\kappa \) from \({{\tilde{\Gamma }}}^{\mathbb {N}}\) to \(\Gamma ^{\mathbb {N}}\). This map induces in a natural way a map from \({\mathcal {P}}({{\tilde{\Gamma }}}^{\mathbb {N}})\) to \({\mathcal {P}}(\Gamma ^{\mathbb {N}})\), the sets of probability measures on \({{\tilde{\Gamma }}}^{\mathbb {N}}\) and \(\Gamma ^{\mathbb {N}}\) (endowed with the topology of weak convergence). The concatenation \(q_0^{\otimes {\mathbb {N}}} \circ \kappa ^{-1}\) of \(q_0^{\otimes {\mathbb {N}}}\) equals \(\nu ^{\otimes {\mathbb {N}}}\), as is evident from (2.3).
For \(Q\in {\mathcal {P}}^\mathrm {inv}({{\tilde{\Gamma }}}^{\mathbb {N}})\), let \(m_Q:=E_Q(\tau _1) \in [1,\infty ]\) be the average word length under Q (\(E_Q\) denotes expectation under the law Q and \(\tau _1\) is the length of the first word). Let
For \(Q\in {\mathcal {P}}^{\mathrm {inv,fin}}({{\tilde{\Gamma }}}^{\mathbb {N}})\), define
Think of \(\Psi _Q\) as the shift-invariant version of \(Q\circ \kappa ^{-1}\) obtained after randomizing the location of the origin. This randomization is necessary because a shift-invariant Q in general does not give rise to a shift-invariant \(Q\circ \kappa ^{-1}\).
The following quenched LDP is a straight adaptation of the one derived in Birkner, Greven and den Hollander [8].
Proposition 2.2
(Quenched LDP). For \({\mathbb {P}}\)-a.e. \(\omega \) the family \({{\,\mathrm{\mathrm{P}}\,}}(R_M^\omega \in \cdot \,)\), \(M\in {\mathbb {N}}\), satisfies the LDP on \({\mathcal {P}}^\mathrm {inv}({{\tilde{\Gamma }}}^{\mathbb {N}})\) with rate M and with rate function given by
where \(\alpha \) is the exponent in (1.6) and \(B_{\delta }(Q)\) is the \({\delta }\)-ball around Q (in any appropriate metric). This rate function is lower semi-continuous, has compact level sets, has a unique zero at \(q_0^{\otimes {\mathbb {N}}}\), and is affine.
Remark 2.3
In [8] a formula was claimed for \(I^\mathrm {que}\) on \({\mathcal {P}}^{\mathrm {inv}}({{\tilde{\Gamma }}}^{\mathbb {N}}) \setminus {\mathcal {P}}^{\mathrm {inv},\mathrm {fin}}({{\tilde{\Gamma }}}^{\mathbb {N}})\) based on a truncation approximation for the average word length. As pointed out by Jean-Christophe Mourrat (private communication), the proof of this formula in [8] is flawed. The formula itself may still be correct, but no proof is currently available. In the present paper we will only need to know \(I^\mathrm {que}\) on \({\mathcal {P}}^{\mathrm {inv},\mathrm {fin}} ({{\tilde{\Gamma }}}^{\mathbb {N}})\).
2.2 Variational criterion for localization
For \(N\in {\mathbb {N}}\), let \(\ell _N\) be the number of returns to zero of the polymer before epoch N, i.e.,
Let \(\Phi :\,{{\tilde{\Gamma }}} \rightarrow {\mathbb {R}}\) be defined by
Then the constrained quenched partition function defined in (1.12) can be written as
while the constrained annealed partition function defined in (1.23) can be written as
where \({{\tilde{\pi }}}_1 Q\) denotes the projection of Q onto the space \({{\tilde{\Gamma }}}\) of the first word.
With the help of Propositions 2.1–2.2 we can derive the following variational characterization of the annealed and the quenched critical curve. Note that \(\textsc {f}^\mathrm {ann}(\beta ,h) > 0\) if and only if \(h<h_c^\mathrm {ann}(\beta )\) and \(\textsc {f}^\mathrm {que}(\beta ,h) > 0\) if and only if \(h<h_c^\mathrm {que}(\beta )\).
Theorem 2.4
[Annealed localization]. For every \(\beta ,h>0\),
Theorem 2.5
[Quenched localization]. For every \(\beta ,h>0\),
As we will see below, the role of the conditions \(m_Q<\infty \) and \(I^\mathrm {ann}(Q)< \infty \) under the two suprema is to ensure that \(\int _{{{\tilde{\Gamma }}}} \Phi _{\beta ,h}\,\text {d}({{\tilde{\pi }}}_1 Q) < \infty \), so that the suprema are well defined. The condition \(m_Q<\infty \) under the second supremum allows us to use the representation in (2.8). We will see in Sect. 3 how the variational formulas in (2.13)–(2.14) can be exploited.
2.3 Proof of Theorems 2.4–2.5
The proof uses arguments developed in Bolthausen et al. [11]. Theorems 2.4–2.5 follow from Propositions 2.1–2.2 with the help of Varadhan’s lemma applied to (2.11)–(2.12). The only difficulty we need to deal with is the fact that both \(Q \mapsto m_Q\) and \(Q \mapsto \Phi ^*_{\beta ,h}(Q) = \int _{{{\tilde{\Gamma }}}} \Phi _{\beta ,h}\, \text {d}({{\tilde{\pi }}}_1Q)\) are neither bounded nor continuous in the weak topology. Therefore an approximation argument is required, which is worked out in detail in [11, Appendices A–D] for the case of the copolymer interaction potential in (1.10). This approximation argument shows why the restriction to \(m_Q<\infty \) and \(I^\mathrm {ann}(Q)<\infty \) may be imposed, a key ingredient being that \(I^\mathrm {ann}(Q)<\infty \) implies \(\Phi ^*_{\beta ,h}(Q)<\infty \). The proof in [11, Appendices A–D] readily carries over because our condition on the interaction potential in (1.1) reflects the properties of the copolymer interaction potential. We sketch the main line of thought. Throughout the sequel \(\beta ,h>0\) are fixed.
Proof of Theorem 2.5
Following the argument in [11, Appendix A], we show that
-
(1)
For every \(g>0\), \(M \mapsto \Psi _{\beta ,h}(R_M^\omega )-gm_{R_M^\omega }\) is bounded \(\omega \)-a.s.
-
(2)
For every \(\varrho \in {\mathcal {P}}({\mathbb {N}})\), \(\nu \in {\mathcal {P}}({\mathbb {R}})\) and \(p=(p_n)_{n\in {\mathbb {N}}}\) with \(p_n \in {\mathcal {P}}({\mathbb {Z}}^n)\), there exist \(\gamma >0\) and \(K=K(\varrho ,\nu ,p;\gamma )>0\) such that \(\Phi ^*_{\beta ,h}(Q) \le \gamma h({\tilde{\pi }}_1Q \mid q_{\varrho ,\nu ,p}) + K\) for all \(Q\in {\mathcal {P}}^{\mathrm {inv}}({{\tilde{\Gamma }}}^{\mathbb {N}})\) with \(h({\tilde{\pi }}_1Q \mid q_{\varrho ,\nu ,p})<\infty \), where (compare with (2.3))
$$\begin{aligned}&q_{\varrho ,\nu ,p}\big ( (\text {d}x_1, \ldots , \text {d}x_n) \times \{(s_1, \ldots , s_n)\} \big ) = \varrho (n) \, \nu (\text {d}x_1) \cdots \nu (\text {d}x_n) \, p_n(s_1,\ldots ,s_n),\nonumber \\&n\in {\mathbb {N}},\,x_1,\ldots ,x_n \in {\mathbb {R}},\,s_1,\ldots ,s_n \in {\mathbb {Z}}. \end{aligned}$$(2.15)
The proof uses the fact that the conditions in (1.1) allow us to approximate \(\varphi \) by a multiple of \(\varphi ^{\mathrm {cop}}\) (recall (1.10)) uniformly on \({\mathbb {Z}}\setminus [-L,L]\) at arbitrary precision as \(L\rightarrow \infty \). The proof also uses a concentration of measure estimate for the disorder, which is proved in [11, Appendix D].
For \(g>0\), define the quenched free energy
where
is the quenched partition function in which every letter gets an energetic penalty \(-g\). Following the argument in [11, Appendix B], we use (1) and (2) to show that, for every \(g>0\),
where \({\mathcal {R}}\) is the set of shift-invariant probability measures under which the concatenation of words produces a letter sequence that has the same asymptotic statistics as a typical realisation of Y, i.e.,
(\(w-\lim \) means weak limit). The proof of (2.18) carries over verbatim. What (2.18) says is that Varadhan’s lemma applies to (2.16)–(2.17) because of the control enforced by (1) and (2).
Following the argument in [11, Appendix C], we show that
with
Here, in the passage from (2.18) to (2.21), the constraint in \({\mathcal {R}}\) disappears from the variational characterization, while \(I^\mathrm {ann}(Q)\) is replaced by \(I^\mathrm {que}(Q)\). The reason is that for every \(Q \in {\mathcal {P}}^\mathrm {inv}({{\tilde{\Gamma }}}^{\mathbb {N}})\) there exists a sequence \((Q_n)_{n\in {\mathbb {N}}}\) in \({\mathcal {R}}\) such that \(Q = w-\lim _{n\rightarrow \infty } Q_n = Q\) and \(\lim _{n\rightarrow \infty } I^\mathrm {ann}(Q_n) = I^\mathrm {que}(Q)\). The proof of (2.21) carries over verbatim. The supremum in the right-hand side of (2.21) is the same as the supremum in the right-hand side of (2.14). \(\square \)
Proof of Theorem 2.4
Varadhan’s lemma also applies to
and yields
with
Again, this works because of the control enforced by (2). The supremum in the right-hand side of (2.24) is the same as the supremum in the right-hand side of (2.13). \(\square \)
3 Proof of Theorem 1.5
The upper bound in (1.43) is immediate from Theorems 2.4–2.5 and the inequality \(I^\mathrm {que}\ge I^\mathrm {ann}\) (see also (1.27)). In Sects. 3.1–3.3 we prove the lower bound in (1.43). This lower bound is the analogue of what for the copolymer model is called the Monthus–Bodineau–Giacomin lower bound (see Giacomin [21], den Hollander [24]).
3.1 A sufficient criterion for quenched localization
The quenched rate function can be written as
with
It can be shown that \(R(Q) \ge 0\) for all \(Q\in {\mathcal {P}}^\mathrm {inv}({{\tilde{\Gamma }}}^{\mathbb {N}})\): R(Q) has the meaning of a concatenation entropy (see Birkner et al. [8]). Therefore, dropping this term in (2.14) we obtain the following sufficient criterion for quenched localization:
The right-hand side resembles the necessary and sufficient criterion for annealed localization in (2.13), the only difference being the extra factor \(1+\alpha \).
3.2 Reduction
Among the laws \(Q \in {\mathcal {P}}^\mathrm {inv}({{\tilde{\Gamma }}}^{\mathbb {N}})\) with a given marginal law \(q \in {\mathcal {P}}({{\tilde{\Gamma }}})\), the product law \(Q = q^{\otimes {\mathbb {N}}}\) is the unique minimizer of the specific relative entropy \(H(Q \mid q_0^{\otimes {\mathbb {N}}})\). Therefore the right-hand side of (3.3) reduces to
where \(h(\cdot \mid \cdot )\) denotes relative entropy.
We next show that (3.3) reduces to an even simpler criterion. To that end, let \(C_N := \bigcup _{n=1}^N \Gamma ^n\) be the subset of words of length at most N. Consider the law \({{\hat{q}}}_N \in {\mathcal {P}}({{\tilde{\Gamma }}})\) defined by
where
is the normalizing constant. The latter is finite because, by (2.10), \(\Phi \) restricted to \(C_N\) is the sum of at most N random variables with finite exponential moments. Note that also
which yields \(h({{\hat{q}}}_N \mid q_0) < \infty \). Trivially, \(m_{{{\hat{q}}}_N} \le N < \infty \). Therefore we are allowed to pick \(q = {{\hat{q}}}_N\) in (3.4), so that (3.4) is satisfied when
Since N is arbitrary, this in turn is satisfied when
where \({\mathcal {N}}_\alpha = \infty \) is allowed. Conversely, if \({\mathcal {N}}_\alpha \le 1\), then (3.4) is not satisfied. Indeed, as soon as \({\mathcal {N}}_\alpha < \infty \) we may introduce the law \({{\hat{q}}} \in {\mathcal {P}}({{\tilde{\Gamma }}})\) defined by
and rewrite \(h(q \mid q_0) = h(q \mid {{\hat{q}}}) - \frac{1}{1+\alpha } \int _{{{\tilde{\Gamma }}}} \Phi \, \text {d}q + \log {\mathcal {N}}_\alpha \), so that
where the last inequality holds for any q because \({\mathcal {N}}_\alpha \le 1\), and so (3.4) fails. Thus, (3.3) reduces to
3.3 Application
As we remarked below (3.3), (2.13) resembles (3.3), the only difference being the factor \(1+\alpha \) instead of 1 in front of \(I^\mathrm {ann}(Q) = H(Q \mid q_0^{\otimes {\mathbb {N}}})\). Therefore, repeating the above steps and recalling (2.14), we conclude that
It follows from (2.10), (2.12) and (3.9) that the condition \({\mathcal {N}}_\alpha > 1\) is equivalent to
i.e., \(\frac{1}{1+\alpha }h < h_c^\mathrm {ann}(\frac{1}{1+\alpha }\beta )\), which by (3.12) implies \(\textsc {f}^\mathrm {que}(\beta ,h) > 0\), i.e., \(h < h_c^\mathrm {que}(\beta )\). This completes the proof of the lower bound in (1.43).
Recalling (2.3), (2.10) and the function \(\psi _{\beta ,h}\) defined in (1.21), we may write \({\mathcal {N}}_0\) as
We will analyse this expression in Sect. 4 in the weak interaction limit \(\beta ,h\downarrow 0\) for the Bessel random walk.
Note that for \(h=0\) the expression in (3.15) reduces to
By Jensen and the fact that \(\varphi \not \equiv 0\), this sum is \(>1\) for all \(\beta >0\). Hence \(F^\mathrm {ann}(\beta ,0)>0\) for all \(\beta >0\), which settles the claim made at the end of Sect. 1.6.
4 Proof of Theorems 1.6–1.8
To prove our scaling results for weak interaction, we will exploit the invariance principle in (1.37). Recall (1.20)–(1.21). Since \(\Vert \psi _{\beta ,h}\Vert _\infty \) tends to zero as \(\beta ,h \downarrow 0\), we can do a weak coupling expansion in the spirit of Caravenna et al. [15].
The proof is long and technical. In Sect. 4.1 we outline the general strategy, which leads to three tasks. These tasks are carried out in Sects. 4.2–4.4, respectively.
4.1 General strategy
Fix \({{\hat{\beta }}}, {{\hat{h}}} \in (0,\infty )\) and \(\beta _N, h_N\) such that, as \(N\rightarrow \infty \),
and
We will prove that, for any \(T \in (0,\infty )\),
(for ease of notation we pretend that TN is integer), where we set
We recall that the renormalized Dirac-function \({\hat{{\delta }}}_0(\cdot )\) is the notation introduced in (1.36) to make the time integral \(\int _0^T {\hat{\psi }}_{{\hat{\beta }},{\hat{h}}}(X_t)\,\text {d}t\) in (4.3) well-defined.
Once the convergence in (4.3) is established, Theorems 1.6–1.8 follow. Indeed, recalling the definitions of \({\hat{F}}^\mathrm {ann}\) in (1.47), (1.49) and (1.51), respectively, we can write
where the interchange of the limits \(T\rightarrow \infty \) and \(N\rightarrow \infty \) is justified in Appendix A.4.
To prove (4.3), we write \(Z^\mathrm {ann}_{TN,\beta _N,h_N}\) as a series, recall (1.20). Since the potential \(\varphi \) is assumed to be symmetric, i.e., \(\varphi (-x) = \varphi (x)\) for all \(x\in {\mathbb {Z}}\), we can write \(\psi _{\beta ,h}(S_n) = \psi _{\beta ,h}(|S_n|)\), and hence
with
We can write a similar decomposition for \({\hat{Z}}^\mathrm {ann}_{T,{\hat{\beta }},{\hat{h}}}\), namely,
with
Formally, if we rewrite the k-th power of the integral as a k-fold integral and afterwards switch the integral and the expectation, then we get the expression
with \(t_0 := x_0 := 0\). This is justified by Fubini’s theorem in the first regime in (4.4), because \({\hat{\psi }}_{{\hat{\beta }},{\hat{h}}}(x)\) is a genuine function (see (4.4)). However, the second and third regime in (4.4) are more delicate because \({\hat{\psi }}_{{\hat{\beta }},{\hat{h}}}(x)\) contains the formal term \({\hat{{\delta }}}_0(x)\). We claim that (4.10) holds in these regimes as well, provided we use the interpretation
where \({{\hat{g}}}\) is the function defined in (1.34) and \({\delta }_0\) is the usual Dirac measure at zero, both of which are proper. We prove (4.10) in Sect. 4.2 below.
Remark 4.1
In the third regime, both terms in \({\hat{\psi }}_{{\hat{\beta }},{\hat{h}}}(x)\) contain \({\hat{{\delta }}}_0(x)\) (see (4.4)). Therefore (4.10) can be simplified, by the recipe in (4.11), to give
where the last equality can be seen to hold by the normalisation constant of the Dirichlet distribution (see also (B.9)). Therefore (4.10) is delicate only in the second regime.
We will prove in Sect. 4.3 that the series in (4.6) and (4.8) have negligible tails:
We will show in Sect. 4.4 that
The last two equations combine to yield (4.3) and complete the proof. \(\square \)
4.2 Proof of (4.10)
For \(\varepsilon > 0\) we define a genuine function \({\hat{{\delta }}}_0^\varepsilon (x)\) that is meant to approximate \({\hat{{\delta }}}_0(x)\) as \(\varepsilon \downarrow 0\) (recall (1.35)–(1.36)):
We also define an approximate version \({\hat{\psi ^\varepsilon }}(x)\) of \({{\hat{\psi }}}(x)\) in (4.4) by setting (suppressing the dependence on \({\hat{\beta }},{\hat{h}}\))
so that
Below we prove that this convergence also holds in \(L^k\), \(k\in {\mathbb {N}}\). Then, via (4.9), we can write
which coincides precisely with our target Eq. (4.10) with the recipe in (4.11) via the characterization of \({{\hat{g}}}\) in (1.33)–(1.34) (note that \(x \mapsto g_t(x,y)\) is continuous on \([0,\infty )\)).
To prove that (4.17) holds in \(L^k\), \(k\in {\mathbb {N}}\), it is enough to show that all moments of \(\int _0^T {\hat{\psi ^\varepsilon }}(X_t) \, \text {d}t\) are uniformly bounded:
Since \({\hat{\psi ^\varepsilon }}(x)\) is a genuine function, Fubini’s theorem tells us that, for every \(\varepsilon > 0\),
Combining (4.4) abd (4.15)–(4.16), we can bound \(|{\hat{\psi ^\varepsilon }}(x)| \le {\bar{\psi }}^\varepsilon (x) := C_1 \, {\hat{{\delta }}}_0^\varepsilon (x) + C_2 \, |x|^{-\gamma }\) for suitable constants \(C_1, C_2\) and \(\gamma \in \{\vartheta ,2\vartheta \} < 2(1-\alpha )\). Since \({\bar{\psi }}^\varepsilon \) is decreasing, and since \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_0(X_t \in \cdot )\) is stochastically dominated by \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_x(X_t \in \cdot )\) for any \(x \ge 0\), we can bound
where the last inequality holds by (1.31)–(1.32), for a suitable (and explicit) \(C'_2 < \infty \) (note that \({\hat{{{\,\mathrm{\mathrm{E}}\,}}}}_0[|X_{t_\ell - t_{\ell -1}}|^{-\gamma }] < \infty \) because \(\gamma < 2(1-\alpha )\)). The last expression is a finite constant, and so we have completed the proof of (4.10). \(\square \)
4.3 Proof of (4.13)
The second inequality in (4.13) follows from the bounds in (4.18)–(4.21), namely, for some constant \(C = C(T) < \infty \) (recall that \(\gamma < 2(1-\alpha )\))
(see also Appendix B.1 ).
To prove the first inequality in (4.13), we recall that, by (4.7),
In the sequel, \(C, C' < \infty \) denote absolute constants, possibly depending on \(T, {{\hat{\beta }}},{{\hat{h}}}\), that may change from line to line. Since \(\beta _N, h_N \downarrow 0\), it follows from (1.21) that for \(N\rightarrow \infty \) (recall that \(\varphi \) is bounded):
Since \(\lim _{|x|\rightarrow \infty } |x|^\vartheta \varphi (x) = c \in (0,\infty )\) by (1.44), we can bound
Next, recall the uniform upper bound in (1.42). For any \(\gamma \ne 2(1-\alpha )\) we can bound
(where \(a \wedge b := \min \{a,b\}\)). Indeed,
-
for \(\gamma > 2(1-\alpha )\) we drop the exponential and note that \(\sum _{k\in {\mathbb {Z}}} (1+|k|)^{1-2\alpha -\gamma } < \infty \);
-
for \(\gamma < 2(1-\alpha )\) a Riemann sum approxmation shows that the sum can be bounded by a multiple of \(n^{1-\alpha -\gamma /2}\) (note that \(\int _0^\infty x^{1-2\alpha -\gamma } \, \mathrm {e}^{-x^2/8} \, \text {d}x < \infty \)).
It follows from (4.25)–(4.26) that
Setting \(n = Nt\) with \(t \in (0,T]\), and recalling that \(\beta _N\), \(h_N\) are given in (4.1)–(4.2), we obtain
Next, by (4.23) we can bound
because the function \({\bar{\chi }}_{\beta _N,h_N}(|x|)\) is decreasing in |x| and the law \({{\,\mathrm{\mathrm{P}}\,}}(|S_n| \in \cdot \,|\, |S_{n'}| = m)\) stochastically dominates \({{\,\mathrm{\mathrm{P}}\,}}(|S_n| \in \cdot \,|\, S_{n'} = 0) = {{\,\mathrm{\mathrm{P}}\,}}(|S_{n-n'}| \in \cdot )\) for all \(m \ge 0\) (as can be seen via a coupling argument). Then, setting \(n_\ell = N t_\ell \), via (4.28) we get
This is the same bound as in (4.22), and hence the first inequaltiy in (4.13) is proved.\(\square \)
4.4 Proof of (4.14)
Intuitively, we can get (4.14) by substituting the asymptotic relation (4.24) into the definition (4.23) of \(C_{TN,k}\) and performing a Riemann sum approximation, recalling (1.44). However, some care is required to properly implement this strategy.
1. Recall from (4.24) that \(\chi _{\beta _N,h_N}(x) \sim \frac{\beta _N^2}{2} \, \varphi (x)^2 - h_N \, \varphi (x)\) as \(N\rightarrow \infty \), and by (1.44) the terms \(\varphi (x)^2\) and \(\varphi (x)\) decay polynomially as \(|x|^{-\gamma }\) as \(|x|\rightarrow \infty \), with \(\gamma = 2\vartheta \) and \(\gamma = \vartheta \), respectively. Fix \(\varepsilon > 0\) small and introduce an approximation \(\chi ^\varepsilon _{\beta _N,h_N}(x)\) of \(\chi _{\beta _N,h_N}(x)\) in which each term is restricted to a relevant range: either \(|x| \approx 1\) (more precisely, \(|x| \le \frac{1}{\varepsilon }\)) or \(|x| \approx \sqrt{N}\) (more precisely, \(\varepsilon \sqrt{N} \le |x| \le \frac{1}{\varepsilon } \sqrt{N}\)), depending on the decay exponent \(\gamma \). Indeed, the bound in (4.26) on \({{\,\mathrm{\mathrm{E}}\,}}\big [ (1+ |S_N|)^{-\gamma } \big ] \le \) tells us that
-
for \(\gamma > 2(1-\alpha )\) the relevant contribution comes from \(|x| = |S_N| \approx 1\);
-
for \(\gamma < 2(1-\alpha )\) the relevant contribution comes from \(|x| = |S_N| \approx \sqrt{N}\).
This motivates the following definition:
We correspondingly define an approximation \(C^\varepsilon _{TN,k}\) of \(C_{TN,k}\) (see (4.23)):
This allows us to split the proof of (4.14) in two parts: For fixed \(k\in {\mathbb {N}}\),
2. The first claim in (4.33) is a consequence of the bounds that we have derived in Sect. 4.3 for the proof of (4.13). Indeed, we can use a telescopic argument based on the bound
to replace \(a_\ell := \chi _{\beta _N,h_N}(S_{n_\ell })\) by \(b_\ell := \chi ^\varepsilon _{\beta _N,h_N}(S_{n_\ell })\) in the definition (4.23) of \(C_{TN,k}\). Note that both \(|a_\ell |\) and \(|b_\ell |\) are bounded by \({\bar{\chi }}_{\beta _N,h_N}(|S_{n_\ell }|)\) defined in (4.25). Similarly, we can bound \(|a_i - b_i| = |(\chi _{\beta _N,h_N} - \chi ^\varepsilon _{\beta _N,h_N})(S_{n_i})| \le \Delta {\bar{\chi }}^\varepsilon _{\beta _N,h_N} (|S_{n_i}|)\), where we define \(\Delta {\bar{\chi }}^\varepsilon _{\beta _N,h_N}(|x|)\) to be \({\bar{\chi }}_{\beta _N,h_N}(|x|)\) in (4.25) with the terms \((1+ |x|)^{-2\vartheta }\) and \((1+ |x|)^{-\vartheta }\) replaced as follows:
This leads to
3. Recall the bound (4.27) for \({{\,\mathrm{\mathrm{E}}\,}}[{{\bar{\chi }}}_{\beta _N,h_N}(|S_{n}|)]\). A similar but improved bound holds for \({{\,\mathrm{\mathrm{E}}\,}}\big [\Delta {\bar{\chi }}^\varepsilon _{\beta _N,h_N}(|x|)\big ]\), namely,
Indeed, by (4.26) and the lines following it, we can choose
Consequently, arguing as in (4.28)–(4.29), we get the following improvement of (4.30):
(set \(n_0 = 0\), \(x_0 = 0\)), from which the first equation in (4.33) readily follows. With similar arguments we can show that in (4.32) we can further restrict the sum to \(n_\ell - n_{\ell - 1} \ge \varepsilon N\) for all \(\ell =1,\ldots , k\), with a negligible error as \(N\rightarrow \infty \) followed by \(\varepsilon \downarrow 0\) (we omit the details). We can thus update the definition (4.32) of \(C^\varepsilon _{TN,k}\), where we also sum over space variables:
4. We are left with proving the second claim in (4.33). To do so, we distinguish between the three regimes in (4.31).
\(\bullet \) First regime. In the regime \(\vartheta \in (0,1-\alpha )\), the function \(\chi ^\varepsilon _{\beta _N,h_N}(x_\ell )\) restricts all variables \(x_\ell \) in the diffusive range \(\{\varepsilon \sqrt{N} \le x \le \frac{1}{\varepsilon } \sqrt{N}\}\) (“intermediate heights”). This allows us to apply the local limit theorem in (1.40), which we may rewrite as follows: Uniformly in \(t \in [\varepsilon , T]\) and \(\varepsilon \le z', z \le 1/\varepsilon \),
where we recall that \(g_t(z',z) = g_1(z'/\sqrt{t}, z/\sqrt{t})/\sqrt{t}\). (For notational simplicity, we will ignore the parity constraint and remove the compensating factor 2.) Note that \(\varphi (x) \sim c / x^\vartheta \) as \(x \rightarrow \infty \) (see (1.44)). Therefore, by (4.1)–(4.4) and (4.31), uniformly in \(\varepsilon \le z \le 1/\varepsilon \),
Introducing macroscopic space-time variables \(n_\ell = N t_\ell \), \(x_\ell = \sqrt{N} z_\ell \), we get from (4.40) that
When we let \(\varepsilon \downarrow 0\), the last integral converges to \({{\hat{C}}}_{TN,k}\) (see (4.10)). This proves the second claim in (4.33).
\(\bullet \) Third regime. In the regime \(\vartheta \in (2(1-\alpha ),\infty )\), the function \(\chi ^\varepsilon _{\beta _N,h_N}(x_\ell )\) restricts all variables \(x_\ell \) to the O(1) range \(\{x \le \frac{1}{\varepsilon } \}\) (“low heights”). This allows us to apply the local limit theorem in (1.38), which we may rewrite as follows: Uniformly in \(t \in [\varepsilon , T]\) and \(0 \le x', x \le 1/\varepsilon \),
where we recall that \({{\hat{g}}}_1(x'/\sqrt{n},0) \sim {{\hat{g}}}_1(0,0) = 1\) (see (1.34)). (We again ignore the parity constraint and remove the compensating factor 2.) Moreover, it follows from (4.1)–(4.2) and (4.31) that uniformly for \(x \le 1/\varepsilon \),
Introducing the macroscopic time variables \(n_\ell = N t_\ell \), while keeping the microscopic space variables \(x_\ell \), we get by (4.40) that
When we let \(\varepsilon \downarrow 0\), the term in brackets converges to
Hence \(C^\varepsilon _{TN,k} \) converges to \({{\hat{C}}}_{TN,k}\) (see (4.12)). This proves the second claim in (4.33).
\(\bullet \) Second regime. In the regime \(\vartheta \in (1-\alpha , 2(1-\alpha ))\), the function \(\chi ^\varepsilon _{\beta _N,h_N}(x_\ell )\) is the sum of two terms with different restrictions, namely, \(\{x \le \frac{1}{\varepsilon } \}\) and \(\{\varepsilon \sqrt{N} \le x \le \frac{1}{\varepsilon } \sqrt{N}\}\). Expanding the product over \(\chi ^\varepsilon _{\beta _N,h_N}( x_\ell )\) in (4.40), we obtain \(2^k\) different terms, where every variable \(x_\ell \) is subject to one of the two restrictions. Each of these terms can be analysed by arguing precisely as in the previous regimes:
-
for \(\{\varepsilon \sqrt{N} \le x_\ell \le \frac{1}{\varepsilon } \sqrt{N}\}\) we apply the local limit theorem in (4.41), introducing the macroscopic space variable \(z_\ell \in [0,\infty )\) defined by \(x_\ell = \sqrt{N} z_\ell \);
-
for \(\{x_\ell \le \frac{1}{\varepsilon } \}\) we apply the local limit theorem in (4.44), keeping the microscopic variable \(x_\ell \in {\mathbb {N}}_0\).
The choice of \(\beta _N\), \(h_N\) in (4.1)–(4.2) ensure that the appropriate Riemann factor appears, so that the sum defining \(C^\varepsilon _{NT,k}\) converges to the integral in (4.10) as \(N\rightarrow \infty \) followed by \(\varepsilon \downarrow 0\). This integral defines \({{\hat{C}}}_{NT,k}\), with the recipe in (4.11). We omit the details, which are notationally tedious because of the need to specify the restriction for each variable \(x_\ell \), but are ultimately straightforward.
Notes
Formally, the squared Bessel process \((Y_t)_{t \ge 0}\) is defined by \(\text {d}Y_t = 2\sqrt{Y_t}\,\text {d}B_t + 2(1-\alpha )\,\text {d}t\) on \([0,\infty )\) with reflection at 0, and \((X_t)_{t \ge 0}\) is defined by setting \(X_t=\sqrt{Y_t}\).
References
Alexander, K.S.: The effect of disorder on polymer depinning transitions. Commun. Math. Phys. 279, 117–146 (2008)
Alexander, K.S.: Excursions and local limit theorems for Bessel-like random walks. Electron. J. Probab. 16, 1–44 (2011)
Alexander, K.S., Zygouras, N.: Quenched and annealed critical points in polymer pinning models. Commun. Math. Phys. 291, 659–689 (2009)
Berger, Q., Caravenna, F., Poisat, J., Sun, R., Zygouras, N.: The critical curve of the random pinning and copolymer models at weak coupling. Commun. Math. Phys. 326, 507–530 (2014)
Berger, Q., Giacomin, G., Lacoin, H.: Disorder and critical phenomena: the \(\alpha =0\) copolymer model. To appear in Probab. Theory Relat. Fields, arXiv:1712.02261
Berger, Q., Lacoin, H.: Pinning on a defect line: characterization of marginal disorder relevance and sharp asymptotics for the critical point shift. J. Inst. Math. Jussieu 17, 305–346 (2018)
Birkner, M.: Conditional large deviations for a sequence of words. Stoch. Proc. Appl. 118, 703–729 (2008)
Birkner, M., Greven, A., den Hollander, F.: Quenched LDP for words in a letter sequence. Probab. Theory Relat. Fields 148, 403–456 (2010)
Bodineau, T., Giacomin, G.: On the localization transition of random copolymers near selective interfaces. J. Stat. Phys. 117, 17–34 (2004)
Bolthausen, E., den Hollander, F.: Localization transition for a polymer near an interface. Ann. Probab. 25, 1334–1366 (1997)
Bolthausen, E., den Hollander, F., Opoku, A.A.: A copolymer near a selective interface: variational characterization of the free energy. Ann. Probab. 43, 875–933 (2015)
Caravenna, F., Giacomin, G.: The weak coupling limit of disordered copolymer models. Ann. Probab. 38, 2322–2378 (2010)
Caravenna, F., Giacomin, G., Toninelli, F.L.: Copolymers at selective interfaces: settled issues and open problems. In: Probability in Complex Physical Systems, vol. 11, Springer Proceedings in Mathematics, pp. 289–311 (2012)
Caravenna, F., den Hollander, F.: A generalized smoothing inequality for disordered polymers. Electr. Commun. Probab. 18(76), 1–15 (2013)
Caravenna, F., Sun, R., Zygouras, N.: Polynomial chaos and scaling limits of disordered systems. J. Eur. Math. Soc. 19, 1–65 (2017)
Caravenna, F., Toninelli, F.L., Torri, N.: Universality for the pinning model in the weak coupling regime. Ann. Probab. 45, 2154–2209 (2017)
Cheliotis, D., den Hollander, F.: Variational characterization of the critical curve for pinning of random polymers. Ann. Probab. 41, 1767–1805 (2013)
Koralov, M., Cranston, L., Molchanov, S., Vainberg, B.: Continuous model for homopolymers. J. Funct. Anal. 256, 2656–2696 (2009)
Dembo, A., Zeitouni, O.: Large Deviations Techniques and Applications, 2nd edn. Springer, Berlin (1998)
Derrida, B., Giacomin, G., Lacoin, H., Toninelli, F.L.: Fractional moment bounds and disorder relevance for pinning models. Commun. Math. Phys. 287, 867–887 (2009)
Giacomin, G.: Random Polymer Models. Imperial College Press, London (2007)
Giacomin, G.: Disorder and critical phenomena through basic probability models. École d’Été de Probabilités de Saint-Flour XL-2010. Lecture Notes in Mathematics, vol. 2025. Springer (2011)
Giacomin, G., Lacoin, H., Toninelli, F.L.: Marginal relevance of disorder for pinning models. Commun. Pure Appl. Math. 63, 233–265 (2010)
den Hollander, F.: Random polymers. École d’Été de Probabilités de Saint-Flour XXXVII-2007. Lecture Notes in Mathematics, vol. 1974. Springer (2009)
Lacoin, H.: Influence of spatial correlation for directed polymers. Ann. Probab. 39, 139–175 (2011)
Lamperti, J.: A new class of probability limit theorems. J. Math. Mech. 11, 749–772 (1962)
Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion, 3rd edn. Springer, Berlin (1999)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The research in this paper was supported by ERC Advanced Grant 267356 VARIS and NWO Gravitation Grant 024.002.003 NETWORKS.
Appendices
Appendix A: Equivalence
In “Appendices A.1–A.3” we prove (1.16), (1.14) and (1.25), respectively. In “Appendix A.4” we show that the limits \(N\rightarrow \infty \) and \(T\rightarrow \infty \) in (4.5) may be interchanged.
1.1 A.1. Proof of (1.16)
For \(N\in {\mathbb {N}}_0\) and \(x \in {\mathbb {N}}_0\), define
(If S has period 2, then replace N by 2N.) By superadditivity,
For \(z \in [0,\infty )\) and \(x \in {\mathbb {N}}_0\), define
Then, clearly,
By (1.6), we have \(C_0=0\). Below we show that this implies \(C_x=0\,\,\forall \, x\in {\mathbb {N}}_0\). The proof is by induction on x.
Any path that starts at x, ends at x and does not go below x can be cut into pieces that zig–zag between x and \(x+1\) and pieces that start at \(x+1\), end at \(x+1\) and do not go below \(x+1\). Hence we have
where \(p_{x,x+1}:={{\,\mathrm{\mathrm{P}}\,}}(S_1=x+1 \mid S_0=x)\). We know that \(F_0(1+\varepsilon ) = \infty \,\,\forall \, \varepsilon >0\). We show that this implies \(F_x(1+\varepsilon ) = \infty \,\,\forall \,\varepsilon >0\) for every \(x\in {\mathbb {N}}\). The proof is by induction on x.
Fix \(x\in {\mathbb {N}}_0\) and suppose that \(F_x(1+\varepsilon ) = \infty \,\,\forall \,\varepsilon >0\). We argue by contradiction. Suppose that \(F_{x+1}(1+\varepsilon )<\infty \) for \(\varepsilon \) small enough. Because \(r_x>0\) and \(Q_x(1+\varepsilon )<\infty \) for \(\varepsilon \) small enough, it follows that \(F_{x+1}(1+\varepsilon ) Q_x(1+\varepsilon ) \ge 1\,\,\forall \,\varepsilon >0\), and by continuity that \(F_{x+1}(1)Q_x(1) \ge 1\). To get the contradiction it therefore suffices to show that \(F_{x+1}(1)Q_x(1)<1\). Now, because \((S_n)_{n\in {\mathbb {N}}_0}\) is recurrent, we have \(F_{x+1}(1) = p_{x+1,x+2}\). Because \(Q_x(1) = 1/(1-r_x)\), it follows that
where for both the last equality and the inequality we again use recurrence.
To prove (1.16), note that for any \(0 \le M \le N\),
Hence
Since \(C_M = 0\) for all \(M\in {\mathbb {N}}_0\), this implies (1.16).
1.2 A.2. Proof of (1.14)
Note that \(Z_{N,\beta ,h}^{\omega ,\mathrm {c}} \le Z_{N,\beta ,h}^\omega \) for all \(N\in {\mathbb {N}}_0\). Hence we only need to show that \(Z_{N,\beta ,h}^{\omega } \le \mathrm {e}^{o(N)}\,Z_{N,\beta ,h}^{\omega ,\mathrm {c}}\) \({\mathbb {P}}\)-a.s. Define the constrained partition function
Then
By (1.1), for every \(\varepsilon >0\) there is an \(M \in {\mathbb {N}}_0\) such that
where we abbreviate \(\gamma ^\pm = \lim _{x\rightarrow \pm \infty } \varphi (x)\) and recall that \(\gamma ^+=0\), \(\gamma ^- \in [0,\infty )\) according to (1.1). For \(N \ge M\), split
with
We will show that all three terms are bounded by \(\mathrm {e}^{o(N)}Z_{N,\beta ,h}^{\omega ,0,0}\). For ease of notation we will pretend that the Markov chain has period 1. This is easily fixed when the period is 2.
Consider \(\mathrm {I}\). Let \(\sigma _N = \max \{0 \le n \le N:\,S_n = M\}\), and split
Replace \((S_m,\ldots ,S_N)\) by an \((N-M-m)\)-step path from M to M that is everywhere \(\ge M\), followed by an M-step downward path \((M,\ldots ,0)\). The cost of this replacement is at most
for the weight factor, with \(\Omega _N(\omega ) = \max _{1 \le n \le N} \omega _n\), and at most
for the path probability, with \(\Pi _M := \prod _{x=M}^1 p_{x,x-1}\). The probability in the denominator equals \(f_M(N-M-m)\), and so we get
Since M is fixed, \(\Pi _M>0\), \(\Vert \varphi \Vert _\infty <\infty \), \(\Omega _N = O(\log N)\) \({\mathbb {P}}\)-a.s. and \(\max _{1 \le n \le N} 1/f_M(n) = \mathrm {e}^{o(N)}\), we get \(\mathrm {I} = \mathrm {e}^{o(N)} Z_{N,\beta ,h}^{\omega ,0,0}\) \({\mathbb {P}}\)-a.s.
The argument for \(\mathrm {II}\) is similar. For \(\mathrm {III}\) we replace \((S_{N-M},\ldots ,S_N)\) by an M-step path from \(S_{N-M}\) to 0 at a finite cost (depending on M and \(\Omega _N\)).
1.3 A.3. Proof of (1.25)
Copy the argument in “Appendix A.2” starting from the analogue of (A.9):
Use that \(\Vert \psi _{\beta ,h}\Vert _\infty < \infty \), and that for every \(\varepsilon >0\) there is an \(M \in {\mathbb {N}}_0\) such that
with \(\eta ^\pm = \psi _{\beta ,h}(\gamma ^\pm ) = \log M(\beta \gamma ^\pm )-h\gamma ^\pm \) (recall (1.21)).
1.4 A.4. Interchange of limits
The following two lemmas, which give us a sandwich for the annealed free energies in the discrete and in the continuous model, are the key to showing that the limits in (4.5) may be interchanged.
Lemma 4.2
For \(M,N\in {\mathbb {N}}_0\), \(\beta \in (0,\infty )\) and \(h \in {\mathbb {R}}\), define
Assume that \(\lim _{|x| \rightarrow \infty } \varphi (x) = 0\) (recall (1.44)). Then, for all \(\beta \in (0,\infty )\) and \(h \in {\mathbb {R}}\),
Proof
Note that \(Z^{\mathrm {ann},-}_{M,N,\beta ,h} \le Z^{\mathrm {ann},+}_{N,\beta ,h}\) for all \(M,N \in {\mathbb {N}}_0\). Hence we only need to show that \(Z^{\mathrm {ann},+}_{M,N,\beta ,h} \le \mathrm {e}^{o(N)} Z^{\mathrm {ann},-}_{N,\beta ,h}\) as \(N\rightarrow \infty \) for all \(M\in {\mathbb {N}}_0\). Indeed, this will imply (A.21) because \(F^\mathrm {ann}(\beta ,h) = \lim _{N\rightarrow \infty } \frac{1}{N} \log Z^{\mathrm {ann},0,0}_{N,\beta ,h}\), as proved in Appendix A.3. In what follows we fix \(\beta ,h\) and abbreviate \({\mathcal {H}}_N(S):=\sum _{n=1}^N \psi _{\beta ,h}(S_n)\).
Recall (A.19), where \(\eta ^\pm = 0\) because \(\gamma ^\pm = \lim _{x \rightarrow \pm \infty } \varphi (x) = 0\) by assumption. Define \(\tau _M:= \min \{n\in {\mathbb {N}}_0:\,|S_n| \le M\}\) and split
with
For \(x \ge M\) and \(x \le -M\),
Combining (A.22) and (A.24), we obtain
For every \(|x|, |y| \le M\) we have
with \(\Pi _M = \prod _{x=-M}^{M-1} p_{x,x+1} \wedge \prod _{x=M}^{-M+1} p_{x,x-1}\). Since \(\Pi _M> 0\), it follows that
As shown in Appendix A.3,
Since \(N \mapsto \log Z^{\mathrm {ann},0,0}_{N,\beta ,h}\) is super-additive, it follows that
Combining (A.27) and (A.29), we obtain
But
Appealing once more to (A.26), we arrive at
which proves the claim because \(\varepsilon >0\) is arbitrary. \(\square \)
Lemma 4.3
For \(S,T \ge 0\), \({\hat{\beta }} \in (0,\infty )\) and \({\hat{h}} \in {\mathbb {R}}\), define
with \({\hat{\psi }}_{{\hat{\beta }},{\hat{h}}}\) defined in (4.4). Then, for all \({\hat{\beta }} \in (0,\infty )\) and \({\hat{h}} \in {\mathbb {R}}\),
Proof
The proof is similar to that of Lemma 4.2. \(\square \)
What makes Lemma 4.2 useful is that \(N \mapsto \log Z^{\mathrm {ann},-}_{M,N,\beta ,h}\) is superadditive for every \(M \in {\mathbb {N}}_0\), while \(N \mapsto \log Z^{\mathrm {ann},+}_{N,\beta ,h}\) is subadditive. Consequently,
Let A, B be the pair of exponents appearing in part (a) of Theorems 1.6–1.8, i.e.,
Fix \(T,S \in (0,\infty )\), in (A.35) replace N by TN, and pick \(M=S\sqrt{TN}\), \(\beta ={\hat{\beta }}N^{-A}\), \(h={\hat{h}}N^{-B}\), to obtain
Let \(N\rightarrow \infty \) to obtain
Here, the fact that
follows from the same argument as used in Section 4 to prove that
In particular, the scaling \(M=S\sqrt{TN}\) fits well with the invariance principle in (1.37). Finally, let \(T\rightarrow \infty \) in (A.38) and use Lemma 4.3, to obtain
Appendix B: Properties of the annealed scaling limit
In this appendix we show that the annealed partition functions and the corresponding annealed free energies encountered in Theorems 1.6–1.8 are finite in each of the three regimes. We also give an explicit characterization of the annealed free energy and the annealed critical curve in the regime where \(\vartheta \in (2(1-\alpha ),\infty )\).
1.1 B.1. Finite free energies for the Bessel process
For \(\mu ,T,\gamma \in [0,\infty )\), define
We show that, for \(0<\gamma < 2(1-\alpha )\), these quantities grow at most exponentially as \(T \rightarrow \infty \):
By Cauchy-Schwarz, this implies that all the free energies in Theorems 1.6–1.8 are finite.
We first focus on \({\hat{Z}}_{\mu ,T}\), which we rewrite as
with
We use the Markov property at times \(t_1,\ldots ,t_k\) to estimate
where the inequality holds because \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_0(X_t \in \cdot )\) stochastically dominates \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_x(X_t \in \cdot )\) for any \(x \ge 0\) (by a standard coupling argument and the fact that X is a Markov process with continuous paths) and because \(x \mapsto x^{-\gamma }\) is non-increasing. We thus obtain
with \(t_0=0\). By diffusive scaling we have \({\hat{{{\,\mathrm{\mathrm{E}}\,}}}}_0[X_t^{-\gamma }] = t^{-\gamma /2}C\), with \(C={\hat{{{\,\mathrm{\mathrm{E}}\,}}}}_0[X_1^{-\gamma }]<\infty \) for \(0<\gamma <2(1-\alpha )\) (recall (1.31)). The change of variables \(t_k = T s_k\) yields
where, for \(\vartheta \in (0, 1)\),
with \(s_0=0\). Since
with \(u_0=1\) and \(u_k=1\) is the normalization of the Dirichlet distribution, after setting \(s_i = s_k \, u_i\) for \(i=1,\ldots , k-1\) in (B.8) we obtain
where we have used Stirling’s bound \(\Gamma (x+1) \ge \mathrm {e}^{x (\log x-1)}\). Substitute \(\vartheta = \gamma /2\) and set
to obtain
We now set \(x := k/T\) and note by direct computation that, with \({{\bar{x}}} := A^{-1} \mathrm {e}^{B/A}\),
Therefore the leading contribution to the sum in (B.12) is given by \(k \approx {{\bar{x}}} T\) (more precisely, the sum restricted to \(0 \le k < 2 {{\bar{x}}} T\) is at most \( 2 {{\bar{x}}} \, T \, \exp (T \, \mathrm {e}^{B/A})\), while the contribution of the remaining terms with \(k \ge 2{{\bar{x}}} T\) is negligible). It follows that
We next focus on \({\tilde{Z}}_{\mu ,T}\), which equals
with
We recall from (1.35) that we can write \({\hat{L}}_T(0) = \lim _{\varepsilon \downarrow 0} {\hat{L}}_T^\varepsilon (0)\) in probability, where
Let us focus on \({\hat{L}}_T^\varepsilon (0)\) for a moment. By explicit computation, for any \(k\in {\mathbb {N}}\) (with \(t_0 := 0\))
where the first inequality holds because \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_0 (X_t \in \cdot )\) stochastically dominates \({\hat{{{\,\mathrm{\mathrm{P}}\,}}}}_x(X_t \in \cdot )\) for any \(x \ge 0\), while the last inequality holds by (1.32) and the definition (B.8) of \(I_k(\cdot )\). This shows that, as \(\varepsilon \downarrow 0\), \({\hat{L}}_T^\varepsilon (0)\) is uniformly bounded in \(L^k\), for any \(k\in {\mathbb {N}}\). By uniform integrability, we can therefore exchange \(\lim _{\varepsilon \downarrow 0}\) and \({\hat{{{\,\mathrm{\mathrm{E}}\,}}}}\) to get, recalling (B.16),
where the last equality holds because the inequality in (B.18) becomes sharp as \(\varepsilon \downarrow 0\). Hence
The steps in (B.10)–(B.14) (with \(\vartheta = 1-\alpha \) instead of \(\vartheta =\gamma /2\)) show that not only \({\tilde{Z}}_{\mu ,T}<\infty \) for all \(\mu ,T\), but also \(\lim _{T\rightarrow \infty } \frac{1}{T} \log {\tilde{Z}}_{\mu ,T} < \infty \) for all \(\mu \).
1.2 B.2. Formula for the annealed free energy
To compute the annealed free energy \({\hat{F}}({\hat{\beta }},{\hat{h}})\) in the regime \(\vartheta \in (2(1-\alpha ),\infty )\), we use the first line in (B.20) to compute the Laplace transform of \({\tilde{Z}}_{\mu ,T}\). Writing \(\mathrm {e}^{-\lambda T} = (\prod _{\ell =1}^k \mathrm {e}^{-\lambda (t_\ell - t_{\ell -1})})\, \mathrm {e}^{-\lambda (T-t_k)}\) for \(\lambda \ge 0\), we get
Hence
which proves (1.57).
Appendix C: Localization criterion for the annealed model
In this appendix we take a closer look at the criterion in (1.62) and show that it does not depend on the starting point of the walk. For the purpose of this appendix, let \(S = (S_n)_{n\in {\mathbb {N}}_0}\) be any recurrent Markov chain on a countable space E, and let \(\psi :\, E \rightarrow {\mathbb {R}}\) be an arbitrary function. Denote by \(\tau ^x := \min \{n\in {\mathbb {N}}:\, S_n = x\}\) the first return time of S to x. Define
We will prove the following property:
It is convenient to introduce the following shorthand notation, for (possibly random) \(\sigma \in {\mathbb {N}}\):
so that we may simply write \(A_x = {{\,\mathrm{\mathrm{E}}\,}}_x [\mathrm {e}^{{\mathcal {H}}(\tau ^x)}]\). Given an arbitrary y, we can split this expected value according to the two complementary events \(\{\tau ^x < \tau ^y\}\) and \(\{\tau ^y < \tau ^x\}\):
The second term can be expanded by summing over all visits of S to y that precede the first return to x. If we define
then by the strong Markov property we get
with the convention that \(A_x = \infty \) if \(B_{yx} \ge 1\). Exchanging the roles of x and y, we get
We are now ready to prove (C.2). Fix x, y. We show that if \(A_x \le 1\), then also \(A_y \le 1\). To simplify the notation, we abbreviate \(b := B_{xy}\), \(b' := B_{yx}\) and \(c := C_{xy} C_{yx}\), so that
with the convention that the ratios equal \(\infty \) if \(b' \ge 1\), respectively, \(b \ge 1\). Assume that \(A_x \le 1\). Then we must have \(b' < 1\) and the formula \(A_x = b + \frac{c}{1-b'}\) applies, which shows that also \(b < 1\) (because \(c > 0\)). Hence we can write
i.e., \(A_y \le 1\).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Caravenna, F., den Hollander, F. Phase transitions for spatially extended pinning. Probab. Theory Relat. Fields 181, 329–375 (2021). https://doi.org/10.1007/s00440-021-01068-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00440-021-01068-y
Keywords
- Charged polymers
- Spatially extended pinning
- Free energy
- Critical curve
- Phase transition
- Large deviation principle
- Variational representation