Abstract
Preferential attachment networks with power law exponent \(\tau >3\) are known to exhibit a phase transition. There is a value \(\rho _{\mathrm{c}}>0\) such that, for small edge densities \(\rho \le \rho _{\mathrm{c}}\) every component of the graph comprises an asymptotically vanishing proportion of vertices, while for large edge densities \(\rho >\rho _{\mathrm{c}}\) there is a unique giant component comprising an asymptotically positive proportion of vertices. In this paper we study the decay in the size of the giant component as the critical edge density is approached from above. We show that the size decays very rapidly, like \(\exp (-c/ \sqrt{\rho -\rho _{\mathrm{c}}})\) for an explicit constant \(c>0\) depending on the model implementation. This result is in contrast to the behaviour of the class of rank-one models of scale-free networks, including the configuration model, where the decay is polynomial. Our proofs rely on the local neighbourhood approximations of Dereich and Mörters (Ann Probab 41(1):329–384, 2013) and recent progress in the theory of branching random walks (Gantert et al. in Ann Inst Henri Poincaré Probab Stat 47(1):111–129, 2011).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and Main Results
1.1 Introduction
Sparse random graph models typically undergo a phase transition in their connectivity behaviour depending on the mean number of edges per vertex. A typical case are the Erdős–Rényi graphs with n vertices and \(m=m(n)\) edges and asymptotic edge density . There exists a critical density \(\rho _{\mathrm{c}}=1\) such that if the edge density satisfies \(\rho \le \rho _{\mathrm{c}}\) the largest component in the graph comprises a vanishing proportion of vertices, whereas for \(\rho >\rho _{\mathrm{c}}\) this proportion \(\theta (\rho )\) is strictly positive. The behaviour of \(\theta (\rho )\) as \(\rho \downarrow \rho _{\mathrm{c}}\) is characterised by an exponent \(\beta \) defined by
which is \(\beta =1\) in the Erdős–Rényi case. A natural extension of the Erdős–Rényi model is the configuration model, which allows to construct random graphs with n vertices and a given degree sequence \(m_1,\ldots , m_n\). Of particular interest is the case when the degree sequence is heavy-tailed,
where the parameter \(\tau >2\) is the power-law exponent. The connectivity behaviour of the Erdős–Rényi model persists with \(\beta =1\) for the configuration model with \(\tau >4\), and with \(\beta =1/(\tau -3)\) if \(3<\tau <4\), see Cohen et al. [5]. The paper by Cohen is based on an informal approximation of local neighbourhoods in the graph by Galton–Watson trees and thereby extends to a wide range of scale-free network models, where similar approximations hold.
Cohen et al. [5] claim their result for scale-free networks in general without specifying a model. This reflects the belief that the behaviour observed in the configuration model extends to all natural scale-free network models including the class of preferential attachment networks. In the present paper however we show for the first time that preferential attachment networks have a qualitatively completely different behaviour than predicted in [5]. In fact, for all \(\tau >3\) the size of the giant component is decaying exponentially as one approaches the critical edge density. More precisely, we show that the relative size of the giant component in preferential attachment networks is
where c is an explicit constant depending on the way in which the edge density is controlled. This demonstrates once again that preferential attachment networks belong to a different universality class than the configuration model and other models based on rank-one connection probabilities.
The underlying phenomenon of the ‘small giant component’ or ‘slow emergence of the giant’ has first been discovered and discussed by Oliver Riordan in the seminal paper [17], and in collaboration with Bollobás and Janson in [3, 4]. Riordan [17] finds that for the original Barabási–Albert model subject to Bernoulli percolation with retention parameter p, one has
where \(m\ge 2\) is the outdegree of all vertices. As this model corresponds to the critical case \(\tau =3\) this is not at odds with the results of Cohen et al. [5]. The merit of our work is to extend the phenomenon of slow emergence to a regime where it is most surprising because it defies the predictions in [5].
In his proof, Riordan exploits the fact that there are local approximations of the network by multitype branching processes whose survival probability can be studied analytically by looking at the associated Laplace operators. These branching processes are more complex than those used in [5].
In the present paper local approximation is by a branching random walk with killing. Instead of the analytical techniques used by Riordan, which we could not apply effectively in our case due to the higher complexity of the approximating process, we use geometric properties of killed branching random walks. Studying their survival probabilities requires sophisticated techniques from the theory of branching random walks, which became available only in the past few years, see [1, 9]. Our results are based on the techniques developed in [9] and are therefore pleasingly probabilistic and highly timely.
1.2 Statement of Results
We start by describing the preferential attachment network introduced in [6], which gives scale-free networks with arbitrary power law exponent \(\tau >2\) by variation of a parameter.
A concave function \(f:{\mathbb {N}}_0 \rightarrow (0,\infty )\) is called an attachment rule if \(f(0)\le 1\) and
The maximal increment is denoted by \(\gamma ^+:=\sup \{\triangle f(k):k\in {\mathbb {N}}_0\}\). By concavity, f is non-decreasing, \(\gamma ^+=f(1)-f(0)\) and the limit \(\gamma :=\lim _{k \rightarrow \infty } {f(k)}/{k}\) exists and equals \(\gamma =\inf \{\triangle f(k):k\in {\mathbb {N}}_0\}\). Given an attachment rule f, we define a growing sequence \((\mathsf{{G}}_n :n\in {\mathbb {N}})\) of random graphs as follows
-
Start with the graph \(\mathsf{{G}}_1\) given by one vertex labelled 1 and no edges;
-
Given the graph \(\mathsf{{G}}_n\), we construct \(\mathsf{{G}}_{n+1}\) from \(\mathsf{{G}}_n\) by adding a new vertex labelled \(n+1\) and, for each \(m \le n\) independently, inserting the directed edge \((n+1,m)\) with probability
$$\begin{aligned} \frac{f(\text {indegree of } m \text { at time }n)}{n}. \end{aligned}$$
Formally, we are dealing with a sequence of directed graphs but all edges point from the younger to the older vertex. Hence, directions can be recreated from the undirected, labelled graph. For all structural questions, particularly regarding connectivity and the length of shortest paths, we regard \((\mathsf{{G}}_n :n\in {\mathbb {N}})\) as an undirected network. It is shown in [6] that when \(\gamma >0\) the networks have a degree distribution which is a power law with exponent We are also interested in the percolated version of the network \((\mathsf{{G}}_n:n \in {\mathbb {N}})\). For \(p \in [0,1]\), we write \(\mathsf{{G}}_n(p)\) for the graph obtained from \(\mathsf{{G}}_n\) by deleting each edge with probability \(1-p\) independent of all other edges.
Let \((\mathsf{{G}}_n:n \in {\mathbb {N}})\) be a sequence of (random or deterministic) graphs, where \(\mathsf{{G}}_n\) has n vertices. For \(n \in {\mathbb {N}}\), we denote by \(|\mathsf{{C}}_n|\) the size of the largest component in \(\mathsf{{G}}_n\). The network \((\mathsf{{G}}_n:n \in {\mathbb {N}})\) has a giant component if there exists a constant \(\theta >0\) such that
where the convergence holds in probability. The limit \(\theta \) is called the size of the giant component.
Dereich and Mörters showed in [7, Theorem 1.6] that when \(\gamma \ge \frac{1}{2}\), or equivalently \(2<\tau \le 3\), then \((\mathsf{{G}}_n(p) :n \in {\mathbb {N}})\) has a giant component for all \(p \in (0,1]\). When \(\gamma <\frac{1}{2}\), or equivalently \(\tau >3\), then there exists a critical percolation parameter \(p_{\mathrm{c}}>0\) such that \((\mathsf{{G}}_n(p) :n \in {\mathbb {N}})\) has giant component if and only if \(p>p_{\mathrm{c}}\). We denote the relative size of the giant component in \((\mathsf{{G}}_n(p) :n \in {\mathbb {N}})\) by \(\theta (p,f)\) and omit p or f from the notation when the percolation parameter or the attachment rule are fixed.
We are interested in the decay in the size of the giant component as we approach \(p_{\mathrm{c}}\) from above in the case \(p_{\mathrm{c}}>0\), or equivalently \(\gamma <\frac{1}{2}\). It was shown in Lemma 3.3 of [7] that the critical retention probability for the network \((\mathsf{{G}}_n :n \in {\mathbb {N}})\) is given by
where \(\rho (\cdot )\) is the spectral radius of the score operator, a function on a nonempty open interval, which we describe explicitly in Sect. 2, and \(\alpha ^*\) is the minimizer of \(\rho \).
Our first result shows the exponential decay of the size of the giant component of the percolated network, when the retention parameter approaches the critical value.
Theorem 1.1
Let f be an attachment rule with \(\gamma <\frac{1}{2}\) and \(\theta (1,f)>0\). Then
An alternative way of reducing the edge density and thereby destroying the giant component is to alter the attachment rule instead of percolating the network. For a linear attachment rule \(f(k)=\gamma k+\beta \) with \(\gamma <\frac{1}{2}\), Dereich and Mörters [7] show that there exists a giant component if and only if
Therefore, one could fix \(\gamma \) and decrease \(\beta \) to \(\beta _{\mathrm{c}}\). Another idea would be, for a given \(\beta \), to decrease \(\gamma \) until \(\beta =\beta _{\mathrm{c}}(\gamma )\). To analyse the behaviour of the size of the giant component under this procedure, let \((f_t)_{t \ge 0}\) be a sequence of attachment rules with \(\gamma _t:=\inf \{\triangle f_t(k):k \in {\mathbb {N}}_0\}<\frac{1}{2}\) for all \(t \ge 0\). We denote by \(\rho _t\) and \(\alpha _t^*\) the spectral radius and its minimizer corresponding to the score operator for the unpercolated branching random walk derived from attachment rule \(f_t\).
Theorem 1.2
Let \((f_t)_{t\ge 0}\) be a pointwise decreasing sequence of attachment rules with \(\gamma _t<\frac{1}{2}\) for all \(t\ge 0\) and pointwise limit f. Suppose that \(\theta (1,f_t)>0\) for all t and \(\theta (1,f)=0\). Then
where \(\alpha ^*\) and \(\rho ''(\alpha ^*)\) are derived from f.
The existence of \(\rho \) and \(\alpha ^*\) corresponding to f is proved in Proposition 2.3. There we will also see that \(\lim _{t \rightarrow \infty }\rho _t(\alpha _t^*)=1\). The following corollary exemplifies Theorem 1.2 for linear attachment rules, see also Fig. 1. We denote
Corollary 1.3
(Linear attachment rule) Let \(\gamma \in [0,\frac{1}{2})\), \(\beta \in (0,1]\). Then
If \(\beta \in (0,1/4]\), then
Remark 1.4
Two cases in our phase diagram are covered in the work of Riordan [17]. The first of these cases corresponds to an approximation from the right of the point \(\beta =0, \gamma =\frac{1}{2}\) which is equivalent to the original Barabási–Albert, or LCD, model.Footnote 1 Note that our results refer to the subcritical case \(\gamma <\frac{1}{2}\) and the critical case \(\gamma =\frac{1}{2}\) is not included. The second is the case \(\beta =\frac{1}{4}, \gamma =0\), the Dubins model, in which there is no preferential attachment and our results are consistent with those of Riordan [17].
Corollary 1.3 allows a quantitative comparison of the decay of the giant component for different models. The smaller \(\gamma \) (or the larger \(\tau \)), the slower is the decay. The LCD model, or equivalent models with \(\gamma =\frac{1}{2}\), have faster decay of the size of the giant component than preferential attachment networks with attachment rules satisfying \(\gamma <\frac{1}{2}\).
Throughout, we will use the following notation: For every integer \(k\in {\mathbb {N}}\), we write \([k]=\{1,\dots ,k\}\), \({\mathbb {R}}_+:=[0,\infty )\), \({\mathbb {N}}_0:={\mathbb {N}}\cup \{0\}\).
The remaining paper is structured as follows: We prove Theorems 1.1 and 1.2 simultaneously. In Sect. 2 we collect several auxiliary results that we need later on. In particular, in Sect. 2.1 we recall the relevant results from [7] who relate the size of the giant component to the survival probability of a multitype branching random walk with killing. In Sect. 2.2 we derive the main tool to analyse these branching random walks, a version of the well-known many-to-one lemma. In Sect. 2.3 we collect various moment estimates, while in Sect. 2.4 we state a large deviation result originally due to Mogulskii in a suitable adaptation to our setting.
The actual proofs of Theorems 1.1 and 1.2 are split into an upper bound carried out in Sect. 3 and a lower bound in Sect. 4. Our proof shows that the survival probability of the branching random walk is well approximated by the probability that the branching random walk follows a carefully chosen strategy up to a large generation N. Here, the choice of N and the particular strategy depend on how close to criticality the model is. With the help of the many-to-one lemma, these events for the branching random walk can be translated into events expressed in terms of a Markov chain. Then, the latter can be analysed using large deviation techniques. The upper bound uses this strategy together with a first moment method, while the lower bound is slightly more involved: we use a coarse graining strategy, whereby we first identify a suitable subset of the multitype branching random walk by grouping together particles across every N generations following a particular strategy. Then, we treat each of these groups as one generation of a standard Galton–Watson process and analyse its survival probability by estimating the probability that one generation survives.
In Sect. 5 we show how to derive Corollary 1.3 from Theorem 1.2. Finally, in Appendix we prove the large deviation result, Theorem 2.10.
As mentioned above our proof follows a similar strategy as in [9]. However, we are considering a very different mechanism: first of all we are dealing with a multi-type branching random walk that additionally has infinitely many offspring in each step. Furthermore, in our case we do not shift the killing boundary, but we allow the parameters of the model to vary such that the models approaches criticality which requires very different moment estimates.
2 Proofs: General Preparations
2.1 The Approximating Branching Process
We introduce a pure jump Markov process with generator
This defines an increasing, integer-valued process, which jumps from k to \(k+1\) after an exponential waiting time with mean 1 / f(k), independently of the previous jumps. Under the probability P we denote by \((Z_t:t \ge 0)\) the process started in zero, by \(({\hat{Z}}_t:t\ge 0)\) the process started in one, and by \((Z_t^{{\scriptscriptstyle {({\tau })}}} :t \ge 0)\) the process started in zero conditioned to have a point at \(\tau \ge 0\).
This process is used to define a multitype branching random walk with type space \({\mathcal {T}}:=[0,\infty )\cup \{\ell \}\), where \(\ell \) is a non-numerical symbol for ‘left’. A particle in location \(x \in {\mathbb {R}}\) and of type \(\tau \in {\mathcal {T}}\), produces offspring to its left whose displacements have the same distribution as the points of the Poisson point process with intensity measure
The type of an offspring on the left equals the distance to its parent.
The distribution of the offspring to the right depends on the type of the particle. When the particle is of type \(\ell \), then the relative positions of its right offspring follow the same distribution as the jump times of \((Z_t:t\ge 0)\). When the particle is of type \(\tau \ge 0\), then the displacements follow the same distribution as the jump times of \((Z_t^{\scriptscriptstyle (\tau )}-\mathbb {1}_{[\tau ,\infty )}(t):t\ge 0)\). All offspring on the right are of type \(\ell \).
The offspring to the right do not form a Poisson point process. The more particles are born, the higher the rate of new particles arriving. Moreover, the total number of particles produced is infinite without accumulation point. The expected distance between a particle and its kth offspring on the right equals \(\sum _{j=0}^{k-1}\frac{1}{f(j)}= \sum _{j=0}^{k-1} \frac{1}{f(j)/j} \frac{1}{j}\). Since \(\lim _{j \rightarrow \infty } \frac{f(j)}{j} = \gamma \), this distance behaves asymptotically like \(\gamma ^{-1} \log (k)\) when \(\gamma \not =0\) and like k when \(\gamma =0\).
We call the described process idealized branching random walk (\({\mathrm {IBRW}}\)) in accordance with [7]. Dereich and Mörters [7] show that the genealogical tree of the \({\mathrm {IBRW}}\) is related to the local neighbourhood of a vertex in \(\mathsf{{G}}_n\). To obtain a branching process approximation to \((\mathsf{{G}}_n(p):n \in {\mathbb {N}})\), we define the percolated \({\mathrm {IBRW}}\) by associating to every offspring in the \({\mathrm {IBRW}}\) an independent Bernoulli(p) random variable. If the random variable is zero, we delete the offspring together with its descendants. Otherwise, the offspring is retained in the percolated \({\mathrm {IBRW}}\). When the percolated \({\mathrm {IBRW}}\) is started with one particle in location x and type \(\tau \), then we write \(P_{(x,\tau )}^p\) for its distribution and \(E_{(x,\tau )}^p\) for the corresponding integral operator;
The percolated \({\mathrm {IBRW}}\) can be interpreted as a labelled tree \(\Gamma \) where every node represents a particle and is connected to its children and (apart from the root) to its parent. The vertices are identified as finite sequences of natural numbers \(x=j_1\ldots j_k\), including the empty sequence \(\emptyset \) which denotes the root. We concatenate sequences \(x=i_1\ldots i_k\) and \(y=j_1\ldots j_m\) to form the sequence \(xy=i_1\ldots i_kj_1\ldots j_m\). When, for \(j \in {\mathbb {N}}\), xj is a vertex in \(\Gamma \), then \(x,x1,\ldots ,x(j-1)\) are also vertices in the tree and \(x=:\mathrm{p}(xj)\) is the parent of xj. The length \(|x|=k\) is the generation of x. For \(|x|\ge k\), we abbreviate \(\mathrm{p}^k(x)\) for the k-fold composition of \(\mathrm{p}(\cdot )\). The ancestor of x in generation k is denoted by \(x_k\), i.e. \(x_k:=\mathrm{p}^{n-k}(x)\) when \(|x|=n\). In particular, \(x_0\) always denotes the root. To every vertex we associate two functions, \(S\) and \(\tau \). Here \(S(x)\) is the location of the particle on the real line and \(\tau (x)\) denotes its type.
To obtain a branching process approximation to the local neighbourhood of a vertex in \((\mathsf{{G}}_n(p):n \in {\mathbb {N}})\), we consider the percolated \({\mathrm {IBRW}}\) with a killing barrier at zero. That is, every particle with location on the nonnegative half-line is deleted together with its descendants. Dereich and Mörters prove the following identification.
Theorem 2.1
(Dereich and Mörters [7]) For all \(p\in [0,1]\) and attachment rules f, \(\theta (p,f)\) equals the survival probability of the percolated \({\mathrm {IBRW}}\) with a killing barrier at zero, started with one particle of type \(\ell \) whose location is given by \(-E\), where E is an exponential random variable with mean one.
Next we collect some spectral properties that will be used in the analysis of the IBRW.
Denote by \(C({\mathcal {T}})\) the Banach space of bounded, continuous functions on \({\mathcal {T}}\) equipped with the supremum norm \(\Vert \cdot \Vert \). For \(\alpha \in (0,1)\), we consider the score operator
on \(C({\mathcal {T}})\), \(A_{\alpha }:=A_{\alpha }^1\). The spectral radius of \(A_{\alpha }^p\) is denoted by \(\rho ^p(\alpha )\). The dependence of \(A_{\alpha }^p\) on the attachment rule f is suppressed in notation but it will always be clear from the context which f is considered. Since by definition,
it suffices to analyse \(A_{\alpha }\). We write \(\mathbf {1}\) for the constant function with value 1 and let \({\mathcal {I}}:=(\gamma ,1-\gamma )\) for \(\gamma <\frac{1}{2}\) and \({\mathcal {I}}=\emptyset \) for \(\gamma \ge \frac{1}{2}\).
Lemma 2.2
Let \(p \in (0,1]\). If \(\gamma \ge \frac{1}{2}\), then \(A_{\alpha }^p\mathbf {1}(0)=\infty \). If \(\gamma <\frac{1}{2}\), then the following holds:
-
(i)
\(\rho ^p(\alpha )\) is finite for all \(\alpha \in {\mathcal {I}}\), \(\rho ^p(\alpha )=p\rho (\alpha )\) and \(\rho (\alpha )\rightarrow \infty \) for \(\alpha \rightarrow \partial {\mathcal {I}}\).
-
(ii)
There exists a unique positive eigenfunction \(v_{\alpha }\) of \(A_{\alpha }^p\) corresponding to \(\rho ^p(\alpha )\) with \(\Vert v_{\alpha }\Vert =1\). Moreover, \(v_{\alpha }\) does not depend on the retention probability p and \(\min _{\tau \in {\mathcal {T}}} v_\alpha (\tau ) > 0\).
-
(iii)
The function \(\rho \) is twice differentiable on \({\mathcal {I}}\) with
$$\begin{aligned} \rho ^{(i)}(\alpha )=E_{(0,\tau _0)}\left[ \sum _{|x|=1} \frac{v_{\alpha }(\tau (x))}{v_{\alpha }(\tau _0)} e^{-\alpha S(x)} (-S(x))^{i}\right] \quad \text {for all } \tau _0 \in {\mathcal {T}}, i \in \{1,2\}.\nonumber \\ \end{aligned}$$(2.3) -
(iv)
The function \(\rho \) is strictly convex on \({\mathcal {I}}\) and there exists a unique minimizer \(\alpha ^* \in {\mathcal {I}}\).
-
(v)
For any \(\tau \in [0,\infty )\)
$$\begin{aligned} v_\alpha (\ell ) \le v_\alpha (\tau ) \le v_\alpha (0). \end{aligned}$$
Proof
By (2.2), it suffices to prove Lemma 2.2 in the case \(p=1\). For that case, it was shown in [7, Lemma 3.1] that \(A_{\alpha }1(0)<\infty \) is equivalent to \(A_{\alpha }\) being a strongly positive, compact operator with \(A_{\alpha }g \in C({\mathcal {T}})\) for all \(g \in C({\mathcal {T}})\). Moreover, it is proved that for all \(g \in C({\mathcal {T}})\), \(g\ge 0\), we have \({\underline{A}}_{\alpha }g \le A_{\alpha }g \le {\overline{A}}_{\alpha }g\), where
and
Here, for example, \({\overline{A}}_{\alpha }g\) means that for all \(\tau \ge 0\),
We have, \(\rho (\alpha )\ge \rho ({\underline{A}}_{\alpha })={\texttt {a}}(\alpha )+{\texttt {a}}(1-\alpha )\). The values of \({\texttt {a}}\) and \({\texttt {c}}\) are identified as (cf. proof of Proposition 1.10 in [7])
We analyse the convergence properties of \({\texttt {a}}(\alpha )\). Using \(\log (1+x) \le x\) for \(x \ge 0\) and \(f(j)=\sum _{i=0}^{j-1}\triangle f(i)+f(0) \ge j\gamma \), we estimate
for some \(C>0\) and a null sequence \((\delta _k)_{k \in {\mathbb {N}}}\). In particular, there exists a \(C'>0\) such that for all \(k \in {\mathbb {N}}\)
Hence, \({\texttt {a}}(\alpha )=\infty \) for all \(\alpha \le \gamma \). On the other hand, by Cauchy’s condensation test, \({\texttt {a}}(\alpha )<\infty \) for all \(\alpha > \gamma \). Hence, \(A_{\alpha }1(0) < \infty \) if and only if \(\alpha \in (\gamma ,1-\gamma )=:{\mathcal {I}}\). From now on we assume that \({\mathcal {I}}\not =\emptyset \), i.e. \(\gamma <\frac{1}{2}\). For \(\alpha \in {\mathcal {I}}\), \(\rho (\alpha )\) is finite and since \(\rho (\alpha ) \ge {\texttt {a}}(\alpha )+{\texttt {a}}(1-\alpha )\), we see that \(\rho (\alpha ) \rightarrow \infty \) for \(\alpha \rightarrow \partial {\mathcal {I}}\). The existence and uniqueness of eigenfunction \(v_{\alpha }:{\mathcal {T}}\rightarrow (0,\infty )\) follows from the Krein–Rutman theorem (see Theorem 3.1.3 in [16]). The fact that \(A_{\alpha }\) is a strictly positive operator, implies \(\min _{\tau \in {\mathcal {T}}}v_{\alpha }(\tau )>0\). Since \(\rho (\alpha )\) is an isolated eigenvalue with one-dimensional eigenspace, one can argue along the lines of Chapter II \(\S \)3 (in particular Remark 2.4) and Theorem II.\(\S \)5.5.4 of [12] that \(\rho \) is twice differentiable and the derivative can be represented as in (2.3). In particular, \(\rho ''(\alpha )>0\) for all \(\alpha \in {\mathcal {I}}\), hence, \(\rho \) is strictly convex on \({\mathcal {I}}\) and there exists a unique minimizer \(\alpha ^* \in {\mathcal {I}}\).
To prove (v), consider \(\tau \ge 0\). Let \({{\hat{Z}}}^{\scriptscriptstyle {({\tau })}}_t= Z^{\scriptscriptstyle {({\tau })}} - \mathbb {1}_{[\tau , \infty )}\) for any \(\tau \in [0,\infty )\). Then, by the definition of the eigenfunction,
where we used in the inequality that the distribution of the positions to the left of the origin do not depend on the initial type and for the second expectation we used the monotonicity in types proved in [7]. The upper bound holds by a similar argument. \(\square \)
The next proposition collects some of the consequences for the spectral radius if we consider converging attachment rules.
Proposition 2.3
Let \((f_t)_{t \ge 0}\) be pointwise decreasing attachment rules with \(\gamma _t = \inf _{k \ge 0} \frac{f_t(k)}{k} < 1/2\). Define
Then, f is concave, \(f(0) \le 1\) and \(f(k+1) - f(k) \le 1\) for all \(k \in {\mathbb {N}}_0\). Let \(\gamma = \inf _{k \ge 0 } \frac{f(k)}{k}\) and let \(\rho (\alpha )\) be the spectral radius of the operator associated to the branching process with attachment rule f, \(\alpha ^*\) be its unique minimizers and let \(v_\alpha \) be the corresponding eigenfunction with \(\Vert v_\alpha \Vert = 1\). Define the same quantities with index t when referring to the branching process associated to \(f_t\), where we set \(\rho _t(\alpha ) = \infty \) if \(t \notin (\gamma _t, 1- \gamma _t)\).
-
(i)
$$\begin{aligned} \rho (\alpha ) = \lim _{ t \rightarrow \infty } \rho _t(\alpha ) \text{ for } \text{ all } t \in (\gamma , 1 -\gamma ). \end{aligned}$$
-
(ii)
Suppose that \(\lim _{t \rightarrow \infty } \rho _t(\alpha _t^*) = 1\), then \(\alpha _t^* \rightarrow \alpha ^*\) as \(t \rightarrow \infty \) and \(\rho (\alpha ^*) = 1\);
-
(iii)
The quotient
$$\begin{aligned} \frac{v_{t,\alpha _t^*}(0)}{v_{t,\alpha _t^*}(\ell )} \end{aligned}$$is uniformly bounded in t away from zero and infinity.
-
(iv)
Moreover, as \(t \rightarrow \infty \),
$$\begin{aligned} \rho _t''(\alpha _t^*) \rightarrow \rho ''(\alpha ^*) . \end{aligned}$$
Proof
Since \((f_t)_{t \ge 0}\) is a pointwise decreasing sequence of positive functions
exists. As an infimum of concave functions, \(f:{\mathbb {N}}_0\rightarrow [0,\infty )\) is concave. The property \(f(0)\le 1\) is inherited from \(f_t\). The increments might in general only satisfy \(f(k+1)-f(k) \le 1\), but the strict inequality is not needed for the analysis of the branching process and the corresponding operators.
The assumption \(\rho _t(\alpha _t^*)>1\) implies that there exists a giant component for all \(t\ge 0\). Hence,
by Proposition 1.10 in [7]. Here \(a_{f_t}\) and \(c_{f_t}\) are the functions given in (2.4). By monotone convergence, \({\texttt {a}}_{f}(1/2)+\sqrt{{\texttt {a}}_{f}(1/2){\texttt {c}}_{f}(1/2)}\ge 1\) and, in particular, \(f(0)>0\). Hence, f is an attachment rule. From now on we add a subscript t to all quantities corresponding to \(f_t\) and no subscript for quantities corresponding to f.
The assumption \(\gamma _t <\frac{1}{2}\) is needed to make the operator \(A_{t,\alpha }\) exist for some \(\alpha \). We have for all \(t \le s\)
In particular, \((\gamma _t)_{t \ge 0}\) is a non-increasing sequence. Let \({\mathcal {I}}=(\gamma ,1-\gamma )\) and \({\mathcal {I}}_t=(\gamma _t,1-\gamma _t)\) for \(t \ge 0\). Then \({\mathcal {I}}_t \subseteq {\mathcal {I}}\) for all \(t\ge 0\) and we write \(\rho _t(\alpha )=\infty \) whenever \(\alpha \not \in {\mathcal {I}}_t\).
Let \(\alpha \in {\mathcal {I}}\). We use the monotonicity of the branching process in the attachment rule to derive
In particular, for every \(\alpha \in {\mathcal {I}}\) there exists \(t\ge 0\) such that \(\rho _t(\alpha )<\infty \). Hence, \(\bigcup _{t \ge 0}{\mathcal {I}}_t ={\mathcal {I}}\). Since \(\rho (\alpha ) \rightarrow \infty \) for \(\alpha \rightarrow \partial {\mathcal {I}}\), there exists an \(\epsilon >0\) such that \(\alpha ^*,\alpha _t^* \in [\gamma +\epsilon ,1-\gamma -\epsilon ]=:{\hat{{\mathcal {I}}}}\) for all \(t \ge 0\).
Convergence \(\gamma _t \rightarrow \gamma \) implies the existence of a \(t_0>0\) such that \(\hat{{\mathcal {I}}}\subseteq {\mathcal {I}}_t\) for all \(t\ge t_0\). In particular, we can consider the family \(\rho ,(\rho _t)_{t\ge t_0}\) of uniformly continuous functions on \(\hat{{\mathcal {I}}}\).
In the next step we argue that \(\alpha _t^* \overset{t \rightarrow \infty }{\longrightarrow } \alpha ^*\). Notice that by assumption \(\rho (\alpha ) =\lim _{t \rightarrow \infty }\rho _t(\alpha )\ge \lim _{t \rightarrow \infty } \rho _t(\alpha _t^*) \ge 1\) and
Hence, \(\rho (\alpha ^*)=1\). Suppose that \(\alpha _t^* \overset{t \rightarrow \infty }{\longrightarrow } \alpha ^*\) does not hold. Then there exists a \(\delta >0\) and a subsequence \(t_n\uparrow \infty \) such that \(|\alpha _{t_n}^*-\alpha ^*| \ge \delta \) for all \(n \in {\mathbb {N}}\). Since \(\rho \) is strictly convex with unique minimizer \(\alpha ^*\), we have
and \(\rho (\alpha )\ge 1+\delta '\) for all \(\alpha \not \in [\alpha ^*-\delta ,\alpha ^*+\delta ]\). In particular,
Since the term on the left-hand side converges to 1, this is a contradiction and the convergence \(\alpha _t^* \overset{t \rightarrow \infty }{\longrightarrow } \alpha ^*\) is established.
The fact that \(\alpha _t^*\) converges to \(\alpha ^*\) and \(f_t\) converges to f implies that \(A_{t,\alpha _t^*}\) converges to \(A_{\alpha ^*}\) because of the uniform continuity of the operator in \(\alpha \in \hat{{\mathcal {I}}}\). Since the eigenspaces of \(\rho (\alpha ^*)\) and \(\rho _t(\alpha _t^*)\) are one dimensional, one can argue along the lines of Note 3 on Chapter II in [12, pp. 568–569] to see that \(v_{t,\alpha _t^*}\) converges to \(v_{\alpha ^*}\). Since the functions are bounded,
is uniformly bounded in t from zero and infinity. With the observed convergences and this uniform bound (2.3) now implies that also
as required. \(\square \)
2.2 The Many-to-One Lemma
We first continue the analysis of the \({\mathrm {IBRW}}\). The following lemma is based on a spine construction which is known as Lyons’ change of measure [14]. Recall the Ulam–Harris notation from Sect. 2.1.
Lemma 2.4
( Many-to-one) For all \(\alpha \in {\mathcal {I}}\) there exists a probability measure \({\mathbb {P}}^\alpha \) on some measurable space, and a Markov process \(((S_n,\tau _n) :n \in {\mathbb {N}}_0;{\mathbb {P}}^{\alpha })\) with state space \({\mathbb {R}}\times {\mathcal {T}}\), such that for all \(n \in {\mathbb {N}}\), \((s_0,\tau _0) \in {\mathbb {R}}\times {\mathcal {T}}\) and \(F:({\mathbb {R}}\times {\mathcal {T}})^n \rightarrow {\mathbb {R}}_+\) measurable
Moreover, \(\sigma ((S_i,\tau _i):i\in [n])=\sigma (S_i:i\in [n])\).
Note that it is easy to check that the distribution of \(((S_n, \tau _n) \, : \, n \in {\mathbb {N}}_0; {\mathbb {P}}^\alpha )\) does not depend on the percolation parameter p.
Proof
Given a labelled tree \((\Gamma ,L)\), with \(L(x)=(S(x),\tau (x))\), we can distinguish an ancestral line \(\xi =(\xi _1,\xi _2,\ldots )\) which we call spine. In the space of labelled trees, we denote by \({\mathcal {F}}_n\) the \(\sigma \)-field generated by the first n generations, \({\mathcal {F}}_n=\sigma ((x,L(x)):|x|\le n)\). The analogue in the space of trees with spines is denoted by \({\mathcal {F}}_n^*\).
For every \((s_0,\tau _0)\in {\mathbb {R}}\times {\mathcal {T}}\), the distribution of the \({\mathrm {IBRW}}\) started in \((s_0,\tau _0)\) can be interpreted as a distribution \(P_{(s_0,\tau _0)}^{p}\) on the set of labelled trees. We extend this measure to the space of labelled trees with spines. Since \((s_0,\tau _0)\) and p will remain fixed throughout the proof, we omit it from the notation and write \(P=P_{(s_0,\tau _0)}^p\) for brevity.
Note that every \({\mathcal {F}}_n^*\)-measurable function g can be written as
for \({\mathcal {F}}_n\)-measurable functions \(g_x\) (see page 24 in [18]). We define \(P_n^*\) to be the (non-probability) measure on \({\mathcal {F}}_n^*\) such that for all nonnegative \({\mathcal {F}}_n^*\)-measurable functions g,
We now construct a new branching random walk under a new probability measure \({\mathbb {P}}^\alpha \). The root has again label \(L(\emptyset )=(S(\emptyset ),\tau (\emptyset ))=(s_0,\tau _0)\). A particle \(\xi _n\) on the spine in generation n with label \((S(\xi _n),\tau (\xi _n))\) produces new offspring with distribution
for all atomic measures \(\mu \) on \({\mathbb {R}}\times {\mathcal {T}}\). Here \({\mathcal {L}}_{\sigma }\) denotes the offspring distribution for a particle of type \(\sigma \in {\mathcal {T}}\) in the original process. The new spine particle \(\xi _{n+1}\) in generation \(n+1\) is chosen from the offspring of \(\xi _n\) by choosing an offspring x with probability proportional to
Off the spine the new branching random walk behaves exactly as the original one. Then
In particular, for all \(F :({\mathbb {R}}\times {\mathcal {T}})^n \rightarrow [0,\infty )\) measurable
We define \((S_n,\tau _n):=(S(\xi _n),\tau (\xi _n))\) and \({\mathbb {P}}_{(s_0,\tau _0)}^{\alpha }={\mathbb {P}}^{\alpha }\). The Markov property follows from the definition of the process. Since the offspring distribution of the spine is absolutely continuous with respect to the offspring distribution of the original process and the type of the original process is a function of the locations of its ancestors and the particle itself, the proof is complete. \(\square \)
Next we also need a higher-dimensional version of the many-to-one lemma that includes the number of offspring of the particles on the spine. For any \(x = (x_0, \ldots , x_n)\) in the branching process, define the point measure on \({\mathbb {R}}\)
with \(\delta _s\) a Dirac mass in s, which describes the positions of offspring of x relative to the position of x. Denote by \(M_p({\mathbb {R}})\) the point measures on \({\mathbb {R}}\).
Lemma 2.5
For all \(\alpha \in {\mathcal {I}}\), \((s_0, \tau _0) \in {\mathbb {R}}\times {\mathcal {T}}\) there exists a probability measure \( {\mathbb {P}}^\alpha _{(s_0,\tau _0)} = {\mathbb {P}}^{\alpha ,p}_{(s_0,\tau _0)}\) and a Markov process \(((S_i, \tau _i, \nu _{i-1})_{i=0, \ldots , n}, : \, n \in {\mathbb {N}}_0; {\mathbb {P}}_{(s_0,\tau _0)}^\alpha )\) started in \((s_0, \tau _0, \delta _{s_0})\) with state space \({\mathbb {R}}\times {\mathcal {T}}\times M_p({\mathbb {R}})\) such that for all measurable F
Moreover, for any measurable \(A \subset {\mathbb {R}}\) and any measurable F,
Note that unlike in Lemma 2.4 the distribution of \((S_k,\tau _k, \nu _{k-1})_{k \in {\mathbb {N}}_0}\) does depend on p, since we are considering a non-linear function of the point process describing the position of the offspring.
Proof
Consider the IBRW with spine \(\xi \) under the measure \({\mathbb {P}}^\alpha \) as constructed in the proof of Lemma 2.4. Then, define \((S_n, \tau _n, \nu _{n-1}) := ( S(\xi _n), \tau (\xi _n), \nu _{\xi _{n-1}})\) and the first statement follows since we know the explicit Radon–Nikodym density of \({\mathbb {P}}^\alpha \) with respect to \(P^*_n\). The second statement is a consequence of the Markov property of the IBRW with spine combined with a suitable choice of test function F. \(\square \)
Lemma 2.6
(Moments) Let \(\alpha \in {\mathcal {I}}\) and \(((S_n,\tau _n):n \in {\mathbb {N}}_0)\) be the Markov process from Lemma 2.4. For all \(\tau _0 \in {\mathcal {T}}\)
Proof
Follows immediately from (2.5) and (2.3). \(\square \)
2.3 Asymptotic Moment Estimates
In the proofs of Theorems 1.1 and 1.2 we will need estimates for moments of the Markov chains defined in Lemmas 2.4 and 2.5. Suppose that \(f_{t_n}\) is a sequence of decreasing attachment rules such that \(f_{t_n} \downarrow f\) pointwise for an attachment rule f. Further, suppose that \(\gamma _n := \lim _{k \rightarrow \infty } f_{t_n}(k)/k < \frac{1}{2}\) for all \(n \in {\mathbb {N}}\), so that by Proposition 2.3 also \(\gamma = \lim _{k \rightarrow \infty } \frac{f(k)}{k} < \frac{1}{2}\). Let \(\alpha ^*_n\), resp. \(\alpha ^*\) be the unique minimizer of the spectral radius \(\rho _n\), resp. \(\rho \), of the operator corresponding to the attachment rule \(f_n\), resp. f. In the setting of Theorem 1.1 take \(f_{t_n} = f\) and write \(\rho _n = \rho ^{p_n} = p_n \rho (\cdot )\).
The Markov chain from Lemma 2.4 corresponding to attachment rule \(f_{t_n}\) is denoted by \(((S_i^{\scriptscriptstyle {({n})}},\tau _i^{\scriptscriptstyle {({n})}}):i \in {\mathbb {N}};{\mathbb {P}}={\mathbb {P}}_{(s_0,\ell )}^{\alpha _n^*}\)).
Lemma 2.7
There exists \(\eta > 0\) such that
Proof
We first consider the case \(S_1^{\scriptscriptstyle {({n})}} \ge 0\). Note that
where we used that \(\rho _n(\alpha _n)>1\), the monotonicity in types for \(v_{\alpha _n}\) by Lemma 2.2 and the uniform boundedness of the quotient \(\frac{v_{\alpha _n}(0)}{v_{\alpha _n}(\ell )}\) in n by Proposition 2.3. Hence, it suffices to show that for the right choice of \(\eta \) the expectation on the right hand side remains bounded in n.
Now, by Lemma 2.2, we have that \(\eta := \frac{1}{4} (\alpha ^*- \gamma ) > 0\). By Proposition 2.3, we have that \(\gamma _n \downarrow \gamma \) and \(\alpha _n^*\rightarrow \alpha ^*\). Then, we can choose \(n_0\) sufficiently large such that for all \(n \ge n_0\) we have \(\gamma _{n} < \gamma + \eta \) and \(\alpha _n^\star \ge \alpha ^* - \eta \).
Furthermore, we denote by \({\tilde{Z}}^{\scriptscriptstyle {({n_0})}}\) the jump process that jumps from k to \(k+1\) with rate \(f_{t_{n_0}}(k)\) started in 1. Then by comparison with a Yule process with constant branching rate, we can find a constant \(C(n_0) > 0\) such that
Finally, we obtain from the construction of IBRW and using (2.6) that for \(n \ge n_0\)
which is finite by choice of \(\eta \).
The fact that the supremum over all n is finite follows from the same argument if we redefine \(\eta \) as \(\frac{1}{4} \min \{\alpha ^* - \gamma , \alpha ^*_k - \gamma _k , k \le n_0 \}\).
For the case that \(S_1^{\scriptscriptstyle {({n})}} \le 0\), it suffices to prove in a second step, that there exists \(\eta > 0\) such that
Since the children to the left of a particle in the IBRW form a Poisson process and their distribution is not depending on the type of their ancestor we have by construction
where \(X^{\scriptscriptstyle {({f})}}\) is the pure birth process with jump rates given by f.
Let \(\epsilon _n = \frac{1}{8} ( 1 - \gamma _n - \alpha ^*_n)\) and let \(\epsilon = \frac{1}{4} ( 1 - \gamma - \alpha ^*)\), where \(\gamma = \inf _{k \ge 1} \frac{f(k)}{k}\). By Lemma 2.2, \(\epsilon _n, \epsilon > 0\).
Using that \(f_{t_k} (k) \le k+1\) and a comparison to a Yule process, we have that there exists \(C_n > 0\) such that
By Proposition 2.3, we can find \(n_0\) such that for all \(n \ge n_0\),
Define \(\eta := \min \{ \frac{3}{8} \epsilon , \epsilon _n, n \le n_0 \}\). Then, we have for \(n \le n_0\),
Furthermore, for \(n \ge n_0\), we can use the monotonicity of \(f_{t_n}\) to deduce that
which completes the second step and thus the proof of the lemma. \(\square \)
Lemma 2.8
Let \((S_k^{\scriptscriptstyle {({n})}}, \tau ^{\scriptscriptstyle {({n})}}_k, \nu _{k-1}^{\scriptscriptstyle {({n})}})\) be the Markov chain defined in Lemma 2.5 either for attachment rule \(f_{t_n}\) and percolation parameter 1 or for fixed attachment rule f (with \(\gamma < \frac{1}{2}\)) and percolation parameter \(p_n\), where \(p_n \downarrow \rho (\alpha ^*)^{-1}\). For any sequence \((M_n)_{n \in {\mathbb {N}}}\) such that \(M_n \rightarrow \infty \), there exist constants \(C > 0\) and \({\hat{\gamma }} > 0\) such that, for all n,
Proof
To unify notation define \(p_n=1\) in the varying f case and \(f_{t_n} = f\) in the percolation case. By the extended many-to-one formula, Lemma 2.5, we have that
We can use that \(\rho _n(\alpha _n^*) > 1\), the monotonicity in types and in \(p_n\), Lemma 2.2, and that \(C :=\sup _{n\in {\mathbb {N}}} \frac{v_{\alpha _n}(0)}{v_{\alpha _n}(\ell ) } < \infty \) by Proposition 2.3, in order to bound the above by
For the first term in (2.7), we note that \(f(k) \le f_{t_n}(k) \le f_{t_1}(k) \le \gamma _1^+ k + f_{t_1}(0)\) (for \(\gamma _1^+ = \sup \{ \Delta f_{t_1} (k), k \in {\mathbb {N}}_0\}\). Therefore, if we let \(({\hat{Z}}^{f_{t_1}})_{t \ge 0} \) be the jump process jumping from k to \(k+1\) at rate \(f_{t_1}(k)\) started in 1, we can conclude that by construction of the IBRW
for some constants \(C(f_{t_1})\) and \({\hat{\gamma }} > 0\), where the latter bound follows by comparison with a Yule process, whose second moments grow at most exponentially.
For the second term on the right hand side in (2.7), the first expectation is bounded uniformly in n by (the second part of) Lemma 2.7 and the second expectation can be bounded by the second moment, so that the first part of the argument applies.
For the final term in (2.7), we use that the particles to the left form a Poisson process, so that we can use a standard identity for Poisson processes, see e.g. [13, Eq. (4.26)], to deduce that
However, as in the proof of Lemma 2.7 the right hand side is bounded uniformly in n. \(\square \)
Corollary 2.9
In the setting of Lemma 2.8, we have for any sequence \(N_n \rightarrow \infty \) and with \(R_n = e^{N_n^{1/4}}, M_n = N_n^{1/5}\) that there exist \({\tilde{C}}, {\tilde{\gamma }} > 0\) such that
Proof
By Lemma 2.7, there exists \(\eta > 0\) such that \(C := \sup _{n\in {\mathbb {N}}_0, \tau } {\mathbb {E}}_{(0,\tau )}^{\alpha _n^*}[ e^{\eta S_1^{\scriptscriptstyle {({n})}}}] < \infty \). Moreover, by Lemma 2.8, there exist \( C'> 0, \gamma ' > 0\) such that \(\sup _{\tau } {\mathbb {E}}^\alpha _{(0,\tau )}[ {\bar{\nu }}_{0}, S_1 \le M_n] \le C' e^{ \gamma ' M_n}\). Hence, we can estimate by Chebyshev’s inequality
by our choice of \(M_n\) and \(R_n\). Therefore, the statement of the corollary follows by choosing \({\tilde{\gamma }}\) and \({\tilde{C}}\) appropriately. \(\square \)
2.4 Mogulskii’s Theorem
The main technical tool in the proof of our main result is the following large deviation result due to Mogulskii in its original form. We state it here in a version adapted to Markov chains as a generalisation of the version for random walks found in [9].
Theorem 2.10
([9, 15]) Let \({\mathcal {T}}\) be a nonempty set. We assume the following:
-
(i)
For each \(n \in {\mathbb {N}}\), \(((S_i^{\scriptscriptstyle (n)},\tau _i^{\scriptscriptstyle (n)}):i \in {\mathbb {N}})\) is a Markov chain with values in \({\mathbb {R}}\times {\mathcal {T}}\).
-
(ii)
\((a_n)_{n \in {\mathbb {N}}}\) and \(((k_n)_{n \in {\mathbb {N}}})\) are positive sequences with \(a_n, k_n \rightarrow \infty \) and \(a_n^2/k_n\rightarrow 0\) as \(n \rightarrow \infty \).
-
(iii)
For all \(c_1,c_2,c_3,c_4 \in {\mathbb {R}}\), \(A>0\) and for \(r_n:=\lfloor Aa_n^2\rfloor \),
$$\begin{aligned} \begin{aligned}&P_{(0,\tau )}\left( c_1 \le \frac{S_i^{\scriptscriptstyle (n)}}{a_n} \le c_2 \, \forall \, i \in [r_n];c_3 \le \frac{S_{r_n}^{\scriptscriptstyle (n)}}{a_n} \le c_4\right) \\&\quad \xrightarrow {n\rightarrow \infty } P(c_1 \le \sqrt{\sigma ^2 A}W_t \le c_2\, \forall \, t \in [0,1]; c_3 \le \sqrt{\sigma ^2 A}W_1 \le c_4) \end{aligned} \end{aligned}$$(2.8)uniformly in \(\tau \in {\mathcal {T}}\), where \(\sigma ^2>0\) is a constant independent of A and \((W_t:t \ge 0;P)\) is a standard Brownian motion.
Let \(g_1 < g_2\) be two continuous functions on [0, 1] with \(g_1(0)\le 0\le g_2(0)\) and denote
Then, for all \(\tau _0 \in {\mathcal {T}}\),
Moreover, for any \(b > 0\), \(\tau _0 \in {\mathcal {T}}\),
The proof of Theorem 2.10 is postponed to Appendix.
We will now show that we can apply Mogulskii’s results to the Markov chains from Lemmas 2.4 and 2.5. We will treat both the setting of Theorems 1.1 and 1.2 at the same time and so continue using the notation introduced at the beginning of Sect. 2.3.
To this end, we first recall Donsker’s theorem for martingale difference arrays (see for example Theorem 7.7.3 in [8] or Theorem 18.2 in [2]). The theorem is usually stated for \(r_n=n\), but it is straightforward to generalize the statement to the following:
Proposition 2.11
Let \((r_n)_{n \in {\mathbb {N}}} \in {\mathbb {N}}^{{\mathbb {N}}}\) be a sequence with \(r_n \uparrow \infty \) as \(n \rightarrow \infty \). For every \(n\in {\mathbb {N}}\), let \((\xi _i^n:1 \le i \le r_n)\) be a family of random variables and denote \({\mathcal {F}}_i^n=\sigma (\xi _1^n,\dots ,\xi _i^n)\) for all \(i \le r_n\). Assume that
-
(i)
\(E[\xi _i^n|{\mathcal {F}}_{i-1}^n]=0\) for all \(i \le r_n\), \(n \in {\mathbb {N}}\).
-
(ii)
For all \(t \in [0,1]\), \(\sum _{i \le \lfloor tr_n\rfloor } E[(\xi _i^n)^2|{\mathcal {F}}_{i-1}^n] \rightarrow t\) in probability as \(n \rightarrow \infty \).
-
(iii)
For all \(\epsilon >0\), \(\sum _{i \le r_n} E[(\xi _i^n)^2 \mathbb {1}_{|\xi _i^n|>\epsilon }|{\mathcal {F}}_{i-1}^n] \rightarrow 0\) in probability as \(n \rightarrow \infty \).
Then the linear interpolation of \((\sum _{i \le m} \xi _i^n:m \le r_n)\) converges weakly to a standard Brownian motion on [0, 1].
We use Donsker’s theorem as follows.
Lemma 2.12
Let \(A>0\), \((a_n)_{n \in {\mathbb {N}}}\) be a positive sequence with \(\lim _{n \rightarrow \infty } a_n =\infty \) and write \(r_n=\lfloor Aa_n^2\rfloor \) for each \(n \in {\mathbb {N}}\). Moreover, let \((S_i^{\scriptscriptstyle {({n})}}, \tau _i^{\scriptscriptstyle {({n})}})_{i \in {\mathbb {N}}_0}\) be the Markov chain introduced in Lemma 2.4 for attachment rule \(f_{t_n}\). For all \(c_1,c_2, c_3, c_4 \in {\mathbb {R}}\),
as \(n \rightarrow \infty \), uniformly in \(\tau \in {\mathcal {T}}\), where \((W_t:t \ge 0;P)\) is a standard Brownian motion.
Proof
The first step is to show that the conditions of Proposition 2.11 are satisfied by the random variables
Let \({\mathcal {F}}_i^{n}=\sigma (S_1^{\scriptscriptstyle {({n})}},\ldots ,S_i^{\scriptscriptstyle {({n})}})\). For all \(i \in [r_n]\),
by Lemma 2.6 and \(\rho _n'(\alpha _n^*)=0\) as \(\alpha _n^*\) minimizes \(\rho _n\). Moreover, as \(n \rightarrow \infty \),
since \(r_n \rightarrow \infty \) and \(\rho _n(\alpha _n^*)/\rho _n''(\alpha _n^*)\rightarrow \sigma ^2\) by definition. Thus, Condition (ii) of Proposition 2.11 is satisfied. Finally, let \(\eta \) be as in Lemma 2.7, then
where C is a constant such that \(x^2 \le C e^{\eta |x|}\) for all \( x\in {\mathbb {R}}\). Then, by Lemma 2.7 the exponential moment is bounded uniformly in n, so that the right hand side converges to zero as required.
To apply Theorem 2.10 we have to check that the convergence of the distribution functions is uniform in the start type. This is guaranteed by the monotonicity of the \({\mathrm {IBRW}}\) in the start type (which was proven in [7, Remark 2.6]) which entails a monotonicity of \((S_i^{\scriptscriptstyle {({n})}})\) by the many-to-one lemma, and by the fact that the limit is independent of the start type. \(\square \)
Lemma 2.13
Let \(N_n\) be a positive sequence with \(\lim _{n \rightarrow \infty } N_n = \infty \) and set \(a_n = N_n^{1/3}, M_n = N_n^{1/5}, R_n = e^{N_n^{1/4}}\). Consider the Markov chain \(({\tilde{S}}^{\scriptscriptstyle {({n})}}_k, {\tilde{\tau }}^{\scriptscriptstyle {({n})}}_k, {\tilde{\nu }}^{\scriptscriptstyle {({n})}}_{k-1})_{k \in {\mathbb {N}}_0}\) with filtration \(({\mathcal {F}}_k^{\scriptscriptstyle {({n})}})_{k \in {\mathbb {N}}_0}\) and transitions given for any measurable F by
where \((S_1^{\scriptscriptstyle {({n})}}, \tau ^{\scriptscriptstyle {({n})}}_1, \nu ^{\scriptscriptstyle {({n})}}_0)\) is the first step of the Markov chain defined in Lemma 2.5 associated either to the IBRW with attachment rule \(f_{t_n}\) and \(p=1\) or with attachment rule f and percolation parameter \(p_n\). Let \(A>0\), write \(r_n=\lfloor Aa_n^2\rfloor \). For all \(c_1,c_2, c_3,c_4 \in {\mathbb {R}}\),
as \(n \rightarrow \infty \), uniformly in \(\tau \in {\mathcal {T}}\), where \((W_t:t \ge 0;P)\) is a standard Brownian motion.
Proof
We show that we can replace \({\tilde{S}}_i^{\scriptscriptstyle {({n})}}\) by \(S_i^{\scriptscriptstyle {({n})}}\) in the above probability up to an error that converges to 0 uniformly in \(\tau \) and then invoke Lemma 2.12. Define the event
and note that by Corollary 2.9, there exist constants \( {\tilde{C}}, {\tilde{\gamma }} > 0\) such that
We estimate for any \(c^i_1, c^i_2 \in {\mathbb {R}}\)
where we assume that n is sufficiently large and we used (2.11) together with \(\frac{1}{1-x} \le 1 +2x\) for \(x \in [0,\frac{1}{2}]\). Iterating this estimate yields
and we note that by the choice of \(r_n\) the error converges to 0.
For a lower bound, we estimate
Iterating and using the bound (2.11) gives
where the error converges to 0 by choice of \(r_n\). Then, combining (2.12) and (2.13) together with Lemma 2.12 we can deduce the statement of the lemma. \(\square \)
3 Proofs: Upper Bound
In this section, we fix the start type \(\ell \) of the \({\mathrm {IBRW}}\). Recall that in the killed \({\mathrm {IBRW}}\) every particle x with \(S(x)>0\) is deleted together with its descendants. We denote its survival probability by \(\zeta \), that is, for \(s_0 \le 0\),
Lemma 3.1
For all \(\alpha \in {\mathcal {I}}\), \(s_0 \le 0\), \(n \in {\mathbb {N}}\) and \(b_1,\dots , b_n \ge 0\),
where \(I(j)={\mathbb {P}}_{(s_0,\ell )}^{\alpha }(-b_i \le S_i \le 0 \;\forall \,i\in [j])\) for \(j=0,\dots ,n\), where \(((S_i, \tau _i)_{i \ge 0}, {\mathbb {P}}_{(s_0,\ell )})\) is the Markov chain from Lemma 2.4 started in \((s_0,\ell )\).
Proof
By definition,
For the first summand we use the Markov inequality and Lemma 2.4 to derive
where we used in the last step that by Lemma 2.2\(\inf _{\tau \in {\mathcal {T}}} v^p_{\alpha }(\tau ) = v_{\alpha }(\ell )\). Similarly,
Combining (3.1)–(3.3), now concludes the proof. \(\square \)
Lemma 3.2
For all \(\alpha \in {\mathcal {I}}\) with \(\rho (\alpha ) \ge 1\) and for all \(s_0 \le 0, k,N \in {\mathbb {N}}\) and \((b_j :j \in [kN])\) nonnegative and decreasing,
Proof
Using Lemma 3.1, that \(j \mapsto I(j)\) is decreasing, \(\rho (\alpha )\ge 1\) and that \(j \mapsto e^{-\alpha b_j}\) is increasing to obtain
\(\square \)
For the proof of Theorem 1.1, let \((p_n)_{n \in {\mathbb {N}}}\) be a sequence of retention probabilities with \(p_n \downarrow p_{\mathrm{c}}\). For Theorem 1.2, let \((t_n)_{n \in {\mathbb {N}}}\) be a sequence of parameters with \(t_n \uparrow \infty \). We write,
Moreover, we denote by \(v_n\) the eigenfunction for \(\rho _n(\alpha _n^*)\) from Lemma 2.2. The Markov chain from Lemma 2.4 corresponding to \(\alpha =\alpha _n^*\) and retention parameter \(p_n\) or attachment rule \(f_{t_n}\) is denoted by \(((S_i^{\scriptscriptstyle {({n})}},\tau _i^{\scriptscriptstyle {({n})}}):i \in {\mathbb {N}};{\mathbb {P}})\), i.e. \({\mathbb {P}}={\mathbb {P}}_{(s_0,\ell )}^{\alpha _n^*}\). One easily checks that in the setup of Theorem 1.1 the distribution of the Markov chain does not depend on n.
Finally, we introduce for all \(n \in {\mathbb {N}}\),
Notice, that in the situation of Theorem 1.1, \(\sigma ^2=\rho ''(\alpha ^*)p_{\mathrm{c}}\). The choice \(\alpha =\alpha _n^*\) guarantees that for both theorems, \(\epsilon _n\downarrow 0\) as \(n \rightarrow \infty \).
Lemma 3.3
Let \(N\in {\mathbb {N}}, s_0\le 0\) and \(a,b>0\). Let \(k_n=\lfloor (a/\epsilon _n)^{\frac{3}{2}} N^{-1}\rfloor \) and \(b_i^n(s_0)=a^{-1/2} b(Nk_n-i)^{1/3}-s_0\), \(i \in [Nk_n]\). Then, for all \(C>0\), \(l \in [N]\),
where \(I_n^s(j)={\mathbb {P}}_{(s,\ell )}(-b_i^n(s) \le S_i^{\scriptscriptstyle {({n})}} \le 0\, \forall \, i\in [j])\).
Notice that the specific choice of parameters implies that
Proof
Denote \(g_1(t)=-a^{-1/2}b(\frac{N}{l}-t)^{\frac{1}{3}}\) and let \(\delta >0\). For n sufficiently large, (3.4) yields
By Lemma 2.12 we can apply Theorem 2.10 with \(a_n=(lk_n)^{1/3}\), \(n=lk_n\) and \(g_2(t)=a^{-1/2} (C+\delta ) \frac{l}{N}\), and then take \(\delta \searrow 0\) to derive
Moreover, as \(n \rightarrow \infty \),
Proof of the upper bounds in Theorems 1.1 and 1.2
Since \(\theta (p,f)\) is non-decreasing in retention probability p and attachment rule f, it suffices to consider the asymptotic behaviour along a discrete subsequence. As before, for Theorem 1.1 we take any discrete sequence of retention probabilities \(p_n \downarrow p_{\mathrm{c}}\) and for Theorem 1.2 we take any discrete sequence \(t_n \uparrow \infty \). We make use of the notation introduced in and before Lemma 3.3. In particular, we fix \(N \in {\mathbb {N}}, a>0\) and \(b>0\) and let \(k_n :=\lfloor (a/\epsilon _n)^{\frac{3}{2}} N^{-1}\rfloor \) and \(b_i=b_i (s_0) = b_i^n=a^{-1/2}b(Nk_n-i)^{\frac{1}{3}}-s_0\). Write
Applying Lemma 3.2 with \(\alpha =\alpha _n^*\), we obtain for every \(N \in {\mathbb {N}}\), \(a>0\), \(b>0\), \(C>0\)
Recall that the choice of parameters implies that
Hence, by Lemma 3.3,
Taking N to infinity, we deduce
Now, we consider any \(C < \sqrt{\frac{\pi ^2 \sigma ^2}{2}}\) and we choose b such that
For these choices, we can take a sufficiently large to see that the first term in the above maximisation can be ignored, while in the second one the supremum is achieved at \(x = 0\). This gives the bound
Now, we can let \(b \uparrow \sqrt{\frac{\pi ^2 \sigma ^2}{2}} - C\) to see that for any \(0<C< \sqrt{\frac{\pi ^2\sigma ^2}{2}}\),
To complete the proof take \(\delta = \frac{1}{2} (1 - \alpha ^*) > 0\). Choose \(\ell = \lfloor \alpha ^* \frac{\pi \sigma }{\sqrt{2}\delta } \rfloor \) and note that \(\ell \delta \le \alpha ^* \frac{\pi \sigma }{\sqrt{2}} < \frac{\pi \sigma }{\sqrt{2}}\). Then, we obtain from (3.5) using the monotonicity of \(s \mapsto \zeta _s^n\),
where we used that \(\alpha ^* < 1\). Thus, we can deduce by first taking the limit \(n\rightarrow \infty \) and then \(\delta \downarrow 0\) that
This immediately implies Theorem 1.2 since \(\sigma ^2=\rho ''(\alpha ^*)\). For Theorem 1.1, \(p_{\mathrm{c}}=1/\rho (\alpha ^*)\) implies
so that Theorem 1.1 follows since \(\sigma ^2=\rho ''(\alpha ^*)p_{\mathrm{c}}\). \(\square \)
4 Proofs: Lower Bound
The general strategy of the lower bound is to identify a subtree of the IBRW that has the same distribution as a Galton–Watson tree. For this carefully chosen subtree we then lower bound the survival probability, which in return gives the required lower bound on the survival probability of the IBRW. In Sect. 4.1 we collect some general facts about Galton–Watson trees, which we will then use in Sect. 4.2 to carry out the proof of the lower bound.
4.1 Galton–Watson Lemmas
In order to show the lower bound we construct a Galton–Watson tree, whose particles are a subfamily of the killed branching random walk. To estimate the survival probability of this Galton–Watson tree, we will use the following general lemma due to [9] and we also recall the proof for completeness.
Lemma 4.1
Let \(\mathrm{GW}\) be a Galton–Watson tree and denote by X the number of children in the first generation and by \(q := {\mathbb {P}}( |\mathrm {GW}| < \infty )\) its extinction probability. Then, for all \(r \le \min \{ \frac{1}{8}, q\}\),
Proof
Denote by q the extinction probability of \(\mathrm{GW}\) and by \(s\mapsto g(s)={\mathbb {E}}[s^{X}]\) the generating function of \(\mathrm{GW}\). Then, for every \(r \in [0,q]\),
where we used that \(g'(s)\) is increasing, and \(g'(s)\le 1\) for all \(s\in [0,q_{n,\epsilon }]\). We continue by estimating
where we used for the second summand that \(u\mapsto u e^{-ru}\) is decreasing on \([r^{-1},\infty )\) and \([r^{-2}, \infty ) \subseteq [r^{-1},\infty )\). Then, for all \(r \in (0,\frac{1}{8}]\),
Combining (4.2) and (4.3), we deduce
Rearranging (4.1) and (4.4), we conclude that for all \(r \le \min \{ \frac{1}{8}, q\}\),
as required. \(\square \)
We will also need an estimate which guarantees that a supercritical Galton–Watson process grows exponentially fast on survival with large probability.
Lemma 4.2
For all \(\theta _1>1>\theta _2>0\) there exists \(\delta >0\) such that for any Galton–Watson process \((X_n:n\in {\mathbb {N}})\) with \(X_0=1\) where mean offspring m, offspring distribution \((p_k)_{k \in {\mathbb {N}}_0}\), and extinction probability q satisfy \(q, p_1<\delta \) and \(m>1/\delta \), we have, for sufficiently large n,
Proof
Denote by g the generating function of \(X_1\). By pruning the tree, i.e. removing all finite subtrees, we obtain a tree which on survival of the original process equals a Galton–Watson process \(({\tilde{X}}_n:n\in {\mathbb {N}})\) with \({\tilde{p}}_0=0\), \({\tilde{p}}_1=g'(q) \le p_1+ \frac{2q}{(1-q)^2}\), and the same mean as the original process. Hence \(\delta >0\) can be chosen such that the pruned process has arbitrarily small \({\tilde{p}}_1\) and arbitrarily large mean.
We first show the statement for the pruned process. For an individual v, we denote by \({\tilde{X}}_n(v)\) the number of its offspring in generation n of the pruned Galton–Watson process. Let \(v_0, v_1, \ldots , v_n\) be the first individuals according to the Ulam–Harris labelling in generations 0 up to n. Then, we can bound
where \({\tilde{X}}_n^{\scriptscriptstyle {({i})}}\) is defined as follows: if \(v_{i-1}\) has two or more offspring we set \({\tilde{X}}_n^{\scriptscriptstyle {({i})}} = {\tilde{X}}_n({\tilde{v}}_i)\), where \({\tilde{v}}_i\) is the second offspring of \(v_{i-1}\) and if \(v_{i-1}\) has only one offspring, we set \({\tilde{X}}_n^{\scriptscriptstyle {({i})}} = 0\). In particular, the \({\tilde{X}}_n^{\scriptscriptstyle {({i})}}\) are independent and have distribution
Then, we can calculate
We now choose \(\delta >0\) so small that \({\mathbb {E}}{\tilde{X}}_1>\theta _1^2\) and \({\tilde{p}}_1<\theta _2^2 / 8\). In particular, for sufficiently large n we have \({\mathbb {P}}({\tilde{X}}_{\lfloor n/2 \rfloor } \le \theta _1^n) \le \theta _2^2/8\). Hence, we obtain for n sufficiently large
Note that if we condition \(X_n\) on extinction in distribution it is equal to a Galton–Watson process \(X_n^*\) with mean \(g'(q) = {\tilde{p}}_1\). Therefore, by Markov’s inequality
by the same assumptions on \(\delta \) as above for n large. We can also assume that \(\delta \) is sufficiently small, so that the extinction probability q is less than 1 / 2. Hence, combining the above estimates,
which completes the proof. \(\square \)
4.2 The Lower Bound
Throughout, we use the same notation as in the upper bound: For the proof of Theorem 1.1, let \((p_n)_{n \in {\mathbb {N}}}\) be a sequence of retention probabilities with \(p_n \downarrow p_{\mathrm{c}}\). For Theorem 1.2, let \((t_n)_{n \in {\mathbb {N}}}\) be a sequence of parameters with \(t_n \uparrow \infty \). We denote by \(S^{\scriptscriptstyle {({n})}}\) the positions either in the percolated IBRW or in the IBRW with attachment rule \(f_{t_n}\). If the context is clear, we will omit the superscript. Also, we write,
Moreover, we denote by \(v_n\) the eigenfunction for \(\rho _n(\alpha _n^*)\) from Lemma 2.2.
Finally, we introduce for all \(n \in {\mathbb {N}}\),
Notice, that in the situation of Theorem 1.1, \(\sigma ^2=\rho ''(\alpha ^*)p_{\mathrm{c}}\). The choice \(\alpha =\alpha _n^*\) guarantees by Proposition 2.3 that for both theorems, \(\epsilon _n\downarrow 0\) as \(n \rightarrow \infty \).
Given any starting point \(s\ge 0\) and initial type \(\tau \), we will write \(P = P_{(s,\tau )} = P^{p_n}_{(s,\tau )}\) in the percolation case and \(P = P_{(s,\tau )} = P^{1}_{(s,\tau )}\) in the case of Theorem 1.2.
In view of Lemma 4.1, we will now choose a Galton–Watson tree \(\mathrm{GW}_{n}\) as a subtree of the killed IBRW in the following way, where we denote by \(X^{\scriptscriptstyle {({n})}}\) the number of children in the first generation of \(\mathrm{GW}_{n}\):
-
(a)
\(P(X^{\scriptscriptstyle {({n})}} \ne 0) \approx \) the survival probability of the Galton–Watson process. That is, when there are offspring, then the process usually survives.
-
(b)
\(P(X^{\scriptscriptstyle {({n})}} \ne 0)\) is close to the survival probability of the killed IBRW. That means that we choose the subpopulation as a good approximation of the BRW and that the first inequality in (4.5) is a good estimate.
-
(c)
\(P(1\le X^{\scriptscriptstyle {({n})}} \le r^{-2})\) has to be small to beat \(r^{2}\).
The Galton–Watson tree is obtained by a coarse-graining procedure, which we now describe. It involves positive parameters \(b, \lambda , \theta , M\) which will be chosen carefully at a later stage of the proof. We group together the first \(N+o(N)\) generations in the IBRW to form the first generation in \(\mathrm{GW}_{n}\). It turns out that we have to choose \(N = N_n\) such that
and for the first N steps we only choose particles whose positions are in the interval
To be precise, let \(L(N) = N + \lfloor N^{1/3} \rfloor \) and
Then we define \({\mathcal {C}}_{n}\) to be the particles in the first generation of \(\mathrm{GW}_{n}\). We include the last \(\lfloor N^{1/3} \rfloor \) generations to make sure that if we survive until time N, then we will have many particles by time L(N).
We iterate the procedure, i.e. the children of \(y \in {\mathcal {C}}_n\) will be
and \(\bigcup _{y \in {\mathcal {C}}_{n}} {\mathcal {C}}_{n}(y)\) will form the individuals of the second generation of \(\mathrm{GW}_{n}\) and we continue in a similar way. Note that by the construction of \(I_{i,n}\), we only include children in the second generation of the original IBRW that are to the left of the position of their ancestor. In particular, their distribution does not depend on the type of the parent. Moreover, all the conditions on the spatial positions are relative to \(S(\emptyset )\). Therefore, the distribution of \({\mathcal {C}}_{n}\) does not depend on either the type of the root \(\emptyset \) nor the initial position \(S(\emptyset )\). Similarly, the distribution of \({\mathcal {C}}_{n}(y)\) does not depend on either type nor position of y. Hence, the number of individuals in the different generations really do form a Galton–Watson process.
Moreover, if we assume that
then we have that all particle positions satisfy \(S(x_i) -S(x_0)\le 0 \) for all \(i \le |x|\) and \( x\in {\mathcal {C}}_{n}\) and \(\mathrm{GW}_{n}\) is really a subset of the killed branching random walk.
Coupling with a Galton–Watson process
To control the contribution of the last \(N^{1/3}\) steps of the branching random walk, we use the following coupling: We can couple the IBRW with a modified IBRW, where in generations \(k L(N) + N,\ldots ,k L(N) + L(N)-1\), for \(k \in {\mathbb {N}}_0\), the particles place their offspring according to the following rules relative to their own position:
-
(i)
to the right, the positions of the offspring follow the jump times of the birth process \(Z_t^{\scriptscriptstyle {({f})}}\) started in 0, which jumps from k to \(k+1\) at rate f(k).
-
(ii)
to the left, the positions of the offspring are given by a Poisson point process with intensity
$$\begin{aligned} e^t E[f(Z_{-t}^{\scriptscriptstyle {({f})}})] \mathbb {1}_{(-\infty ,0]}(t) \, dt, \end{aligned}$$where \((Z_t^{\scriptscriptstyle {({f})}})\) is the birth process with jump rate f started in 0.
-
(iii)
Also, in these generations types do not play a role, so we define all particles to have the same type as \(S(\emptyset )\) in the original process.
Now, we define the Galton–Watson \(\underline{\mathrm{GW}}_n\) similarly to above in terms of the modified IBRW, where we again denote by \({\underline{{\mathcal {C}}}}_n\) the individuals in the first generation. Since \((f_{t_k})\) is decreasing, we can couple the processes such that if \(\underline{\mathrm{GW}}_n\) survives then \(\mathrm{GW}_{n}\) survives and also such that \(|{\underline{{\mathcal {C}}}}_n| \le |{\mathcal {C}}_n|\).
For the lower bound on \(P ( {\mathcal {C}}_{n}\ne \emptyset )\) it turns out that it is enough to control the probability of the set
being non-empty. Note that for both, the original process and the modification, the set \({\mathcal {C}}_{n}^{\scriptscriptstyle (\mathrm{asym})}\) is the same, however in the modified process, the next generations \(N+1, \ldots , L(N)\) have the distribution of a single-type branching random walk. In particular, the number of particles in each generation form a Galton–Watson process, which we will denote by \(({\underline{X}}_k)_{k \in {\mathbb {N}}_0} = ({\underline{X}}_k(M))_{k \in {\mathbb {N}}_0}\). Moreover, we will denote by \({\underline{q}} = {\underline{q}}_M\) its extinction probability when started with a single particle. Note that \({\underline{q}}\) does not depend on n and we will use that by increasing M we have that \(\lim _{M \rightarrow \infty } {\underline{q}}_M = 0\) and also \(\lim _{M \rightarrow \infty } E {\underline{X}}_1(M) = \infty \).
By the Markov property, the survival of the two subsets of the killed branching branching walk are related by
The next result is the key step in the overall lower bound, where we will bound the probability that \({\mathcal {C}}_{n}^{\scriptscriptstyle (\mathrm{asym})}\) is non-empty.
Proposition 4.3
For any \(\alpha ^* \theta < 1\) and any \(\lambda ^2 > \frac{\pi ^2 \sigma ^2}{2 b (1- \alpha ^*\theta )}\), we have for any initial position \(s \ge 0\) and initial type \(\tau \in {\mathcal {T}}\),
Proof
By construction the probability of the event \({\mathcal {C}}_{n}^{\scriptscriptstyle (\mathrm{asym})}\ne \emptyset \) does not depend on the starting point of the initial point nor its type, so we can assume that \(S^{\scriptscriptstyle {({n})}}(\emptyset ) = 0\) and \(\tau (\emptyset ) = 0\).
For the first part of the proof, we will omit the indices n, whenever the context is clear and we are not dealing with asymptotic statements. In particular, we will write \(N = N_n, S = S^{\scriptscriptstyle {({n})}}\), \(\alpha = \alpha _n\), etc. Also, in this proof \(\rho = \rho ^{p_n}\) in the percolation case and \(\rho = \rho _{t_n}\) if the attachment rule is varying.
The first step is to carefully, select the relevant particles in \({\mathcal {C}}_{n}^{\scriptscriptstyle (\mathrm{asym})}\). For \(R_n = e^{N^{1/4}}\) and \(M_n = N^{1/5}\) and an individual x, we write
Recall that
For any \(|x| = N\), let
Define
The Paley–Zygmund inequality yields
The remaining proof proceeds as follows: in the first step we find an easier upper bound on \(E[Y_n^2]\), which we can estimate using the many-to-one lemma in the second step. In Step 3, we find a lower bound on \(E[Y_n]\), which we will then combine with the other steps to obtain our claim.
Step 1: Upper bound on \(E[Y_n^2]\). First write
We split the sum over y according to the last time \(j \in \{ 0, \ldots , N\}\) that the ancestors of y agree with x to obtain
Conditioning on the \(j+1\) first generations and using the independence of the branching process and the fact that on \(\{ {\bar{\nu }}_{x_j} \le R_n\}\), we only have to consider at most \(R_n\) relevant siblings of \(x_{j+1}\), we obtain the upper bound
where \(h_{N,n} = 1\) and for \(j \le N-2\),
In particular, we have shown that
leading to
Step 2: Upper bound on \(h_{j,n}\). First, we note since the left, resp. right, end point of \(I_{k,n}\) are to the left of the left, resp. right, end point of \(I_{j,n}\) for \(k \ge j\), we can apply the monotonicity in the initial types to deduce that
Thus, we obtain by the many-to-one Lemma 2.4
By the monotonicity in types, Lemma 2.2, we have that \(v_\alpha (\tau _0) / v_\alpha (\tau _N) \le v_\alpha (0) / v_\alpha (\ell )\) and further by Proposition 2.3, \(C := \sup _{n} \frac{v^{\scriptscriptstyle {({n})}}_{\alpha _n}(0)}{v^{\scriptscriptstyle {({n})}}_{\alpha _n}(\ell )}< \infty \), so that
We now would like to apply Mogulskii’s theorem to estimate the probability, but we need a bound that works uniformly for all j (since we will be summing over j) and uniformly in u. We thus approximate the sum over j by a finite sum and also split up the interval \([-\lambda N^{1/3},0]\) into smaller subintervals. So fix \(\kappa \in {\mathbb {N}}\) and define
For the next estimate, suppose that \(j \in [ (j' -1) K , j' K - 1]\) for some \(j' \in \{1, \ldots , \kappa -1 \}\) and assume \(u \in [- q \lambda N^{1/3}/ \kappa , - (q-1) \lambda N^{1/3}/\kappa ]\) for some \(q \in \{1, \ldots , \kappa \}\). Then,
where we assumed that n is sufficiently large so that
By Lemma 2.12 we can apply Mogulskii’s theorem, Theorem 2.10 with \(g_1(t) = \frac{(q-1 - \kappa )\lambda }{\kappa } - \theta b \frac{\kappa - j'}{\kappa } t\) and \(g_2(t) = - \theta b \frac{\kappa - j' - 1}{\kappa } t + \frac{q \lambda }{\kappa }\).
where \(\sigma \) is defined in (4.6) and we noticed that the integral does not depend on q nor \(j'\).
Hence, applying the display (4.11) , where we use the definition of \(\epsilon _n= \rho (\alpha )\), and also that \(\alpha _n \theta < 1\) (since \(\alpha ^*\theta < 1\), for all n sufficiently large, uniformly in \(j \in [(j'-1)K, j'K-1]\), we find that
Note that \(C(\kappa ) \rightarrow \frac{\pi ^2\sigma ^2}{2\lambda ^2}\) as \(\kappa \rightarrow \infty \). However, by assumption we have that \(\frac{\pi ^2\sigma ^2}{2\lambda ^2} < b(1-\alpha ^* \theta )\), so we can assume that \(\kappa \) is large enough such that \(C(\kappa ) < b (1 - \alpha ^* \theta )\). In this case, we can deduce that
Moreover, if \(j \in [(\kappa - 1)K+1, N]\), then we can use the trivial bound
Consequently, if we combine (4.12) and (4.13) we obtain
Finally, letting \(\kappa \rightarrow \infty \) gives
Step 3: Lower bounding \(E [ Y_n]\). Let \(\theta _1 > \theta \), then if we set \(\Delta S_i = S_i - S_{i-1}\) and \({\bar{\nu }}_i = \nu _i((-\infty ,M_n))\), we have by Lemma 2.5 that
where we also used that by the monotonicity of types, see Lemma 2.2, \(v_\alpha (0) \ge v_\alpha (s)\) for any \(s \in {\mathcal {T}}\).
Define the Markov chain \(({\tilde{S}}_k, {\tilde{\tau }}_k, {\tilde{\nu }}_{k-1})_{k \in {\mathbb {N}}}\) with associated filtration \(( {\mathcal {F}}_k)\) and transitions given for any measurable F by
Then, we can continue the previous display as
Note that
By Lemma 2.13 we can apply Mogulskii’s Theorem 2.10 also to \(({\tilde{S}}_k)_{k \in {\mathbb {N}}_0}\) with \(g_1(t) = -\theta b t - \lambda \), \(g_2(t) = -\theta b t\), so that we get
We estimate the last term in (4.14) by deducing from Corollary 2.9 that there exist \({\tilde{C}}, {\tilde{\gamma }} > 0\) such that
so that in particular,
Combining (4.15) and (4.16), we get from (4.14) if we recall the definition of \(\epsilon _n\) that
Step 4: Combining the estimates.
Combining the upper bound on \(\sum _{j=1}^N h_{j,n}\) with the lower bound on \(E[Y_n]\) as well as the fact that \(R_n = e^{N^{1/4}}\), we obtain from the second moment bound (4.10),
Finally, we can let \(\theta _1 \downarrow \theta \) to obtain the claim of the proposition. \(\square \)
Proof of Theorems 1.1 and 1.2—lower bounds
By the same argument as at the end of the proof of the upper bound, we have completed the proofs of Theorems 1.1 and 1.2 if we can show that if \(\zeta _n\) is either \(\theta (p_n,f)\) or \(\theta (1,f_{t_n})\), then
In our above construction, we first of all choose the constant M large enough such that the Galton–Watson process \(({\underline{X}}_k(M))_{k \in {\mathbb {N}}_0}\) satisfies the assumptions of Lemma 4.2 for \(\theta _1 = 2\) and \(\theta _2 = 1/2\). Also, we can assume that its survival probability satisfies \(1- {\underline{q}}_M \ge 1/2\). Let \(\theta \) be such that \(\alpha ^* \theta < 1\). Then, choose b large enough such that \(\theta b > M\), so that in particular \(\mathrm{GW}_{n}\) is a subset of the killed IBRW, cf. (4.7). Additionally, we require that
By Proposition 4.3 we obtain that for any \(\lambda > \sqrt{ \frac{\pi ^2 \sigma ^2}{2 b (1- \alpha ^*\theta )}} \)
Now, by (4.18) we have that
so we can additionally assume that \(\lambda \) is small enough such that
Then, if we define
we obtain by (4.19) that
We will use the following general fact (see [9, Fact 4.2], but also [10, Lemma 5.2]): let \(X_1, \ldots , X_k\) be independent non-negative random variables and suppose \(F: (0,\infty ) \rightarrow [0,\infty )\) is non-increasing, then
We will eventually apply Lemma 4.1 to \(\underline{\mathrm{GW}}_n\) and thus first estimate \(P( 1 \le |{\underline{{\mathcal {C}}}}_n | \le r^{-2} )\). We write \({\underline{S}}\) for the positions in the modified branching random walk used in the definition of \(\underline{\mathrm{GW}}_n\). For all sufficiently large n, by using (4.20) in the second step, we obtain
where we used Lemma 4.2 for the last inequality.
Finally, we can lower bound the survival probability of the killed IBRW by the survival probability of \(\underline{\mathrm{GW}}_n\), where we recall that the distribution of \(\underline{\mathrm{GW}}\) does not depend on the initial position and the initial type. Also, we deduce from the upper bound shown in Sect. 3 that necessarily \(P ( |\underline{\mathrm{GW}}_n| = \infty ) \rightarrow 0\), so that the assumption \(r \le \min \{ \frac{1}{8}, P (|\underline{\mathrm{GW}}_n|< \infty ) \}\) is satisfied for large n and we can apply Lemma 4.1 to obtain
Hence, we get from (4.19) that
Finally, letting \(\lambda \downarrow \sqrt{ \frac{\pi ^2 \sigma ^2}{2 b (1- \alpha \theta )}} \) and noting that \(\epsilon _n = b N^{-2/3}(1+o(1))\), we obtain
Hence, (4.17) follows by letting \(\theta \downarrow 0\). \(\square \)
5 Proofs in the Linear Case
In this section, we show how to deduce Corollary 1.3 from our general result, Theorem 1.2.
Proof of Corollary 1.3
In the proof of Proposition 1.3 in [7] it was shown that for linear attachment functions, the spectral radius of \(A_{\alpha }\) equals the largest eigenvalue of
This eigenvalue is given by
In particular, \(\alpha ^*=\frac{1}{2}\) regardless of the choice of \(\gamma \in [0,\frac{1}{2})\) and \(\beta \in (0,1]\), and
In order to apply Theorem 1.2, we need to determine \(\rho ''(\alpha ^*)\). To this end, we write \(\Theta (\alpha )\) for the large squared bracket in (5.1) and
Then
We need the second derivative only for \(\alpha =\alpha ^*=\frac{1}{2}\). Since after applying the product rule, any term multiplied by \((2\alpha -1)\) vanishes, we obtain
The critical values \(\beta _{\mathrm{c}}(\gamma )\) and \(\gamma _{\mathrm{c}}(\beta )\) are chosen such that \(\rho (\alpha ^*)>1\) if and only if \(\beta >\beta _{\mathrm{c}}(\gamma )\) or \(\gamma >\gamma _{\mathrm{c}}(\beta )\). This implies
Clearly,
One easily checks that
i.e. \(\beta _{\mathrm{c}}(\gamma _{\mathrm{c}}(\beta ))=\beta \). We write \(\rho (\alpha ;\beta )\) or \(\rho (\alpha ;\gamma )\) to emphasize the dependence on \(\beta \) or \(\gamma \), respectively. By (5.3), (5.4) and (5.5),
By the continuity of \(\rho \) and its derivatives in \(\beta \) and \(\gamma \), we obtain \(\sigma ^2=2/\beta _{\mathrm{c}}(\gamma )\) for the convergence \(\beta \downarrow \beta _{\mathrm{c}}(\gamma )\) and \(\sigma ^2=2/\beta \) for \(\gamma \downarrow \gamma _{\mathrm{c}}(\beta )\). Moreover, two Taylor expansions yield for \(\beta \downarrow \beta _{\mathrm{c}}\)
The derivative is given by
For the corresponding derivative with respect to \(\gamma \) we use (5.6) to derive
Repeating the argument of (5.7) for \(\gamma \) instead of \(\beta \), Theorem 1.2 yields
Notes
There is a difference in the set-up of the LCD model and our model, as the former uses a fixed number of connections for every new vertex. This technical difference does not affect the results beyond the form of the constants involved.
References
Bérard, J., Gouéré, J.-B.: Survival probability of the branching random walk killed below a linear boundary. Electron. J. Probab. 16(14), 396–418 (2011)
Billingsley, P.: Convergence of Probability Measures. Wiley Series in Probability and Statistics: Probability and Statistics, 2nd edn. Wiley, New York (1999)
Bollobás, B., Riordan, O.: Slow emergence of the giant component in the growing $m$-out graph. Rand. Struct. Algorithms 27(1), 1–24 (2005)
Bollobás, B., Janson, S., Riordan, O.: The phase transition in the uniformly grown random graph has infinite order. Rand. Struct. Algorithms 26(1–2), 1–36 (2005)
Cohen, R., ben-Avraham, D., Havlin, S.: Percolation critical exponents in scale-free networks. Phys. Rev. E 66, 036113 (2002)
Dereich, S., Mörters, P.: Random networks with sublinear preferential attachment: degree evolutions. Electron. J. Probab. 14, 1222–1267 (2009)
Dereich, S., Mörters, P.: Random networks with sublinear preferential attachment: the giant component. Ann. Probab. 41(1), 329–384 (2013)
Durrett, R.: Probability: Theory and Examples. Wadsworth, Pacific Grove (1991)
Gantert, N., Hu, Y., Shi, Z.: Asymptotics for the survival probability in a killed branching random walk. Ann. Inst. Henri Poincaré Probab. Stat. 47(1), 111–129 (2011)
Hu, Y., Shi, Z.: Minimal position and critical martingale convergence in branching random walks, and directed polymers on disordered trees. Ann. Probab. 37(2), 742–789 (2009)
Itô, K., McKean Jr., H.P.: Diffusion Processes and Their Sample Paths. Springer, Berlin (1974)
Kato, T.: Perturbation Theory for Linear Operators. Classics in Mathematics. Springer, Berlin (1995)
Last, G., Penrose, M.: Lectures on the Poisson Process. IMS Textbook. Cambridge University Press, Cambridge (2017)
Lyons, R.: A simple path to Biggins’ martingale convergence for branching random walk. In: Classical and Modern Branching Processes. The IMA Volumes in Mathematics and its Applications, vol. 84, pp. 217–221. Springer, New York (1997)
Mogulskii, A.A.: Small deviations in the space of trajectories. Theory Probab. Appl. 19, 726–736 (1974)
Pinsky, R.G.: Positive Harmonic Functions and Diffusion. Cambridge Studies in Advanced Mathematics, vol. 45. Cambridge University Press, Cambridge (1995)
Riordan, O.: The small giant component in scale-free random graphs. Comb. Probab. Comput. 14(5–6), 897–938 (2005)
Roberts, M.I.: Spine changes of measure and branching diffusions. PhD thesis, University of Bath (2009). http://people.bath.ac.uk/mir20/research.html
Author information
Authors and Affiliations
Corresponding author
Appendix: Proof of Mogulskii’s Theorem
Appendix: Proof of Mogulskii’s Theorem
This section is devoted to the proof of Theorem 2.10.
Proof
The proof is an adaptation of the proof presented in [9]. We give a detailed proof of the upper bound. The changes for the lower bound are similar.
Let \(N=\lfloor \frac{k_n}{r_n}\rfloor \) and \(m_N=k_n\), \(m_k=k r_n\) for \(0 \le k \le N-1\). The Markov property implies that
Since \(g_1\) and \(g_2\) are bounded, for every \(\delta >0\) there exists \(K \in {\mathbb {N}}\) such that \([g_1(\frac{m_{k-1}}{k_n}),g_2(\frac{m_{k-1}}{k_n})] \subseteq [-K\delta , (K-1)\delta ] =\bigcup _{j=-K}^{K-1} [j\delta ,(j+1)\delta ]\). Continuity of \(g_1\) and \(g_2\) further guarantees the existence of \(A=A(\delta )>0\) such that .
Let \(J_{l}=(\frac{(l-1)(N-2)}{A}+1,\frac{l(N-2)}{A}+1] \cap {\mathbb {N}}\) for \(j\in [A]\). We show that for sufficiently large n, for all \(i \in [r_n]\) and \(l \in [A]\)
Since \(\frac{k_n}{r_n} - (N-1) \in [1,2]\), we have
Since \(\lim _{n \rightarrow \infty }r_n/k_n = 0\), this is satisfied for large n. For the other direction, we use that \(\frac{r_n}{k_n} \le \frac{1}{N}\) to see that
The last inequality holds since \(\frac{r_n}{k_n} \rightarrow 0\). The small k-values are needed later for the proof of the lower bound. For \(k \in J_l\), (A.1) implies that for all \(x \in [j \delta , (j+1)\delta ]\)
Since \(\bigcup _{l=1}^A J_l =(1,N-1]\cap {\mathbb {N}}\), we derive
The assumed uniform convergence implies that for \(n \rightarrow \infty \)
The right-hand side can be estimated by (see for example [11, p. 31] or [9, Eq. (5.4)])
Since there are only finitely many \((j,l) \in [-K,K) \cap {\mathbb {N}}\times \{1,\ldots ,A\}\), the convergence and the bound hold uniformly in (j, l). Moreover,
Now we collected everything needed to derive
Since \(g_1\) and \(g_2\) are continuous functions, taking \(A\rightarrow \infty \) yields
Now we can take \(\delta \rightarrow 0\) to establish the claim.
Sketch of the lower bound: Choose a continuous function \(g:[0,1] \rightarrow {\mathbb {R}}\) such that \(g_1(t)< g(t) <g_2(t)\) for all \(t \in [0,1]\). Since it suffices to prove the lower bound for b small, we can assume that \(g(1) \ge g_2(1) -b\). Then, let \(\delta >0\) be such that \(g(t)-g_1(t)>3\delta \) and \(g_2(t)-g(t) >9\delta \) for all \(t\in [0,1]\). Moreover, A is chosen large enough that
Choose \(N=\lfloor \frac{k_n}{r_n}\rfloor \), \(m_N=k_n\) and \(m_k=k r_n\) for \(0 \le k \le N-1\). Writing \(y_k=g(\frac{m_k}{k_n})\) for \(1\le k \le N\), the Markov property implies
where for \(1 \le k \le N\), \(y \in {\mathbb {R}}\), and \(\tau \in {\mathcal {T}}\)
Now the choice of parameters implies that for n sufficiently large
where \(p_{N,n}^1\) is the infimum over all \(\tau \in {\mathcal {T}}\) of \(p_{N,n}(y_{k-1},\tau )\) where \(\beta _{i,N,n}\) is replaced by \(\beta _{i,N,n}-3 \delta \) and the same for \(p_{N,n}^2\) with \(\alpha _{i,N,n}\) replaced by \(\alpha _{i,N,n}-3 \delta \) and \(\beta _{i,N,n}\) by \(\beta _{i,N,n}-6\delta \). Moreover,
The claim now follows with the same arguments as in [9] because of the assumed convergence of the probabilities. \(\square \)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Eckhoff, M., Mörters, P. & Ortgiese, M. Near Critical Preferential Attachment Networks have Small Giant Components. J Stat Phys 173, 663–703 (2018). https://doi.org/10.1007/s10955-018-2054-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-018-2054-5
Keywords
- Scale-free network
- Preferential attachment
- Barabási-Albert model
- Percolation
- Killed branching random walk
- Survival probability