Abstract
Let \(X_1, X_2, \ldots \) be i.i.d. copies of some real random variable X. For any deterministic \(\varepsilon _2, \varepsilon _3, \ldots \) in \(\{0,1\}\), a basic algorithm introduced by H.A. Simon yields a reinforced sequence \(\hat{X}_1, \hat{X}_2 , \ldots \) as follows. If \(\varepsilon _n=0\), then \( \hat{X}_n\) is a uniform random sample from \(\hat{X}_1, \ldots , \hat{X}_{n1}\); otherwise \( \hat{X}_n\) is a new independent copy of X. The purpose of this work is to compare the scaling exponent of the usual random walk \(S(n)=X_1+\cdots + X_n\) with that of its step reinforced version \(\hat{S}(n)=\hat{X}_1+\cdots + \hat{X}_n\). Depending on the tail of X and on asymptotic behavior of the sequence \((\varepsilon _n)\), we show that step reinforcement may speed up the walk, or at the contrary slow it down, or also does not affect the scaling exponent at all. Our motivation partly stems from the study of random walks with memory, notably the socalled elephant random walk and its variations.
Introduction
In 1955, Herbert A. Simon [24] introduced a simple reinforcement algorithm that runs as follows. Consider a deterministic sequence \((\varepsilon _n)\) in \(\{0,1\}\) with \(\varepsilon _1=1\). The nth step of the algorithm corresponds to an innovation if \(\varepsilon _n=1\), and to a repetition if \(\varepsilon _n=0\). Specifically, denote the number of innovations after n steps by
and let also \(X_1, X_2, \ldots \) denote a sequence of different items (in [24], these items are words). One constructs recursively a random sequence of items \(\hat{X}_1, \hat{X}_2, \ldots \) by deciding that \(\hat{X}_n= X_{\sigma (n)}\) if \(\varepsilon _n=1\), and that \(\hat{X}_n=\hat{X}_{U(n)}\) if \(\varepsilon _n=0\), where U(n) is random with the uniform distribution on \([n1]=\{1, \ldots , n1\}\) and \(U(2), U(3), \ldots \) are independent.
Simon was especially interested in regimes where either \(\varepsilon _n\) converges in Césaro’s mean to some limit \(q\in (0,1)\), which we will refer to as steady innovation with rate q, or \(\sigma (n)\) grows like^{Footnote 1}\(n^\rho \) for some \(\rho \in (0,1)\) as \(n\rightarrow \infty \), which we will call slow innovation with exponent \(\rho \). By analyzing the frequencies of words with some fixed number of occurrences, he pointed out that these regimes yield a remarkable oneparameter family of power tail distributions, that are known nowadays as the Yule–Simon laws and arise in a variety of empirical data. This is also closely related to preferential attachment dynamics, see e.g. [10] for an application to the World Wide Web. Clearly, repetitions in Simon’s algorithm should be viewed as a linear reinforcement, the probability that a given item is repeated being proportional to the number of its previous occurrences.
In the present work, the items \(X_1, X_2, \ldots \) are i.i.d. copies of some real random variable X, which we further assume to be independent of the uniform variables \(U(2), U(3), \ldots \). Note that although the variable \(\hat{X}_n\) has the same distribution as X for every \(n\ge 0\), the reinforced sequence \((\hat{X}_n)\) is not stationary; it can also be seen that its tail sigmafield is not even independent of \(\hat{X}_1\). Picking up on a key question in the general area of reinforced processes (see notably the survey [21] by Pemantle, and also some more recent works [2, 14, 15, 18, 22] and references therein), our purpose is to analyze how reinforcement affects the growth of partial sums. Specifically, we write
for the usual random walk with step distribution X, and
for its reinforced version, and we would like to compare \(\hat{S}(n)\) and S(n) when \(n\gg 1\). The main situation of interest is when S has a scaling exponent \(\alpha \in (0,2]\), in the sense that
where Y denotes an \(\alpha \)stable variable. Recall that this holds if and only if the typical step X belongs to the domain of normal attraction (without centering) of a stable distribution, in the terminology of Gnedenko and Kolmogorov [16]. We shall refer to (1) as an instance of \(\alpha \)diffusive asymptotic behavior, the usual diffusive situation corresponding to \(\alpha = 2\).
The asymptotic behavior of the stepreinforced random walk \(\hat{S}\) has been considered previously in the literature when \(\varepsilon _2, \varepsilon _3, \ldots \) are random and given by i.i.d. samples of the Bernoulli law with parameter \(q\in (0,1)\). This is of course a most important case of a steady regime with innovation rate q a.s. It has been shown recently in [8] that when \(X\in L^2(\mathbb {P})\), the asymptotic growth of \(\hat{S}\) exhibits a phase transition at \(q_c=1/2\). Specifically, assuming for simplicity that X is centered, then on the one hand for \(q<1/2\), there is some nondegenerate random variable V such that
In other words, (2) shows that for \(q<1/2\), \(\hat{S}\) has scaling exponent \(\hat{\alpha }=1/(1q)\), or equivalently, grows with exponent \(1/\hat{\alpha }\), and in particular is superdiffusive since \(1/\hat{\alpha }> 1/2\). On the other hand, the stepreinforced random walk remains diffusive for \(q>1/2\), in the sense that \( n^{1/2} \hat{S}(n)\) converges in law to some Gaussian variable. This phase transition was first established when X is a Rademacher variable, i.e. \(\mathbb {P}(X=1)=\mathbb {P}(X=1)=1/2\). Indeed, Kürsten [20] observed that \(\hat{S}\) is then a version of the socalled elephant random walk, a nearest neighbor process with memory which was introduced by Schütz and Trimper [23] and has then raised much interest. The description of the asymptotic behavior of the elephant random walk has motivated many works, see notably [3, 5, 6, 12, 13, 19].
Further, when the typical step X has a symmetric stable distribution with index \(\alpha \in (0,2]\), \(\hat{S}\) is the socalled shark random swim, which has been studied in depth by Businger [11]. Its large time asymptotic behavior exhibits a similar phase transition for \(\alpha >1\), now for the critical parameter \(q_c=11/\alpha \). When \(\alpha \le 1\), there is no such phase transition and \(\hat{S}\) has the same scaling exponent \(\alpha \) as S. See also [7] for related results in the setting of Lévy processes.
The results that we just recalled suggest that, more generally, for any steady innovation regime and any typical step X belonging to the domain of normal attraction of an \(\alpha \)stable distribution (i.e. such that (1) is fulfilled), then the following should hold. First, for \(\alpha \in (0,1)\), the random walk S and its stepreinforced version \(\hat{S}\) should have the same scaling exponent \(\hat{\alpha }=\alpha \), independently of the innovation rate. Second, for \(\alpha \in (1,2]\), if the innovation rate q is larger than \(q_c=11/\alpha \), then again the scaling exponent of S and \(\hat{S}\) should coincide, whereas if \(q<11/\alpha \), then the super\(\alpha \)diffusive behavior (2) should hold and \(\hat{S}\) should thus have scaling exponent \(\hat{\alpha }=1/(1q)<\alpha \). We shall see in Theorems 2 and 4 that this guess is indeed correct. In particular, the weaker the innovation (or equivalently the stronger the reinforcement), the faster the stepreinforced random walk \(\hat{S}\) grows.
An informal explanation for this phase transition is as follows. When \(\alpha \in (1,2]\), the \(\alpha \)diffusive behavior (1) of S relies on some kind of balance between its positive and negative steps (recall that X must be centered, i.e. \(\mathbb {E}(X)=0\)). The reinforcement effect of Simon’s algorithm for sufficiently small innovation rates q yields certain steps to be repeated much more often than the others, up to the point that this balance is disrupted. More precisely, we shall see that in a steady regime with innovation rate q, the maximal number of repetitions of a same item up to the nth step of Simon’s algorithm grows with exponent \(1q\). For \(q>q_c=11/\alpha \), this is smaller than the growth exponent \(1/\alpha \) of S and repetitions have only a rather limited impact on the asymptotic behavior of \(\hat{S}\). At the opposite, for \(q<q_c\), some increments have been repeated much more often and the growth of \(\hat{S}\) is then rather governed by the latter, yielding (3).
We now turn our attention to regimes with slow innovation. Extrapolating from the steady regime, we might expect that reducing the innovation should again speed up the stepreinforced random walk. This intuition turns out to be wrong, and we will see that at the opposite, in slow regimes, diminishing the innovation actually slows down the walk. More precisely, there is another phase transition when \(\alpha \in (0,1)\), occurring now for the critical innovation exponent \(\rho _c=\alpha \). Specifically, if \(\rho <\alpha \), then we shall see in Theorem 1 that \(\hat{S}\) has always scaling exponent \(\hat{\alpha }=1\) (i.e. a ballistic asymptotic behavior which contrasts with the growth with exponent \(1/\alpha >1\) for S), whereas for \(\rho >\alpha \), we will see in Theorem 3 that \(\hat{S}\) has rather scaling exponent \(\hat{\alpha }=\alpha /\rho >\alpha \), that it grows now with exponent \(1/\hat{\alpha }=\rho /\alpha >1\), but nonetheless still significantly slower than S. On the other hand, when \(\alpha \ge 1\), there is no phase transition for slow innovation regimes and \(\hat{S}\) has always scaling exponent \(\hat{\alpha }=1\).
This apparently surprising feature can be explained informally as follows. As it was argued above, for \(\alpha \in (1,2]\), \(\mathbb {E}(X)=0\) and the superdiffusive regime (2) results from the disruption of the balance between positive and negative steps when certain steps are repeated much more than others. At the opposite, for \(\alpha \in (0,1)\), the typical step X has a heavy tail distribution with \(\mathbb {E}(X)=\infty \). In this situation, it is wellknown that for \(n\gg 1\), S(n) has roughly the same size as its largest step up to time n, \(\max \{X_i: 1\le i \le n\}\). Regimes with slow innovation delay the occurence of rare events at which steps are exceptionally large. Therefore they induce a slow down effect for the stepreinforced random walk, up to the point that when the innovation exponent drops below a critical value, \(\hat{S}\) has merely a ballistic growth. This aspect will be further discussed quantitatively in Sect. 5.
A somewhat simpler version of the main results of our work are summarized in Fig. 1. It expresses the scaling exponent \(\hat{\alpha }\) of \(\hat{S}\) in terms of the scaling exponent \(\alpha \in (0,2]\) of S and the innovation parameter \(\rho >0\). The slow regime corresponds to \(\rho \in (0,1)\) and \(\rho \) is then the innovation exponent as usual. The steady regime corresponds to \(\rho >1\), and then the rate of innovation is given by \(q=11/\rho \). This new parametrization for steady regimes of innovation may seem artificial; nonetheless we stress that the same is actually used for the definition of the oneparameter family of Yule–Simon distributions; see Lemma 3.
The cornerstone of our approach is provided by Lemma 2, where we observe that the process that counts the number of occurrences of a given item in Simon’s algorithm can be turned into a square integrable martingale. The latter is a close relative to another martingale that occurs naturally in the setting of the elephant random walk; see [6, 12, 13, 19], among others. The upshot of Lemma 2 is that this yield useful estimates for these numbers of occurrences and their asymptotic behaviors, which hold uniformly for all items.
The plan for the rest of this article is as follows. Section 2 is devoted to preliminaries on the stable central limit theorem, on martingales induced by occurrence counting processes in Simon’s algorithm, and on the Yule–Simon distributions. We state and prove our main results in Sects. 3 and 4. Finally, several comments are given in Sect. 5.
Preliminaries
Given two sequences a(n) and b(n) of positive real numbers, it will be convenient to use the following notation throughout this work:
Background on the stable central limit theorem
We assume in this section that the step distribution belongs to the domain of normal attraction (without centering) of some stable distribution, i.e. that (1) holds for some \(\alpha \in (0,2]\). The Cauchy case \(\alpha =1\) has some peculiarities and for the sake of simplicity, it will be ruled out from time to time. We present some classical results in this framework that will be useful later on.
We start by recalling that for \(\alpha =2\), (1) holds if and only if X is centered with finite variance; see Theorem 4 on p. 181 in [16]. For \(\alpha \in (0,1)\), (1) is equivalent to
for some nonnegative constants \(c_+\) and \(c_\) with \(c_++c_>0\). Finally, for \(\alpha \in (1,2)\), (1) holds if and only if the same as above is fulfilled and furthermore X is centered. See Theorem 5 on p. 1812 in [16].
We denote the characteristic function of X by
and the characteristic exponent of the stable variable Y by \(\varphi _{\alpha }\), that is \(\varphi _{\alpha }: \mathbb {R}\rightarrow \mathbb {C}\) is the unique continuous function with \(\varphi _{\alpha }(0)=0\) such that
In particular, \(\varphi _{\alpha }\) is homogeneous with degree \(\alpha \) in the sense that
In this setting, (1) can be expressed classically as
but we shall rather use a logarithmic version of (3).
Pick \(r>0\) sufficiently small so that \(1\Phi (\theta )<1\) whenever \(\theta \le r\), and then define \(\varphi :[r,r]\rightarrow \mathbb {C}\) as the continuous determination of the logarithm of \(\Phi \) on \([r,r]\), i.e. the unique continuous function with \(\varphi (0)=0\) and such that \(\Phi (\theta )= \exp (\varphi (\theta ))\) for all \(\theta \in [r,r]\). Theorem 2.6.5 in Ibragimov and Linnik [17] entails that (3) can be rewritten in the form
We stress that the parameter t in (4) is real, whereas n in (3) is an integer, and as a consequence, we have also that
Martingales in Simon’s algorithm
Recall Simon’s algorithm from the Introduction, and in particular that \(\sigma (n)\) stands for the number of innovations up to the nth step. In this work, we will be mostly concerned with the cases where either the sequence \(\sigma (\cdot )\) is regularly varying with exponent \(\rho \in (0,1)\), that is
or
It is easily checked that (6) implies \(\sigma (n)\sim qn\), and conversely, (6) holds whenever \(\sigma (n)/n=q+ O(\log ^{\beta } n)\) for some \(\beta >1\). We refer to (5) as the slow regime with innovation exponent \(\rho \in (0,1)\), and to (6) as the steady regime with innovation rate \(q\in (0,1)\). Often, it is convenient to set \(\rho =1/(1q)\) for \(q\in (0,1)\) and then view \(\rho \in (0,1)\cup (1,\infty )\) as a parameter for the innovation, with \(\rho >1\) corresponding to steady regimes.
Several of our results however rely on much weaker assumptions; in any case we shall always assume at least that the total number of innovations is infinite and that the number of repetitions is not sublinear, i.e.
Simon’s algorithms induces a natural partition of the set of indices \(\mathbb {N}=\{1,2,\ldots \}\) into a sequence of blocks \(B_1, B_2, \ldots \), where
In words, \(B_j\) is the set of steps of Simon’s algorithm at which the jth item \(X_j\) is repeated. We consider for every \(n\in \mathbb {N}\) the restriction of the preceding partition to \([n]=\{1, \ldots , n\}\) and write
plainly \(B_j(n)\) is nonempty if and only if \(j\le \sigma (n)\). Last, we set
for the number of elements of \(B_j(n)\), and arrive at the following basic expression for the stepreinforced random walk:
where the \(X_j\) are i.i.d. copies of X, and further independent of the random coefficients \(B_j(n)\).
The identity (8) incites us to investigate the asymptotic behavior of the coefficients \(B_j(n)\). In this direction, we introduce the quantities
and the times of innovation
We stress that these quantities are deterministic, since the sequence \((\varepsilon _n)\) is deterministic.
We start with a simple lemma:
Lemma 1
The following assertions hold:

(i)
Assume that \(\sigma (n) = O(n^{\rho })\) for some \(\rho <1\). Then \(\pi (n)\approx n\).

(ii)
Assume (6); then \(\pi (n)\approx n^{1q}\).

(iii)
Assume (7); then the series \(\sum _{n=1}^{\infty } 1/(n\pi (n))\) converges.
Proof
We have from the definition of \(\pi (n)\) that
Next we observe by summation by parts that
Assume first \(\sigma (n) =O( n^{\rho })\) for some \(\rho <1\). Then \(\sum _{j=2}^{\infty }\sigma (j)j^{2}<\infty \), which yields
and (i) follows.
Next, when (6) holds, we write
The second series in the righthand side converges absolutely; as a consequence,
and (ii) follows.
Finally, assume (7). There is \(a<1\) such that \(\sigma (k)\le a k\) for all k sufficiently large. It follows that there is some \(b>0\) such that for all n,
We conclude that \(1/(n\pi (n))=O(n^{a2})\), which entails the last claim. \(\square \)
The next result determines the asymptotic behavior of the sequences \(B_j(\cdot )\) for all \(j\in \mathbb {N}\), and will play therefore a key role in our analysis.
Lemma 2
Assume (7). For every \(j\in \mathbb {N}\), the process started at time \(\tau (j)\),
is a square integrable martingale. We denote its terminal value by
and have
Proof
The martingale property is immediate from Simon’s algorithm. More precisely, for any \(n\ge \tau (j)\), we have \(\pi (n+1)=\pi (n)\) and \(B_j(n+1)=B_j(n)\) when \(\varepsilon _{n+1}=1\) (by innovation), whereas when \(\varepsilon _{n+1}=0\), we have \(\pi (n+1)=\pi (n)(1+1/n)\) and further (by reinforcement)
and
where \(({\mathcal F}_n)_{n\ge 1}\) denotes the natural filtration of Simon’s algorithm. The claimed martingale property follows, and as a consequence, there is the identity
We next have to check that the mean of the quadratic variation of the martingale \( B_j(\cdot )/ \pi (\cdot )\) satisfies
thanks to Lemma 1, the remaining assertions are then immediate.
In this direction, we first note that the terms in the sum on the lefthand side above that correspond to an innovation (i.e. \(\varepsilon _{n+1}=1\)) are zero and can thus be discarded. Let \(\varepsilon _{n+1}=0\), so that \(\pi (n+1)=\pi (n)(1+1/n)\). We then have
On the one hand, since
we deduce from (10) the bound
On the other hand, since
using again (10), we get
The proof of the statement is now complete. \(\square \)
As an immediate consequence, we point at the following handier estimate for the second moment of \(\Gamma _j\).
Corollary 1
Assume (7) and further that \(\pi (n) \asymp n^a\) for some \(a>0\). Then
Proof
On the one hand, there is the lower bound \(\mathbb {E}(\Gamma _j^2)\ge \mathbb {E}(\Gamma _j)^2\). On the other hand, our assumption also entails that for some \(b, b'>0\), we have
and we conclude with Lemma 2. \(\square \)
Yule–Simon distributions
Recall that the slow and the steady regimes have been defined by (5) and (6), respectively. Simon [24] observed that in each regime, the empirical measure of the sizes of the blocks \(B_j(n)\) converges to a deterministic distribution.
Lemma 3
(Simon [24]) Let \(\rho >0\). For \(0< \rho < 1\), consider the regime (5) of slow innovation with exponent \(\rho \), whereas for \(\rho >1\), set \(q=11/\rho \in (0,1)\) and consider the regime (6) of steady innovation with rate q. In both regimes, for every \(k\in \mathbb {N}\), we have
where \({\mathrm B}\) is the Beta function and the convergence holds in \(L^p\) for any \(p\ge 1\).
The limiting distribution in the statement is called the Yule–Simon distribution with parameter \(\rho \). Strictly speaking, Simon only established the stated converge in expectation. A classical argument of propagation of chaos yields the stronger convergence in probability; see e.g. Section 5 in [4], and since the random variables in the statement are obviously bounded by 1, convergence in \(L^p\) also holds for any \(p\ge 1\).
The next lemma will be needed to check some uniform integrability properties.
Lemma 4
Let \(0<\beta \le \rho \) and assume either (i) or (ii) is fulfilled, where:

(i)
\(\rho \in (0,1)\) and the slow regime (5) holds with exponent \(\rho \),

(ii)
\(\rho >1\) and the steady regime (6) holds with innovation rate \(q=11/\rho \).
Then
Remark 1
Since \({\mathrm B}(k, \rho +1) \sim \Gamma (\rho +1) k^{(\rho +1)}\) as \(k\rightarrow \infty \), we have that
for any \(\beta < \rho \), in agreement with Fatou’s lemma and Lemmas 3 and 4.
Proof
(i) Recall from Lemma 1 that in the slow regime, there are the bounds \(n/c\le \pi (n)\le cn\) for all \(n\in \mathbb {N}\), where \(c>1\) is some constant. Since, from Lemma 2,
we get by Jensen’s inequality that
where for the O upperbound, we used the fact that the inverse function \(\tau \) of \(\sigma \) is regularly varying with exponent \(1/\rho \) (Theorem 1.5.12 in [9]), and Proposition 1.5.8 in [9] since \(\beta /\rho >1\). On the other hand, since \(\tau \) is the rightinverse of \(\sigma \), we have \(\tau (\sigma (n))\le n \le \tau (\sigma (n)+1)\), so again by regular variation, \(\tau (\sigma (n))\sim n\). Finally
as we wanted to verify.
(ii) The proof is similar to (i), using now that there exists \(c>0\) such that
as it is readily seen from Corollary 1. \(\square \)
Strong limit theorems
In this section, we will establish two strong limit theorems for stepreinforced random walks, the first concerns slow innovation regimes, and the second steady ones.
Ballistic behavior
Theorem 1
Suppose that
for some \(\rho \in (0,1)\), and that
for some \(\beta >\rho \). Then
where \(V'\) is some nondegenerate random variable.
We will deduce Theorem 1 by specializing the following more general result.
Lemma 5
Assume (7) and set
Provided that
we have
with
Proof
Thanks to (11), the claim follows from (8) and Lemma 2 by dominated convergence. \(\square \)
Proof of Theorem 1
Recall from Lemma 1(i) that \(\pi (n) \approx n\). From Lemma 5, it thus suffices to check that
since then, the condition (11) follows.
Without loss of generality, we may assume that \(\beta <1\). Pick \(a>0\) sufficiently large so that
and
Since \(X_j\) is a copy of X which is independent of \(\Gamma _j^*\), we have
Recall from Lemma 2 that \(B_j(\cdot )/\pi (\cdot )\) is a closed martingale with terminal value \(\Gamma _j\). Then by Doob’s maximal inequality, there is some numerical constant \(c_{\beta }>0\) such that \(\mathbb {E}((\Gamma _j^*)^{\beta }))\le c_{\beta } \mathbb {E}(\Gamma _j)^{\beta }\), and hence again from Lemma 2,
Finally, since \(\tau (j)\ge (j/a)^{1/\rho }\), we conclude that
which ensures (12) since \(\beta >\rho \). \(\square \)
Super\(\alpha \)diffusive behavior
We next turn our attention to the steady regime.
Theorem 2
Suppose (6) holds with \(q<1/2\) and that
for some \(\beta >1/(1q)\). Then
where \(V'\) is some nondegenerate random variable.
The proof of Theorem 2 relies on the following martingale convergence result.
Lemma 6
Assume (7) and let \(\beta \in (1,2]\). Suppose that \(X\in L^{\beta }(\mathbb {P})\) with \(\mathbb {E}(X)=0\), and further that
The process
is then a martingale bounded in \(L^{\beta }(\mathbb {P})\); we write \(V_{\infty }\) for its terminal value. We have
Proof
The assertion that the process \(V_n\) is a martingale is straightforward since the variables \(X_j\) are i.i.d., centered, and independent of the \(\Gamma _j\). The assertion of boundedness in \(L^{\beta }(\mathbb {P})\) then follows from the assumption (13), the Burkholder–Davis–Gundy inequality, and the fact that, for any sequence \((y_j)_{j\in \mathbb {N}}\) of nonnegative real numbers, since \(\beta \le 2\),
The convergence of \(\hat{S}(n)/\pi (n)\) in \(L^{\beta }(\mathbb {P})\) is proven similarly. Specifically, we observe from (8) that
and recall that the variables \(X_j\) are independent of those appearing in Simon’s algorithm. By the Burkholder–Davis–Gundy inequality, there exists a constant \(c_{\beta }\in (0,\infty )\) such that
We know from Lemma 2 that for each \(j\ge 1\),
and further by Jensen’s inequality, that
The assumption (13) enables us to complete the proof of convergence of the sequence \((\hat{S}(n)/\pi (n))\) in \(L^{\beta }(\mathbb {P})\) by dominated convergence.
The almost sure convergence then follows from the observation that the process \(\hat{S}(n)/\pi (n)\) is a martingale (in the setting of the elephant random walk, a similar property has been pointed at in [6, 12, 13, 19]). Indeed, we see from Simon’s algorithm and the assumption \(\mathbb {E}(X)=0\) that
This immediately entails our assertion. \(\square \)
Proof of Theorem 2
Recall that we assume that \(\mathbb {E}(X^{\beta })<\infty \) for some \(\beta >1/(1q)\). Since \(q<1/2\), we can further suppose without loss of generality that \(\beta \le 2\). Then, by Jensen’s inequality, we have
and we just need to check that the righthand side is finite, as then an appeal to Lemma 6 completes the proof.
It follows from (6) and Lemma 1(ii) that
and then from Corollary 1 that \(\mathbb {E}(\Gamma _j^2)\asymp j^{2+2q}\). Since \(\beta q\beta >1\), the series \(\sum _{j\ge 1}j^{\beta +q\beta }\) converges, and the proof is finished. \(\square \)
Weak limit theorems
In this section, we will establish two weak limit theorems for stepreinforced random walks, depending on the innovation regimes.
Superballistic behavior
Theorem 3
Suppose that X belongs to the domain of normal attraction of a stable law (i.e. (1) holds) with index \(\alpha \in (0,1)\), and that (5) holds for some \(\rho \in (\alpha , 1)\). Then
where \(Y'\) is an \(\alpha \)stable random variable.
Under the assumptions of Theorem 3, the stepreinforced random walk grows roughly like \(n^{\rho /\alpha }\), and since \(1<\rho /\alpha < 1/\alpha \), its asymptotic behavior is both superballistic and sub\(\alpha \)diffusive.
Proof
Note first that, since \(\rho >\alpha \), \(n \sigma (n)^{1/\alpha }\) goes to 0 as \(n\rightarrow \infty \), and a fortiori so does \(B_j(n)\sigma (n)^{1/\alpha }\) uniformly for all \(j\in \mathbb {N}\). We fix \(\theta \in \mathbb {R}\) and get from (8) that for n sufficiently large
We focus on the sum in the righthand side, and first consider the terms with \(B_j(n) \le k\) for some fixed \(k\in \mathbb {N}\). Write
where \(N_\ell (n)=\mathrm {Card}\{ j\le \sigma (n): B_j(n)=\ell \}\). Next, recall from (4) that as \(n\rightarrow \infty \),
We now deduce from Lemma 4 that for any fixed \(k\in \mathbb {N}\), there is the convergence
for every \(p\ge 1\).
We can next complete the proof by an argument of uniform integrability. Recall that \(\varphi (\lambda )=O(\lambda ^{\alpha })\) as \(\lambda \rightarrow 0\) and pick \(\beta \in (\alpha ,\rho )\). There exists \(a>0\) such that for all n sufficiently large and all \(k\ge 1\), there is the upper bound
and the same inequality holds with \(\varphi _{\alpha }\) replacing \(\varphi \). We can then deduce from the preceding paragraph in combination with Lemma 4 that actually
It now suffices to recall that \(\mathfrak {R}\varphi \ge 0\), so by dominated convergence,
which completes the proof. \(\square \)
\(\alpha \)Diffusive behavior
Theorem 4
Suppose that X belongs to the domain of normal attraction without centering of a stable law (i.e. (1) holds) with index \(\alpha \in (0,2]\), and that (6) holds for some \(q\in (0,1)\). Suppose further that \(q>11/\alpha \) when \(\alpha >1\). Then
where \(Y'\) is an \(\alpha \)stable random variable.
The proof of Theorem 4 requires the following uniform bounds
Lemma 7
Suppose (6) holds for some \(q\in (0,1)\) and take any \(\beta \in ( 0, 1/(1q))\). Then
Proof
The claim is obvious when \(\beta <1\), so we focus on the case \(\beta \ge 1\). In this direction, recall from Lemma 2 that \(B_j(n)/\pi (n)\) is a square integrable martingale with terminal value \(\Gamma _j\). Recall also from Lemma 1(ii) and Corollary 1, that in the regime (6), \(\pi (n)\approx n^{1q}\) and \(\mathbb {E}(\Gamma _j^2)\asymp j^{2q2}\). There is thus some constant \(a>0\), such that for any \(\eta >0\) arbitrarily small, we have
Suppose first that \(q<1/2\), so \(\sum _{j\ge 1} j^{2q2}<\infty \) and therefore
Since \(1q<1/\beta \), our claim follows.
Then suppose that \(q=1/2\); using \(\sum _{j\le n} j^{1}\sim \log n\) and \(B_j(n)=0\) for \(j>n\), we get
Since \(1/\beta >1/2\), our assertion is verified.
Finally, suppose that \(q>1/2\); using \(\sum _{j\le n} j^{2q2}\approx n^{2q1}\) and \(B_j(n)=0\) for \(j>n\), we get
Since again \(1/\beta >1/2\), the proof is complete. \(\square \)
Lemma 7 enables us to duplicate the argument for the proof of Theorem 3, as the reader will readily check.
Miscellaneous remarks

Technically, the fact that the indices of the steps at which innovations occur are deterministic eases our approach by pointing right from the start at the relevant quantities. Although our statements are only given for deterministic sequences \((\varepsilon _n)\), they also apply to random sequences \((\varepsilon _n)\) independent of \((X_n)\), provided of course that we can check that the requirements hold a.s. A basic example, which has been chiefly dealt with in the literature, is when the \(\varepsilon _j\) are i.i.d. samples of the Bernoulli distribution with parameter \(q\in (0,1)\), as then (6) obviously holds a.s. Plainly independence of the \(\varepsilon _j\) is not a necessary assumption, and much less restrictive correlation structures suffice. For instance, if we merely suppose that each \(\varepsilon _j\) has the Bernoulli law with parameter \(q_j\) such that \(\sum _{n\ge 2} n^{2}\sum _{j=2}^n(q_jq) <\infty \), and that \(\mathrm {Cov}(\varepsilon _j,\varepsilon _{\ell })\le j\ell ^{a}\) for some \(a>0\), then one readily verifies that (6) is fulfilled a.s. Similar examples can be developed to get slow innovation regimes, for instance assuming that each variable \(\varepsilon _j\) has a Bernoulli law with \(q(j)\approx j^{\rho 1}\) and again a mild condition on the correlation.

Dwelling on an informal comment made in the Introduction, it may be interesting to compare the stepreinforced random walk \(\hat{S}(n)\) with its maximal step \(\hat{X}^*_n=\max _{1\le j \le n} \hat{X}_j\). Assume \(\alpha \in (0,2)\), and that \(\mathbb {P}(X>x) \approx x^{\alpha }\) (recall Sect. 2.1 about characterization of stable domaines of normal attraction). Plainly, there is the identity \(\hat{X}^*_n=X^*_{\sigma (n)}\), where \(X^*_n=\max _{1\le j \le n} X_j\), from which we deduce that \(\sigma (n)^{1/\alpha }\hat{X}^*_n\) converges in distribution as \(n\rightarrow \infty \) to some Frechet variable. Comparing with the results in Sects. 3 and 4, we now see that in the slow regime with innovation exponent \(\rho \in (0,1)\), \(\hat{S}\) grows with the same exponent as \(\hat{X}^*\) when \(\alpha >\rho \), and with a strictly larger exponent if \(\alpha <\rho \). Similarly, in the steady regime with innovation rate \(q\in (0,1)\), \(\hat{S}\) grows with the same exponent as \(\hat{X}^*\) when \(\alpha >\rho =1/(1q)\) and with a strictly larger exponent if \(\alpha <\rho \). In other words, the maximal step \(\hat{X}^*\) has a sensible impact in the strong limit theorems of Sect. 3, but its role is negligible for the weak limit theorems of Sect. 4.

We have worked in the real setting for the sake of simplicity only; the arguments work as well for random walks in \(\mathbb {R}^d\) with \(d\ge 2\). In this direction, one notably needs a multidimensional version of (4), which can be found in Section 2 of Aaronson and Denker [1]. The same sake of simplicity (possibly combined with the author’s lazyness) motivated our choice of working with domains of normal attraction rather than with domains of attraction. Most likely, dealing with this more general setting would only require very minor modifications of the present arguments and results.

It would be interesting to complete the strong limit results (Theorems 1 and 2) and investigate the fluctuations \(n^{1/\hat{\alpha }} \hat{S}(n)V'\) as \(n\rightarrow \infty \). In the setting of the elephant random walk, Kubota and Takei [19] have recently established that these fluctuations are Gaussian.

The case where the generic step X has the standard Cauchy distribution is remarkable, due to the feature that for any \(a,b>0\), \(aX_1+bX_2\) has the same distribution as \((a+b)X\), where \(X_1\) and \(X_2\) are two independent copies of X. It follows that \(n^{1}\hat{S}(n)\) has the standard Cauchy distribution for all n, independently of the choice of the sequence \((\varepsilon _n)\). This agrees of course with Theorems 1 and 4.
Notes
 1.
The precise definition of these regimes will be given later on.
References
 1.
Aaronson, J., Denker, M.: Characteristic functions of random variables attracted to \(1\)stable laws. Ann. Probab. 26(1), 399–415 (1998)
 2.
Angel, O., Crawford, N., Kozma, G.: Localization for linearly edge reinforced random walks. Duke Math. J. 163(5), 889–921 (2014)
 3.
Baur, E.: On a class of random walks with reinforced memory. J. Stat. Phys. (2020). https://doi.org/10.1007/s10955020026023
 4.
Baur, E., Bertoin, J.: On a twoparameter Yule–Simon distribution. arXiv:2001.01486, to appear in Progress in Probability: Festschrift R.A. Doney; Birkhäuser
 5.
Baur, E., Bertoin, J.: Elephant random walks and their connection to Pólyatype urns. Phys. Rev. E 94, 052134 (2016)
 6.
Bercu, B.: A martingale approach for the elephant random walk. J. Phys. A 51(1), 015201 (2018)
 7.
Bertoin, J.: Noise reinforcement for Lévy processes. Ann. Inst. H. Poincaré Probab. Stat. 56(3), 2236–2252 (2020)
 8.
Bertoin, J.: Universality of noise reinforced Brownian motions. In: In and Out of Equilibrium 3, celebrating Vladas Sidoravicius (to appear), Progress in Probability, Birkhäuser. arXiv:2002.09166
 9.
Bingham, N.H., Goldie, C.M., Teugels, J.L.: Regular Variation. Encyclopedia of Mathematics and Its Applications, vol. 27. Cambridge University Press, Cambridge (1987)
 10.
Bornholdt, S., Ebel, H.: World Wide Web scaling exponent from Simon’s 1955 model. Phys. Rev. E 64, 035104 (2001)
 11.
Businger, S.: The shark random swim (Lévy flight with memory). J. Stat. Phys. 172(3), 701–717 (2018)
 12.
Coletti, C.F., Gava, R., Schütz, G.M.: Central limit theorem and related results for the elephant random walk. J. Math. Phys. 58(5), 053303 (2017)
 13.
Coletti, C.F., Gava, R., Schütz, G.M.: A strong invariance principle for the elephant random walk. J. Stat. Mech. Theory Exp. 12, 123207 (2017)
 14.
Cotar, C., Thacker, D.: Edge and vertexreinforced random walks with superlinear reinforcement on infinite graphs. Ann. Probab. 45(4), 2655–2706 (2017)
 15.
Disertori, M., Sabot, C., Tarrès, P.: Transience of edgereinforced random walk. Commun. Math. Phys. 339(1), 121–148 (2015)
 16.
Gnedenko, B.V., Kolmogorov, A.N.: Limit distributions for sums of independent random variables. Translated from the Russian, annotated, and revised by K. L. Chung. With appendices by J. L. Doob and P. L. Hsu. Revised edition. AddisonWesley Publishing Co., Reading, Mass.LondonDon Mills., Ont. (1968)
 17.
Ibragimov, I.A., Linnik, Y.V.: Independent and Stationary Sequences of Random Variables. WoltersNoordhoff Publishing, Groningen (1971). With a supplementary chapter by I. A. Ibragimov and V. V. Petrov, Translation from the Russian edited by J. F. C. Kingman
 18.
Kious, D., Sidoravicius, V.: Phase transition for the oncereinforced random walk on \(\mathbb{Z}^{d}\)like trees. Ann. Probab. 46(4), 2121–2133 (2018)
 19.
Kubota, N., Takei, M.: Gaussian fluctuation for superdiffusive elephant random walks. J. Stat. Phys. 177(6), 1157–1171 (2019)
 20.
Kürsten, R.: Random recursive trees and the elephant random walk. Phys. Rev. E 93(3), 032111 (2016)
 21.
Pemantle, R.: A survey of random processes with reinforcement. Probab. Surv. 4, 1–79 (2007)
 22.
Sabot, C., Tarrès, P.: Edgereinforced random walk, vertexreinforced jump process and the supersymmetric hyperbolic sigma model. JEMS 17(9), 2353–2378 (2015)
 23.
Schütz, G.M., Trimper, S.: Elephants can always remember: Exact longrange memory effects in a nonMarkovian random walk. Phys. Rev. E 70, 045101 (2004)
 24.
Simon, H.A.: On a class of skew distribution functions. Biometrika 42(3/4), 425–440 (1955)
Funding
Open access funding provided by University of Zurich.
Author information
Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bertoin, J. Scaling exponents of stepreinforced random walks. Probab. Theory Relat. Fields 179, 295–315 (2021). https://doi.org/10.1007/s00440020010082
Received:
Revised:
Accepted:
Published:
Issue Date:
Keywords
 Reinforcement
 Random walk
 Scaling exponent
 Heavy tail distribution
Mathematics Subject Classification
 60G50
 60G51
 60K35