Abstract
In this paper we consider a stochastic system with two connected nodes, whose unidirectional connection is variable and depends on point processes associated to each node. The input node is represented by an homogeneous Poisson process, whereas the output node jumps with an intensity that depends on the jumps of the input nodes and the connection intensity. We study a scaling regime when the rate of both point processes is large compared to the dynamics of the connection. In neuroscience, this system corresponds to a neural network composed by two neurons, connected by a single synapse. The strength of this synapse depends on the past activity of both neurons, the notion of synaptic plasticity refers to the associated mechanism. A general class of such stochastic models has been introduced in Robert and Vignoud (Stochastic models of synaptic plasticity in neural networks, 2020, arxiv: 2010.08195) to describe most of the models of long-term synaptic plasticity investigated in the literature. The scaling regime corresponds to a classical assumption in computational neuroscience that cellular processes evolve much more rapidly than the synaptic strength. The central result of the paper is an averaging principle for the time evolution of the connection intensity. Mathematically, the key variable is the point process, associated to the output node, whose intensity depends on the past activity of the system. The proof of the result involves a detailed analysis of several of its unbounded additive functionals in the slow-fast limit, and technical results on interacting shot-noise processes.
Similar content being viewed by others
References
Asmussen, S.: Applied Probability and Queues: Stochastic Modelling and Applied Probability, 2nd edn. Springer, New York (2003)
Beznea, L., Cîmpean, I., Röckner, M.: A new approach to the existence of invariant measures for Markovian semigroups. Ann. Inst. Henri Poincaré Probab. Stat. 55(2), 977–1000 (2019)
Billingsley, P.: Convergence of Probability Measures. Wiley, New York (1999)
Cáceres, M.J., Carrillo, J.A., Perthame, B.: Analysis of nonlinear noisy integrate & fire neuron models: blow-up and steady states. J. Math. Neurosci. 1(1), 7 (2011)
Crane, R., Sornette, D.: Robust dynamic classes revealed by measuring the response function of a social system. Proc. Natl. Acad. Sci. 105(41), 15649–15653 (2008)
Embrechts, P., Liniger, T., Lin, L.: Multivariate Hawkes processes: an application to financial data. J. Appl. Probab. 48(A), 367–378 (2011)
Ethier, S.N., Kurtz, T.G.: Markov Processes: Characterization and Convergence. Wiley, New York (2009)
Freidlin, M.I., Wentzell, A.D.: Random Perturbations of Dynamical Systems, 2nd edn. Springer, New York (1998)
Gilbert, E.N., Pollak, H.O.: Amplitude distribution of shot noise. Bell Syst. Tech. J. 39(2), 333–350 (1960)
Gust, G., Schbath, S.: FADO: a statistical method to detect favored or avoided distances between occurrences of motifs using the Hawkes’ model. In: Statistical Applications in Genetics and Molecular Biology 4 (2005)
Has’minskiĭ, R.Z.: Stochastic stability of differential equations. Alphen aan den Rijn: Sijthoff & Noordhoff, pp. xvi+344 (1980)
Hawkes, A.G.: Spectra of some self-exciting and mutually exciting point processes. Biometrika 58(1), 83–90 (1971)
Hawkes, A.G., Oakes, D.: A cluster process representation of a self-exciting process. J. Appl. Probab. 11(3), 493–503 (1974)
Helson, P.: A new stochastic STDP rule in a neural network model (2018). arXiv:1706.00364 [math]
Hirsch, M.W., Smale, S.: Differential equations, dynamical systems, and linear algebra. Academic Press [A subsidiary of Harcourt Brace Jovanovich, Publishers], New York, pp. xi+358 (1974)
Jacod, J., Shiryaev, A.N.: Limit Theorems for Stochastic Processes. Grundlehren der Mathematischen Wissenschaften, vol. 288, p. xviii+601. Springer, Berlin (1987)
Karatzas, I., Shreve, S.: Brownian Motion and Stochastic Calculus. Graduate Texts in Mathematics, vol. 113, 2nd edn. Springer, New York (1998)
Kempter, R., Gerstner, W., van Hemmen, J.L.: Hebbian learning and spiking neurons. Phys. Rev. E 59(4), 4498–4514 (1999)
Kerstan, J.: Teilprozesse Poissonscher Prozesse. In: Trans. Third Prague Conf. Information Theory, Statist. Decision Functions, Random Processes (Liblice, 1962). Publ. House Czech. Acad. Sci., Prague, pp. 377–403 (1964)
Kingman, J.F.C.: Poisson Processes. Clarendon Press, Oxford (1992)
Kurtz, T.G.: Averaging for martingale problems and stochastic approximation. In: Karatzas, I., Ocone, D. (eds.) Applied Stochastic Analysis, vol. 177, pp. 186–209. Springer, Berlin (1992)
Meyn, S., Tweedie, R.: Markov Chains and Stochastic Stability. Communications and Control Engineering Series. Springer, New York (1993)
Nummelin, E.: General Irreducible Markov Chains and Non-Negative Operators. Cambridge University Press, Cambridge (2004)
Papanicolalou, G., Stroock, D.W., Varadhan, S.R.S.: Martingale approach to some limit theorems. In: Proc. 1976. Duke Conf. On Turbulence. III. Duke University Math (1977)
Reynaud-Bouret, P., Schbath, S.: Adaptive estimation for Hawkes processes; application to genome analysis. Ann. Stat. 38(5), 2781–2822 (2010)
Rice, S.O.: Mathematical analysis of random noise. Bell Syst. Tech. J. 23(3), 282–332 (1944)
Robert, P.: Stochastic Networks and Queues. Stochastic Modelling and Applied Probability. Springer, Berlin (2003)
Robert, P., Vignoud, G.: Stochastic models of synaptic plasticity in neural networks (2020). arXiv:2010.08195
Rogers, L.C.G., Williams, D.: Diffusions, Markov Processes and Martingales: Itô Calculus, vol. 2. Cambridge University Press, Cambridge (2000)
Schottky, W.: Über spontane Stromschwankungen in verschiedenen Elektrizitätsleitern. Annalen der physik 362(23), 541–567 (1918)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Pablo A Ferrari.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supported by PhD grant of École Normale Supérieure, ENS-PSL.
Appendices
Appendix A: Proofs of Technical Results for Occupation Times
1.1 Proof of Lemma 15
Denote, for \(t{\ge }0\), a, b, \(c{\in }{\mathbb {R}}_+\), and \(\varepsilon {>}0\),
Let G be a continuous bounded Borelian function. From Proposition 12, we can extract a sub-sequence \((\varepsilon _n)\) such that, for the convergence in distribution
where (L(t)) is a continuous càdlàgprocess.
We will now prove that the process (L(t)) is such that,
holds almost-surely for \(t{\in }[0,T]\).
For \(A{>}0\), the convergence of \((\overline{\nu }^K_{\varepsilon _n})\) to \(\overline{\nu }^K\) gives the convergence in distribution,
Using again the upper-bound, Relation (29), for \((\overline{X}^K_\varepsilon (t))\) with Relation (23), and Proposition 24 for \(R_{\varepsilon }\), we obtain that
hence, for \(\eta {>}0\),
Since G is bounded, with the elementary relation \(x{=}x{\wedge }A{+}(x{-}A)^+\), \(x{\ge }0\), then, for \(n{\ge }1\),
For any \(A{>}0\) and \(n{\ge }1\), Cauchy-Schwartz’s inequality gives the relation
With Relation (38) and the fact that the left-hand side of (38) has a bounded second moment, by letting n go to infinity, we get the inequality
By letting A go to infinity, the monotone convergence theorem shows that
With Relation (39) and the integrability properties proven just above, we obtain that, for \(\varepsilon {>}0\), there exists \(n_0\) such that if \(n{\ge }n_0\), then the relation
holds. Lemma 15 is proved.
1.2 Proof of Lemma 16
Following Papanicolalou et al. [24] and Kurtz [21], we first show that there exists an optional process \((\overline{\Gamma }^{K}_s)\), with values in the set of probability distributions on \({\mathbb {R}}_+^2\) such that, almost surely, for any bounded Borelian function G on \({\mathbb {R}}_+{\times }{\mathbb {R}}_+^2\),
Recall that the optional \(\sigma \)-algebra is the smallest \(\sigma \)-algebra containing adapted càdlàgprocesses. See Sect. VI.4 of Rogers and Williams [29] for example. This is a simple consequence of Lemma 1.4 of Kurtz [21] and the fact that, due to Relation (19), the measure \(\overline{\nu }^{K}({\mathrm {d}}s,{\mathbb {R}}^2)\) is the Lebesgue measure on [0, T].
Let \(f{\in }{\mathcal {C}}_b^1({\mathbb {R}}_+^2)\) be a bounded \({\mathcal {C}}^1\)-function on \({\mathbb {R}}_+^2\) with bounded partial derivatives, we have the relation
where, for \(t{\ge }0\), if \((\overline{V}^K_\varepsilon (s)){{\mathop {=}\limits ^{{\mathrm{def.}}}}}(\overline{X}^{K}_\varepsilon (s),\overline{Z}^{K}_\varepsilon (s))\),
Proposition 12 shows that the martingale \((\varepsilon \overline{M}_{\varepsilon }^f(t))\) is converging in distribution to 0 as \(\varepsilon \) goes to 0.
Relation (41) gives therefore the convergence in distribution
The convergence in distribution of \((\overline{\Omega }^{K}_{\varepsilon _n}(t), \overline{W}^{K}_{\varepsilon _n}(t), \overline{\nu }_{\varepsilon _n}^{K})\), Proposition 15 and Relation (40) give that, for any f in \({\mathcal {C}}_b^1({\mathbb {R}}_+^2)\), the relation
holds with probability 1.
Let \((f_n)\) be a dense countable sequence in \({\mathcal {C}}_b^1({\mathbb {R}}_+^2)\) and \({\mathcal {E}}_1\) be the event, where Relation (42) holds for all \(f{=}f_n\), \(n{\ge }1\). Note that that \({\mathbb {P}}\left( {\mathcal {E}}_1\right) {=}1\). On \({\mathcal {E}}_1\), there exists a (random) subset \(S_1\) of [0, T] with Lebesgue measure T such that
and, consequently,
By Proposition 25, for \(s{\in }S_1\), the probability distribution \(\overline{\Gamma }_s^{K}\) is the invariant distribution \(\Pi _{K{\wedge }\overline{w}^{K}(s)}\). Lemma 16 is proved.
Appendix B: Shot-Noise Processes
This section presents several technical results on shot-noise processes which are crucial for the proof of Theorem 4. See Schottky [30], Rice [26] and Gilbert and Pollak [9] for an introduction.
1.1 A Scaled Shot-Noise Process
Recall that \((S^x_{\varepsilon }(t))\), with initial point \(x{\ge }0\), has been introduced by Definition 6. We will have the following conventions,
The process (S(t)) is in fact the standard shot-noise process of Lemma 5 associated to the Poisson process \({\mathcal {N}}_\lambda \), for \(t{\ge }0\),
In particular (S(t)) is a stochastically non-decreasing process, i.e. for \(y{\ge }0\) and \(s{\le }t\),
A classical formula for Poisson processes, see Proposition 1.5 of Robert [27] for example, gives the relation, for \(\xi {\in }{\mathbb {R}}\),
in particular \({\mathbb {E}}\left[ S^x(t)\right] {=}x\exp (-t){+}\lambda \left( 1{-}\exp ({-} t)\right) \). It also shows that \((S^x(t))\) is converging in distribution to \(S(\infty )\) such that,
It is easily seen that \((S^x_{\varepsilon }(t)){{\mathop {=}\limits ^{{\mathrm{dist.}}}}} (S^x(t/\varepsilon ))\)and thus with Relation (22), \((S^x_{\varepsilon }(t))\) can be represented as, for \(t{\ge }0\),
We remind here the results of Proposition 7, that will be proved in the following paragraph. For \(\xi {\in }{\mathbb {R}}\) and \(x{\ge }0\), the convergence in distribution of the processes
holds, and
Proof of Proposition 7
Let \(T_1\) and \(T_2\) be two stopping times bounded by N, \(\theta {>}0\), and verifying \(0{\le }T_2{-}T_1{\le }\theta \). Using Relation (46) and the strong Markov property of Poisson processes, we have that
holds, by stochastic monotonicity of (S(t)) of Relation (44).
Aldous’ Criterion, see Theorem VI.4.5 of Jacod and Shiryaev [16] gives that the family of processes
is tight when \(\varepsilon \) goes to 0. For \(p{\ge }1\) and a fixed vector \((t_i){\in }{\mathbb {R}}_+^p\), the ergodic theorem for the Markov process (S(t)) gives the almost-sure convergence of
Hence, due to the tightness property, the convergence also holds in distribution for the processes
The last part is a direct consequence of the identity \((S^x_{\varepsilon }(t)){{\mathop {=}\limits ^{{\mathrm{dist.}}}}} (S^x(t/\varepsilon ))\), and of Relation (45), which gives
The proposition is proved. \(\square \)
1.2 Interacting Shot-Noise Processes
Recall that \((R_{\varepsilon }(t))\) defined by Relation (24) is a shot-noise process with intensity equal to the shot-noise process \((S_{\varepsilon }(t))\).
We start with a simple result on moments of functionals of Poisson processes.
Lemma 23
If \({\mathcal {Q}}\) is a Poisson point process on \({\mathbb {R}}_+\) with a positive Radon intensity measure \(\mu \) and f is a Borelian function such that
then
Proof
It is enough to prove the inequality for non-negative bounded Borelian functions f with compact support on \({\mathbb {R}}_+\).
The formula for the Laplace transform of Poisson point processes, see Proposition 1.5 of Robert [27], gives for \(\xi {\ge }0\),
The proof is done in a straightforward way by differentiating the last identity with respect to \(\xi \) four times and then set \(\xi {=}0\). \(\square \)
Proposition 24
The inequality
holds.
Proof
Denote, for \(t{\ge }0\),
the identity \((S_{\varepsilon }(t)){{\mathop {=}\limits ^{{\mathrm{dist.}}}}} (S(t/\varepsilon ))\) and Relation (43) coupled with Fubini’s Theorem give the relations
with
Relation (22) applied to \(R_\varepsilon (t)\) gives
The quantity \(S_{\varepsilon }(u)\) is a functional of the point process \({\mathcal {P}}_1\) and is therefore independent of the Poisson point process \({\mathcal {P}}_2\). Lemma 23 gives therefore that
hence,
Again with Proposition 7 we obtain that, for \(k{\ge }1\), the variable \(\overline{J}_{k}\) has finite moments of all orders, therefore by Cauchy-Shwartz’ Inequality, the right-hand side of the last inequality is finite. The proposition is proved. \(\square \)
Appendix C: Equilibrium of Fast Processes
For \(w{\in }K_W\), recall that the Markov process \((X^w(t),Z^w(t))\) of Definition 2 is such that
Proposition 25
Under the conditions of Sections 2.2.2 and 2.2.3 , the Markov process \((X^w(t),Z^w(t))\) solution of the SDEs (47) and (48) has a unique invariant distribution \(\Pi _w\), i.e. the unique probability distribution \(\mu \) on \({\mathbb {R}}{\times }{\mathbb {R}}_+^\ell \) such that
for any \(f{\in }{\mathcal {C}}_b^1({\mathbb {R}}{\times }{\mathbb {R}}_+^\ell )\), where \(B^F_w\) is the operator defined by Relation (18).
Proof
We denote by \((X^w_n,Z^w_n)\) the embedded Markov chain of the Markov process \((X^w(t),Z^w(t))\), i.e. the sequence of states visited by \((X^w(t),Z^w(t))\) after each jump, of either \({\mathcal {N}}_{\lambda }\) or \({\mathcal {N}}_{\beta ,X}\).
The proof of the proposition is done in three steps. We first show that the return time of \((X^w(t),Z^w(t))\) to a compact set of \({\mathbb {R}}{\times }{\mathbb {R}}_+^\ell \) is integrable. Then we prove that the Markov chain \((X^w_n,Z^w_n)\) is Harris ergodic, and consequently that it has a unique invariant measure. For a general introduction on Harris Markov chains, see Nummelin [23] and Meyn and Tweedie [22]. Finally, the proof of the proposition uses the classical framework of stationary point processes.
1.1 Integrability of Return Times to a Compact Subset
Suppose that \(w{\ge }0\). The conditions of Sect. 2.2.2 on the functions \(\beta \) and g, and Relation (47) show that \(X^w(t){\ge }{-}c_0\), for all \(t{\ge }0\), if \(X^w(0){\ge }{-}c_0\), with \(c_0{=}c_\beta {+}c_g\). The state space of the Markov process \((X^w(t),Z^w(t))\) can be taken as \({\mathcal {S}}{{\mathop {=}\limits ^{{\mathrm{def.}}}}}[{-}c_0,{+}\infty ){\times }{\mathbb {R}}_+^\ell \).
Define, for \((x,z){\in }{\mathcal {S}}\) and \(0{<}a{\le }1\),
we get that
hence, with the assumptions of Sect. 2.2.3 and 2.2.2 on the function \(k_.\) and \(\beta \), and \(a\le 1\),
where \({\underline{\gamma }}{>}0\) is the minimum of the coordinates of \(\gamma \) and C is a constant independent of x, z and a. We fix \(0{<}a{\le }1\) sufficiently small so that \(\ell aC_\beta C_k{<}1\) and \(K{>}c_0\) such that
If \(H(x,z){>}K\) then \(\max (x,a\Vert z\Vert ){>}K/2\) and therefore \(B_w^F(H)(x,z){\le }{-}1\), H is therefore a Lyapounov function for \(B^F_w\). One deduces that the same result holds for the return time of Markov chain, \((X^w_n,Z^w_n)\) in the set \(I_K=\{(x,z):H(x,z){\le }K\}\).
1.2 Harris Ergodicity of \((X^w_n,Z^w_n)\)
Proposition 5.10 of Nummelin [23] is used to show that \(I_K\) is a recurrent set. A regeneration property would be sufficient to conclude. In particular, we can prove that \(I_K\) is a small set, that is, there exists some positive, non-trivial, Radon measure \(\nu \) on \({\mathcal {S}}\) such that,
for any Borelian subset S of \({\mathcal {S}}\) and all \((x_0, z_0){\in }I_K\).
We denote by \(s_1\), resp. \(t_1\), the first instant of \({\mathcal {N}}_\lambda \), resp. of \({\mathcal {N}}_{\beta ,X^w}\), then, for \((X_0^w,Z_0^w){=}(x_0, z_0){\in }I_K\), by using the deterministic differential equations between jumps, we get
since \(\beta \) is bounded by some constant \(c_\beta ^1\) on the interval \([{-}c_0,K]\).
In the following argument, we restrict X to be non-negative, the extension to \([{-}c_0,+\infty ]\) is straightforward. For \({\mathcal {A}}{=}[0,A]{\in }{\mathcal {B}}({\mathbb {R}}_+)\) and \({\mathcal {B}}{=}[0,B]{\in }{\mathcal {B}}({\mathbb {R}}_+^\ell )\), from Equations (47) and (48), we obtain the relation
where \(H(z){=} z{+}k_1(z)\), \({\bar{s}}_1{{\mathop {=}\limits ^{{\mathrm{dist.}}}}}(s_1{\mid }s_1{\le }t_1)\). By using the fact that \(k_1\) is in \({\mathcal {C}}_b^1({\mathbb {R}}_+^\ell ,{\mathbb {R}}_+^\ell )\) by the conditions of Sect. 2.2.3 and in the same way as Example of Sect. 4.3.3 page 98 of Meyn and Tweedie [22], we can prove that the random variable
has a density, uniformly bounded below by a positive function h on \({\mathbb {R}}_+{\times }{\mathbb {R}}_+^{\ell }\), so that
for all \((x_0,z_0){\in }I_K)\). This relation is then extended to all Borelian subsets S of \({\mathcal {S}}\), so that Relation (50) holds. Proposition 5.10 of Nummelin [23] gives therefore that \((X^w_n,Z^w_n)\) is Harris ergodic.
If \(w{<}0\), the last two steps can be done in a similar way. In this case, the process \((-X^w(t))\) satisfies an analogous equation with the difference that the process \({\mathcal {N}}_{\beta ,X^w}\) does not jump when \({-}X^w(t){>}c_\beta \) since \(\beta (x){=}0\) for \(x{\le }{-}c_\beta \).
1.3 Characterization of \(\Pi _w\)
Let \({\widehat{\Pi }}_w\) be the invariant probability distribution of \((X^w_n,Z^w_n)\). With the above notations,
the probability defined by the classical cycle formula,
for any bounded Borelian function on \({\mathbb {R}}{\times }{\mathbb {R}}_+^\ell \) is an invariant distribution for the process \((X^w(t),Z^w(t))\).
Proposition 9.2 of Ethier and Kurtz [7] shows that any distribution is invariant for \((X^w(t),Z^w(t))\) if and only if it satisfies Relation (49). It remains to prove the uniqueness of the invariant distribution, using the fact that the embedded Markov chain has a unique invariant distribution.
Although this is a natural result, we have not been able to find a reference in the literature. Most results are stated for discrete time, the continuous time is usually treated by looking at the process on a “discrete skeleton”, i.e. at instants multiple of some positive constant. See Proposition 3.8 of Asmussen [1] for example. As this technique is not adapted to our system, we derive a different proof using the Palm measure of the associated stationary point process.
If \(\mu \) is some invariant distribution of the Markov process \((X^w(t),Z^w(t))\), we build a stationary version \(((X^w(t),Z^w(t)), t{\in }{\mathbb {R}})\) of it on the whole real line. In particular, we have that \((X^w(t),Z^w(t)){{\mathop {=}\limits ^{{\mathrm{dist.}}}}}\mu \), for all \(t{\in }{\mathbb {R}}\).
We denote by \((S_n,n{\in }{\mathbb {Z}})\) the non-decreasing sequence of the jumps (due to \({\mathcal {N}}_\lambda \) and \({\mathcal {N}}_{\beta ,X^w}\)), with the convention \(S_0{\le }0{<}S_1\) The sequence \(((X^w(S_n),Z^w(S_n)), n{\ge }0)\) has the same distribution as the process \(((X^w_n,Z^w_n), n{\ge }0)\), the Markov chain with initial state \((X^w(S_0),Z^w(S_0))\). Since, for any \(t{\in }{\mathbb {R}}\),
the marked point process \({\mathcal {T}}{{\mathop {=}\limits ^{{\mathrm{def.}}}}}\left( S_n,(X^w(S_n),Z^w(S_n)\right) , n{\in }{\mathbb {Z}})\) is a stationary point process, i.e.
The Palm measure of \({\mathcal {T}}\) is a probability distribution \({\widehat{Q}}\) such that the sequence \(((S_n{-}S_{n-1},X^w(S_n),Z^w(S_n), n{\in }{\mathbb {Z}})\) is stationary. See Chapter 11 of Robert [27] for a quick presentation of stationary point processes and Palm measures.
Under \({\widehat{Q}}\), the Markov chain \(((X^w(S_n),Z^w(S_n)), n{\ge }0)\) is at equilibrium. Using Harris ergodicity, we have proved in the previous section that the Markov chain \(((X^w_n,Z^w_n), n{\ge }0)\) has a unique invariant measure. Considering that both sequences \(((X^w_n,Z^w_n), n{\ge }0)\) and \(((X^w(S_n),Z^w(S_n)), n{\ge }0)\) have the same distribution, we have that \({\widehat{Q}}\left( {\mathbb {R}}_+^{{\mathbb {Z}}}, \cdot \right) \) is uniquely determined.
Moreover, remembering that,
We have that \({\widehat{Q}}\) is entirely determined by the ergodic distribution of the embedded Markov chain and consequently that the Palm measure \({\widehat{Q}}\) is unique. By Proposition 11.5 of Robert [27], the distribution of \({\mathcal {T}}\) is expressed with \({\widehat{Q}}\).
We have, for every bounded function f,
which uniquely determines the invariant distribution \(\mu \).
The proposition is proved. \(\square \)
Appendix D: Averaging Principles for Discrete Models of Plasticity
In this section, we present a general discrete model of plasticity, state the associated averaging principle theorem and give a sketch of its proof. We will only point out the differences with the proof of the main result of this paper, Theorem 4.
For this model of plasticity, the membrane potential X, the plasticity processes Z and the synaptic weight W are integer-valued variables. This system is illustrated in Sect. 7 of Robert and Vignoud [28] for calcium-based models. It amounts to represent these three quantities X, Z and W as multiple of a “quantum” instead of a continuous variable. The leaking mechanism in particular, the term corresponding to \({-}\gamma Y(t){\mathrm {d}}t\) in the continuous model, \(Y{\in }\{X,Z,W\}\) and \(\gamma {>}0\), in the SDEs, is represented by the fact that each quantum leaves the system at a fixed rate \(\gamma \).
The main advantage of this model is that simple analytical expressions of the invariant distribution are available.
Definition 26
The SDEs for the discrete model are
where \(\beta ,\gamma ,\mu \) are non-negative real numbers, \(B_1\), \(B_2{\in }{\mathbb {N}}^{\ell }\) and, for \(a{\in }\{p,d\}\), \(A_a{\in }{\mathbb {N}}\). The functions \(n_{a,i}\) are assumed to be bounded by \(C_n\).
For \(a{\in }\{p,d\}\), the function I of \({\mathcal {N}}_{I,G}\) for \(G{\in }\{X,\beta X, \gamma Z, \mu W, \Omega _{p}, \Omega _{d}\}\), defined by relation (6), is the identity function \(I(x){=}x\), \(x{\in }{\mathbb {R}}\) and \({\mathcal {N}}_\lambda \) is a Poisson process on \({\mathbb {R}}_+\) with rate \(\lambda \). All associated Poisson processes are assumed to be independent.
Definition 27
For a fixed w, the process of the fast variables \((X^w(t),Z^w(t))\) on \({\mathbb {N}}{\times }{\mathbb {N}}^{\ell }\) of the SDEs is the Markov process whose transition rates are given by, for \((x,z){\in }{\mathbb {N}}{\times }{\mathbb {N}}^{\ell }\),
Theorem 28
(Averaging Principle for a Discrete Model) If the assumptions of Definition 26 are verified, the family of scaled processes \((W_{\varepsilon }(t))\) associated to Relations (51) is converging in distribution, as \(\varepsilon \) goes to 0, to the càdlàginteger-valued process (w(t)) satisfying the ODE
and, for \(a{\in }\{p,d\}\),
where \(\Pi _{w}\) is the invariant distribution of the Markov process of Definition 27.
Proof
Again, we have to show that, on a fixed finite interval, the process (W(t)) is bounded with high probability. A coupled process that stochastically bounds from above the discrete process is also defined.
Definition 29
The process \((\overline{X}(t), \overline{Z}(t), \overline{\Omega }(t), \overline{W}(t) )\) satisfies the following SDEs
where \(B_1\), \(B_2{\in }{\mathbb {N}}^{\ell }\) and, for \(a{\in }\{p,d\}\), \(A_p{\in }{\mathbb {N}}\).
It is not difficult to prove that this process is indeed a coupling that verifies the relation \(W(t){\le }\overline{W}(t)\), for all \(t{\ge }0\) and that the process \((\overline{W}(t))\) is non-decreasing.
From the SDEs governing the scaled version of the coupled system, we obtain
for all \(t{\le }T\), for some constant \(D{\ge }0\). Gronwall’s Lemma gives a uniform bound, with respect to \(\varepsilon \),
Using Markov inequality, we have then that, for any \(\eta {>}0\), the existence of \(K_0\) and \(n_0\) such that \(n{\ge }n_0\), the inequality
holds. We can then finish the proof in the same way as in Sect. 7.2. The tightness property of the family of càdlàgprocesses \((\overline{W}_{\varepsilon }(t))\), \(\varepsilon {\in }(0,1)\) are proved with Aldous’ criterion, see Theorem VI.4.5 of Jacod and Shiryaev [16].
We have to prove the uniqueness of the solution of Relation (52) and the convergence in distribution of the scaled process to the process w(t). For this, we need to have some Lipschitz property on the limiting system, and finite first moments for the invariant distribution of \((X^w(t),Z^w(t))\). This is proved in Sect. 7 of Robert and Vignoud [28] for the case where Z is a one-dimensional process, the extension to multi-dimensional Z is straightforward. \(\square \)
Rights and permissions
About this article
Cite this article
Robert, P., Vignoud, G. Averaging Principles for Markovian Models of Plasticity. J Stat Phys 183, 47 (2021). https://doi.org/10.1007/s10955-021-02785-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10955-021-02785-3