Abstract
Branching processes form an important family of stochastic processes that have been successfully applied in many fields. In this paper, we focus our attention on controlled multi-type branching processes (CMBPs). A Feller-type diffusion approximation is derived for some critical CMBPs. Namely, we consider a sequence of appropriately scaled random step functions formed from a critical CMBP with control distributions having expectations that satisfy a kind of linearity assumption. It is proved that such a sequence converges weakly toward a squared Bessel process supported by a ray determined by an eigenvector of a matrix related to the offspring mean matrix and the control distributions of the branching process in question. As applications, among others, we derive Feller-type diffusion approximations of critical, primitive multi-type branching processes with immigration and some two-sex branching processes. We also describe the asymptotic behaviour of the relative frequencies of distinct types of individuals for critical CMBPs.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Branching processes can be well-applied to describe evolutionary systems, where elements reproduce according to certain probability laws. These processes are commonly used in population dynamics, and, in this context, systems are referred to as populations and elements as individuals or cells. In this framework, multi-type branching processes, that are appropriate for modelling the evolution of populations in which different types of individuals coexist, have been successfully applied. For instance, these processes are used for modelling polymerase chain reaction (see, e.g., Sagitov and Ståhlberg [30]), for modelling cell proliferation kinetics (see, e.g., González et al. [15], Kimmel and Axelrod [26, Chapter 5] or Yanev [37]) or for studying extinction of outbreak of diseases (see, e.g., Mwasunda et al. [27]). We will focus our attention on the class of controlled multi-type branching processes (CMBPs). This class was first introduced in González et al. [14]. The key feature of CMBPs is that the number of parents of each type at a given generation is determined by a random control mechanism that depends on the number of individuals of different types in the previous generation.
CMBPs form a wide family of branching processes that include, as particular cases, the well-known classical branching processes, namely, multi-type branching processes without or with immigration (MBPs or MBPIs), or two-sex branching processes (see details in Example 2.1). Although, for all these processes, individuals give rise to offspring independently of each others, in general, a CMBP no longer satisfies the additive property (see, e.g., Athreya and Ney [4, Chapter 1, page 3]). This is an important property that is satisfied, for instance, by MBPs and MBPIs. The lack of the additive property for a general CMBP makes it difficult to study this kind of processes and it comes from the following fact. Provided that we know the number of different types of individuals in a generation, in case of a general CMBP, the number of different types of individuals in the next generation is a random sum of some independent random variables, while, in case of a MBP or a MBPI, it is a non-random sum (in the sense that the number of summands is deterministic).
This paper aims to obtain a Feller-type diffusion approximation for some critical CMBPs (details of the model are given in Sect. 2) and to study the asymptotic behaviour of the relative frequencies of distinct types of individuals. The problems under consideration have a dual motivation. On the one hand, it has interest in itself from a theoretical point of view. Our theoretical results (see Theorem 3.3 and its corollaries) may allow us to study relevant applied problems as well. For example, our result on the asymptotic behaviour of relative frequencies may find applications in cell kinetics. On the other hand, the obtained scaling limit theorem can enable us to carry out further research on statistical inference. For example, one may complete the study on weighted conditional least square estimators for some critical CMBPs, started in González et al. [11], or one might start to investigate more general quasi-likelihood estimators. This intended research could contribute to the modeling of real data using CMBPs.
From a mathematical point of view, achieving functional limit theorems for critical branching processes has attracted the interest of many researchers since 1951, when Feller [10] was able to provide the first formulation for Galton–Watson processes. He proved that the sequence of appropriately scaled critical Galton–Watson processes converges in distribution to a non-negative diffusion process without drift (for a detailed proof based on infinitesimal generators, see also Ethier and Kurtz [9, Theorem 9.1.3]). The extension to branching processes with immigration started with the pioneering paper of Wei and Winnicki [33], where the limit process is a squared Bessel process that can be characterized as the pathwise unique strong solution of a certain stochastic differential equation (SDE). Focusing on controlled branching models, a further extension to the critical, single-type case was first carried out by Sriram et al. [32], and, more recently, by González et al. [13]. Furthermore, for some critical CMBPs, under quite involved technical conditions, González et al. [16, Corollary 4.1] proved a conditional weak limit theorem for the one-dimensional distributions provided that the explosion set has a positive probability. However, according to our knowledge, functional limit theorems for critical CMBPs are not available in the literature. Our present paper fills this gap, it is a natural extension of the result for critical single-type controlled branching processes (CBPs) in González et al. [13] to the multi-type case.
Assuming that the expectations of the control distributions satisfy a linear relationship with the population size additively perturbed by a function also depending on the population size (see (2.10)), we introduce a classification for such CMBPs based on the spectral radius of a matrix related to the offspring mean matrix and to the control distributions. Under some additional hypotheses, we prove that a suitably scaled and normalized critical CMBP converges weakly toward a squared Bessel process supported by a ray determined by an eigenvector of the aforementioned matrix, see Theorem 3.3. As corollaries, we are able to rediscover the known results on Feller-type diffusion approximations for critical, primitive MBPIs (see Corollaries 3.8 and 3.9). What is even more interesting is that we can apply our main theorem to get Feller-type diffusion approximations for some two-sex branching processes. We emphasize that no such results are available in the literature. Very recently, Bansaye et al. [5] have also proved a scaling limit theorem for a class of two-sex branching processes that combine classical asexual Galton–Watson processes and two-sex Galton–Watson branching processes introduced by Daley [8]. For a comparison of our results and theirs, see Remark 3.12. Finally, a result on the asymptotic behaviour of the relative frequencies for critical CMBPs is also derived from Theorem 3.3 (see Corollary 3.13). This kind of result has potential applications, for instance, in the field of cell kinetics, where it is more usual to measure relative frequencies instead of the absolute cell counts.
The proof of our main result (Theorem 3.3) follows the proof scheme of Theorem 3.1 in Ispány and Pap [21] for MBPIs, which is based on a weak convergence result for random step processes due to Ispány and Pap [20, Corollary 2.2] (see also Theorem A.4). This latter result has been applied in other papers to prove several scaling limit theorems, see, e.g., Ispány and Pap [21] and Ráth [28]. The lack of the additive property for a general CMBP makes the proof of our Theorem 3.3 more involved. Next, we outline the course of the proof, and we also point out the new ingredients in it. We start with determining the conditional moments of the branching process (see Proposition 2.3), which are essential to find out the asymptotic behaviour of some moments of the branching process (see Lemma A.3). The heart of the proof of Theorem 3.3 is an application of Theorem A.4 for a sequence of martingale differences formed from the CMBP in question (see (4.3) in Step 1). Then, in case of MBPIs, the Feller-type diffusion approximation follows straightforwardly from a continuous mapping theorem (see Theorem A.5), as Ispány and Pap [21] showed. However, this is not the case for general CMBPs, an extra additional work (see Step 3) is required. We also emphasize that the additive perturbation of the expectations of the control distributions mentioned above (see also (2.10)) results a new difficulty in the proofs compared to MBPIs, and it is addressed in the proofs of Steps 1 and 3.
The paper is structured as follows. In Sect. 2, CMBPs are defined, and, under the linearity assumption (2.11) on the conditional expectations, we introduce their classification by distinguishing subcritical, critical and supercritical CMBPs. By giving examples, we also point out that different types of classical branching processes can be viewed as particular cases of our model, thus illustrating its wide scope of applicability. In Sect. 3, we collect all the hypotheses that are assumed and we present all of our results obtained. Section 4 is devoted to the proof of Theorem 3.3, which is structured in four steps for an easy reading. We close the paper with an Appendix which contains some auxiliary results such as the asymptotic behaviour of the first and second moments of CMBPs and the second and fourth moments of a corresponding martingale difference (see Lemma A.3).
2 Controlled multi-type branching processes
Let \((\Omega ,{\mathcal {F}},\textrm{P})\) be a fixed probability space on which all the random variables will be defined, and let \({\mathbb {N}}\), \({\mathbb {Z}}_+\), \({\mathbb {R}}\), \({\mathbb {R}}_+\) and \({\mathbb {R}}_{++}\) be the set of positive integers, non-negative integers, real numbers, non-negative real numbers, and positive real numbers, respectively. For all \(\varvec{x},\varvec{y}\in {\mathbb {R}}^p,\) let us denote by \(\varvec{x} \preceq \varvec{y}\) if each coordinate of \(\varvec{x}\) is less than or equal to the corresponding coordinate of \(\varvec{y}\). For \({\varvec{z}}=(z_1,\ldots ,z_p)^\top \in {\mathbb {R}}^p\), let \(|\varvec{z}|:=(\vert z_1\vert ,\ldots ,\vert z_p\vert )^\top \in {\mathbb {R}}_+^p\), and \({\varvec{z}}^+ :=(z_1^+,\ldots ,z_p^+)^\top \in {\mathbb {R}}_+^p\), where \(x^+\) stands for the positive part of \(x\in {\mathbb {R}}\) and \(^\top \) for the transpose. The natural basis in \({\mathbb {R}}^p\) is denoted by \(\varvec{e}_1,\ldots ,\varvec{e}_p\). The null vector in \({\mathbb {R}}^p\) is denoted by \(\varvec{0}_p\), and \(\varvec{1}_p\) is for the vector in \({\mathbb {R}}^p\) with all the coordinates 1. The Borel sigma-algebra on \({\mathbb {R}}^p\) is denoted by \({\mathcal {B}}({\mathbb {R}}^p)\). The trace of a matrix \({\textsf{A}}\in {\mathbb {R}}^{p\times p}\) is denoted by \({\text {tr}} \left( {\textsf{A}} \right) \). The \(p\times p\) identity matrix is denoted by \({\textsf{I}}_p\). For a matrix \({\textsf{A}}\in {\mathbb {R}}^{l\times p}\), let \({\text {Null}}({\textsf{A}}) :=\{\varvec{x}\in {\mathbb {R}}^p: {\textsf{A}}\varvec{x} = \varvec{0}_l\}\). Along the paper, we will not distinguish between the notations of the norm of a vector in \({\mathbb {R}}^p\) and that of a matrix in \({\mathbb {R}}^{p\times p}\). For \({\varvec{z}}\in {\mathbb {R}}^p\), let \(\Vert {\varvec{z}}\Vert \) denote the Euclidean norm of \({\varvec{z}}\), and for a matrix \({\textsf{A}}\in {\mathbb {R}}^{p\times p}\), let \(\Vert {\textsf{A}}\Vert :=\max \{\Vert {\textsf{A}}{\varvec{z}}\Vert : \Vert {\varvec{z}}\Vert =1, \, {\varvec{z}}\in {\mathbb {R}}^p\}\). For a positive semi-definite matrix \({\textsf{A}}\in {\mathbb {R}}^{p\times p}\), let \(\sqrt{{\textsf{A}}}\) denote the unique symmetric positive semi-definite square root of \({\textsf{A}}\). For a function \(\varvec{h}:{\mathbb {R}}^p\rightarrow {\mathbb {R}}^p\), by the notation \(\varvec{h}({\varvec{z}}) = {\text {O}}(f({\varvec{z}}))\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \), where \(f:{\mathbb {R}}^p\rightarrow {\mathbb {R}}_+\), we mean that there exist \(C>0\) and \(R>0\) such that \(\Vert \varvec{h}({\varvec{z}})\Vert \le C f({\varvec{z}})\) for all \({\varvec{z}}\in {\mathbb {R}}^p\) with \(\Vert {\varvec{z}}\Vert >R\). Further, if for all \(\epsilon >0\) there exists \(R>0\) such that \(\Vert \varvec{h}({\varvec{z}})\Vert \le \epsilon f({\varvec{z}})\) for all \({\varvec{z}}\in {\mathbb {R}}^p\) with \(\Vert {\varvec{z}}\Vert >R\), then we write \(\varvec{h}({\varvec{z}}) = {\text {o}}(f({\varvec{z}}))\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \). Convergence in probability is denoted by \({\mathop {\longrightarrow }\limits ^{\textrm{P}}}\). Some further notations for weak convergence of stochastic processes with càdlàg sample paths are recalled in Sect. 3.
For a fixed \(p\in {\mathbb {N}}\) and a \({\mathbb {Z}}_+^p\)–valued random vector \(\varvec{Z}_0\), let us consider a controlled p–type branching process \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}\), defined recursively as
where \(\varvec{Z}_{k}\), \(\varvec{\phi }_{k} (\varvec{z})\), \(\varvec{z}\in {\mathbb {Z}}_+^p\), and \(\varvec{X}_{k,j,i}\) are \({\mathbb {Z}}_+^p\)–valued random vectors:
The intuitive interpretation of the process \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}\) is as follows:
-
\(Z_{k,i}\) is the number of i–type individuals in the k–th generation,
-
\(\phi _{k,i}(\varvec{Z}_{k})\) is the number of i–type progenitors in the k–th generation,
-
\(X_{k,j,i,l}\) is the number of l–type offsprings of the j–th i–type progenitor in the k–th generation.
Assume that \(\{\varvec{Z}_{0}, \varvec{\phi }_{k} (\varvec{z}),\varvec{X}_{k,j,i}: k\in {\mathbb {Z}}_+, \, j\in {\mathbb {N}},\, \varvec{z}\in {\mathbb {Z}}_+^p,\,i\in \{1,\ldots ,p\}\}\) are independent, \(\{\varvec{\phi }_{k}(\varvec{z}):k\in {\mathbb {Z}}_+\}\) are identically distributed for each \(\varvec{z}\in {\mathbb {Z}}_+^p\) and \(\{\varvec{X}_{k,j,i}: k\in {\mathbb {Z}}_+,\,j\in {\mathbb {N}}\}\) are also identically distributed for each \(i\in \{1,\ldots ,p\}\). The distributions of \(\varvec{\phi }_0(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), are called the control distributions.
Moreover, for clarity of the readers, let us write out (2.1) for controlled two-type branching processes. The number of i–type individuals in the initial generation is \(Z_{0,i}\), \(i=1,2\), and the recursive definition in (2.1) takes the form
Part (iv) of Example 2.1 serves as a good illustration of a two-type case.
It is easy to check that the process defined in (2.1) is a \({\mathbb {Z}}_+^p\)-valued Markov chain. This model is very general, several other popular branching processes can be seen as particular cases of the CMBPs, see Example 2.1.
Example 2.1
-
(i)
Multi-type branching processes. We get the subclass of p–type branching processes without immigration by defining the deterministic control function as \(\varvec{\phi }_{k} (\varvec{z}) :=\varvec{z}\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), \(k\in {\mathbb {Z}}_+\), in (2.1).
-
(ii)
Multi-type branching processes with immigration. Let us consider such a MBPI \({(\varvec{Y}_{k})_{k\in {\mathbb {Z}}_+}}\) given by
$$\begin{aligned} \varvec{Y}_{k+1} :=\sum _{i=1}^{p} \sum _{j=1}^{Y_{k,i}} \varvec{\xi }_{k,j,i} + \varvec{I}_{k+1}, \quad \quad k\in {\mathbb {Z}}_+, \end{aligned}$$(2.2)where \(\{ \varvec{Y}_0, \, \varvec{\xi }_{k,j,i}, \varvec{I}_k: k\in {\mathbb {Z}}_+, \, j\in {\mathbb {N}},\, i\in \{1,\ldots ,p\}\}\) are independent \({\mathbb {Z}}_+^p\)–valued random vectors, \(\{ \varvec{\xi }_{k,j,i}: k\in {\mathbb {Z}}_+,\, j\in {\mathbb {N}}\}\) are identically distributed for each \(i\in \{1,\ldots ,p\}\) (offspring distributions) and \(\{ \varvec{I}_k: k\in {\mathbb {Z}}_+\}\) are also identically distributed (immigration distribution). Note that \((\varvec{Y}_k)_{k\in {\mathbb {Z}}_+}\) can be written as a \((p+1)\)–type branching process with
$$\begin{aligned} \varvec{Z}_{k} :=\begin{pmatrix} \varvec{Y}_{k} \\ {1} \end{pmatrix}, \qquad \varvec{X}_{k,j,i} :=\begin{pmatrix} \varvec{\xi }_{k,j,i} \\ 0 \end{pmatrix}, \qquad {\varvec{X}_{k,1,p+1} } :=\begin{pmatrix} \varvec{I}_{k+1} \\ 1 \end{pmatrix} \end{aligned}$$(2.3)for \(k\in {\mathbb {Z}}_+\), \(j\in {\mathbb {N}}\), \(i\in \{1,\ldots ,p\}\), when \(\varvec{\phi }_{k} (\varvec{z}) :=\varvec{z}\), \({\varvec{z}}\in {\mathbb {Z}}_+^{p+1}\), \(k\in {\mathbb {Z}}_+\) (see (i)). Moreover, \((\varvec{Y}_k)_{k\in {\mathbb {Z}}_+}\) can also be written as a controlled \((p+1)\)–type branching process with the choices given in (2.3) and the control functions \( \varvec{\phi }_{k} (\varvec{z}) = (z_1, \ldots , z_p, 1)^\top \), \({\varvec{z}}\in {\mathbb {Z}}_+^{p+1}\), \(k\in {\mathbb {Z}}_+.\)
-
(iii)
Multi-type branching processes with migration. Let us consider a stochastic process \({(\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}}\) given by
$$\begin{aligned} \varvec{Z}_{k+1} :=\sum _{i=1}^{p} \sum _{j=1}^{Z_{k,i}+M_{k,i}(Z_{k,i})} \varvec{X}_{k,j,i}, \quad \quad k\in {\mathbb {Z}}_+, \end{aligned}$$where \(\{\varvec{Z}_{0}, \varvec{M}_{k}({\varvec{z}}),\varvec{X}_{k,j,i}: k\in {\mathbb {Z}}_+, \, j\in {\mathbb {N}},\, \varvec{z}\in {\mathbb {Z}}_+^p,\,i\in \{1,\ldots ,p\}\}\) are independent \({\mathbb {Z}}^p\)–valued random vectors such that \(\varvec{Z}_{0}\) and \(\varvec{X}_{k,j,i}\) have non-negative coordinates, \(\{\varvec{M}_{k}({\varvec{z}}) :=(M_{k,1}(z_1),\ldots ,M_{k,p}(z_p))^\top : k\in {\mathbb {Z}}_+\}\) are identically distributed for each \({\varvec{z}}\in {\mathbb {Z}}_+^p\) with range contained in \([-z_1,\infty ) \times \cdots \times [-z_p,\infty )\), and \(\{\varvec{X}_{k,j,i}: k\in {\mathbb {Z}}_+,\,j\in {\mathbb {N}}\}\) are also identically distributed for each \(i\in \{1,\ldots ,p\}\). For each \(i\in \{1,\ldots ,p\}\), \(M_{k,i}(z_i)\) can be interpreted as a migration component for the i–type individuals in the k–th generation. Depending on the sign of \(M_{k,i}(z_i)\), there is emigration (negative value), immigration (positive value) or no migration (zero value). In fact, this is an equivalent way of writing a CMBP, because every \(\varvec{\phi }_{k} (\varvec{z})\) can be written as \(\varvec{z} + \varvec{M}_{k}({\varvec{z}})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), \(k\in {\mathbb {N}}\), with appropriate choices of \(\varvec{M}_{k}({\varvec{z}})\).
-
(iv)
Two-sex Galton–Watson branching processes with immigration (2SBPIs). Let us consider such a process \((F_k,M_k)_{k\in {\mathbb {Z}}_+}\) with offspring and immigration distribution defined as follows. Let \((F_0,M_0)\) be an \({\mathbb {Z}}_+^2\)–valued random variable (random initial generation) and
$$\begin{aligned} (F_{k+1},M_{k+1}) :=\sum _{j=1}^{U_k} (f_{k,j},m_{k,j}) + (F_{k+1}^{I},M_{k+1}^{I}),\qquad U_k :=L(F_k,M_k),\qquad k\in {\mathbb {Z}}_+, \nonumber \\ \end{aligned}$$(2.4)where \(\{(F_0,M_0),(f_{k,j},m_{k,j}), (F_k^I,M_k^I): k\in {\mathbb {Z}}_+,\, j\in {\mathbb {N}}\}\) are independent \({\mathbb {Z}}_+^2\)–valued random vectors, \(\{ (f_{k,j},m_{k,j}): k\in {\mathbb {Z}}_+,\, j\in {\mathbb {N}}\}\) are identically distributed (offspring distribution), and \(\{ (F_k^I,M_k^I): k\in {\mathbb {Z}}_+\}\) are also identically distributed (immigration distribution). Further, \((U_k)_{k\in {\mathbb {Z}}_+}\) is a sequence of mating units corresponding to the mating function \(L:{\mathbb {Z}}_+\times {\mathbb {Z}}_+ \rightarrow {\mathbb {Z}}_+\), assumed to be non-decreasing in each argument. In the terminology of Asmussen [3], L can be called a marriage function as well. Then the 2SBPI \((F_k,M_k)_{k\in {\mathbb {Z}}_+}\) can be considered a CMBP \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\) given by
$$\begin{aligned} \varvec{Z}_{k} :=\begin{pmatrix} F_{k} \\ M_{k} \end{pmatrix}, \qquad \varvec{\phi }_{k} (\varvec{z}) :=\begin{pmatrix} L(\varvec{z}) \\ 1 \end{pmatrix}, \qquad \varvec{X}_{k,j,1} :=\begin{pmatrix} f_{k,j} \\ m_{k,j} \end{pmatrix}, \qquad \varvec{X}_{k,j,2} :=\begin{pmatrix} F_{k+1}^{I} \\ M_{k+1}^{I} \end{pmatrix} \end{aligned}$$for \(k\in {\mathbb {Z}}_+\), \(j\in {\mathbb {N}}\), and \({\varvec{z}}\in {\mathbb {Z}}_+^2\). In this case, we emphasize that the coordinates of the control can be interpreted in a different way compared to what is written after (2.1). Namely, the first coordinate of the control denotes the number of couples, while the second one denotes an extra couple, which brings in female and male immigrants. \(\blacksquare \)
Remark 2.2
As happens for the 2SBPI, the intuitive interpretation of the branching process that appears in part (ii) of Example 2.1 is lost when we see it as a \((p+1)\)–type branching process or as a controlled \((p+1)\)–type branching process. Indeed, we add the possibility that there is an extra type of individuals in the population, but the number of them in all generations is 1. The only one individual of extra type always gives birth one individual of extra type and possibly some individuals of other types, which correspond to the immigrants arriving in the population. Despite the fact that both cases lack practical interpretability, this mathematical description is perfectly valid and useful:
-
(i)
It will enable us to recover the known result on the asymptotic behaviour of critical, primitive MBPIs due to Ispány and Pap [21, Theorem 3.1] (see Corollary 3.8). Furthermore, if we do not want to lose the intuitive interpretation, we could consider the controlled p–type branching process \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}\) with \({\varvec{Z}}_0 :=\varvec{Y}_0\), \(\varvec{\phi }_{k} (\varvec{z}) :=\varvec{z} + \varvec{I}_{k+1}\), and \(\varvec{X}_{k,j,i} :=\varvec{\xi }_{k,j,i}\) for \(k\in {\mathbb {Z}}_+\), \(j\in {\mathbb {N}}\), and \({\varvec{z}}\in {\mathbb {Z}}_+^p\), where \(\{ \varvec{Y}_0, \, \varvec{\xi }_{k,j,i}, \varvec{I}_k: k\in {\mathbb {Z}}_+, \, j\in {\mathbb {N}},\, i\in \{1,\ldots ,p\}\}\) are given in part (ii) of Example 2.1. This is the third representation of a MBPI (for the other two ones, see part (ii) of Example 2.1). Note also that \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}\) is a particular case of the multi-type branching process with migration in part (iii) in Example 2.1. In the single-type critical case, González et al. [13, Remark 3.2.2] proved that (under some moment assumptions) \((n^{-1} Z_{\lfloor nt\rfloor })_{t\in {\mathbb {R}}_+}\) converges weakly as \(n\rightarrow \infty \) toward a limit process, which coincides with the limit process for a critical (usual) single-type branching processes with immigration scaled and normalized in the same way. Corollary 3.9 will imply that a version of this statement remains true in the p–type critical case.
-
(ii)
It is the first time that a 2SBPI is written as a CMBP (see part (iv) of Example 2.1), such a rewriting has not been considered until now in the literature. In fact, since the control distributions are deterministic in case of a 2SBPI, it suggests a possible way to generalize the notion of a 2SBPI by allowing random mating functions. Earlier, only two-sex Galton–Watson branching processes without immigration, introduced by Daley [8], were viewed as special controlled branching processes (see Sevast’yanov and Zubkov [31, model 3]). Furthermore, Theorem 3.3 together with part (iv) of Example 2.1 will allow us to obtain scaling limit theorems for some critical 2SBPIs previously not considered in the literature (see Corollary 3.10). \(\blacksquare \)
Let us introduce notations for some moments. In all what follows, we suppose that \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert ^4 \right] <\infty \) for \(i\in \{1,\ldots ,p\}\) and \({\text {E}}\! \left[ \Vert \varvec{\phi }_{0}(\varvec{z})\Vert ^4 \right] <\infty \) for \(\varvec{z}\in {\mathbb {Z}}_+^p\), and we denote
where \(i,l\in \{1,\ldots ,p\}\) and \({\varvec{z}}\in {\mathbb {Z}}_+^p\).
Note that the moments defined in (2.5) and (2.6) are of course well-defined under weaker assumptions, namely, under the existence of first and second order moments of the offspring and control distributions, respectively. We also remark that \(\varvec{m}_i\), \(\mathsf {\Sigma }_i\) and \(\zeta _{i,l}\), \(i,l\in \{1,\ldots ,p\}\), do not depend on the control distributions, while \(\varvec{\varepsilon }(\varvec{z})\), \(\mathsf {\Gamma }(\varvec{z})\), and \(\kappa _i(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), \(i\in \{1,\ldots ,p\}\), do depend.
Let us consider the canonical filtration of the process \({\mathcal {F}}_k :=\sigma (\varvec{Z}_{0},\ldots ,\varvec{Z}_{k}),\) \(k\in {\mathbb {Z}}_+,\) introduce the matrix \({\textsf{m}} :=(\varvec{m}_1,\ldots , \varvec{m}_p) \in {\mathbb {R}}_+^{p\times p},\) and the operator \(\odot :{\mathbb {R}}^p \times ({\mathbb {R}}^{p\times p})^p \rightarrow {\mathbb {R}}^{p\times p}\), \(\varvec{z}\odot \varvec{{\textsf{A}}} :=\sum _{i=1}^p z_i{\textsf{A}}_i\) for \(\varvec{z}=(z_1,\ldots ,z_p)^\top \in {\mathbb {R}}^p\) and \(\varvec{{\textsf{A}}}=({\textsf{A}}_1,\ldots ,{\textsf{A}}_p) \in ({\mathbb {R}}^{p\times p})^p\).
The following proposition is a multi-type counterpart of Proposition 3.5 in González et al. [12].
Proposition 2.3
For each \(k\in {\mathbb {N}}\), we have
where \(\varvec{\mathsf {\Sigma }} :=(\mathsf {\Sigma }_1, \ldots ,\mathsf {\Sigma }_p)\in ({\mathbb {R}}^{p\times p})^p\).
Proof
Let \(k\in {\mathbb {N}}\) be fixed arbitrarily. By the Markov property, we get \({\text {E}}\! \left[ {\varvec{Z}}_k \mid {\mathcal {F}}_{k-1} \right] = {\text {E}}\! \left[ {\varvec{Z}}_k \mid {\varvec{Z}}_{k-1} \right] \). Using that \(\varvec{\phi }_{k-1}(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), and \({{\textbf {X}}}_{k-1,j,i}\), \(j\in {\mathbb {N}}\), \(i\in \{1,\ldots ,p\}\), are independent of each other and of \({\varvec{Z}}_{k-1}\), we have
for \({\varvec{z}}\in {\mathbb {Z}}_+^p\), which implies (2.8). Now we turn to prove (2.9). Similarly as before, we get
for \({\varvec{z}}\in {\mathbb {Z}}_+^p\). This yields (2.9). \(\square \)
From now on, we assume that there exist a matrix \(\mathsf {\Lambda }\in {\mathbb {R}}^{p\times p}\) and a function \(\varvec{h}: {\mathbb {Z}}_+^p \rightarrow {\mathbb {R}}^p\) with \(\Vert \varvec{h}(\varvec{z})\Vert = {\text {o}}(\Vert {\varvec{z}}\Vert )\) as \(\Vert \varvec{z}\Vert \rightarrow \infty \) such that
Roughly speaking, the assumption (2.10) means that the average quantity of ancestors can be expressed as a linear map of the number of individuals affected by a minor perturbation, which becomes insignificant (negligible) in comparison with the population size when this latter is large enough. It is worth noting that migration of parents, both emigration and immigration, is permitted under (2.10) (see, for instance, part (iii) of Example 2.1 assuming that \({ \Vert {\text {E}}\! \left[ \varvec{M}_0({\varvec{z}}) \right] \Vert } = {\text {o}}(\Vert {\varvec{z}}\Vert )\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \)). In the single-type case, i.e., in case of \(p=1\), a corresponding condition has already been assumed (see, e.g., the condition (i) in Theorems 1 and 2 in González et al. [17]). In the multi-type case, the assumption (2.10) was also considered in González and del Puerto [11, condition (11.2)] and in González et al. [16, condition (4.2)], but only with a diagonal matrix \(\mathsf {\Lambda }\) having non-negative entries.
Using (2.8) and (2.10), we get
We can introduce a classification for a subclass of CMBPs satisfying (2.11) based on the asymptotic behaviour of \({\text {E}}\! \left[ \varvec{Z}_k \right] \) as \(k\rightarrow \infty \). Taking expectations of both sides of (2.11), by a recursive argument, one can derive a formula for \({\text {E}}\! \left[ \varvec{Z}_k \right] \), \(k\in {\mathbb {N}}\) (see (A.5) in Appendix). From this, one can see that in the description of the asymptotic behaviour of \({\text {E}}\! \left[ \varvec{Z}_k \right] \) as \(k\rightarrow \infty \), the powers of the matrix \(\tilde{{\textsf{m}}} :={\textsf{m}}\mathsf {\Lambda }\) play a crucial role. We assume that \(\tilde{{\textsf{m}}}\in {\mathbb {R}}_+^{p\times p}\) and it has only one eigenvalue of maximum modulus \({\tilde{\rho }}\) with algebraic and geometric multiplicities equal one. By the spectral theory of matrices, the asymptotic behaviour of \(({\textsf{m}}\mathsf {\Lambda })^k\) as \(k\rightarrow \infty \) is determined by \({\tilde{\rho }}\), the other eigenvalues do not come into play. Motivated by this, by definition, we say that such a CMBP is subcritical if \({\tilde{\rho }}<1\), critical if \({\tilde{\rho }}=1\), and supercritical if \({\tilde{\rho }}>1\). It is important to highlight that whenever the matrix \(\tilde{{\textsf{m}}}\) is primitive (i.e., there exists \(n\in {\mathbb {N}}\) such that all the entries of \(\tilde{{\textsf{m}}}^n\) are positive), the Perron–Frobenius theorem (see also Lemma A.1) guarantees the unique existence of \({\tilde{\rho }}\). Moreover, the usual classification of multi-type branching processes without and with immigration coincides with the one proposed here (see, e.g., Athreya and Ney [4, Chapter V, page 186] and Kaplan [25, page 948]).
3 Results
Let \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\) be a controlled p–type branching process given in (2.1). Let us introduce the following hypotheses.
Hypothesis 1
\({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] \), \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert ^4 \right] \) and \({\text {E}}\! \left[ \Vert \varvec{\phi }_{0}(\varvec{z})\Vert ^4 \right] \) are finite for each \(i\in \{1,\ldots ,p\}\) and \({\varvec{z}}\in {\mathbb {Z}}_+^p\).
Hypothesis 2
There exist \(\mathsf {\Lambda }\in {\mathbb {R}}^{p\times p}\), \(\varvec{\alpha }\in {\mathbb {R}}^p\) and a function \(\varvec{g}:{\mathbb {Z}}_+^p\rightarrow {\mathbb {R}}^p\) with \(\Vert \varvec{g}(\varvec{z})\Vert = {\text {o}}(1)\) as \(\Vert \varvec{z}\Vert \rightarrow \infty \) such that
Hypothesis 3
\(\Vert \mathsf {\Gamma }(\varvec{z})\Vert ={\text {o}}(\Vert \varvec{z}\Vert )\) as \(\Vert \varvec{z}\Vert \rightarrow \infty .\)
Hypothesis 4
\(\kappa _i(\varvec{z}) ={\text {O}}(\Vert \varvec{z}\Vert ^2)\) as \(\Vert \varvec{z}\Vert \rightarrow \infty \) for \(i=1,\ldots ,p.\)
Hypothesis 5
The matrix \(\tilde{{\textsf{m}}} :={\textsf{m}}\mathsf {\Lambda }\) belongs to \({\mathbb {R}}_+^{p\times p}\) and
-
(a)
\({\tilde{\rho }}:=1\) is an eigenvalue of \(\tilde{{\textsf{m}}}\) having algebraic and geometric multiplicities 1, and the absolute values of the other eigenvalues of \(\tilde{{\textsf{m}}}\) are less than 1,
-
(b)
there exist a unique right eigenvector \(\tilde{\varvec{u}}\in {\mathbb {R}}_{+}^p\) and a unique left eigenvector \(\tilde{\varvec{v}}\in {\mathbb {R}}_{+}^p\) corresponding to \({\tilde{\rho }} = 1\) such that \(\tilde{u}_1 + \ldots + \tilde{u}_p = 1\), \(\mathsf {\Lambda }\tilde{\varvec{u}}\in {\mathbb {R}}_{+}^p\) and \(\tilde{\varvec{v}}^\top \tilde{\varvec{u}} = 1\),
-
(c)
\(\lim _{k\rightarrow \infty } \tilde{{\textsf{m}}}^k = \tilde{\mathsf {\Pi }} \) and there exist \(\tilde{c}\in {\mathbb {R}}_{++}\) and \(\tilde{r}\in (0,1)\) such that \(\Vert \tilde{{\textsf{m}}}^k - \tilde{\mathsf {\Pi }}\Vert \le \tilde{c}\tilde{r}^k\) for each \(k\in {\mathbb {N}}\), where \(\tilde{\mathsf {\Pi }} :=\tilde{\varvec{u}}\tilde{\varvec{v}}^\top \).
Hypothesis 6
For all \(\epsilon >0\) and \(B>0\), there exists \(k_0(\epsilon ,B)\in {\mathbb {N}}\) such that \({\text {P}} \left[ \Vert {\varvec{Z}}_k \Vert \le B \right] \le \epsilon \) for each \(k\ge k_0(\epsilon ,B)\), \(k\in {\mathbb {N}}\).
Remark 3.1
-
(i)
In Hypotheses 1 and 4, the fourth order moment assumptions for the offspring and control distributions are used in the proofs only for checking the conditional Lindeberg condition, namely, condition (iii) of Theorem A.4, in order to prove convergence of some random step processes toward a diffusion process. For critical single-type CBPs, assuming that the explosion set has probability one, a detailed exposition of a proof of the conditional Lindeberg condition in question under second order moment assumptions can be found in González et al. [13].
-
(ii)
If Hypothesis 2 holds for \(\varvec{\phi }_0(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), then it also holds for any of its linear transformations \({\textsf{B}}\varvec{\phi }_0(\varvec{z}) + \varvec{\beta }\), \({\varvec{z}}\in {\varvec{Z}}_+^p\), with \({\textsf{B}}\in {\mathbb {R}}^{p\times p}\) and \(\varvec{\beta }\in {\mathbb {R}}^p\), by replacing \(\mathsf {\Lambda }\), \(\varvec{\alpha }\) and \(\varvec{g}\) with \(\mathsf {B\Lambda }\), \({\textsf{B}}\varvec{\alpha } + \varvec{\beta }\) and \({\textsf{B}}\varvec{g}\), respectively. Note that, in Hypothesis 2, \(\varvec{\alpha }\in {\mathbb {R}}^p\) can have negative coordinates as well, for an example, see part (i) of Remark 3.6.
-
(iii)
Taking into account Lemma A.1, we have that Hypothesis 5 holds if the matrix \(\tilde{{\textsf{m}}}\) is primitive, its Perron–Frobenius eigenvalue \(\tilde{\rho }\) equals 1, and \(\mathsf {\Lambda } \tilde{\varvec{u}}\in {\mathbb {R}}_{+}^p\), where \(\tilde{\varvec{u}}\) is the right Perron–Frobenius eigenvector of \(\tilde{{\textsf{m}}}\) corresponding to \(\tilde{\rho }\). Moreover, notice that if \({\textsf{m}}\) is primitive, then a set of sufficient conditions for \(\tilde{{\textsf{m}}}\) to be primitive is that \(\mathsf {\Lambda }\in {\mathbb {R}}_{++}^{p\times p}\) and that \({\textsf{m}}\) and \(\mathsf {\Lambda }\) commute. Note also that if \(\mathsf {\Lambda }\in {\mathbb {R}}_+^{p\times p}\), then \(\tilde{{\textsf{m}}}\in {\mathbb {R}}_+^{p\times p}\) and \(\mathsf {\Lambda }\varvec{x}\in {\mathbb {R}}_+^p\) for all \(\varvec{x}\in {\mathbb {R}}_+^p\).
-
(iv)
A sufficient condition for Hypothesis 6 is the almost sure explosion of the process, i.e. \({\text {P}} \left[ \Vert \varvec{Z}_{k}\Vert \rightarrow \infty \text { as } k\rightarrow \infty \right] = 1\). Indeed, \(\{ \Vert \varvec{Z}_{k}\Vert \rightarrow \infty \text { as } k\rightarrow \infty \} \subset \bigcup _{k=0}^\infty \bigcap _{n = k}^{\infty } \{\Vert \varvec{Z}_{n}\Vert > B \}\) for all \(B>0\). Therefore, by the continuity of probability, we have for all \(B>0\),
$$\begin{aligned} 1 = {\text {P}} \left[ \bigcup _{k=0}^\infty \bigcap _{n = k}^{\infty } \{\Vert \varvec{Z}_{n}\Vert> B \} \right] = \lim _{k\rightarrow \infty } {\text {P}} \left[ \bigcap _{n = k}^{\infty } \{\Vert \varvec{Z}_{n}\Vert> B \} \right] \le \liminf _{k\rightarrow \infty } {\text {P}} \left[ \Vert \varvec{Z}_k\Vert > B \right] , \end{aligned}$$yielding that \(\lim _{k\rightarrow \infty } {\text {P}} \left[ \Vert \varvec{Z}_k\Vert > B \right] =1\) for all \(B>0\), or equivalently \(\lim _{k\rightarrow \infty } {\text {P}} \left[ \Vert \varvec{Z}_k\Vert \right] {\le B} =0\) for all \(B>0\). Consequently, Hypothesis 6 holds. It is also interesting to notice that Hypothesis 6 can not be derived from Hypotheses 1–5, see part (ii) of Remark 3.6 for a nontrivial example, where Hypotheses 1–5 hold, but Hypothesis 6 does not.
-
(v)
In case of \(\varvec{g}\equiv \varvec{0}_p\), Hypothesis 6 is not needed for our forthcoming main Theorem 3.3. In fact, in the proof of Theorem 3.3 it will be seen that Hypothesis 6 is only used for deriving (4.34) and (4.37), which are trivially satisfied for \(\varvec{g}\equiv \varvec{0}_p\). \(\blacksquare \)
Next, we give an example for a controlled p–type branching process for which the Hypotheses 1–5 hold.
Example 3.2
Let us consider a controlled p–type branching process with \({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] <\infty \), \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert ^4 \right] <\infty \), \(i\in \{1,\ldots ,p\}\), and
where \(\{U_{k,i}: k\in {\mathbb {Z}}_+, i\in \{1,\ldots ,p\}\}\) are independent and identically distributed \({\mathbb {Z}}_+^p\)–valued random variables such that their common distribution is the uniform distribution on the set \(\{-1,0,1\}\), and \({\mathbbm {1}}_A\) denotes the indicator function of a set A. The interpretation is as follows. Whenever there are i–type individuals, there is a migratory component \(U_{k,i}\) that allows the emigration/immigration of one i–type progenitor or there could be no migration with the same probabilities \(\frac{1}{3}\). This is a particular case of model (iii) in Example 2.1. Hypothesis 1 is trivially fulfilled and Hypotheses 2, 3 and 4 hold with \(\mathsf {\Lambda }={\textsf{I}}_p,\) \(\varvec{\alpha }= \varvec{0}_p\) and \(\varvec{g}\equiv \varvec{0}_p\), since, for each \(\varvec{z}\in {\mathbb {Z}}_+^p\), we have
yielding that \(\Vert \mathsf {\Gamma }(\varvec{z})\Vert \le \frac{2}{3}\) and \(\kappa _i(\varvec{z})\le \frac{2}{3}\) for \({\varvec{z}}\in {\mathbb {Z}}_+^p\). Moreover, taking into account parts (iii) and (iv) of Remark 3.1, if \({\textsf{m}}\) is primitive with Perron–Frobenius eigenvalue 1, then Hypothesis 5 is also satisfied, and Hypothesis 6 is not needed. \(\blacksquare \)
Let \({{\textbf {C}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) be the space of \({\mathbb {R}}^p\)–valued continuous functions on \({\mathbb {R}}_+\) and let us denote by \({{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) the space of \({\mathbb {R}}^p\)–valued càdlàg functions on \({\mathbb {R}}_+\) and by \({\mathcal {D}}_\infty ({\mathbb {R}}_+,{\mathbb {R}}^p)\) its Borel \(\sigma \)–algebra corresponding to the metric defined in equation (1.26) in Jacod and Shiryaev [22, Chapter VI]. For \({\mathbb {R}}^p\)–valued stochastic processes with càdlàg paths \((\varvec{{\mathcal {Y}}}_t^{(n)})_{t\in {\mathbb {R}}_+}\), \(n\in {\mathbb {N}}\), and \((\varvec{{\mathcal {Y}}}_t)_{t\in {\mathbb {R}}_+}\), if the distribution of \((\varvec{{\mathcal {Y}}}_t^{(n)})_{t\in {\mathbb {R}}_+}\) on the space \(\left( {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p),{\mathcal {D}}_\infty ({\mathbb {R}}_+,{\mathbb {R}}^p)\right) \) converges weakly as \(n\rightarrow \infty \) to the distribution of \((\varvec{{\mathcal {Y}}}_t)_{t\in {\mathbb {R}}_+}\) on the same space, then we use the notation \((\varvec{{\mathcal {Y}}}_t^{(n)})_{t\in {\mathbb {R}}_+} {\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}} (\varvec{{\mathcal {Y}}}_t)_{t\in {\mathbb {R}}_+}\) as \(n\rightarrow \infty \).
Let us define a sequence of random step processes \((\varvec{{\mathcal {Z}}}_t^{(n)})_{t\in {\mathbb {R}}_+}\), \(n\in {\mathbb {N}},\) using the controlled p–type branching process \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\), as
Recall that, under Hypothesis 5, \(\tilde{\varvec{u}}, \tilde{\varvec{v}}\in {\mathbb {R}}_+^p\) are the right and left eigenvectors of \(\tilde{{\textsf{m}}}\) corresponding to eigenvalue \({\tilde{\rho }}=1\), respectively.
Theorem 3.3
Suppose that Hypotheses 1–6 hold for the CMBP \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}\) given in (2.1). Then
where \(({\mathcal {Z}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \(\mathcal {Z}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process.
Remark 3.4
The coefficient functions \({\mathbb {R}}\ni x\mapsto \tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha } \in {\mathbb {R}}\) and \({\mathbb {R}}\ni x\mapsto \sqrt{\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}}x^+} \in {\mathbb {R}}\) satisfy the conditions of Theorem 1 in Yamada and Watanabe [36], so pathwise uniqueness is guaranteed for SDE (3.2). Indeed, \(\int _0^\epsilon u^{-1} \,\textrm{d}u = \infty \) for all \(\epsilon >0\) and square root is a strictly increasing non-negative function vanishing at zero such that \(|\sqrt{x}-\sqrt{y}|\le \sqrt{|x-y|}\) for all \(x,y\in {\mathbb {R}}_+\). Moreover, if \(\tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha }\in {\mathbb {R}}_+\) (respectively, \(\tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha }>0\)), then \({\mathcal {Z}}_t\) is non-negative (respectively, positive) for all \(t>0\) by the comparison theorem (see, e.g., Revuz and Yor [29, Chapter IX, Theorem 3.7]) and, consequently, \({\mathcal {Z}}_t^+\) in (3.2) may be replaced by \({\mathcal {Z}}_t\). Indeed, in case of \(\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}}=0\), we have \({\mathcal {Z}}_t=\tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha }\,t\), \(t\in {\mathbb {R}}_+\), and in case of \(\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}}>0\) and \(t>0\), we have \({\mathcal {Z}}_t\) has a Gamma distribution with parameters \(2\tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha }/(\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}})\) and \(2/(\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}} \, t)\) (following, e.g., from Ikeda and Watanabe [19, Chapter IV, Example 8.2]). \(\blacksquare \)
Remark 3.5
-
(i)
It turns out that the limit process in (3.1) can also be characterized as the pathwise unique strong solution of the SDE
$$\begin{aligned} \,\textrm{d}\varvec{{\mathcal {Z}}}_t = \tilde{\mathsf {\Pi }}{\textsf{m}}\varvec{\alpha }\,\textrm{d}t + \tilde{\mathsf {\Pi }}\sqrt{(\mathsf {\Lambda }\varvec{{\mathcal {Z}}}_t)^+ \odot \varvec{\mathsf {\Sigma }}} \,\textrm{d}\varvec{{\mathcal {W}}}_t, \qquad t\in {\mathbb {R}}_+, \end{aligned}$$(3.3)with initial value \(\varvec{\mathcal {Z}}_0 = \varvec{0}_p\), where \((\varvec{{\mathcal {W}}}_t)_{t\in {\mathbb {R}}_+}\) is a standard p–dimensional Wiener process (for details, see Step 4 in the proof of Theorem 3.3).
-
(ii)
If all the coordinates of the vector \(\varvec{\alpha }\in {\mathbb {R}}^p\) in Hypothesis 2 are negative, then it is easy to prove that the pathwise unique strong solution of the SDE (3.2) is \({\mathcal {Z}}_t=0\), \(t\in {\mathbb {R}}_+\). \(\blacksquare \)
In the next remark, we point out that Hypothesis 6 basically excludes that \({\text {P}} \left[ \varvec{\phi }_0(\varvec{0}_p)=\varvec{0}_p \right] =1\), this condition appears for the conditional weak limit theorems for one-dimensional distributions of critical CMBPs in González et al. [16, Sect. 4]. Further, we also demonstrate that Hypothesis 6 is independent of Hypotheses 1–5.
Remark 3.6
-
(i)
If \({\text {P}} \left[ \varvec{\phi }_0(\varvec{0}_p)=\varvec{0}_p \right] =1\), then \(\varvec{\alpha }=-\varvec{g}(\varvec{0}_p)\) in Hypothesis 2. Moreover, if \({\text {P}} \left[ \varvec{\phi }_0(\varvec{0}_p)=\varvec{0}_p \right] =1\) and \(\varvec{0}_p\) is a state of \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\), i.e., there exists an \(n_0\in {\mathbb {Z}}_+\) such that \({\text {P}} \left[ {\varvec{Z}}_{n_0}=\varvec{0}_p \right] >0\), then Hypothesis 6 does not hold. Indeed, in this case, we have \(\{{\varvec{Z}}_{n_0}=\varvec{0}_p \}\subseteq \{ {\varvec{Z}}_k=\varvec{0}_p\}\) for each \(k\ge n_0\), \(k\in {\mathbb {Z}}_+\), yielding that \(\{{\varvec{Z}}_{n_0}=\varvec{0}_p\}\subseteq \{\Vert {\varvec{Z}}_k \Vert \le B\}\) for all \(B>0\) and \(k\ge n_0\), \(k\in {\mathbb {Z}}_+\). Consequently, we get \(0<{\text {P}} \left[ {\varvec{Z}}_{n_0}=\varvec{0}_p \right] \le {\text {P}} \left[ \Vert {\varvec{Z}}_k \Vert \le B \right] \) for all \(B>0\) and \(k\ge n_0\), \(k\in {\mathbb {Z}}_+\). By choosing \(\epsilon :=\frac{1}{2}{\text {P}} \left[ {\varvec{Z}}_{n_0}= \varvec{0}_p \right] >0\), one can see that Hypothesis 6 does not hold. If \({\text {P}} \left[ \varvec{\phi }_0(\varvec{0}_p)= \varvec{0}_p \right] =1\) and \(\varvec{0}_p\) is not a state of \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\), then, in principle, Hypothesis 6 can hold. For example, let \(p:=2\),
$$\begin{aligned}&\varvec{\phi }_{k}(\varvec{z}) =:{\left\{ \begin{array}{ll} \varvec{0}_2 &{} \text {if}\,\, {\varvec{z}}=\varvec{0}_2,\\ \begin{pmatrix} z_1+1 \\ z_1+z_2-1 \end{pmatrix}&\text {if}\,\, {\varvec{z}}\ne \varvec{0}_2, {\varvec{z}}\in {\mathbb {Z}}_+^2, \end{array}\right. } \qquad k\in {\mathbb {Z}}_+,\\&\varvec{X}_{k,j,1} =:\begin{pmatrix} 1 \\ 0 \end{pmatrix}, \quad \varvec{X}_{k,j,2} =:\begin{pmatrix} 0 \\ 0 \end{pmatrix}, \quad {\varvec{Z}}_0 =:\begin{pmatrix} 1 \\ 0 \end{pmatrix},\qquad k\in {\mathbb {Z}}_+,\,j\in {\mathbb {N}}. \end{aligned}$$Then \({\text {P}} \left[ \varvec{\phi }_0(\varvec{0}_2)=\varvec{0}_2 \right] =1\) and
$$\begin{aligned} {\varvec{Z}}_k=(k+1) \begin{pmatrix} 1 \\ 0 \end{pmatrix},\qquad k\in {\mathbb {N}}. \end{aligned}$$Hence one can easily see that Hypotheses 1–6 hold with
$$\begin{aligned} \mathsf {\Lambda }:=\begin{pmatrix} 1 &{} 0 \\ { 1} &{} { 1} \\ \end{pmatrix}, \qquad \varvec{\alpha }:=\begin{pmatrix} 1 \\ { -1} \end{pmatrix}, \qquad \varvec{g}(\varvec{z}):={\left\{ \begin{array}{ll} \begin{pmatrix} -1 \\ { 1} \end{pmatrix} &{} \text {if}\,\, {\varvec{z}}= \varvec{0}_2,\\ \varvec{0}_2 &{} \text {if} \,\,{\varvec{z}}\ne \varvec{0}_2, {\varvec{z}}\in {\mathbb {Z}}_+^2, \end{array}\right. } \end{aligned}$$and
$$\begin{aligned} {\textsf{m}} = \tilde{{\textsf{m}}} :=\begin{pmatrix} 1 &{} 0 \\ 0 &{} 0 \\ \end{pmatrix}, \qquad \tilde{\varvec{u}}:=\begin{pmatrix} 1 \\ 0 \\ \end{pmatrix},\qquad \tilde{\varvec{v}}:=\begin{pmatrix} 1 \\ 0 \\ \end{pmatrix}. \end{aligned}$$In this case, without the application of Theorem 3.3, we readily have that
$$\begin{aligned} (\varvec{{\mathcal {Z}}}_t^{(n)})_{t\in {\mathbb {R}}_+} = \left( \frac{1}{n}(\lfloor nt\rfloor +1) \begin{pmatrix} 1 \\ 0 \\ \end{pmatrix} \right) _{t\in {\mathbb {R}}_+} {\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}} \left( t \begin{pmatrix} 1 \\ 0 \\ \end{pmatrix} \right) _{t\in {\mathbb {R}}_+} \qquad \text {as } n\rightarrow \infty . \end{aligned}$$Of course, Theorem 3.3 also gives the same result, since in this case the SDE (3.2) takes the form \(\,\textrm{d}{\mathcal {Z}}_t = \,\textrm{d}t\), \(t\in {\mathbb {R}}_+\), with initial value \(\mathcal {Z}_0 = 0\), yielding that \({\mathcal {Z}}_t = t\), \(t\in {\mathbb {R}}_+\).
-
(ii)
We give a nontrivial example, where Hypotheses 1–5 hold, but Hypothesis 6 does not. Fix \(p:=2\) and \(\varvec{Z}_0:=\varvec{1}_2\). For \(k\in {\mathbb {Z}}_+\), let \(\varvec{\phi }_{k}(\varvec{z}) :=\varvec{z}+\varvec{1}_2\) for \({\varvec{z}}\in {\mathbb {Z}}_+^2{\setminus } \{\varvec{0}_2\}\) and \(\varvec{\phi }_{k}(\varvec{0}_2) :=\varvec{0}_2\); and for \(k,j\in {\mathbb {N}}\) and \(i\in \{1,2\}\), let \(\varvec{X}_{k,j,i}\) be a random vector with values in \(\{\varvec{0}_2,\varvec{1}_2\}\) taking both values with probability \(\frac{1}{2}\). Then \( \varvec{m}_i={\text {E}}\! \left[ \varvec{X}_{0,1,i} \right] = \frac{1}{2}\varvec{1}_2\) for \(i\in \{1,2\}\). Further, let \(\mathsf {\Lambda }:={\textsf{I}}_2\), \(\varvec{\alpha }:=\varvec{1}_2\), \(\varvec{g}(\varvec{0}_2) :=- \varvec{1}_2\) and \(\varvec{g}(\varvec{z}) :=\varvec{0}_2\) for \({\varvec{z}}\in {\mathbb {Z}}_+^2{\setminus } \{\varvec{0}_2\}\). Hence \(\tilde{{\textsf{m}}} = {\textsf{m}} \mathsf {\Lambda } = {\textsf{m}} = \frac{1}{2} {\begin{pmatrix} 1 &{} 1 \\ 1 &{} 1 \\ \end{pmatrix}}\), which is a primitive matrix with Perron–Frobenius eigenvalue \({{\tilde{\rho }}} =1\). Consequently, Hypotheses 1–5 hold with the above introduced \(\mathsf {\Lambda }\), \(\varvec{\alpha }\), \(\varvec{g}\), \(\tilde{{\textsf{m}}}\), and \(\tilde{\varvec{u}}:=\frac{1}{\sqrt{2}}\varvec{1}_2 =:\tilde{\varvec{v}}\). Moreover, since \(\varvec{0}_2\) is a state of \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\) and \({\text {P}} \left[ \varvec{\phi }_0(\varvec{0}_2)=\varvec{0}_2 \right] =1\), part (i) of this remark implies that Hypothesis 6 does not hold. \(\blacksquare \)
In the next remark, we point out that Theorem 3.3 with \(p=1\) gives back the result of González et al. [13] for critical single-type CBPs.
Remark 3.7
Theorem 3.3 with \(p=1\), i.e. for critical single-type CBPs, yields that \(({\mathcal {Z}}_t^{(n)})_{t\in {\mathbb {R}}_+}\) converges weakly toward \(({\mathcal {Z}}_t)_{t\in {\mathbb {R}}_+}\) as \(n\rightarrow \infty \), where \(({\mathcal {Z}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \(\mathcal {Z}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process, and \(m>0\) and \(\Sigma \) are the offspring mean and variance, respectively (\(\Lambda \) must be \(m^{-1}\) in order that \((Z_k)_{k\in {\mathbb {Z}}_+}\) be critical). It is worth mentioning that González et al. [13, Theorem 3.1] obtained the same limit process \(({\mathcal {Z}}_t)_{t\in {\mathbb {R}}_+}\) assuming the following three hypotheses, namely, (C0): the explosion set has probability one, (C1): \(\varepsilon (z)=m^{-1}z+m^{-1}\alpha '+{\text {o}}(1)\) as \(z\rightarrow \infty \) with some \(\alpha '>0\), and (C2): \(\Gamma (z)={\text {o}}(z)\) as \(z\rightarrow \infty \). Notice that if (C0), (C1) and (C2) hold, then our Hypotheses 2, 3, 5 and 6 are satisfied. Indeed, Hypotheses 2, 3 and 5 follow trivially from (C1) and (C2), and (C0) implies Hypothesis 6 as it was pointed out in part (iv) in Remark 3.1. Concerning Hypotheses 1 and 4, in part (i) of Remark 3.1, we already mentioned the reason why the fourth moments on the offspring and control distributions are assumed in our paper. \(\blacksquare \)
As a consequence of Theorem 3.3 one can deduce the asymptotic behaviour of a critical primitive MBPI previously proved by Ispány and Pap [21, Theorem 3.1]. Let \((\varvec{Y}_k)_{k\in {\mathbb {Z}}_+}\) be a MBPI defined in (2.2) such that the offspring mean matrix \({\textsf{m}}_{\varvec{\xi }} :=({\text {E}}\! \left[ \varvec{\xi }_{0,1,1} \right] , \ldots , {\text {E}}\! \left[ \varvec{\xi }_{0,1,p} \right] ) \in {\mathbb {R}}_+^{p\times p}\) is primitive with Perron–Frobenius eigenvalue 1, and let \(\varvec{u}\) and \(\varvec{v}\) be its right and left Perron–Frobenius eigenvectors, respectively (see Lemma A.1). Then \((\varvec{Y}_k)_{k\in {\mathbb {Z}}_+}\) is a critical primitive MBPI. Let us denote \(\varvec{m}_{\varvec{I}} :={\text {E}}\! \left[ \varvec{I}_1 \right] \in {\mathbb {R}}_+^p\) and \(\varvec{{\textsf{V}}} :=({\text {Var}}\! \left[ \varvec{\xi }_{0,1,1} \right] ,\ldots ,{\text {Var}}\! \left[ \varvec{\xi }_{0,1,p} \right] )\in ({\mathbb {R}}^{p\times p})^p\).
Corollary 3.8
(Ispány and Pap [21, Theorem 3.1]) Let \((\varvec{Y}_k)_{k\in {\mathbb {Z}}_+}\) be a critical primitive p–type branching process with immigration such that \({\text {E}}\! \left[ \Vert \varvec{Y}_0\Vert ^2 \right] <\infty \), \({\text {E}}\! \left[ \Vert \varvec{\xi }_{0,1,i}\Vert ^4 \right] <\infty \), \(i\in \{1,\ldots ,p\}\), and \({\text {E}}\! \left[ \Vert \varvec{I}_{1}\Vert ^4 \right] <\infty \). Then
where \(({\mathcal {Y}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \({\mathcal {Y}}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process.
Proof
To apply Theorem 3.3, let us rewrite \((\varvec{Y}_k)_{k\in {\mathbb {Z}}_+}\) as a CMBP (see part (ii) of Example 2.1). For \(k\in {\mathbb {Z}}_+\), \(j\in {\mathbb {N}}\) and \(i\in \{1,\ldots ,p\}\), we have
Further, for \(k\in {\mathbb {Z}}_+\) and \({\varvec{z}}\in {\mathbb {Z}}_+^{p+1}\), the deterministic control functions \(\varvec{\phi }_{k} (\varvec{z}) = (z_1,\ldots ,z_p,1)^\top \) can be written in the form \(\varvec{\phi }_{k} (\varvec{z}) = \mathsf {\Lambda }\varvec{z} + \varvec{\alpha } + \varvec{g} (\varvec{z})\), with
Then we have \({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] = {\text {E}}\! \left[ { 1+}\Vert \varvec{Y}_0\Vert ^2 \right] < \infty ,\)\({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert ^4 \right] = {\text {E}}\! \left[ \Vert \varvec{\xi }_{0,1,i}\Vert ^4 \right] < \infty \), \(i\in \{1,\ldots ,p\}\), \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,p+1}\Vert ^4 \right] ={\text {E}}\! \left[ (1+\Vert \varvec{I}_{1}\Vert ^2)^2 \right] < \infty \). Thus \(\mathsf {\Gamma }(\varvec{z})={\textsf{0}} \in {\mathbb {R}}^{(p+1)\times (p+1)}\) and \(\kappa _i({\varvec{z}})=0\), \({\varvec{z}}\in {\mathbb {Z}}_+^{p+1}\), \(i\in \{1,\ldots ,p\}\), and hence Hypotheses 1, 2, 3 and 4 hold. Since \(\varvec{g} \equiv \varvec{0}_{p+1}\), Hypothesis 6 is not needed, see part (v) of Remark 3.1. Finally, Hypothesis 5 is satisfied with
where \( \mathsf {\Pi }_{\varvec{\xi }} :=\lim _{k\rightarrow \infty }{\textsf{m}}_{\varvec{\xi }}^k\), and \({\tilde{c}}\) and \({\tilde{r}}\) in part (c) of Hypothesis 5 can be chosen as for \( \mathsf {\Pi }_{\varvec{\xi }}\) (since \({\textsf{m}}_{\varvec{\xi }}\) is primitive with Frobenius–Perron eigenvalue 1, and see part (iii) of Lemma A.1). Further, we get
and, consequently, Theorem 3.3 yields the statement. \(\square \)
Motivated by part (i) of Remark 2.2, we derive another corollary of Theorem 3.3.
Corollary 3.9
Let \((\varvec{Z}_k)_{k\in {\mathbb {Z}}_+}\) be a critical controlled p–type branching process given in part (i) of Remark 2.2 such that \({\text {E}}\! \left[ \Vert \varvec{Y}_0\Vert ^2 \right] <\infty \), \({\text {E}}\! \left[ \Vert \varvec{\xi }_{0,1,i}\Vert ^4 \right] <\infty \), \(i\in \{1,\ldots ,p\}\), and \({\text {E}}\! \left[ \Vert \varvec{I}_{1}\Vert ^4 \right] <\infty \). Then
where \(({\mathcal {Z}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \({\mathcal {Z}}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process.
Proof
We follow the same steps as in the proof of Corollary 3.8. We have \({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] = {\text {E}}\! \left[ \Vert \varvec{Y}_0\Vert ^2 \right] < \infty ,\) \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert ^4 \right] = {\text {E}}\! \left[ \Vert \varvec{\xi }_{0,1,i}\Vert ^4 \right] < \infty \), \(i\in \{1,\ldots ,p\}\), \( {\text {E}}\! \left[ \Vert \varvec{\phi }_0({\varvec{z}})\Vert ^4 \right] \le 8 {\text {E}}\! \left[ \Vert {\varvec{z}}\Vert ^4 + \Vert \varvec{I}_{1}\Vert ^4 \right] < \infty \). Therefore, Hypothesis 1 is satisfied. Hypothesis 2 also holds because \(\varvec{\varepsilon }(\varvec{z}) = \mathsf {\Lambda }\varvec{z} + \varvec{\alpha } + \varvec{g}(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), with \(\mathsf {\Lambda } = {\textsf{I}}_p\), \(\varvec{\alpha } = \varvec{m}_{\varvec{I}}\) and \(\varvec{g}\equiv \varvec{0}_{p}\). Due to \({\text {E}}\! \left[ \Vert \varvec{I}_{1}\Vert ^4 \right] <\infty \), it is clear that \(\mathsf {\Gamma }(\varvec{z}) = {\text {Var}}\! \left[ \varvec{I}_1 \right] \in {\mathbb {R}}^{p\times p}\) and \(\kappa _i ({\varvec{z}}) = {\text {E}}\! \left[ (I_{1,i}-m_{\varvec{I},i})^4 \right] < \infty \), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), \(i\in \{1,\ldots ,p\}\), so Hypotheses 3 and 4 are satisfied. Furthermore, Hypothesis 5 holds with
where \(\mathsf {\Pi }_{\varvec{\xi }} :=\lim _{k\rightarrow \infty }{\textsf{m}}_{\varvec{\xi }}^k\), and \({\tilde{c}}\) and \({\tilde{r}}\) in part (c) of Hypothesis 5 can be chosen as for \(\mathsf {\Pi }_{\varvec{\xi }}\) (see part (iii) of Lemma A.1). Indeed, by our assumption, \({\textsf{m}}_{\varvec{\xi }}\) is primitive with Perron–Frobenius eigenvalue equals 1. Finally, Hypothesis 6 is not needed, since \(\varvec{g}\equiv \varvec{0}_p\) (see part (v) of Remark 3.1). Hence the result follows from Theorem 3.3 using also that \(\varvec{v}^\top {\textsf{m}}_{\varvec{\xi }} \varvec{m}_{\varvec{I}} = \varvec{v}^\top \varvec{m}_{\varvec{I}}\), since \(\varvec{v}\) is the left Perron–Frobenius eigenvector for \({\textsf{m}}_{\varvec{\xi }}\). \(\square \)
Next, we establish a scaling limit theorem for a 2SBPI with the promiscuous mating function \(L(x,y) :=x\min \{1,y\}\), \(x,y\in {\mathbb {Z}}_+\). This type of mating function was first considered by Daley [8], and it was deeply studied by Alsmeyer and Rösler [1, 2]. Up to our knowledge, no such result as the following one is available in the literature (for more details, see Remark 3.12).
Corollary 3.10
Let \((F_k,M_k)_{k\in {\mathbb {Z}}_+}\) be a 2SBPI defined in (2.4) with the promiscuous mating function. Assume that \((F_0,M_0)\) is a \({\mathbb {N}}^2\)–valued random vector, \({\text {E}}\! \left[ \Vert (F_0,M_0)\Vert ^2 \right] <\infty \), \({\text {E}}\! \left[ \Vert (f_{0,1},m_{0,1})\Vert ^4 \right] <\infty \), and \({\text {E}}\! \left[ \Vert (F_1^I,M_1^I)\Vert ^4 \right] <\infty \). If \({\text {P}} \left[ M_1^I=0 \right] =0\) and \({\text {E}}\! \left[ f_{0,1} \right] =1\), then
where \(({\mathcal {X}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \({\mathcal {X}}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process.
Proof
Recall that \(L(x,y) = x\min \{1,y\}\), \(x,y\in {\mathbb {Z}}_+\).
First, using that \({\text {P}} \left[ M_1^I=0 \right] =0\) and \((F_0,M_0)\) is \({\mathbb {N}}^2\)–valued, we check that \(L(F_n,M_n) = F_n\) for each \(n\in {\mathbb {Z}}_+\) almost surely. For this, since \(M_n\) is \({\mathbb {Z}}_+\)–valued, taking into account the form of L, it is enough to verify that \({\text {P}} \left[ M_n=0 \right] =0\), \(n\in {\mathbb {N}}\). Indeed, by a conditioning argument with respect to \(U_n\), we can get for each \(n\in {\mathbb {Z}}_+\),
Consequently, \((F_k,M_k)_{k\in {\mathbb {Z}}_+}\) coincides almost surely with the 2-type controlled branching process \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\) given by
for \(k\in {\mathbb {Z}}_+\), \(j\in {\mathbb {N}}\), and \({\varvec{z}}=(z_1,z_2)\in {\mathbb {Z}}_+^2\).
We want to apply Theorem 3.3 for \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\). The control distributions are deterministic, so Hypothesis 3 and 4 are trivially satisfied. In addition, we get \( {\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] = {\text {E}}\! \left[ \Vert (F_0,M_0)\Vert ^2 \right] < \infty ,\) \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,1}\Vert ^4 \right] = {\text {E}}\! \left[ \Vert (f_{0,1},m_{0,1})\Vert ^4 \right] < \infty \) and \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,2}\Vert ^4 \right] = {\text {E}}\! \left[ \Vert (F_1^I,M_1^I)\Vert ^4 \right] < \infty \), yielding that Hypothesis 1 holds. For each \({\varvec{z}}\in {\mathbb {Z}}_+^2\), we have
and hence Hypothesis 2 holds with the given \(\mathsf {\Lambda }\), \(\varvec{\alpha }\), and \(\varvec{g}\equiv \varvec{0}_2\). Hypothesis 6 is not needed, see part (v) of Remark 3.1. Further, we get
Since \({\text {E}}\! \left[ f_{0,1} \right] =1\) and \(\tilde{{\textsf{m}}}^k = \tilde{{\textsf{m}}}\), \(k\in {\mathbb {N}}\), the conditions of Hypothesis 5 are satisfied with
and \(\tilde{\mathsf {\Pi }} = \lim _{k\rightarrow \infty } \tilde{{\textsf{m}}}^k = { \tilde{{\textsf{m}}}}.\) Therefore, Theorem 3.3 implies that
where \(({\mathcal {Z}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \({\mathcal {Z}}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process. With the process \(({\mathcal {X}}_t)_{t\in {\mathbb {R}}_+}\) given by \({\mathcal {X}}_{t} = (1+{\text {E}}\! \left[ m_{0,1} \right] )^{-1} {\mathcal {Z}}_{t}\), \(t\in {\mathbb {R}}_+\), we obtain the desired result. \(\square \)
Next, we apply Theorem 3.3 for a 2SBPI defined in (2.4) with a so-called self-fertilization mating function defined by \(L({\varvec{z}}):=z_1+z_2\), \({\varvec{z}}=(z_1,z_2)\in {\mathbb {R}}_+^2\), which allows both females and males have partenogenesis (asexual reproduction). In this model, every individual (regardless whether it is a female or male) corresponds to a mating unit and can have female and male offsprings independently of the other individuals. There is a kind of phenomenon in nature as well, e.g., for aphids and for some reptiles. The following result can be considered as another new contribution in the field of Feller-type diffusion approximations of 2SBPIs.
Corollary 3.11
Let \((F_k,M_k)_{k\in {\mathbb {Z}}_+}\) be a 2SBPI defined in (2.4) with the self-fertilization mating function. Assume that \({\text {E}}\! \left[ \Vert (F_0,M_0)\Vert ^2 \right] <\infty \), \({\text {E}}\! \left[ \Vert (f_{0,1},m_{0,1})\Vert ^4 \right] <\infty \), \({\text {E}}\! \left[ \Vert (F_1^I,M_1^I)\Vert ^4 \right] <\infty \), and \({\text {E}}\! \left[ f_{0,1} \right] ,{\text {E}}\! \left[ m_{0,1} \right] \in (0,1)\) are such that \({\text {E}}\! \left[ f_{0,1} \right] + {\text {E}}\! \left[ m_{0,1} \right] = 1\). Then
where \(({\mathcal {X}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \({\mathcal {X}}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process.
Proof
Let us rewrite \((F_k,M_k)_{k\in {\mathbb {Z}}_+}\) as a CMBP \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\) (see part (ii) of Example 2.1). We want to apply Theorem 3.3 for \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\). The control distributions are deterministic, so Hypothesis 3 and 4 are trivially satisfied. In addition, we get \({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] = {\text {E}}\! \left[ \Vert (F_0,M_0)\Vert ^2 \right] < \infty \), \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,1}\Vert ^4 \right] = {\text {E}}\! \left[ \Vert (f_{0,1},m_{0,1})\Vert ^4 \right] < \infty \) and \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,2}\Vert ^4 \right] = {\text {E}}\! \left[ \Vert (F_1^I,M_1^I)\Vert ^4 \right] < \infty \), yielding that Hypothesis 1 holds. For each \({\varvec{z}}\in {\mathbb {Z}}_+^2\), we have
and hence Hypothesis 2 holds with the given \(\mathsf {\Lambda }\), \(\varvec{\alpha }\), and \(\varvec{g}\equiv \varvec{0}_2\). Hypothesis 6 is not needed, see part (v) of Remark 3.1. Further, we get
The two eigenvalues of \(\tilde{{\textsf{m}}}\) are 0 and \({\text {E}}\! \left[ f_{0,1} \right] +{\text {E}}\! \left[ m_{0,1} \right] =1\), and consequently the spectral radius of \(\tilde{{\textsf{m}}}\) is 1. Using also that \({\text {E}}\! \left[ f_{0,1} \right] ,{\text {E}}\! \left[ m_{0,1} \right] \in (0,1)\), we have that \(\tilde{{\textsf{m}}}\) is a primitive matrix with Frobenius–Perron eigenvalue 1. Note also that \(\tilde{{\textsf{m}}}^k = \tilde{{\textsf{m}}}\), \(k\in {\mathbb {N}}\), since \({\text {E}}\! \left[ f_{0,1} \right] +{\text {E}}\! \left[ m_{0,1} \right] =1\). Taking into these considerations, one can check that the conditions of Hypothesis 5 are satisfied with
and the constants \({\tilde{c}}\in {\mathbb {R}}_{++}\) and \({\tilde{r}}\in (0,1)\) in part (c) of Hypothesis 5 can be chosen arbitrarily. Therefore, Theorem 3.3 implies that
where \(({\mathcal {X}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \({\mathcal {X}}_0 = 0\), where \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) is a standard Wiener process. This easily yields the assertion. \(\square \)
Remark 3.12
Scaling limits for a class of processes that combine classical asexual Galton–Watson processes and two-sex Galton–Watson branching processes without immigration (introduced by Daley [8]) have been recently studied in Bansaye et al. [5]. However, this new family of processes does not include the class of two-sex Galton–Watson branching processes with immigration as a particular case, so the results in Bansaye et al. [5] do not imply our Corollaries 3.10 and 3.11. Even if we recover two-sex Galton–Watson branching processes without immigration as particular cases of the model (1)-(2) in Bansaye et al. [5] (using their notations, with the special choices \({\mathcal {E}}_{n,p}^{f,N} :=-1\) and \({\mathcal {E}}_{n,p}^{m,N}:=-1\), \(n,p\in {\mathbb {N}}\), which intuitively mean that the individuals (females or males) do not survive in the next generation), then their Theorem 2.1 cannot be applied, because their assumption (A1) does not hold in this case. Furthermore, note that for two-sex Galton–Watson branching processes without immigration, our Corollary 3.10 cannot be applied as well, since the assumption \({\text {P}} \left[ M_1^I=0 \right] =0\) does not hold in this case, however Corollary 3.11 can be applied and the limit process of the scaled two-sex Galton–Watson branching processes without immigration in question is the identically zero process. \(\blacksquare \)
Next, we formulate another corollary of Theorem 3.3, namely, we derive a limit distribution for the relative frequencies of distinct types of individuals. For different models, one can find similar results in Jagers [23, Corollary 1], in Yakovlev and Yanev [34, Proposition 1], [35, Theorem 2], and in Barczy and Pap [7, Corollary 4.1].
Recall that if \(\tilde{{\textsf{m}}}\) is primitive with Perron–Frobenius eigenvalue 1, then \(\tilde{\varvec{u}}\) and \(\tilde{\varvec{v}}\) denote the right and left Perron–Frobenius eigenvectors corresponding to 1, respectively.
Corollary 3.13
Suppose that Hypotheses 1–4 and 6 hold for the CMBP \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+}\) given in (2.1). Assume also that the matrix \(\tilde{{\textsf{m}}}\) is primitive with Perron–Frobenius eigenvalue 1, and \(\mathsf {\Lambda } \tilde{\varvec{u}}\in {\mathbb {R}}_{+}^p\). If, in addition, \(\tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha } > 0\), then for all \(t>0\) and \(i,j\in \{1,\ldots ,p\}\), we get
Remark 3.14
The indicator functions \({\mathbbm {1}}_{\{ \varvec{e}_j^\top {\varvec{Z}}_{\lfloor nt\rfloor }\ne 0\}}\) and \({\mathbbm {1}}_{\{ {\varvec{Z}}_{\lfloor nt\rfloor }\ne \varvec{0}_p\} }\) are needed in Corollary 3.13, since it can happen that \({\text {P}} \left[ {\varvec{Z}}_{\lfloor nt\rfloor } = \varvec{0}_p \right] >0\) for some \(t>0\). \(\blacksquare \)
Proof of Corollary 3.13
For all \(t>0\) and \(i,j\in \{1,\ldots ,p\}\), Theorem 3.3 yields that
The function \(g:{\mathbb {R}}^ 2\rightarrow {\mathbb {R}}\) defined by \(g(x,y) :=\frac{x}{y}{\mathbbm {1}}_{\{y \ne 0\}}\), \((x,y)\in {\mathbb {R}}^2\), is continuous on the set \({\mathbb {R}}\times ({\mathbb {R}}{\setminus }\{0\})\), and the distribution of \( \Big (\varvec{e}_i^\top \tilde{\varvec{u}}\,{\mathcal {Z}}_t, \varvec{e}_j^\top \tilde{\varvec{u}}\,{\mathcal {Z}}_t \Big )\) is concentrated on this set, since, by Remark 3.4, \({\text {P}} \left[ {\mathcal {Z}}_t >0 \right] =1\) for all \(t>0\) and \(\varvec{e}_j^\top \tilde{\varvec{u}}>0\). Hence the continuous mapping theorem yields that
thus we obtain the first convergence (using also that the limit is not random, so convergence in distribution yields convergence in probability in this case).
Similarly, for each \(i\in \{1,\ldots ,p\}\), the function \(h:{\mathbb {R}}^ p\rightarrow {\mathbb {R}}\) defined by \( h(x_1,\ldots ,x_p):=\frac{x_i}{\sum _{i=1}^p x_k} {\mathbbm {1}}_{\{\sum _{k=1}^p x_k\ne 0\}}\), \((x_1,\ldots ,x_p)\in {\mathbb {R}}^p\), is continuous on the set \({\mathbb {R}}^ p {\setminus }\{(x_1,\ldots ,x_p)\in {\mathbb {R}}^p: \sum _{k=1}^p x_k =0\}\), and the distribution of \(\tilde{\varvec{u}}{\mathcal {Z}}_t\) is concentrated on this set, since \({\text {P}} \left[ {\mathcal {Z}}_t >0 \right] =1\), \(t>0\), and \(\tilde{\varvec{u}}\in {\mathbb {R}}_{++}^p\) imply that \({\text {P}} \left[ \sum _{k=1}^p \varvec{e}_k^\top \tilde{\varvec{u}} {\mathcal {Z}}_t =0 \right] =0\). Using that \({\mathbbm {1}}_{\{ {\varvec{Z}}_{\lfloor nt\rfloor } \ne \varvec{0}_p\} } = {\mathbbm {1}}_{\{ \sum _{k=1}^p \varvec{e}_k^\top {\varvec{Z}}_{\lfloor nt\rfloor } \ne 0\}}\), the continuous mapping theorem yields that
as \(n\rightarrow \infty \), where we also used that the sum of coordinates of \(\tilde{\varvec{u}}\) is 1 (see part (ii) of Lemma A.1). Thus we obtain the second convergence as well (using again that the limit is not random). \(\square \)
4 Proof of Theorem 3.3
The proof is divided in four steps. In the first three Steps 1, 2 and 3, we introduce some auxiliary stochastic processes, prove their convergence in distribution, and in Step 4, we discuss that how Steps 1–3 yield the assertion. The detailed proofs for Steps 1–3 can be found after Step 4.
4.1 Step 1
From the process \((\varvec{Z}_{k})_{k\in {\mathbb {Z}}_+},\) we define a martingale difference sequence with respect to the filtration \(({\mathcal {F}}_{k})_{k\in {\mathbb {Z}}_+}\) given by
and we also consider the following sequence of random step processes
We will prove that
where \((\varvec{{\mathcal {M}}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value \(\varvec{\mathcal {M}}_0= \varvec{0}_p\), where \((\varvec{{\mathcal {W}}}_t)_{t\in {\mathbb {R}}_+}\) is a p–dimensional standard Wiener process.
4.2 Step 2
We will show that (4.3) implies
where the mappings \(\varvec{\Psi }:{{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) and \(\varvec{\Psi }_n:{{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(n\in {\mathbb {N}}\), are given by
for \(\varvec{f}\in {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(t\in {\mathbb {R}}_+\), \(n\in {\mathbb {N}}\).
4.3 Step 3
We will check that \(\varvec{{\mathcal {Z}}}^{(n)} = \varvec{\Psi }_n(\varvec{{\mathcal {M}}}^{(n)}) + \varvec{{\mathcal {V}}}^{(n)}\), \(n\in {\mathbb {N}}\), and
where \((\varvec{{\mathcal {V}}}_t^{(n)})_{t\in {\mathbb {R}}_+},\) \(n\in {\mathbb {N}},\) is another sequence of random step processes defined by
4.4 Step 4
As a consequence of Steps 1–3, one can easily derive (3.1). Namely, let us define
where we used that \(\tilde{\varvec{v}}^\top \tilde{\mathsf {\Pi }} = \tilde{\varvec{v}}^\top \tilde{\varvec{u}} \tilde{\varvec{v}}^\top = \tilde{\varvec{v}}^\top \) (see Hypothesis 5).
Then \(\varvec{{\mathcal {Z}}} = {\mathcal {Y}} \tilde{\varvec{u}}\), since
Further, by Itô’s formula, we can show that \(({\mathcal {Y}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE (3.2) with initial value 0, thus (3.1) follows and the proof concludes. Indeed, the SDE (4.4) can also be written in the form
since \(\mathsf {\Lambda }\tilde{\varvec{u}}\in {\mathbb {R}}_+^{p}\) (see Hypothesis 5) implies that
Therefore, \(({\mathcal {Y}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of the SDE
with initial value 0, since, by (4.11), \(\,\textrm{d}{\mathcal {Y}}_t = \tilde{\varvec{v}}^\top {\textsf{m}}\varvec{\alpha } \,\textrm{d}t + \tilde{\varvec{v}}^\top \,\textrm{d}\varvec{{\mathcal {M}}}_t\), \(t\in {\mathbb {R}}_+\), and \((\varvec{{\mathcal {M}}}_t)_{t\in {\mathbb {R}}_+}\) is the pathwise unique strong solution of (4.12) with initial value \(\varvec{0}_p\).
Suppose that \(\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}} \ne 0\), then the process \(({\mathcal {W}}_t)_{t\in {\mathbb {R}}_+}\) given by
is a well-defined one-dimensional standard Wiener process, since \(\mathsf {\Lambda }\tilde{\varvec{u}}\in {\mathbb {R}}_+^p\), yielding that \((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }}\) is a positive semi-definite matrix. In such a case, the SDE (4.13) can be written as
which coincides with the SDE (3.2), as desired. Otherwise, it is trivial because if \(\tilde{\varvec{v}}^\top ((\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }})\tilde{\varvec{v}} = 0\), then \(\Vert \tilde{\varvec{v}}^\top \sqrt{(\mathsf {\Lambda }\tilde{\varvec{u}})\odot \varvec{\mathsf {\Sigma }}}\Vert = 0\), yielding that SDEs (3.2) and (4.13) correspond to the ODE \(\,\textrm{d}x(t) = \tilde{\varvec{v}} {\textsf{m}}\varvec{\alpha }\,\textrm{d}t\), \(t\in {\mathbb {R}}_+\), with initial value 0, and this ODE has the identically zero solution.
Finally, notice that the statement in Remark 3.5 can be derived using the equality \(\varvec{{\mathcal {Z}}} = {\mathcal {Y}} \tilde{\varvec{u}}\), (4.8), (4.10), (4.11), and (4.12). Namely, we have
Since \(\mathsf {\Lambda }\tilde{\varvec{u}}\in {\mathbb {R}}_+^p\), we have
yielding (3.3), as desired.
4.5 Proof of Step 1
We need to show that (4.3) holds. To this end, we apply Theorem A.4 with \(\varvec{b}(t,\varvec{x})= \varvec{0}_p\in {\mathbb {R}}^p\), \({\textsf{C}} (t,\varvec{x}) = \sqrt{(\mathsf {\Lambda }{{\tilde{\Pi }}}(\varvec{x}+t{\textsf{m}}\varvec{\alpha }))^+ \odot \varvec{\mathsf {\Sigma }}}\in {\mathbb {R}}^{p\times p}\), \((t,\varvec{x})\in {\mathbb {R}}_+\times {\mathbb {R}}^p\), \(\varvec{\eta }\) is the Dirac measure concentrated at \(\varvec{0}_p\), \( \varvec{{\mathcal {U}}} = \varvec{{\mathcal {M}}}\), \({\mathcal {F}}_k^{(n)} = {\mathcal {F}}_{k}\), \( \varvec{U}_{0}^{(n)} = n^{-1} \varvec{Z}_{0}\) and \( \varvec{U}_{k}^{(n)} = n^{-1} \varvec{M}_{k}\), \(k,n\in {\mathbb {N}}\).
First, we check that the SDE (4.4) has a pathwise unique strong solution for all \({\mathbb {R}}^p\)–valued initial values. As seen in Step 4, we can rewrite the SDE (4.4) in the form (4.12).
If \((\varvec{{\mathcal {M}}}_t^{(\varvec{y}_0)})_{t\in {\mathbb {R}}_+}\) is a strong solution of (4.12) with initial value \(\varvec{{\mathcal {M}}}_0^{(\varvec{y}_0)} = \varvec{y}_0 \in {\mathbb {R}}^p\), then we check that the process \((({\mathcal {P}}_t^{(\varvec{y}_0)},\varvec{{\mathcal {Q}}}_t^{(\varvec{y}_0)}))_{t\in {\mathbb {R}}_+}\), defined by
is a strong solution of the SDE
with initial value
Notice that the first SDE in (4.15) readily follows from (4.12), and the second one can be checked as follows
Conversely, if \((({\mathcal {P}}_t^{(p_0,\varvec{q}_0)},\varvec{{\mathcal {Q}}}_t^{(p_0,\varvec{q}_0)}))_{t\in {\mathbb {R}}_+}\) is a strong solution of the SDE (4.15) with initial value \(({\mathcal {P}}_0^{(p_0,\varvec{q}_0)},\varvec{{\mathcal {Q}}}_0^{(p_0,\varvec{q}_0)}) = (p_0, \varvec{q}_0)\in {\mathbb {R}}\times {\mathbb {R}}^p\), again by Itô’s formula, the process
is a strong solution of (4.12) with initial value \(\varvec{{\mathcal {M}}}_0^{(p_0,\varvec{q}_0)}=p_0 \tilde{\varvec{u}}+\varvec{q}_0\).
Now, let us see that the map \({\mathbb {R}}^p \ni \varvec{y} \mapsto (\tilde{\varvec{v}}^\top \varvec{y},({\textsf{I}}_p - \tilde{\mathsf {\Pi }})\varvec{y} )\) is a bijection between \({\mathbb {R}}^p\) and \({\mathbb {R}}\times {\text {Null}} (\tilde{\varvec{v}}^\top )\). Let \(\varvec{x}_0, \varvec{y}_0 \in {\mathbb {R}}^p\) be two vectors with the same image under the map in question. Then the injectivity (i.e. \(\varvec{x}_0 = \varvec{y}_0\)) follows from
where we have used Hypothesis 5. Further, for all \( (p_0, \varvec{q}_0)\in {\mathbb {R}}\times {\text {Null}} (\tilde{\varvec{v}}^\top )\), the vector \(\varvec{y}_0 :=p_0 \tilde{\varvec{u}} + \varvec{q}_0\) is an element of the preimage of \((p_0, \varvec{q}_0)\), since, using that \(\tilde{\varvec{v}}^\top \tilde{\varvec{u}}=1\), we have \({\varvec{v}}^\top (p_0 \tilde{\varvec{u}} + \varvec{q}_0)= p_0+0=p_0\) and
Hence the map in question is surjective.
All in all, it is enough to prove that SDE (4.15) has a pathwise unique strong solution for all initial values in \({\mathbb {R}}\times {\mathbb {R}}^p\), in particular for all initial values in \({\mathbb {R}}\times {\text {Null}} (\tilde{\varvec{v}}^\top )\), which is shown below. The pathwise uniqueness for the first SDE in (4.15) (see also the SDE (4.13)) is clear by the discussion in Step 4 and Remark 3.4. We now proceed with the second SDE in (4.15). One can easily get that its pathwise unique strong solution with initial value \(\varvec{Q}_t^{(p_0,\varvec{q}_0)} = \varvec{q}_0\) takes the form
and not only for all \((p_0,\varvec{q}_0)\in {\mathbb {R}}\times {\text {Null}}(\tilde{\varvec{v}}^\top )\) but also for all \((p_0,\varvec{q}_0)\in {\mathbb {R}}\times {\mathbb {R}}^p\).
In what follows, we check that the assumptions of Theorem A.4 hold with our previous choices.
4.5.1 Step 1/A
For each \(n,k\in {\mathbb {N}}\), we get that \({\text {E}}\! \left[ \Vert n^{-1}\varvec{M}_k\Vert ^2 \right] <\infty \), since
where we used the variance decomposition formula and, at the last step, equation (2.9). Further, \(n^{-1}{\varvec{Z}}_0{\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}} \varvec{0}_p\) as \(n\rightarrow \infty \).
4.5.2 Step 1/B
For all \(T>0,\) the condition (i) of Theorem A.4 is trivially satisfied, since \((\varvec{M}_{k})_{k\in {\mathbb {N}}}\) is a sequence of martingale differences yielding that \({\text {E}} \left[ \varvec{M}_{k} \;\Big |\; {\mathcal {F}}_{k-1} \right] = \varvec{0}_p\), \(k\in {\mathbb {N}}\).
The conditions (ii) and (iii) of Theorem A.4 can be written in the following forms
where the process \((\varvec{{\mathcal {R}}}_t^{(n)})_{t\in {\mathbb {R}}_+}\) is defined by
4.5.3 Step 1/C
We show (4.16). Taking into account \({\text {E}} \left[ \varvec{M}_{k}(\varvec{M}_{k})^\top \;\Big |\; {\mathcal {F}}_{k-1} \right] = {\text {Var}} \left[ \varvec{Z}_{k} \;\Big |\; {\mathcal {F}}_{k-1} \right] \), \(k\in {\mathbb {N}}\), equation (2.9) and Hypothesis 2, we can obtain
Using again Hypothesis 2 together with equation (2.8), the martingale differences in (4.1) can be written as
With this expression and equation (4.2), we have
because \(\tilde{\mathsf {\Pi }}\tilde{{\textsf{m}}} = (\lim _{l\rightarrow \infty } \tilde{{\textsf{m}}}^l) \tilde{{\textsf{m}}} = \lim _{l\rightarrow \infty } \tilde{{\textsf{m}}}^{l+1} = \tilde{\mathsf {\Pi }}\) by Hypothesis 5.
Since \(\mathsf {\Lambda } \tilde{\varvec{u}}\in {\mathbb {R}}_+^p\), we have \(\mathsf {\Lambda }\tilde{\mathsf {\Pi }} = (\mathsf {\Lambda }\tilde{\varvec{u}})\tilde{\varvec{v}}^\top \in {\mathbb {R}}_+^{p\times p}\) and, consequently, we get that
where the bounds \(\varvec{{\mathcal {B}}}_{-, t}^{(n)}\) and \(\varvec{{\mathcal {B}}}_{+, t}^{(n)}\) are defined by
and recall that \(\vert {\varvec{z}}\vert = (\vert z_1\vert , \ldots , \vert z_p\vert )^\top \in {\mathbb {R}}_+^p\) for any \({\varvec{z}}=(z_1,\ldots ,z_p)^\top \in {\mathbb {R}}^p\). Hence we get
Next, we check that, for all \(t\in {\mathbb {R}}_+\),
To verify (4.21), we split the integral as follows
and we take into account
for \(j\in \{0,\ldots ,\lfloor nt\rfloor -1\}\), and
In what follows, we will use that for all \({\textsf{A}}\in {\mathbb {R}}^{p\times p}\) and \(\varvec{x},\varvec{y}\in {\mathbb {R}}^p\), \(\varvec{y}_0\in {\mathbb {R}}_+^p\) with \(\varvec{y} - \varvec{y}_0 \preceq \varvec{x} \preceq \varvec{y} + \varvec{y}_0\), we have
where we recall \(\Vert \varvec{\mathsf {\Sigma }}\Vert = \sum _{i=1}^p \Vert \mathsf {\Sigma }_i\Vert \). Hence, from equations (4.18), (4.20), (4.21) and (4.22), we obtain
As a consequence of Hypothesis 2, the function \(\varvec{g}\) is bounded, and \(\Vert | {\textsf{A}}{\varvec{z}}| \Vert = \Vert {\textsf{A}}{\varvec{z}}\Vert \le \Vert {\textsf{A}} \Vert \Vert {\varvec{z}}\Vert \) for all \({\textsf{A}}\in {\mathbb {R}}^{p\times p}\) and \({\varvec{z}}\in {\mathbb {R}}^p\). Thus, in order to show (4.16), we can easily see that it is enough to prove that
as \(n\rightarrow \infty \) for all \(T>0\). Indeed, for all \(T>0\), the supremum on [0, T] of the fifth and eight terms on the right hand side of (4.23) tend to 0 as \(n\rightarrow \infty \) in probability, since
as \(n\rightarrow \infty \). We remark that, later, at the end of the proof of Step 3, it will turn out that \(n^{-1} \sum _{k=1}^{\lfloor nT\rfloor } {\text {E}}\! \left[ \Vert \varvec{g} (\varvec{Z}_{k-1})\Vert \right] \rightarrow 0\) as \(n\rightarrow \infty \) for all \(T>0\) holds as well.
Let us start proving (4.24) and (4.25). From equation (4.19), the following expression can be obtained recursively,
and, using it with \(\tilde{\mathsf {\Pi }}\tilde{{\textsf{m}}}^k = \tilde{\mathsf {\Pi }}\), \({k\in {\mathbb {Z}}_+}\), we have
Let \({\tilde{c}}\in {\mathbb {R}}_{++}\) and \({\tilde{r}}\in (0,1)\) be the constants given in part (c) of Hypothesis 5 such that \(\Vert \tilde{{\textsf{m}}}^k-\tilde{\mathsf {\Pi }}\Vert \le {\tilde{c}}{\tilde{r}}^k\) for each \(k\in {\mathbb {N}}\), and let us introduce the constants
Then, using (4.28), the following inequalities hold
and
for all \(n\in N\) and \(t\in R_+\), where at the last inequality we also used that
Hence, to get (4.24) and (4.25) it is enough to show
The first convergence in (4.30) holds trivially, since, in fact, \(n^{-2}\Vert {\varvec{Z}}_0\Vert \rightarrow 0\) as \(n\rightarrow \infty \) almost surely.
The second convergence in (4.30) follows from (A.3), since, by Jensen inequality, \({\text {E}}\! \left[ \Vert \varvec{M}_{k}\Vert \right] \le \sqrt{{\text {E}}\! \left[ \Vert \varvec{M}_{k}\Vert ^2 \right] } = {\text {O}}(k^{1/2})\) as \(k\rightarrow \infty \), and hence, for all \(T>0\), we get that
Consequently, \(n^{-2} \sum _{k=1}^{\lfloor nT \rfloor } \Vert \varvec{M}_{k}\Vert \) converges to 0 in \(L^1\) as \(n\rightarrow \infty \).
Next, we check convergence (4.26). It is enough to prove that for all \(T>0\), we have that
By Hypothesis 3, for all \(\epsilon >0\) there exists \(K(\epsilon )>0\) such that \(\Vert \mathsf {\Gamma }(\varvec{z})\Vert < \epsilon \Vert \varvec{z}\Vert \) for each \(\varvec{z}\in {\mathbb {Z}}_+^p\) with \(\Vert \varvec{z}\Vert >K(\epsilon )\). Since
for all \(T>0\), \(\epsilon >0\) and \(n\in {\mathbb {N}}\), we have
where \( \max _{\{\varvec{z}\in {\mathbb {Z}}_+^p: \Vert \varvec{z}\Vert \le K(\varepsilon ) \}} \Vert \mathsf {\Gamma }(\varvec{z})\Vert < \infty \). Using part (i) of Lemma A.3, we get \(\frac{1}{n^2}\sum _{k=1}^{\lfloor nT \rfloor } {\text {E}}\! \left[ \Vert \varvec{Z}_{k-1}\Vert \right] = {\text {O}}(1)\) as \(n\rightarrow \infty \). Hence for all \(T>0\) and \(\epsilon >0\), we get
which yields (4.31), as desired.
Similarly, we prove (4.27) by checking that for all \(T>0\), we get that
By Hypothesis 2, for all \(\epsilon >0\) there exists \(N(\epsilon )>0\) such that \(\Vert \varvec{g}(\varvec{z})\Vert < \epsilon \) for all \(\varvec{z}\in {\mathbb {Z}}_+^p\) with \(\Vert \varvec{z}\Vert >N(\epsilon )\). Then, using the notation \(C_{\varvec{g}}\) introduced in (4.29), we get
and then using Hypothesis 6 we can derive that for all \(T>0\) and \(\epsilon >0\),
for sufficiently large \(n\in {\mathbb {N}}\), where
As a consequence, for all \(T>0\) and \(\epsilon >0\), we have
Hence, by taking the limit as \(\epsilon \downarrow 0\), we obtain (4.32), as desired.
4.5.4 Step 1/D
We show (4.17). For this, it is enough to verify that for all \(T>0\) and \(\theta >0\), we have
Using that
the convergence (4.35) follows from equation (A.4). Consequently, we finished the proof of (4.3).
4.6 Proof of Step 2
We want to apply Theorem A.5 to prove the convergence (4.5). We need to check that the assumptions of Theorem A.5 are satisfied. The continuity of \(\varvec{\Psi }\) can be straightforwardly deduced (following also from Jacod and Shiryaev [22, Chapter VI, Proposition 1.23]), so the measurability of \(\varvec{\Psi }\) holds. For the sequence \((\varvec{\Psi }_n)_{n\in {\mathbb {N}}}\) and \(N\in {\mathbb {N}}\), let us introduce the localized sequence \((\varvec{\Psi }_n^N)_{n\in {\mathbb {N}}}\) given by \(\varvec{\Psi }_n^N: {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(\varvec{\Psi }_n^N (\varvec{f}) (t) :=\varvec{\Psi }_n (\varvec{f}) (t \wedge N)\) for \(\varvec{f}\in {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(t\in {\mathbb {R}}_+\), \(n\in {\mathbb {N}}\). Since, for each \(n\in {\mathbb {N}}\) and \(\varvec{f}\in {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(\varvec{\Psi }_n^N(\varvec{f}) \rightarrow \varvec{\Psi }_n(\varvec{f})\) as \(N\rightarrow \infty \), it is enough to check the measurability of \(\varvec{\Psi }_n^N\), \(n\in {\mathbb {N}}\) (see Barczy et al. [6, page 603] for the details). Briefly, for each \(n\in {\mathbb {N}}\), one can introduce the auxiliary measurable mappings \(\varvec{\Psi }_n^{N,1}:{{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow ({\mathbb {R}}^p)^{nN+1}\) and \(\varvec{\Psi }_n^{N,2}:({\mathbb {R}}^p)^{nN+1}\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) defined by
for \(\varvec{f}\in {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), and
for \((\varvec{x}_0, \varvec{x}_1,\ldots ,\varvec{x}_{nN})\in {({\mathbb {R}}^p)^{nN+1}}\) and \(t\in {\mathbb {R}}_+\). Then we have that \(\varvec{\Psi }_n^N = \varvec{\Psi }_n^{N,2} \circ \varvec{\Psi }_n^{N,1}\), \(n,N\in {\mathbb {N}}\).
Consider the set \({{\textbf {C}}}=\{\varvec{f}\in {{\textbf {C}}}({\mathbb {R}}_+,{\mathbb {R}}^p):\tilde{\mathsf {\Pi }}\varvec{f}(0)=\varvec{f}(0)\}\). Let us check now that \({{\textbf {C}}}\) is measurable and \({\text {P}} \left[ \varvec{{\mathcal {M}}}\in {{\textbf {C}}} \right] =1.\) The projection \({{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\ni \varvec{f} \mapsto \varvec{\pi }_0 (\varvec{f}) :=\varvec{f}(0)\in {\mathbb {R}}^p\) and the mapping \({\mathbb {R}}^p \ni \varvec{x} \mapsto ({\textsf{I}}_p - \tilde{\mathsf {\Pi }})\varvec{x}\in {\mathbb {R}}^p\) are measurable. Since \({{\textbf {C}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) is a measurable set (see Ethier and Kurtz [9, Problem 3.11.25]), we have \({{\textbf {C}}} = {{\textbf {C}}}({\mathbb {R}}_+,{\mathbb {R}}^p) \cap \varvec{\pi }_0^{-1} ({\text {Null}}({\textsf{I}}_p-\tilde{\mathsf {\Pi }}))\in {\mathcal {D}}_\infty ({\mathbb {R}}_+,{\mathbb {R}}^p)\). Furthermore, in Step 1, we proved that \(\varvec{{\mathcal {M}}}\) is the pathwise unique strong solution of SDE (4.4) with initial value \(\varvec{0}_p\). Hence it has continuous sample paths almost surely, and \(\tilde{\mathsf {\Pi }}\varvec{{\mathcal {M}}}_0= \tilde{\mathsf {\Pi }}\varvec{0}_p=\varvec{0}_p=\varvec{{\mathcal {M}}}_0\). Finally, the procedure to show that \({{\textbf {C}}}\subset {{\textbf {C}}}_{\varvec{\Phi },(\varvec{\Phi }_n)_{n\in {\mathbb {N}}}}\) follows the same steps as on page 736 in Ispány and Pap [21].
4.7 Proof of Step 3
First, we check that \(\varvec{{\mathcal {Z}}}^{(n)} = \varvec{\Psi }_n(\varvec{{\mathcal {M}}}^{(n)}) + \varvec{{\mathcal {V}}}^{(n)}\), \(n\in {\mathbb {N}}\). Using Hypothesis 2 and equations (2.8), (4.1), (4.7), and (4.9), we get
as desired.
Next, we show (4.8). Taking into account (4.5) and Jacod and Shiryaev [22, Chapter VI, Lemma 3.31], in order to prove (4.8), it is enough to see that for all \(T>0\) and \(\delta >0\), we get
This can be checked as follows. For all \(T>0\) and \(\delta >0\), by Markov’s inequality and (4.29), we get
for \(n\in {\mathbb {N}}\), and then we proceed as in the proof of (4.27). For all \(\epsilon >0\), there exists \(N(\epsilon )>0\) such that \(\Vert \varvec{g}(\varvec{z})\Vert < \epsilon \) for each \(\varvec{z}\in {\mathbb {Z}}_+^p\) with \(\Vert \varvec{z}\Vert >N(\epsilon )\). So, using Hypothesis 6 and (4.33), we get for all \(T>0\) and \(\epsilon >0\)
for sufficiently large \(n\in {\mathbb {N}}\). By taking the limit as \(n\rightarrow \infty \), we have that for all \(T>0\) and \(\epsilon >0\),
and then taking the limit as \(\epsilon \downarrow 0\), we obtain that \(n^{-1} \sum _{j=1}^{\lfloor nT\rfloor } {\text {E}}\! \left[ \Vert \varvec{g} (\varvec{Z}_{j-1})\Vert \right] \rightarrow 0\) as \(n\rightarrow \infty \), yielding (4.36), as desired. \(\square \)
Data availability
The authors declare that they did not use any data set in their paper.
References
Alsmeyer, G., Rösler, U.: The bisexual Galton-Watson process with promiscuous mating: extinction probabilities in the supercritical case. Ann. Appl. Probab. 6(3), 922–939 (1996)
Alsmeyer, G., Rösler, U.: Asexual versus promiscuous bisexual Galton-Watson processes: The extinction probability ratio. Ann. Appl. Probab. 12(1), 125–142 (2002)
Asmussen, S.: On some two-sex population models. Ann. Probab. 8(4), 727–744 (1980)
Athreya, K.B., Ney, P.E.: Branching Processes. 1st ed. In: Grundlehren der mathematischen Wissenschaften, 1st edn. Springer, Berlin, Heidelberg 1972
Bansaye, V., Caballero, M., Méléard, S., San Martín, J.: Scaling limits of bisexual Galton-Watson processes. Stochastics 95(5), 749–784 (2023)
Barczy, M., Ispány, M., Pap, G.: Asymptotic behavior of unstable INAR(p) processes. Stoch. Process. Appl. 121(3), 583–608 (2011)
Barczy, M., Pap, G.: Asymptotic behavior of critical, irreducible multi-type continuous state and continuous time branching processes with immigration. Stoch. Dyn. 16, 04 (2016). Article 1650008
Daley, D.J.: Extinction conditions for certain bisexual Galton-Watson branching processes. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 9, 315–322 (1968)
Ethier, S.N., Kurtz, T.G.: Markov Processes: Characterization and Convergence. In: Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. Wiley, New York (1986)
Feller, W.: Diffusion processes in genetics. In: Neyman, K. (ed.) Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability, vol. 2 of Berkeley Symposium on Mathematical Statistics and Probability, pp. 227–246. University of California Press, Berlin, Heidelberg (1951)
González, M., del Puerto, I.: Weighted conditional least squares estimation in controlled multitype branching processes. In: González Velasco, M., del Puerto, I., Martínez, R., Molina, M., Mota, M., Ramos, A. (eds.) Workshop on Branching Processes and Their Applications, pp. 147–155. Springer, Berlin Heidelberg (2010)
González, M., del Puerto, I., Yanev, G.: Controlled branching processes. ISTE, Wiley, London (2018)
González, M., Martín-Chávez, P., del Puerto, I.: Diffusion approximation of controlled branching processes using limit theorems for random step processes. Stoch. Model. 39(1), 232–248 (2023)
González, M., Martínez, R., Mota, M.: On the unlimited growth of a class of homogeneous multitype Markov chains. Bernoulli 11(3), 559–570 (2005)
González, M., Minuesa, C., del Puerto, I., Vidyashankar, A.N.: Robust Estimation in Controlled Branching Processes: Bayesian Estimators via Disparities. Bayesian Anal. 16(3), 1009–1037 (2021)
González, M., Martínez, R., Mota, M.: Rates of growth in a class of homogeneous multidimensional Markov chains. J. Appl. Probab. 43(1), 159–174 (2006)
González, M., Molina, M., del Puerto, I.: Asymptotic behaviour of critical controlled branching processes with random control functions. J. Appl. Probab. 42(2), 463–477 (2005)
Horn, R.A., Johnson, C.R.: Matrix Analysis, 2nd edn. Cambridge University Press (1985)
Ikeda, N., Watanabe, S.: Stochastic Differential Equations and Diffusion Processes, 2nd edn. North-Holland Publishing Co., Amsterdam; Kodansha, Ltd., Tokyo (1989)
Ispány, M., Pap, G.: A note on weak convergence of random step processes. Acta Math. Hung. 126, 381–395 (2010)
Ispány, M., Pap, G.: Asymptotic behavior of critical primitive multi-type branching processes with immigration. Stoch. Anal. Appl. 32(5), 727–741 (2014)
Jacod, J., Shiryaev, A.N.: Limit Theorems for Stochastic Processes, 2nd edn. Grundlehren der mathematischen Wissenschaften. Springer, Berlin, Heidelberg (2003)
Jagers, P.: The proportions of individuals of different kinds in two-type populations. a branching process problem arising in biology. J. Appl. Probab. 2, 249–260 (1969)
Kallenberg, O.: Foundations of modern probability, 1st edn. In: Probability and Its Applications. Springer, New York, NY (1997)
Kaplan, N.: The multitype Galton-Watson process with immigration. Ann. Probab. 1(6), 947–953 (1973)
Kimmel, M., Axelrod, D.: Branching processes in biology, 2nd edn. Springer, New York, NY (2015)
Mwasunda, J.A., Irunde, J.I., Kajunguri, D., Kuznetsov, D.: Outbreak or extinction of bovine cysticercosis and human taeniasis: A stochastic modelling approach. Appl. Math. Model. 106, 73–85 (2022)
Ráth, B.: Time evolution of dense multigraph limits under edge-conservative preferential attachment dynamics. Random Struct. Algorithms 41(3), 365–390 (2012)
Revuz, D., Yor, M.: Continuous Martingales and Brownian Motion, 3rd edn. Grundlehren der mathematischen Wissenschaften. Springer, Berlin, Heidelberg (1998)
Sagitov, S., StÅhlberg, A.: Counting unique molecular identifiers in sequencing using a multi-type branching process with immigration. J. Theor. Biol. 558, 111365 (2023)
Sevast’yanov, B.A., Zubkov, A.M.: Controlled branching processes. Theory Prob. Appl. 19(1), 14–24 (1974)
Sriram, T., Bhattacharya, A., González, M., Martínez, R., del Puerto, I.: Estimation of the offspring mean in a controlled branching process with a random control function. Stoch. Process. Appl. 117(7), 928–946 (2007)
Wei, C.Z., Winnicki, J.: Some asymptotic results for the branching process with immigration. Stoch. Process. Appl. 31(2), 261–282 (1989)
Yakovlev, A.Y., Yanev, N.M.: Relative frequencies in multitype branching processes. Ann. Appl. Probab. 19(1), 1–14 (2009)
Yakovlev, A.Y., Yanev, N.M.: Limiting distributions for multitype branching processes. Stoch. Anal. Appl. 28(6), 1040–1060 (2010)
Yamada, T., Watanabe, S.: On the uniqueness of solutions of stochastic differential equations. J. Math. Kyoto Univ. 11(1), 155–167 (1971)
Yanev, N.M.: Stochastic models of cell proliferation kinetics based on branching processes. In: Almudevar, A., Oakes, D., Hall, J. (eds.) Stat. Model. Biol. Syst. Memory Andrei Yakovlev, pp. 3–20. Springer International Publishing (2020)
Acknowledgements
We would like to thank the referee for her/his comments that helped us improve the paper.
Funding
Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. Mátyás Barczy was supported by the project TKP2021-NVA-09. Project no. TKP2021-NVA-09 has been implemented with the support provided by the Ministry of Innovation and Technology of Hungary from the National Research, Development and Innovation Fund, financed under the TKP2021-NVA funding scheme. Miguel González, Pedro Martín-Chávez and Inés del Puerto are supported by grant PID2019-108211GB-I00 funded by MICIU/AEI/10.13039/501100011033. Pedro Martín-Chávez is also grateful to the Spanish Ministry of Universities for support from a predoctoral fellowship Grant no. FPU20/06588.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no Conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
The reader can consult Horn and Johnson [18] for the following known facts about primitive non-negative matrices (see Definition 8.5.0 and Theorems 8.2.8, 8.5.1 and 8.5.2 in [18]). A primitive matrix \({\textsf{A}}\in {\mathbb {R}}_+^{p\times p}\) is an irreducible matrix with only one eigenvalue of maximum modulus (the so-called Perron–Frobenius eigenvalue), or equivalently, \({\textsf{A}}\in {\mathbb {R}}_{+}^{p\times p}\) is primitive if and only if there exists \(n\in {\mathbb {N}}\) such that \({\textsf{A}}^n\in {\mathbb {R}}_{++}^{p\times p}.\) By the Perron–Frobenius theorem, we have the following lemma.
Lemma A.1
Let \({\textsf{A}}\in {\mathbb {R}}_{+}^{p\times p}\) be a primitive matrix and let \(\rho \) be its Perron–Frobenius eigenvalue. The following statements hold:
-
(i)
\(\rho \in {\mathbb {R}}_{++},\) its algebraic and geometric multiplicity are equal to 1, and the absolute values of the other eigenvalues of \({\textsf{A}}\) are less than \(\rho \).
-
(ii)
There exists a unique right eigenvector \(\varvec{u}\in {\mathbb {R}}_{++}^p\) and a unique left eigenvector \(\varvec{v}\in {\mathbb {R}}_{++}^p\) corresponding to \(\rho \) such that the sum of the coordinates of \(\varvec{u}\) is 1 and \(\varvec{v}^\top \varvec{u} = 1\). One calls \(\varvec{u}\) and \(\varvec{v}\) the right and left Perron–Frobenius eigenvector, respectively.
-
(iii)
If \(\rho =1,\) then \(\lim _{k\rightarrow \infty } {\textsf{A}}^k = \mathsf {\Pi }\) and there exist \(c\in {\mathbb {R}}_{++}\) and \(r\in (0,1)\) such that \(\Vert {\textsf{A}}^k-\mathsf {\Pi }\Vert \le cr^k\) for each \(k\in {\mathbb {N}}\), where \(\mathsf {\Pi } :=\varvec{u}\varvec{v}^\top \in {\mathbb {R}}_{++}^{p\times p}\).
Next, we will investigate the asymptotic behaviour of the first and second moments of the norm \(\Vert {\varvec{Z}}_k\Vert \) for a critical CMBP \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\). We will also study the same question for the second and fourth moments of \(\Vert \varvec{M}_k\Vert \), where \((\varvec{M}_k)_{k\in {\mathbb {N}}}\) is the martingale difference sequence, given in (4.1), built from \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\). But first, we need the following auxiliary result, which is easy to prove.
Lemma A.2
Let A, \(A_n\), \(n\in {\mathbb {N}}\), be independent and identically distributed \({\mathbb {Z}}_+\)–valued random variables with zero mean. If B is a \({\mathbb {Z}}_+\)–valued random variable independent of A and \(A_n\), \(n\in {\mathbb {N}}\), then
where \(\Sigma _A :={\text {Var}}\! \left[ A \right] \), \(\zeta _A :={\text {E}}\! \left[ A^4 \right] \), \(\mu _B :={\text {E}}\! \left[ B \right] \), and \(\Gamma _B :={\text {Var}}\! \left[ B \right] \).
Lemma A.3
Let \((\varvec{Z}_k)_{k\in {\mathbb {Z}}_+}\) be a controlled p–type branching process given in (2.1) such that \({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert \right] \), \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert \right] \) and \({\text {E}}\! \left[ \Vert \varvec{\phi }_{0}(\varvec{z})\Vert \right] \) are finite for \(i\in \{1,\ldots ,p\}\) and \({\varvec{z}}\in {\mathbb {Z}}_+^p\). Assume that \(\varvec{\varepsilon }(\varvec{z}) = \mathsf {\Lambda }\varvec{z} + \varvec{h}(\varvec{z})\), \( {\varvec{z}}\in {\mathbb {Z}}_+^p\), where \(\mathsf {\Lambda }\in {\mathbb {R}}^{p\times p}\) and \(\varvec{h}:{\mathbb {Z}}_+^p\rightarrow {\mathbb {R}}^p\) satisfies \(\Vert \varvec{h}(\varvec{z})\Vert = {\text {O}}(1)\) as \(\Vert \varvec{z}\Vert \rightarrow \infty \). Further, suppose that Hypothesis 5 holds.
-
(i)
Then we have
$$\begin{aligned} {\text {E}}\! \left[ \Vert \varvec{Z}_{k}\Vert \right] = {\text {O}}(k) \qquad \text {as}\,\, k\rightarrow \infty . \end{aligned}$$(A.1) -
(ii)
If, in addition, \({\text {E}}\! \left[ \Vert {\varvec{Z}}_0\Vert ^2 \right] \), \({\text {E}}\! \left[ \Vert \varvec{X}_{0,1,i}\Vert ^2 \right] \) and \({\text {E}}\! \left[ \Vert \varvec{\phi }_{0}(\varvec{z})\Vert ^2 \right] \) are finite for \(i\in \{1,\ldots ,p\}\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), and \(\Vert \mathsf {\Gamma }(\varvec{z})\Vert = {\text {O}}(\Vert \varvec{z}\Vert )\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \), then we have (A.1) and
$$\begin{aligned} {\text {E}}\! \left[ \Vert \varvec{Z}_{k}\Vert ^2 \right]&= {\text {O}}(k^2) \qquad \text {as } k\rightarrow \infty , \end{aligned}$$(A.2)$$\begin{aligned} {{\text {E}}\! \left[ \Vert \varvec{M}_{k}\Vert ^2 \right] }&= {{\text {O}}(k)} \qquad \text {as } k\rightarrow \infty . \end{aligned}$$(A.3) -
(iii)
If, in addition, Hypotheses 1, 4 and \(\Vert \mathsf {\Gamma }(\varvec{z})\Vert = {\text {O}}(\Vert \varvec{z}\Vert )\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \) hold, then we have (A.1), (A.2), (A.3) and
$$\begin{aligned} {\text {E}}\! \left[ \Vert \varvec{M}_k\Vert ^4 \right] = {\text {O}}(k^2) \qquad \text {as } k\rightarrow \infty . \end{aligned}$$(A.4)
Proof
(i). By (2.8), for each \(k\in {\mathbb {N}}\), we have
Hence, using the triangle inequality, for each \(k\in {\mathbb {N}}\), we get
We recall the power mean inequality used in the proofs several times: for all \(n\in {\mathbb {N}}\), \(a_i\in {\mathbb {R}}_+\), \(i=1,\ldots ,n\), and \(0<k_1 \le k_2\), we get \(\left( \frac{1}{n}\sum _{i=1}^n a_i^{k_1}\right) ^{\frac{1}{k_1}} \le \left( \frac{1}{n}\sum _{i=1}^n a_i^{k_2}\right) ^{\frac{1}{k_2}}.\) Using \(\Vert \varvec{x}\Vert \le \sqrt{p}\sum _{i=1}^p x_i\) for \(\varvec{x} = (x_1,\ldots ,x_p)\in {\mathbb {R}}_+^p\) and the power mean inequality, for any random vector \(\varvec{\xi }=(\xi _1,\ldots ,\xi _p)\) having non-negative coordinates, we obtain that \({\text {E}}\! \left[ \Vert \varvec{\xi }\Vert \right] \le p \Vert \textrm{E}[\varvec{\xi }]\Vert .\) Consequently, we get
where \({\tilde{C}}=\sup _{j\in {\mathbb {Z}}_+} \Vert \tilde{{\textsf{m}}}^j\Vert <\infty \) due to Hypothesis 5 (see (4.29)) and the constant \(C_{\varvec{h}}\) is defined by
This yields (A.1).
(ii). Since the finiteness of the second moment of the norm of a random vector implies that of the first moment, using part (i) of the present lemma, we have (A.1). Taking into account that the expected value and the trace of a random square matrix commute, we have
where for the last inequality, we used that for any matrix \({\textsf{B}}\in {\mathbb {R}}^{p\times p}\), we have \({\text {tr}} \left( {\textsf{B}} \right) = \sum _{i=1}^p b_{i,i} \le \sum _{i=1}^p \vert b_{i,i}\vert \le p\Vert {\textsf{B}}\Vert \). We know that (A.1) holds, therefore, in order to get (A.2), it is enough to see \(\left\| {\text {Var}}\! \left[ \varvec{Z}_{k} \right] \right\| = {\text {O}}(k^2)\) as \(k\rightarrow \infty \). Using the variance decomposition formula, equations (2.8) and (2.9), and the assumption on \(\varvec{\varepsilon }({\varvec{z}})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), together with the properties of variance, for each \(k\in {\mathbb {N}}\), we obtain
In what follows, we will use that for all \({\varvec{z}}=(z_1,\ldots ,z_p)\in {\mathbb {R}}^p\), we have
where \(\Vert \varvec{\mathsf {\Sigma }}\Vert :=\sum _{i=1}^p \Vert \mathsf {\Sigma }_i\Vert \). Proceeding recursively in (A.7), by (A.8), the triangle inequality and the symmetry of covariance, we get
and now we look for an upper bound for each term on the right hand side of (A.9).
In case of the first term, we easily have
In case of the second term, since \(\Vert \varvec{h}(\varvec{z})\Vert = {\text {O}}(1)\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \), by (A.6), we get
In case of the third term, using (A.1) and (A.6), we get
In case of the fourth term, by the assumption \(\varvec{\varepsilon }(\varvec{z}) = \mathsf {\Lambda }\varvec{z} + \varvec{h}(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), we have
where the constant \(C_{\varvec{h}}\) is defined in (A.6). Using (A.1), this implies that
In case of the fifth term, by the assumption \(\Vert \mathsf {\Gamma }(\varvec{z})\Vert ={\text {O}}(\Vert \varvec{z}\Vert )\) as \(\Vert {\varvec{z}}\Vert \rightarrow \infty \) and the equation (A.1), we get
where
Taking into account the previous estimations and that \(\sup _{j\in {\mathbb {Z}}_+}\Vert \tilde{{\textsf{m}}}^j\Vert <\infty \) due to Hypothesis 5 (see (4.29)), the inequality (A.9) implies that
which concludes the proof of (A.2).
Next, we verify (A.3). Then
where, by the tower rule, we have \({\text {E}}\! \left[ \varvec{M}_{k}(\varvec{M}_{k})^\top \right] = {\text {E}}\! \left[ {\text {Var}} \left[ \varvec{Z}_{k} \;\Big |\; {\mathcal {F}}_{k-1} \right] \right] \). Consequently, using (2.9), (A.8) and the inequality \({\text {tr}} \left( {\textsf{B}} \right) \le p\Vert {\textsf{B}}\Vert \) for any matrix \({\textsf{B}}\in {\mathbb {R}}^{p\times p}\) (justified earlier), we get that
Therefore, (A.3) follows from (A.10), (A.11), (A.12) and (A.13).
(iii). Consider the following reformulation of the martingale difference sequence \((\varvec{M}_k)_{k\in {\mathbb {N}}}\) defined in (4.1):
where we used (2.8).
Applying the power mean inequality twice, and adding and subtracting the random variable \(\sum _{j=1}^{\phi _{k-1,i}(\varvec{Z}_{k-1})} \varvec{m}_{i} = \phi _{k-1,i}(\varvec{Z}_{k-1}) \varvec{m}_{i}\), we get
for \(k\in {\mathbb {N}}\). Using again the power mean inequality and that \(\Vert \varvec{z}\Vert ^4 = \left( z_1^2 + \ldots + z_p^2\right) ^2\), \({\varvec{z}}=(z_1,\ldots ,z_p)\in {\mathbb {Z}}_+^p\), we get
We will compute the previous upper bounds by first determining the corresponding conditional expectations with respect to \({\mathcal {F}}_{k-1}\). Using the notations (2.7) and the Markov property of \(({\varvec{Z}}_k)_{k\in {\mathbb {Z}}_+}\), for each \(k\in {\mathbb {N}}\), \(i,l\in \{1,\ldots ,p\}\), we obtain
and, by (2.6), (2.7) and Lemma A.2 together with the independence of \(\varvec{\phi }_{k-1}(\varvec{z})\), \({\varvec{z}}\in {\mathbb {Z}}_+^p\), \({{\textbf {X}}}_{k-1,j,i}\), \(j\in {\mathbb {N}}\), \(i\in \{1,\ldots ,p\}\) and \({\varvec{Z}}_{k-1}\), we get that
where \(\mathsf {\Sigma }_{i,l,l}\) is the \(l-\)th diagonal element of the variance matrix \(\mathsf {\Sigma }_i\) (see (2.6)).
Consequently, we get
Thus, to get (A.4) it is enough to check that for each \(i\in \{1,\ldots ,p\}\), \({\text {E}}\! \left[ \varepsilon _i (\varvec{Z}_{k}) \right] \), \({\text {E}}\! \left[ \mathsf {\Gamma }_{i,i}(\varvec{Z}_{k}) \right] \), \({\text {E}}\! \left[ \varepsilon _i (\varvec{Z}_{k})^2 \right] \), \({\text {E}}\! \left[ \kappa _i(\varvec{Z}_{k}) \right] \) are \({\text {O}}(k^2)\) as \(k\rightarrow \infty \). Using (A.10) and (A.11), we get
as \(k\rightarrow \infty \) for \(i\in \{1,\ldots ,p\}\). Moreover, by (A.1) and (A.2), we obtain
as \(k\rightarrow \infty \) for \(i\in \{1,\ldots ,p\}\), where \(C_{\varvec{h}}\) is defined in (A.6). Finally, by Hypothesis 4, we get
and hence, by (A.2), we have \({\text {E}}\! \left[ \kappa _i(\varvec{Z}_{k}) \right] \le \kappa _i(\varvec{0}_p) + C_\kappa {\text {E}}\! \left[ \Vert \varvec{Z}_{k}\Vert ^2 \right] = {\text {O}}(k^2)\) as \(k\rightarrow \infty \) for \(i\in \{1,\ldots ,p\}\). \(\square \)
Next, we recall a result on weak convergence of random step processes toward a diffusion process due to Ispány and Pap [20, Corollary 2.2].
Theorem A.4
Let \(\varvec{b}:{\mathbb {R}}_+\times {\mathbb {R}}^p\rightarrow {\mathbb {R}}^p\) and \({\textsf{C}}:{\mathbb {R}}_+\times {\mathbb {R}}^p\rightarrow {\mathbb {R}}^{p\times r}\) be continuous functions. Assume that uniqueness in the sense of probability law holds for the SDE
with initial value \(\varvec{{\mathcal {U}}}_0=\varvec{u}_0\) for all \(\varvec{u}_0\in {\mathbb {R}}^p\), where \((\varvec{{\mathcal {W}}}_t)_{t\in {\mathbb {R}}_+}\) is an r–dimensional standard Wiener process. Let \(\varvec{\eta }\) be a probability measure on \( ({\mathbb {R}}^p,{\mathcal {B}}({\mathbb {R}}^p))\), and let \((\varvec{{\mathcal {U}}}_t)_{t\in {\mathbb {R}}_+}\) be a solution of (A.14) with initial distribution \(\varvec{\eta }\). For each \(n \in {\mathbb {N}},\) let \((\varvec{U}_{k}^{(n)})_{k\in {\mathbb {Z}}_+}\) be a sequence of \({\mathbb {R}}^p\)–valued random vectors adapted to a filtration \(({\mathcal {F}}_{k}^{(n)})_{k\in {\mathbb {Z}}_+}\) (i.e, \(\varvec{U}_{k}^{(n)}\) is \({\mathcal {F}}_{k}^{(n)}\)–measurable) such that \({\text {E}}\! \left[ \Vert \varvec{U}_k^{(n)}\Vert ^2 \right] <\infty \) for each \(n, k \in {\mathbb {N}}\). Let
Suppose \(\varvec{U}_0^{(n)} {\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}} \varvec{\eta }\) as \(n \rightarrow \infty \), and that for all \(T>0,\)
-
(i)
\(\sup _{t\in [0,T]} \left\| \sum _{k=1}^{\lfloor n t\rfloor } {\text {E}} \left[ \varvec{U}_k^{(n)} \;\Big |\; {\mathcal {F}}_{k-1}^{(n)} \right] - \int _0^t \varvec{b}(s,\varvec{{\mathcal {U}}}_s^{(n)})\,\textrm{d}s \right\| {\mathop {\longrightarrow }\limits ^{\textrm{P}}} 0\) as \(n\rightarrow \infty \),
-
(ii)
\(\sup _{t\in [0,T]} \left\| \sum _{k=1}^{\lfloor n t\rfloor } {\text {Var}} \left[ \varvec{U}_k^{(n)} \;\Big |\; {\mathcal {F}}_{k-1}^{(n)} \right] - \int _0^t {\textsf{C}}(s,\varvec{{\mathcal {U}}}_s^{(n)}){\textsf{C}}(s,\varvec{{\mathcal {U}}}_s^{(n)})^\top \,\textrm{d}s \right\| {\mathop {\longrightarrow }\limits ^{\textrm{P}}} 0\) as \(n\rightarrow \infty \),
-
(iii)
\(\sum _{k=1}^{\lfloor n T \rfloor } {\text {E}} \left[ \Vert \varvec{U}_k^{(n)}\Vert ^2 {\mathbbm {1}}_{\{\Vert \varvec{U}_k^{(n)}\Vert >\theta \}} \;\Big |\; {\mathcal {F}}_{k-1}^{(n)} \right] {\mathop {\longrightarrow }\limits ^{\textrm{P}}} 0\) as \(n\rightarrow \infty \) for all \(\theta >0\).
Then \(\varvec{{\mathcal {U}}}^{(n)}{\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}}\varvec{{\mathcal {U}}}\) as \(n \rightarrow \infty \).
For measurable mappings \(\varvec{\Phi }\), \(\varvec{\Phi }_n: {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(n\in {\mathbb {N}}\), let \({{\textbf {C}}}_{\varvec{\Phi },(\varvec{\Phi }_n)_{n\in {\mathbb {N}}}}\) be the set of all functions \(\varvec{f}\in {{\textbf {C}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) for which \(\varvec{\Phi }_n (\varvec{f}_n) \rightarrow \varvec{\Phi }( \varvec{f})\) whenever \(\varvec{f}_n{\mathop {\longrightarrow }\limits ^{\mathcal {\textrm{lu}}}} \varvec{f}\) with \(\varvec{f}_n\in {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(n\in {\mathbb {N}}\). The notation \({\mathop {\longrightarrow }\limits ^{\mathcal {\textrm{lu}}}}\) refers to local uniform convergence, i.e., \(\sup _{t\in [0,T]} \Vert \varvec{f}_n(t)-\varvec{f}(t)\Vert \rightarrow 0\) as \(n\rightarrow \infty \) for all \(T\in {\mathbb {R}}_{++}\). The following result is a kind of continuous mapping theorem, which can be considered as a consequence of Theorem 3.27 in Kallenberg [24], and its proof also appears in Ispány and Pap [20, Lemma 3.1].
Theorem A.5
Let \((\varvec{{\mathcal {U}}}_t)_{t\in {\mathbb {R}}_+}\) and \((\varvec{{\mathcal {U}}}_t^{(n)})_{t\in {\mathbb {R}}_+}\), \(n\in {\mathbb {N}}\), be \({\mathbb {R}}^p\)–valued stochastic processes with càdlàg paths such that \(\varvec{{\mathcal {U}}}^{(n)}{\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}} \varvec{{\mathcal {U}}}\) as \(n \rightarrow \infty \). Let \(\varvec{\Phi }: {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\) and \(\varvec{\Phi }_n: {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\rightarrow {{\textbf {D}}}({\mathbb {R}}_+,{\mathbb {R}}^p)\), \(n\in {\mathbb {N}}\), be measurable mappings such that there exists a measurable set \({{\textbf {C}}}\subset {{\textbf {C}}}_{\varvec{\Phi },(\varvec{\Phi }_n)_{n\in {\mathbb {N}}}}\) with \( {\text {P}} \left[ \varvec{{\mathcal {U}}}\in {{\textbf {C}}} \right] =1\). Then \(\varvec{\Phi }_n (\varvec{{\mathcal {U}}}^{(n)}){\mathop {\longrightarrow }\limits ^{{\mathcal {L}}}} \varvec{\Phi }( \varvec{{\mathcal {U}}})\) as \(n\rightarrow \infty \).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Barczy, M., González, M., Martín-Chávez, P. et al. Diffusion approximation of critical controlled multi-type branching processes. Rev. Real Acad. Cienc. Exactas Fis. Nat. Ser. A-Mat. 118, 101 (2024). https://doi.org/10.1007/s13398-024-01593-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13398-024-01593-0
Keywords
- Controlled branching processes
- Multi-type branching processes
- Two-sex branching processes
- Diffusion approximation
- Squared Bessel processes