1 Introduction

Infectious diseases are the public enemy of mankind and have brought great catastrophe to mankind. Authors were committed to finding ways to control infectious diseases from pathology, epidemiology, culture and other aspects. The mathematical modeling method is considered as an effective method to understand the development and evolution of variables [1,2,3,4,5,6,7,8,9,10,11,12,13]. Mathematical models have been used to study the spread and evolution of infectious diseases in the human population. For example, Bernoulli in 1760 proposed the first mathematical model in epidemiology, for studying the spread and inoculation of smallpox. By classifying human populations into three separate categories: the susceptible S, the infected I and the removed R, Kermack and McKendrick [14] in 1927 proposed a well-known compartmental model. It is assumed in the model that the susceptible class can transform into the infective class through the contact with infected persons, and the infectives can be recovered through treatment so that have permanent immunity. Therefore, it is now well known as the SIR model, which has been widely studied by [15,16,17,18,19,20]. However, some research showed that some diseases, such as influenza [21], viral diarrhea [22] and hand, foot and mouth disease [23], the immunity gained after an illness is temporary, then part of the recovered can transfer to the susceptible population again, this model is known as the SIS model [24].

Many researchers pay special attention to the incidence rate of infectious diseases. A nonlinear incidence rate plays an important role in the evolution of infectious diseases, because epidemic models described by nonlinear incidence rates may be more suitable and realistic, which also exhibit much richer dynamics. For example, the standard incidence rate \(\beta \frac{SI}{N}\) or the bilinear incidence rate \(\beta SI\) is proposed and used in reference [25,26,27,28]. And saturation infection rate \(\frac{\beta SI}{1+\alpha I}\) is used in reference [29]. A special non-monotone with the form \(\beta S^{p} I^{q}\) is proposed and investigated by Severo [30], Liu et al. [31], Hethcote et al. [32], and Y. Li and Muldowney [33]. About more general forms of incidence functions, please see Pugliese [34], Thieme [35], Korobeinikov [36], Ruan and Wang [37] and Huang [38].

Motivated by the previous work, we pay special attention to the following model which is an improved case of Liu et al. [31]:

$$ \textstyle\begin{cases} { \dot{S}(t)=\varLambda-\mu S(t)- \beta S^{p}(t)I(t)+\gamma I(t)},\\ {\dot{I}(t)=\beta S^{p}(t)I(t)-(\gamma+\alpha+\mu)I(t),} \end{cases} $$
(1)

where Λ is the recruitment rate of the population including the birth and migration. α is mortality due to illness. p is positive integer. The biological significance of other parameters please see Liu et al. [31].

Generally in the dynamic modeling of infectious diseases, we will first consider a deterministic model, however, considering the real world is filled with random and unpredictable, using stochastic model to model the dynamic of infectious diseases is more practical. Different stochastic disturbance approaches have been introduced into epidemic models. On the whole, there are four common random stochastic approaches. The first one is to introduce the parameters’ disturbance to a deterministic system (see, e.g., [39,40,41,42,43,44,45,46,47]), the second one is to investigate the stochasticity by using the method of time Markov chains (see, e.g., [48,49,50,51,52]). The third one is to consider Lévy jump noise (see, e.g., [53,54,55]). The fourth one is to study stochastic disturbance around the positive equilibria of a deterministic system (see, e.g., [56, 57]). Similar ideas have also been used in other modeling and analysis, for example [58,59,60,61,62].

In the spread progress of infectious diseases, the transmission coefficient is often subject to interference from the environment. Mathematically, this interference from the environment can be described as a standard Brownian motion simply. In this paper, based on model (1), we assume that the nonlinear incidence rate is perturbed by white noise so that

$$ \beta\rightarrow\beta+ \sigma\dot{B}(t), $$
(2)

where \(B(t)\) is a standard Brownian motion with intensity \(\sigma>0\). Then the resultant model takes the following form:

$$ \textstyle\begin{cases} { dS(t)=\varLambda-\mu S(t)- \beta S^{p}(t)I(t)+\gamma I(t)\,dt-\sigma S^{p}(t)I(t)\, {d}B(t)},\\ { dI(t)=\beta S^{p}(t)I(t)-(\mu+\alpha+\gamma)I(t)\, {d}t+\sigma S^{p}(t)I(t)\,dB(t)}. \end{cases} $$
(3)

Our main objective in the rest of present paper is to attempt to establish the threshold dynamics of system (3) similar to the deterministic system.

2 Preliminaries

Throughout this paper, we let \(\mathbb{R}^{d}\): the d-dimensional Euclidean space. \(\mathbb{R}^{d}_{+}:= \{x\in\mathbb{R}^{d}: x_{i} > 0, 1\leq i\leq d\}\), i.e. the positive cone.

Let \((\varOmega,\mathcal{F}, \mathcal{P})\) be the complete probability space adapted to the filtration \(\{\mathcal{F}\}_{t\geq0}\) and \(\{B_{t}\} _{t\geq0}\) is a one-dimensional Brownian motion defined on it. \(\mathcal {L}^{1}(\mathbb{R}_{+};\mathbb{R}^{d})\) is the family of all \(\mathbb {R}^{d}\)-valued measurable \(\{\mathcal{F}_{t}\}\)-adapted processes \(f=\{f(t)\}_{t\geq0}\) and

$$P \biggl[ \int_{0}^{T} \bigl\vert f(t) \bigr\vert \,dt< \infty \text{ for all } T>0 \biggr]=1. $$

Let \(C^{2,1}(\mathbb{R}^{d} \times\mathbb{R}_{+};\mathbb{R})\) denote the family of all real-valued functions \(V(x,t)\) defined on \(\mathbb{R}^{d} \times\mathbb{R}_{+}\) such that they are twice continuously differentiable in x and once in t. We set

V t = V t , V x = ( V x 1 , V x 2 , , V x d ) , V x x = ( 2 V x i x j ) d × d = ( 2 V x 1 x 1 2 V x 1 x d 2 V x d x 1 2 V x d x d ) .

Clearly, when \(V\in C^{2,1}(R\times R_{+}; R)\), we have \(V_{x}=\frac {\partial V}{\partial x}, V_{xx}=\frac{\partial^{2} V}{\partial x^{2}}\).

Lemma 2.1

(The one-dimensional Itô’s formula [63])

Let \(x(t)\) be an Itô’s process on \(t\geq0\) with the stochastic differential

$$dx(t)=f(t)\,dt+g(t)\,dB_{t}, $$

where \(f\in\mathcal{L}^{1}(\mathbb{R}_{+};\mathbb{R})\) and \(g\in\mathcal {L}^{2}(\mathbb{R}_{+};\mathbb{R})\). Let \(V\in C^{2,1}(\mathbb{R}^{d} \times \mathbb{R}_{+};\mathbb{R})\). Then \(V(x(t),t)\) is again an Itô’s process with the stochastic differential given by

$$\begin{aligned} dV\bigl(x(t),t\bigr)={}& \biggl[V_{t}\bigl(x(t),t\bigr)+V_{x} \bigl(x(t),t\bigr)f(t)+\frac {1}{2}V_{xx}\bigl(x(t),t \bigr)g^{2}(t) \biggr]\,dt \\ &{}+V_{x}\bigl(x(t),t\bigr)g(t)\,dB_{t}. \end{aligned}$$

Let f be an integrable function on \([0,+\infty)\), define \(\langle f(t)\rangle=\frac{1}{t}\int_{0}^{t}f(\theta)\,d\theta\). Then we have the following definition [43].

Definition 2.1

For system (3),

  1. (i)

    the diseases \(I(t)\) is said to be extinctive if \(\lim_{t\rightarrow+\infty}I(t)=0\);

  2. (ii)

    the diseases \(I(t)\) is said to be permanent in mean if there exists a positive constant λ such that \(\liminf_{t\rightarrow+\infty}\langle I(t)\rangle\geq\lambda\).

By using the methods from Lahrouz and Omari [42], we can prove the following lemma.

Lemma 2.2

For any initial value \((S_{0}, I_{0})\in R^{2}_{+}\), there exists a unique solution \((S(t), I(t))\) to system (3) on \(t\geq0\), and the solution will remain in \(R^{2}_{+}\) with probability one, namely, \((S(t), I(t))\in R^{2}_{+}\) for all \(t\geq0\) almost surely.

Proof

Firstly, we know that, for any initial value \((S_{0}, I_{0})\in R^{2}_{+}\), because the coefficients of system (3) are locally Lipschitz continuous, then there exists a unique local solution on \([0,\tau _{\epsilon})\) where \(\tau_{\epsilon}\) is the explosion time. To prove this solution is global, we need to show \(\tau_{\epsilon}=\infty\) almost surely. To do it, let \(\epsilon_{0}>0\) such that \(S_{0} > \epsilon_{0}, I_{0} >\epsilon_{0}\). For any positive ϵ, which satisfies \(\epsilon\leq\epsilon_{0}\), define the stopping time \(\tau _{\epsilon}\) by

$$\tau_{\epsilon}=\inf\bigl\{ t \in[0, \tau_{\epsilon}): S(t)\leq\epsilon \text{ or } I(t)\leq\epsilon\bigr\} , $$

with the traditional setting \(\inf\varnothing= \infty\), where ∅ denotes the empty set. Clearly, \(\tau_{\epsilon}\) is increasing as \(\epsilon\rightarrow0\). Set \(\tau_{0} = \lim_{\epsilon\rightarrow0}\), then \(\tau_{0}\leq\tau_{\epsilon}\) a.s, hence we only need to prove \(\tau_{0} =\infty\) a.s. Otherwise, then there exist a pair of constants \(T > 0\) and \(\delta\in(0, 1)\) such that \(P\{\tau_{0}\leq T\} > \delta\). Hence there exists a positive constant \(\epsilon_{1}\leq\epsilon_{0}\) such that \(P\{\tau_{0}\leq T\} > \delta\) for any positive \(\epsilon\leq\epsilon_{1}\).

Define \(C^{2}\) function \(N: R^{2}_{+}\rightarrow R^{2}_{+}\) by \(N(t) = S(t)+I(t)\). Obviously, \(N(t)\) satisfies

$$dN(t) \leq\bigl(\varLambda-\mu N(t)\bigr)\,dt. $$

After a simple calculation, it easy to see that, for all \(t <\tau _{\epsilon}\),

$$N(t) \leq\max\biggl\{ S_{0} + I_{0}, \frac{\varLambda}{\mu} \biggr\} := C_{1}. $$

Define a function \(V: R^{2}_{+}\rightarrow R^{2}_{+}\) by

$$V (S, I) =-\ln\frac{ S}{C_{1}}-\ln\frac{I}{C_{1}}. $$

Obviously, V is positive definite. Using Itô’s formula, we get

$$dV = LV \,dt + \sigma S^{p-1}(I-S)\,dB, $$

where

$$LV=-\frac{\varLambda}{S}+\mu+\beta S^{p-1}I-\gamma\frac{I}{S}+ \frac {1}{2}\sigma^{2}S^{2(p-1)}I^{2}-\beta S^{p}+\mu+\alpha+\gamma+\frac {1}{2}\sigma^{2}S^{2p}. $$

Then we have

$$LV\leq2\mu+\alpha+\gamma+\beta C_{1}^{p}+\frac{1}{2} \sigma^{2}C_{1}^{2p}+\frac {1}{2} \sigma^{2}C_{1}^{2p}=C_{2}. $$

Thus,

$$dV\leq C_{2}\,dt+\sigma S^{p-1}(I-S)\,dB. $$

Integrating both sides from 0 to \(\tau_{\epsilon}\wedge T\), and then taking expectations, yields

$$EV \bigl(S(\tau_{\epsilon}\wedge T), I(\tau_{\epsilon}\wedge T)\bigr) \leq V (S_{0}, I_{0}) + C_{2}T. $$

Set \(\varOmega_{e} = \{\tau_{e}\leq T\}\) for any positive \(\epsilon\leq \epsilon_{1}\) and then \(P(\varOmega_{E}) >\delta\). Note that, for every \(\omega \in\varOmega_{\epsilon}\), there is at least one of \(S(\tau_{\epsilon}, \omega), I(\tau_{\epsilon}, \omega)\) equals ϵ, then

$$V \bigl(S(\tau_{\epsilon}), I(\tau_{\epsilon})\bigr) \geq-\ln \frac{\epsilon}{C_{1}}. $$

Thus,

$$\begin{aligned} V (S_{0}, I_{0}) + C_{2}T &\geq E \bigl[I_{\varOmega_{\epsilon}}V \bigl(S(\tau_{e}\epsilon \wedge T), I( \tau_{\epsilon}\wedge T)\bigr)\bigr] \\ &= P(\varOmega_{\epsilon})V \bigl(S(\tau_{\epsilon}), I( \tau_{\epsilon})\bigr) \\ &>{-}\delta\ln\frac{\epsilon}{C_{1}}, \end{aligned}$$

where \(I_{\varOmega_{\epsilon}}\) is the indicator function of \(\varOmega_{e}\). Letting \(\epsilon\rightarrow0\) leads to the contradiction \(\infty> V (S_{0}, I_{0}) + C_{2}T =\infty\). Thus, we must have \(\tau_{\epsilon}=\infty\) almost surely. The proof of Lemma 2.2 is completed. □

By using the methods from Ji et al. [64], we can prove the following lemma and remark.

Lemma 2.3

For any initial value \((S_{0}, I_{0})\in\overline{R}^{2}_{+}\), there exists a unique solution \((S(t), I(t))\) to system (3) on \(t\geq0\), and the solution will remain in \(\overline{R}^{2}_{+}\) with probability 1, namely, \((S(t), I(t))\in\overline{R}^{2}_{+}\) for all \(t\geq0\) a.s.

Remark 2.1

In fact, from system (3), we have

$$d\bigl(S(t)+I(t)\bigr)\leq\bigl(\varLambda-\mu\bigl(S(t)+I(t)\bigr)\bigr)\,dt. $$

Then we have

$$S(t)+I(t)\leq\frac{\varLambda}{\mu}+\biggl(S_{0}+I_{0}- \frac{\varLambda}{\mu}\biggr)e^{-\mu t}. $$

If \(S_{0}+I_{0}\leq\frac{\varLambda}{\mu}\) then \(S(t)+I(t)\leq\frac{\varLambda }{\mu}\). Therefore, the region

$$\varGamma=\biggl\{ \bigl(S(t),I(t)\bigr)\in R_{+}^{2}:S(t)+I(t)\leq \frac{\varLambda}{\mu}, t\geq0\biggr\} $$

is an invariant set, then, from now on, we always assume the initial value \((S(0),I(0))\in\varGamma\).

By using the methods from Meng et al. [43], we can prove the following lemma.

Lemma 2.4

Let \((S(t), I(t))\) be a solution of system (3) with initial value \((S(0), I(0))\in R^{2}_{+}\). Then

$$\lim_{t\rightarrow+\infty}\frac{\int_{0}^{t}\sigma S^{p}(\tau)\, {d}B(\tau)}{t}=0\quad \textit{a.s.} $$

Proof

Let \(Z(t)=\int_{0}^{t}\sigma S^{p}(\tau)\,dB(\tau)\) and \(\theta>2\). By the Burkholder–Davis–Gundy inequality in [63] and Lemma 2.1, we have

$$E \Bigl[\sup_{0\leq\tau\leq t} \bigl\vert Z(\tau) \bigr\vert ^{\theta} \Bigr]\leq C_{\theta}E \biggl[ \int_{0}^{t}\sigma^{2} S^{2p}( \tau)\,d\tau \biggr]^{\frac{\theta}{2}}\leq C_{\theta}t^{\frac{\theta}{2}}E \Bigl[\sup_{0\leq\tau\leq t}\sigma^{\theta} S^{p\theta}(\tau) \Bigr]\leq M_{\theta }C_{\theta}t^{\frac{\theta}{2}}, $$

where \(M_{\theta}=\sigma^{\theta} (\frac{\varLambda}{\mu} )^{p\theta}\). Then, for any \(0<\varepsilon<\frac{\theta}{2}-1\),

$$\begin{aligned} \mathbb{P} \Bigl\{ \omega:\sup_{k\delta\leq t\leq(k+1)\delta} \bigl\vert Z(t) \bigr\vert ^{\theta}>(k\delta)^{1+\varepsilon+\frac{\theta}{2}} \Bigr\} &\leq\frac{E ( \vert Z((k+1)\delta) \vert ^{\theta} )}{(k\delta )^{1+\varepsilon+\frac{\theta}{2}}} \\ &\leq\frac{M_{\theta}C_{\theta}[(k+1)\delta]^{\frac{\theta }{2}}}{(k\delta)^{1+\varepsilon+\frac{\theta}{2}}} \\ &\leq\frac{2^{\frac{\theta}{2}}M_{\theta}C_{\theta}}{(k\delta )^{1+\varepsilon}}. \end{aligned}$$

By Doob’s martingale inequality and the Borel–Cantelli lemma in [63], for almost all \(\omega\in\varOmega\), we get

$$ \sup_{k\delta\leq t\leq(k+1)\delta} \bigl\vert Z(t) \bigr\vert ^{\theta}\leq (k\delta)^{1+\varepsilon+\frac{\theta}{2}} $$
(4)

holds for all but finitely many k. Thus, there exists a positive \(k_{0}(\omega)\), for almost all \(\omega\in \varOmega\), for which (4) holds when \(k\geq k_{0}(\omega)\). Hence, if \(k\geq k_{0}(\omega)\) and \(k\delta\leq t\leq(k+1)\delta\), then, for almost all \(\omega\in\varOmega\),

$$\frac{\ln \vert Z(t) \vert ^{\theta}}{\ln t}\leq\frac{ (1+\varepsilon +\frac{\theta}{2} )\ln(k\delta)}{\ln(k\delta)}=1+\varepsilon+\frac {\theta}{2}. $$

So, we have

$$\bigl\vert Z(t) \bigr\vert \leq t^{\frac{1}{2}+\frac{1+\varepsilon}{\theta}}. $$

Then, for the above ε, there exist a constant \(T(\omega)\) and a set \(\varOmega_{\epsilon}\), such that \(\mathbb{P}(\varOmega_{\epsilon })\geq1-\epsilon\) and for \(t\geq T(\omega)\), \(\omega\in\varOmega_{\epsilon}\),

$$0\leq\liminf_{t\rightarrow+\infty}\frac{ \vert Z(t) \vert }{t}\leq\limsup _{t\rightarrow+\infty}\frac{ \vert Z(t) \vert }{t}\leq\limsup_{t\rightarrow+\infty}t^{\frac{1+\varepsilon}{\theta}-\frac{1}{2}}=0. $$

Then we have

$$\lim_{t\rightarrow+\infty}\frac{ \vert Z(t) \vert }{t}=0, $$

i.e.

$$\lim_{t\rightarrow+\infty}\frac{Z(t)}{t}=\lim_{t\rightarrow+\infty} \frac {\int_{0}^{t}\sigma S^{p}(\tau)\,dB(\tau)}{t}=0. $$

This completes the proof of Lemma 2.4. □

3 Dynamics of the deterministic system

It is easy to see that the equilibrium of system (1) satisfies

$$ \textstyle\begin{cases}\varLambda-\mu S(t)-\beta S^{p}(t)I(t)+\gamma I(t)=0,\\ \beta S^{p}(t)I(t)-(\mu+\alpha+\gamma)I(t)=0, \end{cases} $$
(5)

resulting at most two equilibria: \(E_{0} (\frac{\varLambda}{\mu}, 0 ), E^{*} (S^{*}, I^{*} )\), where

$$S^{*}= \sqrt[p]{\frac{\mu+\alpha+\gamma}{\beta}},\qquad I^{*}= { \frac {\varLambda-\mu S^{*}}{\mu+\alpha}}. $$

From the expressions of \(I^{*}\), we know if

$$\frac{\varLambda}{\mu}>\sqrt[p]{\frac{\mu+\alpha+\gamma}{\beta}}, $$

system (1) has unique positive equilibrium \(E^{*}\). Regarding the stability of these equilibria, we have the following theorem.

Theorem 3.1

Define

$$ \mathcal{R}=\frac{\beta (\frac{\varLambda}{\mu} )^{p}}{\mu +\alpha+\gamma}. $$
(6)

Then, for system (1), we have

  1. (i)

    if \(\mathcal{R}<1\), it has a unique stable ‘diseases-extinction’ equilibrium point \(E_{0}\), which implies the extinction of the diseases;

  2. (ii)

    if \(\mathcal{R}>1\), it has a stable positive equilibrium \(E^{*}\), which implies the permanence of the disease.

Proof

The Jacobian of the linearization system of the system (1) at \(E_{0}\) gives

J 0 = ( μ r β ( Λ μ ) p 0 β ( Λ μ ) p ( μ + α + γ ) ) ,

which has the following eigenvalues:

$$\lambda_{1}=-\mu, \qquad\lambda_{2}=\beta \biggl( \frac{\varLambda}{\mu} \biggr)^{p}-(\mu+\alpha+\gamma). $$

Since \(\mu>0\), resulting \(\lambda_{1}<0\), according to the stability theory, \(E_{0}\) is stable if and only if \(\lambda_{2}<0\), i.e., \(\mathcal{R}<1\).

At \(E^{*}\) the Jacobian takes the form of

J 1 = ( β p S p 1 I μ γ β S p β p S p 1 I 0 ) ,

and the eigenvalues of matrix \(J_{1}\) satisfy

$$\begin{aligned} &\lambda_{1}+\lambda_{2}=-\beta p {S^{*}}^{p-1}I^{*}- \mu< 0, \\ &\lambda_{1}\lambda_{2}=- \bigl(\gamma- \beta{S^{*}}^{p} \bigr)\beta p {S^{*}}^{p-1}I^{*} \\ &\phantom{\lambda_{1}\lambda_{2}}=(\mu+\alpha)\beta p {S^{*}}^{p-1}I^{*}>0. \end{aligned}$$

This implies \(\lambda_{1}\) and \(\lambda_{2}\) have negative real parts. Thus the equilibrium \(E^{*}\) is stable. □

4 Dynamics of the stochastic system

In this section, we try to explore the conditions leading to the extinction and persistence of the infectious disease.

4.1 Extinction

Let us introduce

$$ \mathcal{R}^{*}=\frac{\beta (\frac{\varLambda}{\mu} )^{p}}{\mu +\alpha+\gamma}-\frac{\sigma^{2} (\frac{\varLambda}{\mu} )^{2p}}{2(\mu+\alpha+\gamma)}=\mathcal{R}- \frac{\sigma^{2} (\frac {\varLambda}{\mu} )^{2p}}{2(\mu+\alpha+\gamma)}, $$

where \(\mathcal{R}\) is given as in (6). Then we have the following.

Theorem 4.1

For system (3),

  1. (i)

    If \(\sigma^{2}>\max\{\beta (\frac{\mu}{\varLambda} )^{p},\frac{\beta^{2}}{2(\mu+\alpha+\gamma)}\}\), then the infectious disease of system (3) goes to extinction almost surely.

  2. (ii)

    If \(\sigma^{2}<\beta (\frac{\mu}{\varLambda} )^{p}\), then the infectious disease of system (3) goes to extinction almost surely for \(\mathcal{R}^{*}<1\).

Moreover, \(\lim_{t\rightarrow+\infty}S(t)=\frac{\varLambda}{\mu}\), almost surely.

Proof

Let \((S(t), I(t))\) be a solution of system (3) with initial value \((S(0), I(0)) \in R^{2}_{+}\). Applying Itô’s formula to system (3) results in

$$ d\ln I(t)= \biggl(\beta S^{p}(t)-(\mu+\alpha+ \gamma)-\frac{\sigma ^{2}}{2}S^{2p}(t) \biggr)\,dt+\sigma S^{p}(t)\,dB(t). $$
(7)

Integrating both sides of (7) from 0 to t gives

$$ \ln I(t)= \int_{0}^{t} \biggl(\beta S^{p}(\tau)- \frac{\sigma ^{2}}{2}S^{2p}(\tau) \biggr)\,d\tau-(\mu+\alpha+ \gamma)t +M(t)+\ln I(0), $$
(8)

where

$$M(t)= \int_{0}^{t}\sigma S^{p}(\tau) \,dB(\tau), $$

known as the local continuous martingale and \(M(0)=0\).

Consider the quadratic function

$$ g(z)=\beta z-\frac{\sigma^{2}}{2}z^{2},\quad z\in \biggl[0, \biggl(\frac{\varLambda }{\mu} \biggr)^{p} \biggr]. $$
(9)

It is easy to verify that when \(\sigma^{2}>\sigma_{1}=\beta (\frac{\mu }{\varLambda} )^{p}\), \(g(z)\) reaches its maximum value \(g_{\max}=\frac {\beta^{2}}{2\sigma^{2}}\) at \(z = \frac{\beta}{\sigma^{2}}\); and when \(\sigma ^{2}<\sigma_{1}\), its maximum value \(g_{\max}= (\frac{\varLambda}{\mu} )^{p} (\beta-\frac{\sigma^{2}}{2} (\frac{\varLambda}{\mu} )^{p} )\) is achieved at \(z = (\frac{\varLambda}{\mu} )^{p}\). Then in (8), noticing \(S^{p}(t)\in [0, (\frac {\varLambda}{\mu} )^{p} ]\), we have two cases to discuss, depending on whether \(\sigma^{2}>\beta (\frac{\mu}{\varLambda} )^{p}\).

Case I: \(\sigma^{2}>\beta (\frac{\mu}{\varLambda} )^{p}\). In this case, we can easily see from (8) that

$$ \ln I(t)\leq\frac{\beta^{2}}{2\sigma^{2}}t-(\mu+\alpha+r)t+M(t)+\ln I(0). $$
(10)

Dividing both sides of (10) by \(t>0\), we have

$$ \frac{\ln I(t)}{t}\leq- \biggl(\mu+\alpha+\gamma - \frac{\beta ^{2}}{2\sigma^{2}} \biggr)+\frac{M(t)}{t}+\frac{\ln I(0)}{t} $$
(11)

and by Lemma 2.4, we have

$$\lim_{t\rightarrow+\infty}\frac{M(t)}{t}=0 $$

almost surely. Then taking the limit superior on both sides of (11) leads to

$$ \limsup_{t\rightarrow+\infty}\frac{\ln I(t)}{t}\leq- \biggl(\mu+\alpha + \gamma-\frac{\beta^{2}}{2\sigma^{2}} \biggr)< 0 $$

almost surely, when \(\sigma^{2}>\max\{\beta (\frac{\mu}{\varLambda} )^{p},\frac{\beta^{2}}{2(\mu+\alpha+\gamma)}\}\), which implies \(\lim_{t\rightarrow+\infty}I(t)=0\).

Case II: \(\sigma^{2}<\beta (\frac{\mu}{\varLambda} )^{p}\). In this case, we can similarly have

$$\begin{aligned} \ln I(t)\leq \biggl(\frac{\varLambda}{\mu} \biggr)^{p} \biggl(\beta-\frac{\sigma ^{2}}{2} \biggl(\frac{\varLambda}{\mu} \biggr)^{p} \biggr)t-(\mu+\alpha+r)t +M(t)+\ln I(0). \end{aligned}$$
(12)

Dividing both sides of (12) by \(t>0\), we have

$$\begin{aligned} \frac{\ln I(t)}{t}&\leq(\mu+\alpha+\gamma) \biggl[ \frac{\beta (\frac {\varLambda}{\mu} )^{p}}{\mu+\alpha+r}-\frac{\sigma^{2} (\frac {\varLambda}{\mu} )^{2p}}{2(\mu+\alpha+\gamma)}-1 \biggr]+\frac {M(t)}{t}+ \frac{\ln I(0)}{t}. \end{aligned}$$
(13)

Taking the superior limit on both sides of (13) leads to

$$ \limsup_{t\rightarrow+\infty}\frac{\ln I(t)}{t}\leq(\mu+\alpha+\gamma ) \bigl( \mathcal{R}^{*}-1\bigr) $$

almost surely. Then when \(\mathcal{R}^{*}<1\), we have

$$ \limsup_{t\rightarrow+\infty}\frac{\ln I(t)}{t}< 0 $$

almost surely, which implies \(\lim_{t\rightarrow+\infty}I(t)=0\).

Next, we prove the last conclusion. Given \(0<\varepsilon\ll1\), since \(\lim_{t\rightarrow+\infty}I(t)=0\), we have \(0< I(t)<\varepsilon\) for t large enough. By the first equation of system (3), we have

$$ \frac{dS(t)}{dt}\geq\varLambda- \biggl(\mu+ \beta \biggl(\frac{\varLambda}{\mu} \biggr)^{p-1}\varepsilon+\sigma \biggl(\frac{\varLambda}{\mu} \biggr)^{p-1}\varepsilon\bigl|\dot{B}(t)\bigr| \biggr)S(t). $$

Then when \(\varepsilon\rightarrow0\) we have

$$ \liminf_{t\rightarrow+\infty}S(t)\geq\frac{\varLambda}{\mu} $$
(14)

almost surely. On the other hand from Remark 2.1, we have

$$\limsup_{t\rightarrow+\infty}S(t)\leq\frac{\varLambda}{\mu}+\varepsilon $$

almost surely. Let \(\varepsilon\rightarrow0\). Then one has

$$ \limsup_{t\rightarrow+\infty}S(t)\leq\frac{\varLambda}{\mu} $$
(15)

almost surely. From (14) and (15), we have

$$ \lim_{t\rightarrow+\infty}S(t)=\frac{\varLambda}{\mu} $$

almost surely. This completes the proof of Theorem 4.1. □

Remark 4.1

Theorem 4.1 shows that when \(\mathcal{R}^{*}<1\), the infectious disease of system (3) dies out almost surely, that is to say, large white noise stochastic disturbance can lead to epidemic extinction.

Remark 4.2

Note that \(\mathcal{R}^{*}=\mathcal{R}-\frac{\sigma^{2} (\frac{\varLambda }{\mu} )^{2p}}{2(\mu+\alpha+\gamma)}\). Obviously, \(\mathcal{R}<1\) leads to \(\mathcal{R}^{*}<1\), while the other side is not true. This implies that the condition for \(I(t)\) going to extinction in the deterministic system is stronger than its stochastic counterpart due to the effect of the white noise disturbance.

4.2 Permanence in mean

Integrating from 0 to t and dividing by t on both sides of system (3) yields

$$\begin{aligned} \varTheta(t)&= \frac{S(t)-S(0)}{t}+\frac{I(t)-I(0)}{t} \\ &=\varLambda-\mu \bigl\langle S(t)\bigr\rangle -(\mu+\alpha)\bigl\langle I(t)\bigr\rangle . \end{aligned}$$

Then one can get

$$ \bigl\langle S(t)\bigr\rangle = \frac{\varLambda}{\mu}-\frac{\mu+\alpha}{\mu }\bigl\langle I(t)\bigr\rangle - \frac{\varTheta(t)}{\mu}. $$

Applying Itô’s formula gives

$$\begin{aligned} d \bigl(\ln I(t) \bigr)&= \biggl[\beta S^{p}(t)-(\mu+\alpha+\gamma )-\frac{\sigma^{2}}{2}S^{2p}(t) \biggr]\,dt+\sigma S^{p}(t)\, {d}B(t) \\ &\geq \biggl[\beta S^{p}(t)-(\mu+\alpha+\gamma)-\frac{\sigma^{2}}{2} \biggl(\frac{\varLambda}{\mu} \biggr)^{2p} \biggr]\,dt+\sigma S^{p}(t)\,dB(t). \end{aligned}$$
(16)

Integrating from 0 to t and dividing by t on both sides of (16) yields

$$\begin{aligned} \frac{\ln I(t)-\ln I(0)}{t}\geq\beta\frac{1}{t} \int_{0}^{t} S^{p}(\theta )\,d \theta- \biggl[(\mu+\alpha+\gamma)+\frac{\sigma^{2}}{2} \biggl(\frac{\varLambda}{\mu} \biggr)^{2p} \biggr]+\frac{M(t)}{t}, \end{aligned}$$
(17)

by using Hölder’s inequality, we have

$$\begin{aligned} \frac{\ln I(t)-\ln I(0)}{t}\geq{}&\beta \bigl(\bigl\langle S(t)\bigr\rangle \bigr)^{p}- \biggl[(\mu+\alpha+\gamma)+\frac{\sigma^{2}}{2} \biggl( \frac{\varLambda }{\mu} \biggr)^{2p} \biggr]+\frac{M(t)}{t} \\ ={}&\beta \biggl(\frac{\varLambda}{\mu}-\frac{\mu+\alpha}{\mu}\bigl\langle I(t)\bigr\rangle - \frac{\varTheta(t)}{\mu} \biggr)^{p} \\ &{}- \biggl[(\mu+\alpha+\gamma )+\frac{\sigma^{2}}{2} \biggl(\frac{\varLambda}{\mu} \biggr)^{2p} \biggr]+\frac{M(t)}{t}. \end{aligned}$$
(18)

In (18), let \(a=\frac{\varLambda}{\mu}-\frac{\varTheta(t)}{\mu}, b=\frac{\mu+\alpha}{\mu}\langle I(t)\rangle\). Then according the number p being odd, there are two cases we should discuss.

Case I. When p is an even number, let \(p=2n, n\in N\). Then we have

$$\begin{aligned} (a-b)^{p} \geq{}& \begin{pmatrix}p\\ 0\end{pmatrix} a^{p}+ \begin{pmatrix} p\\1\end{pmatrix}a^{p-1}(-b)+ \begin{pmatrix}p\\3\end{pmatrix}a^{p-3}(-b)^{3}+\cdots + \begin{pmatrix}p\\{p-1}\end{pmatrix}a(-b)^{p-1} \\ \geq{}& \begin{pmatrix}p\\0\end{pmatrix}a^{p}+ \begin{pmatrix}p\\1\end{pmatrix} a^{p-1}(-b)+ \begin{pmatrix}p\\3\end{pmatrix}a^{p-3}(-b) \biggl(\frac{\varLambda (\mu+\alpha)}{\mu^{2} } \biggr)^{2} \\ &{}+ \begin{pmatrix}p\\5\end{pmatrix} a^{p-5}(-b) \biggl( \frac{\varLambda (\mu+\alpha)}{\mu^{2} } \biggr)^{4} +\cdots+ \begin{pmatrix} p\\{p-1}\end{pmatrix}a(-b) \biggl(\frac{\varLambda(\mu+\alpha)}{\mu^{2} } \biggr)^{p-2} \\ ={}&a^{p}-b \biggl[ \begin{pmatrix}p\\1\end{pmatrix}a^{p-1}+ \begin{pmatrix}p\\3\end{pmatrix}a^{p-3} \biggl(\frac{\varLambda(\mu+\alpha )}{\mu^{2} } \biggr)^{2}+\cdots \\ &{}+ \begin{pmatrix}p\\{p-1}\end{pmatrix}a \biggl(\frac{\varLambda(\mu+\alpha )}{\mu^{2} } \biggr)^{p-2} \biggr] \\ ={}&a^{p}-b\sum_{i=1,3,\ldots}^{p-1} \begin{pmatrix}p\\i\end{pmatrix}a^{p-i} \biggl( \frac{\varLambda(\mu +\alpha)}{\mu^{2} } \biggr)^{i-1} \\ ={}&a^{p}-b\sum_{k=0}^{p-1} \begin{pmatrix}p\\{2k+1}\end{pmatrix} a^{p-(2k+1)} \biggl( \frac{\varLambda(\mu +\alpha)}{\mu^{2} } \biggr)^{2k} \\ ={}&a^{p}-\frac{\mu+\alpha}{\mu}\sum_{k=0}^{p-1} \begin{pmatrix}p\\{2k+1}\end{pmatrix} a^{p-(2k+1)} \biggl( \frac{\varLambda(\mu+\alpha)}{\mu^{2} } \biggr)^{2k}\bigl\langle I(t)\bigr\rangle \\ \triangleq{}& a^{p}-\Delta_{1}\bigl\langle I(t)\bigr\rangle . \end{aligned}$$

Case II. When p is an odd number, let \(p=2n-1,n\in N\). Then

$$\begin{aligned} (a-b)^{p} \geq{}& \begin{pmatrix}p\\0\end{pmatrix}a^{p}+ \begin{pmatrix}p\\1\end{pmatrix}a^{p-1}(-b)+ \begin{pmatrix}p\\3\end{pmatrix} a^{p-3}(-b)^{3}+\cdots\\ &{}+ \begin{pmatrix}p\\{p-2}\end{pmatrix} a^{2}(-b)^{p-2}+C_{p}^{p}(-b)^{p} \\ \geq{}& \begin{pmatrix}p\\0\end{pmatrix} a^{p}+ \begin{pmatrix}p\\1\end{pmatrix} a^{p-1}(-b)+ \begin{pmatrix}p\\3\end{pmatrix} a^{p-3}(-b) \biggl(\frac{\varLambda (\mu+\alpha)}{\mu^{2} } \biggr)^{2} \\ &{}+ \begin{pmatrix}p\\5\end{pmatrix} (-b) \biggl(\frac{\varLambda(\mu +\alpha)}{\mu^{2} } \biggr)^{4}+\cdots+ \begin{pmatrix}p\\{p-2}\end{pmatrix}a^{2}(-b) \biggl(\frac{\varLambda (\mu+\alpha)}{\mu^{2} } \biggr)^{p-3} \\ &{}+ \begin{pmatrix}p\\p\end{pmatrix}(-b) \biggl(\frac{\varLambda(\mu+\alpha)}{\mu^{2} } \biggr)^{p-1} \\ ={}&a^{p}-b \biggl[ \begin{pmatrix}p\\1\end{pmatrix}a^{p-1}+ \begin{pmatrix}p\\3\end{pmatrix}a^{p-3} \biggl(\frac{\varLambda(\mu+\alpha )}{\mu^{2} } \biggr)^{2}+\cdots\\ &{}+ \begin{pmatrix}p\\{p-2}\end{pmatrix}a^{2} \biggl(\frac{\varLambda(\mu+\alpha )}{\mu^{2} } \biggr)^{p-3} + \begin{pmatrix}p\\{p}\end{pmatrix} \biggl(\frac{\varLambda (\mu+\alpha)}{\mu^{2} } \biggr)^{p-1} \biggr] \\ ={}&a^{p}-b\sum_{i=1,3}^{p} \begin{pmatrix}p\\i\end{pmatrix} a^{p-i} \biggl( \frac{\varLambda(\mu+\alpha )}{\mu^{2} } \biggr)^{i-1} \\ ={}&a^{p}-b\sum_{k=0}^{p} \begin{pmatrix}p\\{2k+1}\end{pmatrix} a^{p-(2k+1)} \biggl( \frac{\varLambda(\mu +\alpha)}{\mu^{2} } \biggr)^{2k} \\ ={}&a^{p}-\frac{\mu+\alpha}{\mu}\sum_{k=0}^{p} \begin{pmatrix}p\\{2k+1}\end{pmatrix} a^{p-(2k+1)} \biggl( \frac{\varLambda(\mu+\alpha)}{\mu^{2} } \biggr)^{2k}\bigl\langle I(t)\bigr\rangle \\ \triangleq{}& a^{p}-\Delta_{2}\bigl\langle I(t)\bigr\rangle , \end{aligned}$$

where \(b=\frac{\mu+\alpha}{\mu}\langle I(t)\rangle<\frac{\varLambda(\mu +\alpha)}{\mu^{2} }\) is used. Then we have

$$\begin{aligned} \frac{\ln I(t)-\ln I(0)}{t}\geq{}&\beta \biggl(\frac{\varLambda}{\mu}- \frac {\varTheta(t)}{\mu} \biggr)^{p}- \biggl((\mu+\alpha+\gamma)+ \frac{\sigma ^{2}}{2} \biggl(\frac{\varLambda}{\mu} \biggr)^{2p} \biggr) \\ &{}- \beta\Delta_{i}\bigl\langle I(t)\bigr\rangle +\frac{M(t)}{t}, \end{aligned}$$
(19)

where

$$ i= \textstyle\begin{cases} 1& \text{if } p \text{ is even,} \\ 2& \text{if } p \text{ is odd.} \end{cases} $$

The inequality (19) can be rewritten as

$$\begin{aligned} \bigl\langle I(t)\bigr\rangle \geq{}&\frac{1}{ \beta\Delta_{i}} \biggl[ \beta \biggl(\frac{\varLambda}{\mu}-\frac{\varTheta(t)}{\mu} \biggr)^{p}- \biggl(( \mu+\alpha+\gamma)+\frac{\sigma^{2}}{2} \biggl(\frac{\varLambda}{\mu } \biggr)^{2p} \biggr) \biggr] \\ &{}-\frac{1}{ \beta\Delta_{i}} \biggl[\frac{\ln I(t)-\ln I(0)}{t}-\frac {M(t)}{t} \biggr]. \end{aligned}$$
(20)

By Lemma 2.4, we get \(\lim_{t\rightarrow+\infty}\frac{M(t)}{t}=0\) almost surely. According to Remark 2.1, one sees that \(I(t)\leq\frac{\varLambda }{\mu}\). Thus one has \(\lim_{t\rightarrow+\infty}\frac{I(t)}{t}=0\) and \(\lim_{t\rightarrow +\infty}\frac{\ln I(t)}{t}=0\) almost surely as \(I(t)\geq1\), and \(\lim_{t\rightarrow+\infty}\varTheta(t)=0\) almost surely. Taking the inferior limit of both sides of (20) yields

$$ \liminf_{t\rightarrow+\infty}\bigl\langle I(t)\bigr\rangle \geq { \frac{\mu+\alpha+\gamma}{ \beta\Delta_{i}}\bigl(\mathcal{R}^{*}-1 \bigr). } $$

Let \(\Delta=\max\{\Delta_{i}, i=1,2\}\). We have

$$ \liminf_{t\rightarrow+\infty}\bigl\langle I(t)\bigr\rangle \geq \frac{\mu+\alpha +\gamma}{ \beta\Delta}\bigl(\mathcal{R}^{*}-1\bigr). $$

Thus, we get the permanence theorem as follows.

Theorem 4.2

If \(\mathcal{R}^{*}>1\), then the infectious disease I is permanent in mean, moreover, I satisfies

$$ \liminf_{t\rightarrow+\infty}\bigl\langle I(t)\bigr\rangle \geq \frac{\mu+\alpha +\gamma}{\beta\Delta}\bigl(\mathcal{R}^{*}-1\bigr). $$

Remark 4.3

Theorem 4.1 and Theorem 4.2 show that the condition for the disease to go to extinction or permanence strongly depends on the intensity of white noise disturbances. And small white noise disturbances will be beneficial to long-term prevalence of the disease, conversely, large white noise disturbances may cause the epidemic disease to die out.

5 Numerical simulation

In the following, by employing the Euler Maruyama (EM) method [63, 65], we make some numerical simulations to illustrate the extinction and persistence of the diseases in stochastic system and corresponding deterministic system for comparison.

We set parameters as \(\mu=0.2, \beta=0.5, \gamma=0.79, \alpha=0.4, p=2\), in system (1). Then we obtain

$$ \textstyle\begin{cases} { \dot{S}(t)=\varLambda-0.2 S(t)- 0.5 S^{2}(t)I(t)+0.79 I(t)},\\ {\dot{I}(t)=0.5 S^{2}(t)I(t)-1.39I(t).} \end{cases} $$
(21)

If \(\varLambda=0.25\), by simple calculation, we have \(\mathcal {R}=0.5621<1\), then according to Theorem 3.1, system (21) has a unique stable ‘diseases-extinction’ equilibrium point \(E_{0}(1.25,0)\), which implies the disease goes to extinction (see Fig. 1). If \(\varLambda=0.45\), we have \(\mathcal {R}=1.8210>1\). Then according Theorem 3.1, system (21) has a unique stable positive equilibrium \(E^{*}(1.6673, 0.1942)\), which implies that the disease of system (21) is permanent (see Fig. 2).

Figure 1
figure 1

Time series for \(S(t),I(t)\) of deterministic system, with \(\varLambda=0.25,\mu=0.2, \beta=0.5,\gamma=0.79,\alpha=0.4, p=2\), where \(\mathcal{R}=0.5621<1\)

Figure 2
figure 2

Time series for the paths \(S(t),I(t)\) for deterministic system with \(\varLambda=0.45,\mu=0.2, \beta=0.5,\gamma=0.79,\alpha=0.4, p=2\), where \(\mathcal{R}=1.8210>1\)

Next, to show the effect of stochastic perturbation on the spread of the disease, based on the deterministic system with persistent disease (see Fig. 2), we consider the stochastic system as follows:

$$ \textstyle\begin{cases} { \dot{S}(t)=\varLambda-0.2 S(t)- 0.5 S^{2}(t)I(t)+0.79 I(t)-\sigma S^{2}(t)I(t)\,dB(t)},\\ {\dot{I}(t)=0.5 S^{2}(t)I(t)-1.39I(t)+\sigma S^{2}(t)I(t)\,dB(t).} \end{cases} $$
(22)

First, we let \(\sigma=0.4\), the condition I in Theorem 4.1 is satisfied, the by Theorem 4.1, the disease will die out under a large white noise perturbation (see Fig. 3). If let \(\sigma =0.3\), by a direct calculation, we get \(\mathcal{R}^{*}=0.9913<1\), obviously, the condition II in Theorem 4.1 is satisfied, the by Theorem 4.1, the disease will die out under a large white noise perturbation (see Fig. 4). While if we let \(\sigma=0.1\), by a direct calculation, we have \(\mathcal {R}^{*}=1.7289>1\), then the disease is persistent by Theorem 4.2 (see Fig. 5). Moreover, Fig. 5 shows the solution of the stochastic system oscillate around the positive equilibrium of the deterministic system.

Figure 3
figure 3

Comparison illustration for \(S(t),I(t)\) of deterministic system and stochastic system with \(\sigma=0.4, \varLambda=0.45, \mu=0.2, \beta=0.5,\gamma=0.79,\alpha=0.4, p=2\), where \(\mathcal {R}=1.8210>1\)

Figure 4
figure 4

Comparison illustration for \(S(t),I(t)\) of deterministic system and stochastic system with \(\sigma=0.3, \varLambda=0.45,\mu=0.2, \beta=0.5, \gamma=0.79,\alpha=0.4, p=2\), where \(\mathcal{R}=1.8210>1, \mathcal {R}^{*}=0.9913<1\)

Figure 5
figure 5

Comparison illustration for \(S(t),I(t)\) of deterministic system and stochastic system with \(\sigma=0.1, \varLambda=0.45,\mu=0.2, \beta=0.5,\gamma=0.79, \alpha=0.4, p=2\), where \(\mathcal{R}=1.8210>1, \mathcal{R}^{*}=1.7289>1\)

6 Conclusion

The aim of this paper is to make contributions to understand the dynamics of SIS epidemic models with nonlinear incidence rate. First, we expand a deterministic SIS epidemic model by introducing the extra mortality. For the modified system, by analyzing the stability of equilibria, we define a threshold which determines the extinction and permanence of the epidemic disease. Second, we establish a stochastic system by introducing the white noise disturbance into the deterministic system. For the stochastic system, we define a new threshold associated with its deterministic counterpart and analyze the dynamics of the system based on the new threshold by using the theory of stochastic differential equations. Our results show that there exists a significant difference of threshold of the stochastic system from its deterministic counterpart. The difference caused by the introduction of stochastic white noise makes the extinction conditions of the diseases in the stochastic system are weaker than that of the corresponding deterministic model. However, in the present model, the nonlinear incidence rate takes the form \(\beta S^{p}(t)I(t)\), which is a special case of the nonlinear incidence rate \(\beta S^{p}(t)I^{q}(t)\) with \(q=1, p\in N\), for the more general case \(p, q\in R^{+}\), we do not give an effective analysis method at present. The analysis of this scheme in such case is left to our further study.