1 Introduction

Epidemiology is a part of mathematical biology which mainly associates with the study of infectious and vector-borne disease and its control strategies in a human population. To study the influence and effect of the communicable disease, researchers often used mathematical models which lead a significant role in theory of epidemiology [1, 2]. Mathematical models are used to provide understanding the dynamical behavior of the disease and to improve plans for eradicating the infection. In epidemiology, first known mathematical model is invented and worked by Daniel Bernoulli [3]. In the middle of twentieth century, mathematical epidemiology strode widely around the world [4,5,6]. The father of modern mathematical epidemiology, Ronald Ross, derived a threshold quantity, called as basic reproduction number, which helps the planners to conclude when the infection will eliminate. In 1927, Kermack and McKendrick wielded mathematical epidemiology to a new level by his contribution for infectious disease [7]. Later on, researchers formulate various mathematical models for developing strategies to control disease outbreak and making a trade-offs in choosing a best possible treatment for seizing the infection [8,9,10].

In mathematical modeling, recruitment rate of susceptible population, incidence rate of disease transmission and recovery rate have been playing a crucial role in deterministic approach of mathematical epidemiology. Mainly two types of recruitment rates are considered to formulate mathematical models. Constant birth rate has been taken into consideration for discussing compartmental model by the authors in [11, 12], which is not realistic in many certain cases. Under this circumstances, logistic growth rate of susceptible has been used to make mathematical models more realistic in case of limited resources and space capacities, disease with long lasting disease [13, 14].

Incidence rate has a vital role for discussing nature of a disease in theory of epidemiology. In early, bilinear incidence rate in the form \(\beta x y\) and standard incidence rate in the form \(\dfrac{\beta x y}{N}\) are used in epidemic model where \(\beta \) is the rate of disease transmission per contact, x and y denote number of susceptible population and number of infected population respectively, and N is the total number of population [15, 16]. In such cases, epidemic model has relatively simple dynamical behavior that means model does not have any periodic oscillations and condition for persistence and eradication of the infection can be determined simply by calculating basic reproduction number \(\left( R_{0}\right) \). The authors established that if \(R_{0}\) < 1, then disease will eliminate and otherwise disease will permanent [17]. But these types of incidence rates are not realistic as they do not include social awareness of susceptible population and crowding effect of infected population. To get rich dynamical phenomenon of the infectious disease, many authors studied nonlinear types of incidence rate [18,19,20,21]. To avoid unboundedness of infection, Capasso and Serio [22], Anderson and May [23] introduced saturated types of infection rate in the form \(\dfrac{\beta x y }{1 + \alpha y}\) where \(\beta y\) denotes force of infection and \(\dfrac{1}{1 + \alpha y}\) measures inhibition factors due to crowding effect of infected class and also for social awareness of the population, \(\alpha \) is inhibition coefficient. Later on, to get rich dynamics of infectious disease that means to get periodic oscillations, multiple peaks of infection, Ruan and Xiao [24] introduced a non-monotone type of infection rate in the form \(\dfrac{\beta x y}{1+ \alpha y^{2}}\) to include psychological effect of population. The function \(\dfrac{y}{1 + \alpha y^2}\) has maximum value at \(y = \dfrac{1}{ \sqrt{\alpha }}\), it is increasing first upto \(y = \dfrac{1}{ \sqrt{\alpha }}\), then decreases to zero as y goes to infinity. Here, initially infection rate increases due to lack of knowledge about the disease, but taking appropriate awareness and protection measures, incidence rate gradually diminishes by avoiding unnecessary contacts [10, 25]. Thereafter, many researchers used general types of nonlinear incidence rate for describing critical situation of disease spreading [21, 26].

Spreading of the disease and influence of the infection can be controlled by taking proper treatment and giving suitable vaccine to infected class. Usually, in classical epidemic model, treatment function of infection is taken as proportional to number of infectives, i.e., every infected should take up treatment for a curable disease. But there is a maximum capacity of medical resources for treating against the disease. So, for largely populated countries, this type of recovered rate is unusable for treatment of the disease. Therefore, saturated types of treatment functions are more realistic as they tend to a finite limit when infected populations increase. In order to include limited number of medical resources such as hospital facilities, medicines, doctors etc., Wang and Ruan [27] considered constant treatment function \(T\left( y \right) \) into epidemic model where

$$\begin{aligned} T\left( y \right) = {\left\{ \begin{array}{ll} r,&{}\quad y>0 \\ 0, &{}\quad y=0, \end{array}\right. } \end{aligned}$$

where a constant capacity of medical resources is used to cure from the infection. They studied that model experiences Hopf bifurcation, saddle-node bifurcation, Bogdanov–Takens bifurcation varying two parameters of the system. Later on, Wang [28] introduced a piecewise treatment function which is proportional to the number of infected before reaching its maximum capacity, by

where \(k y_{0}\) is maximum number of medical resources available for curing the disease. In this model, authors established that model undergoes with backward bifurcation and also observed that basic reproduction number \(\left( R_{0}\right) \) is not remain as a threshold quantity. The authors in [14, 27, 29, 30] considered treatment function as

$$\begin{aligned} h(y) = \dfrac{r y}{1+ \alpha y} \end{aligned}$$

for discussing the limited number of available medical facilities. Above function h(y) has maximum available limit \(\dfrac{r}{\alpha }\) as medical treatment. For a large number of population, treatment may be delayed to receive infected class. This function includes the parameter \(\alpha \) as a extent of effect of infective as delaying the treatment [31].

Our objective is to reduce spread of infection and eradicate disease from the community. Therefore, we have to find an optimal control strategy which is suitable for minimizing the invade of infection, i.e., we should find an optimal value of applied controls to reduce the infection. The aim of applying optimal control is to reduce the density of infected population and minimize control cost [32]. Sometimes, researchers used only vaccination to reduce the infection [33, 34], whereas researchers considered only treatment function to control the invade of infection [35]. Further both treatment and vaccination used to minimize the spread of infection [13, 36]. In this paper, we consider both vaccination and treatment to control the spread of the disease.

The motivation behind this paper is the extension of the work [19, 24]. In [19], authors proposed an epidemic model having constant birth rate of susceptible population with saturated incidence rate and saturated treatment rate. They studied biological properties of the considered model and also proved local as well as global stability of different equilibrium points. They showed model system exhibited through backward bifurcation; moreover, they studied optimal control of the proposed model. In [24], authors considered an epidemic model having constant birth rate of susceptible population with non-monotone incidence rate. They showed stability analysis of equilibrium points. In both papers, authors gave less attention to study critical dynamical behavior like Hopf bifurcation, Bogdanov–Takens bifurcation, etc. In this paper, we have considered an epidemic model having constant birth rate of susceptible population with non-monotone incidence rate and saturated treatment rate. After model formulation, we are interested to examine more critical dynamical behavior of the model such as transcritical bifurcation, saddle-node bifurcation, Hopf bifurcation, Bogdanov–Takens bifurcation, etc. Moreover, we try to find an optimal paths of applied controls to reduce the spread of infection by studying optimal control theory.

Organization of this paper: In Sect. 2, we formulate model and discuss basic properties of the model system such as positivity and boundedness of the solution of the model system. In Sect. 3, we examine different types of equilibrium points and also find basic reproduction number \((R_{0})\). In Sect. 4, we examine local as well as global stability of the disease-free equilibrium point \((R_{0})\) and also endemic equilibrium point \((E^{*})\). In Sect. 5, bifurcation of co-dimension 1, namely transcritical bifurcation, backward bifurcation, saddle-node bifurcation, Hopf bifurcation and bifurcation of co-dimension 2, namely Bogdanov–Takens bifurcation of co-dimension 2 are investigated. In Sect. 6, numerical simulations are performed to check the reliability of theoretical findings. Sensitivity analysis is done to identify the most influential model parameters which have most affect on basic reproduction number \(R_0\) in Sect. 7. Optimal control problem and efficiency analysis is performed in Sect. 8 and 9. Finally, critical dynamical behavior of the model system, and its biological significance are discussed in Sect. 10.

2 Model formulation

Let the total populations are classified into three compartments, namely susceptible individuals (x(t)) , infected individuals (y(t)) and recovered individuals (z(t)) at any time t. In this study, we consider an epidemic model having constant birth rate of susceptible class with non-monotone incidence (affected by inhibitory factors, namely social awareness, psychological factors, etc.) in the form \( \dfrac{\beta xy}{1+\alpha y^{2}} \) with vaccination and saturated treatment

$$\begin{aligned} \left( {\text {in the form}}\; \dfrac{au_{2}y}{1+bu_{2}y}\right) . \end{aligned}$$
Table 1 Model parameters and their descriptions

In reality, at first disease spreads rapidly among the population due to their unconsciousness. But for psychological effect as well as awareness of susceptible population and also for crowding effect of infected population, transmission rate of the disease starts to decrease after someday. Therefore, considering non-monotone incidence rate is appropriate for describing the spread of any disease. Also there is a maximum capacity for treatment facility. When the number of infected peoples are very small, then every people gets treatment as saturated treatment function shows linear type behavior for this case. But for huge number of infected peoples, the saturated treatment tends to its maximum capacity. Therefore, in this case, every people does not get treatment, which is realistic. For these reasons, we have considered here non-monotone incidence rate and saturated treatment function. Two types of death are considered, one for normal death and another for disease induced death. Some infected peoples are recovered for natural immunity from infected class at a rate \(\mu \). Considering all of above assumptions, the differential equation of the proposed model can be placed in the following form:

$$\begin{aligned} \begin{aligned} \dfrac{{\mathrm{d}}x}{{\mathrm{d}}t}&= A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x \\ \dfrac{{\mathrm{d}}y}{{\mathrm{d}}t}&= \dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y}\\ \dfrac{{\mathrm{d}}z}{{\mathrm{d}}t}&=\dfrac{au_{2}y}{1+bu_{2}y}+\mu y+u_{1}x-dz \end{aligned} \end{aligned}$$
(1)

satisfying \(x(0)\ge 0, y(0)\ge 0, z(0)\ge 0\) with nonnegative model parameters are enlisted in Table 1, and flow diagram of the proposed model is given in Fig. 1.

Fig. 1
figure 1

Flow diagram of the proposed model

We notice that recover class (z) does not appear in the first two equations of the model (1), thus we can omit third equation for analysis purpose of the system and focus only on the following subsystem of (1) which is proposed below:

$$\begin{aligned} \dfrac{{\mathrm{d}}x}{{\mathrm{d}}t}&= A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x \nonumber \\ \dfrac{{\mathrm{d}}y}{{\mathrm{d}}t}&= \dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y} \end{aligned}$$
(2)

First, we show that all the solutions of model (2) with nonnegative initial condition remain nonnegative for \(t\ge 0\).

Theorem 1

All solutions (x(t), y(t)) of model (2) with initial conditions \(x(0)\ge 0, y(0) \ge 0\) are nonnegative for all \(t\ge 0\).

Proof

From first equation of model (2), we get

$$\begin{aligned} \dfrac{{\mathrm{d}}x}{{\mathrm{d}}t} = A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x \end{aligned}$$
(3)

Using the integrating factor, we obtain the solution of the above differential equation in the form:

$$\begin{aligned} x(t)&= \exp \left( -\int _0^t \dfrac{\beta y(s){\mathrm{d}}s}{1+\alpha y^2(s)}+(d+u_{1})t\right) \nonumber \\&\quad \times \int _0^t \left( x(0)+ A\left( \exp \int _0^v \dfrac{\beta y(s){\mathrm{d}}s}{1+\alpha y^2(s)}\right. \right. \nonumber \\&\quad \left. \left. +(d+u_{1})v\right) {\mathrm{d}}v\right) {\mathrm{d}}t, \end{aligned}$$
(4)

i.e., \(x(t)>0\) for all \(t\ge 0\). Similarly, we can show \(y(t)\ge 0\) \(\forall t \ge 0\).

This completes the proof. \(\square \)

In the next theorem, we shall establish the boundedness of the solutions for all \( t \ge 0\).

Theorem 2

Let, M= \(\min \lbrace (d+u_{1}),(d+\mu +\delta )\rbrace \), then the region \(\prod = \left\{ (x,y)\in R_{+}^{2} : x + y \le \dfrac{A}{M}\right\} \) is a positively invariant and attracting set for the model (2).

Proof

First, we assume that \( N(t) = x(t)+ y(t)\).

Adding both equations in (2), we get

$$\begin{aligned}&\dfrac{{\mathrm{d}}N}{{\mathrm{d}}t} = A - (d+u_{1})x - (d+\mu +\delta )y - \dfrac{a u_{2} y}{1+ b u_{2} y}\\&{\text {i.e.,}}\; \dfrac{{\mathrm{d}}N}{{\mathrm{d}}t}\le A - (d+u_{1})x - (d+\mu +\delta )y\\&{\text {i.e.,}}\; \dfrac{{\mathrm{d}}N}{{\mathrm{d}}t}\le A - M(x+y) \\&{\text {i.e.,}}\; \dfrac{{\mathrm{d}}N}{{\mathrm{d}}t} \le A - M N \end{aligned}$$

where \(M= \min \lbrace (d+u_{1}),(d+\mu +\delta )\rbrace \). Integrating and taking limit as \(t\longrightarrow \infty \), we get \(N(t)\le \dfrac{A}{M}\). Thus, we have \(0\le N(t)\le \dfrac{A}{M}\) for sufficiently large t. So, all solutions of model (2) are bounded and enter the region \(\prod \). \(\square \)

In the next section, we shall describe the conditions for existence of different equilibrium points.

3 Existence of equilibria and basic reproduction number

The equilibria of the model (2) are solutions of the following equations:

$$\begin{aligned}&A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x=0 \nonumber \\&\dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y}=0 \end{aligned}$$
(5)

It is clear from system (5) that the model (2) always has a disease-free equilibrium

$$\begin{aligned} E_{0} = \ \left( \dfrac{A}{d+u_{1}},0\right) . \end{aligned}$$

As the disease-free equilibrium \(E_{0}\) exist, the basic reproduction number \((R_{0})\) of the model (2) exists. In the following theorem, we compute basic reproduction number of the proposed model using next-generation matrix method [37].

Theorem 3

Basic reproduction number of the proposed model system (2) is given by

$$\begin{aligned} R_{0} = \dfrac{A \beta }{(d+u_{1})(d+\mu +\delta +a u_{2})}. \end{aligned}$$

Proof

Basic reproduction number plays a crucial role in the disease spreading dynamics. The number of secondary infections produced by a single infected individual in its entire life span as an infectious host is defined as basic reproduction number. Based on the values of basic reproduction number, epidemiologist can predict when the disease persists or eradicates from the population. Now, we find basic reproduction number of the proposed model using next-generation matrix approach [37].

In next-generation matrix approach, we consider two matrices \({\mathscr {F}}\) and \({\mathscr {V}}\) with containing newly infected terms and the remaining terms, respectively. Therefore, \({\mathscr {F}} = \Bigg ( \dfrac{\beta x y}{1 + \alpha y^2}\Bigg )\) and \({\mathscr {V}} = \Bigg ((d + \mu + \delta )y + \dfrac{a u_2 y}{1 + b u_2 y}\Bigg )\). Differentiating \({\mathscr {F}}\) and \({\mathscr {V}}\) with respect to infected component y at disease-free equilibrium point \(E_0\left( \dfrac{A}{d+u_{1}},0\right) \), we get F = \(\Bigg ( \dfrac{\beta A }{d + u_1}\Bigg )\) and V= \(\Bigg ((d + \mu + \delta + a u_2)\Bigg )\). Now, next-generation matrix \({\mathscr {K}}\) is given by \({\mathscr {K}} = F V^{-1}.\) Basic reproduction number is the spectral radius of the next-generation matrix \({\mathscr {K}}\). Here, \({\mathscr {K}} = F V^{-1} = \Bigg ( \dfrac{A \beta }{(d+u_{1})(d+\mu +\delta +a u_{2})}\Bigg ).\)

Therefore, basic reproduction number of the proposed model system (2) is given by \(R_{0} = \dfrac{A \beta }{(d+u_{1})(d+\mu +\delta +a u_{2})}\). \(\square \)

The endemic equilibria of the considered model system is \(E(x^*,y^*)\) where

$$\begin{aligned} x^{*}=\dfrac{ A - \left\{ (d+\mu +\delta ) + \dfrac{a u_{2} }{1+ b u_{2}y^{*}}\right\} y^{*}}{(d+u_{1})}, \end{aligned}$$
(6)

and \(y^{*}\) is the positive root of the cubic equation

$$\begin{aligned} Q(y)\equiv A_{3} y^{3} + A_{2} y^{2} +A_{1} y+ A_{0} = 0 \end{aligned}$$
(7)

where \( A_{3} = \alpha b u_{2}(d+ u _{1})(d + \mu + \delta ), \ A_{2} = \alpha (d+ u _{1})\) \((d + \mu + \delta + a u_{2})+ b \beta u_{2}(d + \mu + \delta ) A_{1} = \beta (d + \mu + \delta + a u_{2}) + b u_{2}[(d+ u _{1})(d + \mu + \delta ) - A \beta ] ,A_{0} = (d+ u _{1})(d + \mu + \delta + a u_{2})(1 - R_{0})\).

Discriminant [38] of cubic equation (7) is

$$\begin{aligned} \bigtriangleup&= A_{1}^2 A_{2}^2 - 4 A_{0} A_{2}^3 - 4 A_{1}^3 A_{3} +18 A_{0} A_{1} A_{2} A_{3} \nonumber \\&\quad - 27 A_{0}^2 A_{3}^2 \end{aligned}$$
(8)

Letting \(B_{1} = A_{2}^2 - 3 A_{1} A_{3}, B_{2} = A_{1} A_{2} - 9 A_{0} A_{3} \) and \(B_{3} = A_{1}^2 - 3 A_{0} A_{2}\), the expression of (\( \bigtriangleup \)) can be written in the form:

$$\begin{aligned} \bigtriangleup = \left( - \dfrac{1}{3}\right) \bigtriangleup _{0} \end{aligned}$$
(9)

where

$$\begin{aligned} \bigtriangleup _{0} = B_{2}^2 - 4 B_{1} B_{3} \end{aligned}$$
(10)

Here, coefficients \(A_{2}\) and \(A_{3}\) are always positive and sign of \(A_{0}\) depends on \(R_{0}\) in (7). So, we have the following cases for positive roots of Q(y) = 0 :

Case-1 \(R_{0}\) > 1, then \(Q(0) = A_{0}<\) 0; in this case, unique positive solution of \( Q(y) = 0 \) exists, regardless of the sign of \(Q^{'}(0)\) = \(A_{1}\).

Case-2 \(R_{0}\) < 1, in this case, \( Q(0) = A_{0}>\) 0.

  1. (i)

    If \(Q^{'}(0)\) = \(A_{1}>\) 0, the equation \( Q(y) = 0 \) has no positive solution.

  2. (ii)

    If \(Q^{'}(0)\) = \(A_{1}\) < 0, the following cases arise,

    1. (a)

      \(\bigtriangleup _{0} < 0\), then \(Q (y) = 0\) has two positive solutions.

    2. (b)

      \(\bigtriangleup _{0} = 0\), there is unique positive solution of the equation \(Q(y) = 0 \).

    3. (c)

      \(\bigtriangleup _{0} > 0\), we get no positive solution.

Summarizing the above discussions, we get following theorem.

Theorem 4

  1. (1)

    Model system (2) always has a disease-free equilibrium point \(E_{0}\).

  2. (2)

    When \(R_{0}\) > 1, model system (2) has a unique endemic equilibrium.

  3. (3)

    When \(R_{0}\) < 1 and

    1. (a)

      \(A_{1} >0\), model system (2) has no endemic equilibrium.

    2. (b)

      \(A_{1} <0\) and also \(\bigtriangleup _{0} < 0\), there exist two endemic equilibria \(E_{1}(x_{1}^{*} , y_{1}^{*})\) and \(E_{2}(x_{2}^{*} , y_{2}^{*})\).

    3. (c)

      For \(A_{1} <0\) and also \(\bigtriangleup _{0} = 0\), these two endemic equilibria coalesce into the same endemic equilibrium \(E^{*}(x^{*} , y^{*})\).

Otherwise, model system (2) has no endemic equilibrium.

4 Stability analysis of equilibria

For discussing stability of the equilibrium point, we need to compute Jacobian matrix \(J(E(x^*, y^*))\) of model system (2) at any equilibrium point E (\(x^{*}\), \(y^{*}\)). The Jacobian matrix of model (2 ) is given by

$$\begin{aligned}&J(E) \equiv J(E(x^*,y^*))\nonumber \\&\quad = \left( \begin{array}{ccc} - \ \dfrac{\beta y^*}{1+\alpha {y^*}^2}- \left( d+ u_{1}\right) &{} -\dfrac{\beta x^* \left( 1 - \alpha {y^*}^2 \right) }{\left( 1+\alpha {y^*}^2\right) ^2} \\ \dfrac{\beta y^*}{1+\alpha {y^*}^2} &{} \dfrac{\beta x^* \left( 1 - \alpha {y^*}^2 \right) }{\left( 1+\alpha {y^*}^2\right) ^2}-(d+\mu +\delta )-\dfrac{au_{2}}{(1+bu_{2} y^*)^2} \end{array} \right) . \end{aligned}$$
(11)

Next, we shall prove local stability of disease-free equilibrium point for \(R_0 < 1\) in Theorem 5. Then, we shall show global stability of disease-free equilibrium for \(R_0 < 1\) using Lyapunov stability theorem in Theorem 6. Further, we shall discuss local stability of endemic equilibrium for \(R_0 > 1\) in Theorem 7, and next we shall prove global stability of endemic equilibrium using Dulac function criterion for \(R_0 > 1\) in Theorem 8.

First, we discuss local as well as global stability at disease-free equilibrium point \(E_{0}\).

Theorem 5

If \(R_{0}\) < 1, then disease-free equilibrium point \(E_{0}\) is locally asymptotically stable and unstable for \(R_{0}\) > 1. For \(R_0 = 1\), disease-free equilibrium point \(E_0\) is a saddle-node of co-dimension 1 if \(\dfrac{\beta ^2 A}{(d + u_1)^2} \ne ab u_2^2\) and is a semi-hyperbolic attracting node of co-dimension 2 if \(\dfrac{\beta ^2 A}{(d + u_1)^2} = ab u_2^2\), \(\alpha A \ge abu_2^2.\)

Proof

Jacobian matrix of the proposed model (2) at disease-free equilibrium point \(E_0\) is given by

$$\begin{aligned} J(E_0) = \left( \begin{array}{ccc} - \left( d+ u_{1}\right) &{} -\dfrac{\beta A}{d + u_1} \\ 0 &{} (d+ \mu +\delta + au_2)(R_0 - 1) \end{array} \right) . \end{aligned}$$

Therefore, characteristic roots of \(J\left( E_{0}\right) \) are \(- (d+ u_{1})\) and \((d +\mu +\delta +a u_{2})(R_{0} - 1)\). Therefore, for \(R_{0}\) < 1, both eigenvalues are negative, and for \(R_{0}\) > 1, one of them is positive. So, we can conclude that \(E_{0}\) is locally asymptotically stable for \(R_{0}\) < 1 and unstable for \(R_{0}\) > 1.

For \(R_0 = 1\), one eigenvalue of the Jacobian matrix \(J_{E_0}\) is zero; therefore, in this case, disease-free equilibrium point \(E_0\) is non-hyperbolic type, and to examine dynamical behavior of the disease-free equilibrium point \(E_0\), we have to use center manifold theorem [39].

First, we use a perturbation at disease-free equilibrium point \(E_0\left( \dfrac{A}{d +u_1},0 \right) \) using the transformation \(X = x - \dfrac{A}{d + u_1}\), \(Y = y\), thus model system (2) becomes:

$$\begin{aligned} \dfrac{{\mathrm{d}}X}{{\mathrm{d}}t}&= - (d + u_1)X - \dfrac{\beta A}{d + u_1}Y - \beta X Y + \dfrac{\alpha \beta A}{d + u_1}Y^3 \nonumber \\&\quad +\, o(|X,Y|^4) \nonumber \\ \dfrac{{\mathrm{d}}Y}{{\mathrm{d}}t}&= \beta X Y + ab u_2^2 Y^2 - \left( \dfrac{\alpha \beta A}{d + u_1} + ab^2 u_2^3 \right) Y^3 \nonumber \\&\quad +\, o(|X,Y|^4) \end{aligned}$$
(12)

Next, we use the transformation \(X = v - \dfrac{\beta A}{(d +u_1)^2}u\), \(Y = u\); then, system (12) becomes:

$$\begin{aligned} \dfrac{{\mathrm{d}}u}{{\mathrm{d}}t}&= \beta u v + \left( abu^2_2 - \dfrac{\beta ^2 A}{(d + u_1)^2} \right) u^2 \nonumber \\&\quad - \left( \dfrac{\alpha \beta A}{d + u_1} + ab^2 u_2^3 \right) u^3 +\, o(|u,v|^4) \nonumber \\ \dfrac{{\mathrm{d}}v}{{\mathrm{d}}t}&= - (d +u_1)v - \beta u v + \dfrac{\beta ^2 A}{(d+ u_1)^2}u^2 + \dfrac{\beta ^2 A}{(d + u_1)^2} uv \nonumber \\&\quad -\, \dfrac{\beta A}{(d + u_1)^2}\left[ \dfrac{\alpha \beta A}{d + u_1} + a b^2 u_2^3 - \alpha (d+u_1) \right] u^3 \nonumber \\&\quad + o(|u,v|^4) \end{aligned}$$
(13)

From (13), it is clear that if \(\dfrac{\beta ^2 A}{(d + u_1)^2} \ne ab u_2^2\), then disease-free equilibrium point \(E_{0}\) is a saddle-node of co-dimension 1 [39]. Now if \(\dfrac{\beta ^2 A}{(d + u_1)^2} = ab u_2^2\), then we have to use center manifold theorem to find the nature of the disease-free equilibrium point \(E_{0}\).

By definition of local center manifold of the system

$$\begin{aligned} W^c(0)= & {} \left\{ (u, v)\in R^2 : v = h(u) = a_1 u^2 + a_2 u^3 +a_3 u^3 \right. \nonumber \\&\left. + \ldots , |u|< \delta ' \right\} \end{aligned}$$
(14)

satisfying \(h(0) = 0, Dh(0) = 0\) where \(\delta '\) is sufficiently small, Dh represents the derivative of h with respect to u. Using the conditions, we get the value of \(a_1\) as \(a_1 = \dfrac{ab u_2^2}{d +u_1}.\)

Therefore, local manifold of the system is given by \(v = h(u) = \dfrac{ab u_2^2}{d +u_1}u^2\), and thus system (13) becomes

$$\begin{aligned} \dfrac{{\mathrm{d}}u}{{\mathrm{d}}t}&= - \left\{ ab^2 u_2^3 + \beta \dfrac{\alpha A - a b u^2_2}{d +u_1} \right\} u^3 + o(|u|^4) \nonumber \\ \dfrac{\mathrm{d}v}{\mathrm{d}t}&= - (d + u_1)v + o(|u,v|^2) \end{aligned}$$
(15)

From (15), it is clear that if \(\dfrac{\beta ^2 A}{(d + u_1)^2} = ab u_2^2\), then disease-free equilibrium point \(E_{0}\) is a semi-hyperbolic attracting node of co-dimension 2 for \(\alpha A \ge abu_2^2.\)

Thus, proof of the theorem is complete. \(\square \)

Theorem 6

If \(R_{0}\) < \(1 - R^*_0\), then disease-free equilibrium point \(E_{0}\) is globally stable where \(R^*_0\) is defined in the text.

Proof

To show the global asymptotic stability of the disease-free equilibrium point \(E_{0}\), we are using here Lyapunov’s stability theorem [40, 41].

Let \(L(x,y)= y\) is a Lyapunov function as it is positive definite. Then,

$$\begin{aligned} \dfrac{{\mathrm{d}}L}{{\mathrm{d}}t}= & {} \dfrac{{\mathrm{d}}y}{{\mathrm{d}}t} = \dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y} \\ {\text {i.e.,}} \; \dfrac{{\mathrm{d}}L}{{\mathrm{d}}t}= & {} \left\{ \dfrac{\beta x}{1+\alpha y^{2}}-(d+\mu +\delta )-\dfrac{au_{2}}{1+bu_{2}y} \right\} y \\ {\text {i.e.,}}\; \dfrac{{\mathrm{d}}L}{{\mathrm{d}}t}&\le \left[ \dfrac{\beta A}{d + u_1} - (d+ \mu + \delta + au_2) \right] y \\&\quad -\, au_2\left( \dfrac{1}{1 + bu_2 \dfrac{A}{M}} - 1 \right) y\\ {\text {i.e.,}}\; \dfrac{{\mathrm{d}}L}{{\mathrm{d}}t}&\le (d + \mu + \delta + au_2)\left[ R_0 - \left\{ 1 - \dfrac{ab u_2^2 A}{M + Ab u_2} \right\} \right] y \\ {\text {i.e.,}}\; \dfrac{{\mathrm{d}}L}{{\mathrm{d}}t}&\le (d + \mu + \delta + au_2)\left[ R_0 - \left\{ 1 - R^*_0 \right\} \right] \end{aligned}$$

where \(R^*_0 = \dfrac{ab u_2^2 A}{M + Ab u_2}\).

Therefore, \(\dfrac{{\mathrm{d}}L}{{\mathrm{d}}t}\) < 0 if \(R_{0}\) < \(1 - R^*_0\).

So, by Lyapunov’s stability theorem, it follows that \(E_{0}\) is globally stable if \(R_{0}\) < \(1 - R^*_0\). \(\square \)

This result is epidemiologically more significant since if basic reproduction number is less than a threshold quantity, then disease eradicates from the population. Therefore to control or eradicate the emerging disease, our aim will be to maintain the value of the basic reproduction number less than that threshold value.

Next, we study local as well as global stability of endemic equilibrium point \(E(x^*, y^*)\) for \(R_0 > 1\).

Theorem 7

If \(R_{0}\) > 1, then model system (2) has a unique endemic equilibrium which is locally asymptotically stable for \(\beta \) \(\ge \) \(\max \ \left\{ a b u_{2}^2, \ \dfrac{a \alpha }{b}\right\} \)

Proof

To prove the above-mentioned theorem, here we use Jacobian approach of matrix [8, 42]. The characteristic equation of the Jacobian matrix (11) at any endemic equilibrium point E(xy) is given by

$$\begin{aligned} F(\lambda ) \equiv \left| \begin{array}{lllll} -\dfrac{\beta y}{1+\alpha y^2}-(d+u_{1})-\lambda \quad -\dfrac{\beta x \left\{ 1 - \alpha y^2 \right\} }{\left\{ 1+\alpha y^2\right\} ^2} \\ \dfrac{\beta y}{1+\alpha y^2} \quad \dfrac{\beta x \left\{ 1 - \alpha y^2 \right\} }{\lbrace 1+\alpha y^2\rbrace ^2}-(d+\mu +\delta )-\dfrac{au_{2}}{(1+ b u_{2} y)^2}-\lambda \end{array}\right| = 0. \end{aligned}$$
(16)

Therefore,

$$\begin{aligned} F(0)&= \begin{vmatrix} -\dfrac{\beta y}{1+\alpha y^2}-(d+u_{1})&-\dfrac{\beta x \left\{ 1 - \alpha y^2 \right\} }{\left\{ 1+\alpha y^2 \right\} ^2} \\ \dfrac{\beta y}{1+\alpha y^2}&\dfrac{\beta x \left\{ 1 - \alpha y^2 \right\} }{\lbrace 1+\alpha y^2\rbrace ^2}-(d+\mu +\delta )-\dfrac{au_{2}}{(1+bu_{2} y)^2} \end{vmatrix} \\&= \ \begin{vmatrix} \ -\ \dfrac{A}{x}&\ -\dfrac{\beta x \left\{ 1 - \alpha y^2 \right\} }{\left\{ 1+\alpha y^2 \ \right\} ^2} \\ - \, (d+ u_{1})&\ \dfrac{a b u_{2}^2 y}{(1+bu_{2} y \ )^2}- \dfrac{ \beta x }{(1+ \alpha y^2\ )} \ \end{vmatrix} \\&= \left( - \ \dfrac{A}{x}\right) \begin{vmatrix} \ 1&\ \dfrac{\beta A \left\{ 1 - \alpha y^2 \right\} }{\left\{ \beta y + (d + u_{1})(1 + \alpha y^2\ )\right\} ^2 } \\ -\, \ (d+ u_{1})&\ \dfrac{a b u_{2}^2 y}{(1+bu_{2} y \ )^2}- \dfrac{ \beta A }{\left\{ \beta y + (d + u_{1})(1 + \alpha y^2\ )\right\} } \ \end{vmatrix} \end{aligned}$$
$$\begin{aligned} \quad= & {} \beta y \left( - \ \dfrac{A}{x}\right) \left[ \dfrac{a b u_{2}^2}{\beta (1+bu_{2} y \ )^2} - \dfrac{ A \left\{ \beta + 2 \alpha y (d+ u_{1})\right\} }{\left\{ \beta y + (d + u_{1})(1 + \alpha y^2\ )\right\} ^2 } \right] . \nonumber \\ \end{aligned}$$
(17)

Since, from second equation of (5), we have

$$\begin{aligned} x&= \dfrac{A (1+ \alpha y^2)}{\beta y + (d+u_{1})(1+\alpha y^2)} \nonumber \\&= \dfrac{(1+ \alpha y^2)}{\beta }\left[ (d +\mu +\delta )+ \dfrac{a u_{2}}{\left( 1+ b u_{2}y\right) }\right] . \end{aligned}$$
(18)

Now, we consider a function f(y) defined as

$$\begin{aligned} f(y)&= \dfrac{A}{\beta y + (d+u_{1})(1+\alpha y^2)} - \dfrac{(d +\mu +\delta )}{\beta } \nonumber \\&\quad -\dfrac{a u_{2}}{\beta \left( 1+ b u_{2}y\right) }. \end{aligned}$$
(19)

So, for endemic equilibria, y component is the solution of the equation \(f(y) = 0\) with \( f(0) {=} \dfrac{(d {+} \mu {+}\delta {+} a u_{2})(R_{0} - 1)}{\beta }\).

It follows from the expression of f(0) that

$$\begin{aligned} \begin{matrix} R_{0}> 1&{}\; {\mathrm {if}} &{}\; f(0) > 0\\ R_{0}< 1 &{}\; {\mathrm {if}} &{}\; f(0) < 0 \end{matrix} \end{aligned}$$
(20)

Differentiating f(y) in (19) w.r.t. y, we get

$$\begin{aligned} f'(y) = \dfrac{a b u_{2}^2}{\beta (1 + b u_{2} y)^2} - \dfrac{A \left\{ \beta + 2 \alpha y (d + u_{1})\right\} }{\left\{ \beta y + (d + u_{1})(1+ \alpha y^2) \right\} ^2 }. \end{aligned}$$
(21)

From (17) and (21), we can easily draw a relationship between F(0) and \(f'(y)\), which gives

$$\begin{aligned} F(0)= - \left( \dfrac{A \beta y}{x}\right) f'(y). \end{aligned}$$
(22)

Thus, from (22), we see the following statements hold:

$$\begin{aligned} \ \ F(0)> 0 \ \ \ \text {if} \ \ f'(y)< 0\nonumber \\ \ \ F(0) < 0 \ \ \ \text {if} \ \ f'(y) > 0 \end{aligned}$$
(23)

Here, we take \(R_{0} > 1\) this implies \(f(0) > 0\) (by (20)). We have already seen that if \(R_{0} > 1\), then there is only one endemic equilibrium \(E(x^*, y^*)\). Since, \(f(0) > 0\) hence f(y) should decrease in some neighborhood of \(y^*\). So, in this case, we have \( f'(y) <0 \) and so \(F(0) > 0\).

Now, eigenvalues of the Jacobian matrix are solutions of the equation \( F(\lambda ) = 0 \), i.e.,

$$\begin{aligned} \lambda ^2 + C_{1} \lambda + C_{2} = 0, \end{aligned}$$
(24)

where

$$\begin{aligned} C_{2}&= F(0) > 0,\\ C_{1}&= - \left[ (d+u_{1}) + \dfrac{ 2 \alpha \beta x y^2}{(1+ \alpha y^2\ )^2} \right. \\&\qquad \qquad \qquad \left. + \dfrac{(\beta -a b u_{2}^2)+ 2 \alpha \beta u_{2} y + b u_{2}^2 y^2(b \beta - a \alpha )}{(1+ \alpha y^2)(1+ b u_{2} y)^2} \right] . \end{aligned}$$

It is clear from the expression of \( C_1 \) that \( C_{1} \le 0\) if \(\beta \ge \max \left\{ a b u_{2}^2, \dfrac{a \alpha }{b}\right\} \).

Therefore, all eigenvalues of Jacobian matrix have negative real part. So, for \(R_{0}\) > 1, we see that \(E(x^*, y^*)\) is locally asymptotically stable if \(\beta \) \(\ge \) \( \max \ \left\{ a b u_{2}^2, \ \dfrac{a \alpha }{b}\right\} . \) \(\square \)

Theorem 8

For \(R_0 > 1\), endemic equilibrium point is globally asymptotically stable if \(u_1 > abu_2^2\) holds.

Proof

To prove global stability of the endemic equilibrium point \(E(x^*, y^*)\), here we use Dulac criterion [28, 39]. First, we write our model system (2) in the following form:

$$\begin{aligned} \dfrac{{\mathrm{d}}x}{{\mathrm{d}}t}&= A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x \equiv F(x, y) \nonumber \\ \dfrac{{\mathrm{d}}y}{{\mathrm{d}}t}&= \dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y} \equiv G(x, y) \end{aligned}$$
(25)

Now, we construct Dulac function as \(D(x, y) = \dfrac{1}{y},\) then we have

$$\begin{aligned}&\dfrac{\partial }{\partial x}(DF) + \dfrac{\partial }{\partial y}(DG) = - \dfrac{\beta }{1 + \alpha y^2} - (d + u_1)\\&\qquad \quad -\, \dfrac{2 \alpha \beta x y}{(1 + \alpha y^2)^2} + \dfrac{abu_2^2}{(1 + bu_2 y)^2}\\ \text {i.e.,}\; \dfrac{\partial }{\partial x}(DF) + \dfrac{\partial }{\partial y}(DG)&< - \dfrac{\beta }{1 + \alpha y^2} - (d + u_1)\\&\quad -\, \dfrac{2 \alpha \beta x y}{(1 + \alpha y^2)^2} + abu_2^2\\ \text {i.e.,} \; \dfrac{\partial }{\partial x}(DF) + \dfrac{\partial }{\partial y}(DG)&< - \dfrac{\beta }{1 + \alpha y^2} - d - \dfrac{2 \alpha \beta x y}{(1 + \alpha y^2)^2} \\&\quad -\, (u_1 - abu_2^2) < 0 \;{\text {if}}\; u_1 > abu_2^2. \end{aligned}$$

Therefore by Dulac criterion for \(R_0 > 1\), endemic equilibrium point \(E(x^*, y^*)\) is globally asymptotically stable if \(u_1 > abu_2^2\). \(\square \)

In view of epidemiology, above result indicates disease persists in the population. Therefore for \(R_0 >1\), disease cannot be eradicated from the population when \(u_1 > abu_2^2\).

5 Bifurcation analysis

Bifurcation means qualitative change of the solution of a model system. In this section, we shall discuss different kinds of bifurcations of the solutions in the neighborhood of different equilibrium points. Among them, some are co-dimension 1 such as transcritical bifurcation, backward bifurcation, saddle-node bifurcation, Hopf bifurcation and the co-dimension two bifurcation likely the Bogdanov–Takens bifurcation of co-dimension 2.

5.1 Transcritical bifurcation

First of all, we discuss the transcritical bifurcation for model (2). Since for \(u_{2}\) = \(u_{2}^0\) = \( \left\{ \dfrac{\beta A}{a (d + u_{1})} - \dfrac{d +\mu +\delta }{a}\right\} \), i.e., \( R_0=1 \) one eigenvalue of \( J(E_0) \) is zero, and consequently, the usual eigenanalysis method fails here. Using Sotomayor theorem [39, 43], we shall now establish the system experiences transcritical bifurcation when the control parameter (\( u_2\)) crosses the critical value \(u_{2}\) = \(u_{2}^0\).

Theorem 9

The model system (2) undergoes a transcritical bifurcation at disease-free equilibrium point (\(E_{0}\)) as the control parameter \(u_{2}\) passes through the bifurcation value \(u_{2} = u_{2}^0\).

Proof

Here, we use Sotomayor’s theorem [39, 43] to verify the transversality conditions for transcritical bifurcation; we denote \( f(x,y) = (f_{1}(x,y), f_{2}(x,y))^t \) where

$$\begin{aligned} f_{1}(x,y)= & {} A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x \nonumber \\ f_{2}(x,y)= & {} \dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y} \end{aligned}$$
(26)

Then,

$$\begin{aligned}&J(E^*)=Df(x^*,y^*)\nonumber \\&\quad =\left( \begin{array}{ccc} - \dfrac{\beta y^*}{1+\alpha {y^*}^2}-(d+ u_{1}) &{} -\dfrac{\beta x^* \left( 1 - \alpha {y^*}^2 \right) }{\left( 1+\alpha {y^*}^2\right) ^2} \\ \dfrac{\beta y^*}{1+\alpha {y^*}^2} &{} \dfrac{\beta x^* \left( 1 - \alpha {y^*}^2\right) }{\left( 1+\alpha {y^*}^2\right) ^2}-(d+\mu +\delta )-\dfrac{au_{2}}{(1+bu_{2} y^*)^2} \end{array} \right) , \end{aligned}$$
(27)

and therefore,

$$\begin{aligned} J(E_{0};u_{2}^0)=Df(E_{0};u_{2}^0)=\left( \begin{array}{ccc} - (d+ u_{1}) &{} -\dfrac{\beta A }{d + u_{1}}\\ 0 &{} 0 \end{array} \right) \end{aligned}$$

has one negative eigenvalue and other eigenvalue is zero.

Let, V and W be the two eigenvectors corresponding to the zero eigenvalue of the matrix \(J(E_{0})\) and \([J(E_{0})]^ \mathrm{T} \), respectively, where V and W are given by \( V = \left( \begin{array}{ccc} 1 \\ - \dfrac{\left( d + u_{1}\right) ^2}{\beta A} \end{array} \right) \) and \(W = \left( \begin{array}{ccc} 0 \\ 1 \end{array} \right) .\)

Furthermore, differentiating partially (26) and (27) w.r.t. \(u_{2}\), we get

$$\begin{aligned}&f_{u_{2}}(x,y) = \left( \begin{array}{ccc} 0 \\ - \dfrac{ a y }{(1 + b u_{2} y)^2} \end{array} \right) , \\&Df_{u_{2}}(x,y) = \left( \begin{array}{ccc} 0 &{} 0 \\ 0 &{} - \dfrac{a}{(1 + b u_{2} y)^3} \end{array} \right) . \end{aligned}$$

Therefore, \( f_{u_{2}}(E_{0};u_{2}^0) = \left( \begin{array}{ccc} 0 \\ 0 \end{array} \right) ,\)

$$\begin{aligned} Df_{u_{2}}(E_{0};u_{2}^0) V&= \left( \begin{array}{ccc} 0 &{} 0 \\ 0 &{} - a \end{array} \right) \left( \begin{array}{ccc} 1 \\ - \dfrac{\left( d + u_{1}\right) ^2}{\beta A} \end{array} \right) \\&= \left( \begin{array}{ccc} 0 \\ \dfrac{ a \left( d + u_{1}\right) ^2}{\beta A} \end{array} \right) . \end{aligned}$$

and \( D^2f(E_{0};u_{2}^0) (V,V)\)

$$\begin{aligned}&= \left( \begin{array}{ccc} \dfrac{\partial ^2 f_{1}}{\partial x^2} V_{1}^{2} + 2 \dfrac{\partial ^2 f_{1}}{\partial x \partial y} V_{1} V_{2} + \dfrac{\partial ^2 f_{1}}{\partial y^2} V_{2}^{2} \\ \dfrac{\partial ^2 f_{2}}{\partial x^2} V_{1}^{2} + 2 \dfrac{\partial ^2 f_{2}}{\partial x \partial y} V_{1} V_{2} + \dfrac{\partial ^2 f_{2}}{\partial y^2} V_{2}^{2} \end{array} \right) \\&= \left( \begin{array}{ccc} 0 \\ \dfrac{4 b (d + u_{1})^4}{\beta ^2 A^2} \left( \dfrac{\beta A}{d + u_{1}} - (d+ \mu + \delta ) \right) \end{array} \right) . \end{aligned}$$

Thus, we have \(W^T f_{u_{2}}(E_{0};u_{2}^0) = 0\),

$$\begin{aligned}&W^T \left( Df_{u_{2}}(E_{0};u_{2}^0) V\right) = \dfrac{ a \left( d + u_{1}\right) ^2}{\beta A}\ne 0,\\&W^T \left( D^2f(E_{0};u_{2}^0) (V,V)\right) \\&= \dfrac{4 b (d + u_{1})^4}{\beta ^2 A^2} \left( \dfrac{\beta A}{d+ u_{1}} - (d+ \mu + \delta ) \right) \ne 0 \end{aligned}$$

if \(\dfrac{\beta A}{d + u_{1}} \ne (d+ \mu + \delta )\).

Thus, all conditions of Sotomayor’s theorem for transcritical bifurcation are satisfied; therefore, the model system (2) experiences transcritical bifurcation at \(E_{0}\) when control parameter passes through the critical value \(u_{2}\) = \(u_{2}^0\). Here, the stable disease-free equilibrium point become unstable and one stable endemic equilibrium point generates through transcritical bifurcation. \(\square \)

5.2 Backward bifurcation

According to epidemiology, for \(R_{0} <1 \), disease can be eradicated from the system. But due to saturated treatment, the model system (2) may exhibit backward bifurcation. Epidemiologically the backward bifurcation is most important [28, 30] because in this case the system generates another stable endemic equilibrium point for \( R_0<1 \). In this situation, eradication of disease not only depends on the value of \( R_0 \) but it also depends on the initial number of infected populations. If system undergoes through backward bifurcation, we cannot conclude about eradication of disease for \(R_0< 1.\) For backward bifurcation, there exists bistability of equilibrium points (one stable disease-free equilibrium and stable endemic equilibrium) for \(R_0< 1\). Therefore, if \(R_0< 1\), eradication of disease depends on initial infected population.

Theorem 10

The model system (2) experiences a backward bifurcation at \(R_{0}\) = 1 when \((d+ \mu +\delta +a u_{2})< Aab u_{2}^2\).

Proof

In Sect. 3, we have already seen that infected component y satisfies Eq. (7). Since from the expression of \(R_{0} \), we have

$$\begin{aligned} \beta = \dfrac{(d+u_{1})(d+\mu +\delta +a u_{2})R_{0}}{A}, \end{aligned}$$

so the coefficients of the above equation can be expressed as a function of \( R_0 \) putting value of \( \beta \). So, the solution of the equation is a function of \( R_0 \). Now, to derive a necessary and sufficient condition on the parameters such that the system experiences backward bifurcation with respect to \( R_0 \), we have to compute value of \(\dfrac{\partial y}{\partial R_{0}}\vert _{R_{0}=1,y=0}\).

Differentiating partially Eq. (4) w.r.to. \(R_{0}\) and putting \( R_0=1, y=0 \), we get

$$\begin{aligned} \dfrac{\partial y}{\partial R_{0}}\vert _{R_{0}=1,y=0} = \dfrac{A}{(d+ \mu + \delta + a u_{2})- Aabu_{2}^2}. \end{aligned}$$

Thus, the endemic equilibrium curve bifurcates backward direction if the slope of the solution curve has negative value at \( R_0=1 \) which is equivalent to \(\dfrac{\partial y}{\partial R_{0}}\vert _{R_{0}=1,y=0} < 0\). Thus, the necessary and sufficient condition for backward bifurcation is \((d+ \mu +\delta +a u_{2})< Aab u_{2}^2\) . \(\square \)

From above, it is clear that the model system (2) has backward bifurcation due to saturated treatment. So, there is a range \(R_{0}^*\) < \(R_{0}\) < 1, for which model system (2) has two endemic equilibria. Now, we concentrate on discussing the stability of these two endemic equilibria when backward bifurcation condition is satisfied. So, we have the following theorem:

Theorem 11

If \(R_{0}^*< R_{0} < 1\) and \((d+ \mu +\delta +a u_{2})\) < \(Aabu_{2}^2\) hold, then model system (2) has two endemic equilibria. The one with smaller number of infected (\(E_{1}\)) is unstable, while the other with higher number of infected (\(E_{2}\)) is locally asymptotically stable if \(\beta \) \(\ge \) \(\max \ \left\{ a b u_{2}^2, \ \dfrac{a \alpha }{b}\right\} . \)

Proof

Since, the equation \( f(y) = 0 \) (where f(y) is given by (19)) has two distinct real roots \( y_{1}^{*}\), \(y_{2}^{*}\) with \(y_{1}^{*}\) < \(y_{2}^{*}\), then f(y) is increasing in neighborhood of \( y_{1}^{*} \) that means \( f'(y_{1}^{*})> 0\) and consequently f(y) is decreasing in neighborhood of \(y_{2}^{*}\) that means \(f'(y_{1}^{*})< 0\). So, we have the following two cases.

Case-1 For the equilibrium \(E_{1}(x_{1}^*,y_{1}^*)\), we have \(f'(y_{1}^{*})\) > 0, So F(0) < 0 (by (23)) . Since \(F(\lambda )\) is a quadratic equation, we have \(F(\lambda )\) \(\rightarrow \) \(\infty \) as \(\lambda \) \(\rightarrow \) \(\infty \) . Hence, there must be a real positive \(\lambda ^{*}\) such that F(\(\lambda ^{*}\)) = 0. So Jacobian matrix has at least one positive eigenvalue. Therefore, \(E_{1}(x_{1}^*,y_{1}^*)\) is unstable.

Case-2 For the equilibrium \(E_{2}(x_{2}^*,y_{2}^*)\), we have \(f'(y_{2}^{*})\) < 0, So F(0) > 0 (by Eq. (23)) . In this case, we proceed same as Theorem 7 and obtain that \(E_{2}(x_{2}^*,y_{2}^*)\) is locally asymptotically stable if \(\beta \) \(\ge \) \(\max \ \left\{ a b u_{2}^2, \ \dfrac{a \alpha }{b}\right\} . \)

Thus, we prove the above stated theorem. \(\square \)

In the next theorem, we shall establish the condition of saddle-node bifurcation. This type of bifurcation occurs when two fixed points create and annihilate depending on the model parameter.

5.3 Saddle-node bifurcation

Suppose at \(\alpha = \alpha ^{[{\mathrm{SN}}]}\), Eq. (7) has coincident real positive root, and the corresponding endemic equilibrium point is \(E(x^*,y^*)\).

Theorem 12

The model system (2) experiences saddle-node bifurcation at the coincident endemic equilibrium point \( E(x^*,y^*) \) when the system parameter \( \alpha \) crosses the critical value \(\alpha = \alpha ^{[{\mathrm{SN}}]}\) and

$$\begin{aligned}&A \left( - 2 + 9 \alpha ^{[{\mathrm{SN}}]} {y^*}^2 - 3 {\alpha ^{[{\mathrm{SN}}]}}^2 {y^*}^4 \right) \\&\qquad + x^* (d + u_{1}) \left( 1 - \alpha ^{[{\mathrm{SN}}]} {y^*}^2 \right) \ne 0 . \end{aligned}$$

Proof

It can be easily shown that for \(\alpha \) = \(\alpha ^{[SN]}\), we have \( {\mathrm{det}}(J(E))\vert _{\alpha = \alpha ^{[{\mathrm{SN}}]}}\) = 0 and \({\mathrm{tr}}(J(E))\vert _{\alpha = \alpha ^{[{\mathrm{SN}}]}}\)\(\ne \ 0 \) at the coincident endemic equilibrium point \(E(x^*,y^*)\). Hence, one eigenvalue of the J(E) will be zero at \( \alpha = \alpha ^{[{\mathrm{SN}}]} \). If V and W are eigenvectors corresponding to eigenvalue zero for the matrices J(E) and \(J^T({E})\), respectively, then

$$\begin{aligned} V&= \left( \begin{array}{ccc} \dfrac{\beta x^* \left( 1 - \alpha ^{[{\mathrm{SN}}]} {y^*}^2 \right) }{\left( 1+\alpha ^{[SN]} {y^*}^2\right) }\\ \beta y^*+ \left( d + u_{1}\right) \left( 1+\alpha ^{[SN]} {y^*}^2\right) \end{array} \right) \; {\text {and}} \\ W&= \ \left( \begin{array}{ccc} - \left( d + u_{1}\right) \left( 1+\alpha ^{[{\mathrm{SN}}]} {y^*}^2\right) \\ \beta y^*+ \left( d + u_{1}\right) \left( 1+\alpha ^{[{\mathrm{SN}}]} {y^*}^2\right) \end{array} \right) . \end{aligned}$$

Furthermore, we have

$$\begin{aligned} f_{\alpha }(E;\alpha ^{[{\mathrm{SN}}]}) = \left( \begin{array}{ccc} \dfrac{ 2 \alpha ^{[{\mathrm{SN}}]} \beta x^* {y^*}^2}{\left( 1+ \alpha ^{[{\mathrm{SN}}]} {y^*}^2\ \right) ^2} \\ - \dfrac{ 2 \alpha ^{[{\mathrm{SN}}]} \beta x^* { y^*}^2}{\left( 1+ \alpha ^{[{\mathrm{SN}}]} {y^*}^2 \ \right) ^2} \end{array} \right) \end{aligned}$$

and \( D^2f(E;\alpha ^{[{\mathrm{SN}}]}) (V,V)\) = \(\dfrac{2 A}{ x^* \left( 1 + \alpha ^{[{\mathrm{SN}}]}{y^*}^2\right) ^2 }\) \(\times \) [A (\(2 - 9 \alpha ^{[{\mathrm{SN}}]} {y^*}^2\) + \(3 {\alpha ^{[{\mathrm{SN}}]}}^2 {y^*}^4\)) + \(x^* (d + u_{1}) ( 1 - \alpha ^{[{\mathrm{SN}}]} {y^*}^2\)) ] \(\times \) \(\left( \begin{array}{ccc} 1 \ 1 \end{array} \right) \). Thus, we have

$$\begin{aligned} W^T f_{\alpha }(E;\alpha ^{[{\mathrm{SN}}]}) = - \dfrac{2 \alpha ^{[{\mathrm{SN}}]} \beta {y^*}^2}{\left( 1 + \alpha ^{[{\mathrm{SN}}]} {y^*}^2\right) }[A + (d+ u_{1})]\ne 0, \end{aligned}$$

and

$$\begin{aligned}&W^T \left( D^2f(E;\alpha ^{[{\mathrm{SN}}]})(V,V)\right) = \dfrac{2 A \left( A + (d+ u_{1})x^*\right) }{ {x^*}^2 \left( 1 + \alpha ^{[{\mathrm{SN}}]}{y^*}^2\right) } \\&\quad \times \left[ A \left( - 2 + 9 \alpha ^{[{\mathrm{SN}}]} {y^*}^2 - 3 {\alpha ^{[{\mathrm{SN}}]}}^2 {y^*}^4 \right) \right. \\&\quad \left. + x^* (d + u_{1})\left( 1 - \alpha ^{[{\mathrm{SN}}]} {y^*}^2 \right) \right] \ne 0. \end{aligned}$$

Thus, the transversality condition for the occurrence of saddle-node bifurcation at coincident equilibrium \(E(x^*,y^*)\) for \(\alpha \) = \(\alpha ^{[{\mathrm{SN}}]}\) is satisfied for system (2). Hence, by Sotomayor’s theorem [39, 43], we can conclude that system (2) has saddle-node bifurcation when model parameter \(\alpha \) passes through its critical value \(\alpha \) = \(\alpha ^{[{\mathrm{SN}}]}\). \(\square \)

5.4 Hopf bifurcation

In this section, we consider the case when \( J(E(x^*,y^*)) \) has a pair of purely imaginary eigenvalues. Hopf bifurcation occurs when this pair of complex conjugate eigenvalues around non-trivial equilibrium \(E(x^*,y^*)\) cross-imaginary axis [44]. In this case, endemic equilibria \(E(x^*,y^*)\) loses stability through Hopf bifurcation under some parametric restriction. Considering A as the bifurcation parameter, suppose at \( A=A^{[\mathrm{HB}]}\), we have \(\mathrm{tr}(J(E(x^*,y^*)))\vert _{A = A^{[\mathrm{HB}]}}\) = 0 and \(\mathrm{det}(J(E(x^*,y^*)))\vert _{A = A^{[\mathrm{HB}]}}>\) 0. The existence of a Hopf bifurcation around endemic equilibrium \(E(x^*,y^*)\) is established in the following theorem:

Theorem 13

The model system (2) experiences a Hopf bifurcation at any endemic equilibrium point \(E(x^*,y^*)\) when the model parameter A crosses the critical value \(A = A^{[\mathrm{HB}]}\) . Moreover,

  1. (i)

    If \(\zeta < 0\), there is a family of stable periodic orbits of model system (2) as A decreases from \( A^{[\mathrm{HB}]}\),

  2. (ii)

    If \( \zeta >0 \), there is a family of unstable periodic orbits of model system (2) as A increases from \( A^{[\mathrm{HB}]}\), where \( \zeta \) will be defined in the proof of the theorem.

Proof

From above discussions, it is clear that \(\mathrm{tr}(J (E))= 0 \) and \( \mathrm{det}(J(E))> 0\) at \( A=A^{[\mathrm{HB}]}\) when endemic equilibria \(E(x^*,y^*)\) exists. That means the corresponding Jacobian matrix J(E) has a pair of purely imaginary eigenvalues if \( A =A^{[\mathrm{HB}]}\) .

Now, in order to ensure the changes of stability through Hopf bifurcation, we have to check the transversality condition for Hopf bifurcation . The transversality condition is

$$\begin{aligned} \dfrac{\mathrm{d}}{\mathrm{d}A}\left( \mathrm{tr} J_{E}\right) \vert _{A = A^{[\mathrm{HB}]}} \ne 0 . \end{aligned}$$

Our next target is to find the direction of Hopf bifurcation in the neighborhood of the endemic equilibrium point \(E(x^*,y^*)\) for the critical value \( A = A^{[\mathrm{HB}]}\). First, we translate the endemic equilibrium \(E(x^*,y^*)\) to origin using transformation \( X = x - x^*\), \( Y = y - y^*\), then model system (2) becomes

$$\begin{aligned} \dfrac{\mathrm{d}X}{\mathrm{d}t}&= a_{11} X + a_{12} Y + a_{13} X Y + a_{14} Y^ 2 \nonumber \\&\quad +\, a_{15} X Y^ 2 + a_{16} Y^ 3 \nonumber \\ \dfrac{\mathrm{d}Y}{\mathrm{d}t}&= a_{21} X + a_{22} Y + a_{23} X Y + a_{24} Y^ 2 \nonumber \\&\quad +\, a_{25} X Y^ 2 + a_{26} Y^ 3 \end{aligned}$$
(28)

where the coefficients \( a_{ij }\) is given in “Appendix-I.”

Let, \(\Sigma \) denote the origin of the XY plane, then \(\mathrm{det}(J(\Sigma ))\) = \(a_{11}\) \(a_{22}\) \( - \) \(a_{12}\) \(a_{21}\) > 0 and let \(\varOmega \) = \(\sqrt{\mathrm{det}(J(\Sigma ))}\). Then, under the transformation \( u = -X \) and \( v = \left( a_{11} \dfrac{X}{\ \varOmega } + a_{12} \dfrac{Y}{\ \varOmega }\right) \), the model system (2) reduces to the normal form:

$$\begin{aligned} \dfrac{{\mathrm{d}}u}{{\mathrm{d}}t} = - \varOmega \ v + f(u,v) \nonumber \\ \dfrac{{\mathrm{d}}v}{{\mathrm{d}}t} = \ \varOmega \ u + \ g(u,v) \end{aligned}$$
(29)

where

$$\begin{aligned} f(u,v)&= \left( \dfrac{a_{11} \ a_{13}}{a_{12}} - \dfrac{a_{11}^2 a_{14}}{a_{12}^2} \right) u ^2 \\&\quad + \left( \dfrac{\ a_{13} }{a_{12}} - \dfrac{ 2 a_{11} \ a_{14}}{a_{12}^2} \right) \varOmega u v \\&\quad +\, \left( - \dfrac{a_{14} }{a_{12}^2 }\right) \varOmega ^2 v^2 + \left( \dfrac{ 2 \ a_{11} \ a_{15} }{a_{12}^2} - \dfrac{ 3 a_{11}^2 a_{16}}{a_{12}^3} \ \right) \varOmega u^2v \\&\quad + \, \left( \dfrac{ \ a_{15} }{a_{12}^2} - \dfrac{ 3\ a_{11} \ a_{16}}{a_{12}^3} \ \right) \varOmega ^2 u v^2 \\&\quad + \left( \dfrac{ \ a_{11}^2 \ a_{15} }{a_{12}^2} - \dfrac{a_{11}^3 a_{16}}{a_{12}^3} \ \right) u^3 \\&\quad +\, \left( - \dfrac{a_{16} }{a_{12}^3 }\right) \varOmega ^3 \ v^3, \end{aligned}$$

and

$$\begin{aligned} g(u,v)&= \left( \dfrac{ a_{11}^2 a_{24}}{a_{12}} + \dfrac{a_{11}^3 a_{14}}{a_{12}^2} - \dfrac{ \ a_{11}^2 a_{13}}{a_{12}} - a_{11} a_{23} \right) \dfrac{u ^2}{\varOmega } \\&\quad + \left( \dfrac{ 2 a_{11} a_{24}}{a_{12}}+ \dfrac{2 a_{11}^2 a_{14}}{a_{12}^2} - \dfrac{ a_{11} a_{13}}{a_{12}} - a_{23} \right) u v\\&\quad + \left( \dfrac{ a_{11} a_{14}}{a_{12}^2} + \dfrac{a_{24}}{a_{12}} \right) \varOmega v ^2 + \left( \dfrac{ a_{11} a_{16}}{a_{12}^3} + \dfrac{a_{26}}{a_{12}^2} \right) \varOmega ^2 v ^3 \\&\quad +\, \left( \dfrac{3 a_{11}^3 a_{16}}{a_{12}^3} - \dfrac{2 a_{11}^2 a_{15}}{a_{12}^2} - \dfrac{ 2 a_{11} a_{25}}{a_{12}} + \dfrac{3 a_{11}^2 a_{26}}{a_{12}^2} \right) u ^2 v \\&\quad +\, \left( \dfrac{3 a_{11} a_{26}}{a_{12}^2} - \dfrac{ a_{11} a_{15}}{a_{12}^2} + \dfrac{ 3 a_{11} ^2 a_{16}}{a_{12} ^3} - \dfrac{ a_{25}}{a_{12}} \right) \varOmega u v ^2 \\&\quad +\, \left( \dfrac{ a_{11}^3 \ a_{26}}{a_{12}^2} + \dfrac{a_{11}^4 a_{16}}{a_{12}^3} - \dfrac{ a_{11} ^2 a_{25}}{a_{12}} - \dfrac{ a_{11}^3 a_{15}}{a_{12}^2} \right) \dfrac{u^3}{\varOmega }. \end{aligned}$$

The discriminating quantity (\(\varGamma \)) for finding the direction of Hopf bifurcation is defined below:

$$\begin{aligned} \varGamma&= \dfrac{1}{16}[ \ f_{uuu}+ f_{uvv} +g_{uuv} + g_{vvv}\ ] \\&\quad +\, \dfrac{1}{16 \ \varOmega } [\ f_{uv}(\ f_{uu}+f_{vv}) - g_{uv}(\ g_{uu}+ g_{vv})\\&\quad -\, f_{uu} \ g_{uu} + f_{vv} \ g_{vv} \ ] = \dfrac{\zeta }{\ 8 \ \varOmega ^2 \ a_{12} ^2} \end{aligned}$$

where \(f_{u v}\) denotes \(\dfrac{\partial \ ^2 f}{\partial u \partial v}(0,0)\) and

$$\begin{aligned} \zeta&= \left[ (a_{15} + 3 a_{26}) - \dfrac{a_{14} ( a_{13} + 2 a_{24})}{a_{12}}\right] \varOmega ^4 \\&\quad + \left[ \left\{ ( a_{15} + 3 \ a_{26}) - \dfrac{ 2 a_{14} ( a_{13} + 2 a_{24})}{ a_{12}} \right\} a_{11}^2\right. \\&\quad \left. + \left\{ a_{13}^2 - 2 a_{12} a_{25} + a_{13} a_{24} - 2 a_{14}^2 \right\} a_{11} + a_{12} a_{23} a_{24} \right] \varOmega ^2 \\&\quad +\, \left[ a_{23} - \dfrac{a_{11}(a_{13} + 2 a_{24})}{a_{12}}\right] \\&\quad \left[ a_{11}^2 (a_{12} a_{24} - a_{12} a_{13}) + \ a_{11} (a_{11}^2 a_{14} - a_{12}^2 a_{23})\right] . \end{aligned}$$

By [11, 45], proof of the theorem is complete. \(\square \)

5.5 Bogdanov–Takens bifurcation

As so far, we have studied the bifurcations of co-dimension 1, i.e., considering only one parameter as the bifurcation parameter. The model system (2) experiences the bifurcations of co-dimension 1 such as transcritical bifurcation, saddle-node bifurcation, and Hopf bifurcation. We are now interested to investigate co-dimension 2 bifurcation such as Bogdanov–Takens bifurcation of co-dimension 2. Under the condition \(\mathrm{tr}(J(E)) = 0\) and \(\mathrm{det}(J(E)) = 0\), the Jacobian matrix of the coincident endemic equilibrium \(E(x^*,y^*)\) has a double zero eigenvalue. Suppose these two conditions occur for the parameter values \( A=A_{0} \) and \( \alpha =\alpha _0 \) when other parameters are keeping as constant. So the model system (2) may admit a Bogdanov–Takens (BT) bifurcation in vicinity of the coincident endemic equilibrium point. Now, first we investigate the conditions for which BT bifurcation will occur.

5.5.1 Condition for existence of Bogdanov–Takens bifurcation

To ensure the existence of BT bifurcation for the critical values of the parameters ( i.e., \( A=A_{0} \) and \( \alpha =\alpha _0 \)), we are using the transformation \( X = x - x^*\), \( Y = y - y^*\) and the condition \(\mathrm{tr}(J(E)) = 0\), \(\mathrm{det}(J(E)) = 0\), the model system (2) reduces to

$$\begin{aligned} \dfrac{\mathrm{d}X}{\mathrm{d}t}&= a_{11} X + a_{12} Y + a_{13} X Y + a_{14} Y^ 2\nonumber \\&\quad +\, Q_{1} (X,Y) \nonumber \\ \dfrac{\mathrm{d}Y}{\mathrm{d}t}&= - \dfrac{a_{11}^2}{a_{12}} X - a_{11} Y + a_{23} X Y + a_{24} Y^ 2 \nonumber \\&\quad +\, Q_{2} (X,Y) \end{aligned}$$
(30)

where \(Q_{i} (X,Y), i = 1, 2 \) are smooth functions of (XY) at least of order three and the coefficients are given in “Appendix-I” (using maple software [46] ).

We set \( x = X, \ y = a_{11} X + a_{12} Y \), then system (30) transforms to

$$\begin{aligned} \dfrac{\mathrm{d}x}{\mathrm{d}t}&= y + c_{1} \ x^ 2 + c_{2} \ x \ y + c_{3} \ y^ 2 + R_{1} (x,y) \nonumber \\ \dfrac{\mathrm{d}y}{\mathrm{d}t}&= c_{4} \ x^2 + c_{5} \ x \ y + c_{6} \ y^ 2 + R_{2} (x,y) \end{aligned}$$
(31)

where \(R_{i} (x,y), i = 1, 2 \) is a smooth function in \( ( x,\ y )\) at least of order three and

$$\begin{aligned} c_{1}&= \dfrac{ a_{11} (a_{11} a_{14} - a_{12} a_{13})}{a_{12}^2}, \; c_{2} = \dfrac{ a_{12} a_{13} - 2 a_{11} a_{14} }{a_{12}^2},\\ c_{3}&= dfrac{ a_{14} }{a_{12}^2},\; c_{6} = dfrac{ a_{11} a_{14} + a_{12} a_{24} } {a_{12}^2},\\ c_{4}&= \dfrac{ a_{11}^2 (a_{11} a_{14} - a_{12} a_{13}) + a_{11} a_{12} (a_{11} a_{24} - a_{12} a_{23})}{a_{12}^2},\\ c_{5}&= \dfrac{ a_{12} (a_{12} a_{23} + a_{11} a_{13}) - 2 a_{11} (a_{12} a_{24} + a_{11} a_{14}) }{a_{12}^2}. \end{aligned}$$

Next, letting \( x = x, s = y +c_{3}y^2\) and neglecting higher terms of order greater than 2 the system (31) becomes,

$$\begin{aligned} \dfrac{\mathrm{d}x}{\mathrm{d}t}&= s + c_{1} x^ 2 + c_{2} x s + R_{3} (x,s) \nonumber \\ \dfrac{\mathrm{d}s}{\mathrm{d}t}&= c_{4} x^2 + c_{5} x s + c_{6} s^ 2 + R_{4} (x,s) \end{aligned}$$
(32)

where \(R_{i} (x,s), i = 3, 4\) is a smooth function in (xs) at least of order three. One more time, we set the variables \( x = x, z = s - c_{6} x s,\) system (32) becomes

$$\begin{aligned} \dfrac{\mathrm{d}x}{\mathrm{d}t}&= z + c_{1} \ x^ 2 + ( c_{2} + c_{6} ) \ x \ z + R_{5} (x,z) \nonumber \\ \dfrac{\mathrm{d}z}{\mathrm{d}t}&= c_{4} \ x^2 + c_{5} x z + R_{6} (x,z) \end{aligned}$$
(33)

where \(R_{i} (x,z), i = 5, 6\) is a smooth function in (xz) at least of order three. Finally, to obtain the normal form of Bogdanov–Takens bifurcation [47, 48], we perform the transformation \(u = x - \dfrac{c_{2} + c_{6}}{2} \ x^2,\) \( v = z + c_{1} x^2\), then system (33) transforms to

$$\begin{aligned} \dfrac{\mathrm{d}u}{\mathrm{d}t}&= v + P_{1} (u,v) \nonumber \\ \dfrac{\mathrm{d}v}{\mathrm{d}t}&= c_{4} u^2 + (c_{5} + 2 c_{1} \ ) uv + P_{2} (x,z) \end{aligned}$$
(34)

where \(P_{i} (u,v)\), \(i = 1, 2\) is a smooth function in (uv) at least of order three. From above, it is clear that \(c_{4}\) \(\ne \) 0 and \(c_{5} + 2 \ c_{1}\) \(\ne \) 0 which implies model system (2) admits Bogdanov–Takens bifurcation.

Thus, we obtain the following theorem:

Theorem 14

Suppose that \(E(x^*,y^*)\) is an endemic equilibrium of model system (2) such that \(\mathrm{tr}(J_{E})\) = 0 and \(\mathrm{det}(J_{E})\) = 0 with \(c_{4}\) \(\ne \) 0, \(c_{5} + 2 \ c_{1}\) \(\ne \) 0 . Then, \(E(x^*,y^*)\) is a Bogdanov–Takens singularity of co-dimension 2 .

5.5.2 Bogdanov–Takens bifurcation of co-dimension 2

Now, we find analytical expressions for saddle-node bifurcation curve, Hopf bifurcation curve and homoclinic bifurcation curve in a small neighborhood of BT point, by determining the versal unfolding of the model system (2) as discussed in [49,50,51]. First, we give a small perturbation to bifurcation parameters A and \(\alpha \) by \( A = \left( A_{0} + \lambda _{1}\right) \) and \(\alpha = \left( \alpha _{0} + \lambda _{2} \right) \) where \( |\lambda _{i}|<<1, i=1,2\) around their critical values then the model system (2) becomes:

$$\begin{aligned} \dfrac{\mathrm{d}x}{\mathrm{d}t}&= \left( A_{0} + \lambda _{1}\right) -\dfrac{\beta xy}{1 + \left( \alpha _{0} + \lambda _{2}\right) y^{2}}-\left( d+u_{1}\right) x \nonumber \\ \dfrac{\mathrm{d}y}{\mathrm{d}t}&= \dfrac{\beta xy}{1+ \left( \alpha _{0} + \lambda _{2}\right) y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y} \end{aligned}$$
(35)

Now translating endemic equilibrium point \(E(x^*,y^*)\) to origin using the transformation \( X = x - x^*\), Y = \(y - y^*\), then system (35) becomes

$$\begin{aligned} \dfrac{\mathrm{d}X}{\mathrm{d}t}&= \left( \left( A_{0} + \lambda _{1}\right) -\dfrac{\beta x^* y^*}{1 + \left( \alpha _{0} + \lambda _{2}\right) {y^*}^{2}}-\left( d+u_{1}\right) x^*\right) \nonumber \\&\quad + b_{1} X + b_{2} Y + b_{3} X Y + b_{4} Y^ 2 + \phi _{1} ( X, Y, \lambda _{1}, \lambda _{2}) \nonumber \\ \dfrac{\mathrm{d}Y}{\mathrm{d}t}&= \left( \dfrac{\beta x^* y^*}{1+ \left( \alpha _{0} + \lambda _{2}\right) {y^*}^{2}}-(d+\mu +\delta )y^*-\dfrac{au_{2}y^*}{1+bu_{2}y^*} \right) \nonumber \\&\quad + b_{5} X + b_{6} Y - b_{3} X Y + b_{7} Y^ 2 + \phi _{2} ( X, Y, \lambda _{1}, \lambda _{2}) \end{aligned}$$
(36)

where \(\phi _{i}(X, Y), i = 1, 2\) is a smooth function in (XY) at least of order 3 and the coefficients \( b_{i} \) are given in “Appendix-II” (using maple software [46] ).

Letting \( u = Y\),

$$\begin{aligned} v&= \left( \dfrac{\beta x^* y^*}{1+ \left( \alpha _{0} + \lambda _{2}\right) {y^*}^{2}}-(d+\mu +\delta )y^*-\dfrac{au_{2}y^*}{1+bu_{2}y^*} \right) \\&\quad + b_{5} X + b_{6} Y - b_{3} X Y + b_{7} Y^ 2 + \phi _{2} ( X, Y, \lambda _{1}, \lambda _{2}), \end{aligned}$$

we get

$$\begin{aligned} \dfrac{\mathrm{d}u}{\mathrm{d}t}&= v \nonumber \\ \dfrac{\mathrm{d}v}{\mathrm{d}t}&= d_{0} + d_{1} u + d_{2} v + d_{3} u ^2 + d_{4} u v + d_{5} v^ 2\nonumber \\&\quad +\, \phi _{3} ( u, v, \lambda _{1}, \lambda _{2}) \end{aligned}$$
(37)

where \(\phi _{3}(u, v, \lambda _{1}, \lambda _{2})\) is a smooth function in (uv) at least of order 3 and the coefficients \(d_j \)’s are given in “Appendix-III.”

Next, we introduce a new time variable \(\tau \) by \( \mathrm{d}t = (1 - d_{5} u) \ \mathrm{d} \tau \), and rewriting \(\tau \) as t, we get from (37)

$$\begin{aligned} \dfrac{\mathrm{d}u}{\mathrm{d}t}&= (1 - d_{5} u ) v \nonumber \\ \dfrac{\mathrm{d}v}{\mathrm{d}t}&= (1 - d_{5} u ) (d_{0} + d_{1} u + d_{2} v + d_{3} u ^2 \nonumber \\&\quad +\, d_{4} u v + d_{5} v^ 2 + \phi _{3} ( u, v, \lambda _{1}, \lambda _{2}) ) \end{aligned}$$
(38)

Let \( X = u, Y = (1 - d_{5} u) v\), then system (38) reformats as,

$$\begin{aligned} \dfrac{\mathrm{d}X}{\mathrm{d}t}&= Y \nonumber \\ \dfrac{\mathrm{d}Y}{\mathrm{d}t}&= d_{0} + e_{1} X + d_{2} Y + e_{2} X ^2 + e_{3} X Y + \phi _{4} ( X, Y, \lambda _{1}, \lambda _{2}) \end{aligned}$$
(39)

where \(\phi _{4}( X, Y, \lambda _{1} , \lambda _{2})\) is a smooth function in (XY) at least of order 3 and \(e_{1}\) = \(( \ d_{1} - 2 \ d_{0} \ d_{5} \ )\), \(e_{2}\) = \(( \ d_{3} - 2 \ d_{1} \ d_{5} \ + d_{0} \ d_{5} ^2 \ )\), \(e_{3}\) = \(( \ d_{4} - \ d_{2} \ d_{5} \ )\). Now, we set \( x = \left( X + \ \dfrac{d_{2}}{e_{3}} \right) , y = Y\), then the system (44) takes the form :

$$\begin{aligned} \dfrac{\mathrm{d}x}{\mathrm{d}t}&= y \nonumber \\ \dfrac{\mathrm{d}y}{\mathrm{d}t}&= f_{1} + f_{2} x + e_{2} x^2 + e_{3} x y + \phi _{4} ( x, y, \lambda _{1} , \lambda _{2}) \end{aligned}$$
(40)

where \(\phi _{4}(x, y, \lambda _{1}, \lambda _{2})\) is a smooth function in (xy) at least of order 3 and \(f_{1}\) = \( d_{0} - \dfrac{e_{1} \ d_{2}}{e_{3}} + \dfrac{e_{2} \ d_{2}^2}{e_{3}^2}\) , \(f_{2} = e_{1} - \dfrac{ 2 \ d_{2} \ e_{2}}{e_{3}}.\) Finally, we change variables using X = \(\dfrac{e_{3}^2}{e_{2}} \ x, \)Y = \(\dfrac{e_{3}^3}{e_{2}^2} \ y \), \(\tau \) = \(\dfrac{e_{2}}{e_{3}} \ t \) and denoting \(\tau \) by t, system (40) becomes

$$\begin{aligned} \dfrac{\mathrm{d}X}{\mathrm{d}t}&= Y \nonumber \\ \dfrac{\mathrm{d}Y}{\mathrm{d}t}&= \psi _{1}(\lambda _{1} , \lambda _{2}) + \psi _{2}(\lambda _{1} , \lambda _{2}) + X ^2 + X Y \nonumber \\&\quad + \phi _{5} ( X, Y, \lambda _{1} , \lambda _{2}) \end{aligned}$$
(41)

where \(\phi _{5}( X, Y, \lambda _{1} , \lambda _{2})\) is a smooth function in (X , Y ) at least of order 3 and \(\psi _{1}(\lambda _{1}, \lambda _{2})\) = \(\dfrac{f_{1} \ e_{3}^4}{e_{2}^3}\),

\(\psi _{2}(\lambda _{1} , \lambda _{2})\) = \(\dfrac{f_{2} \ e_{3}^2}{e_{2}^2}\). If the transversality condition

$$\begin{aligned} \begin{vmatrix} \dfrac{ \partial \psi _{1}(\lambda _{1} , \lambda _{2})}{\partial \lambda _{1} }&\dfrac{ \partial \psi _{1}(\lambda _{1} , \lambda _{2})}{\partial \lambda _{2} }\\ \dfrac{ \partial \psi _{2}(\lambda _{1} , \lambda _{2})}{\partial \lambda _{1} }&\dfrac{ \partial \psi _{2}(\lambda _{1} , \lambda _{2})}{\partial \lambda _{2} } \end{vmatrix} \ne 0 \end{aligned}$$
(42)

holds, then the system (35) undergoes through Bogdanov–Takens bifurcation when \( (\lambda _1,\lambda _2) \) is in the small neighborhood of (0, 0) . The local representation of the bifurcation curves are given below:

  1. (1)

    The saddle-node bifurcation curve is given by \({\mathrm{SN}} = \lbrace ( \lambda _{1} , \lambda _{2})\): \(\psi _{1} ( \lambda _{1} , \lambda _{2}) = 0 , \psi _{2} (\lambda _{1} , \lambda _{2}) \ne 0 \rbrace \).

  2. (2)

    The Hopf bifurcation curve is given by \(H = \lbrace ( \lambda _{1} , \lambda _{2})\): \(\psi _{1} ( \lambda _{1} , \lambda _{2}) = - \ \psi _{2} ^2 (\lambda _{1} , \lambda _{2}) ,\ \psi _{2} (\lambda _{1} , \lambda _{2}) > 0 \rbrace \).

  3. (3)

    The Homoclinic bifurcation curve is given by \(\mathrm{HL} = \lbrace ( \lambda _{1} , \lambda _{2})\) : \(\psi _{1} (\lambda _{1} , \lambda _{2})\) = \(- \dfrac{49}{25} \ \psi _{2} ^2 (\lambda _{1} , \lambda _{2}) + o(\psi _{2}^\frac{5}{2}),\ \psi _{2} (\lambda _{1} , \lambda _{2}) > 0 \rbrace \).

6 Numerical simulation

In this section, we shall justify the theoretical findings of our proposed model using some numerical approaches. For this purpose, first, we draw one parameter bifurcation diagram w.r.t. \(R_0\) to justify the Backward bifurcation (see Fig. 2).

From Fig. 2, it is clear that disease-free equilibrium point is stable if \(R_0 < 1\), and it is unstable for \(R_0 > 1\). Further, for \(R_0 < 1\), there exists a range \(R_{0}^{*}< R_{0} < 1\) where two endemic equilibrium points exist. For \(R_0 < 1\), the endemic equilibrium point with lower infected density is unstable and the endemic equilibrium point with higher infected density is stable. Therefore for \(R_0 < 1\), there exists bistability of equilibrium points (one stable disease-free equilibrium point and one stable endemic equilibrium point). At \(R_0 = 1\), disease-free equilibrium changes its stability from stable to unstable with creation of one stable endemic equilibrium point; therefore, model system (2) undergoes through transcritical bifurcation at disease-free equilibrium for \(R_0 = 1\).

Next, we draw schematic bifurcation diagram for Bogdanov–Takens bifurcation (using [52]) as given in Fig. 3 for model system (2) which divides first quadrant of A\(\alpha \) plane (considering other parameters values in Table 2) in six sub-regions, namely \(R_{1}, R_{2}, R_{3}, R_{4}, R_{5}, R_{6}\). In Fig. 3, the red line is saddle-node bifurcation (SN) line, black line represents Hopf bifurcation (H) line, cyan line indicates homoclinic bifurcation (HL) line, and magenta line denotes transcritical bifurcation line. Now, we draw the phase portraits for each of the sub-region of A\(\alpha \) plane considering different parameter values for interpreting number of equilibrium points and their behavior.

Table 2 Model parameters and their respective values
Fig. 2
figure 2

Backward bifurcation diagram with respect to \( R_{0} \) and other parameters are given in Table 2 with \(A = 3.24\), \(\alpha = 0.35\), the blue line corresponds to stable branch and red line is unstable

Fig. 3
figure 3

Schematic bifurcation diagram in A\(\alpha \) plane fixing other variables as given in Table 2

In region \(R_{1}\), there is one disease-free equilibrium \((E_{0})\) point which is stable node (see the Fig. 4a). From epidemiological point of view, this region is most important because disease is not persisting here.

Next, we consider the parameter values on saddle-node line (the red line). For parametric values on the SN-line, along with the stable disease-free equilibrium point, one coincident endemic equilibrium point arises here which is unstable (saddle point) in nature (see Fig. 4b). Since in this case, disease-free equilibrium is stable, disease eradicates from the population, and also endemic equilibrium is unstable; therefore, disease does not persist in the community.

Fig. 4
figure 4

Phase portrait for the parameter values a In region \(R_{1}\) (\(A= 3.22\), \(\alpha = 0.5 \)) b On Saddle-node line(\(A =3.22\), \(\alpha = 0.3344\)) c In region \(R_{2}\) (\(A= 3.25\), \(\alpha = 0.35 \)) d In region \(R_{3}\) (\(A= 3.25\), \(\alpha = 0.12 \)) and other parameters are given in Table 2

Now, crossing SN-line, we enter in region \(R_{2}\) from region \(R_{1}\). In the region \(R_{2}\), there exists two endemic equilibrium points and the stable disease-free equilibrium point (\(E_{0}\)). Among endemic equilibrium points, one is stable spiral and another is saddle (see the phase portrait in Fig. 4c). Therefore, in \(R_{2}\) region, there exist bi-stability behavior of equilibrium points, one is disease-free equilibrium and other stable point is endemic equilibrium point. Stable manifold of saddle endemic equilibrium point (the green line in Fig. 4c) divides the whole \(R_{2}\) region into two sub-region. We see, if we take initial point inside the basin of attractor then solution of curve approaches to stable endemic equilibrium point, and if we take the initial point outside, the basin of attractor then solutions are converging to disease-free equilibrium point. Thus, we can conclude that for lower or much higher number of infected population at an initial stage in a system, disease will be eliminated and for a moderate number of infected population at an initial stage disease will persist in the system.

Fig. 5
figure 5

Phase portrait for the parameter value a On Homoclinic line (\(A= 3.25\), \(\alpha = 0.054 \)) b In region \({R_{4}}\) (\(A=3.22\), \(\alpha = 0.1 \)) c In region \(R_{5}\) (\(A= 3.31\), \(\alpha = 0.4\)) d In region \(R_{6}\) (\(A= 3.306\), \(\alpha = 0.003\)) and other parameters are given in Table 2

Next, we enter region \(R_{3}\) by crossing Hopf bifurcation line (black line) from region \(R_{2}\). Here, we see that the number of equilibrium points is same as in region \(R_{2}\) . But stable endemic equilibrium point becomes unstable and there arises a stable limit cycle. Here, also bistability arises: one is the stable disease free equilibrium and other is stable limit cycle around the endemic equilibrium point (see the phase portrait in Fig. 4d). Stable manifold divides basin of attractor into two region, outside basin of attractor disease will eradicate and inside basin of attractor number of infected oscillates which means it is difficult to control the disease. That means for elimination of disease our aim is to minimize the density of infected individuals such that at any time it falls outside the basin of attraction.

Now, we consider parameter values on the homoclinic line (cyan line). For these values of the parameters, the stable limit cycle connect the saddle endemic point, therefore a homoclinic loop appears (See the phase portrait in Fig. 5a). Nature of other equilibrium points remain same. Thus, if initial density of populations are inside homoclinic loop, disease will persist in the system with oscillatory behavior and for lying outside homoclinic loop disease will eliminate from the system.

Next, we enter in region \(R_{4}\) by crossing homoclinic bifurcation line from region \(R_{3}\). In region \(R_{4}\), stable limit cycle vanishes through homoclinic bifurcation, and it becomes unstable spiral. Only disease-free equilibrium is stable here. If we take initial point anywhere in region \(R_{4}\), solution of the system (2) converges to disease-free equilibrium, i.e., disease-free equilibrium is globally stable here (see the phase portrait in Fig. 5b). That means disease will eliminate in the system if we choose the parameters in this region.

Now, crossing transcritical bifurcation line (magenta line), we enter in region \(R_{5}\) from region \(R_{2}\). Here saddle endemic equilibrium disappears, other endemic equilibrium point stay stable and the disease-free equilibrium becomes unstable (saddle point). As there is no any other stable equilibrium, the endemic equilibrium point is globally stable here (see the phase portrait in Fig. 5c). From biological point of view, this region is dangerous because disease permanently stay in the system.

At last, we enter in region \(R_{6}\) from region \(R_{5}\) by crossing Hopf bifurcation line (black line). Here, disease-free equilibrium is unstable (saddle point) and endemic equilibrium is unstable spiral. A stable limit cycle arises around endemic equilibrium (see the phase portrait in Fig. 5d). Therefore, disease will persist here like the region \( R_5\) but the density of the infected population will oscillate in this case.

7 Sensitivity analysis

By sensitivity analysis, we can identify the influential model parameters which have more impact on the basic reproduction number \(R_0\) of the proposed model. The model parameters having higher sensitive index have more significant role in disease spreading dynamics among the population. By controlling such more sensitive parameters, the impact of the infection can be diminished or controlled. In order to identify such sensitive model parameters, we need to estimate the variation of basic reproduction number \(R_0\) with respect to different model parameters. For calculating sensitivity index, we use normalized forward sensitivity index method [53,54,55]. According to normalized forward sensitivity index theory, sensitivity index of the basic reproduction number \(R_0\) with respect to model parameter \(\phi \) is defined by \(\varGamma ^{\phi }_{R_0} = \dfrac{\partial R_0}{\partial \phi }.\dfrac{\phi }{R_0}.\) Positive sign of the sensitive index of the model parameters indicates basic reproduction number increases with model parameter increases and negative sign indicates basic reproduction number decreases with model parameter increases. In our proposed model most sensitive model parameters are susceptible recruitment rate(A), disease transmission rate \((\beta )\), treatment control parameter \((u_2)\), vaccinated control parameter \((u_1)\), and cure rate (a). We enlist sensitivity indexes of all model parameters in Table 3.

Table 3 Sensitivity indexes of the model parameters
Fig. 6
figure 6

Diagram of sensitivity index of each model parameter on \(R_0\) using PRCC

To perform sensitivity analysis numerically, we have used partial rank correlation coefficient (PRCC) method [56]. This method correlates between output and input. Here, basic production number (\(R_0\)) is treated as output, and all model parameters are treated as inputs. PRCC measures robustness of sensitivity to determine the monotonic and nonlinear relationship between output and input; therefore, it plays a crucial role in sensitivity analysis. To carry out PRCC, we have taken sample size 100 and the corresponding PRCC diagram is placed in Fig. 6. From Fig. 6, it is clear that PRCC values lie from \(-1\) to \(+1\), which indicates that corresponding model parameters induce positive or negative impact on output \(R_0\). The PRCC values of model parameters imply how much model parameters are sensitive to \(R_0\) and the sign assigns a relation between model parameters (input) and \(R_0\) (output). Positive sign implies relation between input and output is proportional and negative sign implies relation is disproportional. It is obvious from Fig. 6 that model parameters A, \(\beta \) have positive influence on \(R_0\) and model parameters \(d, u_1, \mu , \delta , a, u_2\) have negative influence on \(R_0\).

8 Application of optimal control technique

Our intention is to prevent spread of infection and eradicate disease from the population. For this purpose, we need to adopt suitable treatment policy and perfect vaccination campaign. In this paper, we consider two controls namely vaccination \((u_1)\) and treatment \((u_2)\). Now, our target is to identify best control among vaccination and treatment, i.e., we try to find out an optimal values of applied controls. Therefore, we have to use optimal control theory to get optimal system. To verify the existence of optimal control problem, we use Pontryagin’s maximum principle. The main motive for applying optimal control is to decrease infected population, at the same time reduce applied control cost [32]. For analyzing purpose, we consider controls as constant but implemented cost may be high for constant controls; therefore, we have to use control functions as function of time t. In theory of optimal control, we consider time-dependent control functions. Now, we formulate an optimal control problem.

8.1 Formulation of optimal control problem

In this section, we formulate an optimal control problem of the proposed model. First, we consider an cost functional which will be minimum in the considered interval. Here, we construct integral of the cost functional as L(wu) = \(A_1 x + A_2 y + \dfrac{1}{2} B_1 u_1^2 + \dfrac{1}{2} B_2 u_2^2\), where L(wu) is Lagrangian of optimal control problem. Therefore, optimal control problem is given by:

$$\begin{aligned}&{\text {Min}}\ J(x,u) = \int _{0}^{T} L(t, w(t), u(t)) {\mathrm{d}}t \nonumber \\&{\text {subject to}} \nonumber \\&w^{'}(t) = f(w(t)) + g(w(t))u(t), \forall t\in [0, T] \nonumber \\&u(t) \in U(t), \forall t\in [0, T]\nonumber \\&w(0) = w_{0} \end{aligned}$$
(43)

where

$$\begin{aligned}&w(0)=\left( x(0),y(0),z(0) \right) \ge 0, w(t) = \left( \begin{array}{c} x(t) \\ y(t) \\ z(t) \end{array} \right) ,\\&u(t)= \left( \begin{array}{c} u_1(t) \\ u_2(t) \end{array} \right) , g(w) = \left( \begin{array}{cc} - x &{} 0 \\ 0 &{} - \dfrac{a y}{1 + b u_{2} y} \\ x &{} \dfrac{a y}{1 + b u_{2} y} \end{array} \right) , \\&f(w)= \left( \begin{array}{c} A - \dfrac{\beta x y}{1 + \alpha y^2} - d x \\ \dfrac{\beta x y}{1 + \alpha y^2} - (d + \mu + \delta )y \\ \mu y - d z \end{array} \right) . \end{aligned}$$

In L(tw(t), u(t)), \( A_1(> 0),\) \(A_2(> 0) \) are loss due to susceptible and infected populations, respectively, whereas \(B_1, B_2\) are weight costs for control functions \(u_1(t),\) \(u_2(t),\) respectively. Here, we consider quadratic term in control to present severity of side effect of vaccination and treatment [57]. The control functions are Lebesgue measurable which are given by,

$$\begin{aligned} U = \left\{ (u_1, u_2): 0\le u_i \le 1, i= 1,2 \; {\text {and}} \; t\in [0, T]\right\} \end{aligned}$$
(44)

8.2 Existence and uniqueness of optimal control problem

Theorem 15

For given an optimal control problem (43) subject to (1) admits an optimal pair \(u_1^*(t)\) and \(u_2^*(t)\) such that

\(J(u_1^{*}(t), u_2^{*}(t)) = min \left\{ J(u_1(t), u_2(t)): (u_1, u_2)\in U \right\} \) where U is given by (44).

Proof

The state and control variables are non-empty and also nonnegative. Control constraint U is also convex.

Adding three equations of model (1), we have

$$\begin{aligned}&\frac{\mathrm{d}N}{\mathrm{d}t} = A - \delta y - d N, \ {\text {where}} \ N = x + y + z \\&{\text {i.e.,}}\; \frac{\mathrm{d}N}{\mathrm{d}t} \le A - d N \end{aligned}$$

Therefore, \(N(t) \le \dfrac{A}{d} + \left( N_0 - \dfrac{A}{d} \right) e^{- d t}\), i.e., \(N(t) \longrightarrow \dfrac{A}{d}\) as \(t \longrightarrow \infty \), i.e., all state variables are bounded. Also integrand of objective functional \(J(u_1(t), u_2(t))\) is convex w.r.t. \(u_1\) and \(u_2\).

Now, we rewrite system (1) as:

\(\phi _{t}(t) = E \phi + F(\phi )\) where \(\phi (t) = \left( \begin{array}{c} x(t) \\ y(t) \\ z(t) \end{array} \right) ,\) \(\phi _{t}(t) = \left( \begin{array}{c} x^{'}(t) \\ y^{'}(t) \\ z^{'}(t) \end{array} \right) ,\) \(F(\phi ) = \left( \begin{array}{c} - \dfrac{\beta x y}{1 + \alpha y^2} \\ \dfrac{\beta x y}{1 + \alpha y^2} \\ 0 \end{array} \right) ,\)

\(E = \left( \begin{array}{ccccc} - (\mu + u_1) &{} 0 &{} 0 \\ 0 &{} -(d + \mu + \delta ) - \dfrac{a u_2}{1 + b u_{2}y} &{} 0 \\ u_1 &{} \dfrac{a u_2}{1 + b u_{2}y}+ \mu &{} - d \end{array} \right) \). Therefore,

$$\begin{aligned}&|| F(\phi _1) - F(\phi _2)|| \\&= \left|\dfrac{\beta x_2 y_2}{1 + \alpha y^2_2}- \dfrac{\beta x_1 y_1}{1 + \alpha y^2_1} \right|+ \left|\dfrac{\beta x_1 y_1}{1 + \alpha y^2_1} - \dfrac{\beta x_2 y_2}{1 + \alpha y^2_2} \right|\\&\le | \beta x_2 y_2 - \beta x_1 y_1| + | \beta x_1 y_1 - \beta x_2 y_2 | \\&\le 2 \beta |x_1 y_1 - x_2 y_2 | \le 2 \beta |y_1||x_1-x_2|+ 2 \beta |x_2||y_1-y_1|, \end{aligned}$$

i.e., \(|| F(\phi _1) - F(\phi _2) || < C || \phi _1 - \phi _2 || \ where \ C= 2 \beta \).

Now, if \(G(\phi )=E \phi + F(\phi )\), then \(||G(\phi _1) - G(\phi _2) || \le || E|| || \phi _1 - \phi _2 || + C || \phi _1 - \phi _2 || \le B || \phi _1 - \phi _2 ||, \ where\ \left( || E || + C \right) \le B < \infty \). Thus, Lipschitz condition is satisfied for all state variables. Hence, there exists optimal control pair \(u_1^*(t)\) and \(u_2^*(t)\) such that

$$\begin{aligned} J(u_1^{*}(t), u_2^{*}(t)) = min \left\{ J(u_1(t), u_2(t)): (u_1, u_2)\in U \right\} . \end{aligned}$$
Fig. 7
figure 7

Time series of the population with control (blue line), without control (red line) and control variables; a Susceptible population b Infected population c Recovered population d \(Control u_{1}\) e \(Control u_{2}\). (Color figure online)

This completes proof of the theorem. \(\square \)

8.3 Characterization of optimal control problem

Theorem 16

Optimal control pair \(u_1^*(t),\) \(u_2^*(t)\) which minimizes \(J(u_1(t), u_2(t))\) over U is given by

$$\begin{aligned} u_1^*(t)= & {} \min \left( \max \left( 0, \dfrac{(\rho _1 - \rho _3)x}{B_1} \right) , 1 \right) ,\\&u_2^*(t) = \min \left( \max \left( 0, {\overline{u}}_{2} \right) , 1 \right) , \end{aligned}$$

where \({\overline{u}}_{2}\) is the nonnegative root of \(B_2 u_2 ( 1 + b u_2 y)^2 = a y(\rho _2 - \rho _3)\).

Proof

For characterization of optimal control problem, we apply Pontryagin’s maximum principle [13, 58]. First, we define Hamiltonian H as

$$\begin{aligned}&H(x, y, z, u_1, u_2, \rho _1, \rho _2, \rho _3) = A_1 x + A_2 y \\&\quad +\, \dfrac{1}{2} B_1 u_1^2 + \dfrac{1}{2} B_2 u_2^2 + \rho _1 \left\{ A-\dfrac{\beta xy}{1+\alpha y^{2}}-\left( d+u_{1}\right) x \right\} \\&\quad +\, \rho _2 \left\{ \dfrac{\beta xy}{1+\alpha y^{2}}-(d+\mu +\delta )y-\dfrac{au_{2}y}{1+bu_{2}y} \right\} \\&\quad +\, \rho _3 \left\{ \dfrac{au_{2}y}{1+bu_{2}y}+\mu y+u_{1}x-dz \right\} \end{aligned}$$

satisfying

$$\begin{aligned} \dfrac{\mathrm{d} \rho _1(t)}{\mathrm{d}t} = - \dfrac{\partial H}{\partial x}, \dfrac{\mathrm{d} \rho _2(t)}{\mathrm{d}t} = - \dfrac{\partial H}{\partial y}, \dfrac{\mathrm{d} \rho _3(t)}{\mathrm{d}t} = - \dfrac{\partial H}{\partial z} \end{aligned}$$
(45)

with \(\rho _i(T) = 0, i = 1, 2, 3\).

Solving (45), we observe adjoint variables \(\rho _1, \rho _2, \rho _3\) satisfy

$$\begin{aligned} \dfrac{\mathrm{d} \rho _1}{\mathrm{d}t}&= - A_1 + (\rho _1 - \rho _2) \dfrac{\beta y}{(1 + \alpha y)^2} + (\rho _1 - \rho _3)u_1 + \rho _1 d \nonumber \\ \dfrac{\mathrm{d} \rho _2}{\mathrm{d}t}&= - A_2 + (\rho _1 - \rho _2) \beta x \dfrac{1 - \alpha y^2}{(1 + \alpha y^2)^2} + \rho _2 (d + \delta )\nonumber \\&\quad + (\rho _2 - \rho _3) \dfrac{a u_2}{(1 + b u_2 y)^2}+ (\rho _2 - \rho _3) \mu \nonumber \\ \dfrac{\mathrm{d} \rho _3}{\mathrm{d}t}&= \rho _3 d \end{aligned}$$
(46)

with

$$\begin{aligned} \rho _1(T) = 0, \rho _2(T) = 0, \rho _3(T) = 0. \end{aligned}$$
(47)

At \((u_1^{*}, u_2^{*})\), we have \(\dfrac{\partial H}{\partial u_1} = 0,\) \(\dfrac{\partial H}{\partial u_2} = 0\). Thus, we get \(u_1^*(t) = \min \left( \max \left( 0, \dfrac{(\rho _1 - \rho _3)x}{B_1} \right) , 1 \right) \), \(u_2^*(t) = \min \left( \max \left( 0, {\overline{u}}_{2} \right) , 1 \right) \), where \({\overline{u}}_{2}\) is the nonnegative root of \(B_2 u_2 ( 1 + b u_2 y)^2 = a y(\rho _2 - \rho _3)\).

Now, \(\dfrac{\partial ^{2} H}{\partial u_{1}^{2}} > 0\), \(\dfrac{\partial ^{2} H}{\partial u_{2}^{2}} > 0\) and \(\dfrac{\partial ^{2} H}{\partial u_{1}^{2}}\). \(\dfrac{\partial ^{2} H}{\partial u_{2}^{2}} > \left( \dfrac{\partial ^{2} H}{\partial u_{1} \partial u_{2}} \right) ^2 \) at \((u_{1}^{*}, u_{2}^{*})\).

Hence, optimality condition is satisfied at \(u^*(t) = (u_1{^*}(t), u_2{^*}(t))\). \(\square \)

For numerical simulation of optimal control problem, here we use forward–backward sweep method where in this method forward application of fourth-order Runge–kutta method of system (1) is combined with backward application of fourth-order Runge–kutta method of system (46) with transversality condition (47). We take time interval as [0, 20], i.e., after 20 units time all controls will be stopped. To draw numerically, we use parameter values as in Table 2 with \(A = 100\), \(\alpha = 0.5\) and \(A_1 = 0.05\), \(A_2 = 0.05\), \(B_1 = 0.6\) and \(B_2 = 0.8\) with initial conditions \(x(0) = 100\), \(y(0) = 10\), and \(z(t) = 0\). In Fig. 7a–c, solution curves of susceptible, infected and recovered populations, respectively, are given with and without controls and time series of control \(u_1\), \(u_2\) are shown in Fig. 7d and e. Our proposed model can be used for describing disease dynamics such as SARS, MERS, Dengue disease spreading. Since these types of disease spread rapidly in community at first, but after someday, the disease transmission starts to decrease; therefore, the consideration of such model is appropriate for such diseases.

9 Efficiency analysis

Efficiency analysis is performed to find out best effective strategy when more than one control is applied. In this paper, we consider two controls namely vaccination for susceptibles \((u_1)\) and treatment for infectives \((u_2)\). Now, we have to know the effective strategy among applied controls to reduce the density of infectives and minimize the implemented control cost. So now, we perform efficiency analysis. Since two controls are used, therefore three cases may arise: (i) \(u_1 \ne 0, u_2 \ne 0\) (ii) \(u_1 \ne 0, u_2 = 0\) (iii) \(u_1 = 0, u_2 \ne 0\). We call this cases as Strategy 1, Strategy 2 and Strategy 3, respectively. Calculating efficiency index, we compare the applied strategies. The strategy with higher index is the best strategy [59]. Efficiency index (E. I.) is \(\left( 1 - \dfrac{A^{c}}{A^{0}} \right) \times 100\) where \(A^0\) and \(A^c\) denote cumulative density of infected population without and with controls, respectively. We calculate value of \(A^0\) and \(A^c\) using Simpson’s \(\dfrac{1}{3}\) rule. The value of \(A^0\) is determined by \(A^{0} = \int _{0}^{20} y(t) {\mathrm{d}}t\). We get \(A^0 = 495.0097\) and values of \(A^c\) and E.I. are given in Table 4.

Table 4 Values of efficiency indexes

From our analysis, Strategy 1 is the best strategy among three strategies that if we give both vaccine and treatment to population then disease will easily die out. But between vaccination and treatment, treatment is the more effective strategy, i.e., biologically if we use treatment to infected population, then density of infectives is more reduced.

10 Conclusion

In this paper, we have considered a deterministic epidemic model with constant birth rate and a non-monotone incidence rate affected by inhibitory factors, crowding effects, etc. Saturated type of treatment rate is taken into consideration for limited hospital resources. To discuss the complex dynamical behavior of the model system, we compute basis reproduction number \(R_{0}\), which has crucial role for eradicating the disease infection. We have established that when basic reproduction number is less than unity, then the disease will eradicate from the system for any initial population size, i.e., disease-free equilibrium is globally asymptotically stable under certain parametric restriction. Further we have seen that for \(R_{0} > 1\), endemic equilibrium point is globally asymptotically stable under some restrictions, i.e., disease will persist in the system. We use center manifold theorem to study the dynamical behavior of the disease-free equilibrium point at \(R_0 = 1.\) In our study, we observe that disease-free equilibrium point may be a saddle-node of co-dimension 1 or may be a semi-hyperbolic attracting node of co-dimension 2.

For discussing complex dynamical phenomenon, we examine the transcritical bifurcation, backward bifurcation, saddle-node bifurcation, Hopf bifurcation and Bogdanov–Takens bifurcation of co-dimension 2. When backward bifurcation occurs, then the system shows bistability, one is stable disease-free equilibrium point and other is the stable endemic equilibrium point. By Hopf bifurcation, a stable limit cycle arises around the endemic equilibrium point. There exists a basin of attractor inside which all solution curves trend to that limit cycle and outside which all solution curves trend to disease-free equilibrium point which indicates disease oscillates inside the basin of attractor and outside of the basin of attractor disease eradicates from the community. Depending on the parameter value, the system experiences the homoclinic bifurcation, i.e., the stable limit cycle connecting the saddle point disappear through homoclinic bifurcation. Finally, numerical simulations are used to justify the theoretical findings. The numerical simulations show that there is a threshold range of the initial density of the infected population outside of which the disease will eradicate from the population.

Finally sensitivity analysis is performed to find out the influential model parameters which have most impact on the basic reproduction number \(R_0.\) In order to identify most significant model parameters, we estimate variation of the basic reproduction number \(R_0\) with respect to different model parameters. To control or eradicate the influence of the emerging disease, we need to take proper preventive measures to control the most sensitive model parameters. In our study, we observe that disease can be controlled or eradicated in some regions. Also there is some region where disease persists in the population. Optimal control is studied to reduce the density of infected population and minimize control cost. Using efficiency analysis, we identify most effective strategy. In our study, the most effective strategy is treatment strategy. Therefore, if we use treatment to infected population, then density of infected population is more reduced. In our view, study of controls may give some light in the layout.