1 Introduction

It has been nearly thirty years since the first cases of human immunodeficiency virus (HIV) garnered the world’s attention. Without treatment and awareness, the virus slowly debilitates a person’s immune system until they fall prey to illness. The epidemic has claimed the lives of nearly 35.0 million worldwide [1] and has affected many more. In India, approximately 3.5 million [3.3 to 4.1 million] people are acquiring HIV every year and more than 1.93 to 3.04 million people [2] are living with HIV. Unless we take bold actions, however, we anticipate a new era of rising infections and even greater challenges in serving people living with HIV and higher health care costs.

HIV/AIDS epidemic is a serious, growing public health problem worldwide. The cause is known and the principal routes of transmission understood, but resources for treating HIV infected patients and for combating the spread of the virus are limited. Though the best way of disease control is mass vaccination, but for the case of HIV, the usage of vaccination is very costly and conferred immunities are temporary. We have observed that basic mathematical models [38] mainly deal with the interaction between susceptible and infective individuals. In view of controlling HIV/AIDS, there are many antiretroviral therapies (ART) available nowadays [914] which help the immune system in preventing the infection even though it is not possible to cure it. There are many other factors such as unsafe sex and low condom usage, injecting drug use, widespread stigma, education, migration and mobility etc. which also affect the wideness of this infectious disease. Awareness programs introduce people with the disease and help them to take precautions to reduce the chances of being infected. Moreover, it is important to mention that awaking people through media in the population results in less interaction between susceptible and infected individuals, which lowers the disease transmission rate into susceptible individuals. So, awareness factor can be considered as a strong tool against the expansion of HIV/AIDS. Saying particulary, awareness factor has a great visitation not only on the behavioral changes in individuals, but it also helps governmental health care interventions to control the spread of the disease HIV/AIDS.

Many mathematical models have been developed to monitor HIV/AIDS and explore the impact of intervention strategies that are being implemented. Misra et al. [15, 16] studied the effect of awareness programs through media on disease dynamics in a variable population with immigration. Van Segbroeck et al. [17] analytically studied the disease dynamics of a well-mixed population with rescaled infectiousness where the contact network reshaping occurs much faster than disease spreading. Recently, Samanta and Chattopadhyay [18] have studied a slow-fast dynamics with the effect of awareness program in disease outbreak.

In this present study, we have analyzed a simple SI network epidemic model to study the impact of awareness programs conducted through media campaigning on the spread of HIV/AIDS in a variable population with immigration. However, these results will fall into the non-network epidemic models category. We assume that the growth rate of the cumulative density of awareness programs driven by media is proportional to the number of the infective present in the population. We also assume that the awareness programs against the disease will alert the susceptible to isolate themselves from infective individuals and to form a separate class. We have formulated the mathematical model and also discuss the equilibrium points and their stability. Lastly, we use an optimal control theory paradigm to our mathematical model, in which the contact processes between unaware susceptible and infected classes with aware susceptible and infected classes are controlled. We also solve the model numerically and then discuss the analytical and numerical results according to biological aspect.

2 Formulation of mathematical model with basic assumptions

We consider a basic HIV model

$$ \begin{aligned} &\frac{dS}{dt}= \Pi-\beta S I-d S, \\ &\frac{dI}{dt}=\beta S I-(d+\delta_{I}) I, \end{aligned} $$
(1)

where \(S(t)\) and \(I(t)\) are the densities of the susceptible and infected populations, respectively, in the region under consideration at any time t. Therefore, the total population is \(N(t)=S(t)+I(t)\). Π is the constant recruitment rate in the susceptible population either by birth or immigration; β is the disease transmission rate; d is the natural death rate of the population and \(\delta_{I}\) is the additional death rate due to HIV infection. It is assumed that the disease spreads due to direct contact between the susceptible and the infective only.

Furthermore, the total susceptible and infected, each population is divided into two sub-classes such that \(S(t)=S_{-}(t)+S_{+}(t)\) and \(I(t)=I_{-}(t)+I_{+}(t)\) due to the awareness programs driven by the media \(M(t)\), where the ‘−’ sign denotes the unaware class and ‘+’ sign represents the aware class. As the awareness disperses, people respond to it and change their behavior to alter their susceptibility. It is also assumed that due to awareness programs, aware susceptible and infected individuals avoid being in contact with the infective, form separate ‘aware classes’ (\(S_{+}(t)\) and \(I_{+}(t)\)) and do not get involved in sexual relations or in any other means that causes AIDS. But due to lack of their memory, a portion of aware people ignore the indemnities and become unaware again. We assume that this transfer rate from aware class to unaware class is \(\lambda_{i} \) (\(i=1,2\)).

On the basis of the above assumptions, system (1) reduces to

$$ \begin{aligned} &\frac{dS_{-}}{dt}= \Pi-\beta S_{-} I_{-}- \alpha_{1} S_{-} M+\lambda_{1} S_{+}-d S_{-}, \\ &\frac{dS_{+}}{dt}= \alpha_{1} S_{-} M-\lambda_{1} S_{+}-d S_{+}, \\ &\frac{dI_{-}}{dt}= \beta S_{-} I_{-}-\alpha_{2} I_{-} M+ \lambda_{2} I_{+}-(d+\delta _{I}) I_{-}, \\ &\frac{dI_{+}}{dt}= \alpha_{2} I_{-} M-\lambda_{2} I_{+}-(d+ \delta_{I}) I_{+}, \\ &\frac{dM}{dt}= \eta I_{-} -\eta_{0} M \end{aligned} $$
(2)

with the initial conditions \(S_{-}(0)=S_{0-}\), \(S_{+}(0)=S_{0+}\), \(I_{-}(0)=I_{0-}\), \(I_{+}(0)=I_{0+}\), \(M(0)=M_{0}\), where \(\alpha_{i} \) (\(i=1,2\)) is the contact rate between unaware individuals and media. Awareness programs are implemented proportionally with the change of unaware infective individuals at a rate η and cut down at a rate \(\eta_{0}\) due to their ineffectiveness. Note that aware individuals die out with a lower, but very smaller rate than that of unaware individuals. So, with no loss of generality, we assume that aware susceptible and aware infected classes of people die at the same rates as unaware classes, i.e., d and \(d+\delta_{I}\), respectively. Therefore, \(N=S_{-}+S_{+}+I_{-}+I_{+}\), and using this fact, the above system (2) reduces to

$$\begin{aligned} &\frac{dN}{dt}= \Pi-d N-\delta_{I} (I_{-}+I_{+}), \\ &\frac{dS_{+}}{dt}= \alpha_{1} (N-S_{+}-I_{-}-I_{+}) M- \lambda_{1} S_{+}-d S_{+}, \\ &\frac{dI_{-}}{dt}= \beta(N-S_{+}-I_{-}-I_{+}) I_{-}-\alpha_{2} I_{-} M + \lambda_{2} I_{+}-(d+\delta_{I}) I_{-}, \\ &\frac{dI_{+}}{dt}= \alpha_{2} I_{-} M-\lambda_{2} I_{+}-(d+ \delta_{I}) I_{+}, \\ &\frac{dM}{dt}= \eta I_{-}-\eta_{0} M. \end{aligned}$$
(3)

Since the model monitors human populations and media coverage, it is assumed that all the state variables are non-negative at time \(t = 0\). All parameters of the model are assumed to be non-negative. It then follows from the differential equations that the variables are non-negative for all \(t \geq0\). In the absence of HIV infection, \(N \rightarrow\frac{\Pi}{d}\) as \(t \rightarrow\infty\), provided \(N(0) < \frac{\Pi}{d}\). Thus, the following feasible region

$$\mathcal{D} = \biggl\{ (N, S_{+}, I_{-}, I_{+}, M) \in\Re^{5}_{+} : S_{+}\leq N\leq \frac{\Pi}{d}; 0\leq I_{-}, I_{+}; 0\leq M \leq\frac{\eta\Pi}{\eta_{0} d} \biggr\} $$

is positively invariant. It is therefore sufficient to consider solutions in \(\mathcal{D}\). In this region, the usual existence, uniqueness and continuation results hold for the system.

3 Equilibrium points and stability analysis

In order to obtain equilibrium points, we set all equations of (3) equal to zero as

$$\begin{aligned} &\Pi-d N-\delta_{I} (I_{-}+I_{+})=0, \end{aligned}$$
(4)
$$\begin{aligned} &\alpha_{1} (N-S_{+}-I_{-}-I_{+}) M-\lambda_{1} S_{+}-d S_{+}=0, \end{aligned}$$
(5)
$$\begin{aligned} &\beta(N-S_{+}-I_{-}-I_{+}) I_{-}-\alpha_{2} I_{-} M+\lambda_{2} I_{+}-(d+\delta_{I})I_{-}=0, \end{aligned}$$
(6)
$$\begin{aligned} &\alpha_{2} I_{-} M-\lambda_{2} I_{+}-(d+\delta_{I}) I_{+}=0, \end{aligned}$$
(7)
$$\begin{aligned} &\eta I_{-}-\eta_{0} M=0. \end{aligned}$$
(8)

System (3) has two non-negative equilibria.

3.1 Disease-free equilibrium point

The disease-free equilibrium always exists and is of the form

$$E^{0}\bigl(N^{0},S^{0}_{+}, I^{0}_{-}, I^{0}_{+}, M^{0}\bigr)=\biggl(\frac{\Pi}{d},0,0,0,0 \biggr). $$

The linearization of the second, third, fourth and fifth equations of model (3) at the disease-free state \(E^{0}\) can be rewritten in the following form:

$$\frac{dX}{dt}=(F-V) X, $$

where

$$\begin{aligned}& X=[S_{+},I_{-},I_{+},M]^{T}, \\& F= \begin{bmatrix} 0 & 0 & 0 & \frac{\alpha_{1} \Pi}{d} \\ 0 & \frac{\beta\Pi}{d} & \lambda_{2} & 0 \\ 0 & 0 & 0 & 0\\ 0 & \eta& 0 & 0 \end{bmatrix}, \\& V= \begin{bmatrix} \lambda_{1}+d & 0 & 0 & 0 \\ 0 & d+\delta_{I} & 0 & 0 \\ 0 & 0 & \lambda_{2}+d+\delta_{I} & 0\\ 0 & 0 & 0 & \eta_{0} \end{bmatrix}. \end{aligned}$$

A threshold criterion \(R_{0}\), which is referred to as basic reproductive number, can be derived using the spectral radius of the next-generation matrix [19]. Therefore, to find \(R_{0}\), we must find the largest eigenvalue of \(FV^{-1}\). Thus,

$$R_{0}= \max_{|\kappa|} \det\bigl(\kappa I-FV^{-1}\bigr) =\max_{|\kappa|} \det \begin{bmatrix} \kappa& 0 & 0 & \frac{\alpha_{1} \Pi}{\eta_{0} d} \\ 0 & (\kappa-\frac{\beta\Pi}{d (d+\delta_{I})}) & \frac{\lambda_{2}}{\lambda _{2}+d+\delta_{I}} & 0 \\ 0 & 0 & \kappa& 0\\ 0 & \frac{\eta}{d+\delta_{I}} & 0 & \kappa \end{bmatrix}, $$

where I is a \(4\times4\) identity matrix.

The characteristic equation of \(FV^{-1}\) is

$$\begin{aligned} \kappa^{3} \biggl(\kappa-\frac{\Pi\beta}{d (d+\delta_{I})}\biggr)=0. \end{aligned}$$
(9)

Thus,

$$\begin{aligned} R_{0}=\frac{\Pi\beta}{d (d+\delta_{I})}. \end{aligned}$$
(10)

The disease-free equilibrium point \(E^{0}\) always exists, and it is locally asymptotically stable when \(R_{0}<1\) and unstable when \(R_{0}>1\) [20]. We use the threshold \(R_{0}\) to answer the question of whether the infection can be established. When \(R_{0}>1\), HIV infection can take hold. Otherwise the infection will be eliminated.

3.2 Endemic equilibrium point

The endemic equilibrium for system (3) should be in the form \(E^{*}(N^{*}, S^{*}_{+}, I^{*}_{-}, I^{*}_{+}, M^{*})\), in which

$$\begin{aligned}& S^{*}_{+}=\frac{\alpha_{1} \eta(d+\delta_{I})}{\eta_{0} \beta(\lambda_{1}+d)} I^{*}_{-}+\frac{\alpha_{1} \eta^{2} \alpha_{2} (d+\delta_{I})}{\eta^{2}_{0} \beta (\lambda_{1}+d)(\lambda_{2}+d+\delta_{I})} {I^{*}_{-}}^{2}, \\& I^{*}_{+}=\frac{\alpha_{2} \eta}{\eta_{0}(\lambda_{2}+d+\delta_{I})} {I^{*}_{-}}^{2}, \\& M^{*}=\frac{\eta}{\eta_{0}} I^{*}_{-}. \end{aligned}$$

\(I^{*}_{-}\) is a positive, real root of the quadratic equation

$$\begin{aligned} A_{1} {I^{*}_{-}}^{2}+A_{2} I^{*}_{-}+A_{3}=0, \end{aligned}$$
(11)

where

$$\begin{aligned}& A_{1}=\frac{\eta\alpha_{2} (d+\delta_{I})}{\eta_{0} (\lambda_{2}+d+\delta_{I})} \biggl[\frac{1}{d}+\frac{\alpha_{1} \eta}{\eta_{0} \beta(\lambda_{1}+d)} \biggr], \\& A_{2}=(d+\delta_{I}) \biggl[\frac{1}{d}+ \frac{\alpha_{1} \eta}{\eta_{0} \beta (\lambda_{1}+d)} \biggl(1+\frac{\alpha_{2} (\lambda_{1}+d)}{\alpha_{1} (\lambda _{2}+d+\delta_{I})} \biggr) \biggr], \\& A_{3}= \biggl[ \frac{(d+\delta_{I})}{\beta}-\frac{\Pi}{d} \biggr]. \end{aligned}$$

It is clear that if \(A_{3}<0\), then one of the roots of equation (11) will be positive. Also, it is interesting to note that if \(A_{3}<0\) then \(R_{0}\) becomes greater than unity, i.e., \(R_{0}>1\).

Thus, we can conclude the following theorem.

Theorem 3.1

The disease-free equilibrium (DFE) \(E^{0}\) exists and is locally asymptotically stable when \(R_{0}=\frac{\Pi\beta}{d (d+\delta_{I})}<1\). Whenever \(R_{0}=\frac{\Pi\beta}{d (d+\delta_{I})}>1\), \(E^{0}\) becomes unstable and the endemic equilibrium point \(E^{*}\) exists.

4 Optimal control strategy

Optimal control is useful for controlling an epidemiological system. Generally, we solve these types of problems by finding the time-dependent profiles of the control variable to optimize a particular performance. Our target is to maximize the aware susceptible (or aware infected) population by controlling the interaction between unaware susceptible (or unaware infected) through media. Thus, the optimal control problem, where the state system is given by

$$\begin{aligned} &\frac{dS_{-}}{dt}= \Pi-\beta S_{-} I_{-}-\alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-} M+\lambda _{1} S_{+}-d S_{-}, \\ &\frac{dS_{+}}{dt}= \alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-} M-\lambda_{1} S_{+}-d S_{+}, \\ &\frac{dI_{-}}{dt}= \beta S_{-} I_{-}-\alpha_{2} u_{2}(t) I_{-} M+ \lambda_{2}I_{+}-(d+\delta_{I}) I_{-}, \\ &\frac{dI_{+}}{dt}= \alpha_{2} u_{2}(t) I_{-} M- \lambda_{2} I_{+}-(d+\delta_{I}) I_{+}, \\ &\frac{dM}{dt}= \eta I_{-}-\eta_{0} M. \end{aligned}$$
(12)

Here, \(u_{1}(t)\) and \(u_{2}(t)\) represent the control parameters for the interactions between \(S_{-}\) with M and \(I_{-}\) with M, respectively, by invoking awareness among people through media.

Also it is our objective to keep the cost function, measured in terms of time, as low as possible. Thus, the objective function to be minimized is

$$\begin{aligned} J\bigl[u_{1}(t),u_{2}(t)\bigr]=\int ^{t_{f}}_{t_{0}}\bigl[P u_{1}^{2}(t)+Q u_{2}^{2}(t)- S_{+}^{2}(t)- I_{+}^{2}(t) \bigr]\,dt. \end{aligned}$$
(13)

The parameters \(P>0\) and \(Q>0\) are dimensionless weight functions on the benefit of the cost. These are the costs of per media campaigning for unaware susceptible and unaware infected populations, respectively, per unit time.

In this problem, we are seeking the optimal control pair \((u_{1}^{*},u_{2}^{*})\) such that

$$J\bigl(u_{1}^{*},u_{2}^{*}\bigr)=\min\bigl\{ J(u_{1},u_{2}): (u_{1},u_{2})\in U\bigr\} , $$

where \(U=\{(u_{1}, u_{2}): u_{i} \mbox{ measurable}, 0 \leq u_{i}(t) \leq1, t \in[t_{0}, t_{f} ]\mbox{ for }i=1,2\}\).

To determine the optimal control \(u_{1}^{*}\) and \(u_{2}^{*}\), we use the Pontryagin minimum principle [21]. To solve the problem, we use the Hamiltonian [22, 23] given by

$$\begin{aligned} H =&P u_{1}^{2}(t)+Q u_{2}^{2}(t)- S_{+}^{2}- I_{+}^{2} \\ &{}+\xi_{1}\bigl\{ \Pi-\beta S_{-} I_{-}-\alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-} M+\lambda_{1} S_{+}-d S_{-}\bigr\} \\ &{}+\xi_{2}\bigl\{ \alpha_{1} \bigl(1-u_{1}(t) \bigr) S_{-} M-\lambda_{1} S_{+}-d S_{+}\bigr\} \\ &{}+\xi_{3}\bigl\{ \beta S_{-} I_{-}-\alpha_{2} u_{2}(t) I_{-} M+\lambda_{2} I_{+}-(d+\delta _{I}) I_{-} \bigr\} \\ &{}+\xi_{4}\bigl\{ \alpha_{2} u_{2}(t) I_{-} M- \lambda_{2} I_{+}-(d+\delta_{I}) I_{+}\bigr\} \\ &{}+\xi_{5}\{\eta I_{-}-\eta_{0} M\}, \end{aligned}$$
(14)

where \(\xi_{i} \) (\(i=1,2,3,4,5\)) are the adjoint variables.

By using the Pontryagin minimum principle and for the existence condition of the optimal control theory [24], we obtain the following theorem.

Theorem 4.1

If the objective cost function \(J(u_{1},u_{2})\) over U attains its minimum for the optimal control \(u^{*}=(u_{1}^{*},u_{2}^{*})\) corresponding to the endemic equilibrium \((S_{-}^{*}, S^{*}_{+}, I^{*}_{-}, I^{*}_{+}, M^{*})\), then there exist adjoint functions \(\xi_{1}\), \(\xi_{2}\), \(\xi_{3}\), \(\xi_{4}\), \(\xi_{5}\) satisfying the equations

$$\begin{aligned} &\frac{d\xi_{1}}{dt}= \xi_{1} \bigl\{ \beta I_{-}+ \alpha_{1} \bigl(1-u_{1}(t)\bigr) M+d\bigr\} - \xi_{2} \alpha_{1} \bigl(1-u_{1}(t)\bigr) M- \xi_{3} \beta I_{-}, \\ &\frac{d\xi_{2}}{dt}= 2 S_{+} - \xi_{1} \lambda_{1}+ \xi_{2} (\lambda_{1}+d), \\ &\frac{d\xi_{3}}{dt}= \xi_{1} \beta S_{-}-\xi_{3}\bigl\{ \beta S_{-}-\alpha_{2} u_{2}(t) M-(d+\delta_{I})\bigr\} - \xi_{4} \alpha_{2} u_{2}(t) M-\xi_{5} \eta, \\ &\frac{d\xi_{4}}{dt}= 2 I_{+}-\xi_{3} \lambda_{2}+ \xi_{4}(\lambda_{2}+d+\delta _{I}), \\ &\frac{d\xi_{5}}{dt}= \xi_{1} \alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-}-\xi_{2} \alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-}+\xi_{3} \alpha_{2} u_{2}(t) I_{-}-\xi_{4} \alpha_{2} u_{2}(t) I_{-} +\xi_{5} \eta_{0}, \end{aligned}$$
(15)

along with the transversality condition \(\xi_{i}(t_{f})=0\) (\(i=1,\ldots,5\)).

Proof

According to the Pontryagin minimum principle, the unconstrained optimal control variables \(u_{1}^{*}\) and \(u_{2}^{*}\) satisfy

$$\begin{aligned} \frac{\partial H}{\partial u^{*}_{1}}=\frac{\partial H}{\partial u^{*}_{2}}=0. \end{aligned}$$
(16)

Equation (14) can be rewritten as

$$\begin{aligned} H =&\bigl[P u_{1}^{2}(t)-\xi_{1} \alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-} M+\xi_{2} \alpha_{1} \bigl(1-u_{1}(t)\bigr) S_{-} M\bigr]+\bigl[Q u_{2}^{2}(t) \\ &{}-\xi_{3} \alpha_{2} u_{2}(t) I_{-} M+ \xi_{4} \alpha_{2} u_{2}(t) I_{-} M\bigr]+ \mbox{other terms without }u_{1}\mbox{ and }u_{2}. \end{aligned}$$

We can obtain from (16)

$$\begin{aligned} &\frac{\partial H}{\partial u^{*}_{1}}=2 P u^{*}_{1}(t)+\xi_{1} \alpha_{1} S_{-} M-\xi_{2} \alpha_{1} S_{-} M=0, \end{aligned}$$
(17)
$$\begin{aligned} &\frac{\partial H}{\partial u^{*}_{2}}=2 Q u^{*}_{2}(t)-\xi_{3} \alpha_{2} I_{-} M+\xi_{4} \alpha_{2} I_{-} M=0. \end{aligned}$$
(18)

We have also obtained \(u^{*}_{1}\) and \(u^{*}_{2}\) in the following form:

$$\begin{aligned} &u^{*}_{1}(t)= \frac{ \alpha_{1} ( \xi_{2} - \xi_{1} ) S_{-} M}{2 P}, \end{aligned}$$
(19)
$$\begin{aligned} &u^{*}_{2}(t)= - \frac{ \alpha_{2} ( \xi_{4} - \xi_{3} ) I_{-} M}{2 Q}. \end{aligned}$$
(20)

Since the standard control is bounded, we conclude for the control \(u_{1}\) that

$$\begin{aligned} u^{*}_{1}(t)= \left \{ \textstyle\begin{array}{@{}l@{\quad}l} 0 & \mbox{if } \frac{ \alpha_{1} ( \xi_{2} - \xi_{1} ) S_{-} M}{2 P}\leq0;\\ \frac{ \alpha_{1} (\xi_{2} - \xi_{1}) S_{-} M}{2 P} & \mbox{if } 0< \frac{ \alpha _{1} ( \xi_{2} - \xi_{1} ) S_{-} M}{2 P}< 1; \\ 1 & \mbox{if } \frac{ \alpha_{1} (\xi_{2} - \xi_{1}) S_{-} M}{2 P}\geq1. \end{array}\displaystyle \displaystyle \displaystyle \right . \end{aligned}$$
(21)

Hence the compact form of \(u^{*}_{1}\) is

$$\begin{aligned} u^{*}_{1}=\max \biggl[\min \biggl[1, \frac{\alpha_{1} (\xi_{2} - \xi_{1}) S_{-} M}{2 P} \biggr],0 \biggr]. \end{aligned}$$
(22)

In a similar manner we can get the compact form of \(u^{*}_{2}\)

$$\begin{aligned} u^{*}_{2}=\max \biggl[\min \biggl[1, -\frac{\alpha_{2} (\xi_{4} - \xi_{3}) I_{-} M}{2 Q} \biggr],0 \biggr]. \end{aligned}$$
(23)

According to the Pontryagin minimum principle, it can be written as follows:

$$\begin{aligned} \frac{d\xi_{i}}{dt}=-\frac{\partial H}{\partial x_{i}} \quad \mbox{for } i=1,2,3,4,5, \end{aligned}$$
(24)

where \(x_{i}\equiv(S_{-}, S_{+}, I_{-}, I_{+}, M)\).

The necessary conditions satisfying the optimal control \(u^{*}(t)\) are

$$\begin{aligned} H\bigl(x_{i}(t),u^{*}(t),\xi_{i}(t),t\bigr)= \min_{u\in U}H\bigl(x_{i}(t),u(t),\xi_{i}(t),t \bigr). \end{aligned}$$
(25)

Using (24), we can get the adjoint system (15) corresponding to systems (12) and (13). The optimal system consists of the state system (12), satisfying the initial condition \(S_{-}(0)=S_{0-}\), \(S_{+}(0)=S_{0+}\), \(I_{-}(0)=I_{0-}\), \(I_{+}(0)=I_{0+}\), \(M(0)=M_{0}\), and the adjoint system (15), satisfying transversality condition \(\xi_{i}(t_{f})=0\) for \(i=1,2,3,4,5\). □

4.1 Uniqueness of the optimal control

To prove the uniqueness of the optimality system, we use a simple lemma for the small time interval.

Lemma 1

[25]

The function \(u^{*}(s)=\min(\max (s,a),b)\) is Lipschitz continuous in s, where \(a< b\) are some fixed positive constants.

Theorem 4.2

The solution of the optimal system is unique for sufficiently small \([t_{0},t_{f}]\).

Proof

Suppose that \((S_{-}, S_{+}, I_{-}, I_{+}, M, \xi_{1}, \xi_{2}, \xi_{3},\xi_{4}, \xi_{5})\) and \((\bar{S}_{-},\bar{S}_{+}, \bar{I}_{-},\bar{I}_{+},\bar{M},\bar{\xi}_{1},\bar{\xi}_{2},\bar{\xi}_{3}, \bar{\xi}_{4},\bar{\xi}_{5})\) are two solutions of systems (12) and (15) such that for \(\theta>0\), \(S_{-}=e^{\theta t}p_{1}\), \(S_{+}=e^{\theta t}p_{2}\), \(I_{-}=e^{\theta t}p_{3}\), \(I_{+}=e^{\theta t}p_{4}\), \(M=e^{\theta t}p_{5}\), \(\xi_{1}=e^{-\theta t}q_{1}\), \(\xi_{2}=e^{-\theta t}q_{2}\), \(\xi_{3}=e^{-\theta t}q_{3}\), \(\xi_{4}=e^{-\theta t}q_{4}\), \(\xi_{5}=e^{-\theta t}q_{5}\) and \(\bar{S}_{-}=e^{\theta t}\bar{p}_{1}\), \(\bar{S}_{+}=e^{\theta t}\bar{p}_{2}\), \(\bar{I}_{-}=e^{\theta t}\bar{p}_{3}\), \(\bar{I}_{+}=e^{\theta t}\bar{p}_{4}\), \(\bar{M}=e^{\theta t}\bar{p}_{5}\), \(\bar{\xi}_{1}=e^{-\theta t}\bar{q}_{1}\), \(\bar{\xi}_{2}=e^{-\theta t}\bar{q}_{2}\), \(\bar{\xi}_{3}=e^{-\theta t}\bar{q}_{3}\), \(\bar{\xi}_{4}=e^{-\theta t}\bar{q}_{4}\), \(\bar{\xi}_{5}=e^{-\theta t}\bar{q}_{5}\).

Again, let

$$\begin{aligned}& u^{*}_{1}= \max\biggl[\min\biggl[1,\frac{ \alpha_{1} (q_{2} - q_{1}) p_{1} p_{5} e^{\theta t}}{2 P}\biggr],0\biggr], \\& u^{*}_{2}= \max\biggl[\min\biggl[1, - \frac{ \alpha_{2} (q_{4} - q_{3}) p_{3} p_{5} e^{\theta t} }{2 Q}\biggr],0\biggr] \end{aligned}$$

and

$$\begin{aligned}& \bar{u}^{*}_{1}= \max\biggl[\min\biggl[1,\frac{ \alpha_{1} (\bar{q}_{2} - \bar{q}_{1}) \bar{p}_{1} \bar{p}_{5} e^{\theta t}}{2 P}\biggr],0 \biggr], \\& \bar{u}^{*}_{2}= \max\biggl[\min\biggl[1,- \frac{ \alpha_{2} (\bar{q}_{4} - \bar{q}_{3}) \bar{p}_{3} \bar{p}_{5} e^{\theta t} }{2 Q}\biggr],0 \biggr]. \end{aligned}$$

Substituting the values of \(S_{-}\), \(S_{+}\), \(I_{-}\), \(I_{+}\), M, \(\xi_{1}\), \(\xi_{2}\), \(\xi_{3}\), \(\xi_{4}\), \(\xi_{5}\), \(u^{*}_{1}\) and \(u^{*}_{2}\) in equations (12) and (15), we get

$$\begin{aligned} &\dot{p_{1}}+\theta p_{1}= \Pi e^{-\theta t} -\beta p_{1} p_{3} e^{\theta t}- \alpha_{1} \biggl(1-\frac{ (q_{2} - q_{1}) \alpha_{1} p_{1} p_{5} e^{\theta t}}{2 P}\biggr) p_{1} p_{5} e^{\theta t}+\lambda_{1} p_{2}-d p_{1}, \\ &\dot{p_{2}}+\theta p_{2}=\alpha_{1} \biggl(1- \frac{ (q_{2} - q_{1}) \alpha_{1} p_{1} p_{5} e^{\theta t}}{2 P}\biggr) p_{1} p_{5} e^{\theta t}-( \lambda_{1} +d) p_{2}, \\ &\dot{p_{3}}+\theta p_{3}=\beta p_{1} p_{3} e^{\theta t}+\alpha_{2} \frac{ (q_{4} - q_{3}) \alpha_{2} p_{3} p_{5} e^{\theta t}}{2 Q} p_{2} p_{5} e^{\theta t} + \lambda_{2} p_{4}-(d+\delta_{I}) p_{3}, \\ &\dot{p_{4}}+\theta p_{4}=-\alpha_{2} \frac{ (q_{4} - q_{3}) \alpha_{2} p_{3} p_{5} e^{\theta t}}{2 Q} p_{2} p_{5} e^{\theta t} - ( \lambda_{2}+d+\delta_{I}) p_{4}, \\ &\dot{p_{5}}+\theta p_{5}=\eta p_{3}- \eta_{0} p_{5}, \\ &\dot{q_{1}}-\theta q_{1}= q_{1} \biggl\{ \beta p_{3} e^{\theta t}+\alpha_{1} \biggl(1- \frac { \alpha_{1} (q_{2} - q_{1}) p_{1} p_{5} e^{\theta t}}{2 P}\biggr) p_{5} e^{\theta t}+d\biggr\} \\ &\hphantom{\dot{q_{1}}-\theta q_{1}=}{} -\alpha_{1} q_{2} \biggl(1- \frac{ \alpha_{1} (q_{2} - q_{1}) p_{1} p_{5} e^{\theta t}}{2 P}\biggr) p_{5} e^{\theta t}-\beta p_{3} q_{3} e^{\theta t}, \\ &\dot{q_{2}}-\theta q_{2}= 2 e^{2\theta t}p_{2} - \lambda_{1} q_{1} + (\lambda _{1}+d) q_{2}, \\ &\dot{q_{3}}-\theta q_{3}= \beta p_{1} q_{1} e^{\theta t} -q_{3}\biggl\{ \beta p_{1} e^{\theta t} +\alpha_{2} \frac{ (q_{4} - q_{3}) \alpha_{2} p_{3} p_{5}}{2 Q} p_{5} e^{2\theta t}-(d+\delta_{I})\biggr\} \\ &\hphantom{\dot{q_{3}}-\theta q_{3}=}{} +\alpha_{2} \frac{ (q_{4} - q_{3}) \alpha_{2} p_{3} p_{5}}{2 Q} p_{5} q_{4} e^{2\theta t} - \eta q_{5}, \\ &\dot{q_{4}}-\theta q_{4}= 2 p_{4} e^{2\theta t}- \lambda_{2} q_{3} +(\lambda _{2}+d+\delta_{I}) q_{4}, \\ &\dot{q_{5}}-\theta q_{5}= \alpha_{1} \biggl(1- \frac{ \alpha_{1} (q_{2} - q_{1}) p_{1} p_{5} e^{\theta t}}{2 P}\biggr) p_{1} q_{1} e^{\theta t} - \alpha_{1} \biggl(1-\frac{ \alpha_{1} (q_{2} - q_{1}) p_{1} p_{5} e^{\theta t}}{2 P}\biggr) p_{1} q_{2} e^{\theta t} \\ &\hphantom{\dot{q_{5}}-\theta q_{5}=}{} - \alpha_{2} \frac{ (q_{4} - q_{3}) \alpha_{2} p_{3} p_{5}}{2 Q} p_{3} q_{3} e^{2\theta t}+ \alpha_{2} \frac{ (q_{4} - q_{3}) \alpha_{2} p_{3} p_{5}}{2 Q} p_{3} q_{4} e^{2\theta t} + \eta_{0} q_{5}. \end{aligned}$$
(26)

The equations for \(S_{-}\) and \(\bar{S}_{-}\), \(S_{+}\) and \(\bar{S}_{+}\), \(I_{-}\) and \(\bar{I}_{-}\), \(I_{+}\) and \(\bar{I}_{+}\) and lastly M and \(\bar{M}\) are subtracted. Each equation is multiplied by an appropriate function and then integrated from \(t_{0}\) to \(t_{f}\).

$$\begin{aligned}& \int^{t_{f}}_{t_{0}}(u_{1}-\bar{u}_{1})^{2}\,dt\leq C_{1}e^{2\theta t_{f}} \int ^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{1}- \bar{q}_{1}|^{2}+|q_{2}-\bar{q}_{2}|^{2} \bigr]\,dt, \\& \int^{t_{f}}_{t_{0}}(u_{2}-\bar{u}_{2})^{2}\,dt\leq C_{2}e^{2\theta t_{f}} \int ^{t_{f}}_{t_{0}} \bigl[|p_{3}-\bar{p}_{3}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{3}- \bar{q}_{3}|^{2}+|q_{4}-\bar{q}_{4}|^{2} \bigr]\,dt. \end{aligned}$$

By using the above results, we get ten integral equations as follows:

$$\begin{aligned} &\frac{1}{2}(p_{1}-\bar{p}_{1})^{2}(t_{f})+ \theta\int^{t_{f}}_{t_{0}}|p_{1}-\bar{p}_{1}|^{2}\,dt \\ &\quad\leq\beta e^{\theta t_{f}}\int^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{3}-\bar{p}_{3}|^{2}\bigr]\,dt + \alpha_{1} e^{\theta t_{f}} \int^{t_{f}}_{t_{0}}\bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{5}-\bar{p}_{5}|^{2} \bigr]\,dt \\ &\qquad{}+ \alpha_{1} C_{1} e^{3\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{1}- \bar{q}_{1}|^{2}+|q_{2}-\bar{q}_{2}|^{2} \bigr]\,dt \\ &\qquad{}+ \lambda _{1} \int^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{2}-\bar{p}_{2}|^{2}\bigr]\,dt + d \int^{t_{f}}_{t_{0}}|p_{1}- \bar{p}_{1}|^{2}\,dt, \\ &\frac{1}{2}(p_{2}-\bar{p}_{2})^{2}(t_{f})+ \theta\int^{t_{f}}_{t_{0}}|p_{2}-\bar{p}_{2}|^{2}\,dt \\ &\quad\leq\alpha_{1} e^{\theta t_{f}} \int^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{2}-\bar{p}_{2}|^{2}+|p_{5}-\bar{p}_{5}|^{2} \bigr]\,dt \\ &\qquad{}+ \alpha_{1} C_{1} e^{3\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{2}-\bar{p}_{2}|^{2}+|p_{5}- \bar{p}_{5}|^{2}+|q_{1}-\bar{q}_{1}|^{2}+|q_{2}- \bar{q}_{2}|^{2}\bigr]\,dt \\ &\qquad{}+ (\lambda_{1} +d) \int ^{t_{f}}_{t_{0}}|p_{2}-\bar{p}_{2}|^{2}\,dt, \\ &\frac{1}{2}(p_{3}-\bar{p}_{3})^{2}(t_{f})+ \theta\int^{t_{f}}_{t_{0}}|p_{3}-\bar{p}_{3}|^{2}\,dt \\ &\quad\leq\beta e^{\theta t_{f}}\int^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{3}-\bar{p}_{3}|^{2}\bigr]\,dt \\ &\qquad{}+ \alpha_{2} C_{2} e^{3\theta t_{f}} \int^{t_{f}}_{t_{0}}\bigl[|p_{2}- \bar{p}_{2}|^{2}+|p_{3}-\bar{p}_{3}|^{2}+|p_{5}- \bar{p}_{5}|^{2}+|q_{3}-\bar{q}_{3}|^{2}+|q_{4}- \bar{q}_{4}|^{2}\bigr]\,dt \\ &\qquad{}+ \lambda _{2} \int ^{t_{f}}_{t_{0}}\bigl[|p_{3}-\bar{p}_{3}|^{2}+|p_{4}-\bar{p}_{4}|^{2} \bigr]\,dt + (d+\delta_{I}) \int^{t_{f}}_{t_{0}}|p_{3}- \bar{p}_{3}|^{2}\,dt, \\ &\frac{1}{2}(p_{4}-\bar{p}_{4})^{2}(t_{f})+ \theta\int^{t_{f}}_{t_{0}}|p_{4}-\bar{p}_{4}|^{2}\,dt \\ &\quad\leq\alpha_{2} C_{2} e^{3\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{2}-\bar{p}_{2}|^{2}+|p_{3}-\bar{p}_{3}|^{2}+|p_{5}- \bar{p}_{5}|^{2}+|q_{3}-\bar{q}_{3}|^{2}+|q_{4}- \bar{q}_{4}|^{2}\bigr]\,dt \\ &\qquad{}+ (\lambda_{2}+d+ \delta_{I}) \int^{t_{f}}_{t_{0}}|p_{4}- \bar{p}_{4}|^{2}\,dt, \\ &\frac{1}{2}(p_{5}-\bar{p}_{5})^{2}(t_{f})+ \theta\int^{t_{f}}_{t_{0}}|p_{5}-\bar{p}_{5}|^{2}\,dt \\ &\quad\leq\eta\int^{t_{f}}_{t_{0}}\bigl[|p_{3}- \bar{p}_{3}|^{2}+|p_{5}-\bar{p}_{5}|^{2} \bigr]\,dt + \eta_{0} \int^{t_{f}}_{t_{0}} |p_{5}-\bar{p}_{5}|^{2}\,dt, \\ &\frac{1}{2}(q_{1}-\bar{q}_{1})^{2}(t_{0})+ \theta\int^{t_{f}}_{t_{0}}|q_{1}-\bar{q}_{1}|^{2}\,dt \\ &\quad\leq\beta e^{\theta t_{f}} \int^{t_{f}}_{t_{0}} \bigl[|p_{3}-\bar{p}_{3}|^{2}+|q_{1}-\bar{q}_{1}|^{2}\bigr]\,dt + \alpha_{1} e^{\theta t_{f}} \int^{t_{f}}_{t_{0}}\bigl[|p_{5}-\bar{p}_{5}|^{2}+|q_{1}-\bar{q}_{1}|^{2} \bigr]\,dt \\ &\qquad{}+ \alpha_{1} C_{3} e^{3\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{1}- \bar{q}_{1}|^{2}+|q_{2}-\bar{q}_{2}|^{2} \bigr]\,dt \\ &\qquad{}+ d \int^{t_{f}}_{t_{0}} |q_{1}-\bar{q}_{1}|^{2}\,dt +C_{4} e^{\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{5}-\bar{p}_{5}|^{2}+|q_{1}-\bar{q}_{1}|^{2}+|q_{2}- \bar{q}_{2}|^{2}\bigr]\,dt \\ &\qquad{}+ \beta e^{3\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{3}-\bar{p}_{3}|^{2}+|q_{1}-\bar{q}_{1}|^{2}+|q_{3}- \bar{q}_{3}|^{2}\bigr]\,dt, \\ &\frac{1}{2}(q_{2}-\bar{q}_{2})^{2}(t_{0})+ \theta\int^{t_{f}}_{t_{0}}|q_{2}-\bar{q}_{2}|^{2}\,dt \\ &\quad\leq2 e^{2 \theta t_{f}} \int^{t_{f}}_{t_{0}} \bigl[|p_{2}-\bar{p}_{2}|^{2}+|q_{2}-\bar{q}_{2}|^{2}\bigr]\,dt + \lambda_{1} \int ^{t_{f}}_{t_{0}}\bigl[|q_{1}-\bar{q}_{1}|^{2}+|q_{2}-\bar{q}_{2}|^{2} \bigr]\,dt \\ &\qquad{}+ (\lambda _{2}+d) \int^{t_{f}}_{t_{0}}|q_{2}- \bar{q}_{2}|^{2}\,dt, \\ &\frac{1}{2}(q_{3}-\bar{q}_{3})^{2}(t_{0})+ \theta\int^{t_{f}}_{t_{0}}|q_{3}-\bar{q}_{3}|^{2}\,dt \\ &\quad\leq\beta C_{5} e^{\theta t_{f}} \int^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|q_{1}- \bar{q}_{1}|^{2}+|q_{3}-\bar{q}_{3}|^{2} \bigr]\,dt \\ &\qquad{}+ \alpha_{2} C_{6} e^{3 \theta t_{f}}\int ^{t_{f}}_{t_{0}}\bigl[|p_{3}-\bar{p}_{3}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{3}- \bar{q}_{3}|^{2}+|q_{4}-\bar{q}_{4}|^{2} \bigr]\,dt \\ &\qquad{}+ (d+\delta_{I}) \int^{t_{f}}_{t_{0}} |q_{3}-\bar{q}_{3}|^{2}\,dt + \eta\int ^{t_{f}}_{t_{0}}\bigl[|q_{3}-\bar{q}_{3}|^{2}+|q_{5}-\bar{q}_{5}|^{2} \bigr]\,dt, \\ &\frac{1}{2}(q_{4}-\bar{q}_{4})^{2}(t_{0})+ \theta\int^{t_{f}}_{t_{0}}|q_{4}-\bar{q}_{4}|^{2}\,dt \\ &\quad\leq2 e^{2\theta t_{f}}\int^{t_{f}}_{t_{0}} \bigl[|p_{4}-\bar{p}_{4}|^{2} + |q_{4}- \bar{q}_{4}|^{2}\bigr]\,dt + \lambda_{2} \int ^{t_{f}}_{t_{0}} \bigl[|q_{3}-\bar{q}_{3}|^{2}+|q_{4}-\bar{q}_{4}|^{2} \bigr]\,dt \\ &\qquad{}+ (\lambda _{2}+d+\delta_{I}) \int ^{t_{f}}_{t_{0}} |q_{4}-\bar{q}_{4}|^{2}\,dt, \\ &\frac{1}{2}(q_{5}-\bar{q}_{5})^{2}(t_{0})+ \theta\int^{t_{f}}_{t_{0}}|q_{5}-\bar{q}_{5}|^{2}\,dt \\ &\quad\leq\alpha_{1} e^{\theta t_{f}} \int^{t_{f}}_{t_{0}}\bigl[|p_{1}- \bar{p}_{1}|^{2}+|q_{1}-\bar{q}_{1}|^{2}+|q_{5}- \bar{q}_{5}|^{2}\bigr]\,dt \\ &\qquad{}+ C_{7} \alpha_{1} e^{3\theta t_{f}} \int^{t_{f}}_{t_{0}}\bigl[|p_{1}- \bar{p}_{1}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{1}- \bar{q}_{1}|^{2}+|q_{2}-\bar{q}_{2}|^{2} + |q_{5}-\bar{q}_{5}|^{2}\bigr]\,dt\\ &\qquad{} + \alpha_{1} e^{\theta t_{f}} \int^{t_{f}}_{t_{0}} \bigl[|p_{1}-\bar{p}_{1}|^{2}+|q_{2}-\bar{q}_{2}|^{2}+|q_{5}-\bar{q}_{5}|^{2} \bigr]\,dt \\ &\qquad{}+ C_{8} \alpha_{2} e^{3\theta t_{f}} \int ^{t_{f}}_{t_{0}}\bigl[|p_{3}-\bar{p}_{3}|^{2}+|p_{5}-\bar{p}_{5}|^{2}+|q_{3}- \bar{q}_{3}|^{2}+|q_{4}-\bar{q}_{4}|^{2}+|q_{5}- \bar{q}_{5}|^{2}\bigr]\,dt \\ &\qquad{}+ \eta_{0} \int ^{t_{f}}_{t_{0}}|q_{5}-\bar{q}_{5}|^{2}\,dt. \end{aligned}$$

From the above expressions, all these ten integrals are added and estimated to obtain the result. After combination, we get

$$\begin{aligned} &\frac{1}{2}\bigl[(p_{1}-\bar{p}_{1})^{2}(t_{f})+(p_{2}-\bar{p}_{2})^{2}(t_{f})+(p_{3}-\bar{p}_{3})^{2}(t_{f})+(p_{4}-\bar{p}_{4})^{2}(t_{f})+(p_{5}-\bar{p}_{5})^{2}(t_{f}) \\ &\qquad{}+(q_{1}-\bar{q}_{1})^{2}(t_{0})+(q_{2}- \bar{q}_{2})^{2}(t_{0})+(q_{3}-\bar{q}_{3})^{2}(t_{0})+(q_{4}-\bar{q}_{4})^{2}(t_{0})+(q_{5}-\bar{q}_{5})^{2}(t_{0})\bigr] \\ &\qquad{}+\theta\int^{t_{f}}_{t_{0}}\bigl[|p_{1}- \bar{p}_{1}|^{2}+|p_{2}-\bar{p}_{2}|^{2}+|p_{3}- \bar{p}_{3}|^{2}+|p_{4}-\bar{p}_{4}|^{2}+|p_{5}- \bar{p}_{5}|^{2} \\ &\qquad{}+|q_{1}-\bar{q}_{1}|^{2}+|q_{2}- \bar{q}_{2}|^{2}+|q_{3}-\bar{q}_{3}|^{2}+|q_{4}- \bar{q}_{4}|^{2}+|q_{5}-\bar{q}_{5}|^{2} \bigr]\,dt \\ &\quad\leq\bigl(\tilde{K}+\tilde{K_{1}} e^{3 \theta t_{f}}\bigr) \int ^{t_{f}}_{t_{0}}\bigl[|p_{1}-\bar{p}_{1}|^{2}+|p_{2}-\bar{p}_{2}|^{2}+|p_{3}- \bar{p}_{3}|^{2}+|p_{4}-\bar{p}_{4}|^{2} + |p_{5}-\bar{p}_{5}|^{2} \\ &\qquad{}+|q_{1}-\bar{q}_{1}|^{2}+|q_{2}- \bar{q}_{2}|^{2}+|q_{3}-\bar{q}_{3}|^{2} + |q_{4}-\bar{q}_{4}|^{2}+|q_{5}-\bar{q}_{5}|^{2}\bigr]\,dt < 0. \end{aligned}$$
(27)

Here \(\tilde{K}\) and \(\tilde{K_{1}}\) depend on the model parameters and constant terms.

We choose θ such that \(\theta>\tilde{K}+\tilde{K_{1}}\) and \(t_{f}<\frac{1}{3 \theta}\ln(\frac{\theta-\tilde{K}}{\tilde{K_{1}}})\), then

$$ \begin{aligned} &p_{1}=\bar{p_{1}},\qquad p_{2}=\bar{p_{2}},\qquad p_{3}= \bar{p_{3}},\qquad p_{4}=\bar{p_{4}},\qquad p_{5}=\bar {p_{5}}, \\ &q_{1}=\bar{q_{1}},\qquad q_{2}= \bar{q_{2}},\qquad q_{3}=\bar{q_{3}},\qquad q_{4}=\bar{q_{4}},\qquad q_{5}= \bar{q_{5}}. \end{aligned} $$
(28)

Thus, the system has a unique optimal solution for a small time interval. If the state equation has the initial condition and the adjoint equation has the final time condition, then the optimal controls \(u^{*}_{1}\) and \(u^{*}_{1}\) give a unique and optimal control strategy for the density of awareness programs driven by the media in the region under consideration. □

5 Numerical simulation

To study the dynamical behavior of model (2), we perform numerical computations with initial values \(S_{-}(0)=150\), \(S_{+}(0)=15\), \(I_{-}(0)=20\), \(I_{+}(0)=5\) and \(M(0)=8\). The set of parameter values is given in Table 1. These values are collected from different peer reviewed international journals, and the rest are hypothetical parameters relevant to HIV/AIDS. This set of parameter values is constant throughout the numerical experiments except the values of \(\lambda_{1}\) and \(\lambda_{2}\). Numerical simulations are done using MATLAB (version 7.6.0).

Table 1 List of parameters used for system ( 2 )

Trajectories for all populations and for cumulative awareness programs are drawn in Figure 1, which shows the changes in their behavior at any time t. From this figure, we can see that aware populations start increasing in nature from their initial values and other populations approach their equilibrium values. The non-linear stabilities of \((S_{-}^{*},I^{*}_{-})\) in \(S_{-}^{*}-I^{*}_{-}\) plane and \((S_{+}^{*},I^{*}_{+})\) in \(S_{+}^{*}-I^{*}_{+}\) plane are shown in Figure 2 and Figure 3, respectively. We can see that all trajectories initiating inside the region of attraction approach towards the equilibrium values \((S_{-}^{*},I^{*}_{-})\) and \((S_{+}^{*},I^{*}_{+})\). It is also worthy to mention that the rate of transfer from aware class to unaware class, i.e., \(\lambda_{1}\) and \(\lambda_{2}\), play an important role in the system. In Figure 4 and Figure 5, trajectories are drawn for different values of \(\lambda_{1}\) and \(\lambda_{2}\) (\(\lambda_{1}=0.003,0.005\) and \(\lambda _{2}=0.01,0.05\)), respectively, and it is found that value changes in \(\lambda_{1}\) and \(\lambda_{2}\) alter different population behavioral changes. Despite constant awareness being transferred to masses via media campaign, some fractions of the aware populations neglect the risks associated with the unaware susceptible and the infective, but try to be bold by their own beliefs and eventually cater to the infective class through awareness failures. This fraction is always a threat to the success rate of any mass awareness program and the disease persists as endemic. To test the effectiveness of the media awareness potential, we run our simulation experimentation considering a real life scenario, where we deliberately consider the infection rate at a higher prevalence (\(0.002<\beta<0.007\)) and compare increased ‘media awareness campaign’ to become aware of the potential risks and hazards of STD (sexually transmitted disease) like HIV and its countermeasures. Our simulation study reveals that increasing the awareness campaign during high infection period has the potential to delay the onset of infection among aware people for longer period compared to unaware class of population (Figure 6). It is also to be noted that the population which has already been infected may show some revival prospect due to the acquired informational wealth, but the response is very slow as they loose self-confidence to recover completely once they become infected.

Figure 1
figure 1

Variations of populations with time.

Figure 2
figure 2

Non-linear stability of \(\pmb{(S_{-}^{*},I^{*}_{-})}\) in \(\pmb{S_{-}^{*}-I^{*}_{-}}\) plane.

Figure 3
figure 3

Non-linear stability of \(\pmb{(S_{+}^{*},I^{*}_{+})}\) in \(\pmb{S_{+}^{*}-I^{*}_{+}}\) plane.

Figure 4
figure 4

Variation of infected population and other populations with different values of \(\pmb{\lambda_{1}}\) .

Figure 5
figure 5

Variation of infected population and other populations with different values of \(\pmb{\lambda_{2}}\) .

Figure 6
figure 6

Awareness performance for different values of media campaign, \(\pmb{M_{0}}\) under high infection scenario ( \(\pmb{0.002 < \beta < 0.007}\) ).

Numerical illustrations for optimal control induced problems (12) and (13) are done, and we perform optimality test of the system by making the changes of the variable \(\tau= t/t_{f}\) and transferring the interval \([0, 1]\). Here τ represents the step size, used for better strategy with a line search method, which will maximize the reduction of performance measure. We choose \(t_{f} = 1 + \Delta t_{f}\) and initially \(t_{f} = 1\). We also assume that \(\Delta t_{f} = 0.1\) and our desired value of \(t_{f} = 100\). The solution of the optimal system (12) and ‘control measures’ for different populations are displayed in Figure 7. In this figure, we can see that the control pair, \((u_{1}, u_{2})\) has a meaningful bearing on the system. Note that after inducing control approach, unaware susceptible and infected population sizes are decreasing, while the sizes of aware susceptible and infected populations are increasing as per our expectation. In Figure 8, control performances of media coverage \(u_{1}\) and \(u_{2}\) indicate that the frequency of awareness programs should be increasing with the change of time. Moreover, execution of more awareness campaigns and faster dissemination of awareness mobilize a large fraction of the population. This in turn will widen the aware population and hence slow down the prevalence of the disease HIV.

Figure 7
figure 7

The system behavior for optimal control when final time is \(\pmb{t_{f}=1}\) (keeping all other parameters same as before considered).

Figure 8
figure 8

Optimal control parameters \(\pmb{u_{1}^{*}}\) and \(\pmb{u_{2}^{*}}\) are plotted as the functions of time, keeping all other parameters same.

6 Discussion and conclusion

In this research article, we deal with a non-linear SI mathematical model reflecting the effect of awareness programs on a certain population with constant recruitment rate. We have studied the impact of awareness as a novel intervention for the control of epidemiological diseases. In the modeling process, it is assumed that media campaigns create awareness regarding personal protection as well as control AIDS. As a result, behavioral changes (transfer from unaware to aware) occur within the human population, which results in the formation of a new class, i.e., aware class. Individuals of this class not only protect themselves from the infection, but being aware they also take part in reducing AIDS by taking precautions. Our analytical study shows that the basic reproduction number \(R_{0}\), which determines the existence of the disease, does not contain any awareness related terms. As a result, the persistence of the disease does not depend on awareness programs. However, the awareness programs reduce the infection rate, shorten the rate of disease transmission and cut down the size of the disease. Numerical simulations, which are very realistic, add an extra dimension to our analytic conclusions. On the basis of the existence conditions and analytical results, we showed that the system has a unique optimal control pair \((u^{*}_{1},u_{2}^{*})\) for which the cost function will be minimum and outcomes will be time worthy. This implies that the presence of awareness in the population makes the disease expedition difficult and shorter. But in practical sense, disease remains endemic, because factors like low education, ignorance in taking precautions, social problems, immigration etc. play negative roles in the system. We discuss a simple model that captures some important features, and we believe these findings may help in controlling AIDS through awareness.