1 Introduction

The World Health Organization (WHO) reports that the pandemic of COVID-19 has affected over 200 countries across the globe, with over 219 million people infected and 4.5 million people dead (WHO, 2021). The particularly high number of patients in this unprecedented experience necessitates proper resource management and an effective supply chain network (SCN) (Sharma et al., 2020; Eshkiti, Bozorgi-Amiri, and Sabouhi 2022). The leading implications triggered by this pandemic are the lack of hospital beds, insufficient medical supplies required for patients and the healthcare system, and waste management in hospitals, which has caused global complications (Rowan & Laffey, 2020; Sun et al., 2021). As a result, quick responses and solutions are vital to respond to these challenges.

As COVID-19 cases continue to rise, one of the major worries is the inadequacy of the medical infrastructure to fulfill patients' needs, particularly in the case of hospital beds and ICU beds; the lack of these facilities can lead to a dramatically high mortality rate associated with the disease (Choi, 2021; Volpato et al., 2020). This issue is considered a crisis even in some developed countries with competent healthcare systems (Gatto et al., 2020; Patel et al., 2020). Additionally, COVID-19 patients in hospitals have occupied hospital beds, reducing the capacity of hospitals, as a result of which patients suffering from other diseases cannot be hospitalized. The lack of empty hospital beds also exposes non-COVID-19 patients to the infection because they are more vulnerable to the virus. Moreover, as the number of patients increases, the health workers, who have the most contact with COVID-19 patients, are more likely to become infected with the virus. The absence of health workers due to being sick decreases the service capacity of many sectors, such as hospitals.

Prior to the COVID-19 epidemic, medicine shortages were a severe concern. However, the increased demand for certain medications and government restrictions implemented in some cases as a result of the pandemic aggravated the problem. Due to the pandemic, an increase in demand for medications and medical equipment is inevitable. Hence, paying extra attention to the supply of medical equipment and the medications needed by patients is of particular necessity (Goodarzian et al., 2021a, b; Shuman et al., 2020).

Along with the aforementioned factors, another issue that is equally crucial is the proper handling of infectious medical waste (IMW) within hospital. During the COVID-19 outbreak, various types of medical and hazardous waste were generated, such as infected protective equipment like masks and gloves (Sharma et al., 2020). Handling IMW is a critical part of managing a infectious disease as inappropriate collection and mismanagement of these wastes can exacerbate the spread of the infection and put patients and the healthcare system under a great deal of pressure (Kargar et al., 2020a). Earlier studies have demonstrated that one of the primary ways hospital staff are infected with diseases is the unwanted contact with infectious waste at the point of generation of the IMW. A hospital in a developed country, such as the US or the UK, has a legal duty to prevent infections caused by improper handling of IMW (WHO, 2014). As a result, the disposal and safe processing of IMW is a critical component of an efficient emergency response. Adequate healthcare waste management encompasses suitable procedures for collecting, segregating, storing, transporting, treating, and disposing of waste. For this reason, given the high risks of IMW, which can infect people and impair other actions for controlling the pandemic to, governments declare IMW management as necessary and of high priority (UNEP, 2020). Thus, considering an integrated approach to allocating patients to hospitals, supplying their medicinal items, and managing hospital waste is of particular importance (Goodarzian et al., 2021a).

The main challenges faced in the management of this pandemic are the uncertainty over the number of patients and disruptions in the capacity of facilities due to this uncertainty which is believed to be a burden regarding patients’ hospitalization, medical items distribution, and waste management (Koffman et al., 2020).

Disruption in the capacity of facilities could have various justifications, including the followings:

  • Owing to the consequences of the COVID-19 pandemic concerning the SC of personal protective equipment (PPE) and surges in the number of patients requiring hospitalization, many healthcare professionals are dying in hospitals, and physicians have become infected with this virus because of the lack of PPE, causing hospitals to have reduced their capacity (Yoshida et al., 2020).

  • As a result of the infection of people working in waste treatment and disposal facilities resulting from close contact with waste infected with the COVID-19 virus, as well as a significant rise in the volume of generated waste (such as PPEs and masks, for example, which are disposable), waste management facilities have faced a reduction in capacity (Teymourian et al., 2021).

  • The disruption in PPE supply exacerbated by the healthcare crisis caused by a demand shock, the government's failure to manage and distribute domestic supplies, and significant interruptions to PPE's worldwide SC (Cohen & Rodgers, 2020).

Numerous approaches exist to manage the uncertainty in an optimization problem (Sadrabadi et al. 2023). The method of stochastic programming has been frequently employed to tackle an optimization problem with uncertainty. When coping with uncertainty via the stochastic method a set of scenarios must be created to resemble the uncertain parameters. These created scenarios ought to correspond to possible occurrences of the uncertain parameters and their dependent probabilities (Desi−Nezhad, Sabouhi, and Sadrabadi 2022). Machine Learning (ML) is one of the most widely utilized approaches to forecasting time series and has been proposed as an alternative statistical time series forecasting method. ML forecasting is one of the most effective approaches to capturing patterns in the sequence data, forecasting time series analysis, and obtaining scenarios for planning (Makridakis et al., 2018).

To respond to the pandemic of COVID-19, temporary facilities (for example temporary hospitals), the distribution of relief items and medicines, and waste management must be consideration. Thus, in this paper, a hybrid approach was proposed. In the first phase, by training number of Artificial Neural Networks (ANN), the number of patients in future periods and disruption scenarios were forecasted. Subsequently, using the K-Means method, these obtained scenarios were reduced. In the second phase, a data-driven two-stage stochastic model was proposed. In this model, given the fact that several hospitals in many countries mostly function based on their maximum patient's capacity, locating temporary hospitals to allocate patients were also considered. Furthermore, the proposed mathematical model considers the distribution of the required medications to hospitals by locating distribution centers. Due to the significant amount of IMW produced in hospitals and their high risk of disease transmission, waste management was also considered in the model by taking into account the existing waste disposal centers and setting up temporary disposal centers along with temporary transfer centers.

The following sections of the article are organized as such: In Sect. 2, an overview of the literature on the subject is presented. The two-phase approach to supply chain network design (SCND) is presented in Sect. 3. Sect. 4 presents the computational results of the case study. Sect. 5 discusses the findings and managerial insights. Finally, Sect. 6 concludes the paper.

2 Literature review

Within this section of the article, the papers conducted in the four following areas are reviewed: models for forecasting the number of COVID-19 patients, data-driven optimization and scenario generation and reduction in SCND, supply chain networks for resource allocation and distribution during COVID-19 outbreak, and medical waste management during COVID-19 outbreak. Afterward, research gaps are presented.

2.1 Models for forecasting the number of COVID-19 patients

Due to the significant number of individuals affected by COVID-19 and the elevated rate of fatalities, one of the utmost important tasks is to make highly accurate forecasts regarding the number of patients. An accurate prediction can contribute to revealing the surge in demands; as a result, governments will be able to take preventive actions more efficiently (Abbasimehr & Paki, 2021). ML models and ANN have been widely used to forecast the number of confirmed COVID-19 patients and have shown good performance in this regard. (Kırbaş et al., 2020), employing Nonlinear Autoregression Neural Network (NARNN) model, Auto-Regressive Integrated Moving Average (ARIMA) approach, and Long-Short Term Memory (LSTM) model, predicted the number of confirmed COVID-19 cases in various countries. They found that the LSTM model has the greatest performance among the other approaches. Considering Polynomial Regression and hyperparameter optimization while using the Gaussian process regression, (Dhamodharavadhani & Rathipriya, 2021) estimated the mortality rate of COVID-19 in India. (Wang et al., 2020) trained an LSTM model on the data of daily confirmed cases by Johns Hopkins University to forecast 150 consecutive days in Russia, Peru, and Iran. Other researchers have published similar articles in this regard, such as (Abbasimehr & Paki, 2021; Arora et al., 2020; Melin et al., 2020; Shahid et al., 2020). In addition to using ML methods for forecasting the cases of COVID-19, other models, such as SEIR, have been utilized to estimate the epidemiological parameters of COVID-19, and the obtained predicted parameters are utilized to forecast future cases (Godio et al., 2020; Gopal et al., 2021; Kumar et al., 2021; Malik et al., 2021; Zhan et al., 2021).

2.2 Data-driven optimization, scenario generation and reduction in supply chain network design

Primary efforts in data-driven supply chains focus on Robust optimization and ML techniques for identifying the uncertainty sets. Using Support vector clustering for obtaining the uncertainty set for uncertain parameters, a data-driven model that systemically designs and optimizes the supply chain of biodiesel production from the wastewater sludge by (Mohseni & Pishvaee, 2020). In a similar approach, (K. M. Gumte et al., 2021a, 2021b) used a neuro-fuzzy clustering mechanism for clustering the uncertain space to identify the optimal uncertainty regions. After clustering, by calculating the local density of points, creating the boundary of the acquired cluster, generating a hypercube with the use of the minimum and maximum boundaries of each dimension of the cluster, and employing the Sobol sampling approach, they calculated the new uncertain dataset used for the worst- and best-case scenarios in their proposed Robust optimization. Similarly, (K. Gumte et al., 2021a, 2021b) used neuro-fuzzy C-means clustering for obtaining the uncertainty sets.

In the problems of SCND considering uncertainty, the stochastic programming approach has been frequently employed to tackle an optimization problem under uncertainty. Stochastic parameters are usually represented by discrete scenarios with known probabilities. Research studies have allocated significant efforts to creating effective scenarios in stochastic programming. (Govindan & Fattahi, 2017) used the Latin Hypercube Sampling method for scenario generation and the backward scenario reduction method for decreasing the scenarios' number. In order to make decisions about expanding a food company, according to the available historical data and considering market intelligence, business sense, and the intuition of the management, (Aras & Bilge, 2018) considered different scenarios for demand. In another study, through Cholesky's factorization method for generating the scenarios and by decreasing the number of generated scenarios with the K-Means algorithm, (Khatami et al., 2015) proposed a closed-loop SCND with uncertainty in return and demand. In this regard, (Snoeck et al., 2019), using the Monte Carlo method, generated scenarios for parameters of selling price, demand, transportation costs, and feedstock price to assess the value of investments in mitigation and the costs of disruptions in the chemical supply chain. Similarly, (Gholami-Zanjani et al., 2021) utilized the Monte Carlo method for generating the demand scenarios for a resilient food SCND. Afterwards, they used Fuzzy Bezdek Clustering method for reducing the number of generated scenarios. In a similar context, to generate scenarios related to operating costs and environmental impacts parameters, (Feitó-Cespón et al., 2021) used a Fuzzy Inference System methodology. Subsequently, using the obtained scenarios, they proposed a reverse SCND in the context of the plastic recycling process.

2.3 Supply chain networks for resource allocation and distribution during COVID-19 outbreak

Because of the severe damage wrought by COVID-19, the demand for resources has risen dramatically, and the need to optimize resource allocation and distribution of relief items and medicines has become increasingly important (Jordan et al., 2021; Yaqoubi et al., 2022). Because transfusing plasma from severe COVID-19 cases can reduce the effects of the disease on new patients, (Shirazi et al., 2021) developed a bi-objective stochastic optimization-simulation model for the plasma supply chain during the COVID-19 pandemic. By simulating the quantity of plasma demand, they developed a supply chain to find the location of blood collection centers, locate temporary plasma processing centers, allocate them beside the existing process centers to collection centers, and allocate facilities to the hospitals. Other respiratory diseases, such as the flu, could put extra pressure on the health services during the COVID-19 pandemic. In this regard, (Rastegar et al., 2021) attempted to optimize the allocation of influenza vaccines by locating distribution centers and considering allocating demands to the hospitals. (Sarkar et al., 2021) developed a data-driven optimization model for the allocation of patients to the hospitals by developing a compartmental model to delineate the spread of the COVID-19 virus and applying Pareto analysis to categorize the cities that have been affected the most by the COVID-19 pandemic.

(Goodarzian et al., 2021b) developed a sustainable medical SCN considering the production of COVID-19-related medicines, the distribution centers’ location and distribution of medications, facility allocation, and inventory control. They also used a simulation-based approach to determine the number of medicines required for COVID-19 patients. Finally, they used three meta-heuristics: Firefly algorithm hybrid with Variable Neighborhood Search, Ant Colony Optimization, and Fish Swarm Algorithm in a case study. Concerning the resource sharing for critical medical resources, (Mehrotra et al., 2020) developed a stochastic model for allocating ventilators to demand regions and then used the model to solve a case study of different states in the United States. (Gilani & Sahebi, 2022) proposed a vaccine distribution supply chain model by considering economic, social, and environmental dimensions. This model considers domestic or foreign vaccine production, vaccine distribution centers, different modes of transportation, types of vaccines, and the amount of vaccine sent to medical centers. Similarly, (Shiri & Ahmadizar, 2022) proposed a vaccine SCN for the COVID-19 pandemic. In this study, they first generated the demand scenarios using the Monte Carlo method; then, to distribute vaccines, by locating distribution centers, healthcare infrastructures are allocated to these hubs by considering minimization of the total cost as the objective function.

2.4 Medical waste management during COVID-19 outbreak

Another important aspect of planning the COVID-19 pandemic response is the management of IMW (Peng et al., 2020). Owing to the great number of people who are infected with the virus, the IMW generated in hospitals has faced dramatic growth. Therefore, the waste management is believed to be a vital part of managing the COVID-19 pandemic. (Tirkolaee et al., 2021) developed a location-routing mathematical model considering sustainability and time windows for management of medical waste during the COVID-19 pandemic. This model aims to minimize three objectives: total violation from time windows, total traveling time, and set up of disposal centers-associated infection risk. In another study, (Kargar et al., 2020b) presented a reverse logistics network design for waste management amid the COVID-19 pandemic. The IMW generated is handled with the landfills and existing and temporary treatment centers. This model tries to minimize three objectives: the total costs, the associated infection risk from transportation and processing of IMW in treatment centers, and uncollected wastes. Similarly, (H. Yu et al., 2020b) proposed a reverse logistics model for waste management during an epidemic outbreak. In this study, hospitals and healthcare centers are considered as the source of generating IMW. For managing these wastes, they have considered temporary waste transit centers, existing and temporary waste treatment centers, and waste disposal centers. They aim to minimize the infection risk of uncollected wastes, the infection risk of transportation and treatment of the wastes, and the total costs. Table 1 depicts the specific features of the reviewed studies.

Table 1 The classification of reviewed papers

2.5 Research gaps

As stated by the review of literature, it can be primarily seen that no studies included an integrated SCND to respond to the pandemic of COVID-19 in terms of patient allocation to the existing and temporary hospitals, medication distribution, waste management, infection risk of constructing temporary facilities and waste management, and disruption in the capacity of facilities altogether. Secondly, there is a lack of data-driven modeling to make use of available data from COVID-19 disease; this is particularly important since many of the components associated with the pandemic of COVID-19, including the number of patients in future periods, are unknown, can be accurately addressed using certain methods (ML and ANN). Many countries around the world provide daily updates on the number of people infected and deaths by COVID-19 amid the ongoing pandemic. Through time, and due to the persistence of this epidemic, it could be seen that there is a lot of time series data available. The use of historical data and tools like ML or time series analysis can contribute to accurate forecasting of the number of infected people in future periods. One of the vital problems of supply chains (commercial and humanitarian) throughout the COVID-19 pandemic is the uncertainty in demand (Raj et al., 2022). When uncertainty is present in decision-making, it can lead to certain problems, such as medical resource distribution, reducing or limiting the ability to provide care to patients, providing temporary healthcare solutions, and many other problems. Data-driven SCNDs, such as the proposed two-phase approach in this study, can positively impact the financial performance, responsiveness, and resource management of the supply chains (W. Yu et al., 2018). Thirdly, a few papers have considered the disruption in the capacity of facilities, such as hospitals, distribution centers, as well as waste transfer and disposal centers simultaneously. Finally, investigation of the specific features of COVID-19 disease, such as the probability of different demand scenarios, prioritizing the allocation of severe patients to the existing and more advanced facilities and non-severe patients to temporary ones with a lower service level, and the specific medicine requirements for each group of patients has not been comprehensively carried out in the literature.

In this study, a two-phase approach is presented by mathematical modeling the specific characteristics of the COVID-19 pandemic. As part of the first phase, using the ANN, the number of patients in the future periods was forecasted, and scenarios were generated. Afterward, using the K-Means method, the obtained scenarios were reduced. Ultimately, in the second phase, a data-driven two-stage stochastic programming (TSSP) model was proposed using the obtained scenarios in the first phase, which has three objectives, namely maximizing the minimum allocation of COVID-19 patients to hospitals, minimizing the total transportation time, and minimizing the total infection risk associated with the temporary facility installation and transportation of medical supplies and IMW. In the proposed model, the following factors were considered: locating temporary facilities (temporary hospitals and temporary waste transfer and disposal centers), disruption in the capacity of facilities, medication distribution by locating distribution centers, and waste management.

3 A two-phase approach for supply chain network design

Amidst the COVID-19 pandemic, one of the biggest challenges is the lack of adequate hospital beds for patients. Given the fact that the lack of medical care for patients can raise the chance of an increase in the severity of their disease, locating temporary hospitals is essential. However, COVID-19 disease is more severe in some and milder in others. Therefore, patients’ hospitalization should be prioritized, and people with more severe conditions should be prioritized. In addition, they should be allocated to the existing hospitals (which are more equipped than temporary ones). For this purpose, a multi-objective, three-echelon, multi-product, multi-period, data-driven mixed-integer two-stage stochastic linear programming model is proposed. As shown in Fig. 1, COVID-19 patients with any level of severity of the disease are initially allocated to the existing hospitals. Thereafter, if there is no more bed capacity in the existing hospitals, with locating temporary hospitals, patients with lower disease severity are assigned to these established temporary hospitals. Depending on the patients' need for medications, the required medications from distribution centers are then provided to the hospitals by locating distribution centers. Subsequently, considering the IMW produced in hospitals, infectious wastes are handled in two ways. The first strategy involves transferring waste directly to the existing disposal facilities for waste. Afterward, with the completion of the processing capacity of the existing waste disposal centers, if necessary, temporary waste disposal centers are set up. In the second strategy, by establishing temporary waste transfer centers, the wastes are first transferred to these centers and then sent to the existing and temporary waste disposal centers. Among all the above-mentioned challenges, the most pivotal part of supply chain management to respond to this pandemic is knowing the number of patients for the planning horizon. Therefore, forecasting the number of patients for future periods is known to be one of the main components of effective disease management; this is because of the growing number of infected cases, as a result of which, more medical staff and health workers could become infected, causing disruptions and underperformance in distribution centers, hospitals, and waste management centers due to their absence.

Fig. 1
figure 1

The proposed SCN

This problem necessitates making efficient decisions about the following:

  • The location of the distribution centers

  • The location of the temporary hospitals

  • The quantity of medications transported from distribution centers to medical facilities

  • The number of patients allocated to hospitals

  • The location of temporary disposal and transfer centers

  • The amount of uncollected IMW in hospitals

  • The quantity of waste transferred from hospitals to the waste disposal or transfer centers

This research is divided into two phases in this regard. In the first phase, several recurrent neural networks with LSTM architecture were trained on the historical data to forecast the number of patients in future periods and generate demand scenarios. The latest daily data of COVID-19 allows accurate forecasts using state-of-the-art tools, such as ML and deep learning (DL). Additionally, the daily data of the number of infected people available is suitable for ML owing to the long time since the pandemic started. On the other hand, time series data that cover a broad period capture more trends and seasonality embedded in them, making the forecasts more accurate. Via the K-Means clustering method (Jain et al., 1999), the created scenarios were then clustered, and the final scenarios were obtained. During the second phase, a TSSP mathematical model was developed using the demand scenarios obtained in the first phase. Subsequently, following the conversion of the suggested model into a linear form, the multi-objective model was turned into a single-objective model employing augmented \(\varepsilon \)-constraint method (AUGMECON2).

Figure 2 depicts the steps of the two-phase approach. In the following sections, we elaborate on each phase.

Fig. 2
figure 2

The steps of the proposed approach

3.1 Phase one: forecasting the demand using LSTM-RNN

DL methods, especially RNNs, are suitable for processing sequences like time-series and have been widely utilized in this area. However, the major drawback of recurrent neural networks is the issue of vanishing or exploding gradients, which makes training challenging (Abbasimehr & Paki, 2021). As an extension of RNN, LSTM excels in forecasting time series data owing to its capability of maintaining long-term temporal dependencies (allowing it to store information for extended periods) (Greff et al., 2017). LSTM-RNNs are more accurate than traditional ANNs for time series data because they can capture long-term dependencies. This makes them ideal for predicting future trends and forecasting time series. It handles the vanishing gradient problem because of which learning is terminated in neural networks. Additionally, LSTM-RNNs are becoming increasingly popular for their ability to resist noise interference and their capacity to handle higher levels of complexity than traditional RNNs.

Every LSTM block comprises of a cell state, a forget gate, an input gate, and an output gate. The structure of an LSTM block is illustrated in Fig. 3. In LSTM, \(x({t}_{i})\) is the input value, \(h\left({t}_{i}\right)\) and \(h\left({t}_{i-1}\right)\) are the output values at the time points \({t}_{i}\) and \({t}_{i-1}\), respectively, and \(c\left({t}_{i}\right)\) and \(c\left({t}_{i-1}\right)\) represent the cell states at the time points \({t}_{i}\) and \({t}_{i-1}\), respectively. The functioning of each LSTM block is as follows:

  • The forget gate \(f\left({t}_{i}\right)\) computes and preserves the information in \(c\left({t}_{i-1}\right)\) using \(x\left({t}_{i}\right)\) and \(h({t}_{i-1})\) along with sigmoid activation.

  • Using the \(x({t}_{i})\) and \(h\left({t}_{i-1}\right)\), the input gate \(a({t}_{i})\) computes the value of \(c({t}_{i})\).

  • By applying tanh and sigmoid on \(c\left({t}_{i}\right)\), the output gate \(o\left({t}_{i}\right)\) implements regulation.

Fig. 3
figure 3

Architecture of LSTM (Greff et al., 2017)

The mathematical expression for the forward learning process of an LSTM block is as follows:

$$a\left({t}_{i}\right)=\sigma ({w}_{a}x\left({t}_{i}\right)+{w}_{ha}h\left({t}_{i-1}\right)+{b}_{a})$$
(1)
$$f\left({t}_{i}\right)=\sigma ({w}_{f}x\left({t}_{i}\right)+{w}_{hf}h\left({t}_{i-1}\right)+{b}_{f})$$
(2)
$$c\left({t}_{i}\right)={f}_{t}\times c\left({t}_{i-1}\right)+{a}_{t}\times tanh({w}_{x}x\left({t}_{i}\right)+{w}_{hc}\left(h\left({t}_{i-1}\right)+{b}_{c}\right))$$
(3)
$$o\left({t}_{i}\right)=\sigma ({w}_{o}x\left({t}_{i}\right)+{w}_{ho}h\left({t}_{i-1}\right)+{b}_{o})$$
(4)
$$h\left({t}_{i}\right)=o\left({t}_{i}\right)\times tanh(c\left({t}_{i}\right))$$
(5)

Here, the activation functions are sigmoid (\(\sigma \)) and tanh, and the symbol \(\times \) denotes point-wise multiplication, \(\left\{{{b}_{c}, b}_{f},{b}_{a},,{b}_{o}\right\}\) denote the biases of the cell state, forget gate, input gate, and output gate, respectively, \(\left\{{w}_{c},{w}_{f},{w}_{a},{w}_{o}\right\}\) are the weights matrix of cell state, forget gate, input gate, and output gate, respectively, \(\left\{{w}_{ha},{w}_{hf},{w}_{hc},{w}_{ho}\right\}\) are the recurrent weights, and finally, \(\left\{a\left({t}_{i}\right), f\left({t}_{i}\right), c\left({t}_{i}\right), o\left({t}_{i}\right)\right\}\) show the output results of the gates. The learning steps of an LSTM are as below:

  1. 1.

    Computing the output using Eqs. (1) to (5) (forward learning).

  2. 2.

    Calculating the error between the output and input data labels.

  3. 3.

    Backpropagation and updating each gate's weight using an optimization algorithm.

At the end of the process, the optimal weights and biases are determined by repeating the aforementioned steps for a given number of iterations.

The architecture used for the proposed neural networks is shown in Fig. 4, consisting of LSTM neurons with Rectified Linear Unit (ReLU) activation function for the first layer, which gets the data as input. In the next layer, fully connected Dense neurons with ReLU activation function are utilized. Regularization, particularly the appropriate use of the drop-out layer, is critical to the performance, given the intricacy of the proposed models. Studies have shown that with this type of artificial neural network architecture and through the use of this network for forecasting time series, drop-out layers with a rate of 0.5 are of great importance to prevent overfitting and enhance the generalization of the resulted model (Lipton et al., 2016). Therefore, a drop-out layer with a rate of 0.5 is applied. Eventually, fully connected Dense neurons with ReLU activation are used in the final layer for providing the output.

Fig. 4
figure 4

The proposed ANN architecture

According to Fig. 4, the inputs with the size of \(5\times 1\) (the number of periods specified) are entered into 100 LSTM blocks. Moreover, in the final layer, there are five fully connected Dense neurons giving an output with the size of \(5\times 1\) (as much as the number of periods desired for forecasting). It should be noted that this architecture is based on the actions taken during data preparation. Therefore, the input and output data sizes depend on how the data and the desired outcomes from running this network are labeled in this research, since the data are considered to be five time periods as input and the subsequent five time periods as labels, the proposed network will finally forecast the demand for five periods.

3.1.1 Data preparation

Three fundamental data preparation tasks used to prepare data for forecasting are data cleaning, normalization, and data labeling. Given a time series of length N \(\left\{x\left({t}_{i}\right), i=1, 2, \dots , N\right\}\), the preprocessing steps are as follows:

  • Data cleaning

    In the case of noisy and missing values in the given time series, the noisy values need to be smoothed out, and the missing values must be replaced using a suitable methodology.

  • Data normalization

    The given \(x\left({t}_{i}\right)\) time series is normalized as follow, and the result represented as \(\left\{{x}_{n}\left({t}_{i}\right), i=1, 2, \dots , N\right\}\):

    $${x}_{n}\left({t}_{i}\right)= \frac{x\left({t}_{i}\right)-\mathrm{min}(x)}{\mathrm{max}\left(x\right)-\mathrm{min}(x)}$$
    (6)

    where \(\mathrm{max}\left(x\right)\) and \(\mathrm{min}(x)\) respectively are the maximum and the minimum values of \(x\).

  • Data labeling

    For its input, the proposed ANN model requires examples in the input–output format. As a result, each example must be a pair of an input object and the desired output value (label). For obtaining the mentioned format from the given normalized time series \({x}_{n}\left({t}_{i}\right)\), we should divide the \({x}_{n}\left({t}_{i}\right)\) into examples with the desired length of \(K\). For this purpose, by considering \(L\) as the total length of \({x}_{n}\left({t}_{i}\right)\), input–output pairs (\({I}_{i}\)) are as follows:

    $$ I_{1} = \left\{ {\begin{array}{*{20}l} {input:\left\{ {x_{n} (t_{1} ),x_{n} (t_{2} ),...,x_{n} (t_{K} )} \right\}} \hfill \\ {output:\left\{ {x_{n} (t_{{K + 1}} ),x_{n} (t_{{K + 2}} ),...,x_{n} (t_{{2K}} )} \right\}} \hfill \\ \end{array} } \right. $$
    (7)
    $$ I_{2} = \left\{ {\begin{array}{*{20}l} {input:\left\{ {x_{n} (t_{2} ),x_{n} (t_{3} ),...,x_{n} (t_{{K + 1}} )} \right\}} \hfill \\ {output:\left\{ {x_{n} (t_{{K + 2}} ),x_{n} (t_{{K + 3}} ),...,x_{n} (t_{{2K + 1}} )} \right\}} \hfill \\ \end{array} } \right. $$
    (8)
    $$ I_{L - 2K + 1} = \left\{ {\begin{array}{*{20}c} {input:\left\{ {x_{n} (t_{L - 2K + 1} ),x_{n} (t_{L - K} ),...,x_{n} (t_{L - K} )} \right\}} \\ {output:\left\{ {x_{n} (t_{L - K + 1} ),x_{n} (t_{L - K + 2} ),...,x_{n} (t_{L} )} \right\}} \\ \end{array} } \right. $$
    (9)

This procedure gives the total number of \(L-2K+1\) instances for the input of the proposed ANN. An example of the mentioned procedure for a simple time series with a length of 10 and considering \(K=4\) is presented in Fig. 5 which makes. \(L-2K+1=10-2\times \left(4\right)+1=3\)

Fig. 5
figure 5

A simple example of the procedure of data labeling

labeled instances.

3.1.2 K-Means clustering method

There are many methods and algorithms for clustering data. The clustering algorithm of K-Means is amongst the most prevalent and frequently used algorithms (Sabouhi et al., 2021). This methos attempts to divide \(n\) observations into \(k\) clusters. (James et al., 1967). Let \({t}_{m}\) be the vector of \(mth\) observation (forecasted time series) where each observation has \(e\) elements. Now, the subsequent actions have been finished:

  • Step 1 Randomly choose \(k\) data points as the initial Centroids (cluster centers) and call them \({c}_{k}\).

  • Step 2 Find the distance between each observation (\({t}_{m}\)) and the selected \(k\) centroids as follows:

    $$ \begin{array}{*{20}c} {d_{k}^{m} = \sqrt {\sum\limits_{e = 1}^{e} {\left( {t_{m}^{e} - c_{k}^{e} } \right)^{2} } } } & \begin{gathered}\quad \forall m \in 1,2,...,n \hfill \\ \quad\forall k \in 1,2,...,k \hfill \\ \end{gathered} \\ \end{array} $$
    (10)

    where \({t}_{m}^{e}\) is the \(eth\) element of vector \({t}_{m}\) and \({c}_{k}^{e}\) is the \(eth\) element of vector \({c}_{k}\). \({d}_{k}^{m}\) will be the distance between observation \(m\) and centroid \(k\).

  • Step 3 Now, based on the distance discovered, assign each data point to the nearest centroid. If observation \({t}_{m}\) is assigned to \({c}_{k}\) centroid, set \({m}_{mk}=1\), 0 otherwise.

  • Step 4 Take the average of the observations in each cluster group to update the centroid position as follows:

    $$ \begin{array}{*{20}c} {c_{k} = \frac{{\sum\limits_{m = 1}^{n} {m_{mk} t_{i} } }}{{\sum\limits_{m = 1}^{n} {m_{mk} } }}} & {\forall k \in 1,2,...,k} \\ \end{array} $$
    (11)
  • Step 5 Repeat Steps 2 to 4 until our centroids remain steady.

    For selecting the ideal value for clusters’ number (\(k\)), the Elbow method was used (Bholowalia & Kumar, 2014). In this approach, we adjust the clusters’ number (\(k\)) from 1 to a chosen number. Afterward, for each value of \(k\), we calculate the total distance between the observations and the centroids. Finally, when we plot the obtained values with the \(k\) value, the resulting plot exhibits an Elbow shape. The sum of the distances remains almost the same from the elbow point, Hence, the \(k\) value that corresponds to this point represents the optimal number of clusters or the optimal \(k\) value.

3.2 Developing two-stage stochastic data-driven programming

The second phase of the proposed two-phase approach develops a multi-objective stochastic optimization model for locating distribution centers, temporary hospitals, temporary waste disposal centers, and temporary waste transfer centers. Furthermore, considering disruption in the capacity of the mentioned facilities, the proposed model considers patients’ allocation to hospitals, distribution of drugs to hospitals, and waste management. The created model has three objectives: (1) maximizing the minimum patients' allocation to hospitals within all regions, (2) minimizing the sum of transportation times, and (3) minimizing the total risk of disease outbreak by establishing temporary facilities, waste transportation, and uncollected waste in hospitals.

In TSSP, the model makes two sets of decisions: first-stage decisions and second-stage decisions (Birge & Louveaux, 2011). The decisions of first-stage are made prior to comprehending the scenarios obtained from the first phase, including setting up distribution centers, locating temporary hospitals, and locating temporary waste disposal and transfer centers (Sabouhi & Jabalameli, 2019). The decisions in the second stage are related to the scenarios obtained from the first phase; considering the impact of these scenarios on disruption of the capacity of facilities and the demand, these decisions include the allocation of patients to hospitals, the medications’ number transferred to hospitals from distribution centers, the amount of medications inventory at hospitals, the amount of waste transferred from hospitals to waste disposal and transfer centers, as well as the quantity of uncollected waste in hospitals. Followed by the mathematical model, the next section outlines the sets, parameters, and decision variables.

3.2.1 Mathematical model

Sets and indices

Set of regions,\(m\in M\)

M

Set of existing hospitals,\(h\in P\)

P

Set of potential locations for temporary hospitals,\(h{^{\prime}}\in S\)

S

Set of all hospitals, \(H=P\cup S\),\(h\in H\)

H

Set of existing waste disposal centers,\(q\in U\)

U

Set of potential locations for temporary waste disposal centers,\(q\in R\)

R

Set of all waste disposal centers, \(Q=U\cup R\),\(q\in Q\)

\(Q\)

Set of potential locations for temporary transfer centers \(l\in L\)

\(L\)

Set of different disease severity in patients,\(i\in I\)

\(I\)

Set of potential locations for distribution centers,\(k\in K\)

\(K\)

Set of time periods,\(t\in T\)

\(T\)

Set of medications,\(d\in D\)

\(D\)

Set of demand scenarios,\(v\in V\)

\(V\)


Parameters

\({de}_{imt}^{v}\)

Forecasted patients’ number (demand) with disease severity of \(i\in I\) in region \(m\in M\) in time period \(t\in T\) under scenario \(v\in V\)

\({st}_{k}\)

Set-up cost of distribution center \(k\in K\)

\({pr}_{d}\)

Medication's purchase price per unit \(d\in D\)

\({tr}_{kh}\)

Per-unit transportation time of the medication \(d\in D\) from distribution center \(k\in K\) to hospital \(h\in H\)

\({hl}_{h}\)

Per-unit holding cost of medication in warehouse of hospital \(h\in H\) for each time-period

\({\theta }_{i}\)

The lowest fraction of group \(i\in I\) that must be included (coverage rate)

\({\beta }_{ktd}\)

The maximum capacity of distribution center \(k\in K\) for supplying medication \(d\in D\) in time-period \(t\in T\)

\({cp}_{h}\)

Capacity of patients in hospital \(h\in H\)

\({cw}_{h}\)

Warehouse capacity of hospital \(h\in H\) for medication storage

\({ch}_{h\mathrm{^{\prime}}}\)

Set-up cost of temporary hospital \(h\mathrm{^{\prime}}\in S\)

\({cq}_{q}\)

Set-up cost of temporary disposal center \(q\in R\)

\({cl}_{l}\)

Set-up cost of temporary transfer center \(l\in L\)

\({th}_{mh}\)

Travel time between region \(m\) and existed hospital \(h\in P\)

\({ts}_{hh\mathrm{^{\prime}}}\)

Travel time between existed hospital \(h\in P\) and temporary hospital \(h\mathrm{^{\prime}}\in S\)

\({tl}_{hl}\)

Travel time between hospital \(h\in H\) and transfer center \(l\in L\)

\({tc}_{lq}\)

Travel time between transfer center \(l\in L\) and disposal center \(q\in Q\)

\({tq}_{hq}\)

Travel time between hospital \(h\in H\) and disposal center \(q\in Q\)

\(\varphi \)

Conversion factor of time travel to cost

\({dn}_{id}\)

1 if group type \(i\in I\) need medication type \(d\in D\), 0 otherwise

\({hp}_{iht}\)

1 if patient from group type \(i\in I\) can be served by hospital \(h\in H\) at time period \(t\in T\), 0 otherwise

\(p{h}_{h\mathrm{^{\prime}}}\)

Population exposure around temporary hospital \(h\mathrm{^{\prime}}\in S\)

\({pa}_{q}\)

Population exposure around temporary disposal center \(q\in R\)

\({el}_{l}\)

Population exposure around temporary transfer center \(l\in L\)

\({rq}_{hq}\)

Population exposure around transportation route of waste from hospital \(h\in H\) to disposal center \(q\in Q\)

\({rl}_{hl}\)

Population exposure around transportation route of waste from hospital \(h\in H\) to transfer center \(l\in L\)

\({rc}_{lq}\)

Population exposure around transportation route of waste from transfer center \(l\in L\) to disposal center

\(q\in Q\)

\(a{r}_{h}\)

Probability of accidental risk at hospital \(h\in H\)

\(ri\)

Reproduction number of the disease

\(wr\)

Waste generation rate per person

\({pq}_{qt}\)

Unit processing cost in disposal center \(q\in Q\) at period \(t\in T\)

\({pl}_{lt}\)

Unit processing cost in transfer center \(l\in L\) at period \(t\in T\)

\({sq}_{q}\)

Capacity of disposal center \(q\in Q\)

\({sl}_{l}\)

Capacity of transfer center \(l\in L\)

\({BG}_{1}\)

Budget available for set-up costs of temporary hospitals and distribution centers, and transportation of patients and medications

\({BG}_{2}\)

Budget available for set-up costs of temporary waste disposal and transfer centers, and waste transportations, and waste processing costs

\({\pi }_{v}\)

Probability of occurrence of scenario \(v\in V\)

\({dh}_{hv}\)

Percentage of capacity disruption in hospital \(h\in H\) under scenario \(v\in V\)

\({ds}_{lv}\)

Percentage of capacity disruption in transfer center \(l\in L\) under scenario \(v\in V\)

\({dt}_{qv}\)

Percentage of capacity disruption in disposal center \(q\in Q\) under scenario \(v\in V\)

\({dk}_{kv}\)

Percentage of capacity disruption in distribution center \(k\in K\) scenario \(v\in V\)

Decision variables

\({\omega }_{k}\)

1 if distribution center \(k\) is set up, 0 otherwise

\({W}_{htd}^{v}\)

Number of medications \(d\in D\) stored in hospital \(h\in H\) warehouse in time-period \(t\in T\) under scenario \(v\in V\)

\({X}_{khtd}^{v}\)

Number of medications \(d\in D\) transported from distribution center \(k\in K\) to hospital \(h\in H\) in time-period

\(t\in T\) under scenario \(v\in V\)

\({EH}_{h\mathrm{^{\prime}}t}\)

1 if temporary hospital \(h\mathrm{^{\prime}}\in S\) is set up in period \(t\in T\), 0 otherwise

\({Z}_{imht}^{v}\)

Number of patients with disease severity of \(i\in I\) at region \(m\in M\) that are assigned to existed hospital

\(h\in P\) in period \(t\in T\) under scenario \(v\in V\)

\({Y}_{ihh\mathrm{^{\prime}}t}^{v}\)

Number of patients with disease severity of \(i\in I\) at existed hospital \(h\in P\) that are assigned to temporary hospital \(h\mathrm{^{\prime}}\in S\) in time period \(t\in T\) under scenario \(v\in V\)

\({O}_{qt}\)

1 if temporary disposal center \(q\in R\) is set up in time period \(t\in T\), 0 otherwise

\({E}_{lt}\)

1 if temporary transfer center \(l\in L\) is set up in time period \(t\in T\), 0 otherwise

\({ML}_{hlt}^{v}\)

Waste amount transported from hospital \(h\in H\) to temporary transfer center \(l\in L\) in time period \(t\in T\) under scenario \(v\in V\)

\({MQ}_{hqt}^{v}\)

Waste amount transported from hospital \(h\in H\) to disposal center \(q\in Q\) in time period \(t\in T\) under scenario \(v\in V\)

\({MC}_{lqt}^{v}\)

Waste amount transported from transfer center \(l\in L\) to disposal center \(q\in Q\) in time period \(t\in T\) under scenario \(v\in V\)

\({UC}_{ht}^{v}\)

Amount of uncollected of waste at hospital \(h\in H\) in time period \(t\in T\) under scenario \(v\in V\)

Objective functions

$$ Max \, Z_{1} = \mathop {Min}\limits_{i,m,t} \left\{ {\sum\limits_{v \in V} {\pi_{v} \left( {\frac{{\sum\limits_{h \in P} {Z_{imht}^{v} } }}{{de_{imt}^{v} }}} \right)} } \right\} $$
(12)
$$ Min \, Z_{2} = \sum\limits_{v} {\pi_{v} } \left[ \begin{gathered} \sum\limits_{h \in H} {\sum\limits_{k \in K} {\sum\limits_{t \in T} {\sum\limits_{d \in D} {X_{khtd}^{v} \times tr_{kh} } } } } + \sum\limits_{h \in P} {\sum\limits_{i \in I} {\sum\limits_{t \in T} {\sum\limits_{m \in M} {Z_{imht}^{v} \times th_{mh} } } } } + \hfill \\ \sum\limits_{h \in P} {\sum\limits_{h^{\prime} \in S} {\sum\limits_{t \in T} {\sum\limits_{i \in I} {Y_{ihh^{\prime}t}^{v} \times ts_{hh^{\prime}} } } } } + \, \sum\limits_{t \in T} {\sum\limits_{q \in Q} {\sum\limits_{h \in H} {MQ_{hqt}^{v} \times tq_{hq} } } } \, + \hfill \\ \, \sum\limits_{t \in T} {\sum\limits_{h \in H} {\sum\limits_{l \in L} {ML_{hlt}^{v} \times tl_{hl} } } } { + }\sum\limits_{l \in L} {\sum\limits_{q \in Q} {\sum\limits_{t \in T} {MC_{lqt}^{v} \times tc_{lq} } } } \, \hfill \\ \end{gathered} \right] $$
(13)
$$ \begin{gathered} Min \, Z_{3} = \, \sum\limits_{t \in T} {\sum\limits_{q \in R} {O_{qt} \times pa_{q} } } + \sum\limits_{l \in L} {\sum\limits_{t \in T} {E_{lt} \times el_{l} } } + \sum\limits_{h^{\prime} \in S} {\sum\limits_{t \in t} {{\text{EH}}_{h^{\prime}t} } } \times ph_{h^{\prime}} + \hfill \\ \, \sum\limits_{v} {\pi_{v} } \left[ \begin{gathered} \, \sum\limits_{h \in H} {\sum\limits_{t \in T} { \, ar_{h} \times UC_{ht}^{v} \times ri} } \, + \sum\limits_{t \in T} {\sum\limits_{h \in H} {\sum\limits_{q \in Q} {MQ_{hqt}^{v} \times rq_{hq} } } } + \hfill \\ \sum\limits_{h \in H} {\sum\limits_{l \in L} {\sum\limits_{t \in T} {ML_{hlt}^{v} \times rl_{hl} } } } \, + \sum\limits_{t \in T} {\sum\limits_{l \in L} {\sum\limits_{q \in Q} {MC_{lqt}^{v} \times rc_{lq} } } } \hfill \\ \end{gathered} \right] \hfill \\ \end{gathered} $$
(14)

The first objective function, Eq. (12), allocates patients by maximizing the minimum allocation-to-demand ratio per group in each region and each period. Furthermore, Eq. (13) expresses the second objective function and attempts to minimize the sum of transportation times, including the patients' transportation to hospitals, transportation time of medications to hospitals, and waste transportation time to transfer and disposal centers. Finally, The third objective function, given by Eq. (14) seeks to minimize the total risk of disease outbreaks by considering population exposure to establishing temporary facilities and waste transportation and the risk of disease outbreaks caused by uncollected waste in hospitals.

Constraints

$$ \begin{array}{*{20}c} {\sum\limits_{h \in P} {Z_{imht}^{v} } \ge \theta_{i} \times de_{imt}^{v} } & \begin{gathered} \forall i \in I,m \in M, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(15)
$$ \begin{array}{*{20}c} {\sum\limits_{m \in M} {Z_{imht}^{v} } \le \sum\limits_{m \in M} {de_{imt}^{v} } } & \begin{gathered} \forall h \in P,i \in I, \hfill \\ \forall v \in V,t \in T \hfill \\ \end{gathered} \\ \end{array} $$
(16)
$$ \begin{array}{*{20}c} {\sum\limits_{h \in P} {Z_{imht}^{v} } \le de_{imt}^{v} } & \begin{gathered} \forall i \in I,m \in M, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(17)

Constraints (15) to (17) make the lower and upper bound for the variable of patient allocation to hospitals (\({Z}_{imht}^{v}\)). Constraint (15) guarantees that patients with different levels of severity must be allocated to the hospitals at least at the coverage rate, which ensures that the number of patients allocated to hospitals cannot be less than a threshold. Constraint (16) ensures that the total allocation of patients in all regions per severity group must be less than or equal to the total number of patients in all regions, which guarantees that the patients’ allocation to hospitals cannot exceed the total demand. Similarly, constraint (17) also ensures that the allocation of patients from regions to hospitals must be less than or equal to the number of patients in each region.

$$ \begin{array}{*{20}c} {\sum\limits_{h^{\prime} \in S} {Y_{ihh^{\prime}t}^{v} } \le \sum\limits_{m \in M} {Z_{imht}^{v} } } & \begin{gathered} \forall h \in P,i \in I, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(18)

Based on the Constraint (18), the allocation of patients from the existing hospitals to temporary hospitals should not exceed the number of them in the existing hospital.

$$ \begin{array}{*{20}c} {EH_{h^{\prime}t} \le \frac{{\sum\limits_{i \in I} {\sum\limits_{m \in M} {Z_{imht}^{v} } } - \sum\limits_{i \in I} {Y_{ihh^{\prime}t}^{v} } }}{{(1 - dh_{hv} )cp_{h} }}} & \begin{gathered} \forall h \in P,h^{\prime} \in S, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(19)

Constraint (19) ensures that a temporary hospital is set up and used if only the existing hospitals’ capacity is completed (considering the capacity reduction due to disruption in the existing hospitals).

$$ \begin{array}{*{20}c} {\sum\limits_{h \in P} {\sum\limits_{i \in I} {Y_{ihh^{\prime}t}^{v} } } \le (1 - dh_{h^{\prime}v} ) \times EH_{h^{\prime}t} \times cp_{h^{\prime}} } & {\forall h^{\prime} \in S,t \in T,v \in V} \\ \end{array} $$
(20)
$$ \begin{array}{*{20}c} {\sum\limits_{m \in M} {\sum\limits_{i \in I} {Z_{imht}^{v} } } - \sum\limits_{h^{\prime} \in S} {\sum\limits_{i \in I} {Y_{ihh^{\prime}t}^{v} } } \le (1 - dh_{hv} ) \times cp_{h} } & {\forall h \in P,v \in V,t \in T} \\ \end{array} $$
(21)
$$ \begin{array}{*{20}c} {\sum\limits_{h \in H} {X_{khtd}^{v} } \le (1 - dk_{kv} ) \times \beta_{ktd} \times \omega_{k} } & \begin{gathered} \forall k \in K,d \in D, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(22)

Constraints (20), (21) and (22) respectively define the available capacity in the existing hospitals, temporary hospitals, and distribution centers considering the disruption in their capacity.

$$ \begin{array}{*{20}c} {\sum\limits_{k \in K} {X_{khtd}^{v} } \ge \sum\limits_{h^{\prime} \in S} {\sum\limits_{m \in M} {\left( {Z_{imht}^{v} - Y_{ihh^{\prime}t}^{v} } \right) \times dn_{id} - W_{h,t - 1,d}^{v} } } } & \begin{gathered} \forall i \in I,h \in P, \hfill \\ \forall d \in D,v \in V, \hfill \\ \forall t \in T \hfill \\ \end{gathered} \\ \end{array} $$
(23)
$$ \begin{array}{*{20}c} {\sum\limits_{k \in K} {X_{kh^{\prime}td}^{v} } \ge \sum\limits_{h \in P} {Y_{ihh^{\prime}t}^{v} } \times dn_{id} - W_{h^{\prime},t - 1,d}^{v} } & \begin{gathered} \forall i \in I,h^{\prime} \in S, \hfill \\ \forall d \in D,t \in T, \hfill \\ \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(24)
$$ \begin{array}{*{20}c} {\sum\limits_{k \in K} {\sum\limits_{d \in D} {X_{kh^{\prime}td}^{v} } } \le cw_{h^{\prime}} \times EH_{h^{\prime}t} } & {\forall h^{\prime} \in S,v \in V,t \in T} \\ \end{array} $$
(25)
$$ \begin{array}{*{20}c} {\sum\limits_{k \in K} {\sum\limits_{d \in D} {X_{khtd}^{v} } } \le cw_{h} } & {\forall h \in P,v \in V,t \in T} \\ \end{array} $$
(26)

Constraints (23) and (24) indicate that the allocation of medications from distribution centers to hospitals should be greater than or equal to the number of medications patients need, considering the amount of medication stock in the hospitals' warehouse. Constraints (25) and (26) guarantee that the number of medications sent to hospitals does not exceed their storage capacity.

$$ \begin{array}{*{20}c} \begin{aligned} W_{htd}^{v} = & W_{h,t - 1,d}^{v} + \sum\limits_{k \in K} {X_{khtd}^{v} } \\ & - \left( {\sum\limits_{i \in I} {\sum\limits_{m \in M} {Z_{imht}^{v} \times dn_{id} } } - \sum\limits_{i \in I} {\sum\limits_{h^{\prime} \in S} {Y_{ihh^{\prime}t}^{v} \times dn_{id} } } } \right) \\ \end{aligned} & \begin{gathered} \forall h \in P,d \in D, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(27)
$$ \begin{array}{*{20}c} {W_{h^{\prime}td}^{v} = W_{h^{\prime},t - 1,d}^{v} + \sum\limits_{k \in K} {X_{kh^{\prime}td}^{v} } - \sum\limits_{i \in I} {\sum\limits_{h \in P} {Y_{ihh^{\prime}t}^{v} \times dn_{id} } } } & \begin{gathered} \forall h^{\prime} \in S,d \in D, \hfill \\ \forall t \in T,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(28)
$$ \begin{array}{*{20}c} {\sum\limits_{d \in D} {W_{htd}^{v} } \le cw_{h} } & {\forall h \in H,t \in T,v \in V} \\ \end{array} $$
(29)

Constraints (27) and (28) calculate the medication inventory at hospitals at the end of each period. Additionally, Constraint (29) ensures that at the end of each period, the hospitals’ stock level does not exceed the storage capacity of that hospital.

$$ \begin{array}{*{20}c} {\sum\limits_{m \in M} {Z_{imht}^{v} } - \sum\limits_{h^{\prime} \in S} {Y_{ihh^{\prime}t}^{v} } \le hp_{iht} \times (1 - dh_{hv} ) \times cp_{h} } & \begin{gathered} \forall h \in P,t \in T, \hfill \\ \forall i \in I,v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(30)
$$ \begin{array}{*{20}c} {\sum\limits_{h \in P} {Y_{ihh^{\prime}t}^{v} } \le hp_{ih^{\prime}t} \times (1 - dh_{h^{\prime}v} ) \times cp_{h^{\prime}} } & \begin{gathered} \forall h^{\prime} \in S,v \in V, \hfill \\ \forall t \in T,i \in I \hfill \\ \end{gathered} \\ \end{array} $$
(31)

Given the fact that some hospitals cannot serve a group of patients in some periods (such as temporary hospitals that cannot serve patients with high disease severity in any period), Constraints (30) and (31) ensure that patients with a specific severity group can only be hospitalized provided that the hospital can accept patients with that level of severity.

$$ \begin{array}{*{20}c} {\left( \begin{gathered} \, \sum\limits_{k \in K} {st_{k} \times \omega_{k} } \, + \sum\limits_{h \in H} {\sum\limits_{k \in K} {\sum\limits_{t \in T} {\sum\limits_{d \in D} {pr_{d} \times X_{khtd}^{v} } } } } \, + \hfill \\ \, \sum\limits_{h \in H} {\sum\limits_{k \in K} {\sum\limits_{t \in T} {\sum\limits_{d \in D} {tr_{kh} \times X_{khtd}^{v} \times \varphi } } } } \, + \, \sum\limits_{h \in H} {\sum\limits_{t \in T} {\sum\limits_{d \in D} {hl_{h} \times W_{htd}^{v} } } } \, + \hfill \\ \, \sum\limits_{h^{\prime} \in S} {\sum\limits_{t \in T} {ch_{h^{\prime}} \times EH_{h^{\prime}t} } } \, + \sum\limits_{h \in P} {\sum\limits_{t \in T} {\sum\limits_{i \in I} {\sum\limits_{m \in M} {th_{mh} \times Z_{imht}^{v} \times \varphi } } } } \, + \hfill \\ \, \sum\limits_{h^{\prime} \in S} {\sum\limits_{h \in H} {\sum\limits_{t \in T} {\sum\limits_{i \in I} {ts_{hh^{\prime}} \times Y_{ihh^{\prime}t}^{v} \times \varphi } } } } \hfill \\ \end{gathered} \right) \le BG_{1} } & {\forall v \in V} \\ \end{array} $$
(32)
$$ \begin{array}{*{20}c} {\left( \begin{gathered} \sum\limits_{t \in T} {\sum\limits_{q \in Q} {\sum\limits_{h \in H} {MQ_{hqt}^{v} \times tq_{hq} \times \varphi } } } \, + \sum\limits_{t \in T} {\sum\limits_{h \in H} {\sum\limits_{l \in L} {ML_{hlt}^{v} \times tl_{hl} \times \varphi } } } + \hfill \\ \sum\limits_{t \in T} {\sum\limits_{l \in L} {\sum\limits_{q \in Q} {MC_{lqt}^{v} \times tc_{lq} \times \varphi } } } \, + \sum\limits_{h \in H} {\sum\limits_{q \in Q} {\sum\limits_{t \in T} {MQ_{hqt}^{v} \times pq_{qt} } } } \, + \hfill \\ \sum\limits_{l \in L} {\sum\limits_{t \in T} {\sum\limits_{q \in Q} {MC_{lqt}^{v} \times pq_{qt} } } } \, + \sum\limits_{l \in L} {\sum\limits_{h \in H} {\sum\limits_{t \in T} {ML_{hlt}^{v} \times pl_{lt} } } } \, + \hfill \\ \, \sum\limits_{t \in T} {\sum\limits_{q \in R} {O_{qt} \times cq_{q} } } \, + \, \sum\limits_{t \in T} {\sum\limits_{l \in L} {E_{lt} \times cl_{l} } } \hfill \\ \end{gathered} \right) \le BG_{2} } & {\forall v \in V} \\ \end{array} $$
(33)

Constraints (32) and (33) determine the maximum budget. Constraint (32) includes the total cost of constructing temporary hospitals and distribution centers, transporting patients to hospitals, sending medications to hospitals, and the inventory costs in hospitals. Moreover, Constraint (33) comprises the total temporary disposal and transfer centers setup costs, waste transportation costs, and waste processing costs in the disposal and transfer centers.

$$ \begin{array}{*{20}c} {\sum\limits_{q \in Q} {MQ_{hqt}^{v} } + \sum\limits_{l \in L} {ML_{hlt}^{v} } \le wr \times \left( {\sum\limits_{i \in I} {\sum\limits_{m \in M} {Z_{imht}^{v} } } - \sum\limits_{i \in I} {\sum\limits_{h^{\prime} \in S} {Y_{ihh^{\prime}t}^{v} } } } \right)} & \begin{gathered} \forall h \in P,t \in T, \hfill \\ \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(34)
$$ \begin{array}{*{20}c} {\sum\limits_{q \in Q} {MQ_{h^{\prime}qt}^{v} } + \sum\limits_{l \in L} {ML_{h^{\prime}lt}^{v} } \le wr \times \sum\limits_{i \in I} {\sum\limits_{h \in P} {Y_{ihh^{\prime}t}^{v} } } } & {\forall h^{\prime} \in S,t \in T,v \in V} \\ \end{array} $$
(35)

Constraints (34) and (35), by calculating the amount of waste generated in each hospital, ensure that the total amount of waste sent from a hospital to waste transfer and disposal centers should not exceed the amount of waste in that hospital.

$$ \begin{array}{*{20}c} \begin{aligned} UC_{ht}^{v} =\, & UC_{h,t - 1}^{v} + wr \times \left( {\sum\limits_{m \in M} {\sum\limits_{i \in I} {Z_{imht}^{v} } } - \sum\limits_{i \in I} {\sum\limits_{h^{\prime} \in S} {Y_{ihh^{\prime}t}^{v} } } } \right) \\ & - \left( {\sum\limits_{q \in Q} {MQ_{hqt}^{v} } + \sum\limits_{l \in L} {ML_{hlt}^{v} } } \right) \\ \end{aligned} & \begin{gathered} \forall h \in P,t \in T, \hfill \\ \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(36)
$$ \begin{array}{*{20}c} {UC_{h^{\prime}t}^{v} = UC_{h^{\prime},t - 1}^{v} + wr \times \sum\limits_{i \in I} {\sum\limits_{h \in P} {Y_{ihh^{\prime}t}^{v} } } - \left( {\sum\limits_{q \in Q} {MQ_{h^{\prime}qt}^{v} } + \sum\limits_{l \in L} {ML_{h^{\prime}lt}^{v} } } \right)} & \begin{gathered} \forall h^{\prime} \in S,t \in T, \hfill \\ \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(37)

Constraints (36) and (37) calculate the amount of uncollected waste in each hospital.

$$ \begin{array}{*{20}c} {\sum\limits_{h \in H} {ML_{hlt}^{v} } \le (1 - ds_{lv} ) \times sl_{l} \times E_{lt} } & \begin{gathered}\quad\quad \forall l \in L,t \in T, \hfill \\\quad\quad \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(38)
$$ \begin{array}{*{20}c} {\sum\limits_{h \in H} {MQ_{hqt}^{v} } + \sum\limits_{l \in L} {MC_{lqt}^{v} } \le (1 - dt_{qv} ) \times sq_{q} \times O_{qt} } & \begin{gathered}\quad\quad \forall t \in T,q \in R, \hfill \\\quad\quad \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(39)
$$ \begin{array}{*{20}c} {\sum\limits_{h \in H} {MQ_{hqt}^{v} } + \sum\limits_{l \in L} {MC_{lqt}^{v} } \le (1 - dt_{qv} ) \times sq_{q} } & \begin{gathered}\quad\quad \forall q \in U,\forall t \in T, \hfill \\\quad\quad \forall v \in V \hfill \\ \end{gathered} \\ \end{array} $$
(40)
$$ \begin{array}{*{20}c} {\sum\limits_{q \in Q} {MC_{lqt}^{v} } = \sum\limits_{h \in H} {ML_{hlt}^{v} } } & \begin{gathered} \forall t \in T,v \in V, \hfill \\ \forall l \in L \hfill \\ \end{gathered} \\ \end{array} $$
(41)

Constraints (38), (39), and (40) are respectively the capacity constraints of temporary transfer centers, temporary disposal centers, and the existing disposal centers. Constraint (41) is the flow balance constraint of the temporary transfer centers.

$$ \begin{array}{*{20}c} {\omega_{k} ,EH_{h^{\prime}t} ,O_{qt} ,E_{lt} \in \left\{ {0,1} \right\}} & {\forall k \in K,h^{\prime} \in S,t \in T,q \in R,l \in L} \\ \end{array} $$
(42)
$$ \begin{array}{*{20}c} {W_{htd}^{v} ,X_{khtd}^{v} ,ML_{hlt}^{v} ,MQ_{hqt}^{v} ,MC_{lqt}^{v} ,UC_{ht}^{v} \ge 0} & \begin{gathered} \forall h \in H,t \in T, \hfill \\ \forall d \in D,v \in V, \hfill \\ \forall k \in K,l \in L, \hfill \\ \forall q \in Q \hfill \\ \end{gathered} \\ \end{array} $$
(43)
$$ \begin{array}{*{20}c} {Z_{imht}^{v} ,Y_{ihh^{\prime}t}^{v} \ge 0} & {\forall m \in M,i \in I,h \in P,h^{\prime} \in S,t \in T,v \in V} \\ \end{array} $$
(44)

The constraints (42) to (44) show the properties of the decision variables.

3.2.2 Linearization

Due to the nonlinearity of the objective function (12), we deal with its linearization method in this section. The new free variable \(\mu \) will be replaced with \(Min\left\{ {\sum\limits_{v \in V} {\pi_{v} \times \frac{{\sum\limits_{h \in P} {Z_{imht}^{v} } }}{{de_{imt}^{v} }}} } \right\}\) in the objective function (12). Therefore, the following statement is correct:

$$ \begin{array}{*{20}c} {\mu = Min\left\{ {\sum\limits_{v \in V} {\pi_{v} \times \frac{{\sum\limits_{h \in P} {Z_{imht}^{v} } }}{{de_{imt}^{v} }}} } \right\}} & {\forall i \in I,t \in T,m \in M} \\ \end{array} $$
(45)

Therefore, considering \(\mu \le \sum\limits_{v \in V} {\pi_{v} } \times \frac{{\sum\limits_{h \in P} {Z_{imht}^{v} } }}{{de_{imt}^{v} }}\) and also \(\mu\) seeks to maximize in the first objective function, then it always takes the lowest value of \(\sum\limits_{v \in V} {\pi_{v} } \times \frac{{\sum\limits_{h \in P} {Z_{imht}^{v} } }}{{de_{imt}^{v} }}\)(Asghari et al., 2022; Rastegar et al., 2021). According to Eq. (45), the linear model presented will be as follows:

$$ MaxZ_{1} = \mu $$
(46)

Objective function (13) and (14)

$$ \begin{array}{*{20}c} {s.t.} \\ {\begin{array}{*{20}c} {\mu \le \sum\limits_{v \in V} {\pi_{v} } \times \frac{{\sum\limits_{h \in P} {Z_{imht}^{v} } }}{{de_{imt}^{v} }}} & {\forall i \in I,t \in T,m \in M} \\ \end{array} } \\ \end{array} $$
(47)

Constraints (15) to (44)

3.2.3 Multi-objective solution technique

One of the frequently employed methods for multi-objective programming is \(\varepsilon \)-constraint. This study employs an refined form of the augmented \(\varepsilon \)-constraint method, called AUGMECON2, was utilized to find the exact Pareto set of the multi-objective problem (Mavrotas & Florios, 2013). The \(\varepsilon \)-constraint method involves optimizing one objective function and converting the others into constraints, setting an upper bound limit for each. By altering the boundaries and solving the single-objective model, efficient solutions can be obtained. In the following, the AUGMECON2 method is explained:

$$ \max \left( {f_{1} (x) + \varepsilon \times \left( {\frac{{L_{2} }}{{d_{2} }} + \frac{{L_{3} }}{{d_{3} }} \times 10^{ - 1} + ... + \frac{{L_{p} }}{{d_{p} }} \times 10^{ - (p - 2)} } \right)} \right) $$
(48)
$$ \begin{array}{*{20}c} {s.t.} \\ {f_{2} (x) - L_{2} = e_{2} } \\ \vdots \\ \end{array} $$
(49)
$$ f_{p} (x) - L_{p} = e_{p} $$
(50)
$$ x \in S,{\text{ L}}_{i} \in R^{ + } $$
(51)

In Eqs. (48) to (51), \({e}_{1}, {e}_{2}, \dots , {e}_{p}\) are the right-hand side parameters acquired from each specific iteration of the grid points of the \({f}_{2}\left(x\right),{f}_{3}\left(x\right), \dots , {f}_{p}(x)\) objective functions. Parameters \({d}_{2}, {d}_{3}, \dots , {d}_{p}\) are the difference between the best and worst values of the respective objective functions and the variables \({L}_{2}, {L}_{3}, \dots , {L}_{p}\) are the surplus variables of the corresponding constraints. In addition, \(\varepsilon \in [{10}^{-6}, {10}^{-3}]\) and \(S\) is the feasible region. Figure 6 illustrates the steps of this method.

Fig. 6
figure 6

Steps of the AUGMECON2 method

Accordingly, the proposed mathematical model using the AUGMECON2 method will be as follows:

$$ Max\left( {\mu + \varepsilon \times \left( {(\frac{{L_{2} }}{{d_{2} }}) + (\frac{{L_{3} }}{{d_{3} }} \times 10^{ - 1} )} \right)} \right) $$
(52)
$$ s.t. $$
$$ \sum\limits_{v} {\pi_{v} } \left[ \begin{gathered} \sum\limits_{h \in H} {\sum\limits_{t \in T} {\sum\limits_{k \in K} {\sum\limits_{d \in D} {X_{khtd}^{v} \times tr_{kh} } } } } + \sum\limits_{t \in T} {\sum\limits_{i \in I} {\sum\limits_{h \in P} {\sum\limits_{m \in M} {Z_{imht}^{v} \times th_{mh} } } } } + \hfill \\ \sum\limits_{h \in P} {\sum\limits_{h^{\prime} \in S} {\sum\limits_{t \in T} {\sum\limits_{i \in I} {Y_{ihh^{\prime}t}^{v} \times ts_{hh^{\prime}} } } } } + \, \sum\limits_{q \in Q} {\sum\limits_{h \in H} {\sum\limits_{t \in T} {MQ_{hqt}^{v} \times tq_{hq} } } } \, + \hfill \\ \, \sum\limits_{h \in H} {\sum\limits_{l \in L} {\sum\limits_{t \in T} {ML_{hlt}^{v} \times tl_{hl} } } } { + }\sum\limits_{l \in L} {\sum\limits_{t \in T} {\sum\limits_{q \in Q} {MC_{lqt}^{v} \times tc_{lq} } } } \, \hfill \\ \end{gathered} \right] - L_{2} = e_{2} $$
(53)
$$ \left\{ \begin{gathered} \sum\limits_{t \in T} {\sum\limits_{q \in R} {O_{qt} \times pa_{q} } } + \sum\limits_{t \in T} {\sum\limits_{l \in L} {E_{lt} \times el_{l} } } + \sum\limits_{h^{\prime} \in S} {\sum\limits_{t \in t} {{\text{EH}}_{h^{\prime}t} } } \times ph_{h^{\prime}} + \hfill \\ \, \sum\limits_{v} {\pi_{v} } \left[ \begin{gathered} \, \sum\limits_{h \in H} {\sum\limits_{t \in T} { \, ar_{h} \times UC_{ht}^{v} \times ri} } \, + \sum\limits_{h \in H} {\sum\limits_{t \in T} {\sum\limits_{q \in Q} {MQ_{hqt}^{v} \times rq_{hq} } } } + \hfill \\ \sum\limits_{h \in H} {\sum\limits_{l \in L} {\sum\limits_{t \in T} {ML_{hlt}^{v} \times rl_{hl} } } } \, + \sum\limits_{q \in Q} {\sum\limits_{t \in T} {\sum\limits_{l \in L} {MC_{lqt}^{v} \times rc_{lq} } } } \hfill \\ \end{gathered} \right] \hfill \\ \end{gathered} \right\} - L_{3} = e_{3} $$
(54)

Constraints (15) to (44) and Constraint (47).

4 Computational results

In this section, in addition to explaining the case study, we present the results obtained from two phases of the suggested method in Sect. 3. To realize this, we first present the results obtained from the demand forecast by the proposed ANNs (first phase). Then, in the second phase, we will solve the proposed mathematical model using the results obtained from the first phase.

4.1 Case study

Tehran is the capital of Iran, which has one of the highest mortality rates for COVID-19 patients in Iran. The increase in the number of patients has led to the shortage of hospital beds for patients. Tehran has 22 districts. After the surveys, 15 hospitals in this city were found to be currently considered for the hospitalization of COVID-19 patients. The information about these hospitals was obtained from the Ministry of Health and Medical Education (MOHME). The location and bed capacity of these hospitals are shown in Table 6 of Appendix 1.

Moreover, to find the distances between the districts of Tehran, the center of each district was selected, which is shown in the Table 7 of Appendix 1. Furthermore, 10 waste disposal centers dedicated to waste management of the COVID-19 were identified in Tehran, whose locations are presented in Table 8 of Appendix 1. Finally, this research identified and considered six candidate locations for temporary hospitals, four candidate locations for temporary waste disposal centers, four potential locations for temporary waste transfer centers, and seven candidate locations for distribution centers. Figure 7 exhibits all the locations in this case study.

Fig. 7
figure 7

Location of hospitals, distribution centers, and waste disposal and transfer centers on the map

In the following, the Neshan map API has been used under the average traffic conditions of Tehran city to obtain the time interval between network points. Furthermore, for the set of medications in this case study, two medications, Naproxen and Famotidine, the most prescribed by physicians in Iran for COVID-19 patients, have been considered. The risk of a facility (population exposure around facilities) was calculated using (Zhao et al., 2016):

$$Risk= \pi {r}^{2}\left(k{m}^{2}\right)\times \text{Population} \text{density}(\text{People}/\text{k{m}}^{2})$$
(55)

where the impacted radius (\(r)\) is set to 1 km for transfer centers, 3 km for disposal centers, and 4 km for hospitals. Similarly, the risk of disease spread associated with transportation routes was calculated as the population's exposure within a 500 m range along the transportation route. The risk of probable accidents at the hospitals (\(a{r}_{h}\)) was also considered to be \(0.0007\) (Yu et al., 2020a). Finally, based on several studies, different values have been reported for the reproduction number of COVID-19 (\(ri\)), and herein, we used the value of \(2.5\); the majority of studies have reported a number close to it (Achaiah et al., 2020). The values of other important parameters of this case study are shown in Table 2.

Table 2 Parameter values of this case study ($: Dollars, $/S: Dollar/seconds, M3: Cubic meters)

4.1.1 Phase one results—demand forecasting using LSTM-RNN

Management of the patients’ allocation to hospitals necessitates proper and accurate forecasting concerning the number of patients. For this purpose, ANNs were trained using the historical data obtained from MOHME. The data timespan used for training is from 19 February 2020 to 26 October 2021. These data include the number of patients admitted to hospitals, classified into two groups: patients with mild and severe disease conditions. Accordingly, for each group of severity levels, 100 ANNs were trained to forecast the number of patients of that group in future periods. Given the fact that the duration of hospitalization of patients in the hospital is approximately 14 days for COVID-19 (Vekaria et al., 2021), each period in this study consists of a total of 14 days. We considered \(k=5\), which divides the data into labeled instances with a length of 5 according to Eq. (9). Primarily, the data concerning the daily number of patients were cleaned and normalized. Subsequently, the data of each of the 14 consecutive days were added together to form the suggested time period in this study. According to the start and end time of the collected data, a total of 615 days of data was available. As a result, the input to the model was five consecutive periods, with the label being five consecutive periods following the previous five.

To measure the effectiveness of the proposed ANNs for demand forecasting, four measures, namely Root-mean-square deviation (RMSE), Mean squared error (MSE), Mean absolute error (MAE), and Mean absolute percentage error (MAPE), were employed. The following equations give the definitions of RMSE, MSE, MAE, and MAPE:

$$ RMSE = \sqrt {\frac{1}{M}\sum\limits_{i = 1}^{M} {(U_{i} - R_{i} )^{2} } } $$
(56)
$$ MSE = \frac{1}{M}\sum\limits_{i = 1}^{M} {(U_{i} - R_{i} )^{2} } $$
(57)
$$ MAE = \frac{1}{M}\sum\limits_{i = 1}^{M} {\left| {U_{i} - R_{i} } \right|} $$
(58)
$$ MAPE = \frac{1}{M}\sum\limits_{i = 1}^{M} {\left| {\frac{{U_{i} - R_{i} }}{{U_{i} }}} \right|} $$
(59)

where \({U}_{i}\) is the actual value at time point \(i\) and \({R}_{i}\) is the predicted value of the time point \(i\). The performance of the ANNs is shown in Fig. 19 of Appendix 2 for predicting severe cases and Fig. 20 of Appendix 2 for predicting less severe cases.

After training ANNs on the data of severe and less severe patients, 100 ANNs were prepared for each group. Subsequently, the number of patients in each group for future periods was forecasted using these trained machines. The forecasting results show that areas with a higher population density will have more infected people in future periods. The obtained results indicate that according to the positive correlation between the population density of the districts and the number of people infected, the regions whose number of infected people is forecasted to be the highest in Fig. 8 are the those having a high population density in Tehran city (Bhadra et al., 2021).

Fig. 8
figure 8

Forecasted number of demands per district

Ultimately, via the K-Means clustering method, all the forecasts for each group of patients' severity were clustered into three clusters, giving the final three demand scenarios. To obtain the number of clusters, we used the Elbow method. As seen in Fig. 9, the optimal value of \(k\) (Number of clusters) is 3.

Fig. 9
figure 9

The Elbow method’s result

All the clustered scenarios for the number of severe and less severe patients in future periods are represented in Figs. 21, 22, 23, 24 of Appendix 3. Finally, as the representative of each cluster, the center of each cluster was selected as the demand forecast used in the mathematical model. The numerical results of the forecasts are given in Table 9 of Appendix 3. It should be noted that as a result of insufficient disclosure of the number of patients in each district separately by MOHME, the forecasted number of patients for each district was obtained using the population density of each district. Eventually, the probability of occurrence of each scenario (cluster) was calculated according to the number of members in each cluster divided by the total number of forecasts made, which is presented in Table 3.

Table 3 Probability of scenarios' occurrence

4.1.2 Phase two results—solving the two-stage stochastic programming mathematical model

The solution time of the \(\varepsilon \)-constraint method for solving the proposed model with the GUROBI solver of GAMS software was 2 min and 23 s; the objective values are reported in Table 4. Additionally, Fig. 10 shows the values of objective functions.

Table 4 Corresponding values of objective functions of the Pareto solutions
Fig. 10
figure 10

Values of objective functions

The Pareto frontier of solving the presented case study using the proposed model is plotted in Fig. 11.

Fig. 11
figure 11

Pareto frontier of solving the case study

Table 5 depicts the located distribution centers, temporary hospitals, temporary disposal centers, and temporary transfer centers for each Pareto solution. As can be seen, since the first objective function was chosen as the primary objective function, and the other objective functions were transferred to the constraints, with the minimum allocation-to-demand ratio reduction, fewer temporary hospitals were established. Moreover, in the first few of Pareto's solutions, the waste management budget was not fully utilized, leaving room for locating temporary transfer centers. Since these temporary transfer centers were not used, their selection was solely in view of sufficient funding. The locations chosen for temporary waste disposal centers are generally locations 12 and 15. These choices were made due to the lower capacity of the existing waste disposal centers near these temporary facilities. Figure 12 demonstrates the located temporary hospitals along with temporary disposal and transfer centers.

Table 5 Location of facilities in each Pareto solution
Fig. 12
figure 12

Located temporary facilities

According to Fig. 12, temporary hospitals were located where there are either fewer existing hospitals available or locations where there are no existing hospitals. Similarly, temporary disposal centers were established in the areas with no existing waste disposal centers in their vicinity. Also, according to the results of the forecasts, shown in Fig. 8 and Table 9 of Appendix 3, it can be concluded that another main reason behind choosing these areas as the location of temporary facilities is the high number of forecasted patients in them; for example, the fourth district of Tehran city has the highest number of patients according to the data obtained from MOHME and the forecasts made; as a result, a temporary hospital has been located in this area, as well as temporary disposal centers near this area. It should be noted that population density alone is not the reason for locating temporary facilities, and some other aspects, such as the insufficient capacity of hospitals in some areas (such as areas 18, 19, and 22), should also be taken into consideration.

Figure 13 shows the percentage of the demand allocated to each hospital in the chosen Pareto solutions for each scenario. As seen, the percentage of the patients allocated to the existing and temporary hospitals and the percentage of those not allocated to hospitals vary in each Pareto solution and each scenario. With the decrease in the minimum allocation-to-demand ratio, the allocation of patients to temporary hospitals declines and eventually reaches zero. The number of patients not allocated to hospitals, on the other hand, increases. For note, the proposed mathematical model locates temporary hospitals and allocates patients to them only when filling the capacity of existing hospitals. For this reason, the allocation priority always goes to existing hospitals, which is indicated through the low percentage of allocated patients to temporary hospitals.

Fig. 13
figure 13

Percentage of patients’ allocation to each hospital in Pareto solutions for each scenario

In Pareto solution #1, the uncovered demand obviously peaked at scenario #1 and reached the lowest in scenario #3. According to Table 9 of Appendix 3, the demand forecasts of scenario #1 were higher than those of scenarios #2 and #3 on average; the demand forecasts of scenario 3 were also lower on average than those of the other two scenarios, which apparent from the percentage of demand that remains uncovered in each scenario. As both total time and total risk decreased, the minimum allocation-to-demand ratio was reduced until the threshold for covering each group of patients' severity was obtained. For this reason, following Pareto solution #4, temporary hospitals were not considered, and allocation to temporary hospitals reached zero.

Figure 14 represents the total amount of uncollected waste. The amount of waste not collected decreased with the reduction in the total risk. In addition, the establishment of waste disposal centers reduced the total risk, which resulted in a reduction in uncollected waste.

Fig. 14
figure 14

Total amount of the uncollected waste in each scenario

4.2 Sensitivity analyses

Various sensitivity analyses on certain key parameters were undertaken and presented in this section to assess the performance of the proposed mathematical model. To this end, primarily, the adjustments to the results made by decreasing and increasing the forecasted demand were examined. As seen in Fig. 15, when the demand declines, the minimum allocation-to-demand ratio increases; for example, if we lowered the demand by 20% of its actual value, the model maintained a higher allocation-to-demand ratio overall. Furthermore, in cases where we increased the demand, the minimum allocation of patients to hospitals generally decreased.

Fig. 15
figure 15

The sensitivity analysis of demand parameter on the first objective function

The sensitivity analysis of the demand parameter on the second and third objective functions is shown in Fig. 16. As can be seen, the overall risk of the disease outbreak is reduced with a decrease in demand, which is due to less transportation and lower levels of uncollected waste in hospitals. Furthermore, with a reduction in the demand, the total time also decreases. The main reasons behind this behavior include the lower transport of patients between hospitals and the lower level of medication distribution.

Fig. 16
figure 16

The sensitivity analysis of demand parameter on the second and third objective functions

Disruptions in the capacity of the proposed network facilities are one of the main parameters that can impact all the objective functions. Therefore, the proposed model was solved by considering different combinations of disruptions in order to identify the most effective ones. Initially, the model was solved without considering any disruptions. Afterward, the effects of each disruption were examined separately, and in the end, the model was solved by considering all the disruptions simultaneously. The results of this proposed process are shown in Fig. 17.

Fig. 17
figure 17

Values of objective functions by considering different combinations of disruption parameters

According to the results obtained from Fig. 17, the simultaneous influence of all disruptions leads to the lowest allocation-to-demand ratio and causes high total time and total risk. Moreover, since no disruptions lead to the highest allocation-to-demand ratio, the total time and total risk are minimized. It can also be concluded that hospital disruption significantly impacts the allocation-to-demand ratio. Correspondingly, disruption of distribution centers, waste disposal centers, and waste transfer centers has a considerable impact on the total time and total risk. This impact stems from less efficient medication distribution and waste management. Thus, hospital disruption could be an important parameter of the proposed model.

Figure 18 illustrates the sensitivity analysis of hospital disruption on objective functions. Based on this figure, with greater disruption of hospitals, the minimum allocation-to-demand ratio increased. Correspondingly, the minimum allocation-to-demand ratio decreased with less disruption, which was predictable. Nonetheless, by changing the disruption parameter, different behaviors of the second objective function were observed. The total time slightly increased with a sharp decrease in the disruption parameter (Section a). This increase is due to the rise in hospitals' capacity, which increases the allocation of patients to hospitals. On the other hand, due to the decrease in transportation time owing to greater access to closer hospitals, the total time increase still remains slight. With a minor decrease in hospital disruption (Section b), the capacity of hospitals slightly increased, which augmented the allocation of patients to hospitals. However, this rise in the capacity of hospitals was not effective enough for a more optimal patient allocation (allocation to closer hospitals). Hence, the total time saw a more significant increase. With a slight increase in hospital disruption, patients’ allocation decreased, which reduced the total time (Section c). Nevertheless, with further growth in this disruption, the second objective function showed a different behavior. With a significant increase in disruption parameter, since the capacity of hospitals is much more impaired, and due to the lower optimized allocations (allocation of patients to distant hospitals due to the lack of capacity in nearby hospitals), the total time considerably increased (Section d). The lowest sensitivity to hospital disruption belonged to the third objective function. By allocating patients to other hospitals in a situation where the hospital disruption did not change, the transportation-related risks changed; they were smaller in comparison with the risk of establishing temporary facilities. They, therefore, made minor changes in the overall risk.

Fig. 18
figure 18

Sensitivity analysis of hospitals disruption

5 Discussion

This section presents the main findings and outputs of our study. In the first phase of our two-phase approach, the number of patients in future periods was forecasted using historical data utilizing various neural networks with LSTM-RNN architecture to cope the uncertainty in demand. After generating scenarios and reducing them using the K-Means algorithm, the output obtained from the first phase was three scenarios for each group of severe and less severe patients. Using the forecasted scenarios and developing the mathematical model by monitoring the response to COVID-19 in most parts of the world, particularly the countries with the highest number of COVID-19 cases, using expert opinions, and reviewing the literature review, the output of the second phase showed that temporary hospitals could reduce the dramatically high pressure on the existing hospitals. It is feasible to transport less severe patients to temporary hospitals and vacate equipped beds in the existing ones to treat patients with severe conditions. This is particularly important since a high percentage of COVID-19 patients can recover by standard care (Baud et al., 2020).

A higher and better demand coverage leads to the following problems: the supply of medications and waste management. Solving the proposed mathematical model, the necessity to set up distribution centers is undeniable. This issue is important since by using of temporary hospitals, a higher percentage of patients can be hospitalized, increasing the demand for medications. On the other hand, due to the increased patient coverage, the amount of waste generated in hospitals also rises. Therefore, locating distribution and temporary transfer and disposal centers is highly important. For example, the sensitivity analysis showed that disruption in distribution centers could reduce demand coverage and significantly increase risk and time in the system.

One of the issues that has been less addressed in the literature is the consideration of an objective function to minimize the network's total transportation time. Taking transportation time of patients and medications distribution and waste management into account is highly impressive due to the high risk of disease spread and, consequently, the increase in the number of patients. Managerial insights could be summarized as follows:

  • Data-driven decision-making and using tools such as ANN to forecast demand can help decision-makers overcome uncertainties more efficiently and improve the quality of answers. Also, a more accurate forecast of the demand makes planning the resources better and more equitable.

  • By assigning an appropriate budget to set up temporary hospitals, the coverage of patients can be maintained at a high level.

  • Temporary waste management facilities such as temporary transfer and disposal centers can help achieve improved waste management by reducing the amount of waste that is not collected in hospitals.

  • Facilities disruption can reduce patient coverage and increase transportation time and the risk of disease outbreak. However, considering different demand and disruption scenarios can reduce the destructive effects of disruption.

  • Due to the tremendous increase in demand for medication needed by COVID-19 patients and PPE, the location of distribution centers is very important. The lack of these items can lead to an increase in the probability of death of patients and the probability of medical staff infection and could disrupt the healthcare services.

6 Conclusion

COVID-19 has caused disease and a high rate of disease-related death worldwide. Following the widespread vaccination in most countries, it seemed as though the international community could effectively deal with the outbreak of this disease. Meanwhile, various virus mutations have repeatedly shown up (as of this writing, the Delta and Omicron mutations), indicating that we have a long way to go (Karim & Karim, 2021). One of the main concerns with the widespread prevalence of this disease is the lack of adequate capacity in hospitals to provide medical care to the affected people. Capacity completion of hospitals increases the risk of mortality and, on the other hand, puts inordinate pressure on the medical staff.

Additionally, due to the high number of COVID-19 patients, other significant problems, such as the supply of required medications and the management of waste generated by hospitals, have become pivotal and require planning. Nevertheless, a missing link is the uncertainty regarding the number of patients in future periods. Accordingly, this paper proposed a two-phase approach for the SCND to respond to the COVID-19 pandemic. This approach could contribute to the following decisions:

  • Forecasting the number of patients in future time periods.

  • Allocation of patients to hospitals.

  • Locating temporary hospitals if the existing hospitals are under the condition of full capacity.

  • Distribution of required medications by locating distribution centers.

  • Waste management with locating temporary facilities for waste transfer and disposal.

Given the fact that each neural network fits the data differently and delivers a distinct forecast, the K-Means method was employed to obtain the demand scenarios. It is noteworthy that methodologies like ML can make the decision process swifter and more accurate. In the second phase, using the results obtained from the first stage, the mixed integer linear TSSP model was proposed to optimally allocate patients to hospitals, establish temporary hospitals, distribute required medications, and manage wastes. The proposed model also considered different types of patient groups based on the level of disease severity, a minimum threshold for covering these groups, and disruption of facilities. The objectives of the proposed model include maximizing the minimum allocation-to-demand ratio, minimizing the total transportation time, and minimizing the risk of the disease spread.

Despite the utility of the proposed two-phase approach, our investigation is not without limitations. It is important to acknowledge that our approach may not have addressed all potential challenges, such as issues related to vehicle routing, supplier selection for medications, supply of raw materials and products during an epidemic outbreak, or considering sustainability aspects in mathematical modeling. Therefore, to address the limitations of the presented approach, we suggest the following future research directions:

  • Applying the two-phase approach outlined in this research to respond to another epidemic disease, such as the monkeypox outbreak.

  • Employing robust optimization methods to address uncertainty in modeling

  • Considering the global supply chain for the supply of raw materials and products during epidemic conditions

  • Incorporate the supplier selection and order allocation problems in the proposed model

  • Including vehicles routing and scheduling decisions in the modeling

  • Considering sustainability aspects in the mathematical modeling.

Owing to the adaptability of the proposed method, the findings of this study can be used to manage similar situations, such as the case of another infectious disease. Also, despite considering the specific features of the COVID-19 disease in the presented modeling, the presented framework can be used in other areas, such as forecasting other goods or services demand (first phase) and supply chains with commercial approaches (second phase). Furthermore, it seems vital to investigate the COVID-19 vaccine supply chain similar to this article.