Introduction

From the beginning of civilizations, the human race has witnessed epidemics of various forms and degrees. The recorded history of Greek epidemic dated back to 430–427 BCE which claimed the lives of an estimated 25–35 Greeks according to the contemporary historian Thucydides. Before the outbreak of COVID-19, there were many occurrences of epidemics, most notably The Black Death (1346–1353) that wiped out nearly half the population of Europe and Spanish Influenza (1918–1919) caused about 500 million deaths around the world. A detailed account of the history of epidemic and pandemic can be found in [1]. Throughout history, it has been observed that inadequate knowledge about the disease, misinformation and misconception among the populous, and improper handling of the situation caused more damage than it could have [2].

No wonder, in the case of COVID-19, we have experienced various confusing or sometimes conflicting information about the epidemic that affects the proper functioning of government. As succeeding populations are getting infected through social contact, the countries who have taken early measures have turned out to be the successful ones in containing the outbreak. Before any scientific correlations could be found among factors of contagion, basic measures could have saved most of the lives that were lost. Moreover, government’s inaction and citizen’s ignorance have turned the current situation at stake. While the nature of the virus and its contagiousness are yet to be determined, still proper actions by the legislative body or government and compliance of citizens could significantly reduce the spread and have better containment. A study on social distancing policies driven by public awareness was conducted and voluntary actions were found to have the strongest causal impact on reducing social interactions which resulted in the decline of the rate of infection by 37% after 15 days [3]. Mahmud et al. [4] considered their model on social dynamics that excludes government roles in control. The number of tests per population size is very important for allowing authorities to isolate and treat the infected individual avoiding further spread. In this paper, we cast down our eyes on an epidemic prediction model for COVID-19 that considers the role of governance and citizen’s consciousness along with other necessary variables [5]. This model apprises the concerned audience how their actions and behaviors change the outcome of the onrush by getting user inputs of the aggregated parameters on an interactive Web user console available at http://covid19.aiub.edu. Therefore, this work is not just a model that works on a computer and reports the result in an ambiguous way that the general public do not understand, it is also a functional system that communicates information with the concerned parties.

The hard part of the model is to initialize the parameter values accurately enough to fit the reported data that can reasonably project the future. Though accuracy at an early phase is not what we are concerned with, we have focused on its correct behavior based on social and pathological dynamics. The model updates its parameter on a regular basis when new data are received. Testing of hypothesis and methodology have been described in the “Methodology” section followed by model inception and implementation.

Social Dynamics

Scherer and Cho (2003) showed how one’s social networking behavior, as well as perception, impacts the health behavior of the respondent [5]. The government of several countries remain unsuccessful in assuring their people that they have control over the situation. In the interim, both the number of newly infected cases and deaths continue to increase every day. Pessimistic decisions may ascend either when a hazard is publicized aggressively or when a dreadful circumstance is presented with extravagant believability [6]. The decelerate response of several countries to the COVID-19 sternness can be explained through geographic collocation. Findings of a research work led by Fischhoff et al. (2003) revealed that the hazard discernment concerning a hostile affair is induced by propinquity to the hazard [7]. In another study, it was observed that the perceptions and behaviors toward the hazard concerning avian influenza (H5N1) virus striking Europe between 2005 and 2006 were significantly associated with the proximity to hazards [8].

Besides, the inability of the political leaders to have people in their confidence to combat COVID-19 would have an influential impact on people to be defiant to the government’s order regarding social distancing and lockdown [9]. The government should clarify the motives of a lockdown to its people, particularly to the younger ones who have been found to breach the lockdown rules constantly [10]. Confidence in mainstream media has been found to be correlated with support for lockdown measures as well [11]. In a country like Bangladesh having misgovernance, the government has traditionally been failing to build proper awareness among the people during any emergency period and manage the crisis politically [12].

Prediction Models and Their Components

The SIR model [13] is one of the olden compartmental models in epidemiology projecting infectious diseases like COVID-19 [14] and numerous diversified derivations came out from it. The principal SIR model comprises three compartments.

S: The number of susceptible individuals. It is the number of a population who is at risk of being infected and after being infected. The susceptible individuals are then shifted to the next infectious compartment I after being infected. The transmission rate from susceptible to infected is assumed to be βSI/N2 where N is the total population and the transmission parameter β is the average number of individuals that one infected individual will infect per unit time.

I: The number of infectious individuals. The population of this compartment are the individuals who have been infected and are capable of infecting susceptible individuals. A segment of the population of this infected class will be shifted to the next removal compartment at a recovery rate γ, so that 1/γ is the average period during which an infected individual remains infectious.

R: The number of removed individuals. The individuals who have been infected and have either recovered from the disease or died entered the removed compartment.

The basic reproductive number R0, which is inured to quantify the transmission of pathogens, is the ratio β/γ. That means R0 is the average number of people infected by an infected individual over the disease infectivity period, in a susceptible population [15]. The model assumes that each transferring person has an equal probability of being contracted by others regardless of social practice and rule in place. The model considers the infected people residing in quarantine as similar as those who are not in quarantine. Hence, both are assumed to have equivalent transmission rates β which might not be true in real cases. Finally, the assumption that β stays constant all the way through the duration of pandemic also might not be veritable. The SIR model is weak to translate the gestation period when an affected person is moving ubiquitously without showing any symptoms whatsoever.

SEIR model is the most widely used epidemic model derived from the SIR model. In addition to SIR, the SEIR model introduced an intermediate compartment E for exposed population for the incubation period during which susceptible individuals have been infected but are not yet infectious themselves, and σ is the incubation rate at which the dormant individuals become infectious. The SEIRS model is used to allow recovered individuals to return to a susceptible state at a rate ξ at which they return to the susceptible status due to the loss of immunity [16].

Whether a person is susceptible to a specific pathogen partly depends on the person’s immune response. The person could have natural active immunity or might have developed antibodies through a non-deliberate contact with the pathogen. Natural passive immunity that comes from the mother is also possible given the length of the COVID-19 epidemic [18]. Furthermore, overcoming the population’s natural immunity, the type of strain is another numerator for the susceptible. Body mass index (BMI) is said to have an influence on developing or aggravating the disease, but the studies are complicated by the nutrition factors that should oppose the effect of BMI, and we chose not to consider this in our model [19].

After getting infected (I in SIR model), an uninfected person contributes to the transmission rate in a population N, where N should be a population in proximity and have means to come in close contact, either active or passive and not just a mass in a location. By passive contact, we mean places and artifacts shared by multiple persons at different times within an effective interval: this is how long the SARS-CoV-2 can survive given surface and aerosol stability [20]. R, the removal by recovery or death, appears to have little significance since the necessary transmission is already made within the I compartment, and N has to be observed in a real social context. In our computation, we have considered the median incubation period 5.2 days when the infected I infects others [21]. In the SEIR model, instead of susceptible (N), exposed individual (E) bears greater significance in calculating the rate of infection. Analyzing all conceivable notions and dynamics, the basic reproductive quantity has been fragmented into several practical factors described in kinetics-modeling-fitting.

A popular way of risk assessment was proposed by statistician Fergus Simpson who attempted to estimate the risk of being captured by the virus in terms of the number of dice [22]. According to him, in the first week when a virus initiates to spread across a country, one person out of ten million catches the virus each day, equivalent to a nine-dice risk for being infected by the virus and which is very low. But, if the virus can spread unrestrictedly, the accompanying risk will increase fast. By week 2, the risk increases approximately six times compared to week 1, equivalent to an eight-dice risk. In the third week, it parallels seven-dice risk, equals six-dice risk by week 4, and so on. But he also cautioned that his formula might not work where the checkout process of positive cases being performed each day is very low with respect to the population size. Hence, his approach has not been taken into consideration in this study. Although it was anticipated by several researchers initially that the density of the population had a positive correlation with the spread of COVID-19 [23], some recent studies show that the assumption might not be completely evident [17, 24]. Health expenditure by the government was identified as a significant determinant of deaths caused from COVID-19 [25, 26], whereas topographical locations along with their climatic circumstances were detected as prominent factors for the diffusion of the virus [27]. Although the temperature has been observed to be a relevant factor of COVID-19 transmission according to the results of some studies [28], there are some counter-arguments as well [29, 30]. Males, as well as older people, were observed to be exceedingly vulnerable to COVID-19 [31, 32]. Even though a few studies have been conducted on the overall COVID-19 situation along with its various aspects regarding Bangladesh so far [33], none of them has unambiguously focused on the projection of it. In their study, Azad and Hussain (2020) have attempted to fit the available data to date in Bangladesh applying several conventional models like the exponential model, Richards model, logistic model, compartmental model, and Gompertz model, but these traditional models do not take crucial parameters associated with the actions and behaviors [34].

LTSM Network

LSTM: Among the state-of-the-art deep learning methods, recurrent neural network (RNN) has been proven to be the most robust for prediction as it can automatically excerpt the necessary features from the training samples, delivering the activation from the previous time step as the load for the present time step and the network self-connections. Long short-term memory (LSTM) is one of the most powerful and well-known RNN which is skilled to learn order dependence in sequence prediction state. We have tried out many other models along with the LSTM model, e.g., adaptive neuro fuzzy inference system (ANFIS) and multilayer perceptron (MLP) models provide some results but not as good as LSTM models. From Table 1, we can clearly understand that the ANFIS model is not so accurate compared to the LSTM model.

Table 1 MLP training results

On the other hand, Table 2 is for the result of the MLP model.

Table 2 ANFIS training results

Although the error rate is lower in MLP model, the resulting correlation coefficients indicate negative association among the parameters. We have also checked with the STGCN model that works better in link prediction on adjacent matrix, but it has not fitted our dataset. Due to inadequate accuracy of both the MLP and ANFIS models, we have finally decided to apply the LSTM model for making prediction based on our dataset as shown in Table 5.

Methodology

In conventional compartmental models in epidemiology like SIR, SEIR initiates with the size and density of the population [35]. But after the outbreak of COVID-19, questions are being raised by the epidemiologists and the public health experts whether the population size as well as its density along with urbanization matter or not. The population size of the USA is 5.5 times more than Italy. If the population size underwrites the infected numbers, then nearly 5.5 times more infected cases would be expected in the USA in comparison to Italy. Likewise, Sweden has experienced almost a similar death rate as Ireland despite of having lesser population density. Similarly, Spain has undergone about an identical mortality rate as Italy although the population density of Italy is more than twice that of Spain. Moreover, urban cities like Shanghai, Seoul, and Singapore with enormous population densities have shown better performance in combatting the COVID-19 situation than many other cities which have comparatively a low population density. In this study, three different measures of correlation coefficients, namely, Pearson’s product-moment correlation coefficient, Kendall’s correlation coefficient, and Spearman’s rank correlation coefficient, are used to evaluate the strength of the bivariate relationship between the confirmed COVID-19 positive cases as well as deaths, population density along with size, and urbanization. Results are displayed graphically in Fig. 1, indicating weak correlations between the variables pairwise.

Fig. 1
figure 1

Correlation coefficients between the selected variables

To discard the aforementioned infeasibility of population size and density as well as difficulties in estimating the transmission rate kinetics in SIR and SEIR models, a probabilistic approach has been adopted in this study where government control, people’s acquiescence to the norms and rules of COVID-19, test positivity, and infection transmission frequency have been encompassed. Updated values from statistical survey and analysis (until 03 Aug 2020) of all four parameters can be found on our Web portalFootnote 1 with downloadable daily data sheet at Dashboard bar. The secondary data used in this study have been extracted from the WHO (url: http://www.who.int) and the Institute of Epidemiology, Disease Control and Research (IEDCR) (url: https://iedcr.gov.bd), while the primary data regarding people’s compliance along with government’s control have been collected from the respondents through a sample survey. A separate survey has been utilized to collect the information about the number of people expected to come in contact with an infected person if s/he moves outside home for whatever reason. Finally, a modified exponential regression model has been developed to fit the observed data encompassing the probability of citizen’s compliance accompanied by the government’s control, test positivity, and infection transmission frequency and to prove the dynamicity of all the probabilistic parameters. LSTM is used to train a neural network making a comparison with the cumulative positive COVID-19 cases of Bangladesh.

Kinetics-Modeling-Fitting

Identification of the relevant causes accompanied by interventions is the must to limit the spread of an epidemic. Epidemiological models work as a guide to plan regarding the outbreak of an epidemic. In the case of extremely infectious COVID-19, most of the affected nations have planned social distancing along with self-isolation measures after vigilant observation of the kinetics of the virus’s evolution. With the multifaceted socio- and pathological dynamics of COVID-19, it is hard to predict when and how it grows and ends [36]. To date, different groups of scientists have came up with many different models with wide deviations [37]. Selecting too many parameters would increase the risk of non-sampling error due to imprecise and inadequate data, e.g., to estimate the number of people commuting en masse at a given time and location with acceptable accuracy in a country like Bangladesh could be very difficult. As mentioned earlier, this issue has been encountered through a sample survey covering the people aged between 21 and 50 which is the highest COVID-19 affected age group in Bangladesh. The survey also covers the working people of both sexes who have to go out for their job. The principal focus of this study has been on pandemic predictability by factors that influence the outcome of an exponential model by fitting four probabilities (4P) which can be adjusted over time when a new observation comes in. State’s control and citizen’s compliance:

P(G) = Probability of the state’s control over the situation.

This encompasses:

  1. 1.

    State making the necessary laws to contain the pandemic.

  2. 2.

    State enforces the law in strict order.

  3. 3.

    State’s healthcare infrastructure and improvement where necessary.

  4. 4.

    Public awareness program for motivation.

P(P) = Probability of the citizen’s compliance to the imposed laws and health regulations.

This includes:

  1. 1.

    Complying to laws regulating social distancing, such as the use of public transport, social gathering, and shopping.

  2. 2.

    The practice of sanitization.

  3. 3.

    Wearing protective gears when necessary.

  4. 4.

    Perception of the situation.

P(G) has direct control over P(P) whereas P(P) has passive control over P(G), which indicates an asymmetric relationship between them. However, poor scores of both P(G) and P(P) would only degrade the situation. Hence, both must function together in order to have an impact on the overall situation. Therefore, a joint probability of P(G) and P(P), namely P(C), has been designed for the study which is inversely proportional to the propagation rate of a pandemic.

$$\textit{P(C) = P(G)P(P)}$$

Where P(C) is the joint factor of state and its citizens. To calculate the probability of overall control P(C), data collected through sample survey have been utilized as mentioned before. The survey provides data about the public compliance (P), and government control (G) being scaled between 1 and 10 (1 indicates the lowest degree of acquiescence, whereas 10 indicates the highest one) along with information whether the respective respondent is affected with COVID-19 or not. Exactly 1119 respondents have participated in the survey, based on which, P(C) has been estimated as follows:

$$\textit{P(C)=P(G)P(P)} $$

Where P(C) is the joint factor of state and its citizens.

P(P) = weighted probability of public compliance

\(=\frac {\sum \limits _{i=1}^{10} d_{i} p_{i}}{10}\); di = 1,2,3,……,10

Where, pi = unweighted probability of public compliance:

\(=\frac {{{number of respondents on compliance having specific scale value}}}{{{total number of respondents}}}\)

Also,

P(G) = weighted probability of government control

\(=\frac {\sum \limits _{i=1}^{10} d_{i} p_{i}}{10}\); di = 1,2,3,……,10

Where, pi = unweighted probability of government control:

\(=\frac {\textit {number of respondents on control having specific scale value}}{\textit {total number of respondents}}\)

Applying the aforementioned formulas, we have estimated P(P) = 0.324664879 and P(G) = 0.295084897 from the survey data of this study. Finally, we have estimated

$$ \textit{P(C) = P(P)P(G) =} 0.095803703. $$

Pathogen’s Reproduction Number and Contagiousness

The reproduction number, R0, is a complicated factor to compute that indicates how contagious an infectious disease is [14, 38]. It states how many individuals can get contracted by a single infected person with a previously unknown disease. There are three known variants of SARS-CoV-2 (A, B, C) and it is changing the nucleotide sequence of the genome, and we do not know which variant is prevailing in the South Asian region [39]. Nine new mutations have been reported in Bangladesh according to an unpublished report [40]. Also, it is not determined yet why some people who are more affected are showing serious symptoms, while others remain immune or asymptomatic. It also remains undiscovered why and how this has formed serious havoc in some parts of the world whereas the continent like Africa is only mildly affected. So, it is appallingly difficult to determine the contagiousness here in Bangladesh. Since it depends on the type of pathogen and how people are interacting with each other in certain social settings, we have considered this part in P(C). Many scientists have hypothesized the possibility of herd immunity [41] or active natural immunity (ANI), which is very hard to quantify as a factor of R0. By far, since the virus is new to our understanding and no vaccine (natural artificial immunity) has yet been developed, we have no idea how an individual’s immune would respond at first exposure [42]. On the other hand, the probability of a person catching infection once being exposed, designated as P(I), is an essential factor to understand the velocity and magnitude of the contagion.

P(I) = Probability of a person getting infected after being exposed.

Where, P(I) is primarily a function of:

  1. 1.

    Strength of pathogen type, SV: Value not known

  2. 2.

    Human natural immunity, IN: Value not known

  3. 3.

    Viral load, LV: Value not known

VL is the threshold of viral load to which a target is exposed to be infected. Viral load is a measure of virus particles, also called infectious dose, that is the amount of viruses needed to establish an infection. For influenza viruses, people need to be exposed to as little as 10 virus particles to get infected, while as many as thousands for other human viruses. W. David Hardy mentioned that “The virus is spread through very casual interpersonal contact [43].” We did not find an estimation of how many virus particles of SARS-CoV-2 are needed to trigger the infection, but COVID-19 is clearly very contagious, probably because few particles are needed for causing the infection leading to a low infectious dose or viral load [44]. Since the values of the above components are yet to be reliably established, we have passively determined the P(I) through statistical studies on various data, which is proportional to the growth of a pandemic and varies from region to region.

Thus, we can say:

P(I) is the configuration of the strength of pathogen type, human natural immunity, and viral load and we have tried to determine P(I) by the rate of propagation in a sample population who are possibly being exposed and part of them got infected.

The probability of infection P(I) has been estimated through:

\(P(I)=\frac {f_{i+IP}}{N \varphi _{i}}; i=1,2,3, {\ldots } \ldots \)

Where, N = number of people expected to come in contact to an infected person:

$$ \varphi_{i}={\sum}_{i^{th}\text {day}}^{(i+IP-1)^{th}\text {day}} \text{+'ve cases}$$
$$ f_{i+IP}=\text{+'ve cases of} (i+IP)^{th} \text{day}$$

In this study, the value of N is estimated to be 10 from a different survey conducted on 359 respondents who have gone outside the home for some/any reason with the risk of being infected by an infected person and IP is the incubation period.

Test Positivity

Test positivity is the ratio between the number of daily positive cases and tests done. A high percentage of the population being tested positive may assume that the right people are being tested and more tests would accumulate more positive results. Though the association between the number of tests and positive cases is not linear, it helps to contain the disease at an early stage [45]. This proportion will be headed down from either upsurge in the number of tests increases or drop in contagiousness. Two types of testing approach have been observed around the world depending on the state’s capacity and policy: (1) reactive tests where only people with acute symptoms are being tested and (2) proactive tests where subjects are being tested at random. Countries that have gone through reactive testing obtained high test positivity by testing only people with acute symptoms while less symptomatic or asymptomatic cases are free to roam contracting others upon contact. Conversely, countries with proactive testing, where people are being tested in every suspected scenario at random got a lower test positivity. The results in Tables 3 and 4 show how the containment of the pandemic is proportional to the test positivity. Table 4 is exhibiting an increase over total cases for 29 days. Table entries in bold indicate high Test Positivity resulting in higher number of cases. Besides, it also shows that the accomplishing situations in some countries have strong correlations with the percentage of test positivity. The daily trend can be found on various authentic data sources like in [46, 47].

Table 3 List of countries with high to low test positivity
Table 4 List of countries with high to low test positivity

Here are the following tables. Footnote 2

The World Health Organization (WHO) reported incubation period (time from exposure to the development of symptoms) for COVID-19 between 2 and 10 days [46]. The mean incubation period was 5.2 days (95% confidence interval) [21]. In our model, test positivity P(T) acts as the probability of getting positive cases out of the total number of tested cases and it is regularly updated with the moving average test positivity of incubation period (IP).

P(T) = Moving average of the probability of getting positive cases out of total samples being tested (Fig. 2).

Fig. 2
figure 2

Test positivity, moving average from April 1 to July 12, 2020

Without running into a complex set of incomprehensible parameters, we have tried to keep the model simple with a core equation apprehended by 4Ps that can be learned over time reflecting a closer prediction of reality.

Finally, the regression model fitted with the data becomes:

$$ I(t)=\alpha \Big(1-e^{-\mu \frac{P(I) P(T)}{P(g)P(p)}t^{n}} \Big)$$

here α, μ, and n are constants.

LSTM

LSTM network is an enhanced version of RNN. It has memory blocks instead of neurons that are connected through layers. Each block has its own components which make it smarter than classical neurons. It also has memory to store the recent sequences. There are some gates in the block that actually control the block’s current state and the output. Block uses the sigmoid activation function to check that the gates are triggered or not (Fig. 3).

Fig. 3
figure 3

The structure of LSTM unit [51]

This equation basically selects information which can be passed to the cell. The data from the input side of previous memory which is to be ignored is decided by the forget gate by the following equation:

$$ f_{t}=\sigma(W_{i} * [h_{t-1},x_{t}] + b_{i}) $$
(1)

The control gate controls the update of the cell by the following formula:

$$ \Tilde{C} = tanh(W_{c} * [h_{t-1},x_{t}] + b_{c}) $$
$$ C_{t}=f_{t} * C_{t-1} + i_{t} * \Tilde{C}_{t} $$
(2)

The output layer updates both hidden layer ht− 1 and output and is given by:

$$ o_{t}=\sigma(W_{o} * [h_(t-1,x_{t}] + b_{o}) $$
$$ h_{t}=o_{t} * tanh(C_{t}) $$
(3)

tanh is used to normalize the values into range − 1 to 1. The weight matrices are W and the activation function is σ which is taken as the sigmoid.

Results and Discussions

Scientists are working on different facts of COVID-19 in Bangladesh as well as other countries of the world to analyze and forecast the propagation using the conventional SIR and SEIR models initializing some implicit compartmental rate constants [48, 49]. Hazardous civic interaction plays the role of diffusing transmissible viral particles which is possible to control by being acquiescent with the health norms and law and order of the government. Test positivity is the vibrant contagion trend indicator which indicates how obliviously the suspected are being infected. But these vital factors have been ignored by most pandemic models. In this proposed model, these key factors have not only been addressed but have also been integrated with their implication in fitting the propagation trail. Those who worked with traditional compartmental models considering the whole population as susceptible and density as a multiplier of the rate constants overlooked the correlation between the factors and the eruptions. Some studies showed the correlation between the degree of contagion and body mass index (BMI), but they ignored the nutrition facts. Evidently in this study, the population size and density are ignored due to the insignificant correlation aspect. The reproduction number and implicit compartmental rate constants which are certainly difficult to measure due to the insufficiency of reliable data in Bangladesh are circumvented. We have explicitly addressed the impact of citizen’s awareness about the hazard of COVID-19, norms and rules of health concern, government laws, obligations, and citizen’s compliance with them. Moreover, we have also considered the test positivity and the probability of being infected.

Figure 4 shows how precisely the model is superimposing with real values. The growth rates of new positive cases, both estimated from model and real cases, are shown in Fig. 5. Though the growth rate is declining, its sluggish gradient indicates the lengthier presence of coronavirus here in Bangladesh.

Fig. 4
figure 4

Projection coherent with real positive cases (April 1 to June 29, 2020)

Fig. 5
figure 5

Growth rate of positive cases

The scenario may change upon how cognizant the citizens are about this contagious microbe. As mentioned earlier, the joint probability P(C) of state’s control and citizen’s compliance regulates the propagation rate. The propagation has been estimated in ± 5% controlled compliance situation. Figure 6 shows how only 5% decrease in P(C) and 5% increase in P(T) force the growth curve upward. Similarly, Fig. 7 shows exactly the opposite: 5% increase in P(C) and 5% decrease in P(T) from the estimated values. We have kept P(I) unchanged in both cases.

Fig. 6
figure 6

5% decrease in P(C) and 5% increase in P(T) from the estimated values (April 1 to June 29, 2020)

Fig. 7
figure 7

5% increase in P(C) and 5% decrease in P(T) from the estimated values (April 1 to June 29, 2020)

The daily positive cases are estimated from the projected propagation by the first derivative and considered an average of the incubation period. Figure 8 shows the comprehensive scenario of daily positive cases with the estimated one. An estimated mean absolute prediction error (MAPE) of 0.20 indicates reasonable prediction by our 4P model.

Fig. 8
figure 8

Daily new positive cases, reported vs. projected

Figure 9 shows the estimated flattening of the curve on last calculated P(C), P(T), and P(I) which is subject to change. On June 26, the USA observed an all-time surge in new coronavirus cases which were about to get flattened during the first quarter of June following some premature reopening, public unrest, civil disobedience, etc., that evidently lowered the country’s P(C) score. Reports on critical events are available on the JHU Timeline of COVID-19 [50]. Although improving, the test positivity in the USA is still above 12% on average during this writing. In Bangladesh, a similar surge could be expected during the Islamic festival Eid-Ul-Adha at the end of July. However, in every possible scenario, we continue conducting our study to get the values of the determinants and update the projection accordingly. Since there will no periodic version of the same paper, we have deployed our model available at http://covid19.aiub.edu.

Fig. 9
figure 9

Propagation projection with reported cases until July 12, 2020

The LSTM method is used to forecast the COVID-19 cases in Bangladesh. From Table 5, we can see that there are three results with its different parameters where the prediction accuracy is quite high. Numerous settings have been tested but not all of them have provided satisfactory results. Reasonable results have only been provided by the number 3 settings. From Fig. 10, it is also noticeable that the accuracy of number 3 result is good. In that, the 1st layer unit is 15, 20, 25, and 30 with the drop out 0.1, 0.1, 0.25, and 0.2, respectively.

Table 5 LSTM training results
Fig. 10
figure 10

Plot diagrams for the prediction of cumulative positive cases

Conclusion

Nature plays its role on mankind as history tells us and human wisdom and action that ultimately determine the outcome. In this study, the emphasis has been given more on human actions rather than on the SARS-CoV-2 as well as a human as a host for which very little is known so far. Empirical determination of its association with the probability of an exposed person to get infected omitting ambiguous factors has been accomplished. For successful containment of the epidemic, effective social distancing, hygiene, large-scale testing, and isolation are recommended to ensure at the earliest possible time. The 4P model provides a strong premise in decision making by demonstrating the causality of the epidemic over which the state alongside its citizens has control and the machine learning outcome concretely concludes that here four probabilistic parameters are not just some fitting parameters, rather it is sufficient to train a machine learning model. The roles of the four parameters are globally acknowledged and can be used for other countries as well given the parameter values are determined in their own context. In our experiments, the CDR (case, death, recovery)-only data did not show any promising result; hence, we suggest considering the dynamics we mentioned in their research settings.