Introduction

COVID-19, which was originated in the area of Wuhan, China, late December of 2019, is believed to be associated with the coronavirus 2 (SARS-CoV-2) (Zhou et al. 2020; Lam et al. 2020). SARS-CoV-2, which is believed to be from a bat origin (Huang et al. 2020; Wu et al. 2020), is a single-stranded positive-sense RNA virus (Satija and Lal 2007) with a nanosize ranging from 60 to 140 nm in diameter (Satija and Lal 2007; Hsiao et al. 2020). Currently, there are over sixty-seven million confirmed COVID-19 cases throughout the globe, and this number is expected to escalate due to the highly contagious nature of the virus.

As a result, early and accurate diagnosis of COVID-19 is of great importance for controlling the spread of the disease and to reduce its mortality. Emerging laboratory and epidemiological data suggest that environmental conditions may affect the current COVID-19 pandemic (Brassey et al. 2020). Consequently, the effect of meteorological parameters on the spread of COVID-19 should be explored to help in the prediction of its development over the coming months, although numerous factors may affect the progression of the COVID-19 pandemic.

Extensive research evolved to investigate the relationship between different metrological parameters and the COVID-19 daily cases (Cucinotta and Vanelli 2020). Pani et al. (2020) inspected the role of hot tropical weather in the transmission of COVID-19 by investigating the correlation between meteorological parameters and the COVID-19 pandemic cases in Singapore using Spearman and Kendall rank correlation. A positive significant correlation with the COVID-19 pandemic was found with temperature, dew point, relative humidity, absolute humidity, and water vapor.

Gupta et al. (2020) collected daily data of new cases of COVID-19 in 50 US states from January 1 to April 9, 2020, and two weather data (temperature and absolute humidity). They validated the results for 10-day intervals against monthly observations and also with worldwide trends. The obtained results were used to predict Indian regions which would be vulnerable to weather-based spread in the upcoming months of 2020. It was noticed that COVID-19 spread in the USA is significant for states characterized by a humidity ratio values in the range from 4 to 6 g/m3. The number of new cases exceeds 10,000, irrespective of the chosen time intervals for the study parameters. These developments are consistent with worldwide observations but do not correlate well with Indian regions so far possibly due to the total cases reported per interval < 10,000. The results explained the relationship between weather parameters and COVID-19 spread.

Iqbal et al. (2020) researched the connection between the fast spread of COVID-19 and regional climate parameters over a global scale. They collected data of COVID-19 cases from January 2020 to 5th of June 2020 in different affected countries and regional climatic parameters data. They found that the countries located in the lower temperature region had a rapid increase in the COVID-19 cases than the countries located in the warmer climatic regions. Also, they concluded that the average daylight hours and average high temperature were correlated to total COVID-19 cases, while average high temperature showed a correlation with total COVID-19 death cases.

Menebo (2020) obtained data of COVID-19 from the Norwegian public health institute and weather data from the Norwegian Meteorological Institute. He analyzed the correlation between weather and COVID-19 in the city of Oslo, Norway. He concluded that maximum temperature, normal temperature (°C), and precipitation level (mm) significantly affect the incidence rate of COVID-19.

Ahmadi et al. (2020) conducted a study to investigate the effect of the number of infected people with COVID-19 based on the population density, intra-provincial movement, and infection days to the end of the study period. The study investigated the average temperature, average precipitation, humidity, wind speed, and average solar radiation effect on COVID-19 spreading in Iran. They used the partial correlation coefficient (PCC) and Sobol-Jansen methods for analyzing the effect and correlation of variables with the COVID-19 spreading rate. Sensitivity analysis showed that the population density and intra-provincial movement have a direct relationship with the infection outbreak. However, areas with low values of wind speed, humidity, and solar radiation were found to be of high rate of infection that supports the virus’ survival.

Based on the above literature review, it is of importance to correlate weather variables (including average daily temperature, maximum temperature, relative humidity, wind speed, pressure, and average daily solar radiation) to COVID-19 active cases. Having achieved this, sensitivity analysis between variables was conducted using Sobol method, and independent variable importance of multilayer perceptron was found.

Methods

Study area

Jordan is located between longitudes 350 40′ and 390 E and between latitudes 290 30′ and 340 N in the northern hemisphere between the zone biome (moist cold winters and dry hot summers) and the arid one (Al-Quraín 2008). The total area of Jordan is 89,342 km2 and according to the General Census of Population and Housing Institute the total population of Jordan exceeds ten million.

Data collection

In this work, the daily data on confirmed COVID-19 cases in Jordan over the period from March 15 to August 31, 2020, was obtained from the Jordanian Ministry of Health, while the weather data including the average daily temperature (°C), maximum ambient temperature (°C), relative humidity (%), wind speed (m/s), atmospheric pressure (kPa), and average daily solar radiation (W/m2) were obtained from Jordan Metrological Department.

Since the announcement made by WHO that COVID-19 is a pandemic, the number of confirmed cases is continuously escalating worldwide (Cucinotta and Vanelli 2020). Officially, the first confirmed COVD-19 case was declared by the Jordanian Health Ministry on March 2. The number of confirmed cases increased to 2739 on September 10, 2020, while the number of recovered cases was 1981, and 688 cases were still under treatment, and the total death numbers was 20 cases. Figure 1 shows the number of confirmed COVID-19 cases over the period from March 15 to August 31, 2020.

Fig. 1
figure 1

Confirmed daily cases of the COVID-19 in Jordan over the period from March 15 to August 31, 2020

Results

The initial wave of COVID-19 transmission

Figure 2 shows the number of confirmed cases during the initial wave of COVID-19 transmission, which was over the period from March 15 to April 30, 2020. As shown in the figure, there exists a general trend that indicates a decrease in the positive cases over the test period.

Fig. 2
figure 2

Confirmed daily cases of the COVID-19 in the initial wave of COVID-19 transmission

Normality test

The normality test results in the initial wave COVID-19 transmission are shown in Table 1. The basic assumption of Pearson’s correlation for the variable (number of daily new cases of COVID-19 in Jordan) was not achieved. The p value from the Kolmogorov-Smirnov test of normality was found to be 0.022, which is less than 0.05. The p value from the Shapiro-Wilk test of normality was found to be 0.00, which is less than 0.01. Based on these results, there was an assumption that the normal distribution of daily new cases of COVID-19 in the initial wave transmission in Jordan was found to be not acceptable; as a result, the Spearman correlation test (non-parametric correlation estimation) was used to examine and determine the relationships between the COVID-19 pandemic in the initial wave transmission and weather variables. A bivariate, two-tailed analysis at α = 95% as a confidence interval was applied.

Table 1 Normality test results

Correlation test

Since normality assumption was not fulfilled, Spearman correlation test was used for data analysis. Table 2 shows Spearman correlation test results. Daily temperature (r = − 0.283; p = 0.054), maximum temperature (r = − 0.245; p = 0.098), relative humidity (r = 0.092; p = 0.540), wind speed (r = 0.041; p = 0.783), and pressure (r = − 0.015; p = 0.920) were not significantly correlated with COVID-19 pandemic positive cases because the p value is more than 0.05. Average daily solar radiation (r = − 0.503; p = 0.000) was negatively strong and significantly correlated with COVID-19 pandemic because the p value is less than 0.05.

Table 2 Spearman correlation test results

Multilayer perceptron (MLP)

Multilayer perceptron (MLP) is a popular feed-forward neural network architecture. MLP consists of three layers, namely, input, output, and hidden layers (Ahmadi et al. 2019). More details about MLP is found in Ahmadi et al. (2019).

Based on trial and error method, MLP technique, with 10 hidden neurons and using 70% of the data for training, 15% testing and 15% for validation, was conducted to obtain the best model with R = 70.5% for the initial wave of COVID-19 transmission as shown in Fig. 3.

Fig. 3
figure 3

Regression plot of predicted values against actual ones for the MLP model in the initial wave of COVID-19 transmission

Table 3 shows independent variable importance, which represents the measuring of how much the network’s model-predicted value changes for different values of the independent variable.

Table 3 Independent variable importance

The results show that daily solar radiation and daily temperature are the most effective variables on the spread rate of COVID-19 virus with wind speed and pressure being the least effect.

Sobol sensitivity analysis

Sensitivity analysis between variables is conducted using Sobol sensitivity method to examine the importance of variables. More details about sensitivity analysis are outlined in Dorosti et al. (2020). It is based on the decomposition of model output variance into summands of variances of the input parameters in increasing dimensionality (Saltelli et al. 1999; Sobol 2001).

Sobol sensitivity analysis is applied in quantifying how much variability in the model output is depending on each of the input parameter or the interactions between parameters. It can be achieved by computing the first-order, second-order, higher-order, and the overall sensitivity indices.

In this research, the Sobol sensitivity analysis is conducted using a global sensitivity analysis toolbox in MATLAB. Figure 4 shows the significance and sensitivity analysis of the Sobol method. The vertical axis indicates the first-order effect of sensitivity. The results show that the most important variable in COVID-19 virus infection rate is daily solar radiation at the initial wave transmission.

Fig. 4
figure 4

First-order effect of Sobol sensitivity analysis during the initial wave of COVID-19 transmission

The second wave of COVID-19 transmission

Figure 5 shows the number of confirmed cases during the second wave of COVID-19 transmission, which was over the period from August 1 to August 31, 2020. As shown in the figure, there exists a general trend that indicates an increase in the positive cases over the test period.

Fig. 5
figure 5

Confirmed daily cases of the COVID-19 during the second wave of COVID-19 transmission

Normality test

The normality test results in the second wave COVID-19 transmission are shown in Table 4. The basic assumption of Pearson’s correlation for the variable (number of daily new cases of COVID-19 in Jordan) was not achieved. The p value from the Kolmogorov-Smirnov test of normality was 0.007, which is less than 0.05. The p value from the Shapiro-Wilk test of normality was found to be 0.001, which is less than 0.01. Based on these results, there was an assumption that the normal distribution of daily new cases of COVID-19 in the second wave COVID-19 transmission in Jordan was found to be not acceptable. As a result, the Spearman correlation test (non-parametric correlation estimation) was used to examine and determine the relationships between the COVID-19 pandemic spread and weather variables during the second wave transmission. A bivariate, two-tailed analysis at α = 95% as a confidence interval was applied.

Table 4 Normality test results

Correlation test

Since normality assumption was not fulfilled, Spearman correlation test was used for data analysis. Table 5 shows Spearman correlation test results. Daily temperature (r = 0.339; p = 0.062) and relative humidity (r = − 0.211; p = 0.255) were not significantly correlated with COVID-19 pandemic spread because the p value is more than 0.05. Maximum temperature (r = 0.394; p = 0.028), wind speed (r = 0.477; p = 0.007), pressure (r = − 0.429; p = 0.016), and average daily solar radiation (r = − 0.757; p = 0.000) were significantly correlated with COVID-19 pandemic spread because the p value is less than 0.05.

Table 5 Spearman correlation test results

It may be noticed that maximum temperature and wind speed were positively moderate correlated. The pressure was negatively moderate correlated, and the average daily temperature was negatively strong correlated.

Multilayer perceptron (MLP)

Based on trial and error method, MLP technique with 10 hidden neurons and using 60% of the data for training, 20% testing, and 20% for validation was used to obtain the best model with R = 83.8% for the second wave of COVID-19 transmission as shown in Fig. 6.

Fig. 6
figure 6

Regression plot of predicted values against actual ones for the MLP model in the second wave of COVID-19 transmission

Table 6 shows independent variable importance. The results show that daily solar radiation and pressure are the most important variables on the COVID-19 virus infection rate, with the least important variables being wind speed and maximum temperature.

Table 6 Independent variable importance

Sobol sensitivity analysis

The Sobol sensitivity analysis in this research was conducted using a global sensitivity analysis toolbox in MATLAB. Figure 7 shows the significance and sensitivity analysis of the Sobol method. The vertical axis indicates the first-order effect of sensitivity. The results show that the most important variable in COVID-19 virus infection rate is daily solar radiation at the second wave transmission.

Fig. 7
figure 7

First-order effect of Sobol sensitivity analysis during the second wave of COVID-19 transmission

Discussion

In this study, weather variables including average daily temperature, maximum temperature, relative humidity, wind speed, pressure, and average daily solar radiation were correlated with COVID-19 active cases in Jordan during the first and second waves. This was achieved by using non-parametric correlation test.

It was found that the most effective weather parameters on the active cases of COVID-19 in the initial wave transmission was the average daily solar radiation (r = − 0.503; p = 0.000), while all other tests for other parameters failed as shown in Table 2.

In the second wave COVID-19 transmission, it was found that the most effective weather parameter on the active cases of COVID-19 was the maximum temperature (r = 0.394; p = 0.028). This was followed by wind speed (r = 0.477; p = 0.007), pressure (r = − 0.429; p = 0.016), and average daily solar radiation (r = − 0.757; p = 0.000) as shown in Table 5.

Independent variable importance of multilayer perceptron to examine the importance of variables was conducted. The best coefficient of determination (R) was 70.5% and 83.8% in the initial and second wave COVID-19 transmission, respectively. During the initial wave of COVID-19 transmission, the results showed that daily solar radiation and daily temperature were the most important variables on the COVID-19 virus infection rate, while the least important variables were wind speed and pressure.

In the second wave of COVID-19 transmission, the results showed that daily solar radiation and pressure were the most important variables in the COVID-19 virus infection rate, and the least important variables were wind speed and maximum temperature.

The Sobol sensitivity analysis was also conducted using a global sensitivity analysis toolbox in MATLAB. The results showed that the most important variable in COVID-19 virus infection rate was daily solar radiation with a value equal to 0.3491 as shown in Fig. 4 at the initial wave transmission. Also, the results showed that the most important variable in COVID-19 virus infection rate was daily solar radiation with a value equal to 0.7452 as shown in Fig. 7 at the initial wave transmission.

Based on the above statistical analysis, correlation test, independent variable importance of Multilayer Perceptron, and Sobol sensitivity analysis, it may be noticed that the most effective weather parameter on the active cases of COVID-19 in the initial and the second waves transmission was the average daily solar radiation. It was found that the average daily solar radiation was negatively strong and significantly correlated, which means that the number of daily confirmed cases in Jordan decreases with the daily solar radiation.

Time complexity of algorithm

In the presented algorithm used, three main approaches related to best model and time complexity were calculated. Xiao (2017) proved that the time complexity of the algorithm is O(m1, m2) for Spearman’s rank correlation, where m1 and m2 are fixed integers. The running time of MLP approach with maximum number of indicators is O(p.n.h.o.i), where h and i are the numbers of hidden layer and iteration, respectively (Serpen and Gao 2014). Using Monte-Carlo sampling technique, time complexity of computing an estimation of E[Eyξu]2 expressions for one Sobol’ index term becomes O(2n) (Kourany et al. 2017).

Conclusion

In this study, weather variables including average daily temperature, maximum temperature, relative humidity, wind speed, pressure, and average daily solar radiation were correlated with COVID-19 active cases in Jordan during the first and second waves. This was achieved by using non-parametric correlation test.

It was found that the most effective weather parameters on the active cases of COVID-19 in the initial wave transmission is the average daily solar radiation (r = − 0.503; p = 0.000), while all other tests for other parameters failed.

In the second wave COVID-19 transmission, it was found that the most effective weather parameter on the active cases of COVID-19 is the maximum temperature (r = 0.394; p = 0.028). This was followed by wind speed (r = 0.477; p = 0.007), pressure (r = − 0.429; p = 0.016), and average daily solar radiation (r = − 0.757; p = 0.000).

In order to examine the importance of variables, independent variable importance of multilayer perceptron was conducted, and global sensitivity analysis by using Sobol analysis approved that the average daily solar radiation plays an important role in the COVID-19 outbreak.