Introduction

The severe acute respiratory syndrome coronavirus (SARS-CoV-2), called COVID-19, is an infectious disease, which was initially reported in Wuhan, China in December 2019 (Rendana 2020). The signs of COVID-19 were high fever, sore throat, fatigue, shortness of breath, dry cough, and pneumonia (Rendana 2020). The COVID-19 was also accountable for kidney failure, pneumonia, acute respiratory syndrome, and death in severe cases (Raza et al. 2020).

The World Health Organization (WHO) declared a public health emergency due to the COVID-19 outbreak. The continuous rise in the COVID-19 was reported all over the world (Rendana 2020). The COVID-19 was accountable for 39,196,259 cases and 1,101,298 deaths in more than 200 countries (WHO 2020). Due to the COVID-19, the governments tried to stop the spread through different measures like social distancing, lockdowns, ban on public events or gatherings, closure of academic institutions, closures of transport, and closure of non-essential business (Siciliano et al. 2020). Considering the extra-human factors, environmental factors could influence the COVID-19 transmission, which makes it necessary to investigate the drivers of the COVID-19 transmission (Rosario et al 2020).

The literature stated that the COVID-19 was spread from one to another person through respiratory droplets and physical contact (Huang et al. 2020; Shereen et al. 2020). The COVID-19 virus was found in aerosols up to 3 h (post-aerosolization), 4 h (copper), 24 h to 37 h (cardboard), and 2–3 days (stainless and plastic steel) (Rendana 2020). Due to this, Chan et al. (2020) reported that environmental factor like wind speed was a reliable indicator to investigate the viability, spread, and transmission of COVID-19. The variations in climatic indicators were associated with the transmission of infectious diseases (Stott 2016). Many infectious diseases were influenced by environmental indicators such as the SARS outbreak had linked with environmental indicators (Raza et al. 2020). Xu et al. (2020) also revealed the association of meteorological indicators and air quality with the transmission of SARS and MERS.

The polluted air was responsible for about 4.3 million deaths in the world. The concentration of particulate matter (PM) was associated with the increase in cardiovascular diseases, diabetes, asthma, and cancers. Air pollution was linked with influenza, pneumonia and acute lower respiratory infections, and SARS. The PM is a basic component of air pollution and can remain in the air for hours. The airborne PM2.5 has been linked with the transmission of influenza and respiratory infection (Xu et al. 2020).

In Pakistan, the COVID-19 was initially reported on 26th February 2020 (Syed and Sibgatullah 2020). In Pakistan, the COVID-19 was accountable for 323,019 cases and 6654 deaths, as of October 18, 2020 (Government of Pakistan 2020). According to the WHO, the government of Pakistan should take necessary precautionary steps to stop COVID-19 (Khan et al. 2020). The government of Pakistan directed the general public to stay in homes, wear a mask, hand washing, avoid face touching, avoid public places, and maintain social distance to stop the COVID-19 transmission (Raza et al. 2020). The statistics of COVID-19 are still serious in Pakistan (Fig. 1).

Fig. 1
figure 1

National and provincial COVID-19 statistics (as of October 19, 2020)

Cai et al. (2007) investigated the effect of meteorological variables (wind speed, sunshine, temperature, and humidity) and air pollution on the SARS virus. The atmospheric pollution, like particulate matter (PM), was responsible for serious diseases in human beings (Toppi et al. 2020). Andree (2020) reported that PM2.5 is a factor behind COVID-19 transmission in the Netherlands. The expected increase in the COVID-19 was 100% for 20% increase in the concentrations of pollution. In Pakistan, air pollution is accountable for 22% of premature deaths. Different cities in Pakistan were included in the list of polluted cities in the world (IQAir 2019). Bashir et al. (2020) examined the association of climate indicators (wind speed, humidity, temperature, rainfall, and air quality) with COVID-19 in United States. Correlation analysis confirmed the linkages between COVID-19 and climate indicators.

The present study aims to examine the impact of wind speed and air pollution (PM2.5) on COID-19 cases in Pakistan, its provinces, and the Islamabad Capital Territory. The contribution of this study in the literature is (a) it first time examines the association of wind speed and PM2.5 with the COVID-19 in Pakistan, (b) it first time examines the association of wind speed and PM2.5 with the COVID-19 in Punjab, Sind, Khyber Pakhtunkhwa (KPK), and Islamabad Capital Territory (ICT), and (c) it used generalized Additive Model to construct dose–response curve between environmental variables (wind speed and PM2.5) and COVID-19 cases, (d) it used generalized Poisson regression technique to empirically find the effect of wind speed and PM2.5 on COVID-19 cases.

Pakistan was ranked in the 23rd position according to the total COVID-19 cases. Pakistan ranked at 6th and 36th position with respect to the population and land area, respectively. Pakistan has 2.08 billion population, showing 2.4% annual population growth rate. The economy of Pakistan was primarily based on the agriculture sector and has 1629 USD per capita annual income (Hussain et al. 2018). Pakistan is located between latitudes 24° N–37° N and longitudes 60° E–75° E. The climate of Pakistan is generally arid to semiarid, having both monsoon and western system rainfall (Raza et al. 2020).

This study used data on daily COVID-19 statistics from March 10, 2020, to October 04, 2020, in the case of Pakistan, its provinces (Punjab, Sindh, KPK), and ICT. The COVID-19 related data were obtained from the Government of Pakistan (Government of Pakistan 2020), and environmental data like wind speed (km/h) and particulate matter (PM2.5) were obtained from the Air Quality Open Data Platform (Air Quality Open Data Platform 2020). This research investigated the link between environmental variables and the infection rate of COVID-19, which was estimated as (Raza et al. 2020):

$$ {\text{Infection}}\;{\text{Rate}} = \frac{{{\text{Number}}\;{\text{of}}\;{\text{COVID}} - 19\;{\text{cases }}}}{{{\text{Days}}\;{\text{of}}\;{\text{COVID}} - 19\;{\text{Infection}}\;\left( {{\text{Outbreak}}} \right)}}. $$
(1)

Different software (R software, STATA 15, and Eviews 10) were used for the empirical analysis.

Materials and methods

Correlation analysis

The degree of correlation among environmental variables and COVID-19 was investigated using both Spearman’s and Kendall’s rank correlation tests. These tests were nonparametric tests, computed on ranks (Hamadeh et al. 2017; Raza et al. 2020). Spearman's rank correlation analysis used a monotonic function to find the correlation between two variables. The mathematical expression of Spearman's rank correlation coefficient (Raza et al. 2020) is expressed as:

$$ r_{s} = 1 - 6\frac{{\sum d_{i}^{2} }}{{n\left( {n^{2} - 1} \right)}} $$
(2)

where n is the number of alternatives, and \(d_{i}\) shows the difference between the ranks of two parameters (Raza et al. 2020). This study also used Kendall’s rank correlation coefficient (τ) (Kocsis and Anda 2018), which was expressed as (Raza et al. 2020):

$$ \tau = \frac{S}{D} $$
(3)

where S is the trend and D is the possible data pairs by using n datasets (Raza et al. 2020):

$$ D = \left( {\begin{array}{*{20}c} n \\ 2 \\ \end{array} } \right). $$
(4)

Dose–response curve

A generalized linear model was used to find the association of various environmental indicators with the COVID-19. The generalized additive model (GAM) shows a relation between the predictor variables (Xi) and the univariate response variable (Y) (Raza et al. 2020). However, the exponential family of distribution for Y was normal, binomial, and Poisson (Goswami et al. 2020). The GAM was used to fit equations and splines to investigate the dose–response impact between environmental indicators and the COVID-19. The GAM was applied to explore the link for environmental indicators and health outcomes (Raza et al. 2020). This study used an additive and semi-parametric (Raza et al. 2020) polynomial model, as:

$$ \log y_{t} = \beta_{0} + \beta_{1} x_{t}^{3} - \beta_{2} x_{t}^{2} + \beta_{3} x_{t} + s\left( {{\text{WS}}_{t} } \right) + s\left( {{\text{PM}}_{t} } \right) + \varepsilon_{t} $$
(5)

where y represents the COVID-19 infection on day t, β0 is the intercept, s(∙) is a spline function, having two degrees of freedom to avoid over-fitting, β is the parameter of x, and x denotes the outbreak days, WS shows the wind speed (km/h), and PM shows the concentration of Particulate Matter in the atmosphere. The expression s(∙) was used to control the confounding of explanatory variables because a smooth spline function can control the impact of confounding factors (Raza et al. 2020). However, Raza et al. (2020) reported the advantage of the linear function between environmental indicators and COVID-19, expressed as:

$$ \log y_{t} = \beta_{0} + \beta_{1} x_{t}^{3} - \beta_{2} x_{t}^{2} + \beta_{3} x_{t} + \beta_{4} {\text{WS}}_{t} + \beta_{5} {\text{PM}}_{t} + \varepsilon_{t} . $$
(6)

A dose–response curve linked multiple dose-variables (x) with the probability of response-variable (infection) (Brouwer et al. 2017; Raza et al. 2020). In other words, the dose–response curve revealed the expectation of potential loss due to various environmental variables (Raza et al. 2020; Ye et al. 2020). This study applied the GAM to draw a dose–response curve between dose variables (outbreak days, wind speed, and PM2.5) and response variables (COVID-19 infection).

Regression analysis

The generalized Poisson regression is the generalization of standard Poisson regression (Maxwell et al. 2018; Raza et al. 2020), used to find the link between environmental indicators and COVID-19. The generalized Poisson regression used positive as well as negative correlations between the variables (Consul and Famoye 1992; Famoye 2015; Raza et al. 2020). The Poisson model was used to find the link between environmental indicators and total cases (Li et al. 2019).

It is a widely used procedure based on classical assumptions. The Poisson distribution was assumed for the dependent variable, having values like 0, 1, 2, 3,… n. It explained the expected value \(y_{i}\) by the explanatory variables \(x_{i}\) (Cupal, Deev and Linnertova 2015; Raza et al. 2020) as:

$$ E\left\{ {y_{i} |x_{i} } \right\} = \exp \left\{ {x_{i}^{T} \beta } \right\}. $$
(7)

The Poisson distribution was assumed for the count variable \(y_{i}\) for given \(x_{i}\) in the count data models (Raza et al. 2020).

$$ P\left\{ {y_{i} = y|x_{i} } \right\} = \exp \left\{ { - \lambda_{i} } \right\}\lambda_{i}^{y} /y!,\quad y = 0,1,2,3, \ldots $$
(8)

where \(y!\) is y factorial. A suitable functional form for \(\lambda_{i}\) gives expressions for the probabilities, used to build the log-likelihood function for the model, called a Poisson regression model (Raza et al. 2020). The generalized Poisson regression was better for over-dispersion \(\left( {{\text{Var}}\left( {y_{i} } \right) > E\left( {y_{i} } \right)} \right)\) and under-dispersion \({\text{Var}}\left( {y_{i} } \right) < E\left( {y_{i} } \right)\). According to this, the probability density function of \(y_{i}\) (Famoye 1993; Maxwell et al. 2018) was expressed as:

$$ f_{i} \left( {y_{i} ,\mu_{i} ,\alpha } \right) = \left( {\frac{{\mu_{i} }}{{1 + \alpha \mu_{i} }}} \right)\frac{{\left( {1 + \alpha y_{i} } \right)^{{y_{i} - 1}} }}{{y_{i} !}}\exp \left[ {\frac{{\mu_{i} \left( {1 + \alpha y_{i} } \right)}}{{1 + \alpha \mu_{i} }}} \right] $$
(9)
$$ y_{i} = 0,1,2, \ldots \;{\text{ and}}\;\mu_{i} = \mu_{i} \left( {x_{i} } \right) = \exp \left( {x_{i} \beta } \right) $$

where \(x_{i}\) and \(\beta\) were (k − 1) dimensional vector of covariates and regression parameters, respectively. The mean and variance were expressed as (Raza et al. 2020):

$$ {\text{Mean}}:E\left( {Y_{i} |x + i} \right) = \mu_{i} $$
$$ {\text{Variance}}:V\left( {Y_{i} |x_{i} } \right) = \mu_{i} \left( {1 + \alpha \mu_{i} } \right)^{2} . $$

Results and discussion

Descriptive analysis of COVID-19 and environmental indicators

Table 1 shows the descriptive statistic of environmental indicators and COVID-19 cases at the national and provincial levels in Pakistan. The average wind speed was 3.285 km/h in Pakistan. At the provincial level, the wind speed was more in Sindh (4.499 km/h) followed by KPK (3.230 km/h), Punjab (3.062 km/h), and ICT (2.350 km/h). The air pollution, amount of particulate matter (PM2.5) in the atmosphere, which was 95.435 in Pakistan. However, the PM2.5 was more in Punjab (116.895) followed by KPK (107.529), ICT (80.574), and Sindh (76.742). The average daily COVID-19 cases were 1507-957 in Pakistan, 477.756 per day in the Punjab, 661.206 per day in the Sindh, 182.182 per day in the KPK, and 80.321 in the ICT.

Table 1 Descriptive analysis of variables

Figures 2, 3 and 4 show the trend of environmental indicators (wind speed and PM2.5) and COVID-19 in Pakistan from March 10, 2020, to October 04, 2020. Figure 2 shows that COVID-19 infections were continuously increased in Pakistan. On average, the COVID-19 cases were 1507.957 per day, showing a maximum value of 6825 cases on June 13, 2020. After that, the curve showed a decrease in COVID-19 cases. However, the recent situation was not satisfactory in Pakistan due to the confirmation of 600–700 COVID-19 cases per day.

Fig. 2
figure 2

Daily COVID-19 statistics in Pakistan

Fig. 3
figure 3

The trend of wind Speed in Pakistan

Fig. 4
figure 4

The trend of air pollution (PM2.5) in Pakistan

Figure 3 shows that the wind speed increased between March 10, 2020, and July 21, 2020. After that, there was some reduction in wind speed. However, wind speed showed fluctuations in Pakistan from March 10, 2020, to October 04, 2020. During this period, the average wind speed was 3.285 km/h, showing a minimum (1.85 km/h) and maximum (5.225 km/h). Figure 4 shows the trend of air pollution in Pakistan, which was measured using the amount of PM2.5. On average, the mean of PM2.5 was 95.135 in Pakistan. The amount of PM2.5 showed fluctuation in Pakistan at both national and provincial levels. However, it was comparatively higher at the start of the COVID-19 in Pakistan. It was lowest on August 28, 2020. After that, the trend of PM2.5 was increasing in Pakistan.

Correlation analysis

The link between environmental variables and COVID-19 was assessed by using Spearman's rank and Kendall’s rank correlation tests. The correlation analysis explored the association of wind speed and PM2.5 with COVID-19 cases (Table 2) and the COVID-19 infection rate (Table 3). In general, the results of both tests showed a similar conclusion. The results showed that wind speed has a positive and significant correlation coefficient in Pakistan and its provinces (Punjab, Sindh, and KPK). It explains the rise in COVID-19 due to the rise in wind speed. Literature showed a positive correlation between wind speed and COVID-19 in Beijing, China (Yuan et al. 2006), and India (Gupta et al. 2020). However, the correlation between COVID-19 and wind speed was insignificant in the case of ICT. Contrary to these results, the negative correlation for COVID-19 and wind speed was reported in Indonesia (Rendana 2020) and Turkey (Şahin 2020). It is because the dynamics of the COVID-19 outbreaks were found highly variable in different countries or regions (Dong et al. 2020). A positive correlation was found for PM2.5 and COVID-19 in Pakistan, Punjab province, and ICT. It shows that the rise in PM2.5 concentration was also responsible for the COVID-19 in Pakistan, Punjab province, and ICT. However, a negative and significant correlation exists in the Sindh province. The wind speed was positively and significantly correlated with the COVID-19 infection rate in Pakistan and its provinces (Punjab, Sindh, and KPK). However, the correlation between PM2.5 and the COVID-19 infection rate was only significant in Sindh province (negative), KPK province (positive), and the ICT (positive). However, Zoran et al. (2020) showed a positive link between pollutants and COVID-19 in Italy. Along with these environmental indicators, people's endurance, health facilities, social distancing (Bashir et al. 2020), population density, and medical facilities (Wang et al. 2020) were also possible factors to control the COVID-19 transmission.

Table 2 Correlation coefficients between environmental variables and COVID-19
Table 3 Results of generalized Poisson regression

Dose–response analysis

The GAM was used to construct the dose–response curve for wind speed and COVID-19 (Fig. 5). In Punjab, there was an inverted U-shaped curve between wind speed and COVID-19, which depicts that COVID-19 were initially increased due to an increase in wind speed, but the COVID-19 cases showed reduction when wind speed was more than 4 km/h. Initially, the dose–response curve showed a direct link between wind speed and COVID-19 in Pakistan, Sindh, KPK, and ICT. After that, it becomes flattered or downward sloped in Sindh, KPK, and ICT. According to Zhang et al. (2020), the correlation between wind speed and COVID-19 was positive, negative, and insignificant in the literature.

Fig. 5
figure 5

The dose–response curves for wind speed and COVID-19

Figure 6 shows the dose–response curve for PM2.5 and COVID-19. It showed an inverted U-shaped link between PM2.5 and COVID-19 cases per day in Pakistan, Punjab, and KPK. It explains that the COVID-19 initially increased due to a rise in PM2.5, but COVID-19 cases reduced when the amount of PM2.5 was more than the threshold level. The higher concentration of PM2.5 was also responsible to wear face masks and restricted mobility. The dose–response curve for PM2.5 and COVID-19 was positively sloped in the ICT, which means that the COVID-19 increased due to air pollution. On the other hand, the dose–response curve showed a negative link between PM2.5 and COVID-19 cases in Sindh province, showing a fall in COVID-19 due to a higher concentration of PM2.5 in the atmosphere.

Fig. 6
figure 6

The dose–response curves for air pollution (PM2.5) and COVID-19

Generalized Poisson regression analysis

Table 3 reveals the empirical results of generalized Poisson regression to assess the impact of wind speed and air pollution (PM2.5) on COVID-19 at the national and provincial levels in Pakistan. Yuan et al. (2006) also used a regression procedure to find the influence of wind speed, temperature, and humidity on the SARS outbreak. Chen et al. (2020) reported that environmental indicators (wind speed and air quality) can affect the transmission of infectious diseases. According to Zhang et al. (2020), the correlation between wind speed and COVID-19 was positive, negative, and insignificant in the literature. The virus was found in the air for many hours, which implies that the transmission of the virus was influenced by wind speed (Rendana 2020). By holding the airborne factor of coronaviruses, the wind speed was accountable for the spread of the virus, by accelerating the traveling from one place to another place. Therefore, the use of face masks was suggested to slow the virus spread in different countries such as Czech Republic, Hong Kong, Slovakia, Taiwan, Thailand, and Vietnam (Sarkodie and Owusu 2020). The increase in the expected log count of COVID-19 was 0.113 times for a 1 km/h rise in the wind speed in Pakistan. It means that the wind speed was not favorable to control COVID-19. The same conclusion was reported by Yuan et al. (2006) in Beijing, China. Yuan et al. (2006) reported that the rise in COVID-19 cases was 2.28 units for 1 unit increase in the wind speed in Beijing, China. Moreover, Bolano-Ortiz et al. (2020) also revealed a positive association of wind speed with COVID-19 infection and mortality in the Latin American and Caribbean countries. The regression analysis of provinces and ICT revealed that the rise in expected log count of COVID-19 was 0.074 times in Punjab, 0.042 times in Sindh, and 0.082 times in KPK for a 1 km/h increase in the wind speed. However, there was no significant association between wind speed and COVID1-19 in ICT. However, the impact of PM2.5 on the COVID-19 was not significant in Pakistan. It means that the rise in PM2.5 was not responsible for the COVID-19 in Pakistan. The impact of PM2.5 on the COVID-19 transmission was also insignificant in Punjab, KPK, and ICT. However, the fall in the expected log count of COVID-19 cases was 0.005 times in Sindh. Wu et al. (2020) mentioned that the PM2.5 in the atmosphere was linked with the rise in the COVID-19 deaths in the United States. The PM2.5 showed adverse health impact through inhalation because PM2.5 can be deposited in the lungs, which further increased the probability of cardio-respiratory diseases (Zoran et al. 2020).

Conclusion

The world economy and public health faced huge problems due to the COVID-19 outbreak. This study investigated the impact of wind speed and air pollution (PM2.5) on COVID-19 in Pakistan from March 10, 2020, to October 04, 2020. Wind speed has a positive and significant correlation coefficient in Pakistan and its provinces (Punjab, Sindh, and KPK). The dose–response curve for wind speed and COVID-19 was like an inverted U-shaped curve, which means that COVID-19 initially increased due to wind speed, but the COVID-19 cases showed reduction when wind speed was more than 4 km/h. The dose–response curve initially reflected a positive link between wind speed and COVID-19 infection in Pakistan, Sindh, KPK, and ICT. After that, it becomes flattered or downward sloped in Sindh, KPK, and ICT. The expected log count of COVID-19 infection was increased by 0.113 times (Pakistan), 0.074 times (Punjab), 0.042 times (Sindh), and 0.082 times (KPK) for a 1 km/h increase in the wind speed. The correlation between PM2.5 and COVID-19 was also positive in Pakistan, Punjab, and ICT. However, a negative and significant correlation exists in the Sindh. The dose–response curve for PM2.5 and COVID-19 was inverted U-shaped in Pakistan, Punjab province, and KPK province. It reflects the initial increase in the COVID-19 for an increase in PM2.5, but COVID-19 cases reduced when the amount of PM2.5 was more than the threshold level. The higher concentration of PM2.5 was also responsible to wear face masks and restricted mobility. The dose–response curve for PM2.5 and COVID-19 was positively sloped in ICT, which depicts the rise in COVID-19 infection due to higher air pollution. On the other hand, the dose–response curve showed a negative link between PM2.5 and COVID-19 in Sindh province. However, the regression impact of PM2.5 on the COVID-19 was insignificant in Pakistan. The regression impact of PM2.5 on the COVID-19 transmission was also insignificant in Punjab, KPK, and ICT. However, a fall in the expected log count of COVID-19 was 0.005 times in Sindh for a 1 unit rise in PM2.5. This research has some limitations such as (a) the COVID-19 can be affected by some other indicators like population density, mobility, age of the person, people endurance, and biological properties of virus (b) it is also important to find the impact of some other air pollutants (CO, SO2, NO2, PM 10) on the COVID-19 transmission.