Introduction

The newly discovered coronavirus COVID-19 was first observed in Wuhan City, Hubei Province, China in late December 2019 (Ahmadi et al. 2020; Jahangiri et al. 2020; Lu et al. 2020; Xu et al. 2020; Yongjian et al. 2020). The coronavirus spread rapidly in just a few days out of China’s boundaries and spread to almost all countries of the world (Chen et al. 2020; Jahangiri et al. 2020; Lu et al. 2020; Ogen 2020; Phan et al. 2020; WHO 2020b; Xu et al. 2020). Data from the World Health Organization (WHO) show that from February 19, 2020 to March 22, 2020, 257,182 new coronavirus cases were identified worldwide, and globally confirmed cases reached almost 333 thousand, with 14,509 deaths (WHO 2020a, c). In the same period, throughout Iran’s provinces, 21,638 COVID-19 cases have been confirmed, including 1682 deaths (https://www.behdasht.gov.ir). Symptoms of fever, dry cough, and sore throat have been observed in many infected patients (Huang et al. 2020; Jiang et al. 2020; Ogen 2020; Sohrabi et al. 2020; Yongjian et al. 2020), and in some cases, a severe acute respiratory syndrome was observed which led to death (Chen et al. 2020; Guo et al. 2020; Mehta et al. 2020; Rodriguez-Morales et al. 2020; Sohrabi et al. 2020). Therefore, to determine the role of the most significant factors affecting COVID-19 contagion in urban areas, especially in megacities, multiple studies have been conducted (Bontempi 2020; Chan et al. 2020; Domingo and Rovira 2020; Li et al. 2020; Rastogi et al. 2020; Wang et al. 2020a). Many factors, including climatic parameters such as temperature and relative humidity (Zhu and Xie 2020), population density (Kraemer et al. 2020), and quality of medical care (Wang et al. 2020b) affect the virus’s transmission. To pursue the matter further, the impact of short-term exposure to air pollutants should be particularly considered (Bashir et al. 2020).

Many studies have shown that ambient air pollutants have adverse effects on the human respiratory system, and have reported that chronic respiratory illnesses can also be caused by long-term exposure to major air pollutants (Becker and Soukup 1999; Cai et al. 2007; Chauhan and Johnston 2003; Chen et al. 2007; Ciencewicki and Jaspers 2007; Horne et al. 2018; Mehta et al. 2013; Phosri et al. 2019; Xie et al. 2019; Xu et al. 2016). Carbon monoxide (CO), nitrogen dioxide (NO2), ozone (O3), and sulfur dioxide (SO2) are significant harmful ambient air pollutants in many urban regions. The risk of adverse cardiopulmonary events, including death, can be increased by continuous exposure to CO (Chen et al. 2007). High concentrations of NO2, as a highly reactive gas produced during combustion, cause catastrophic injury to the human lung (Beelen et al. 2008; Chen et al. 2007; Hoek et al. 2013). NO2 also contributes to the formation of some harmful secondary air pollutants such as O3 (Khoder 2002). Tropospheric O3 levels harm public health in areas throughout the world. Many studies have shown that respiratory irritation and decreased lung function from damaged lung tissue, causing pulmonary and heart disease, are caused by short-term exposure to high levels of O3 (Bell et al. 2004; Filippidou and Koukouliata 2011; Gryparis et al. 2004; Parodi et al. 2005). SO2 can also lead to respiratory problems, especially in patients with pulmonary underlying disease (Smith et al. 1977).

COVID-19 can disperse in aerosol form and remain viable for hours in ambient air (Van Doremalen et al. 2020). Hence, a tangible understanding of the impact of ambient air pollutants on the novel coronavirus infection seems necessary. This study aims to assess the relationship between the satellite-based concentrations of four air pollutants and COVID-19 cases in 31 provinces in Iran by combining two databases: the tropospheric concentration of four air pollutants (i.e., CO, NO2, O3, and SO2) and the number of confirmed cases between February 19 to March 22, 2020. As well, the impact coefficients of each pollutant are introduced separately for Iran’s provinces, and the most vulnerable provinces are determined. Finally, discussion and concluding remarks are provided.

Materials and methods

Study area

The Islamic Republic of Iran, with over 80 million inhabitants and an area of 1648,195 km2, is located in western Asia in a transcontinental region, i.e., the Middle East. Iran lies within latitudes 24° and 40°N and longitudes 44° and 64°E. It consists of 31 provinces with a wide range of population densities (Fig. 1). The climate in Iran is classed as between arid and semiarid. Topography, meteorological conditions, and emissions from mobile and stationary sources in many megacities in Iran contribute to poor air quality on many days of the year.

Fig. 1
figure 1

Population densities (ratios of population size to area) in Iran’s provinces

Data collection

The daily numbers of coronavirus cases for each province were collected from the official website of the Ministry of Health and Medical Education (https://www.behdasht.gov.ir). The provincial population sizes were obtained from the Statistical Center of Iran (SCI) (https://www.amar.org.ir). It should be noted that the number of cases in each province is only reported from February 19, as the starting point of the outbreak in Iran, to March 22, 2020, as shown in Table 1. The maximum numbers of confirmed cases are in Tehran, Isfahan, and Mazandaran provinces, while the minimum numbers of confirmed cases belong to Bushehr, Chaharmahal and Bakhtiari, and Kohgiluyeh and Boyer-Ahmad provinces.

Table 1 Statistical data from February 19 to March 22, 2020

Satellite-based data can be used for monitoring air quality and performing health research (Omrani et al. 2020; Putrenko and Pashynska 2017). To obtain the daily tropospheric concentrations of CO, NO2, O3, and SO2 from the surface up to ~ 10 km, satellite data were extracted from the Sentinel-5P Pre-Operations Data Hub (https://s5phub.copernicus.eu/). The Sentinel-5P, a satellite with a spectral resolution of 0.25–0.55 nm and global daily coverage of 7 × 7 km2, is an Earth-observation satellite. It was developed and launched in October 2017 by the European Commission under the “Copernicus” Program. The satellite carries a TROPOspheric Monitoring Instrument (TROPOMI) that provides a 3.5 × 7 km2 spatial resolution of the measurements for all trace gases, except for CO, which has a 7 × 7 km2 spatial resolution. In this study, Level 2 geophysical products were collected using “wget,” which is a computer program that retrieves content from web servers (https://scihub.copernicus.eu/userguide/BatchScripting).

Daily air pollutant data in a netCDF format were used for both spatial and temporal observations. Since TROPOMI measurements have a varied spatial distribution of grid cells and depend on the viewing zenith angle in every orbit at the moment of observation, each product was resampled for the study area on the regular grid with the same spatial resolution. The dataset was re-projected by latitude/longitude WGS84 using R statistical software (version 3.6.3) (Omrani et al. 2020). Some factors such as cloud cover, saturation, and geometry affect the data quality of individual observations. Therefore, to rectify errors and incomplete retrievals, the data were cleaned using a quality assurance value marked by a ‘qa_value’ which ranges between 0 (no data) and 1 (full data). Measurements with a ‘qa_value’ above 0.5 provided by TROPOMI were used, and outliers were eliminated based on Copernicus specifications (Copernicus 2019). For further analysis, spatial assessment of the air pollutant density was done by separately using the monthly raw data for the mean concentrations of each pollutant for all provinces, which was processed by a Python code. Table 1 shows the statistics for the satellite-based mean concentrations of the four air pollutants in each province during the observation period. The mean concentration values for CO, NO2, O3, and SO2 varied between 0.58 and 1.12 mmol/m2, 11.57 and 13.75 µmol/m2, 0.59 and 0.72 mmol/m2, and 0.40 and 0.66 mmol/m2, respectively.

Multiple linear regression

In this study multiple linear regression (MLR) (İçağa and Sabah 2009; Ng and Awang 2018; Sousa et al. 2007; Ul-Saufie et al. 2012) was used to determine the relationship between daily COVID-19 confirmed cases density (CCD) and the average concentrations of the four criteria air pollutants (i.e., CO, NO2, O3, and SO2). CCD is obtained by dividing the number of total confirmed cases by the population size. The basic model using the least square method was defined as follows:

$$ N_{\text{CCD}} = a{\text{C}}_{\text{CO}} + \, b{\text{C}}_{{{\text{NO}}_{ 2} }} + \, c{\text{C}}_{{{\text{O}}_{3} }} + d{\text{C}}_{{{\text{SO}}_{2} }} $$
(1)

where NCCD indicates the normalized CCD, , b, c, and d are the impact coefficients of CO, NO2, O3, and SO2, respectively. CCO, CNO2, CO3, and CSO2 indicate the daily normalized average concentrations of CO, NO2, O3, and SO2, respectively. The normalized values were obtained by dividing the difference between the daily and the average values by the standard deviations of their daily values.

The above four air pollutants collectively covered 83,624 grid cells over Iran. The impact coefficients determine the contribution level of each pollutant to the occurrence of COVID-19 infections. These coefficients vary between − 1 and 1 to, respectively, indicate relative decreasing and increasing effects.

Results and discussion

Results

Descriptive analysis

The spatial distribution of CCD is shown in Fig. 2. Mapping the CCD over Iran revealed major COVID-19 ‘hotspots’ in Semnan and Qom (which include nearly 9% of all confirmed cases). Aside from these two provinces, nine other provinces located close to the center of Iran had high CCDs and collectively included 64% of all cases. In contrast, five southern provinces, particularly Bushehr and Sistan and Balouchestan, had the lowest CCDs due to their lower case numbers and populations.

Fig. 2
figure 2

Confirmed COVID-19 Case Densities (CCD) from February 19 to March 22, 2020

The provincial mean concentrations of CO, NO2, O3, and SO2 and the corresponding CCDs are shown in Fig. 3. The mean CO concentrations in 13 provinces which collectively included more than 63% of all cases were over 1 mmol/m2. The mean NO2 concentrations were higher than 13 µmol/m2 for 10 provinces which collectively included 58% of all cases. The average O3 concentrations in 11 provinces which collectively included 20% of total cases were more than 0.65 mmol/m2, and 12 provinces with average SO2 concentrations over 0.6 mmol/m2 encompassed 55% of confirmed cases. It should be noted that Semnan and Qom had both high CCDs and high concentrations of the four pollutants. Moreover, three provinces which collectively included 4% of total confirmed cases had mean CO concentrations lower than 0.7 mmol/m2. Six provinces had average NO2 concentrations lower than 12 µmol/m2 and collectively included 5% of all confirmed cases. The mean O3 concentrations were less than 0.6 mmol/m2 for two provinces that together encompassed only 3% of all confirmed cases. There were also two provinces with average SO2 concentrations lower than 0.45 mmol/m2 that included only one percent of total confirmed cases.

Fig. 3
figure 3

Average provincial air pollutant concentrations vs. CCDs from February 19 to March 22, 2020

Air pollutant impact coefficients

The spatial distribution of the four impact coefficients a, b, c, and d, corresponding to CO, NO2, O3, and SO2, respectively, are shown in Fig. 4. The results show that the impact coefficients a, b, c, and d varied between − 0.4 and 0.6, − 0.3 and 0.8, − 0.5 and 0.6, and − 0.6 and 0.8, respectively. There were five provinces with a coefficients higher than 0.4, with Tehran having the largest value, and nine provinces with a coefficients smaller than − 0.2, with South Khorasan having the lowest. The impact coefficient b was larger than 0.4 for seven provinces and smaller than 0 for nine provinces, with Golestan having the highest and North Khorasan having the lowest. The impact coefficient c was over 0.4 for four provinces and less than − 0.25 for five provinces, of which Kohgiluyeh and Boyer-Ahmad had the highest and Markazi the lowest. The impact coefficient d was larger than 0.4 for four provinces, of which Chaharmahal and Bakhtiari had the highest, and smaller than − 0.2 for four provinces, of which Golestan had the lowest. Also, the provinces with the highest a, b, c, and d coefficients had over 31%, 13%, 3%, and 3% of all confirmed cases, respectively. Conversely, the provinces with the lowest a, b, c, and d coefficients had over 22%, 37%, 33%, and 9% of total confirmed cases, respectively.

Fig. 4
figure 4

Impact coefficients for: a CO, b NO2, c O3, and d SO2 during February 19 to March 22, 2020

Sensitivity analysis

In order to perform a provincial sensitivity analysis and to figure out the different effects of the impact coefficients on each province, the influences of each impact coefficient on the density of confirmed COVID-19 cases along with the CCDs are shown in Fig. 5. Each province had different impact coefficients during the study period. Accordingly, the absolute (regardless of positive or negative sign) value of the impact coefficient a was higher than other coefficients in 10 provinces which included 51% of all confirmed cases. The absolute values of the impact coefficient b outnumbered the other three coefficients in 10 provinces which collectively included 32% of total confirmed cases, while the impact coefficient c had the highest absolute value among the other coefficients in eight provinces that collectively encompassed 15% of all confirmed cases. Among the four impact coefficients, the absolute value of coefficient d was highest in merely three provinces that collectively included 2% of confirmed cases. However, in comparison with the other coefficients, the absolute value of impact coefficient a was lowest in only five provinces that collectively had 11% of all cases. The absolute value of impact coefficient b was lower than all other coefficients in seven provinces which collectively had 14% of total cases. The impact coefficient c had the lowest value compared with the other coefficients in 12 provinces which collectively had 41% of all confirmed cases. The absolute value of coefficient d was lowest in seven provinces which collectively had 34% of total confirmed cases compared with the other impact coefficients. Therefore, the overall trend of the four impact coefficients was such that the absolute values of the coefficients a, b and d, and c were significant in many provinces, respectively. It should be noted that the impact coefficients had negative or positive associations in various provinces. Positive values imply relative increasing effect, while negative values indicate a relative decreasing impact on the density of COVID-19 confirmed cases. Among all provinces, the impact coefficients a, b, c, and d had a positive correlation in 16, 22, 17, and 13 provinces, respectively.

Fig. 5
figure 5

Provincial impact coefficients

The value of impact coefficient a outnumbered the other coefficients’ values, regardless of positive or negative influences, in Qom, Yazd, and Markazi as the three regions with high CCDs. Exceptionally, in Semnan, which had the highest CCD, the value of impact coefficient b was larger than that of a.

Discussion

The results of the investigation into the COVID-19 outbreak in Iran’s provinces show that provinces such as Semnan, Qom, Yazd, and Markazi had the highest CCDs and means that such provinces are affected more than others by the novel coronavirus considering their population sizes. Based on the results, it seems that restricting movement and following health instructions should be considered more in these provinces. In addition, provinces located mostly in the center and northern center of Iran with significant population densities had relatively high CCDs. Among these provinces, Tehran, with the largest population, is reckoned to have a potential for COVID-19 infection that may lead to a critical situation unless strict health regulations are put in place. Given that pollutant levels have a prominent role in the virus outbreak, emissions reductions might be a useful approach to control the infection in these provinces. Therefore, some decisions such as setting low-emission zones to reduce daily commutes, prohibiting heavy-duty vehicles entry to high-risk regions, and limiting the work hours of power plants and refineries should be taken. The results reveal that air pollutant levels were influential in the period of the disease outbreak. The mean concentration of CO was high in 13 out of 31 provinces implying that there are more cases in areas with higher concentrations of CO, and a similar trend for NO2 can be observed. Unlike CO and NO2, O3 had a somewhat different trend. Given the results, southern provinces that had lower CCDs were exposed to higher concentrations of O3 during the study period, even though it should be mentioned that the COVID-19 outbreak was assessed in the winter when surface ozone was more likely to have been destroyed by nitrogen oxides. Similar to CO and NO2, mean SO2 concentrations were significant in provinces. This shows that high-risk provinces were subjected to more SO2 pollution. It can be concluded that provinces with larger CCDs, such as Semnan and Qom, had higher mean concentrations of almost all the pollutants.

In this study, significant negative and positive linear correlations have been found between the impact coefficients a, b, d, and c and confirmed COVID-19 cases. However, the impact coefficient d was correlated more negatively than positively with the density of confirmed cases among the provinces. These findings provide evidence that the air pollutants CO, NO2, and O3 unlike SO2, mostly had relatively increasing rather than decreasing effects, even though all of them could be important contributing factors in COVID-19 infections.

Although almost all the air pollutants could be risk factors for the respiratory system, results demonstrated that impact coefficient d had more decreasing effects on COVID-19 cases. Thus, to figure out the effect of SO2 precisely and comprehensively, additional research should be done. Based on the study results, in order to control COVID-19 infections, reducing emissions of CO, NO2, and O3 could be effective, especially in provinces that have experienced larger COVID-19 outbreaks. Meanwhile, further research is needed to understand the effect of SO2 on infections and to illuminate its correlation with confirmed COVID-19 cases.

The current study has several constraints. First of all, only the association between air pollutants and the density of COVID-19 cases was considered, and the results did not address the effects of air pollutants on the disease transmission rate. Second, no distinction was considered for the gender or ages of confirmed cases, nor was the presence of background diseases taken into account. Also, various features of pollutant emission sources were not considered in the data used. Furthermore, the results of this study are mainly focused on Iran, and subsequent studies are required to generalize the conclusions.

Conclusion

The COVID-19 pandemic has affected many countries throughout the world, including both developed and developing countries, since late December 2019. Iran has varied provincial population densities and many highly air-polluted megacities that have different infection rates. Therefore, understanding the effects of major air pollutants on the spread of the virus is vital. Tehran, Isfahan, and Mazandaran provinces had the most COVID-19 confirmed cases, while Bushehr, Chaharmahal and Bakhtiari, and Kohgiluyeh and Boyer-Ahmad provinces had the least confirmed cases.

The COVID-19 CCDs showed that Semnan and Qom were affected by the virus more than other provinces. Nine central provinces had high CCDs, and collectively had 64% of all confirmed cases, while several southern provinces such as Bushehr and Sistan and Balouchestan had the lowest CCDs.

The Sentinel-5P data showed that the largest values of the mean concentrations of CO, NO2, O3, and SO2 were measured in provinces, which had 63%, 58%, 20%, and 55% of all COVID-19 cases. This shows that short-term exposure to larger concentrations of these pollutants could be an important factor leading to an increase in the risk of COVID-19 infection.

Based on the results of modeling, there is a considerable statistical relationship between air pollutants and COVID-19 infection. The provinces with the largest values for the impact coefficients a, b, c, and d held over 31%, 13%, 3%, and 3% of all confirmed COVID-19 cases, while the provinces with the lowest coefficients for a, b, c, and d comprised over 22%, 37%, 33%, and 9% of total confirmed cases. The provincial sensitivity analysis demonstrated that the influences of each impact coefficient on the density of confirmed COVID-19 cases were different. Among all provinces, the impact coefficients a, b, c, and d had positive correlations, i.e., an increasing effect, in 16, 22, 17, and 13 provinces, respectively. The value of impact coefficient a was larger than that of the other coefficients in Qom, Yazd, and Markazi, which had high CCD values. This means that CO had the most significant effect on the density of confirmed cases. In Semnan, as a top hot spot province, the effect of NO2 was greater than that of CO.

Since there is not yet any scientific reason which explains why SO2 had a decreasing effect on the density of confirmed virus cases, conducting more studies to shed light on the underlying mechanisms seems to be necessary.