1 Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), an extremely transmissible and pathogenic coronavirus, caused the novel coronavirus disease (COVID-19), a highly pathogenic, infectious, and invasive pneumococcal disease. Two highly transmissible and pathogenic viruses known as severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV) emerged in humans at the beginning of the twenty-first century. COVID-19, a pandemic of acute respiratory disease, threatens human health and public safety (Cui et al., 2019; Hu et al., 2021).

The novel coronavirus disease (COVID-19) outbreak spread rapidly to Europe, the Middle East, the USA, and other places of the world in a short time, just after it started as an epidemic in Wuhan, China, in late December 2019. As a result of this event, World Health Organization announced on March 11, 2020, that: “We expect to see the number of cases, the number of deaths, and the number of affected countries climb even higher. We are deeply concerned both by the alarming levels of spread and severity and by the alarming levels of inaction. We have therefore made the assessment that COVID-19 can be characterized as a pandemic.”

The first COVID-19 case was reported on 11 March 2020, and the first death from COVID-19 was announced on 17 March 2020 in Turkey. Therefore, Turkey took emergency measures to combat the epidemic as quickly as possible. January 10, 2020, a science council was established to evaluate and manage all COVID-19 cases over the country. Passengers were checked for COVID-19 at important points such as the airport before entering the country, and a quarantine period was applied. On 3 February 2020, in Turkey, all flights from China stopped temporarily. After Iran approved its first case, the borders between the two countries were closed. Hygiene, physical distancing, using a mask, and staying at home were advised by the government to reduce or stop transmission of the virus. Some restrictive measures such as education from home and preventing people over 65 from leaving their homes were taken. In addition, the curfew on weekends and official and religious holidays continues throughout the country, especially in metropolitan cities such as Istanbul, Ankara, and Konya. Despite drastic measures, the number of confirmed cases continued to increase (Aydın et al., 2021; Şahin, 2020).

Air pollution has been identified as the largest environmental problem and cause of premature death in the world. Because the body’s immunity is affected by air pollution, humans remain more vulnerable to pathogens (Copat et al., 2020). World Health Organization estimates that around 7 million premature death happens every year caused by exposure to polluted air, due to exposure to highly polluted air by 9 out of 10 people all over the World (WHO, 2018). Epidemiological and toxicological studies show a link between urban air pollutants that can induce airway inflammation and airway hyper-responsiveness and the increasing rate and/or severity of respiratory and cardiovascular diseases (Koren, 1995; Madureira et al., 2019; Schwela, 2000; Zoran et al., 2020).

There are many studies to investigate the relationship between the epidemiological factors and the concentration of air pollutants and meteorological factors. In the study by Yao et al., 2020, a positive relationship was found between PM2.5 and PM10 concentrations and the case fatality rate (CFR) of COVID-19 in Wuhan (Yao et al., 2020). It has been observed that when PM2.5 and PM10 increase by 10 μg/m3, CFR of COVID-19 increases by 0.86% (0.50%–1.22%) and 0.83% (0.49%–1.17%), respectively. According to the result of the review study (Mehmood et al., 2020), both short- and long-term exposure to PM2.5 support the higher incidence of the lethality of COVID-19. In another study by Zoran et al., 2020, analysis results showed a negative correlation between COVID-19 and relative air humidity, and the high impact of the daily air quality index (AQI) on the COVID-19 cases outbreak in Milan (Zoran et al., 2020). In the study conducted by Li et al., 2020, it was stated that AQI had an important role in COVID-19 transmission as it was found that there was a statistically significant and positive correlation between AQI and the number of daily COVID-19 incidence in both Wuhan and Xiaogan (Li et al., 2020). NO2, PM2.5, and CO showed a statistically significant and positive correlation with the daily incidence of COVID-19, while temperature and daily sunshine duration correlated negatively. In the study by Biktasheva, 2020, it was determined that local air humidity was statistically negatively correlated with the COVID-19 mortality rate in the German Federal States (Biktasheva, 2020). The study by Bashir et al., 2020 is noted that PM2.5, O3, and NO2 are significantly associated with total cases, recovered cases, active cases, and deaths from COVID-19. There was also a statistically significant correlation between temperature and total incidence, recovery, and deaths (Bashir et al., 2020). Finally, PM10, humidity, and EQI were only significantly associated with active cases from the COVID-19 pandemic in Germany. These results reveal that while air pollutants such as PM2.5, NO2, and O3 are the main determinants of COVID-19 in Germany, temperature is the only climate indicator. The study by Hendryx and Luo, 2020, showed that a higher risk of COVID-19 may be associated with higher diesel exhaust concentrations, as significant correlations with diesel PM, COVID-19 prevalence, and mortality have been detected (Hendryx and Luo, 2020). However, more studies are needed to confirm these relationships. Another study by Iqbal et al., 2020, showed a correlation between average daylight hours, total COVID-19 cases, average high temperatures, total COVID-19 cases, and deaths (Iqbal et al., 2020). These results are expected to support organizations such as WHO and local governments in their fight against the spread of COVID-19. The study by Srivastava, 2020, comprehensively reveals that SARS-CoV-2 viruses are associated with aerosols from sneezing, coughing, and speech (Srivastava, 2021). In addition, the study revealed that the aerosol containing SARS-CoV-2 produced by sneezing and coughing was the main source of the spread of the virus, particulate matter and gaseous pollutants caused more COVID-19 cases and deaths, and increased temperature/humidity decreased the number of cases. In a study conducted by Şahin in Turkey, COVID-19 cases and air pollutants were associated only; meteorological factors have not been evaluated (Şahin, 2020). It was found that SO2 had a statistically significant positive correlation with the number of COVID-19 cases, O3 had a high effect on the number of COVID-19, NO, NO2, and NOx had a negative effect, and there was a correlation between PM10 and the number of COVID-19 cases. The study by Zhu et al., 2020, in eight regions of South American countries, exhibited a statistically highly significant negative correlation between daily incubate cases and absolute humidity throughout the selected regions (Zhu et al., 2020).

COVID-19 is known to be transmitted from human to human person through close contact (about 2 m) by aerosol containing SARS-CoV-2 produced by sneezing and coughing respiratory droplets smaller than 5 μm in diameter. Therefore, it has been one of the important issues investigated the role of environmental factors in transmission, as particulate matter and gaseous pollutants can cause more COVID-19 cases and deaths (Copat et al., 2020; Li et al., 2020; Srivastava, 2021; WHO, 2018). In addition, due to the effects of air pollution on the immune system, the relationship between air quality parameters and epidemiological factors should be examined. Since this issue concerns the whole society, it has serious importance in terms of public health management.

In this study, the air quality parameters in two different climatic and geographical areas were statistically evaluated and compared. The relationship between the air quality parameters and the new COVID-19 cases was conducted and discussed. Two regression models for the effect of air quality parameters on new cases were developed and discussed.

2 Material and methods

2.1 Study area

The study consists of two regions from turkey, which are the Western Black Sea and Western Anatolia. The Western Black Sea Region lay down along the Black Sea at the middle north boundary of Turkey (See Fig. 1). This area consists of 10 districts, which are Amasya, Çankırı, Kastamonu, Samsun, Tokat, Zonguldak, Karabük, Sinop, Bartın, and Çorum. The total area is 73,914.95 km2. This area considers a low-density population area in Turkey 61 km2. The population in the area is 4,493,559 people (2014).

Fig. 1
figure 1

The location of Western Anatolia and Western Black Sea regions. The map consisting the Köppen-Geiger climate classification (Öztürk et al., 2017)

The Western Anatolia region is located in the internal part of Turkey (See Fig. 1). This area consists of three big districts, which are Ankara, Konya, and Karaman. The total area of this region is 75,096.90 km2. This area considers a low-density population area in Turkey 100 km2. The population in the area is 7,499,242 people (2014).

To find the differences between the Western Anatolia and Western Black Sea regions, these areas were evaluated based on the Köppen-Geiger climate classification. The Köppen-Geiger climate classes represent a reflection of topography structure and vegetation characteristics. Considering vegetation as visible climate, different regions in a similar class share common vegetation characteristics. The Köppen-Geiger system separates climate into five major classes and 30 sub-types. The classification is based on threshold values and seasonality of monthly precipitation and air temperature patterns. The five major classes are A (tropical), B (dry), C (moderate), D (continental), and E (polar) (Öztürk et al. 2017). The study area is mainly consisting three main groups out of the five groups; these three classes are B (dry), C (moderate), and D (continental).

2.2 Group B: dry climates

This region is characterized by an increased rate of evaporation and transpiration than the annual rate of rain, making its plants of the type that tolerate heat and drought. One of the sub-classifications within the study area that appeared within this group is the cold semi-arid climate (BSk).

2.3 Group C: moderate climates

This region predominates in areas with moderate temperatures. The average temperature in the coldest months of the year in winter is less than 18 °C and not less than 3 °C below zero. Within the study area, there are the following sub-classifications found within this group: Csa has a Mediterranean climate characterized by hot and dry summer; Csb has a Mediterranean climate characterized by warm and dry summer; Cfa has a humid subtropical climate characterized by hot summers and no dry season; and Cfb has a temperate oceanic climate characterized by warm summers and no dry season.

2.4 Group D: continental climates

This climatic region is characterized by frozen soil in winter and the persistence of snow cover for several months of the year, and its average temperature in the coldest months of the year is less than 3 °C below zero, and its average exceeds 10 °C in the warmest of those months. Within the study area, the following sub-regions and sub-classifications appeared within this group:

Dsa: hot, summer, humid continental climate influenced by the Mediterranean region. The coldest month has an average of less than 0 °C, the average temperature of the warmest month is above 22 °C, and at least 4 months have an average of 10 °C. Precipitation comes in the winter months, and the summer months are dry, with the driest month in the summer receiving less than 30 mm.

Dsb: a continental climate with warm, humid summers influenced by the Mediterranean region. The coldest month has an average of less than 0 °C, the average temperature of the warmest month is less than 22 °C, and at least 4 months have an average above 10 °C. The precipitation comes during the winter months, and the summer months are the driest. The driest month in the summer receives less than 30 mm of precipitation.

Dsc: subarctic climate influenced by the Mediterranean region. The coldest months average below 0 °C and 1–3 months average above 10 °C. The precipitation is coming in the winter months, while the driest months are coming during the summer. The driest month in summer receives less than 30 mm of precipitation.

Dfb: warm-summer humid continental climate. The coldest months average less than −0 °C, all months have average temperatures below 22 °C, and at least 4 months average 10 °C. There is not much difference in precipitation between seasons.

Dfc: arctic climate. The coldest months average below 0 °C and 1–3 months average above 10 °C. There is no significant difference in precipitation between seasons.

The Western Anatolia region comprises (See Fig. 1) the sub-classes such as B (dry) BSk, C (moderate) Csa, Csb, and D (continental) Dsa, Dsb, Dsc, Dfb. Around 70.06% (52,615.16 km2) from the Western Anatolia region appeared in the B dry group, 23.36% (17,545.09 km2) appeared in the D (continental) group, while only 6.57% (4936.64 km2) from the area appeared C (moderate) group. On the other hand, The Western Black Sea region (See Fig. 1) contains the same climate groups (B (dry), C (moderate), and D (continental)), but consisting of different sub-classes such as BSk, Csa, Csb, Cfa, Cfb, Dsb, Dsc, Dfb, and Dfc. Approximately, 31.56% (23,332.30 km2) from the Western Black Sea Region appeared in the B dry climate group, 26.26% (19,410.84 km2) from the area appeared in the C (moderate) climate group, and 42.17 % (31,170.42 km2) from the region appeared in the D (continental) climate group.

In addition, the internal location of the Western Anatolia region and located in the shadow area of the continentals worked as a barrier against air movement. Thus, the air contamination is assumed to be different from the Western Black Sea Region. The Western Black Sea Region is generally located in the face of the air source coming from the Black Sea.

2.5 Data collection

Meteorological data such as net short wave radiation flux, temperature, relative humidity, wind speed, and pressure were gathered from the Global Land Data Assimilation System (GLDAS). The GLDAS meteorological data is hosted in Google Earth Engine (GEE) platform (Rodell et al., 2004). The climatological data were downloaded using Javascript language from the gridded images for the study of the Western Anatolia and Western Black Sea regions.

The data on air quality parameters were acquired from the Ministry of Environment and Urbanization (Ministry of Environment and Urbanization, 2021). Air quality parameters such as particulate matter (PM10) having an aerodynamic diameter of less than or equal to 10 m (PM10), nitrogen dioxide (NO2), sulfur dioxide (SO2), carbon monoxide (CO), nitrogen oxides (NOx), and ground-level ozone pollution (O3). The data on confirmed cases and new hospitalizations on COVID-19 were obtained from the COVID-19 information page of the Republic of Turkey Ministry of Health on December 12, 2020 (https://covid19.saglik.gov.tr/). The daily data were collected from June 29, 2020, to November 23, 2020, for the meteorological data and air quality parameters which are equal to the available data on the number of COVID-19 cases

2.6 Statistical analysis

The data for the two regions regarding selected environmental parameters and the number of new cases of COVID-19 were analyzed using descriptive statistics and graphical presentation (Alkarkhi and Low, 2012). Two more techniques were used to find the effect of environmental parameters on the new cases of COVID-19; the two techniques are principal component analysis and regression model to describe the effect of environmental parameters on the number of new cases (Al-Karkhi and Alqaraghuli, 2020).

3 Result and discussion

3.1 Descriptive statistics

Descriptive statistics for the environmental variables in the two selected regions are presented in Table 1. The difference between the average values, standard deviation, and minimum and maximum values is high which is reflected by the values of standard deviation for most of the selected parameters which indicates that the level of the selected parameters has highly fluctuated. The distribution of meteorological and air pollutant parameters in the two study areas is visualized in Fig. 2 and Fig. 3, respectively. The collected data for the number of new COVID-19 cases in the two regions, Western Black Sea and Western Anatolia, are presented as a graphical summary in Fig. 4, including the mean, standard deviation, skewness kurtosis, minimum, maximum, first and third quartile, median, confidence interval, and histogram with the normal curve. The mean value for the Western Black Sea is less than Western Anatolia (90.365 < 247.68), while the skewness value for Western Black Sea is higher than Western Anatolia (3.3695 > 0.6851), both groups of data are skewed. The high variation is attributed to the difference between the two geographical and topographical locations, as Western Anatolia is a continental area and the Western Black Sea is a coastal area.

Table 1 Showing descriptive statistics for the selected environmental variables
Fig. 2
figure 2

The distribution of meteorological parameters a specific humidity, b wind speed, c air temperature, and d pressure in the two study areas

Fig. 3
figure 3

The distribution of air pollutant parameters (μg/m3) a PM10, b SO2, c CO, d NO2, e)NOX, and f O3 in the two study areas

Fig. 4
figure 4

Graphical summary for the new number of cases in the two regions a Western Black Sea and b Western Anatolia

3.2 Principal component regression

Principal component regression was used to extract new variables from environmental parameters to be used as independent variables and the number of new cases of COVID-19 in the two selected regions as a dependent variable. Ten environmental variables were used to extract the independent variables that influence the number of new COVID-19 cases and model the relationship in terms of regression model. Three components were extracted with an eigenvalue of more than 1 explaining more than 85% of the total variance (Table 2 shows the contribution (percentage) of each component). The loading of each environmental variable with each component is measured by the coefficient associated with the variable (Table 3), loadings take values from −1 to 1, the high value of loading close to −1 or 1 gives an indication that the selected variable has a strong influence on the component, while close to 0 indicates that the variable has a weak influence on the component. The values of the first and second principal components are presented in Fig. 5, clearly showing the difference between the two regions. The values of each component represent an independent variable; thus, three independent variables can be used based on the number of components. Based on the graphical summary of the number of new cases of COVID-19 (Fig. 4a, b), the variable is skewed, thus normality testing should be carried out. The result of the normality assumption for the number of new cases is presented in Fig. 6a, b, showing that the data did not meet the normality assumption as presented by the value of the Anderson-Darling (AD) 12.832 and 3.048 for Western Black Sea and Western Anatolia with P-value < 0.005. Transformation of the number of new cases was carried out in two steps, the first step is to transform the original data into fractional rank and the second step is to use inverse DF and IDF normal; the result of the transformed data is presented in Fig. 7a, b, showing the transformed data are normally distributed as appeared by the small value of AD, 0.055 and 0.026 for Western Black Sea and Western Anatolia respectively.

Table 2 Showing the extracted components including eigenvalues, percentage, and cumulative variance explained by each component
Table 3 Showing the loadings for the loading for the extracted components
Fig. 5
figure 5

The first and second components

Fig. 6
figure 6

Probability plot for normality assumption test for number of new cases a Western Black Sea and b Western Anatolia

Fig. 7
figure 7

Probability plot for transformed data (normality assumption test) a Western Black Sea and b Western Anatolia

The relationship between the number of new cases (transformed variable) as dependent variable and the three variables that resulted from principal component analysis (each component produced one variable), X1, X2, and X3 are given in Eqs. 1 and 2 for Western Black Sea and Western Anatolia respectively.

$$\textrm{New}\ \textrm{cases}\ \textrm{Western}\ \textrm{Black}\ \textrm{Sea}=\kern0.5em 270+1.8\ {X}_1+191\ {X}_2-71.8\ {X}_3$$
(1)
$$\textrm{New}\ \textrm{cases}\ \textrm{Western}\ \textrm{Anatolia}\kern0.5em =319+49.8\ {X}_1-55.8\ {X}_2+6.72\ {X}_3$$
(2)

Where X1 represents the effective values for the parameters: wind speed, PM10, SO2, CO, NOX, and O3, X2 represents the effective values for pressure, humidity, and NO2, and X3 represents the effective value for temperature.

The results of the analysis showed that the model in Eqs. 1 and 2 is significant which indicates the existence of relationship between the number of new cases and the selected environmental parameters. Furthermore, testing of individual variable showed that the variable X1 which represents the values of the first components (wind speed, SO2, CO, NOX, and O3) is not an influential variable in the Western Black Sea area (Table 4); this indicates that the parameters represented in X1 are not influential parameters and does not affect the number of new cases of COVID-19, while X1 and X2 are influential variables. On the other hand, the variables X2 and X3 are not influential variables in Western Anatolia (Table 4); this indicates that only wind speed, SO2, CO, NOX, and O3 which represent X1 are influential parameters and affect the number of new cases in Western Anatolia. Accordingly, the number of new cases is affected by different parameters and based on the region which indicates the difference in the number of new COVID-19 cases. The model in Eq. 1 explains only 59.3% of the total variance in the number of new cases while the model in Eq. 2 explains only 13.7% of the total variance.

Table 4 t-test for regression coefficient for Black Sea and Anatolia

The data for the number of new cases of COVID-19 were further analyzed using t-test. The results of the analysis showed that the number of new cases of COVID-19 is different in the two regions as the t-test statistic is −16.58 and P-value < 0.0001. These results confirmed that the environmental parameters behave differently in each region. This could be due to the difference in the climate classes that are explained in section 2.1.

4 Conclusion

Evaluation of the effect of air quality parameters on new COVID-19 cases between two different climatic and geographical regions is expected to contribute to the determination of appropriate health management strategies. Based on the abovementioned findings and discussion, it can be concluded that the number of new cases of COVID-19 and the selected air quality parameters in both areas (continental and coastal areas) has a significant relationship. Furthermore, more than 85% of the total variation in the data was explained by only three components which represent the new variables in regression analysis. The Tow regression model that describes the relationship between the new independent variables (X1, X2, and X3) and the number of new cases of COVID-19 (response variable) was significant. In the continental area, wind speed, SO2, CO, NOX, and O3 which represent X1 are influential parameters and affect the number of new cases. The model in Eq. 1 explains only 59.3% of the total variance in the number of new cases, while the model in Eq. 2 explains only 13.7% of the total variance. The results of the analysis of variance for regression analysis showed a strong significant effect on the number of new cases (P-value < 0.0001). Furthermore, X1 showed a negative effect on the number of cases, while X2 and X3 showed a positive effect on the number of new COVID-19 cases.