1 Introduction

Atmospheric Sulfur dioxide (SO2), emitting from anthropogenic and natural sources, is one of the most important air pollutants leading to climate change, environmental problems and health risks (Braga et al. 2006; Gonçalves et al. 2005; Krotkov et al. 2016; Mallik and Lal 2014; Pereira et al. 2009; Hoveidi et al. 2017; Borhani and Noorpoor 2020). Sulfur dioxide can cause a number of harmful effects on the lungs, (i.e., Wheezing, breathing problems, chest tightness) (EPA 2008; Yun et al. 2015). While PM2.5 (Particulate Matter with 2.5 μm or less in aerodynamic diameter) is deposited on the surface of the deeper parts of the lungs and PM10 (Particulate Matter with 10 μm or less in aerodynamic diameter) on the surface of the larger airways in the upper part of the lungs. Particulate matter deposited on the lung surface can lead to tissue damage and lung inflammation (Wei and Tang 2018; Losacco and Perillo 2018; Happo 2009). SO2 is one of the five main pollutants that together account for more than 90% of air pollution in the world (Sharma 2001). Sulfur oxides in combination with particulate matter and moisture (e.g. humidity) in ambient air produce the most harmful effects associated with atmospheric air pollution (Dutkiewicz et al. 2000; Moldan et al. 2001). Numerous studies to show that exposure to SO2, PM2.5, and PM10 Periodontitis increases the risk of respiratory diseases in adults, during the COVID-19 pandemic (Borhani et al. 2021a; Zoran et al. 2020; Zou et al. 2020; Wu et al. 2020). Sulfur dioxide (SO2) is one of the Short-lived climate forcers (SLCF). Short-lived climate forcers (SLCF) are also known to as short-lived climate pollutants (SLCP) (Randall et al. 2008; Retama et al. 2015; Stohl et al. 2015; Kindbom et al. 2019). SO2 remains in the air for an average of 2 to 4 days (Turalıoğlu et al. 2005). SO2, PM2.5, PM10 are precursors of aerosols (Wang et al. 2015).

Although the relationships between meteorological conditions and SO2 air pollution have been investigated, such as, Kalkstein and Corrigan (1986) examined a synoptic climatological approach to assessing the effects of climate on sulfur dioxide concentrations. Elminir (2005) mentioned dependence of air pollutants on meteorological conditions over Cairo in Egypt. And Several works have been carried out to study SO2 air pollution in Iran. Such as, Geravandi et al. (2015) investigated the effects of nitrogen dioxide (NO2) and sulfur dioxide (SO2) exposure related to health and increased premature deaths in Ahvaz, the capital of Khuzestan province, Iran. The results showed that about 5.6% of respiratory mortality and chronic obstructive pulmonary disease cases occurred when nitrogen dioxide (NO2) and sulfur dioxide (SO2) concentrations were above 20 μg m−3. Masoudi et al. (2018) studied the effects of meteorological factors on the prediction of SO2 and the relationships between air pollutants and some meteorological parameters in Tehran from 2009 to 2010. Shen et al. (2020) proposed a machine learning-based extreme learning model to predict the concentration and ratio of air pollutants (O3, CO, NO2, SO2, PM2.5, PM10) in Seoul, the capital of South Korea. Bhatti et al. (2021) analyzed the relationship between pollutants (CO, NO, O3, SO2) using a time series model to predict subsequent concentrations. Naseem et al. (2018) used an autoregressive integrated moving average (ARIMA) model to predict air quality time series data and then evaluated its application in air quality management decision making.

We still have very little information on the dependence of sulfur dioxide emissions on the geographic and meteorological conditions of the city and the emission rate from stationary and mobile sources. This study aimed to examine the relationship between SO2, PM10 and PM2.5 concentrations in ambient air and meteorological conditions in Tehran using the auto-regressive integrated moving average (ARIMA) method and statistical tools of analysis. we tried to make a time series model to forecast the environmental air quality time series data. Therefore, first, exhibits monthly, seasonal and annual variations of concentration of sulfur dioxide and particulates (either PM10 and PM2.5) in the Tehran air over ten years (2011–2020). In the second step, the effect of particulate matter (i.e., PM10 and PM2.5), wind speed, WS, and relative humidity, (RH in %)) on variations of sulfur dioxide concentration based on Pearson correlations between sulfur dioxide and its precursors were also analyzed. In the third step, the concentration distribution of the air pollutants in 21 air quality monitoring stations of the regions were zoned based on the measurements with the ArcGIS software using Inverse distance weighting (IDW) interpolation method. Finally, we use a time series model for the prediction of sulfur dioxide and particulate matter concentrations and wind speed and relative humidity. Figure 1 shows a flowchart of the steps followed during the research.

Fig. 1
figure 1

Flow chart of research methodology

2 Materials and methods

2.1 The study area

Tehran, the largest metropolitan center in Iran, is located at about 35° 41′ 21″ N and 51° 23′ 20″ E (see Fig. 2). The total area covered by the city is about 751 km2. The average annual precipitation amounts to 245.8 mm. The highest temperature in Tehran is 43 °C and the lowest temperature is − 15 °C. The average relative humidity is about 40%. The prevailing wind is West Tehran (270°) and its average speed is 5.5 m/s. Tehran city has a population of 8.7 million approximately. There are a number of reasons for air pollution in Tehran, some of which are in the industrial sector, the transportation sector, and the geographical location of the city, with the large Alborz Mountains in the north of the city contributing to pollution levels. Tehran is thus one of the most polluted cities in Iran, which is why an analysis of air quality in this city must be conducted.

Fig. 2
figure 2

The locations of the air quality monitoring stations and the Mehrabad synoptic station in Tehran. a Iran, b Tehran Province, c Tehran City

2.2 Field measurement

In Tehran city, the automatic measurement of air quality is performed by 21 measuring stations of Air Quality Control Company (AQCC 2020). Figure 2 shows the location of the air pollution monitoring stations. The paper presents the analysis of SO2, PM10 and PM2.5 concentrations measurements collected at 21 stations which provided the information for a database covering a period of ten years: 2011–2020 (Table 1). all air quality monitoring stations with > 75% of the hourly data over the year were used for further calculations, as less than 0.1% of the SO2 data for each station was removed from the data set and subsequent calculations. The incoming information was converted into standard concentrations based on the Clean Discussion Guidelines issued by the US Natural Security Organization (USEPA 1997). In this standard, a maximum concentration of 24 h was used for SO2, PM10, and PM2.5.

Table 1 Annual average of SO2, PM2.5, PM10 and RH and WS recorded from 2011 to 2020 in Tehran

The instruments used to measure the concentrations of SO2, PM10 and PM2.5 were continuous automatic analyzers manufactured by Thermo Fisher Scientific Inc. (Waltham, MA, USA) and Teledyne API Inc. (San Diego, CA, USA). The PM monitor was used to measure the concentrations of both PM10 and PM2.5 in ambient air based on light scattering and β-ray attenuation. The SO2 analyzer employed pulse fluorescence detection to determine the SO2 concentration in the ambient air. Data must be evaluated in the wider context of the checks and calibrations employed during their use and in the production of published data, a process usually known as quality assurance and quality control (QA/QC). Recording Air Quality Control Company systems follows QA/QC (2008/50/EC). Then, in the ArcGIS software, the IDW method was used to obtain air pollution zoning maps. As a result, spatial analysis of distribution of SO2, PM2.5 and PM10 contaminants in Tehran was made. Also, the Spearman correlation coefficient model was used to examine the relationship between sulfur dioxide and PM10, PM2.5, RH, and WS (Ozbay 2012). Correlation coefficients (r) significant at the 0.05 level (P value ≤ 0.05) are identified with a single asterisk (significant).

$$r = \frac{{\sum \left( {\hat{y}_{i } - \overline{{\hat{y}_{i } }} } \right)\left( {y_{i} - \overline{{y_{i} }} } \right)}}{{\sqrt {\sum \left( {\hat{y}_{i } - \overline{{\hat{y}_{i } }} } \right)^{2} \sum \left( {y_{i} - \overline{{y_{i} }} } \right)^{2} } }}$$
(1)

2.3 ARIMA model

Time series forecasts including automatic ARIMA modeling was carried out using the ‘forecast’ package. A differenced process that depends upon the p lags of the series values and q lags of the error terms is termed as ARIMA (p = autoregressive, d = differencing, q = moving average). we apply these processes on this series to find out the best and most appropriate ARIMA model. The ARIMA model can be considered as a “cascade” of the two models. The first part of this model is suitable for non-stationary conditions, the equation of which is as follows:

$$Y_{t} = \left( {1 - L} \right)^{d} X_{t} .$$
(2)

The second part is wide-sense stationary, the equation of which is as follows:

$$\left( {1 - \mathop \sum \limits_{i = 1}^{p} \varphi_{i} L^{i} } \right)Y_{t} = \left( {1 + \mathop \sum \limits_{i = 1}^{q} \theta_{i} L^{i} } \right)\varepsilon_{t} .$$
(3)

The variables of this equation are: L is the lag operator, \(\theta_{i}\) the parameters of the moving average part, \(\varepsilon_{t}\) is error term, and the three variables p, d, q mentioned earlier.

The values of the p and q variables can be found by one of the functions: autocorrelation function (ACF), or partial autocorrelation function (PACF). We used both of these functions here to find the most suitable order (Agrawal et al. 2017). To establish whether an ARIMA model qualifies to predict the concentration of sulfur dioxide, PM2.5 and PM10 and wind Speed and relative humidity parameters in Tehran, statistical errors through determination coefficient (R2), mean square error (MSE) and root mean square error (RMSE) were applied as follows (Beckerman et al. 2013; Elavarasan et al. 2018; Goap et al. 2018):

$$R^{2} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \overline{{y_{i} }} } \right)}}{{\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \overline{y}} \right)}}$$
(4)
$$MSE = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} (y_{i} - \hat{y}_{{i{ }}} )^{2}$$
(5)
$$RMSE = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} (y_{i} - \hat{y}_{i } )^{2} }$$
(6)

where \(y_{{\varvec{i}}}\) and \(\hat{y}_{i }\) are the forecasted and observed \(i\) values of \(y\), \(\overline{{y_{i} }}\) and \(\overline{{\hat{y}_{i } }}\) represent the mean \(y\) values of the forecasted and observed in the tested sample set and \(n\) denotes the number of datum points in the set.

In this study, Meteorological data from the Mehrabad synoptic station in Tehran during a period of 10 statistical years (2011–2020) was used to recorded wind speed and relative humidity. Mehrabad Meteorological Station is located at 35° 41′ N and 51° 19′ E. The average annual temperature, average annual relative humidity, average annual precipitation, and annual sunshine hours are 17.7 °C, 41%, 238.9 mm, and 3025 h, respectively. The prevailing wind is from the west, as calculated at three times of record (morning, noon, and evening) at a speed of 5.5 m/s (IRIMO 2020).

3 Result and discussion

3.1 Sampling data analysis

The results show that the trend of changes in PM2.5, PM10, and sulfur dioxide in Tehran decreased from 2011 to 2020 (Fig. 4a–c) (Skrobacki et al. 2019). Torbatian et al. (2020) investigated air pollution variations in Tehran. Their results show that the PM2.5 and sulfur dioxide concentrations decreased from 2005 to 2016. The average concentrations of SO2, PM2.5, and PM10 during the last 10 Years, are 12.77 ppb, 32.14 µg m–3 and 81.94 µg m–3, respectively. In December and January 2019 and 2020, the concentrations of PM2.5, PM10, and sulfur dioxide increased (Table 1). It was found that the maximum concentration for the examined period was recorded in January 2012 with an average value of 29.91 ppb and the lowest average monthly concentration for SO2 was in 2018 (3.00 ppb) (Table 1). Sulfur dioxide pollutants also cause the formation of secondary suspended particles in the atmosphere. in winter and autumn, inversion is often observed in the city, which leads to an increase in the concentration of pollutants in the city. Kaushik and Melwani (2007), observed high SO2 concentrations during the colder months in Delhi. However, in another study in Kerman, Iran, SO2 concentrations were shown to have a stable trend (Mansouri et al. 2013).

In this study, PM10 also showed a decreasing trend. The highest average monthly concentration for PM10 was in 2013 (112.2 μg m−3) and the lowest in 2020 (44.44 μg m−3) and maximum values are seen in summer (Table 2). The results described in the present study are consistent with several earlier investigations (Draxler et al. 2001; Shahsavani et al. 2012). According to Fig. 3b, the highest amounts of PM10 were recorded in the Shad Abad station.

Table 2 The average monthly SO2, PM2.5 and PM10 concentration and RH and WS in the city Tehran from 2011 to 2020
Fig. 3
figure 3

Distribution of annual average concentration in air quality monitoring stations in Tehran from 2011 to 2020, a PM2.5, b PM10, c SO2

The highest average monthly concentrations for PM2.5 were in 2017 (52.81 μg m−3) and the lowest in 2019 (17.00 μg m−3). PM2.5 reaches its maximum concentration in Tehran in December and January (Table 2). Zhang et al. (2021) examining the trend of PM2.5 concentration, they also concluded that the average PM2.5 concentration reaches its maximum in December and January. The zoning map shows that in regions 3 and 18 (Darrous and Shadabad monitoring stations), concentrations of PM2.5 are high and in the central regions is less (Fig. 3a).

Two monitoring stations, Darrous and Aqdasiyeh, are located in the north of the city, and higher sulfur dioxide concentration is observed at these sites. After that, the District 16 monitoring station (in the southwest of Tehran) shows the highest sulfur dioxide concentration (Fig. 3c). The primary and secondary standard for sulfur dioxide according to the NAAQS for 24 h was 140 ppb (Goudarzi et al. 2014; WHO 2003; US EPA 2011). Table 3 shows that the annual average concentration of sulfur dioxide in air quality monitoring stations in Tehran was below the NAAQS standards from 2011 to 2020. One of the factors that can be effective in the heterogeneous distribution of pollutants in the city of Tehran is the rapid population growth and development of construction activity in some areas compared to other areas. This means that development has not been continuously distributed throughout the city during these ten years.

Table 3 Detailed information on AQMSs and the hourly average of SO2, PM2.5 and PM10 data recorded from 2011 to 2020 in Tehran

3.2 Correlation analysis

Table 4 shows the relationships between sulfur dioxide and PM2.5, PM10, RH, WS using the average Pearson correlation coefficient (r). The results indicated that the Sulfur oxides, in combination with particulate matter and relatively high relative humidity, have the strongest negative impact on the environment (P value > 0.05) (Schwartz 1996; Asadifard 2013). While the concentration of sulfur oxides decreases with increasing wind speed (P value < 0.05) (Unal et al. 2000). Venners et al. (2003) also investigated the concentration of sulfur oxides in Chongqing, China, showing that there is a positive correlation between sulfur dioxide concentrations and particulates (either PM2.5 and PM10) concentrations. wind speed (WS) has a negative correlation with PM2.5 and PM10 concentrations (r = − 0.808 and − 0.496 respectively). In addition, a significant positive correlation was observed between PM2.5 and PM10, which shows that PM2.5 and PM10 have the same origin (r = 0.561).

Table 4 Spearman’s rank correlations between SO2, PM2.5, PM10 and RH, WS data from 2011 to 2020

3.3 Results of the ARIMA model

For the purpose of time series modelling in this study, the first 12 observations (January 2020 to December 2020) were used to fit the ARIMA model while the subsequent 7 observations (from January 2021 to July 2021) were kept for the post sample forecast accuracy check. As presented in Table 5, the value of SO2 is decreases till the ARIMA (3,1,4) but at ARIMA (2,1,4) it again increases. So, the smallest significant value of SO2 for the ARIMA process is at ARIMA (3,1,4). So, we may conclude that the ARIMA (3, 1, 4) gives better results on forecasting the concentration of sulfur dioxide. Pohoata and Lungu (2017) also used the ARIMA model to forecast SO2 concentration in Ploiesti, Romania, showing that the best result was obtained with ARIMA (3, 1, 3). Das et al. (2019) also showed the best fitted ARIMA (p, d, q) was selected on the basis of the maximum value of R2 and minimum value of RMSE. Similarly, ARIMA (2, 1, 3), ARIMA (3, 2, 3), ARIMA (3, 1, 4) and ARIMA (3, 1, 3) are the best model for predicting the concentration of PM2.5, PM10, relative humidity and wind speed, respectively. The ARIMA model was able to predict the 7 months from 2021 relatively accurately, with only 1 year (2020) of data (Table 6). The ARIMA statistical forecasting method did not provide satisfactory results to predict PM10 concentrations in 2021, the relative errors are bigger than 10% (Fig. 4c). The forecasting of the PM10 concentrations is difficult because of the uncertainties in describing the meteorological conditions and emission variability (Zhang et al. 2017). This error rate may be associated with a rapid change in wind direction. The prevailing wind flows from the west in Tehran. main winds from west, south and southwest which are industrial places are the main sources of Tehran air pollution (Borhani et al. 2017, 2021b). The increasing error in forecasting is due to the unpredictability of new trends that may occur in the future. In April 2021 prediction for SO2 concentration, the Actual Value was 4.68 ppb, while the Predicted Value was 4.61 ppb, a difference of 0.07 ppb. For the data corresponding to SO2, PM2.5, WS and RH time series, the determination coefficient (R2) and RMSE equal to (0.74, 2.18), (0.73, 9.49), (0.88, 0.67) and (0.82, 9.66), respectively (as can be seen in Fig. 4a–e and Table 7). Therefore, the ARIMA model provided satisfactory results.

Table 5 Square root of the MSE for Algorithm test the average monthly SO2, PM2.5 and PM10 concentration and RH and WS in Tehran from January 2020 to December 2020
Table 6 Comparison of model predictions and actual values for average monthly SO2, PM2.5 and PM10 concentration and RH and WS in Tehran from January 2020 to December 2020 (Algorithm test)
Fig. 4
figure 4

Time-series analysis (Train set,Forecast set, Algorithm forecast, Test set and Algorithm test) model in air quality monitoring stations in Tehran from 2011 to 2020, a SO2, b PM2.5, c PM10, d WS, e RH

Table 7 Comparison of model predictions and actual values for average monthly SO2, PM2.5 and PM10 concentration and RH and WS in Tehran from January 2021 to July 2021 (Algorithm forecast)

In general, the ARIMA model is the most suitable model for predicting unexpected values, but drastic changes in the data and, of course, the monthly step and averaging have caused a high error for some pollutants, even when the noise has been removed.

4 Conclusion

The paper presents the results of comparative analysis of measuring and monitoring the concentration of sulfur dioxide, based on, the changes in sulfur dioxide concentration due to changes in particulates (either PM10 and PM2.5) concentrations and meteorological conditions (relative humidity and wind speed) in the air in the city of Tehran from 2011 to 2020. In this study used an ARIMA modelling approach for short-term forecasting Sulfur dioxide (SO2), Particulate Matter (PM10 and PM2.5) concentrations, and meteorological conditions (relative humidity and wind speed). According to the measurements results of SO2, PM10 and PM2.5 concentrations from 21 monitoring stations of Air Quality Control Company in Tehran and the data of Mehrabad Meteorological Station (RH and WS), it can be deduced, average SO2, PM10, and PM2.5 concentrations have decreased significantly over the past decade. And as for the seasonal concentrations, SO2 concentrations in the air are increased in the winter, which is affected not only by weather but also by other pollutants (production of thermal energy for city heating, combustion of fuel oil in power plants, small boiler rooms for certain economic entities, companies, old vehicles, etc.). The results show that the monthly concentration of sulfur dioxide is lowest in June and highest in January. The distribution of annual average concentration in air quality monitoring stations shows that the highest average SO2 concentration in zoning map is related to the north and southwest of Tehran. The Correlations of SO2 with PM2.5 and PM10 and with relative humidity and wind speed were examined using Pearson correlation. The results showed that SO2 has a positive and significant correlation with PM2.5, PM10 and relative humidity and a negative correlation with relative wind speed. These results come from the fact that more than 80% of sulfur oxides are produced by burning fossil fuels from stationary pollution sources, followed by the most important cause of particulate matter emissions, the high volume of traffic in Tehran. The Alborz Mountains have made Tehran’s climate very humid and as a result, sulfur oxides combined with suspended particulate matter and humidity have the most harmful effects on air pollution in Tehran.

However, for quality improvement, it is necessary to take many additional measures, such as the expansion of the public transport system, especially the metro system, the establishment of technical control centers and the obligation of technical inspection of vehicles, the control and monitoring of car traffic, especially in the last hours of the night, the change of fuel consumption of suburban power plants and the monitoring of pollution standards, and better distribution of fuel for intercity buses, because without this update, no concrete measures can be taken to reduce SO2.

In addition, relatively accurate prediction of air pollutant concentrations, both short and long term, is an important step in mitigating the damage causes of poor air quality. For example, public and private institutions can plan for the economic, health and environmental impacts of air pollution by predicting the period when air pollution levels may be particularly high.