1 Introduction

Tehran is known to experience high concentrations of particulate matter, particularly during cold seasons. The polluted episodes have Air Quality Index (AQI) higher than the moderate range (Heger and Sarraf 2018; Ashrafi 2012; World Bank 2020; Yousefian et al. 2020). Particulate matters with a diameter less than 10 μm, known as PM10, can get deep into the lungs, and particle matters with a diameter less than 2.5 μm, known as PM2.5, can even get into the bloodstream that poses a greater risk to health than PM10 (EPA 2018). According to Heger and Sarraf (2018), Tehran is annually experiencing more than 4000 premature deaths from ambient PM2.5 air pollution and more than USD 2.6 billion economy costs associated with air pollutions, only considering the human health effect that underestimates the total economic costs from air pollution. Pishgar et al. (2020) have shown a dramatic increase in respiratory diseases from 2008 to 2018; and about 43,000 deaths due to respiratory diseases over the 10 years. Bayat et al. (2019) determined the benefits associated with the decline of PM2.5 levels based on health targets and BenMAP-CE modeling. Their results indicate that economic benefits from reducing the annual concentration of PM2.5 to 10 μgm−3 could be USD 1.9 billion.

Different types of emission sources and their link to the high concentrations of PM2.5 and PM10 have been identified by previous studies. Farahani and Arhami (2020) studied the contribution of local PM10 and PM2.5 sources and regional dust storms to the ambient air of Tehran using air pollution modeling. They found that the contribution of local emission sources as high as 75% and 82% of ambient PM10 and PM2.5 levels, respectively. Ashrafi et al. (2017) studied the potential of changes in radiation budgets by the high level of particulate matter (PM) from different PM10 sources of dust storms in Iran, using the WRF-Chem model. They found that dust aerosol led to short-wave radiation perturbation in both the earth’s surface and top of the atmosphere. For instance, during one of the simulations, the domain-averaged short-wave radiation perturbation was—24.2 Wm−2, and perturbations at the top of the atmosphere are lower than those on the surface and strongly dependent on surface albedo.

Based on the local emission inventory of Tehran, which is derived from major air pollution sources in Tehran, vehicle sources are known as the dominant source of PM2.5, with a contribution of 85% among all sources (Shahbazi et al. 2016), due to their outdated technology and abundance in the city (Heger and Sarraf 2018). However, Arhami et al. (2017) determined that re-suspended soil has a more significant contribution to the PM level of Tehran, using source apportionment method and chemical analysis of whole year measurement samples. They also showed the ratio of Dust/PM2.5 was lower in the cold months than in the warm ones.

The health effects of PM2.5 depend on its concentration that is highly controlled by emission sources and atmospheric conditions. Regardless of emission sources, the atmospheric parameters such as planetary boundary later height (PBLH) and Ventilation Coefficient (VC) affect the concentration level of air pollutants in the ambient air significantly (Sujatha et al. 2016; Du et al. 2013). The atmospheric boundary layer controls the maximum mixing depth, corresponding with a more polluted region near-surface and relatively cleaner free atmosphere (Krishnan and Kunhikrishnan 2004). Mixing height alongside wind speed profile corresponding in VC, which is the power of pollution discharge or ventilation from captured air beyond the inversion layer (Cheremisinoff 2002). In this context, we refer to maximum mixing height as PBLH. Ashrafi et al. (2009) estimated PBLH and VC from 1998 to 2008 using sounding measurements during different seasons for Tehran. They reported that PBLH shows the lowest values during December and January. They measured VC to be about 5000 m2s−1, due to the low PBLH values, which has the lowest value compared to other months. (Ashrafi et al. 2009). These studies are limited, and the dynamics and particularly the spatial patterns of PBLH and VC are, to the most part, unknown in the city.

Furthermore, the impact of heat mitigation measures on the PBLH and air quality has been examined by previous studies (Arghavani et al. 2019; Vahmani et al. 2016). It is reported that although heat mitigation strategies such as cool and green roofs highly affect the heat budget and are suitable for reducing the ambient temperature, they can have a negative effect on severe air pollution episodes due to the reduction of PBLH and change of wind flow patterns (Epstein et al. 2017; Arghavani et al. 2019).

Although the government applied restriction on the nighttime traffic from 2020 November due to control on the spread of COVID-19, which could reduce emission from mobile sources, the concentration of pollutants in the city was elevated with respect to past years. Su et al. (2020) also linked this abnormal high PM level during lockdowns in northern China to low PBLH. It could be concluded that low PBLH and VC probably play a significant role in the air pollution levels of Tehran, whereas some emission control could not even have an effect. This is important to understand since lockdowns have reduced air pollution levels in other cities (He et al. 2020).

The formation and dynamics of PBLH and VC in Tehran are not well understood although they were measured by Ashrafi et al. (2009). This study aims to address this knowledge gap by investigating the roles of dynamic PBLH and VC in a severe air pollution episode that occurred in the winter of 2021. To investigate these roles in the air pollution of Tehran, in this study, we used the WRF model to investigate the role of dynamic PBLH and VC in the severe air pollution episode of winter 2021. To do this, first, we set up the model for the Tehran boundary. Then we model the PBLH and VC for the study period using the WRF model outputs. We examined the model performance using meteorology datasets and satellite surface temperature products. To understand the relation of PBLH and VC to PM2.5 levels, we extract PM2.5 concentrations from available air pollution measurement stations. Then we looked for the possible cause of changes in PBLH and VC that might have led to high levels of PM2.5 in the study period..

2 Materials and methods

2.1 Study domain

Tehran, the capital of Iran, faces the Alborz Mountain in the north and flat plains of agricultural lands in the south (Fig. 1). According to the 2016 census, the population of the city is 8,737,510 people (Statistical Center of Iran 2016). The average annual temperature is 19.1 ℃, and the average minimum and maximum temperatures are 14.2 oC and 36 ℃, respectively. Also, the average rainfall of this city is 209.3 mm (Statistical Center of Iran 2021).

Fig. 1
figure 1

The study area map. Red circles show locations of 17 air pollution measurement stations within the Tehran boundary. Green squares show locations of two synoptic stations, Mehrabad within the Tehran boundary and Imam-Khomeini outside of the city (Google Earth)

2.2 Polluted and clean episode

Severe high air pollution of PM2.5 happened from 30th December 2020 to 15th January 2021, which in fourteen days, Air Quality Index (AQI) exceeded 150 (unhealthy according to EPA (EPA 2021)). Such a long-lasting highly polluted episode has not happened from 2014 to 2021. According to ground measurements in the study period in all stations (Fig. 1), the maximum hourly means of PM10 and PM2.5 reached approximately 170 μgm−3, and 100 μgm−3, respectively (Fig. 2a). Therefore, we selected the period of 27th December 2020 to 15th January 2021 as the polluted episode. We choose this time to determine the role of PBLH and VC in these high PM2.5 level episodes. To assess PBLH and VC roles in clean days, we select 17th January 2021 to 23rd January 2021 as the clean episode.

Fig. 2
figure 2

Time series of a PM10 and PM2.5 concentrations from stationary measurements with errors (maximums and minimums), b PBLH, and c VC from the WRF model within Tehran boundaries throughout the study period

2.3 Model configuration

In this study, we used the state-of-the-art Weather Research and Forecasting (WRF) version 4.3, the non-hydrostatic, an Eulerian Mesoscale NWP model, capable of using different physic parameterization (Skamarock et al. 2019; Wang et al. 2018), to simulate the atmosphere condition and weather parameters for Tehran domain. WRF is coupled to an urban canopy model alongside the land surface model that resolves urban canopy processes and surface energy balance accounting for the nature of urban surfaces (Chen et al. 2011).

We used three nested domains to calculate the urban meteorology parameters of Tehran, as shown in Fig. 3a. The first (d01), second (d02), and third (d03) domains consisted of 90 × 70, 64 × 52, and 46 × 46 grid cells and grid resolution of 18, 6, and 2 km, respectively. We used the 6 h FNL datasets of the National Center of Environmental Prediction (NCEP), with the resolution of 0.5° for boundary and initial conditions of the WRF model.

Fig. 3
figure 3

a Topography of three nested domains in WRF modeling. b The local climate zone classes dataset (Izadi 2021) is used to generate urban categories

The physic parameterization schemes used in this study were the Lin scheme (Janjić 1994) for microphysics, the Kain-Fritsch scheme (Kain 2004) for cumulus parameterization, the Rapid Radiative Transfer Model (RRTM) scheme (Mlawer et al. 1997) for long-wave radiation, the Goddard scheme (Chou and Suarez 1999) for short-wave radiation, the MYJ scheme (Chen and Sun 2002) for the planetary boundary layer, and the atmosphere model is coupled to the Noah land surface model (Niu et al. 2011) including single-layer urban canopy model (Kusaka and Kimura 2004). For regions outside of the United States of America, the default land-use and urban fraction are inaccurate and have missing values. Therefore, we extract land use categories, urban fraction (FRC_URB), and types (Demuzere et al. 2021) based on the local climate zone (LCZ) classes dataset (Izadi 2021), as shown in Fig. 3b. These extracted parameters were used as terrestrial data input. We also used urban parameters as shown in Table 1, proposed by Arghavani et al. 2019, in the urban parameter table (URBPARM.TBL). Based on the extracted urban types, we updated both land use index and green fraction for both January and December according to the LCZ dataset, as shown in Figs. 4 and 5, respectively.

Table 1 Single layer urban canopy model parameters (Arghavani et al. 2019)
Fig. 4
figure 4

(Left column) Land use index (top) default, and (bottom) updated. (Top center) urban fraction. (Bottom center) land use fraction for high residential urban type. (Top right) land use fraction for low residential urban type. (Bottom right) land use fraction for commercial/industry urban type

Fig. 5
figure 5

Green fraction of default (left column) and updated (right column) in all simulations in January (bottom row) and December (top row)

Furthermore, we developed a method to define land use fraction for three urban types according to LCZ classes, as shown in Fig. 4. It must be remarked that for two outer domains, 21 modified MODIS land use cover classes dataset was used in all simulations. Twelve hours were discarded for the spin-up of modeling. Model configurations are summarized in Table 2. To have a more accurate prediction of meteorology conditions, we separated the length of the study period and ran the model every three days. The purpose of the developed methodology is to have a better land surface temperature and meteorology representation compared to the default method.

Table 2 Model configuration summary

Although the latest version of the WRF model, WRF 4.3, introduced a new feature called WRF-BEP that allows for the use of Building Energy Parametrization with Local Climate Zone (LCZ) input data, we opted to use WRF coupled to single layer Urban Canopy Model (WRF-UCM) due to a lack of detailed information on the city’s buildings. We performed all simulations with the modified land use type, land use fraction, and green fraction (as presented in Figs. 4 and 5). However, there were some uncertainties in urban parameterization, as we discussed later in the Discussion chapter.

2.4 Calculation of PBLH and VC

PBLH is simulated by WRF-UCM over the city (domain d03). Then, we used PBLH to calculate the VC using the integration of wind speed across the PBLH. Since the wind speed profile is needed to be calculated in the PBLH using the WRF outputs, we interpolated the wind speed in the PBLH. Then we calculated the VC using Eq. (1),

$${\text{VC}}\left( {x,y} \right) = \sum\limits_{1}^{{{\text{PBLH}}}} {f({\text{wind}}_{i} ,z_{i} )} = \left\{ {\begin{array}{*{20}l} {\frac{{z_{i}^{2} {\text{wind}}_{i} }}{2}} \hfill & {i = 1} \hfill \\ {\frac{{({\text{wind}}_{i} z_{i} + {\text{wind}}_{i - 1} z_{i - 1} )(z_{i} - z_{i - 1} )}}{2}} \hfill & {1 < i < {\text{PBLH}}} \hfill \\ {\frac{{({\text{wind}}_{{{\text{PBL}}}} z_{{{\text{PBL}}}} + {\text{wind}}_{i - 1} z_{i - 1} )(z_{{{\text{PBL}}}} - z_{i - 1} )}}{2}} \hfill & {i = {\text{PBLH}}} \hfill \\ \end{array} } \right.$$
(1)

where \({\text{VC}}(x,y)\) is the VC at each cell of the study domain at a time, i is the model’s layer number from ground, \(z_{i}\) presents the model’s layer height, \(wind_{i}\) presents the model’s layer horizontal wind speed (calculated from \(\sqrt {u_{i}^{2} + v_{i}^{2} }\)), \({\text{wind}}_{{{\text{PBL}}}}\) and \(z_{{{\text{PBL}}}}\) are wind speed at PBLH from previous step, and height of PBLH, respectively.

2.5 Statistical analysis method

To perform statistical analysis of the model performance, we used Pearson correlation coefficient (R), root means squared error (RMSE), and mean absolute error (MAE) using Eqs. (2, 3, 4) and, respectively.

$$R = \frac{{{\text{cov}} (P,O)}}{{\sigma_{P} \sigma_{O} }}$$
(2)
$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {P_{i} - O_{i} } \right)^{2} } }$$
(3)
$${\text{MAE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {P_{i} - O_{i} } \right|}$$
(4)

where P and O are predictions and observations, respectively, cov is the covariance function, σP and σO are standard deviations of predicted and observed samples, respectively, and n is the number of samples.

2.6 Meteorological and pollutants dataset

We obtained the hourly measurements of meteorological parameters, relative humidity at 2 m (RH2), wind speed at 10 m (WIND10), and temperature at 2 m (T2), for use in model validation from the ASOS Network of IOWA State University, which is available online through their website (Iowa Environmental Mesonet 2021). The ground observations were obtained for two available synoptic stations in d03 of modeling, Mehrabad and Imam-Khomeini airports, with station codes of OIII and OIIE, respectively. Mehrabad station is located at 51°18′12.96″ east and 35°41′8.52″ north, inside the city. Imam-Khomeini station is located at 51°8′60.00″ east and 35°25′0.12″ north, outside the city (as shown in Fig. 1 with green square). Mehrabad and Imam-Khomeini stations were chosen due to their representation of the urban and rural areas of d03, respectively.

We obtained hourly mean air pollutants (PM10 and PM2.5) measurements through the Tehran Air Quality Control Company (AQCC) online website, which has an archive of available records. We extracted and averaged PM10 and PM2.5 measured for all available stations (seventeen stations shown in Fig. 1 with red circles) in Tehran for each hour of the modeling period (AQCC 2021). Then, we averaged all parameters for each day to better judge the impact of PBLH and VC on the concentrations.

Furthermore, we used land surface temperature products from MODIS onboard Terra and Aqua (MOD11_L2.v006) to evaluate the model performance in the second and third domains. Also, we used Wyoming university upper air dataset for the Mehrabad Station to evaluate the model performance vertically.

2.7 Understanding the causes of PBLH and VC fluctuations

In Causality of PBLH and VC fluctuations, we extracted different parameters, such as sensible heat flux (HFX), latent heat flux (LH), T2, first layer cloud fraction (CF), and sea level pressure (SLP), from the WRF model and mapped over d03 within the Tehran boundaries. We did so to have a better understanding of the causality of these changes in PBLH and VC. We also calculated the daily average of these parameters within the Tehran boundaries from the model. Moreover, we looked for the formation of high- and low-pressure systems in the first domain at 500 millibar level. We mapped the PMs spatial distribution during the study period, the clean and polluted episodes in the city. Finally, we performed statistical analysis of mean, standard deviation (std), first, second, and third quartile (Q1, Q2, and Q3, respectively) on the extracted parameters and pollutions.

3 Results

3.1 Model validation

We validated the modified model with three different methods. First, we used RH2, WIND10, and T2 parameters of two synoptic stations of Mehrabad and Imam-Khomeini airports to validate the WRF model in the study period. Then, we used land surface temperature to evaluate the performance of the model for both the second (Fig. 6) and third domain (Fig. 7). Afterall, we used temperature, wind speed, potential temperature Wyoming dataset for the Mehrabad station to evaluate the vertical performance of the model (Fig. 8).

Fig. 6
figure 6

Land surface temperature of the second domain. (top row) the WRF model, (bottom row) the MODIS product (MOD11_L2.v006). The corresponding times to the columns are (left column) 2021-01-01 18:00, (middle column) 2021-01-06 08:00, and (right column) 2021-01-13 19:00

Fig. 7
figure 7

Land surface temperature of the third domain. (top row) the WRF model, (bottom row) the MODIS product (MOD11_L2.v006). The corresponding times to the columns are (left column) 2021-01-02 08:00, (middle column) 2021-01-08 18:00, and (right column) 2021-01-22 19:00

Fig. 8
figure 8

Vertical evaluation of the model for the potential temperature (top row), wind speed (middle row), and temperature (bottom row) parameters. From left the time steps are 2020-12-21 00:00, 2020-01-01 00:00, 2021-01-08 00:00, 2021-01-24 00:00. the y-axes are the height in meters

We observed an acceptable performance of hourly variation of mentioned parameters over the Tehran modeling domain from the WRF model on both stations, as shown in Fig. 9 for time series plots and Fig. 10 for model performance and its accuracy correlations. In Table 3, we presented the statistical analysis of the model.

Fig. 9
figure 9

Times series plots of model performance and stationary measurements for the period of 2020-12-21_00:00 to 2021-01-26_00:00, top row temperature at 2 m, middle row relative humidity at 2 m, bottom row wind speed at 10 m, left column Mehrabad station, and right column Imam-Khomeini station

Fig. 10
figure 10

Model performance and correlations of modeling versus Observations, top row Imam-Khomeini synoptic station, bottom row Mehrabad synoptic station, left column Temperature at 2 m, middle column relative humidity at 2 m, and right column wind speed at 10 m for the period of study

Table 3 Statistical analysis of model

Although the model could capture all three parameters patterns in the Imam-Khomeini station with acceptable performance (Figs. 9 and 10), the model underestimates the 2 m temperature at Mehrabad station (mainly during the nights). The model could not capture the RH in the Mehrabad station; since RH correlates to the temperature, this could explain the model performance in the Mehrabad station for RH. The wind speed MSE at the Mehrabad station was measured to be 2.4 ms−1. However, it is important to note that this measurement may contain some errors that could be attributed to the method of wind speed measurement utilized at the station. Specifically, wind speeds are reported as rounded numbers, and any wind speeds below 2 ms−1 are recorded as 0. These factors may contribute to the observed errors in the wind speed measurement at the station. We looked through the vertical profile of the Mehrabad station (Fig. 8) for the development of temperature profile in the urban area. The model could capture all available parameters vertical profile well for the duration of the modeling. We associated this underestimation in the 2 m temperature with land use type (which is in this case the urban type) that is discussed in details by Molnár et al. (2019). Unfortunately, other weather stations were not accessible to the authors for comparison.

The model could capture land surface temperature in the second (Fig. 6) and third (Fig. 7) domains when we set up the model with the modified green fraction and land use type (figures are not shown). Vahmani and Ban-Weiss (2016) also showed modified remote sensing green fraction yield better performance than the model’s default. While the model could capture the spatial pattern and variation during the modeling period, sometimes the model could not capture the minimum temperature well, mainly in the high elevation of the Alborz mountains (figures are not shown).

3.2 Impact of VC and PBLH on PM levels

To assess the impact of VC and PBLH on PM levels, we used only cells within the Tehran boundaries (Fig. 4) to calculate the average PBLH and VC of the city. Also, we averaged PM2.5 and PM10 concentrations for all seventeen stations, as shown in Fig. 1, to correlate the PM levels with PBLH and VC, as shown in Fig. 2.

PM2.5 and PM10 levels, averaged over all stations, reached higher levels during the period of the polluted episode than the period of the clean one, as shown in Fig. 2a. During the polluted episode, maximum and minimum PM2.5 reached 130.4 and 24.7 μgm−3, respectively. In the clean episode, the maximum and minimum were 40.9 and 4.6 μgm−3, respectively. Differences between PM10 and PM2.5 in the polluted episode period were higher than in the clean episode one, with the mean of 56.5 and 33.6 μgm−3, respectively. The maximum average PM10 concentration in the polluted reached 209.3 μgm−3. Table 4 shows other statistical analyses of the polluted and the clean episode and how the PMs are distributed during both episodes. Also, we mapped the spatial pattern of PM2.5 and PM10 and their ratio (PM2.5/PM10) over Tehran (as shown in Fig. 11). PM10 was the dominant pollutant in the southwest of Tehran in the clean and polluted episodes, while PM2.5 was the dominant pollutant in the inner and north of the city (Fig. 11).

Table 4 Parameters description of mean, std, min, Q1, Q2, Q3, and max for PM2.5, PM10, PBLH, VC, sensible heat flux, latent heat flux, air temperature, cloud fraction, and sea level pressure for the polluted and clean episodes
Fig. 11
figure 11

Spatial pattern of averaged PM2.5 (top row), PM10 (middle row), and the ratio of PM2.5 to PM10 (bottom row) in the clean episode (left column), the polluted episode (middle column), and the period of study (right column)

Boundary layer dynamics and wind strength govern the concentration of the pollutants at any given location. Hence, VC plays a significant role in the dispersion of pollutants (Iyer and Raj 2013; Mahalakshmi et al. 2011). Over the simulated 37 days, PBLH in the period of the polluted episode was significantly lower than in the clean episode, as shown in Fig. 2b. As a result of the decrease in PBLH alongside low dominant wind speed, a huge decline in VC occurred in the polluted episode (Fig. 2c). When VC dropped under 0.7 × 105 m2s−1 in the period of the polluted episode, the accumulation of both PM2.5 and PM10 occurred. In the middle of the polluted episode, a slight increase in PBLH and VC occurred, leading to a decrease in PM levels. We further linked this increase in the middle of the polluted episode to the presence of the low-pressure system.

The average VC above 4.7 × 106 m2s−1 favored effective dispersion and reduction of PM2.5 and PM10 levels in the period of the clean episode (Fig. 2). On the other hand, the high VC and PBLH during the clean episode show a significant correlation with the drop in both PM10 and PM2.5 levels. The low VC and PBLH in the polluted episode show a significant correlation with the PM2.5 and PM10 accumulation in the city. As shown in Fig. 2, an average PBLH of less than 1000 m correlates with an average VC of less than 1 × 105 m2s−1. Also, the average PBLH higher than 1000 m correlates with the average VC higher than 2 × 106 m2s−1. We showed the statistical analysis of PBLH and VC in Table 4.

3.3 Causality of PBLH and VC fluctuations

We, in Fig. 12, depict the daily average of different meteorology parameters that might affect the PBLH and VC in the period of study. Differences between daily PM10 and PM2.5 increased significantly during the polluted episode, while during the clean episode, these differences were at their minimum (Fig. 12a). The spatial pattern of PM2.5/PM10 (PM ratio) in the city (Fig. 11) illustrates that during the polluted episode, inner-city PM2.5 had high pollution portion. Also, the west and south side of the city had a higher PM ratio than the north and east side. Figure 12 illustrates our findings regarding the influence of wind speed on the formation of VC and its impact on pollutant dispersion. Specifically, we observed that during the clean episode, when wind speeds were notably higher, VC played a more significant role in facilitating the dispersion of pollutants. This highlights the importance of considering wind speed as a key factor in understanding the dynamics of air pollution and the role of various compounds in this process.

Fig. 12
figure 12

a Daily averaged PM10 and PM2.5 levels from stationary measurements. Daily average of b PBLH and VC, c sensible heat flux and latent heat flux, d air temperature, e cloud fraction, and f sea level pressure from the WRF model within Tehran boundaries throughout the study period

At the beginning of the polluted episode, the daily average PM levels rose. In the middle of the polluted episode, two drops in PM levels resulted in a decrease in PM levels. There was a decline in PM levels for two days at the late polluted episode despite increasing VC (Fig. 12a, b). This pollution is due to the time that wind needs to transport and disperse accumulated pollutants. Also, prior to the polluted episode, although VC dropped significantly, one day of accumulation of pollutants was required to increase the city’s pollutants to the unhealthy range (2020–12-26). Since sensible heat flux, latent heat flux, cloud fraction, temperature, and sea level pressure play essential roles in the formation of PBLH, we investigate the causality of reduction in PBLH in polluted episode period by these parameters.

The cloud’s presence in the middle of the polluted episode leads to a decrease in temperature and an increase in HFX. As a result of higher HFX, PBLH rose above 200 m, and favorable VC provides suitable conditions for PM dispersion (Fig. 12c, d, e). Further, we linked this pollution discharge to the presence of a low-pressure system for a short time over the northeast of Iran. The presence of a high-pressure system from 25th December 2020 until 13th January 2021 accounted for the polluted episode, while a low-pressure system from 14th January 2021 until 21st January 2021 accounted for favorable conditions in the clean episode, as shown in Fig. 12f. In Fig. 12f, these periods are separated by brown and green dashed lines, respectively.

Although the first two days of the high-pressure system resulted in the accumulation of PM levels in the city, PM levels did not reach the harmful range. Even though the low-pressure system replaced the high-pressure system in the late polluted episode, two days were still unhealthy AQI range, as shown in Fig. 12a–f. Moreover, the polluted episode experienced a mean average SLP of 1024.8 hPa. However, the mean average SLP of the clean episode was 1018.8 hPa. It should be noted that the polluted and clean episodes both had high- and low-pressure systems due to the delay in the accumulation and discharge of pollutants at the beginning of the high-pressure system and the beginning of the low-pressure one, respectively.

We, in Fig. 13, depict the three columns from the left as a presentation of the high-pressure system and two columns from the right as a presentation of the low-pressure system. As shown in Fig. 13, the urban areas experienced warmer temperatures during the high-pressure system than the rural area. High HFX correspond with higher PBLH, and as a result, higher VC occurred in the city. On the south of the northern mountains in the plain area, sea level pressure was higher than the high altitudes of mountains. Similar to 2021-01-09, the south and west-south of the city experience higher VC than other parts of the city during the most high-pressure system. Also, the wind direction was toward the city. The wind direction and speed can bring the outside produced pollutants to the city. During the low-pressure system, the temperature is distributed evenly in both rural and urban areas. Higher PBLH in this system happened compared to those in the high-pressure system. Most of the time, the south and west–south of the city had much higher VC than other parts of the city. In the part of the city where the cloud was present, VC decreased compared to cloudless parts of the city.

Fig. 13
figure 13

Spatial patterns of different parameters captured by the model. (from the top row to bottom) PBLH, VC, sensible heat flux, latent heat flux, T2, cloud fraction, sea level pressure, respectively; From left, the first, second, and third columns represent the high-pressure system with the time of 2020-12-27_12:00, 2021-01-04_06:00, and 2021-01-09_10:00, respectively. From left, fourth and fifth columns represent the low-pressure system with the time of 2021-01-15_16:00 and 2021-01-20_12:00, respectively

In comparison, the polluted episode was warmer than the clean episode. Both HFX and LH were higher in the clean episode than the polluted episode. Statistical analysis of sensible heat flux, latent heat flux, temperature, cloud fraction, and sea level pressure are presented in Table 4.

We investigated more in the formation of low- and high-pressure systems in the first domain at 500 mb pressure level, as shown in Fig. 14. Prior to the polluted episode, a low-pressure system across from the northwest of the domain toward the east. Afterward, a high-pressure system from the west with low wind speeds dominated Iran for several days. In the middle of the polluted episode (Fig. 14, 2021-01-06), a slight low-pressure with high wind speed across from the north of the domain toward the east. Afterward, the high-pressure system was dominant again. A low-pressure system with high wind speed was the key feature of the clean episode.

Fig. 14
figure 14

Geopotential height (orange contour line), wind speed, and direction (barbs and color contour) at 500 mb of the first domain at different times

4 Discussion

4.1 Model’s uncertainties

We configured WRF-UCM over Tehran to simulate PBLH and VC. Validations of the model showed reasonable performance. The T2 uncertainties were found to be higher in urban areas compared to rural areas. However, the model used in this study was able to accurately capture the overall land surface temperature. Salamanca et al. (2018) discussed that while the WRF-UCM model performed well in simulating summertime temperatures (due to calibration to represent the urban heat islands during summer time), it had limitations in capturing wintertime temperatures (in the case of this study in December and January) which were not well represented in the model. On the other hand, Lin et al. (2016) found that the WRF-UCM2D model performed better than the WRF-UCM model, and both studies showed that the Noah-MP surface layer parameterization led to improved wind and temperature performance. Both the Noah and Noah-MP models were able to accurately capture the skin surface temperature. Molnár et al. (2019) showed that based on the defined LCZ in the model, the model’s performance varies from cell to cell. This means the accuracy of the training dataset can affect the model’s performance. According to the Izadi (2021) dataset, the cell associated with Mehrabad station has the industrial/commercial urban type with higher anthropogenic heat. Therefore, the model’s overestimation in T2 can be expected since the land use type is not correctly trained in that specific urban area.

Furthermore, the model calculated the minimum PBLH (mainly happening during the night) about 50 m which is unrealistic. As Lopez-Coto et al. (2020) reported, most physic parameterization configurations underestimate the PBLH during the night. Therefore, the PBLH during the night needs a better representation. In the case of this study, nighttime PBLH and VC are underestimated by the model. There were some uncertainties in the UCM parameterization and urban-type that we used due to the number of samples and the training accuracy. A more accurate and well-trained dataset could improve the performance of the model. Also, we used default urban surfaces albedo and anthropogenic heat. It should be noted that the WRF-BEP model is mainly used and calibrated over the summertime (Ribeiro et al. 2021; Molnár et al. 2019).

4.2 Impact of climate change and meteorology conditions on air pollution

Typical high VCs in the southwest of the city have significant potential to transport local and abroad pollution sources to the inner city. The wind field was also in favor of transporting the non-local pollutants to the city. Therefore, with the advent of a stable high-pressure system, the PBLH and VC will decrease dramatically. This decline is in favor of accumulation and high PM concentrations in the south and southwest. Also, PM levels will increase due to the wind field of the city. On the other hand, high wind speed during the clean episode could contribute to the dust emission (Chen et al. 2021; Csavina et al. 2014; Kurosaki and Mikami 2007). However, as Fu et al. (2017) have shown, narrow streets with high buildings along heavy traffics could accentuate the PM levels in the city.

Some studies in other parts of the world showed that the increase in air pollution levels might be the effect of climate change due to changes in the pattern of cyclones and natural ventilation mechanisms (Mickley et al. 2004). Since one of the consequences of change in the pattern and extremes of weather parameters such as PBLH, VC, temperature, etc., could result from climate change, severe air pollution episodes will be expected under the same emission rate in the future. However, more studies are required to determine if this persistent high-pressure system resulted from climate change.

Furthermore, extreme dryness in recent years (Ashraf et al. 2021) led to unstable soil as a result of groundwater depletion, more soil erosion, and dust storms with local sources (Ebrahimi Khusfi et al. 2020) will be expected to happen. They will be brought to the city by high potential VC in the southwest of the city, since the gap of PM10 and PM2.5 have increased in the period of the polluted episode. As higher temperature results in lower PM2.5/PM10, warmer days of the modeling periods showed a positive correlation with lower PM2.5/PM10, where traffic emissions contributed to air pollution with a higher PM2.5/PM10 ratio in the cold days of modeling episodes (Xu et al. 2017; Puxbaum et al. 2004). Therefore, resilience in climate change is seemed to have an essential role in air pollution management(Bidokhti et al. 2016; McCollum et al. 2013; Dulal 2017). Since December and January have the lowest PBLH and VC among other months, the necessity of more control and managing over emissions of pollutions become more evident in these months.

5 Conclusion

In this study, we set up the WRF-UCM model to investigate the impact of meteorology on the severe episode of air pollution episode in Tehran from 30th December 2020 to 15th January 2021, which resulted in fourteen days with an AQI over 150 (unhealthy). To improve the representation of urban areas in our study, we utilized the available LCZ dataset for Tehran and extracted urban features. The developed methodology using the extracted urban features showed acceptable performance in providing meteorological conditions using the WRF-UCM model. This approach is particularly useful for future studies, as the default settings of the WRF model do not include urban fraction or urban types as input, particularly for Tehran. By incorporating this information, our methodology provides a more accurate representation of urban areas and their impact on meteorology, which in turn can improve our understanding and prediction of air pollution episodes in urban environments.

During the modeling period, the WRF-UCM model demonstrated acceptable performance in capturing 2 m temperature and humidity, as well as 10 m wind speed. Additionally, the model accurately captured land surface temperature in agreement with the satellite observation. However, the model needs improvement in presenting winter temperatures since it is primarily calibrated for summer temperatures. Furthermore, the model underestimates nighttime Planetary Boundary Layer Height (PBLH) and requires better representation.

The low PBLH and VC values in the polluted episode period highly impacted the concentrations of PM2.5 and PM10 in the city. High stable meteorological conditions and a long-lasting high-pressure system intensified the air pollution levels in the urban area of Tehran. The low PBLH and VC result in the accumulation of high concentration in the city, given the complex terrain of the urban area. Notably, this accumulation is highly correlated to wind speed reduction due to an increase in surface roughness as a result of urbanization.

It was observed that pollutants took almost two days to accumulate, resulting in AQI levels higher than moderate during the polluted episode. Similarly, during the clean episode, it took two days for pollutants to be removed by the low-pressure system. Furthermore, non-local pollutants, whether anthropogenic or dust, could significantly impact air pollution in Tehran.

In summary, our study provides valuable insights into the impact of meteorology on air pollution in Tehran. Our methodology provides a useful tool for future research. However, improvements to the model are necessary, particularly in representing winter temperatures and nighttime PBLHs, to enhance the accuracy of the results.