Introduction

High urban air pollution has become one of the main concerns of citizens of Macedonian cities due to the elevated levels of pollutants, especially during the winter season, thus having a negative impact on human health (Mindell and Joffe 2004). The problem associated with the urban air pollution is very complex, as many factors contribute to increased concentrations of the main pollutants (e.g. urbanization, specific topography, emissions sources, environmental factors, meteorology). The main pollutants emitted into the atmosphere in urban areas such as Skopje are sulfur oxides (SO2), nitrogen oxides (NOx), carbon monoxide (CO), and particulate matter (PM/aerosols), mostly consisting of black carbon, sulfates, nitrates, and organic matter. For these reasons, it is necessary to carry out advanced research based on observational, theoretical, and modeling studies in evaluation and estimation of the impact of different emissions (anthropogenic, biogenic, and especially mobile emissions) on air pollutant concentrations in urban areas, since the urban sources of variety of gaseous and particulate matters have the largest contribution to anthropogenic emission (Seinfeld and Pandis 1998). These chemical pollutants include primary PM10 and PM2.5, which are directly emitted in the atmosphere from various sources (e.g., road traffic, construction sites, soil dust, fires, fuel consumption), and secondary PM, which is formed in the atmosphere via chemical reactions in gas and aqueous phases, leading to the oxidation of precursors such as sulfur dioxide (SO2) and nitrogen oxides (NOx). The recent economic developments in North Macedonia and the increasing urbanization will potentially cause wide-ranging consequences in terms of environmental and health problems in the urban areas, especially in the larger cities like Skopje.

Real-time forecast of urban air quality is a relatively new discipline in atmospheric science and highly important to the public as an advanced information for both air quality and safety assessment. The ever-increasing computational power of modern computers over the past years allowed the use of advanced offline deterministic 3-D chemical transport models (CTM), leading to more accurate air-quality forecast, replacing the previous empirical and statistical approach with its limitations of adequately representing the physical and chemical processes of pollutants. Nevertheless, the use of increased computational resources of these new models is justified as they show significant advances and are capable of simulating the complex physical-chemical processes (Zhang et al. 2012; Spiridonov et al. 2019a). The details describing the original approach of developing the air quality forecast and alert system for North Macedonia based on the WRF-Chem model are given by Spiridonov et al. (2019b).

The goal of the research described in this paper is to find the optimal pollutant input parameters for air pollution prediction that uses a WRF-Chem weather model that combines various sources of air pollution emission repositories. We will start with a brief description of the novel approach and the evaluation method with comparison of results and verification. At the end, we will conclude our findings with the optimal pollution source inputs that produce the best results.

Air quality forecast system of North Macedonia

The air quality forecast system has been developed and extensively tested to simulate the atmospheric physical and chemical transport processes over North Macedonia. The main design of the system is based on using the Weather Research Forecast, a non-hydrostatic meso-scale model WRF-ARW (see Skamarock et al. 2008), and chemical module WRF-Chem developed by Grell et al. (2005). An integral part of the system are global data from emissions from various sources (e.g. Stockwell et al. 1990; Ackermann et al. 1998; Schell et al. 2001). For simulation of the urban air quality, WRF-Chem utilizes a set of inventories that are processed with PREP-CHEM-SRC algorithm created by Freitas et al. (2017). The global anthropogenic emissions utilize the Emission Database for Global Atmospheric Research (EDGAR) (e.g. Olivier et al. 2005; Muntean et al. 2014; Crippa et al. 2016a, b) and upgraded data set, the EDGAR-HTAP, with resolution of 0.1° × 0.1° according to Janssens-Maenhout et al. (2015). The WRF-Chem model also incorporates the RETRO, a 1960–2000 global anthropogenic emission base, and GOCART background emission data. A more detailed information about the system can be found in Emmons et al. (2010) and Spiridonov et al. (2019b).

Improving the chemical initialization of the WRF-Chem model

We started by using the same configuration of the WRF-Chem model as described in our previous research (Spiridonov et al. 2019b), with a modified approach in sourcing the urban mobile emissions. Our hypothesis is that the pollution emissions in the atmosphere strongly depend on seasonal variations due to the household heating emissions being concentrated in winter. Given that the majority of the households in North Macedonia use heating wood or other household combustion stovesFootnote 1, we expect that these sources would have significant environmental effects and contribution to increased air pollution levels during the winter period. The high intensity and scale of the measured pollution values during winter further supports our hypothesis that the household heating emissions form a significant component to the total air pollution. Therefore, any model designed for our region should put special emphasis in correctly incorporating these emissions. Our main contribution is to perform a thorough model evaluation with different pollutant emission levels in different seasons. Thus, we would be able to find the optimal mobile pollution sources for each season that should be incorporated into the model chemistry input. This novel approach will significantly improve the model performance and its precision for air quality forecast in urban areas.

Emission sources

In addition to the standard sources integrated in the PREP-CHEM-SRC preprocessor, we use additional data composed by the inventory of gas emissions updated by the Ministry of Environmental and Physical Planning (MOEPP). This inventory represents the total yearly emissions of gases by different GNFR defined over the entire area of the country in geographic rectangles (grid boxes) with size of 0.1 × 0.1 lat/lon deg. For each of these rectangles, we have the total emission of some significant chemical compounds (e.g. nitrogen oxides (NOx), carbon monoxide (CO), and non-methane volatile organic compounds (NVOC)). Each of these pollutants is classified (grouped) by source within the following categories (GNFR sectors): public power, industry, other stationary combined, fugitive, solvents, road transport, shipping, aviation, off road, waste, agricultural livestock, other agriculture, and others. The emissions are given as the total annual emissions in (kiloton/year) for each category and each rectangle on the map. These emissions are later added to the default inventories of the Prep-Chem-Src preprocessor and are compiled and used as initial chemical conditions in the WRF-Chem simulation. As Prep-Chem-Src requires us to overlay the emissions to cities, with their rural and urban areas given, we took a rather different adaptation of this approach. We took the urban areas of North Macedonia and overlapped them with the mobile emission data defined in a grid boxes. We have arbitrary added an additional 1.0 km2 of urban area to each rectangle without an urban area overlap or given the overlapping area of the city with that rectangle if an overlap exists (Fig. 1). The rural area was calculated as the remainder of the area of the square. These areas, together with the emissions (scaled by the factors described in the next chapter) were inserted into Prep-Chem-Src for preprocessing.

Fig. 1
figure 1

Graphical illustration of the adjustment method of the urban mobile emission data pollutants. a) The mobile data emission provided by the Ministry of Environmental and Physical Planning (MOEPP) with a grid resolution of 0.1° × 0.1° lat/long. The shaded orange area represents the municipality of Skopje with location of all available air quality measuring stations from the state network, and private stations. b) Same as Fig. 1a but only for the mobile data emission provided by the (MOEPP). c) Same as Fig. 1b, but for the main municipalities of North Macedonia

Scaling of the chemical inputs

The model chemistry initialization employs an ensemble technique with two different approaches of scaling the inputs:

  1. a)

    The first scaling was done independently of the pollution emission source, by scaling factor of the total emissions for every grid box in the model domain in factors ranging from 0 to 200% of the original input with rate of 10%, The scaling was done uniformly across the entire domain.

  2. b)

    The second approach was done by taking into account the pollution source by scaling only the sources we considered having a seasonal variability. Namely, the column of emission data termed “other stationary combined” sources mostly consist of the pollution emissions from household heating for which the peak occurs in winter and has a significant impact to the urban air quality over that period (see Mirakovski et al. 2018). The justification for this consideration is based on the fact of seasonal variability. For these reasons, it is necessary to delineate emissions and to apply scaling in order to avoid overestimating emission values in summer and underestimation in winter. Furthermore, as these sources are the most significant source of CO and a quite insignificant one for NO, the importance of a separate scaling of these sources is more pronounced.

As for the other pollution sources (traffic, power generation, waste) we do not expect a significant seasonal variation; therefore, these sources were not a subject to scaling in the second approach.

Numerical experiments and model evaluations

WRF-Chem predictions

The best approach for evaluation of any model will be achieved on a wide variety of atmospheric conditions. Given the large number of ensemble members for simulations (required about 40 per run), the computational time and computer performance needed to simulate each day would be significant. This would require us to limit the simulation span and prioritize certain types of atmospheric conditions that we want to evaluate on. As our goal is basically the best estimation of the input parameters for pollutant emissions, we chose to conduct evaluations on days with stable atmospheric conditions that would yield the most reliable results. This way, by comparing our results with the measured results from the network of air quality measurements stations, we could estimate the pollutant inputs on those specific dates in the area around the measurement stations. Numerical experiments have been conducted for 3 winter cases 3 Jan 2020, 17 Jan 2019, and 27 Jan 2017, two summer cases 20 Sep 2019 and 25 Jun 2017, and one autumn case 17 Oct 2017. WRF-Chem model runs were initialized with GFS 0.25 deg data taken at 00:00 UTC for 4-day advance forecasts. All selected dates are specified for the stable atmospheric conditions. However, we have intentionally chosen evaluation time frames over the different seasons, with widely different measured pollutant concentrations, in order to test our hypotheses that these differences are mostly due to the seasonal distribution of the pollutants inputs to the atmosphere from the local sources.

GIS processing of the data and point-based evaluation

The evaluation of the model is done using the publicly available data from a network of measurement stations. This network consists of a combination of government-operated stations by the MOEPP, plus a supplementary network of home installed stations from the Pulse project. We expect that the MOEPP stations are producing better accuracy than that by the home-based stations, as the latter ones are usually non-certified cheap, household devices, without any guarantee on their measurement precision and calibration and the correctness of their placement and installation. Therefore, we think that evaluation results against each of these stations should be taken with caution and would be chosen only to evaluate against the reliable measurements of the MOEPP stations. The only feasible way to successfully evaluate on these home-installed measurements would be to average and use them as a stack of sensor evaluation, which would hopefully smooth out the high-frequency noise. In order to evaluate the simulation against the in situ measurements by the MOEPP stations, we need to transform the model predictions for the specific geographic point of the sensor locations. For that, we first calculated the interpolated simulated data at each of the stations. The geographic interpolation is performed using a GIS database extension for Postgres, named PostGIS. It was done in such way that for each of the measuring points of the location, we took all the simulation sampling points in a radius of 5 km around the measurement point. Then, we calculate weighted average of the sampled simulation points within this radius, where the weighted factor is calculated to be proportional to the inverse distance between the measured point and the simulation sampling point. Finally, the measured and the simulated results for each point were compared with the statistical verification scores explained in the next paragraph.

Comparison with measurements

WRF-Chem predictions have been performed for each of the selected cases using this novel ensemble technique regarding to the mobile emissions data sets. We used two different evaluation strategies regarding these measures: first, we evaluated for each separate day, and then we evaluated the data for the entire simulation period of 4 days ahead for each previously defined intervals of the simulation. The model initialization has been performed using the two scaling methods: The first one utilizes an overall scaling of total annual mobile emissions regardless of the source. The second approach is based on scaling in a wider percentage range only on the total mobile emissions arising from household heating. In addition, it should be emphasized that in several cases, there are stations that were turned off at some time and they either do not provide any data or are constantly zero. Due to the automated way of generating graphical representations, it was more difficult to omit them. Hence, if the measured value is a constant zero, or a time series graph is missing, that means we don't have measured data for that station at that time. Figure 2 a shows a time series of PM10 predictions using overall scaling factor regardless the emission source, while Fig. 2b shows WRF-Chem predictions initialized with scaling only of the mobile emissions, which were generated from heating and other non-industrial pollutants. The results using two different approaches are compared with corresponding measurements obtained at three urban stations. Looking closely at the results, we found that for a winter case study on 3 Jan 2020, WRF-Chem initialization with scaling only mobile emissions caused by heating, provides a better agreement and better fit with measurement series compared with the first model chemistry initialization approach. It is more evident in evaluating the results shown on Supplemental Material Fig. S1, for other winter case (left panels) and summer event (right panels). One sees that WRF-Chem predictions of PM10 concentration for all measured stations are more sensitive to the mobile emissions during a winter time when home heating plays significant role in pollution accumulation in urban areas under stable, high-pressure anticyclonic weather conditions. The findings confirmed our hypothesis that there is a strong seasonal irregular distribution of atmospheric pollutant emissions. It suggests that the seasonal variation of mobile emission input should be considered in the chemical model initialization.

Fig. 2
figure 2

Time series of WRF-Chem ensemble forecast of \(\mathrm{PM}\left(\frac{\upmu \mathrm{g}}{{\mathrm{m}}^3}\right)\) for a three urban air quality monitoring stations. Model is initialized on 3 Jan 2020, with a two different scaling approaches of the initial mobile data emissions. a Overall (bulk) scaling; b scaling of pollution emission as the result of heating and other non-industrial pollutants

WRF-Chem model verification

A number of different verification scores were used for evaluating the predictions. The data from every MOEPP station and for every day of the simulation was used to compute the following statistical tests: mean error (BIAS), mean absolute error (MAE), root mean square error (RMSE), and correlation coefficient (CC) as well as an additional metrics like difference between the predicted and measured maximum for a given period (maximum difference), difference between the predicted and measured minimum for a given period (minimum difference), difference between the predicted and measured average for a given period (difference of averages), and difference between the sum of all predicted values and the sum of all measured values for a given period (difference of sums), which should be proportional to the difference of the areas of the curves for a given period. The first four measures are standard evaluation metrics, while the latter four are additional, more intuitive metrics that were used for evaluation of the model, which in our opinion are more relevant from human perspective, in case the model data would be made available to the general public. However, due to the sheer volume of data collected, the charts presented here will be focused on the four standard measures: Mean bias error (BIAS), mean absolute error (MAE), root mean square error (RMSE), and the correlation coefficient of the PM10 concentrations. The distribution of all metrics as a function of the scaling factors for the 3 Jan 2020 event using a different chemical initialization is shown in Supplemental Material Fig. S2. The chemical initialization is performed using the overall mobile emissions, regardless of the pollution emission sources, while Supplemental Material Fig. S3 shows metrics dependence on scaling factor when ensemble forecasts utilize chemical initialization with scaling only performed for mobile emissions generated by heating. As it is expected, the best accuracy across all scores during the winter months was achieved when scaling up the baseline pollutants input by a higher factor, while in the summer (see Supplemental Material Fig. S4), late spring, and early autumn (not shown), the best results were achieved when scaling down the baseline pollutant emissions by a significant factor. Additionally, we noticed that the scaling factors are not uniform across the entire domain, which might come from the incorrectness of some of the pollutant inputs. The spatial distribution of the predicted PM10 concentrations over the Skopje area in winter is given in Fig. 3, where Fig. 3a shows the predictions of the initial model and Fig. 3b shows the predictions with the optimal parameter chosen by our approach. The same information is presented in Fig. 4 for the summer event. By doing the comparison on days with stable atmospheric conditions and weak wind, we tried to eliminate the influence of the atmospheric factors in our experiment.

Fig. 3
figure 3

Spatial distribution of PM10 pollution prediction in the Skopje area for the 17 Jan 2019 event. a Forecast of the original model. b Forecast of the model with optimal scaling factors

Fig. 4
figure 4

Spatial distribution of PM10 pollution prediction in the Skopje area for the 20 Sep 2019 event. a Forecast of the original model. b Forecast of the model with optimal scaling factors

Conclusion and the future work

The main goal of the present research is to develop and test a novel chemical initialization input that is being implemented in the air quality forecast system in North Macedonia. It is performed using ensemble approach regarding the treatment of the mobile emissions data and modeling of the appropriate scaling factor. The WRF-Chem model forecasts have shown a high sensitivity to different chemical initialization scaling methods. While scaling the overall (total) mobile annual emissions, regardless their emissions sources, tends to produce some differences regarding to PM10 concentration level. Our results show some overestimation during summer and underestimation during the winter season. More precisely, the scaling approach that utilizes scaling in a wider range only of the mobile emissions originated from heating for household needs offers a better flexibility for more accurate pinpointing of the optimal parameters, relative to the method that utilizes indiscriminate scaling of the overall mobile emissions. The verification results indicate that the best accuracy across all scores during the winter months was achieved when scaling up the pollutant input by a higher factor, while in the other seasons, the best results were achieved when scaling down the pollutant emissions by a significant factor. Namely, using our evaluation metrics, we can conclude that the best results are obtained if initializing the model for the Skopje region with values around 50–70% of the total urban mobile emissions for the chosen intervals in January (peak of heating season), with certain days where the optimal range is up to 130% of the original emissions. Values of 10% or less of total urban mobile emissions give the optimal parameters for late summer and early autumn (out of heating season). Taking all into account, we can conclude that the seasonal variation in the pollutant input to the atmosphere is a significant factor in simulating the pollution in this region. Therefore, these seasonal variations must be taken into account when fitting the pollutant emissions input to any model. The seasonal variation of the chemical initialization is modeled even better with selective scaling only of the domestic heating sources. These findings have the potential of future work that may focus on producing a better estimation of the pollutant inputs at any time. A promising area for future work would be investigating the dependence of the temperature and the pollutant inputs, which would be expected due to the higher use of heating fuels with lower temperatures. Our future work will be focused on this area, in producing a better prediction of the emission of pollutants based on the temperature and even coupling the emissions with the temperature predictions of the model.