Air pollution forecasting application based on deep learning model and optimization algorithm

Heydari, Azim; Majidi Nezhad, Meysam; Astiaso Garcia, Davide; Keynia, Farshid; De Santoli, Livio

doi:10.1007/s10098-021-02080-5

Air pollution forecasting application based on deep learning model and optimization algorithm

Original Paper
Open access
Published: 14 April 2021

Volume 24, pages 607–621, (2022)
Cite this article

Download PDF

You have full access to this open access article

Clean Technologies and Environmental Policy Aims and scope Submit manuscript

Air pollution forecasting application based on deep learning model and optimization algorithm

Download PDF

Azim Heydari ORCID: orcid.org/0000-0003-1352-3209¹,
Meysam Majidi Nezhad¹,
Davide Astiaso Garcia²,
Farshid Keynia³ &
…
Livio De Santoli¹

7776 Accesses
51 Citations
1 Altmetric
Explore all metrics

Abstract

Air pollution monitoring is constantly increasing, giving more and more attention to its consequences on human health. Since Nitrogen dioxide (NO₂) and sulfur dioxide (SO₂) are the major pollutants, various models have been developed on predicting their potential damages. Nevertheless, providing precise predictions is almost impossible. In this study, a new hybrid intelligent model based on long short-term memory (LSTM) and multi-verse optimization algorithm (MVO) has been developed to predict and analysis the air pollution obtained from Combined Cycle Power Plants. In the proposed model, long short-term memory model is a forecaster engine to predict the amount of produced NO₂ and SO₂ by the Combined Cycle Power Plant, where the MVO algorithm is used to optimize the LSTM parameters in order to achieve a lower forecasting error. In addition, in order to evaluate the proposed model performance, the model has been applied using real data from a Combined Cycle Power Plant in Kerman, Iran. The datasets include wind speed, air temperature, NO₂, and SO₂ for five months (May–September 2019) with a time step of 3-h. In addition, the model has been tested based on two different types of input parameters: type (1) includes wind speed, air temperature, and different lagged values of the output variables (NO₂ and SO₂); type (2) includes just lagged values of the output variables (NO₂ and SO₂). The obtained results show that the proposed model has higher accuracy than other combined forecasting benchmark models (ENN-PSO, ENN-MVO, and LSTM-PSO) considering different network input variables.

Graphic abstract

Pragmatic Method Based on Intelligent Big Data Analytics to Prediction Air Pollution

A novel hybrid model for six main pollutant concentrations forecasting based on improved LSTM neural networks

Article Open access 24 August 2022

Air quality prediction for Chengdu based on long short-term memory neural network with improved jellyfish search optimizer

Article 17 April 2023

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

High levels of air pollutants are added by industries, vehicles and different natural/anthropogenic sources leading to air quality degradation. As discussed in (Linares et al. 2018), institutions, like World Health Organization (WHO) or the European Environment Agency have reported that being exposed to pollutants increases the risk of early death (García and Aznarte 2020). Air pollution causes harmful effects on the human’s and other animals’ health, together with damages to plants and monuments. In addition, particulate air pollution is directly related to cardiovascular (Brook et al. 2004) and respiratory (Gorai et al. 2014) disorders.

Nitrogen dioxide (NO₂) commonly is derived from fossil fuel combustion and causes bronchitis, pneumonia, emphysema, etc., when entered the alveoli. Sulfur dioxide (SO₂) comes from the fuel combustion, including sulfur, like coal and petroleum. At high levels, it irritates the respiratory tract make breathing difficult (Chen et al. 2012). Sulfur dioxide (SO₂) is one of highly reactive gases called “oxides of sulfur” that is emitted into the atmosphere by fossil fuels (coal and petroleum products), power and industrial plants, and industrial processes, like steel and mining, burning fuels consisting of high sulfur amount because of transportation vehicles, including locomotives, ships, pulp industries, and natural sources, like volcanic emissions (Andersson et al. 2013).

Global energy consumption is constantly increasing (from 3728 Mtoe in 1965 to 12,928 Mtoe in 2014) due to an increase in population and economic growth (Aydin 2015a). The use of different types of energy sources has increased the global primary energy consumption provided by fossil sources, particularly oil, coal, and natural gas, up to 87%. The increase in energy demand for the next years (Aydin 2014) would be met not only by the increase in renewable energy source, but also by fossil fuels, mainly oil and gas (Aydin 2015b). Countries with the highest energy consumption currently constitute about 62% of the world's energy consumption. Thus, modeling their energy consumption for obtaining an estimated model of future world energy consumption is of great importance (Gokhan Aydin et al. 2016). Cities as high-density urban areas consume a high amount of energy. They occupy only 2% of the land, but use about 75% of the global energy consumption, and are responsible for 80% of the world’s greenhouse gas emission (Feng and Zhang 2012). Moreover, due the increase in fossil fuels use from the Industrial Revolution, greenhouse gases have significantly increased. Consequentially, global warming and climate changes have been the main concerns, particularly during the past two decades. The effect of global warming-related consequences affecting the global economy has been widely studied since the 1990s. International organizations aimed at decreasing the adverse effects of global warming using intergovernmental and binding laws. Carbon dioxide (CO₂) has been known as the most important greenhouse gases in the Earth's atmosphere. The energy sector has been the first by the direct burning of fuels, by which a high amount of CO₂ is emitted. CO₂ from energy indicates nearly 60% of the emissions of anthropogenic greenhouse gas, which varies highly by country, because of diverse national energy structures (Köne and Büke 2010). Oil plays an important role in the world economy and natural gas (NG) has become a direct competitor for the first because of its environmental benefits and its effect on electricity production. In the next two decades, there will be a rise in energy demand provided by fossil fuels, and oil still plays a main role complemented with the increasing effect of NG mostly on electricity production (Aydin 2014).

In the current century, the rarity of the fossil fuels to generate electricity and the high population make energy as one of the basic needs of developing human societies. However, increased awareness and concern regarding the environmental issues resulted in the increased willingness for using energy-efficient technologies. Thus, energy is one of the most important global challenges for all countries (Mohammadi et al. 2018). The highest approved oil reserves, such as non-conventional oil deposits can be found in Venezuela, Saudi Arabia, Canada and Iran (20%, 18%, 13%, 10.6% of global reserves, respectively) (Tofigh and Abedian 2016). Based on Oil & Gas Journal, from January 2013, Iran has been estimated to have 155 billion barrels of oil reserves, over 10% of the global total reserves and 13% of Organization of the Petroleum Exporting Countries (OPEC) reserves (Tofigh and Abedian 2016) that remains approximately for 94 years (Nejat et al. 2013). The highest proved NG reserves can be found in Russia, Iran, and Qatar (24%, 16.8%, and 12.5% of global reserves), respectively. Also, Iran has been the third-largest NG reserves in Asia because of the South Pars field development. Nonetheless, it will be the seventh NG producer by 2035 (Tofigh and Abedian 2016).

The air quality forecast issue is addressed by time series assessment, in which the model selection and parametrization are highly regarded. Recently, using machine learning approaches, such as neural networks, supported via deep learning are producible (Valput et al. 2019). Gennaro et al. (2013), proposed the artificial neural network (ANN) to forecast daily PM₁₀ concentration at regional as well as urban background areas. Feng et al. (2015), presented air mass trajectory assessment as well as wavelet transform for improving ANN prediction accuracy of daily mean PM_2.5 concentration and it was applied for recognizing marked corridors, whereas wavelet transform was used for coping with the PM_2.5 concentration fluctuation effectively. In (Sun and Sun 2017), authors proposed a new combined forecasting model according to principal component analysis (PCA) and least squares support vector machine (LSSVM) optimized with cuckoo search algorithm regarding the prediction of PM_2.5 concentrations. In (Madaan 2019), authors provided a novel forecasting model according to Bidirectional Long Short-Term Memory Networks for the prediction of air quality via estimation of the concentration levels for different pollutants (NO2), particulate matter (PM_2.5 and PM₁₀), by which threat level can be classified for them in the next 24 h. Prasad et al. (2016), designed adaptive neuro-fuzzy inference system (ANFIS) in order to predict the daily air pollution levels of SO₂, NO₂, CO, ozone (O₃), and PM₁₀ in the climate of a Megacity (Howrah). In addition, Li et al. (2018) provided an intelligent model for air pollutant concentration prediction according to the weighted extreme learning machine (WELM) as well as the adaptive neuro-fuzzy inference system (ANFIS). García and Aznarte (2020) have proposed a forecasting model based on Shapley additive explanations to predict NO2 time series. Li and Jin (2018) presented a combined intelligent model based on fuzzy synthetic evaluation for the early warning monitoring of air pollutants in China. Sen et al. (2016) proposed an ARIMA forecasting model for predicting energy consumption and greenhouse gas (GHG) emission. The model applied for an Indian pig iron manufacturing organization. Ding et al. (2017) provided a new grey multivariable model to predict CO₂ emission form fuel combustion in China. Wang and Ye (2017) presented a forecasting method based on nonlinear grey multivariable method for predicting Chinese carbon emission from fossil energy use. Furthermore, a discrete grey forecasting model for energy-related CO₂ emissions prediction in China from 2011 to 2015 implemented by (Ding et al. 2020). A hybrid intelligent prediction model based on semi-experimental regression approach as well as ANFIS for air pollution forecasting has been applied by (Zeinalnezhad et al. 2020). Maleki et al. (2019) applied a forecasting model based on ANN for criteria air pollutant concentrations forecasting such as O₃, NO₂, SO₂, PM₁₀, PM_2.5, CO, AQI, and AQHI. In (Say and Yücel 2006), Turkey’s energy sector reviewed from 1970 to 2002. The total energy consumption (TEC) was modeled through the economic growth (proxied by gross national product—GNP) as well as population growth as the main factors for determining the energy consumption in developing countries. Also, they reviewed the relationship between the TEC and total CO₂ (TCO₂) emission. Accordingly, they modeled, the potent association between TEC and TCO₂ (R² = 0.998) using regression analysis. In addition, a regression model can predict the TEC according to the population as well as the GNP with high confidence (R² = 0.996). Kumar et al. (2017) provided American Meteorological Society/Environmental Policy Agency Regulatory Model (AERMOD) in order to predict short-term air quality considering weather forecasting based on WRF method. The comprehensive emission inventory was provided regarding the sources in Chembur, Mumbai. In (Kumar et al. 2016)), vehicular pollution modeling presented with AERMOD using simulated wheatear meteorology through Weather Research and Forecasting method. NOx and PM levels were 3.6 and 1.45 times greater in peak time compared with off-peak and evening peak, respectively. Tao et al. (2019), proposed a new short-term forecasting model considering deep learning to PM2.5 concentration. The proposed model has been applied for the Beijing PM2.5 dataset. Liu et al. (2020) presented a new wind-sensitive attention structure using the long short-term memory (LSTM) neural network method for forecasting the air pollution—PM2.5 level. A combined air quality early warning model that includes estimation, forecasting, and assessment is provided by (Jiang et al. 2019). Hähnela et al. (2020) developed a deep-learning framework to monitor and forecast air-pollution that can train across various model domains. In (Ding et al. 2021), a new selection model considering the cooperative data index has been developed to characterize the optimum forecasting group from different forecasts. Agarwal et al. (2020) provided a prediction method based on artificial neural networks for forecasting PM₁₀, PM_2.5, NO₂, and O₃ pollutant level for the current day and the next four days in an area with high pollution.

In the light of the abovementioned state of the art, the main contributions and novelty of this paper are presented as follows:

(a)
The air pollution from manufacturing industry production has been analyzed in different months based on various input variables.
(b)
Since input variables plays very important roles in the performance of the forecasting model, the best features as input variable have been selected using mutual information model in this paper.
(c)
A novel forecasting application (long short-term memory network and multi-verse optimization algorithm) has been proposed to predict the amount of produced NO₂ and SO₂ by the Combined Cycle Power Plant. In this model, the LSTM parameters are optimized using the MVO metaheuristic algorithm.
(d)
The proposed forecasting application has been successfully verified on the real datasets. In addition, the forecasting model is compared with the other valid benchmark models such as (LSTM-PSO, ENN-MVO, and ENN-PSO).
(e)
The best advantage of the proposed prediction application compared with the methods presented in recent studies is the use of a combined deep learning method. In particular, in the proposed application, deep learning models and optimization algorithms have been used and integrated with the method of mutual information to select the best features as the model inputs.

This paper is organized as follows: Sect. 2 explains the case study and artificial intelligence methods. Section 3 describes the proposed air pollution forecasting application. The results and discussion provided in Sect. 4. Finally, Sect. 5 presents research conclusions.

Materials and methods

After a brief explanation of the case study and data gathering, this section explains the artificial intelligence methods developed in the paper.

Case study

The Kerman Combined Cycle Power Plant is located in Kerman, Iran. The latitude and longitude are 56.7904° and 30.2091°, respectively.

The Kerman Combined Cycle Power Plant, with a rated power equivalent to 1912 (MW), includes eight gas units, each with a rated power of 159 (MW) along with 4 steam units, each with a rated power of 160 (MW). Each steam contains two vertical heat recovery steam generator. The exhaust steam is transferred to a steam turbine from two boilers, each boiler consisting of two high and low pressure drums as well as a deaerator drum.

The main fuel of this power plant is natural gas, while its support fuel is gas oil (diesel), which is stored in 20 million liter tanks. In this regard, gas at 70% and diesel at 30% are the fuel of this power plant. Figure 1 shows the Kerman Combined Cycle Power Plant. The data has been collected based on the different stacks output. Moreover, the proposed forecasting model has been tested by two different datasets: (Case A) is the NO₂ and SO₂ produced by stack 1; (Case B) is the NO₂ and SO₂ produced by stack 2. Based on these two cases, we can analysis the level of air pollution and better evaluate the proposed model. The datasets including the wind speed, air temperature, NO₂, and SO₂ for five months (May–September 2019) with a time step of 3-h. The location of Kerman Combined Cycle Power Plant is indicated in Fig. 1. In addition, Fig. 2 represents the Windrose diagram based on wind speed and wind direction data.

Mutual Information

Various indicators are developed to calculate statistical dependence, including Mutual information (MI). Shannon, as the creators of the entropy (Shannon 1948), proposed the following formula to calculate the Shannon entropy (which is also known as information content).

$$ H\left( X \right) = - \mathop \sum \limits_{i = 1}^{N} p\left( {x_{i} } \right)\log \left[ {p\left( {x_{i} } \right)} \right] $$

(1)

Yang et al. (2000) determined the MI between two randomly selected variables.

$$ {\text{MI}}\left( {X,Y} \right) = H\left( X \right) + H\left( Y \right) - H\left( {X,Y} \right) $$

(2)

Here $H\left(X\right), H\left(Y\right)$, and $H\left(X,Y\right)$ are equal to the entropy of X, Y, and their common entropy, as follows:

$$ H\left( {X,Y} \right) = - \mathop \sum \limits_{x \in X} \mathop \sum \limits_{y \in Y} p_{XY} \left( {x,y} \right) $$

(3)

Selecting appropriate variables is a crucial step in developing models. Analyzing the previously observed trends is a valuable source for identifying important factors. Babel and colleagues argued that access to mutual information is a vital source to identify appropriate factors (Babel et al. 2015). In the present research, MI was used to select the best features of the model and to evaluate the appropriateness of used lagged values. Table 1 pinpoints strong correlations between NO₂ and SO₂ and all the other indicators.

Table 1 The correlation coefficient of NO₂ and SO₂ with other indicators

Full size table

Elman neural network

Elman neural network (ENN) is a kind of self-recursive neural network that is usually composed of two hidden layers and output (Elman 1990). This network is combined by a hidden layer and an output layer. In Elman neural network, due to the recurrent nature, the output of the hidden layer will be returned by the feedback loop to input of the hidden layer. The most important part of neural network is determining the most optimal values of parameters in the network. Network parameters are the weights and biases. Since the Elman neural network has recursive feature, if two different Elman networks with the same weight and biases and with a specified period and specific input enter into the network, they may produce a different output. Figure 3 shows the Elman neural network and recursive loops. This network, like other neural networks for optimal calculation of the parameters and training the network needs a way to train the network to obtain the optimum value of parameters.

Long short-term memory neural network

LSTM is a unique recurrent neural network (RNN) method with the RNN features, in which a series of memory cells are used for coping with the arbitrary input data and enhancing the learning process of time series. Also, it captures the long-term dependence of the input information for preventing the gradient disappearing of data transmission, leading to enhancing its ability for capturing the dynamic alterations of the time series (Yuan et al. 2019). Hochreiter and Schmidhuber (1997) developed the long short-term memory (LSTM) architecture and it was improved by Gers et al. (1999) using an extra forget gate. It is the most effective RNN architecture and is widely used. In the LSTM method, gate unites, output and input units, and memory cells, are combined (Yuan et al. 2019).

Gate units:

$$ {\text{Input}}\;{\text{gate:}}\;g_{i} = \sigma \left( {W_{ix} \cdot X_{t} + W_{ih} \cdot h_{t - 1} + b_{i} } \right) $$

(4)

$$ {\text{Forget}}\;{\text{gate:}}\;g_{f} = \sigma \left( {W_{fx} \cdot X_{t} + W_{fh} \cdot h_{t - 1} + b_{f} } \right) $$

(5)

$$ {\text{Output}}\;{\text{gate:}}\;g_{o} = \sigma \left( {W_{ox} \cdot X_{t} + W_{oh} \cdot h_{t - 1} + b_{o} } \right) $$

(6)

Input units:

$$ C\_{\text{in}} = \beta \left( {W_{cx} \cdot X_{t} + W_{ch} \cdot h_{t - 1} + b_{c} } \right) $$

(7)

Memory cells:

$$ C_{t} = g_{f} \cdot C_{t - 1} + g_{i} \cdot C\_{\text{in}} $$

(8)

Output unit:

$$ h_{t} = g_{o} \cdot \theta \left( {C_{t} } \right) $$

(9)

$$ y = \varphi \left( {W_{y} \cdot h_{t} + b_{y} } \right) $$

(10)

where $\sigma \left(\bullet \right)$, $\beta \left(\bullet \right)$, and $\theta \left(\bullet \right)$ indicate nonlinear activation functions, respectively; $\varphi \left(\bullet \right)$ represents the output unit function; the weight coefficient W as well as bias coefficient b are applied for establishing the relationship among the units made the model. Figure 4 shows the main structure of the LSTM network.

Multi-verse optimization algorithm

As a nature-based algorithm, the multi-verse optimization algorithm (MVO) is developed by Mirjalili et al. (2016). This algorithm is the main inspiration of developing the theory of multi-verse in astrophysics. Based on the MVO, several big bangs form various universes, while white holes, black holes, and wormholes connect these universes. Mirjalili argued that matters of the MVO move from a universe to another by white/black holes so that black and white holes attract and emit matters, respectively. Wormholes connect two sides of a universe. The major terms of this theory are as follows: each universe is a solution, while each solution is comprised of a series of objects, generations or iterations that are used to demonstrate the time, and the inflation rate is used to demonstrate the value of each object in a particular universe. In this theory, a solution is equal to a universe with various white/black/wormholes. To enhance the value of objects, white holes are assumed to be more likely in a particular solution, which indicates a higher value. On the other hand, black holes are more likely to be formed in objects with the worst values, which causes the transmission of values from variables with acceptable solutions. This enhances the likelihood of improving weak solutions, which in turn translates into an improved mean value of all solutions. In Eqs. 3 and 4 the core structure of the algorithm is described:

$$ x_{i}^{j} = \left\{ {\begin{array}{*{20}c} {x_{k}^{j} \quad r_{1} < {\text{NI}}\left( {U_{i} } \right)} \\ {x_{i}^{j} \quad r_{1} \ge {\text{NI}}\left( {U_{i} } \right)} \\ \end{array} } \right. $$

(11)

where ${X}_{i}^{j}$ represents the jth object of the ith universe, r₁ is a random number in a predefined spectrum ranging from zero to one, NI(Ui) is equal to the normalized inflation rate of the ith universe and ${X}_{k}^{j}$ represents the jth object of the kth universe.

$$ x_{i}^{j} = \left\{ {\begin{array}{*{20}l} {x_{j} + {\text{TDR}} \times \left( {\left( {{\text{Ub}}_{b} - {\text{Lb}}_{b} } \right) \times r_{4} \times Lb_{b} } \right)\quad r_{3} < 0.5|r_{2} < {\text{WEP}}} \\ {x_{j} - {\text{TDR}} \times \left( {\left( {{\text{Ub}}_{b} - {\text{Lb}}_{b} } \right) \times r_{4} \times Lb_{b} } \right)\quad r_{3} \ge 0.5|r_{2} < {\text{WEP}}} \\ {x_{i}^{j} \quad r_{2} \ge {\text{WEP}}} \\ \end{array} } \right. $$

(12)

where Xj is the jth centroid of the best universe obtained so far, UB represents the upper bound, LB equals the minimum bound, Traveling Distance Rate (TDR) and Wormhole Existence Probability (WEP) are coefficients, r₂, r₃, and r₄ represent random values ranging from zero to one.

Also, the algorithm of MVO represents the ideal solution to optimize and apply it to influence other solutions. In the original research, the authors argued that wormholes can be found in all universes. Again, in turn, it enhances the likelihood of having access to better solutions and maintaining the ideal solution which is reached in the optimization process. At the end of optimization, the ideal solution can be achieved as a global optimum for a particular problem. The prerequisite of solving the abovementioned equations is exchanging variables between various solutions. It worth noting that if these equations be applied in similar patterns (either exploitative or exploratory) yield similar results. The MVO contains the following processes to appropriately focus on various patterns during the optimization pattern, which is as follows:

$$ {\text{WEP}} = {\text{Min}} + {\text{Iteration}} \times \left( {\frac{{{\text{Max}} - {\text{Min}}}}{L}} \right) $$

(13)

$$ {\text{TDR}} = 1 - \frac{{{\text{Iteration}}^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 p}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$p$}}}} }}{{L^{{{\raise0.7ex\hbox{$1$} \!\mathord{\left/ {\vphantom {1 p}}\right.\kern-\nulldelimiterspace} \!\lower0.7ex\hbox{$p$}}}} }} $$

(14)

where p is the exploitation component. Two types of adaptive variables are available in MVO: WEP and TDR. WEP increases based on the frequency of iterations to increase exploitation. To enhance the precision of exploitation/local in the process of finding the best solution, TDR should be increased in various iterations. Hence, MVO can be considered as a revolutionary algorithm to exchange matters. This issue indicates the crossover operator, a known revolutionary operator to find the optimum solution. It leads to an unexpected alternation of universes, enhances the exploration, and keeps the diverseness of the universes while performing the iterations. After identifying the best universe, each universe takes a series of variables in a random process. This issue indicates the mutation, a revolutionary algorithm. In turn, the mutation operator results in slight alternations in best solutions and exploitation. Elitism is a revolutionary operator to maintain the best solution achieved during the optimization process. The elitism achieves through finding the best universe (Fathy and Rezk 2018).

Proposed air pollution forecasting application

The proposed strategy acts in such a way that the data are firstly evaluated basing on the mutual information statistical method; the parameters that have a good correlation move to the next stage; then, input variables (best features) are classified to predict two output variables (NO₂ and SO₂).

After selecting the feature, the future amount of NO₂ and SO₂ will be predicted by using combined intelligent forecasting application. The proposed air forecasting application phases are as follows:

Phase 1 Determining the effect coefficient of effective indicators on the NO₂ and SO₂ emissions, by using the Pearson correlation.
Phase 2 Designing the main engine forecaster based on long short-term memory neural network considering the obtained best features in phase 1.
Phase 3 Optimizing the parameters of LSTM and ENN neural network, by using the different metaheuristic optimization algorithms such as multi-verse optimization algorithm and particle swarm optimization algorithm.
Phase 4 Testing and training of the combined models, by using the total data as well as the classified data in phase 1.

Figure 5 presents the overall framework of the proposed forecasting application.

Results and discussions

This section summarizes the error measurement indicators and the “NO₂ and SO₂ forecasting results” section.

Error measurement indicators

After optimizing the neural network parameters, the hybrid forecasting models are trained and tested. The optimal parameter values should be integrated in the neural network. The network should be trained, using some of the input data (in this study, 80% of the data is used for training). Afterwards, different error indicators are used to assess the trained network. In this study, to evaluate the performance of forecasting models, three error indicators were incorporated, i.e., RMSE (Eq. 15), MAE (Eq. 16), and MAPE (Eq. 17).

$$ {\text{RMSE}} = \sqrt {\frac{1}{m}\mathop \sum \limits_{i = 1}^{m} \left( {x_{{{\text{acti}}}} - x_{{{\text{fori}}}} } \right)^{2} } $$

(15)

$$ {\text{MAE}} = \frac{1}{m}\mathop \sum \limits_{i = 1}^{m} \left| {x_{{{\text{acti}}}} - x_{{{\text{fori}}}} } \right| $$

(16)

$$ {\text{MAPE}} = \frac{100}{m}\mathop \sum \limits_{i = 1}^{m} \left| {\frac{{x_{{{\text{acti}}}} - x_{{{\text{fori}}}} }}{{x_{{{\text{acti}}}} }}} \right| $$

(17)

NO₂ and SO₂ forecasting results

Since the environmental pollutants play a very important role in global warming as well as climate change, predicting their growth rate in the near future will help to manage the control of these pollutants. Nowadays, artificial intelligent methods have a very acceptable accuracy in predictive problems. As discussed previously, the aim of this paper is to predict the amount of produced NO₂ and SO₂ by the Combined Cycle Power Plant. In this study, various intelligent models (ENN-PSO, ENN-MVO, and LSTM-PSO) are applied as predictive intelligence models. Table 2 shows the predicted results of the proposed model and the other provided model in this study using the data of Case A from May to September in two different types of input data, respectively. In this Table, two types of input variables are considered. It means, type (1) includes: wind speed, air temperature, and three lagged values of each output variables (${\text{NO}}_{{2_{t - 1} }}$, ${\text{NO}}_{{2_{t - 2} }}$, and ${\text{NO}}_{{2_{t - 3} }}$ for NO₂ forecasting; ${\text{SO}}_{{2_{t - 1} }}$, ${\text{SO}}_{{2_{t - 2} }}$, and ${\text{SO}}_{{2_{t - 3} }}$ for SO₂ forecasting), type (2) includes: just lagged values of each output variables.

Table 2 Performance of the proposed model and other combined forecasting models to predict NO₂ and SO₂ based on type (1) and (2) input variables (Case A)

Full size table

As indicated in Table 2, the performance of the proposed approach is very better than the other combined forecasting models for two output variables (NO₂ and SO₂). In addition, to better monitor the results of the Table, the forecasted value and real value of two different months have been selected and are shown in Fig. 6.

Furthermore, the two output variables (NO₂ and SO₂) have been predicted based on two different types of data for Case B. The results are shown in Table 3.

Table 3 Performance of the proposed model and other combined forecasting models to predict NO₂ and SO₂ based on type (1) and (2) input variables (Case B)

Full size table

As shown in Tables 2 and 3, the proposed model (MI-LSTM-MVO) is presented to predict NO₂ and SO₂. The results illustrate that the efficiency and stability of the proposed model is better than the other provided model for both outputs forecasting. Based on the accuracy of the proposed forecasting model, the air pollution produced by a Combined Cycle Power Plant can be predicted and managed. Additionally, accurate and reliable forecasting methods can be used by policy makers as useful tool to accurately define strategies, plans and rules for decreasing atmospheric pollution. Figure 7 shows the MAPE error of the provided models for NO₂ and SO₂ forecasting.

Conclusion

As a globally important phenomenon, more attention is paying to air pollution, mainly because predicting the polluted days both can prevent negative health outcomes and provides the necessary information to increase policy-makers’ awareness. Identifying factors that contribute to air pollution and their trend over time has a crucial role in developing effective models aimed to reduce air pollution. In this study, a forecasting method more efficient and effective has been developed, in order to predict air pollution from manufacturing industry. The major contribution of the paper is combining the long short-term memory based on optimizing the hyper parameters of the LSTM using multi-verse optimization metaheuristic algorithm to predict NO₂ and SO₂. In addition, the proposed method is tested with the real manufacturing-related air pollution dataset from Combined Cycle Power Plant in Kerman, Iran from May to September 2019. Also, the forecasting performances of the proposed method have been compared with some benchmark methods (LSTM-PSO, ENN-MVO, and ENN-PSO). According to the results section, the proposed method (MI-LSTM-MVO) has better performances than other methods considering different months and network input variables.

Nonetheless, despite the homogenous representation of meteorology, forecasts are well linked to the observations. Compared with the uncertainty of the emission inventories, such ANN-oriented forecasting model is rapid and less resource-intensive to generate daily forecasts. The forecasting results are applicable for scientific purposes and also to take short-term corrective plans for air quality control in cities with high pollution. It can be replicated in different cities following the needed setup, optimization, and validation for emergency planning and short-term air quality control.

References

Agarwal S, Sharma S, Suresh R, Rahman MdH, Vranckx S, Maiheu B, Blythb L, Janssen S, Gargava P, Shukl VK, Batra S (2020) Air quality forecasting using artificial neural networks with real time dynamic error correction in highly polluted regions. Sci Total Environ 735:139454. https://doi.org/10.1016/j.scitotenv.2020.139454
Article CAS Google Scholar
Andersson SM, Martinsson BG, Friberg J, Brenninkmeijer CAM, Rauthe-Schöch A, Hermann M, Van Velthoven PFJ, Zahn A (2013) Composition and evolution of volcanic aerosol from eruptions of Kasatochi, Sarychev and Eyjafjallajökull in 2008–2010 based on CARIBIC observations. Atmos Chem Phys 13(4):1781–1796. https://doi.org/10.5194/acp-13-1781-2013
Article CAS Google Scholar
Aydin G (2014) Production modeling in the oil and natural gas industry: an application of trend analysis. Pet Sci Technol 32(5):555–564. https://doi.org/10.1080/10916466.2013.825271
Article CAS Google Scholar
Aydin G (2015a) Forecasting natural gas production using various regression models. Pet Sci Technol 33(15–16):1486–1492. https://doi.org/10.1080/10916466.2015.1076842
Article CAS Google Scholar
Aydin G (2015b) Regression models for forecasting global oil production. Pet Sci Technol 33(21–22):1822–1828. https://doi.org/10.1080/10916466.2015.1101474
Article CAS Google Scholar
Aydin G, Jang H, Topal E (2016) Energy consumption modeling using artificial neural networks: the case of the world’s highest consumers. Energy Sources Part B 11(3):212–219. https://doi.org/10.1080/15567249.2015.1075086
Article Google Scholar
Babel MS, Badgujar GB, Shinde VR (2015) Using the mutual information technique to select explanatory variables in artificial neural networks for rainfall forecasting. Meteorol Appl 616:610–616. https://doi.org/10.1002/met.1495
Article Google Scholar
Brook RD, Franklin B, Cascio W, Hong Y, Howard G, Lipsett M, Luepker R, Mittleman M, Samet J, Smith SC, Tager I (2004) Air pollution and cardiovascular disease: a statement for healthcare professionals from the expert panel on population and prevention science of the American heart association. Circulation 109(21):2655–2671. https://doi.org/10.1161/01.CIR.0000128587.30041.C8
Article Google Scholar
Chen C, Zhao B, Weschler CJ (2012) Assessing the influence of indoor exposure to “outdoor ozone” on the relationship between ozone and short-term mortality in US communities. Environ Health Perspect 120(2):235–240. https://doi.org/10.1289/ehp.1103921
Article CAS Google Scholar
Ding S, Dang YG, Li XM, Wang JJ, Zhao K (2017) Forecasting Chinese CO₂ emissions from fuel combustion using a novel grey multivariable model. J Clean Prod 162:1527–1538. https://doi.org/10.1016/j.jclepro.2017.06.167
Article Google Scholar
Ding S, Xu N, Ye J, Zhou W, Zhang X (2020) Estimating Chinese energy-related CO₂ emissions by employing a novel discrete grey prediction model. J Clean Prod 259:120793. https://doi.org/10.1016/j.jclepro.2020.120793
Article CAS Google Scholar
Ding Z, Chen H, Zhou L (2021) Optimal group selection algorithm in air quality index forecasting via cooperative information criterion. J Clean Prod 283:125248. https://doi.org/10.1016/j.jclepro.2020.125248
Article CAS Google Scholar
Elman JL (1990) Finding structure in time. Cogn Sci 14(2):179–211
Article Google Scholar
Fathy A, Rezk H (2018) Multi-verse optimizer for identifying the optimal parameters of PEMFC model. Energy 143:634–644. https://doi.org/10.1016/j.energy.2017.11.014
Article Google Scholar
Feng YY, Zhang LX (2012) Scenario analysis of urban energy saving and carbon abatement policies: A case study of Beijing city, China. Procedia Environ Sci 13:632–644. https://doi.org/10.1016/j.proenv.2012.01.055
Article CAS Google Scholar
Feng X, Li Q, Zhu Y, Hou J, Jin L, Wang J (2015) Artificial neural networks forecasting of PM_2.5 pollution using air mass trajectory based geographic model and wavelet transformation. Atmos Environ 107:118–128. https://doi.org/10.1016/j.atmosenv.2015.02.030
Article CAS Google Scholar
García MV, Aznarte JL (2020) Ecological Informatics Shapley additive explanations for NO₂ forecasting. Eco Inform 56(2):101039. https://doi.org/10.1016/j.ecoinf.2019.101039
Article Google Scholar
Gennaro G, Trizio L, Di A, Pey J, Pérez N, Cusack M, Alastuey A, Querol X (2013) Neural network model for the prediction of PM₁₀ daily concentrations in two sites in the Western Mediterranean. Sci Total Environ 463–464:875–883. https://doi.org/10.1016/j.scitotenv.2013.06.093
Article CAS Google Scholar
Gers FA, Schmidhuber J, Cummins F (1999) Learning to forget: continual prediction with LSTM. In: 9th International conference on artificial neural networks: ICANN, pp 850–855
Gorai AK, Tuluri F, Tchounwou PB (2014) A GIS based approach for assessing the association between air pollution and asthma in New York State, USA. Int J Environ Res Public Health 11(5):4845–4869. https://doi.org/10.3390/ijerph110504845
Article CAS Google Scholar
Hähnela P, Mareček J, Monteil J, O’Donncha F (2020) Using deep learning to extend the range of air pollution monitoring and forecasting. J Comput Phys 408:109278. https://doi.org/10.1016/j.jcp.2020.109278
Article CAS Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article CAS Google Scholar
Jiang P, Li C, Li R, Yang H (2019) An innovative hybrid air pollution early-warning system based on pollutants forecasting and Extenics evaluation. Knowl-Based Syst 164:174–192. https://doi.org/10.1016/j.knosys.2018.10.036
Article Google Scholar
Köne AÇ, Büke T (2010) Forecasting of CO₂ emissions from fuel combustion using trend analysis. Renew Sustain Energy Rev 14:2906–2915. https://doi.org/10.1016/j.rser.2010.06.006
Article CAS Google Scholar
Kumar A, Patil RS, Kumar A, Rakesh D (2016) Comparison of predicted vehicular pollution concentration with air quality standards for different time periods. Clean Technol Environ Policy 18(7):2293–2303. https://doi.org/10.1007/s10098-016-1147-6
Article CAS Google Scholar
Kumar A, Patil RS, Kumar A, Rakesh D (2017) Application of AERMOD for short-term air quality prediction with forecasted meteorology using WRF model. Clean Technol Environ Policy 19(7):1955–1965. https://doi.org/10.1007/s10098-017-1379-0
Article CAS Google Scholar
Li R, Jin Y (2018) The early-warning system based on hybrid optimization algorithm and fuzzy synthetic evaluation model. Inf Sci 435:296–319. https://doi.org/10.1016/j.ins.2017.12.040
Article Google Scholar
Li Y, Jiang P, She Q, Lin G (2018) Research on air pollutant concentration prediction method based on self-adaptive neuro-fuzzy weighted extreme learning machine. Environ Pollut 241:1115–1127. https://doi.org/10.1016/j.envpol.2018.05.072
Article CAS Google Scholar
Linares C, Falcón I, Ortiz C, Díaz J (2018) An approach estimating the short-term effect of NO₂ on daily mortality in Spanish cities. Environ Int 116(2):18–28. https://doi.org/10.1016/j.envint.2018.04.002
Article CAS Google Scholar
Liu D-R, Lee S-J, Huang Y, Chiu C-J (2020) Air pollution forecasting based on attention-based LSTM neural network and ensemble learning. Expert Syst 37(3):1–16. https://doi.org/10.1111/exsy.12511
Article CAS Google Scholar
Madaan D (2019) Real time attention based bidirectional long short-term memory networks for air pollution forecasting. In: 2019 IEEE fifth international conference on Big Data computing service and applications (BigDataService), pp 151–158. https://doi.org/https://doi.org/10.1109/BigDataService.2019.00027
Maleki H, Sorooshian A, Goudarzi G, Baboli Z, Tahmasebi Y (2019) Air pollution prediction by using an artificial neural network model. Clean Technol Environ Policy 21(6):1341–1352. https://doi.org/10.1007/s10098-019-01709-w
Article CAS Google Scholar
Mirjalili S, Mirjalili SM, Hatamlou A (2016) Multi-Verse optimizer: a nature-inspired algorithm for global optimization. Neural Comput Appl 27(30):495–513. https://doi.org/10.1007/s00521-015-1870-7
Article Google Scholar
Mohammadi M, Ghasempour R, Astaraei FR, Ahmadi E, Aligholian A, Toopshekan A (2018) Optimal planning of renewable energy resource for a residential house considering economic and reliability criteria. Electr Power Energy Syst 96:261–273. https://doi.org/10.1016/j.ijepes.2017.10.017
Article Google Scholar
Nejat P, Kasir A, Jomehzadeh F, Behzad H, Saeed M, Majid MZA (2013) Iran’s achievements in renewable energy during fourth development program in comparison with global trend. Renew Sustain Energy Rev 22:561–570. https://doi.org/10.1016/j.rser.2013.01.042
Article Google Scholar
Prasad K, Gorai AK, Goyal P (2016) Development of ANFIS models for air quality forecasting and input optimization for reducing the computational cost and time. Atmos Environ 128:246–262. https://doi.org/10.1016/j.atmosenv.2016.01.007
Article CAS Google Scholar
Say NP, Yücel M (2006) Energy consumption and CO₂ emissions in Turkey: empirical analysis and future projection based on an economic growth. Energy Policy 34:3870–3876. https://doi.org/10.1016/j.enpol.2005.08.024
Article Google Scholar
Sen P, Roy M, Pal P (2016) Application of ARIMA for forecasting energy consumption and GHG emission: a case study of an Indian pig iron manufacturing organization. Energy 116:1031–1038. https://doi.org/10.1016/j.energy.2016.10.068
Article Google Scholar
Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27(3):379–423
Article Google Scholar
Sun W, Sun J (2017) Daily PM_2.5 concentration prediction based on principal component analysis and LSSVM optimized by cuckoo search algorithm. J Environ Manag 188:144–152. https://doi.org/10.1016/j.jenvman.2016.12.011
Article CAS Google Scholar
Tao Q, Liu F, Li Y, Sidorov D (2019) Air pollution forecasting using a deep learning model based on 1D convnets and bidirectional GRU. IEEE Access 7:76690–76698. https://doi.org/10.1109/ACCESS.2019.2921578
Article Google Scholar
Tofigh AA, Abedian M (2016) Analysis of energy status in Iran for designing sustainable energy roadmap. Renew Sustain Energy Rev 57:1296–1306. https://doi.org/10.1016/j.rser.2015.12.209
Article Google Scholar
Valput D, Navares R, Aznarte JL (2019) Forecasting hourly NO₂ concentrations by ensembling neural networks and mesoscale models. Neural Comput Appl 32:9331–9342. https://doi.org/10.1007/s00521-019-04442-z
Article Google Scholar
Wang ZX, Ye DJ (2017) Forecasting Chinese carbon emissions from fossil energy consumption using non-linear grey multivariable models. J Clean Prod 142:600–612. https://doi.org/10.1016/j.jclepro.2016.08.067
Article Google Scholar
Yang HH, Van Vuuren S, Sharma S, Hermansky H (2000) Relevance of time–frequency features for phonetic and speaker-channel classification. Speech Commun 31(1):35–50. https://doi.org/10.1016/S0167-6393(00)00007-8
Article Google Scholar
Yuan X, Chen C, Jiang M, Yuan Y (2019) Prediction interval of wind power using parameter optimized Beta distribution based LSTM model. Appl Soft Comput J 82:105550. https://doi.org/10.1016/j.asoc.2019.105550
Article Google Scholar
Zeinalnezhad M, Gholamzadeh A, Kleme J (2020) Air pollution prediction using semi-experimental regression model and adaptive neuro-fuzzy inference system. J Clean Prod 261:121218. https://doi.org/10.1016/j.jclepro.2020.121218
Article CAS Google Scholar

Download references

Funding

Open access funding provided by Università degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

Department of Astronautical, Electrical and Energy Engineering (DIAEE), Sapienza University, Rome, Italy
Azim Heydari, Meysam Majidi Nezhad & Livio De Santoli
Department of Planning, Design, and Technology of Architecture, Sapienza University of Rome, Rome, Italy
Davide Astiaso Garcia
Department of Energy Management and Optimization, Institute of Science and High Technology and Environmental Sciences, Graduate University of Advanced Technology, Kerman, Iran
Farshid Keynia

Authors

Azim Heydari
View author publications
You can also search for this author in PubMed Google Scholar
Meysam Majidi Nezhad
View author publications
You can also search for this author in PubMed Google Scholar
Davide Astiaso Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Farshid Keynia
View author publications
You can also search for this author in PubMed Google Scholar
Livio De Santoli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Azim Heydari.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Heydari, A., Majidi Nezhad, M., Astiaso Garcia, D. et al. Air pollution forecasting application based on deep learning model and optimization algorithm. Clean Techn Environ Policy 24, 607–621 (2022). https://doi.org/10.1007/s10098-021-02080-5

Download citation

Received: 03 November 2020
Accepted: 01 April 2021
Published: 14 April 2021
Issue Date: March 2022
DOI: https://doi.org/10.1007/s10098-021-02080-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Air pollution forecasting application based on deep learning model and optimization algorithm