A novel hybrid model for multi-step daily AQI forecasting driven by air pollution big data

  • 38 Accesses


Air pollution is considered as a serious issue for the society and economy. Air pollution forecasting can offer a timely and effective early warning. In the current mainstream studies, the information from high-resolution air pollution data is usually ignored. In this study, the useful information from high-resolution big data (1 h) is utilized sufficiently. In the proposed model, high-resolution data (1 h) of different air pollution indices (AQI, PM2.5, PM10, SO2, NO2, O3, and CO concentrations) are utilized to enhance the performance, including 7 × 24,000 data samples. Besides, the variational mode decomposition (VMD) and the least absolute shrinkage and selector operation (LASSO) are proposed to preprocess and reshape the input data. The stacked auto-encoder (SAE) is proposed to reduce dimension and extract features. The deep echo state network (DESN) is trained to generate daily forecasting results. The high-resolution input is converted to low-resolution output in this way. Air pollution data of four polluted Chinese cities are utilized to verify the effectiveness, stability, and universality of the proposed model. The proposed model can outperform low-resolution forecasting models and other benchmark models.

This is a preview of subscription content, log in to check access.

Access options

Buy single article

Instant unlimited access to the full article PDF.

US$ 39.95

Price includes VAT for USA

Subscribe to journal

Immediate online access to all issues from 2019. Subscription will auto renew annually.

US$ 99

This is the net price. Taxes to be calculated in checkout.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5


  1. Al-Ali A, Zualkernan I, Aloul F (2010) A mobile GPRS-sensors array for air pollution monitoring. IEEE Sensors J 10:1666–1671

  2. Bai Y, Zeng B, Li C, Zhang J (2019) An ensemble long short-term memory neural network for hourly PM2.5 concentration forecasting. Chemosphere 222:286–294.

  3. Błaszczyk E, Rogula-Kozłowska W, Klejnowski K, Kubiesa P, Fulara I, Mielżyńska-Švach D (2017) Indoor air quality in urban and rural kindergartens: short-term studies in Silesia, Poland. Air Qual Atmos Health 10:1207–1220.

  4. Dragomiretskiy K, Zosso D (2014) Variational mode decomposition. IEEE Trans Signal Process 62:531–544

  5. Gallicchio C, Micheli A (2017) Deep echo state network (deepesn): a brief survey. arXiv preprint arXiv:171204323

  6. Lei MT, Monjardino J, Mendes L, Gonçalves D, Ferreira F (2019) Macao air quality forecast using statistical methods. Air Qual Atmos Health 12:1049–1057.

  7. Liu H, Chen C (2019) Prediction of outdoor PM2.5 concentrations based on a three-stage hybrid neural network model. Atmos Pollut Res.

  8. Liu H, Duan Z, Chen C (2019a) A hybrid framework for forecasting PM2.5 concentrations using multi-step deterministic and probabilistic strategy. Air Qual Atmos Health 12:785–795.

  9. Liu H, Jin K, Duan Z (2019b) Air PM2.5 concentration multi-step forecasting using a new hybrid modeling method: comparing cases for four cities in China. Atmos Pollut Res 10:1588–1600.

  10. Liu H, Wu H, Lv X, Ren Z, Liu M, Li Y, Shi H (2019c) An intelligent hybrid model for air pollutant concentrations forecasting: case of Beijing in China. Sustain Cities Soc 47:101471.

  11. Liu H, Xu Y, Chen C (2019d) Improved pollution forecasting hybrid algorithms based on the ensemble method. Appl Math Model 73:473–486

  12. Murillo-Escobar J, Sepulveda-Suescun JP, Correa MA, Orrego-Metaute D (2019) Forecasting concentrations of air pollutants using support vector regression improved with particle swarm optimization: case study in Aburrá Valley, Colombia. Urban Clim 29:100473.

  13. Niu M, Wang Y, Sun S, Li Y (2016) A novel hybrid decomposition-and-ensemble model based on CEEMD and GWO for short-term PM2. 5 concentration forecasting. Atmos Environ 134:168–180

  14. Robichaud A (2017) Surface data assimilation of chemical compounds over North America and its impact on air quality and Air Quality Health Index (AQHI) forecasts. Air Qual Atmos Health 10:955–970.

  15. Sun K, Chen X, Wang J, Zhang T, Zhu Z (2018) Investigation of air quality over the largest city in central China using high resolution satellite derived aerosol optical depth data. Atmos Pollut Res 9:584–593.

  16. Tibshirani R (1996) Regression shrinkage and selection via the lasso: a retrospective. J R Stat Soc 58:267–288

  17. Ventura LMB, de Oliveira PF, Soares LM, Luna AS, Gioda A (2019) Forecast of daily PM2.5 concentrations applying artificial neural networks and Holt–Winters models. Air Qual Atmos Health 12:317–325.

  18. Wang Y-d, X-k F, Jiang W, Wang T, Tsou M-H, X-y Y (2017) Inferring urban air quality based on social media. Comput Environ Urban Syst 66:110–116

  19. Wu B, Li T, Baležentis T, Štreimikienė D (2019) Impacts of income growth on air pollution-related health risk: exploiting objective and subjective measures. Resour Conserv Recycl 146:98–105

  20. Wu Q, Lin H (2019) Daily urban air quality index forecasting based on variational mode decomposition, sample entropy and LSTM neural network. Sustain Cities Soc 50:101657.

  21. Yuan X, Huang B, Wang Y, Yang C, Gui W (2018) Deep learning-based feature representation and its application for soft sensor modeling with variable-wise weighted SAE. IEEE Trans Ind Inform 14:3235–3243

  22. Zhai B, Chen J (2018) Development of a stacked ensemble model for forecasting and analyzing daily average PM 2.5 concentrations in Beijing, China. Sci Total Environ 635:644–658

  23. Zhang L et al (2018) Trend analysis and forecast of PM2. 5 in Fuzhou, China using the ARIMA model. Ecol Indic 95:702–710

  24. Zhu S, Lian X, Liu H, Hu J, Wang Y, Che J (2017) Daily air quality index forecasting with hybrid models: a case in China. Environ Pollut 231:1232–1244

Download references

Author information

Correspondence to Hui Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix. Forecasting results

Appendix. Forecasting results

Table 4 Evaluation indices of all models in Experiment 1
Table 5 Evaluation indices of different models in Experiment 2
Fig. 6

Evaluation indices of all models in Experiment 1

Fig. 7

Scatter diagrams of forecasting results in Experiment 2 (Xiamen)

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Xu, Y., Liu, H. & Duan, Z. A novel hybrid model for multi-step daily AQI forecasting driven by air pollution big data. Air Qual Atmos Health (2020).

Download citation


  • Big data driven
  • Multi-step AQI forecasting
  • High/low-resolution conversion
  • Multiple index input