Att-ConvLSTM: PM2.5 Prediction Model and Application

  • Zhe Xu
  • Yi LvEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1074)


Environment protection department need to grasp the concentration of PM2.5 in a future moment when monitoring. However, the existing PM2.5 prediction studies only forecast short-term time points, and cannot accurately give the trend of the next period of time. In this paper, a PM2.5 prediction model based on Att-ConvLSTM model integrated training method is established by the advantage of ConvLSTM to obtain spatiotemporal information. Then, Experiments were performed using DNN, ARIMA and LSTM as control model with Att-ConvLSTM model and used it for application test. The result demonstrated that prediction model can extract spatiotemporal features with attention mechanism and ConvLSTM. The model can reduce the generalization error of the model when predicting other observation points.


Attention mechanism Spatiotemporal information Att-ConvLSTM Prediction Generalization error 


  1. 1.
    Zhou, S., Li, W., Qiao, J.: Prediction of PM2.5 concentration based on self-organizing recurrent fuzzy neural network. CAAI Trans. Intell. Syst. 13(04), 509–516 (2018)Google Scholar
  2. 2.
    Chen, J., Lu, J., Avise, J.C., et al.: Seasonal modeling of PM2.5 in California’s San Joaquin Valley. Atmos. Environ. 92, 182–190 (2014)CrossRefGoogle Scholar
  3. 3.
    Fan, J., Li, Q., Zhu, Y., Hou, J., Feng, X.: Aspatio-temporal prediction framework for air pollution based on deep RNN. Sci. Surv. Mapp. 42(7), 76–83Google Scholar
  4. 4.
    Li, X., Peng, L., Yao, X., et al.: Long short-term memory neural network for air pollutant concentration predictions: method development and evaluation. Environ. Pollut. 231(Pt 1), 997–1004 (1987)Google Scholar
  5. 5.
    Zheng, Y., Liu, F., Hsieh, H.-P.: U-air: when urban air quality inference meets big data. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1436–1444. ACM, New York (2013)Google Scholar
  6. 6.
    Li, X., Peng, L., Hu, Y., et al.: Deep learning architecture for air quality predictions. Environ. Sci. Pollut. Res. 23(22), 22408–22417 (2016)CrossRefGoogle Scholar
  7. 7.
    Yan, L., Wu, Y., Yan, L., et al.: Encoder-decoder model for forecast of PM2.5 concentration per hour. In: 2018 1st International Cognitive Cities Conference (IC3), pp. 45–50 (2018)Google Scholar
  8. 8.
    Wen, C., Liu, S., Yao, X., et al.: A novel spatiotemporal convolutional long short-term neural network for air pollution prediction. Sci. Total Environ. 654, 1091–1099 (2019)CrossRefGoogle Scholar
  9. 9.
    Shi, X., Chen, Z., Wang, H., et al.: Convolutional LSTM network: a machine learning approach for precipitation nowcasting. arXiv:1506.04214 (2015)
  10. 10.
    Mnih, V., Heess, N., Graves, A., et al.: Recurrent models of visual attention. In: Advances in Neural Information Processing Systems, pp. 2204–2212 (2014)Google Scholar
  11. 11.
    Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)CrossRefGoogle Scholar
  12. 12.
    Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. In: 31st Annual Conference on Neural Information Processing Systems, NIPS 2017, 4–9 December 2017, pp. 5999–6009. Neural Information Processing Systems Foundation, December 2017Google Scholar
  13. 13.
    Yang, J., Zhang, Y., Zhu, Y.: Classification performance of support vector machine with ε-insensitive loss function. J. Xi’an Jiaotong Univ. 41(11), 1315–1320 (2007)zbMATHGoogle Scholar
  14. 14.
    Lang, Y., Xiao, L.: Forecasting concentrations of PM2.5 in main urban area of Hangzhou and mapping using SARIMA model and ordinary Kringing method. Acta Sci. Circumstantiae 38, 62–70 (2018)Google Scholar
  15. 15.
    Zheng, G., Wang, T., Yang Y., Zhang, X.: PM2.5 concentration prediction model based on genetic wavelet neural network. Geomat. Spat. Inf. Technol. 41(09), 248–250+256 (2018)Google Scholar
  16. 16.
    Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45(11), 2673–2681 (1997)CrossRefGoogle Scholar
  17. 17.
    Wang, X., Girshick, R., Gupta, A., et al.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Faculty of Information TechnologyBeijing University of TechnologyBeijingChina

Personalised recommendations