Skip to main content

Advertisement

Log in

Integration of complete ensemble empirical mode decomposition with deep long short-term memory model for particulate matter concentration prediction

  • Research Article
  • Published:
Environmental Science and Pollution Research Aims and scope Submit manuscript

Abstract

The atmospheric particulate matter (PM) with a diameter of 2.5 μm or less (PM2.5) is one of the key indicators of air pollutants. Accurate prediction of PM2.5 concentration is very important for air pollution monitoring and public health management. However, the presence of noise in PM2.5 data series is a major challenge of its accurate prediction. A novel hybrid PM2.5 concentration prediction model is proposed in this study by combining complete ensemble empirical mode decomposition (CEEMD) method, Pearson’s correlation analysis, and a deep long short-term memory (LSTM) method. CEEMD was employed to decompose historical PM2.5 concentration data to different frequencies in order to enhance the timing characteristics of data. Pearson’s correlation was used to screen the different frequency intrinsic-mode functions of decomposed data. Finally, the filtered enhancement data were inputted to a deep LSTM network with multiple hidden layers for training and prediction. The results evidenced the potential of the CEEMD-LSTM hybrid model with a prediction accuracy of approximately 80% and model convergence after 700 training epochs. The secondary screening of Pearson’s correlation test improved the model (CEEMD-Pearson) accuracy up to 87% but model convergence after 800 epochs. The hybrid model combining CEEMD-Pearson with the deep LSTM neural network showed a prediction accuracy of nearly 90% and model convergence after 650 interactions. The results provide a clear indication of higher prediction accuracy of PM2.5 with less computation time through hybridization of CEEMD-Pearson with deep LSTM models and its potential to be employed for air pollution monitoring.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Ali M, Prasad R, Xiang Y, Yaseen ZM (2020) Complete ensemble empirical mode decomposition hybridized with random forest and kernel ridge regression model for monthly rainfall forecasts. J Hydrol 584:124647. https://doi.org/10.1016/j.jhydrol.2020.124647

    Article  Google Scholar 

  • Bokde N, Feijóo A, Al-Ansari N et al (2020) The hybridization of ensemble empirical mode decomposition with forecasting models: application of short-term wind speed and power modeling. Energies 13:1666

    Article  Google Scholar 

  • Cabaneros SM, Calautit JK, Hughes BR (2019) A review of artificial neural network models for ambient air pollution prediction. Environ Model Softw 119:285–304

    Article  Google Scholar 

  • Chen P-H, Lin C-J, Schölkopf B (2005) A tutorial onν-support vector machines. Appl Stoch Model Bus Ind 21:111–136. https://doi.org/10.1002/asmb.537

    Article  Google Scholar 

  • Cheng M, Wang B, Yang M, Ma J, Ye Z, Xie L, Zhou M, Chen W (2020) microRNAs expression in relation to particulate matter exposure: a systematic review. Environ Pollut 260:113961

    Article  CAS  Google Scholar 

  • Considine EM, Reid CE, Ogletree MR, Dye T (2020) Improving accuracy of air pollution exposure measurements: statistical correction of a municipal low-cost airborne particulate matter sensor network. Environ Pollut 268:115833. https://doi.org/10.1016/j.envpol.2020.115833

    Article  CAS  Google Scholar 

  • Dhingra S, Madda RB, Gandomi AH, Patan R, Daneshmand M (2019) Internet of Things mobile–air pollution monitoring system (IoT-Mobair). IEEE Internet Things J 6:5577–5584

    Article  Google Scholar 

  • Fei S (2016) A hybrid model of EMD and multiple-kernel RVR algorithm for wind speed prediction. Int J Electr Power Energy Syst 78:910–915. https://doi.org/10.1016/j.ijepes.2015.11.116

    Article  Google Scholar 

  • Fu M, Fan T, Ding Z, Salih SQ, Al-Ansari N, Yaseen ZM (2020) Deep learning data-intelligence model based on adjusted forecasting window scale: application in daily streamflow simulation. IEEE Access 8:32632–32651. https://doi.org/10.1109/ACCESS.2020.2974406

  • Geng J, Huang M-L, Li M-W, Hong W-C (2015) Hybridization of seasonal chaotic cloud simulated annealing algorithm in a SVR-based load forecasting model. Neurocomputing 151:1362–1373. https://doi.org/10.1016/j.neucom.2014.10.055

    Article  Google Scholar 

  • Gers FA, Schmidhuber J, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12:2451–2471. https://doi.org/10.1162/089976600300015015

    Article  CAS  Google Scholar 

  • Goyal P, Kumar A (2011) Mathematical modeling of air pollutants: an application to Indian urban city. https://doi.org/10.5772/16840

  • Hashim BM, Al-Naseri SK, Al Maliki A et al (2021) On the investigation of COVID-19 lockdown influence on air pollution concentration: regional investigation over eighteen provinces in Iraq. Environ Sci Pollut Res 1–19. https://doi.org/10.1007/s11356-021-13812-x

  • Hochreiter S, Schmidhuber JJ (1997) Long short-term memory. Neural Comput 9:1–32. https://doi.org/10.1162/neco.1997.9.8.1735

    Article  CAS  Google Scholar 

  • Huang C-J, Kuo P-H (2018) A deep CNN-LSTM model for particulate matter (PM2. 5) forecasting in smart cities. Sensors 18:2220

    Article  Google Scholar 

  • Huang K, Bi J, Meng X, Geng G, Lyapustin A, Lane KJ, Gu D, Kinney PL, Liu Y (2019) Estimating daily PM2.5 concentrations in New York City at the neighborhood-scale: implications for integrating non-regulatory measurements. Sci Total Environ 697:134094. https://doi.org/10.1016/j.scitotenv.2019.134094

    Article  CAS  Google Scholar 

  • Khosravi K, Daggupati P, Alami MT, Awadh SM, Ghareb MI, Panahi M, Pham BT, Rezaie F, Qi C, Yaseen ZM (2019) Meteorological data mining and hybrid data-intelligence models for reference evaporation simulation: a case study in Iraq. Comput Electron Agric 167:105041

    Article  Google Scholar 

  • Liu X, Liu Q, Zou Y, Wang G (2018) A self-organizing LSTM-based approach to PM2.5 forecast. Cloud Computing and Security, pp 683–693

  • Liu H, Jin K, Duan Z (2019a) Air PM2.5 concentration multi-step forecasting using a new hybrid modeling method: comparing cases for four cities in China. Atmos Pollut Res 10:1588–1600. https://doi.org/10.1016/j.apr.2019.05.007

    Article  CAS  Google Scholar 

  • Liu K, Wu Q, Wang L, Wang S, Liu T, Ding D, Tang Y, Li G, Tian H, Duan L, Wang X, Fu X, Feng X, Hao J (2019b) Measure-specific effectiveness of air pollution control on China’s atmospheric mercury concentration and deposition during 2013–2017. Environ Sci Technol 53:8938–8946

    Article  CAS  Google Scholar 

  • Liu H, Duan Z, Chen C (2020) A hybrid multi-resolution multi-objective ensemble model and its application for forecasting of daily PM2.5 concentrations. Inf Sci 516:266–292. https://doi.org/10.1016/j.ins.2019.12.054

    Article  Google Scholar 

  • Loy-Benitez J, Vilela P, Li Q, Yoo C (2019) Sequential prediction of quantitative health risk assessment for the fine particulate matter in an underground facility using deep recurrent neural networks. Ecotoxicol Environ Saf 169:316–324. https://doi.org/10.1016/j.ecoenv.2018.11.024

    Article  CAS  Google Scholar 

  • Malhotra P, Vig L, Shroff G, Agarwal P (2015) Long short term memory networks for anomaly detection in time series. In: 23rd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2015 - Proceedings.

  • Mihăiţă AS, Dupont L, Chery O, Camargo M, Cai C (2019) Evaluating air quality by combining stationary, smart mobile pollution monitoring and data-driven modelling. J Clean Prod 221:398–418

    Article  Google Scholar 

  • Milad A, Adwan I, Majeed SA, Yusoff NIM, al-Ansari N, Yaseen ZM (2021) Emerging technologies of deep learning models development for pavement temperature prediction. IEEE Access 9:23840–23849

    Article  Google Scholar 

  • Pak U, Ma J, Ryu U, Ryom K, Juhyok U, Pak K, Pak C (2020) Deep learning-based PM2.5 prediction considering the spatiotemporal correlations: a case study of Beijing, China. Sci Total Environ 699:133561. https://doi.org/10.1016/j.scitotenv.2019.07.367

    Article  CAS  Google Scholar 

  • Pallozzi E, Guidolotti G, Mattioni M, Calfapietra C (2020) Particulate matter concentrations and fluxes within an urban park in Naples. Environ Pollut 266:115134. https://doi.org/10.1016/j.envpol.2020.115134

    Article  CAS  Google Scholar 

  • Qi C, Zhou W, Lu X, Luo H, Pham BT, Yaseen ZM (2020) Particulate matter concentration from open-cut coal mines: a hybrid machine learning estimation. Environ Pollut 263:114517. https://doi.org/10.1016/j.envpol.2020.114517

    Article  CAS  Google Scholar 

  • Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323:533–536. https://doi.org/10.1038/323533a0

    Article  Google Scholar 

  • Soh P-W, Chang J-W, Huang J-W (2018) Adaptive deep learning-based air quality prediction model using the most relevant spatial-temporal relations. IEEE Access 6:38186–38199. https://doi.org/10.1109/access.2018.2849820

    Article  Google Scholar 

  • Takahashi M, Feng Z, Mikhailova TA et al (2020) Air pollution monitoring and tree and forest decline in East Asia: a review. Sci Total Environ 742:140288

  • Tzanis CG, Alimissis A, Philippopoulos K, Deligiorgi D (2019) Applying linear and nonlinear models for the estimation of particulate matter variability. Environ Pollut 246:89–98. https://doi.org/10.1016/j.envpol.2018.11.080

    Article  CAS  Google Scholar 

  • Wang P, Zhang H, Qin Z, Zhang G (2017) A novel hybrid-Garch model based on ARIMA and SVM for PM 2.5 concentrations forecasting. Atmos Pollut Res 8:850–860. https://doi.org/10.1016/j.apr.2017.01.003

    Article  Google Scholar 

  • Wen C, Liu S, Yao X, Peng L, Li X, Hu Y, Chi T (2019) A novel spatiotemporal convolutional long short-term neural network for air pollution prediction. Environment Sci Total Environ 654:1091–1099

    Article  CAS  Google Scholar 

  • Yaseen ZM (2021) An insight into machine learning models era in simulating soil, water bodies and adsorption heavy metals: review, challenges and solutions. Chemosphere 277:130126

  • Yaseen ZM, Sulaiman SO, Deo RC, Chau K-W (2018) An enhanced extreme learning machine model for river flow forecasting: state-of-the-art, practical applications in water resource engineering area and future research direction. J Hydrol 569:387–408. https://doi.org/10.1016/j.jhydrol.2018.11.069

    Article  Google Scholar 

  • Yeh J-R, Shieh J-S, Huang NE (2010) Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method. Adv Adapt Data Anal 02:135–156. https://doi.org/10.1142/s1793536910000422

    Article  Google Scholar 

  • Yi X, Zhang J, Wang Z et al (2018) Deep distributed fusion network for air quality prediction. In: Proceedings of the 24th ACM SIGKDD. International Conference on Knowledge Discovery & Data Mining:965–973

  • Yuchi W, Gombojav E, Boldbaatar B, Galsuren J, Enkhmaa S, Beejin B, Naidan G, Ochir C, Legtseg B, Byambaa T, Barn P, Henderson SB, Janes CR, Lanphear BP, McCandless LC, Takaro TK, Venners SA, Webster GM, Allen RW (2019) Evaluation of random forest regression and multiple linear regression for predicting indoor fine particulate matter concentrations in a highly polluted city. Environ Pollut 245:746–753. https://doi.org/10.1016/j.envpol.2018.11.034

    Article  CAS  Google Scholar 

  • Zhou Y, Chang FJ, Chang LC, Kao IF, Wang YS, Kang CC (2019) Multi-output support vector machine for regional multi-step-ahead PM2.5 forecasting. Sci Total Environ 651:230–240. https://doi.org/10.1016/j.scitotenv.2018.09.111

    Article  CAS  Google Scholar 

Download references

Acknowledgements

We would like to thank Editage for English language editing.

Availability of data and materials

Data were obtained from the online website: https://www.aqistudy.cn/.

Funding

This work was supported by the National Natural Science Foundation of China (grant number 61973275).

Author information

Authors and Affiliations

Authors

Contributions

MF: data curation; formal analysis; methodology; investigation. CL: visualization; writing—original draft, review & editing draft preparation; resources; software. TF: visualization; writing—original draft, review & editing draft preparation. RP: formal analysis, project administration; writing—review & editing. DM: data curation; formal analysis; methodology; investigation; visualization; writing—original draft, review & editing draft preparation. OD: writing—original draft, review & editing draft preparation. DL: supervision, conceptualization; project administration; writing—review & editing. SS: validation, manuscript revision, analysis, investigation, and writing up. ZMY: supervision, conceptualization; project administration; writing—review & editing.

Corresponding author

Correspondence to Zaher Mundher Yaseen.

Ethics declarations

Ethics approval

The manuscript is conducted within the ethical manner advised by the Environmental Science and Pollution Research.

Consent to participate

Not applicable.

Consent for publication

The research is scientifically consented to be published.

Conflict of interest

The authors declare no competing interests.

Additional information

Responsible Editor: Marcus Schulz

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fu, ., Le, C., Fan, . et al. Integration of complete ensemble empirical mode decomposition with deep long short-term memory model for particulate matter concentration prediction. Environ Sci Pollut Res 28, 64818–64829 (2021). https://doi.org/10.1007/s11356-021-15574-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11356-021-15574-y

Keywords

Navigation