Abstract
The objective of this work is to predict daily PM2.5 air quality in Dakar, Senegal using data from an automated measurement station integrated into a server using a data assimilation model. Initially, a 3-year data set was used to identify and validate an appropriate ARIMA data assimilation model. The data was split into an 80% training set and a 20% test set. The Augmented Dickey-Fuller (ADF) test was used to check the normality of the data series. Subsequently, we used the AutoArima method to determine the optimal model to represent the time series. Preliminary results show that a model with order (2,1,1) accurately represents the series. Additional analysis using model fit tests showed that the (3, 0, 1) model was most effective in representing and predicting the data. The statistical validation performance of this model demonstrates its capability to forecast PM2.5 concentrations for up to 72 h (3 days), achieving correlation coefficients exceeding 80%. However, after three days, the predictions returned to background levels. In the final stage of the study, data from automatic stations were integrated into a server hosting the assimilation model to improve daily PM2.5 forecasts for Dakar. An interactive platform was developed to visualize measurements and forecasts over two days. The results show that by integrating the data with the assimilation model, predictions are significantly improved.
Similar content being viewed by others
Data availability
The data underlying our article will be available upon request, in accordance with the publisher’s policies and subject to relevant ethical and legal considerations. To access the data, please contact ahmed.gueye@ucad.edu.sn.
References
Agence Internationale de l’énergie (AIE) (2020) World Energy Outlook 2020. https://www.iea.org/reports/world-energy-outlook-2020, Paris. Licence : CC BY 4.0
Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19(6):716–723. https://doi.org/10.1109/TAC.1974.1100705
Arku R, Birch A, Shupler M, Yusuf S, Hystad P, Brauer M (2018) Characterizing exposure to household air pollution within the prospective urban rural epidemiology (pure) study. Environ Int 114:307–317. https://doi.org/10.1016/j.envint.2018.02.033
Bhatti UA, Yan Y, Zhou M, Ali S, Hussain A, Qingsong H, Yu Z, Yuan L (2021) Time series analysis and forecasting of air pollution particulate matter (pm2.5): an Sarima and factor analysis approach. IEEE Access 9:41019–41031. https://doi.org/10.1109/ACCESS.2021.3060744
Burnham PK, Anderson R (2004) D: Multimodel inference: understanding AIC and BIC in model selection. Sociol Methods Res 2(33):261–304. https://doi.org/10.1177/0049124104268644
Chang Y, Park JY (2002) On the asymptotics of adf tests for unit roots. Economet Rev 21(4):431–447. https://doi.org/10.1081/ETC-120015385
Cheng Y, Zhang H, Liu Z, Chen L, Wang P (2019) Hybrid algorithm for short-term forecasting of pm2.5 in China. Atmos Environ 200:264–279. https://doi.org/10.1016/j.atmosenv.2018.12.025
Diaz-Robles LA, Ortega JC, Fu JS, Reed GD, Chow JC, Watson JG, Moncada-Herrera JA (2008) A hybrid Arima and artificial neural networks model to forecast particulate matter in urban areas: the case of temuco, chile. Atmos Environ 42(35):8331–8340. https://doi.org/10.1016/j.atmosenv.2008.07.020
Dieme D, Cabral-Ndior M, Garçon G, Verdin A, Billet S, Cazier F, Courcot D, Diouf A, Shirali P (2012) Relationship between physicochemical characterization and toxicity of fine particulate matter (pm2.5) collected in Dakar city (Senegal). Environ Res 113:1–13. https://doi.org/10.1016/j.envres.2011.11.009
Drame M, Jenkins G, Camara M, Robjhon M (2011) Observations and simulation of a Saharan air layer event with a midtropospheric dust layer at Dakar. Senegal 116:6–7. https://doi.org/10.1029/2011JD016368. (6-7 July 2010)
European Environment Agency, González Ortiz A, Guerreiro C, Soares J (2020) Air quality in Europe—2020 Report. https://doi.org/10.2800/786656
Fu S, Deng X, Shupe MD, Xue H (2019) A modelling study of the continuous ice formation in an autumnal arctic mixed-phase cloud case. Atmos Res 228:77–85. https://doi.org/10.1016/j.atmosres.2019.05.021
Gakidou E, Afshin A, Abajobir AA, Abate KH (2017) All: Global, regional, and national comparative risk assessment of 84 behavioural, environmental and occupational, and metabolic risks or clusters of risks, 1990–2016: a systematic analysis for the global burden of disease study 2016. The Lancet 390(10100):1345–1422. https://doi.org/10.1016/S0140-6736(17)32366-8
Gopu P, Panda RR, Nagwani NK (2021) Time series analysis using Arima model for air pollution prediction in hyderabad city of India. In: Soft Computing and Signal Processing, pp 47–56. Springer, Singapore. https://doi.org/10.1007/978-981-33-6912-2_5
Gueye A, Drame MS, Diallo M, Ngom B, Ndao Niang D, Faye A, Pusede S (2023) Pm2.5 hotspot identification in Dakar area: an innovative IoT and mapping-based approach for effective air quality management. In: 2023 IEEE 9th International Conference on Smart Instrumentation, Measurement and Applications (ICSIMA), pp 302–307. https://doi.org/10.1109/ICSIMA59853.2023.10373491
Hyndman RJ, Khandakar Y (2008) Automatic time series forecasting: the forecast package for R. J Stat Softw 27(3):1–22. https://doi.org/10.18637/jss.v027.i03
Hyndman R, Athanasopoulos G (2018) Forecasting: Principles and Practice, 2nd edn. OTexts, Australia. https://otexts.org/fpp2/
Hyndman R, Athanasopoulos G (2021) Forecasting: Principles and Practice, 3nd edn. OTexts, Australia. https://otexts.org/fpp3/
Kadri F, Abdennbi K, Chaabane S, Nouaouri I (2020) A recurrent neural network approach to forecasting demand for care in hospital emergency departments. In: 10th Francophone Conference on Hospital Systems Management and Engineering. https://uphf.hal.science/hal-03199566
Karroum K, Lin Y, Chiang Y (2020) coll: A review of air quality modeling. MAPAN 35:287–300. https://doi.org/10.1007/s12647-020-00371-8
Kelly FJ, Fussell JC (2015) Air pollution and public health: emerging hazards and improved understanding of risk. J Stat Softw 37:631–649. https://doi.org/10.1007/s10653-015-9720-1
Khan S, Alghulaiakh H (2020) Arima model for accurate time series stocks forecasting. Int J Adv Comput Sci Appl. https://doi.org/10.14569/IJACSA.2020.0110765
Lelieveld J, Klingmüller K, Pozzer A, Pöschl U, Fnais M, Daiber A, Münzel T (2019) Cardiovascular disease burden from ambient air pollution in Europe reassessed using novel hazard ratio functions. Eur Heart J 40(20):1590–1596. https://doi.org/10.1093/eurheartj/ehz135
Lingxiao Z, Li Z, Leilei Q (2002) Forecasting of Beijing pm2.5 with a hybrid Arima model based on integrated AIC and improved GS fixed-order methods and seasonal decomposition. Heliyon. https://doi.org/10.1016/j.heliyon.2022.e12239
Makridakis S, Hibon M (2000) The m3-competition: results, conclusions and implications. Int J Forecast 16(4):451–476. https://doi.org/10.1016/S0169-2070(00)00057-1. (The M3- Competition)
Mehndiratta P, Jain A, Srivastava S, Gupta N (2013) Environmental pollution and nanotechnology. Environ Pollut 2:49. https://doi.org/10.5539/ep.v2n2p49
Murthiand MN, Srinivasulu A, Kumari VP, Naidu MB (2021) Comparing two Arima models for daily stock price data. J Stat Softw 9(11):1606–1610. https://doi.org/10.22214/ijraset.2021.38943
Ngom B, Diallo M, Marilleau N (2020) Medart-mas: meta-model of data assimilation on real-time multi-agent simulation. In: 2020 IEEE/ACM 24th International Symposium on Distributed Simulation and Real Time Applications (DS-RT), pp 1–7. https://doi.org/10.1109/DS-RT50469.2020.9213694
Ngom B, Diallo M, Seyc MR, Drame MS, Cambier C, Marilleau N (2021) Pm10 data assimilation on real-time agent-based simulation using machine learning models: case of Dakar urban air pollution study. In: 2021 IEEE/ACM 25th International Symposium on Distributed Simulation and Real Time Applications (DS-RT), pp 1–4. https://doi.org/10.1109/DS-RT52167.2021.9576143
Shen Z, Sun J, Cao J, Zhang L, Zhang Q, Lei Y, Gao J, Huang R-J, Liu S, Huang Y, Zhu C, Xu H, Zheng C, Liu P, Xue Z (2016) Chemical profiles of urban fugitive dust pm2.5 samples in northern Chinese cities. Sci Total Environ 569–570:619–626. https://doi.org/10.1016/j.scitotenv.2016.06.156
Sun Q, Zhuang J, Du Y, Xu D, Li T (2018) Design and application of a web-based real-time personal pm2.5 exposure monitoring system. Sci Total Environ 627:852–859. https://doi.org/10.1016/j.scitotenv.2018.01.299
Sun J, Shen Z, Zhang L, Lei Y, Gong X, Zhang Q, Zhang T, Xu H, Cui S, Wang Q, Cao J, Tao J, Zhang N, Zhang R (2019) Chemical source profiles of urban fugitive dust pm2.5 samples from 21 cities across china. Sci Total Environ 649:1045–1053. https://doi.org/10.1016/j.scitotenv.2018.08.374
Xie Y, Chen X, Zhang L (2023) Prediction of pm2.5 concentration based on cnnlstm deep learning model. In: 2023 Asia-Europe Conference on Electronics, Data Processing and Informatics (ACEDPI). https://doi.org/10.1109/ACEDPI58926.2023.00051
Yang Y, Wang Z, Cao C, Xu M, Yang X, Wang K, Guo H, Gao X, Li J, Shi Z (2024) Estimation of pm2.5 concentration across china based on multi-source remote sensing data and machine learning methods. Remote Sens. https://doi.org/10.3390/rs16030467
Acknowledgements
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest relevant to this study.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gueye, A., Drame, M.S., Niang, S.A.A. et al. Enhancing PM2.5 Predictions in Dakar Through Automated Data Integration into a Data Assimilation Model. Aerosol Sci Eng (2024). https://doi.org/10.1007/s41810-024-00230-y
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s41810-024-00230-y