Abstract
This paper comprehensively reviews and compares methodologies used to monitor air quality and their impact on human health. With urbanization and industrialization increasing in emerging nations, air pollution levels have become a significant threat to human well-being. The study highlights the importance of reducing exposure to air pollution for the improvement of public health. The paper focuses on the comparative analysis of measuring the Air Quality Index (AQI) using deep learning algorithms like Long Short-Term Memory (LSTM) and classical machine learning models such as Autoregressive Integrated Moving Average (ARIMA), Decision Tree, K-Nearest Neighbour, Extreme Gradient Boosting, Gradient Boosting, Adaptive Boosting, Huber Regressor, and Dummy Regressor for AQI prediction. The performance of these models is evaluated using daily and hourly time series data from 2014 to 2018, with the Root Mean Squared Error (RMSE) used as the performance indicator. The results demonstrate that LSTM outperforms ARIMA, particularly with hourly data. For daily data, ARIMA achieved an RMSE of 97.88, whereas LSTM obtained an RMSE of 143.07. On the other hand, for hourly data, ARIMA yielded an RMSE of 69.65, while LSTM achieved a lower RMSE of 44.6539. These findings highlight the potential of deep learning algorithms, specifically LSTM, in accurately forecasting air quality.
Similar content being viewed by others
References
Quah, S. R. (2016). International encyclopedia of public health. Academic press.
Chen, S., & Sizhu, Wu. (2020). Deep learning for identifying environmental risk factors of acute respiratory diseases in Beijing, China: Implications for population with different age and gender. International Journal of Environmental Health Research, 30(4), 435–446.
Wang, D., Wei, S., Luo, H., Yue, C., & Grunder, O. (2017). A novel hybrid model for AQI forecasting based on two-phase decomposition technique and modified extreme learning machine. Science of the Total Environment, 580, 719–733.
Kleine Deters, J., Zalakeviciute, R., Gonzalez, M., & Rybarczyk, Y. (2017). Modeling PM 2.5 urban pollution using machine learning and selected meteorological parameters. Journal of Electrical and Computer Engineering. https://doi.org/10.1155/2017/5106045
AlJanabi, S., Yaqoob, A., & Mohammad, M. (2020). Pragmatic method based on intelligent big data analytics to prediction air pollution. Big data and networks technologies (pp. 84–109). Springer International Publishing.
Al-Janabi, S., & Alkaim, A. F. (2020). A nifty collaborative analysis to predicting a novel tool (DRFLLS) for missing values estimation. Soft Computing, 24(1), 555–569.
Al-Janabi, S., & Mahdi, M. A. (2019). Evaluation prediction techniques to achievement an optimal biomedical analysis. International Journal of Grid and Utility Computing, 10(5), 512–527.
Tsan, Y.-T., Chen, D.-Y., Liu, P.-Y., Kristiani, E., Lan, K., Nguyen, P., & Yang, C.-T. (2022). The prediction of influenza-like illness and respiratory disease using LSTM and ARIMA. International Journal of Environmental Research and Public Health, 19, 1858. https://doi.org/10.3390/ijerph
Sethi, J. K., & Mittal, M. (2019). A new feature selection method based on machine learning technique for air quality dataset. Journal of Statistics and Management Systems, 22(4), 697–705.
Adhikary, R., & Batra, N. (2020). Computational tools for understanding air pollution. In Adjunct Proceedings of the 2020 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2020 ACM International Symposium on Wearable Computers, pp 199–203.
Janarthanan, R., Partheeban, P., Somasundaram, K., & Elamparithi, P. N. (2021). A deep learning approach for prediction of air quality index in a metropolitan city. Sustainable Cities and Society, 67, 102720.
Han, Y., Lam, J. C., Li, V. O., & Reiner, D. (2021). A Bayesian LSTM model to evaluate the effects of air pollution control regulations in Beijing, China. Environmental Science and Policy, 115, 26–34.
Ma, J., Cheng, J. C., Lin, C., Tan, Y., & Zhang, J. (2019). Improving air quality prediction accuracy at larger temporal resolutions using deep learning and transfer learning techniques. Atmospheric Environment, 214, 116885.
Saravanan, D., & Kumar, K. S. (2021). Improving air pollution detection accuracy and quality monitoring based on bidirectional RNN and the Internet of Things. Materials Today: Proceedings., 81, 791–796.
Bousquet, J., Anto, J. M., Annesi-Maesano, I., Dedeu, T., Dupas, E., Pépin, J. L., Eyindanga, L. S. Z., Arnavielhe, S., Ayache, J., Basagana, X., & Benveniste, S. (2018). POLLAR: impact of air POLLution on Asthma and Rhinitis; a European Institute of Innovation and Technology Health (EIT Health) project. Clinical and Translational Allergy, 8, 1–13.
Betetta-Gomez, J., Medina-Ramos, C., Tafur-Anzualdo, I., & de María Diaz-Diaz, F. (2020). Deep learning to predict outpatient visits by respiratory illnesses in a high PM10 environment. In 2020 IEEE XXVII International Conference on Electronics, Electrical Engineering and Computing (INTERCON) (pp. 1–4). IEEE.
Giri, N., Chavan, S., Heda, R., Israni, R., & Sethiya, R. (2019, December). Disease Migration, Mitigation, and Containment: Impact of Climatic Conditions & Air Quality on Tuberculosis for India. In 2019 IEEE Pune Section International Conference (PuneCon) (pp. 1–6). IEEE.
Klompmaker, J. O., Hart, J. E., Peter James, M., Sabath, B., Xiao, W., Zanobetti, A., Dominici, F., & Laden, F. (2021). Air pollution and cardiovascular disease hospitalization–Are associations modified by greenness, temperature and humidity? Environment International, 156, 106715.
Liu, D.-R., Lee, S.-J., Huang, Y., & Chiu, C.-J. (2020). Air pollution forecasting based on attention-based LSTM neural network and ensemble learning. Expert Systems, 37(3), e12511.
Alharbi, E., Nadeem, F., & Cherif, A. (2021). Smart healthcare framework for asthma attack prediction and prevention. In 2021 National Computing Colleges Conference (NCCC) (pp. 1–6). IEEE.
Wang, Ce., Feng, L., & Qi, Yi. (2021). Explainable deep learning predictions for illness risk of mental disorders in Nanjing, China. Environmental Research, 202, 111740.
Khatibi, T., & Karampour, N. (2021). Predicting the number of hospital admissions due to mental disorders from air pollutants and weather condition descriptors using stacked ensemble of Deep Convolutional models and LSTM models (SEDCMLM). Journal of Cleaner Production, 280, 124410.
Mao, W., Jiao, L., Wang, W., Wang, J., Tong, X., & Zhao, S. (2021). A hybrid integrated deep learning model for predicting various air pollutants. GIScience and Remote Sensing, 58(8), 1395–1412.
Gatti, R. C., Velichevskaya, A., Tateo, A., Amoroso, N., & Monaco, A. (2020). Machine learning reveals that prolonged exposure to air pollution is associated with SARS-CoV-2 mortality and infectivity in Italy. Environmental Pollution, 267, 115471.
Suleiman, A., Tight, M. R., & Quinn, A. D. (2019). Applying machine learning methods in managing urban concentrations of traffic-related particulate matter (PM10 and PM2.5). Atmospheric Pollution Research, 10(1), 134–144.
Maleki, H., Sorooshian, A., Goudarzi, G., Baboli, Z., TahmasebiBirgani, Y., & Rahmati, M. (2019). Air pollution prediction by using an artificial neural network model. Clean Technologies and Environmental Policy, 21(6), 1341–1352.
Berkowicz, R. (2000). OSPM-A parameterised street pollution model. Environmental Monitoring and Assessment, 65, 323–331.
Funding
Not applicable.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author(s) declare(s) that there is no conflict of interest.
Human and animal rights
This study does not include any human participants or animals; hence, any informed consent or animal welfare statement does not apply to this study.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mishra, A., Gupta, Y. Comparative analysis of Air Quality Index prediction using deep learning algorithms. Spat. Inf. Res. 32, 63–72 (2024). https://doi.org/10.1007/s41324-023-00541-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41324-023-00541-1