Abstract
Air pollution poses a serious threat to public health and for the environment, thus predicting air quality is very crucial for the health and well-being of individuals and the environment. Economic development drives rapid industrialization and urbanization, which are significant sources of air pollution in developing countries. Kuwait’s rapid urbanization and vehicular traffic, along with dust storms, make it a prime location for research for environmental pollution. Keeping this in view, a study was designed to evaluates various machine learning prediction methods for particulate matter concentrations (\(PM_{10}\)) in Kuwait. The prediction models were developed using three different algorithms, including k-nearest neighbor (KNN), artificial neural network (ANN) and support vector regression (SVR). The models were developed using a 3-year dataset collected by Kuwait Environmental Public Authority (K-EPA) for two stations selected in this study (Al-Ahmadi and Al-Salam). The performance of the models was evaluated using various metrics, including Mean Biased Error (MBE), Root Mean Squared Error (RMSE), Normalized Root Mean Squared Error (nRMSE) and Coefficient of Determination (\(R^{2}\)). The results show that both stations experienced severe air quality issues and that particulate matter concentrations (PM10) are strongly influenced by the different meteorological and pollutant variables. The findings show that for the Al-Ahmadi location, artificial neural networks (ANN) (\(R_{cal}^{2}\) = 0.885, \(R_{val}^{2}\) = 0.775) and K-nearest neighbor (KNN) (\(R_{cal}^{2}\) = 0.895, \(R_{val}^{2}\) = 0.613) were good, while for Al-Salam, KNN (\(R_{cal}^{2}\) = 0.945, \(R_{val}^{2}\) = 0.715) was a better choice to predict \(PM_{10}\). These models can be used by the decision-makers to impose pollution controls, evaluate policies, or plan targeted actions to reduce particle matter.
Similar content being viewed by others
Data availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
References
Chen B, Kan H (2008) Air pollution and population health: a global challenge. Environ Health Prev Med 13:94–101
Draxler RR, Gillette DA, Kirkpatrick JS, Heller J (2001) Estimating PM10 air concentrations from dust storms in Iraq. Kuwait and Saudi Arabia. Atmos. Environ. 35:4315–4330
Heal MR, Kumar P, Harrison RM (2012) Particles, air quality, policy and health. Chem Soc Rev 41:6606–6630
Wilson AM, Salloway JC, Wake CP, Kelly T (2004) Air pollution and the demand for hospital services: a review. Environ Int 30:1109–1118
Hand J, Gill T, Schichtel B (2019) Urban and rural coarse aerosol mass across the United States: Spatial and seasonal variability and long-term trends. Atmospheric Environment. 218, https://doi.org/10.1016/j.atmosenv.2019.117025
Tsiouri V, Kakosimos K, Kumar P (2014) Concentrations, physicochemical characteristics and exposure risks associated with particulate matter in the Middle East Area-A review. Air Qual Atmos Health 8:67–80
Althuwaynee OF, Balogun AL, Al Madhoun W (2020) Air pollution hazard assessment using decision tree algorithms and bivariate probability cluster polar function: evaluating inter-correlation clusters of PM10 and other air pollutants. GIScience Remote Sens. 57:207–226
Hewson EW (1956) Meteorological factors affecting causes and controls of air pollution. J Air Pollut Control Assoc 5:235–241
Tian G, Qiao Z, Xu X (2014) Characteristics of particulate matter (PM10) and its relationship with meteorological factors during 2001–2012 in Beijing. Environ Pollut 192:266–274
Qiu H, Yu IT, Tian L, Wang X, Tse LA, Tam W, Wong TW (2012) Effects of coarse particulate matter on emergency hospital admissions for respiratory diseases: A time-series analysis in Hong Kong. Environ Health Perspect 120:572–576
Hassan H, Latif MT, Juneng L, Amil N, Khan MF, Yik DJ, Abdullah NA (2020) Interaction of PM10 concentrations with local and synoptic meteorological conditions at different temporal scales. Atmos Res 241:104975
Jacob DJ, Winner DA (2009) Effect of climate change on air quality. Atmos Environ 43:51–63
Liu Y, Wang T (2020) Worsening urban ozone pollution in China from 2013 to 2017-Part 1: The complex and varying roles of meteorology. Atmos Chem Phys 20:6305–6321
Querol X, Alastuey A, Ruiz C, Artiñano B, Hansson H, Harrison R, Buringh E, Ten Brink H, Lutz M, Bruckmann P et al (2004) Speciation and origin of PM10 and PM2.5 in selected European cities. Atmos Environ 38:6547–6555
Liu CN, Chen SC, Tsai CJ (2011) A novel multifilter PM10-PM2. 5 sampler (MFPPS). Aerosol Sci Technol 45:1480–1487
Mok KM, Hoi KI (2005) Effects of meteorological conditions on PM 10 concentrations-A study in Macau. Environ Monit Assess 102:201–223
Oanh NK, Chutimon P, Ekbordin W, Supat W (2005) Meteorological pattern classification and application for forecasting air pollution episode potential in a mountain-valley area. Atmos Environ 39:1211–1225
Hao J, Wang L (2005) Improving urban air quality in China: Beijing case study. J Air Waste Manag Assoc 55:1298–1305
Kumar P, Robins A, Vardoulakis S, Britter RBR, Gurjar AN, Harrison RM (2011) Preliminary estimates of nanoparticle number emissions from road vehicles in megacity Delhi and associated health impacts. Environ Sci Technol 2011(45):5514–5521
Kumar P, Ketzel M, Vardoulakis S, Pirjola L, Britter R (2011) Dynamics and dispersion modelling of nanoparticles from road traffic in the urban atmospheric environment-A review. J Aerosol Sci 42:580–603
Al-Hurban A, Khader S, Alsaber A, Pan J (2021) Air Quality Assessment in the State of Kuwait during 2012 to 2017. Atmosphere 12:678
Al-Awadhi JM, AlShuaibi AA (2013) Dust fallout in Kuwait city: deposition and characterization. Sci Total Environ 461:139–148
Wang S, Wang J, Zhou Z, Shang K (2005) Regional characteristics of three kinds of dust storm events in China. Atmos Environ 39:509–520
Alolayan MA, Brown KW, Evans JS, Bouhamra WS, Koutrakis P (2013) Source apportionment of fine particles in Kuwait City. Sci Total Environ 448:14–25
Brown KW, Bouhamra W, Lamoureux DP, Evans JS, Koutrakis P (2008) Characterization of particulate matter for three sites in Kuwait. J Air Waste Manag Assoc 58:994–1003
Holmes NS, Morawska L (2006) A review of dispersion modelling and its application to the dispersion of particles: An overview of different dispersion models available. Atmos Environ 40:5902–5928
Kang GK, Gao JZ, Chiao S, Lu S, Xie G (2018) Air quality prediction: Big data and machine learning approaches. Int. J. Environ. Sci. Dev. 9:8–16
Al-Shayji K, Lababidi H, Al-Rushoud D, Al-Adwani H (2008) Development of a fuzzy air quality performance indicator. Kuwait J. Sci. Eng. 35:101–126
Fitz-Simons T (1999 Jul 1) Guideline for Reporting of Daily Air Quality: Air Quality Index (AQI); Technical Report; Environmental Protection Agency, Office of Air Quality Planning and and Standards, Research Triangle Park, NC (United States);
Norazian MN, Shukri YA, Azam RN, Al Bakri AMM (2008) Estimation of missing values in air pollution data using single imputation techniques. ScienceAsia 34:341–345
Alsaber AR, Pan J, Al-Hurban A (2021) Handling complex missing data using random forest approach for an air quality monitoring dataset: A case study of Kuwait environmental data (2012 to 2018). Int J Environ Res Public Health 18:1333
Shafique MA (2022) Imputing missing data in hourly traffic counts. Sensors 22:9876
Alsaber A, Pan J, Al-Herz A, Alkandary DS, Al-Hurban A, Setiya P, Group K, et al. (2020) Influence of ambient air pollution on rheumatoid arthritis disease activity score Index. Int. J. Environ. Res. Public Health, 17, 416
Andrew AM (2000) An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods by Nello Christianini and John Shawe-Taylor, Cambridge University Press, Cambridge, 2000, xiii+ 189 pp., ISBN 0-521-78019-5 (Hbk, £27.50). Robotica, 18, 687–689
Ortiz-García E, Salcedo-Sanz S, Pérez-Bellido Á, Portilla-Figueras J, Prieto L (2010) Prediction of hourly O3 concentrations using support vector regression algorithms. Atmos Environ 44:4481–4488
Sánchez AS, Nieto PG, Fernández PR, del Coz Díaz J, Iglesias-Rodríguez FJ (2011) Application of an SVM-based regression model to the air quality study at local scale in the Avilés urban area (Spain). Math Comput Model 54:1453–1466
Chang CC, Lin CJ (2002) Training v-support vector regression: theory and algorithms. Neural Comput 14:1959–1977
Schölkopf B, Smola AJ, Williamson RC, Bartlett PL (2000) New support vector algorithms. Neural Comput 12:1207–1245
Huang Q, Mao J, Liu Y (November 2012) An improved grid search algorithm of SVR parameters optimization. In Proceedings of the 2012 IEEE 14th International Conference on Communication Technology, Chengdu, China, 9–11; pp. 1022–1026
Bai Y, Li Y, Wang X, Xie J, Li C (2016) Air pollutants concentrations forecasting using back propagation neural network based on wavelet decomposition with meteorological conditions. Atmos Pollut Res 7:557–566
Moazami S, Noori R, Amiri BJ, Yeganeh B, Partani S, Safavi S (2016) Reliable prediction of carbon monoxide using developed support vector machine. Atmos Pollut Res 7:412–418
Emamgholizadeh S, Kashi H, Marofpoor I, Zalaghi E (2014) Prediction of water quality parameters of Karoon River (Iran) by artificial intelligence-based models. Int J Environ Sci Technol 11:645–656
Dahikar SS, Rode SV (2014) Agricultural crop yield prediction using artificial neural network approach. Int. J. Innov. Res. Electr. Electron. Instrum. Control. Eng. 2:683–686
Kakati N, Deka RL, Das P, Goswami J, Khanikar PG, Saikia H (2022) Forecasting yield of rapeseed and mustard using multiple linear regression and ANN techniques in the Brahmaputra valley of Assam. North East India. Theor. Appl. Climatol. 150:1201–1215
BARAN B (2021) Air quality Index prediction in besiktas district by artificial neural networks and k nearest neighbors. Mühendislik Bilimleri ve Tasarım Dergisi, 9, 52–63
Ul-Saufie AZ, Yahya AS, Ramli NA, Hamid HA (2011) Comparison between multiple linear regression and feed forward back propagation neural network models for predicting PM10 concentration level based on gaseous and meteorological parameters. Int. J. Appl. 1:42–49
Willmott CJ, Robeson SM, Matsuura K (2012) A refined index of model performance. Int J Climatol 32:2088–2094
Cacciola R, Sarva M, Polosa R Adverse respiratory effects and allergic susceptibility in relation to particulate air pollution: flirting with disaster
Bu-Olayan A, Thomas B (2012) Dispersion model on PM 2.5 fugitive dust and trace metals levels in Kuwait Governorates. Environ Monit Assess 184:1731–1737
Munir S, Gabr S, Habeebullah TM, Janajrah MA (2016) Spatiotemporal analysis of fine particulate matter (PM2. 5) in Saudi Arabia using remote sensing data. Egypt. J. Remote Sens. Space Sci. 19:195–205
Jayamurugan R, Kumaravel B, Palanivelraja S, Chockalingam M (2013) Influence of temperature, relative humidity and seasonal variability on ambient air quality in a coastal urban area. Int. J. Atmos. Sci. 2013:1–7
Ganguly R, Sharma D, Kumar P (2019) Trend analysis of observational PM10 concentrations in Shimla city. India. Sustain. Cities Soc. 51:101719
Chaloulakou A, Kassomenos P, Spyrellis N, Demokritou P, Koutrakis P (2003) Measurements of PM10 and PM2.5 particle concentrations in Athens. Greece. Atmos. Environ. 37:649–660
Suleiman A, Tight M, Quinn A (2019) Applying machine learning methods in managing urban concentrations of traffic-related particulate matter (PM10 and PM2.5). Atmos Pollut Res 10:134–144
Künzli N, Kaiser R, Medina S, Studnicka M, Chanel O, Filliger P, Herry M, Horak F Jr, Puybonnieux-Texier V, Quénel P et al (2000) Public-health impact of outdoor and traffic-related air pollution: a European assessment. Lancet 356:795–801
He HD, Pan W, Lu WZ, Xue Y, Peng GH (2016) Multifractal property and long-range cross-correlation behavior of particulate matters at urban traffic intersection in Shanghai. Stoch. Environ. Res. Risk Assess. 30:1515–1525
Park DU, Ha KC (2008) Characteristics of PM10, PM2.5, CO2 and CO monitored in interiors and platforms of subway train in Seoul. Korea. Environ. Int. 34:629–634
Plocoste T, Laventure S (2023) Forecasting PM10 Concentrations in the Caribbean Area Using Machine Learning Models. Atmosphere 14:134
Kujawska J, Kulisz M, Oleszczuk P, Cel W (2022) Machine Learning Methods to Forecast the Concentration of PM10 in Lublin. Poland. Energies 15:6428
Shaziayani WN, Ul-Saufie AZ, Mutalib S, Mohamad Noor N, Zainordin NS (2022) Classification Prediction of PM10 Concentration Using a Tree-Based Machine Learning Approach. Atmosphere 13:538
Peng J, Han H, Yi Y, Huang H, Xie L (2022) Machine learning and deep learning modeling and simulation for predicting PM2.5 concentrations. Chemosphere308, 136353
Rahi P, Sood SP, Bajaj R, Kumar Y (2021) Air quality monitoring for Smart eHealth system using firefly optimization and support vector machine. Int J Inf Technol 13:1847–1859
Ghose U, Bisht U (2020) Tailored feedforward artificial neural network based link prediction. Int J Inf Technol 12:757–765
Bozdağ A, Dokuz Y, Gökcçek ÖB (2020) Spatial prediction of PM10 concentration using machine learning algorithms in Ankara. Turkey. Environmental Pollution 263:114635
Masood A, Ahmad K (2020) A model for particulate matter (PM2. 5) prediction for Delhi based on machine learning approaches. Procedia Computer Science 167:2101–2110
Ramessur MA, Nagowah SD (2021) A predictive model to estimate effort in a sprint using machine learning techniques. Int J Inf Technol 13:1101–1110
Mittal K, Aggarwal G, Mahajan P (2019) Performance study of K-nearest neighbor classifier and K-means clustering for predicting the diagnostic accuracy. Int J Inf Technol 11:535–540
Hamid Y, Shah FA, Sugumaran M (2019) Wavelet neural network model for network intrusion detection system. Int J Inf Technol 11:251–263
Acknowledgements
The authors would like to gratefully acknowledge the pollutant data provided by the Data Management Department of the K-EPA through the Environmental Monitoring Information System of Kuwait (eMISK).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Alsaber, A., Alsahli, R., Al-Sultan, A. et al. Evaluation of various machine learning prediction methods for particulate matter \(PM_{10}\) in Kuwait. Int. j. inf. tecnol. 15, 4505–4519 (2023). https://doi.org/10.1007/s41870-023-01521-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-023-01521-2