Abstract
Particulate matter has a detrimental consequence on the health of living organisms throughout the world and predicting their concentration is very imperative to assess their impact on human health. Faridabad is the most populated and largest city of Haryana, India, and the current study was designed to foresee the PM2.5 content by different modeling techniques: (1) support vector machine (SVM), (ii) random forest (RF), (iii) artificial neural network (ANN), (iv) M5P model, and (v) Gaussian process regression (GP). Collected data (659 observations) from May 2015 to May 2018 were used to develop the models. Parameters such as temperature (T), ground-level ozone (O3), sulfur dioxide (SO2), nitrogen dioxide (NO2), nitric oxide (NO), NOx, wind speed (WS), wind direction (WD), relative humidity (RH), bar pressure (BP), and solar radiation (SR) are used as input parameters for prediction of PM2.5. The results of all the models suggested that RF model with testing correlation coefficient (CC) 0.8312, mean absolute error (MAE) 30.7757, R2 (correlation of determination) 0.6909, and root mean square error (RMSE) 44.6947 is the best estimator for appraisal of PM2.5 followed by SVM, GP, M5P, and ANN models. The sensitivity analysis results indicated that wind speed is the utmost influencing parameter for the estimation of PM2.5. Abilities of different models were compared and RF was established as the best technique based on assessment criteria. We recommend more studies employing RF and other techniques as hybrids that lead to better models.
Similar content being viewed by others
References
Bao CK, Lu YS, Shang JC (2004) Framework and operational procedure for implementing strategic environmental assessment in China. Environ Impact Assess Rev 24(1):27–46. https://doi.org/10.1016/S0195-9255(03)00137-9
Borja-Aburto VH, Castillejos M, Gold DR, Bierzwinski S, Loomis D (1998) Mortality and ambient fine particles in Southwest Mexico City, 1993-1995. Environ Health Perspect 106:849–855. https://doi.org/10.2307/3434129
Canu S, Smola A (2006) Kernel methods and the exponential family. Neurocomputing 69(7–9):714–720
Chow JC (1995) Measurement methods to determine compliance with ambient air quality standards for suspended particles. J Air Waste Manage Assoc 45(5):320–382. https://doi.org/10.1080/10473289.1995.10467369
Cortes C, Vapnik V (1995) Support-vector networks. Machine learning 20(3):273–297. https://doi.org/10.1007/BF00994018
Davidson CI, Phalen RF, Solomon PA (2005) Airborne particulate matter and human health: a review. Aerosol Sci Technol 39:737–749. https://doi.org/10.1080/02786820500191348
Dunea D, Iordache S, Liu H-Y, Bøhler T, Pohoata A, Radulescu C (2016) Quantifying the impact of PM2.5 and associated heavy metals on respiratory health of children near metallurgical facilities. Environ Sci Pollut Res Int 23:15395–15406. https://doi.org/10.1007/s11356-016-6734-x
Fattore E, Paiano V, Borgini A, Tittarelli A, Bertoldi M, Crosignani P, Fanelli R (2011) Human health risk in relation to air quality in two municipalities in an industrialized area of Northern Italy. Environmental research 111(8):1321–1327
Feng X, Li Q, Zhu Y, Hou J, Jin L, Wang J (2015) Artificial neural networks forecasting of PM 2.5 pollution using air mass trajectory based geographic model and wavelet transformation. Atmos Environ 107:118–128. https://doi.org/10.1016/j.atmosenv.2015.02.030
Ghorani-Azam A, Riahi-Zanjani B, Balali-Mood M (2016) Effects of air pollution on human health and practical measures for prevention in Iran. J Res Med Sci 21:65–72. https://doi.org/10.4103/1735-1995.189646
Hulin M, Caillaud D, Annesi Maesano I (2010) Indoor air pollution and childhood asthma: variations between urban and rural areas. Indoor Air 20(6):502–514. https://doi.org/10.1111/j.1600-0668.2010.00673.x
Kim S, Shiri J, Singh VP, Kisi O, Landeras G (2015) Predicting daily pan evaporation by soft computing models with limited climatic data. Hydrol Sci J 60:1120–1136. https://doi.org/10.1080/02626667.2014.945937
Kisi O, Parmar KS, Soni K, Demir V (2017) Modeling of air pollutants using least square support vector regression, multivariate adaptive regression spline, and M5 model tree models. Air QualAtmos Health 10:873–883. https://doi.org/10.1007/s11869-017-0477-9
Kuss M (2006) Gaussian process models for robust regression, classification, and reinforcement learning (Doctoral dissertation, Technische Universität)
Li Q, Guo Y, Song J-Y, Song Y, Ma J, Wang HJ (2018) Impact of long term exposure to local PM10 on children’s blood pressure: a Chinese national cross-sectional study. Air QualAtmos Health 11:705–713. https://doi.org/10.1007/s11869-018-0577-1
Liu JC, Peng RD (2018) Health effect of mixtures of ozone, nitrogen dioxide, and fine particulates in 85 US counties. Air QualAtmos Health 11:311–324. https://doi.org/10.1007/s11869-017-0544-2
Liu KF-R, Lu C-F, Chen C-W, Shen Y-S (2012) Applying Bayesian belief networks to health risk assessment. Stoch Env Res Risk A 26:451–465. https://doi.org/10.1007/s00477-011-0470-z
Marzouni MB, Alizadeh T, Banafsheh MR, Khorshiddoust AM, Ghozikali MG, Akbaripoor S, Mehdipour V, Memarianfard M (2017) Application of support vector machine and gene expression programming on tropospheric ozone prognosticating for Tehran metropolitan. CivEng J 3:557. https://doi.org/10.28991/cej-030984
Mehdipour V, Stevenson DS, Memarianfard M, Sihag P (2018) Comparing different methods for statistical modeling of particulate matter in Tehran, Iran. Air Qual Atmos Health 11(10):1155–1165
Parsaie A, Haghiabi AH (2016) Numerical routing of tracer concentrations in rivers with stagnant zones. Water Supply 17(3):825–834. https://doi.org/10.2166/ws.2016.175
Parsaie A, Azamathulla HM, Haghiabi AH (2017) Physical and numerical modeling of performance of detention dams. J Hydrol. https://doi.org/10.1016/j.jhydrol.2017.01.018
Pope CA, Ezzati M, Cannon JB, Allen RT, Jerrett M, Burnett RT (2018) Mortality risk and PM 2.5 air pollution in the USA: an analysis of a national prospective cohort. Air Quality, Atmosphere & Health. 11(3):245–252
Quinlan JR (1992) Learning with continuous classes. In 5th Australian joint conference on artificial intelligence. 92:343–348
Rasmussen CE, Williams CK (2006) Gaussian processes for machine learning, vol. 1. MIT press, 39, 40–43
Schweitzer L, Zhou J (2010) Neighborhood air quality, respiratory health, and vulnerable populations in compact and sprawled regions. J Am Plan Assoc 76:363–371. https://doi.org/10.1080/01944363.2010.486623
Singh B, Sihag P, Singh K (2017) Modelling of impact of water quality on infiltration rate of soil by random forest regression. Modeling Earth Systems and Environment 3(3):999–1004
Sharifi SS, Rezaverdinejad V, Nourani V (2016) Estimation of daily global solar radiation using wavelet regression, ANN, GEP and empirical models: a comparative study of selected temperature based approaches. J Atmos Sol TerrPhys 149:131–145. https://doi.org/10.1016/j.jastp.2016.10.008
Turner MC, Krewski D, Pope CA III, Chen Y, Gapstur SM, Thun MJ (2011) Long-term ambient fine particulate matter air pollution and lung cancer in a large cohort of never-smokers. Am J Respir Crit Care Med 184(12):1374–1381. https://doi.org/10.1164/rccm.201106-1011OC
Vapnik V (1998) The support vector method of function estimation. In: Suykens J.A.K., Vandewalle J. (eds) Nonlinear Modeling. Springer, Boston, MA
Xing YF, Xu YH, Shi MH, Lian YX (2016) The impact of PM2.5 on the human respiratory system. J Thorac Dis 8:E69–E74. https://doi.org/10.3978/j.issn.2072-1439.2016.01.19
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Sihag, P., Kumar, V., Afghan, F.R. et al. Predictive modeling of PM2.5 using soft computing techniques: case study—Faridabad, Haryana, India. Air Qual Atmos Health 12, 1511–1520 (2019). https://doi.org/10.1007/s11869-019-00755-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11869-019-00755-z