Abstract
Multivariate cluster analysis suffers from subjectivity associated with selecting input parameters and the related weights known as “Priority Rates” (PRs). This paper aims to formulate an algorithm referred to as “GA-SOM” to reduce this subjectivity by a hybrid of genetic algorithm (GA) and self-organizing map (SOM), in which GA optimizes the PRs of the input parameters by maximizing silhouette index (SI) along with interrogating SOM to calculate SI. GA-SOM was first applied to clustering groundwater level (GWL) data to identify less heterogeneous clusters in a large and heterogeneous aquifer. Then, GWL fluctuations were predicted by support vector machine (SVM) in 12 representative observation wells (OWs) by GA-SOM from 48 OWs. The results show a significant improvement in the quality of clustering so that the SI value equals 0.61 by using optimized PRs, while it did not exceed 0.38 without using PRs. Performances of SVMs are evaluated using RMSE and NSE, where for all OWs, RMSE and NSE vary in the range of 0.32–0.71 m and 0.91–0.97, respectively. Therefore, the models are found to be fit-for-purpose for predicting GWL in the aquifer.
Similar content being viewed by others
References
Bowden GJ, Maier HR, Dandy GC (2002) Optimal division of data for neural network models in water resources applications. Water Resour Res 38:2–1. https://doi.org/10.1029/2001WR000266
Chang FJ, Chang LC, Huang CW, Kao IF (2016) Prediction of monthly regional groundwater levels through hybrid soft-computing techniques. J Hydrol 541:965–976. https://doi.org/10.1016/j.jhydrol.2016.08.006
Chaudhary V, Bhatia RS, AhlawatAK (2014) A novel Self-Organizing Map (SOM) learning algorithm with nearest and farthest neurons. AEJ, 53: 827–831. https://www.sciencedirect.com/science/article/pii/S1110016814000970
Chen LH, Chen CT, Pan YG (2009) Groundwater level prediction using SOM-RBFN multisite model. J Hydrol Eng 15:624–631
Chen LH, Chen CT, Lin DW (2010) Application of integrated back-propagation network and self-organizing map for groundwater level forecasting. J Water Res Plan Man 137:352–365
Coley DA (1999) An introduction to genetic algorithms for scientists and engineers. World Scientific Publishing Co Inc.
Davies DL, Bouldin DW (1979) A cluster separation measure. IEEE T Pattern Anal. https://doi.org/10.1109/TPAMI.1979.4766909
Emberger L (1930) Sur une formule applicable en géographie botanique. Cah Herb Seanc Acad Sci 191:389–390
Fang HT, Jhong BC, Tan YC, Ke KY, Chuang MH (2019) A two-stage approach integrating SOM-and MOGA-SVM-based algorithms to forecast spatial-temporal groundwater level with meteorological factors. Water Resour Manag 33:797–818. https://doi.org/10.1007/s11269-018-2143-x
Ghazi B, Jeihouni E, Kalantari Z (2021) Predicting groundwater level fluctuations under climate change scenarios for Tasuj plain. Arab J Geosci 14:1–12. https://doi.org/10.1007/s12517-021-06508-6
Han JC, Huang Y, Li Z, Zhao C, Cheng G, Huang P (2016) Groundwater level prediction using a SOM-aided stepwise cluster inference model. J Environ Manage 182:308–321. https://doi.org/10.1016/j.jenvman.2016.07.069
Huang F, Huang J, Jiang SH, Zhou C (2017) Prediction of groundwater levels using evidence of chaos and support vector machine. J Hydroinform 19:586–606. https://doi.org/10.2166/hydro.2017.102
Hsu KC, Li ST (2010) Clustering spatial–temporal precipitation data using wavelet transform and self-organizing map neural network. Adv Water Resour 33:190–200. https://doi.org/10.1016/j.advwatres.2009.11.005
Isa NM, Aris AZ, Narany TS, Sulaiman WNA (2017) Applying the scores of multivariate statistical analyses to characterize the relationships between the hydrochemical properties and groundwater conditions in respect of the monsoon variation in Kapas Island, Terengganu Malaysia. Environ Earth Sci 76:169. https://doi.org/10.1007/s12665-017-6487-y
Khatibi R, Nadiri AA (2021) Inclusive multiple models (IMM) for predicting groundwater levels and treating heterogeneity. Geosci Front 12:713–724. https://doi.org/10.1016/j.gsf.2020.07.011
Kohonen T (1997) Self-organizing maps. Springer, Berlin
Kohonen T (1982) Self-organized formation of topologically correct feature maps. Biol Cybern 43:59–69. https://doi.org/10.1007/BF00337288
Lin GF, Chen LH (2005) Time series forecasting by combining the radial basis function network and the self organizing map. Hydrol Process 19:1925–1937. https://doi.org/10.1002/hyp.5637
Lin GF, Chen LH (2006) Identification of homogeneous regions for regional frequency analysis using the self-organizing map. J Hydrol 324:1–9. https://doi.org/10.1016/j.jhydrol.2005.09.009
Mangiameli P, Chen SK, West DA (1996) Comparison of SOM neural network and hierarchical clustering. Eur J Oper Res 93:402–417. https://doi.org/10.1016/0377-2217(96)00038-0
Moazamnia M, Hassanzadeh Y, Nadiri AA, Khatibi R, Sadeghfam S (2019) Formulating a strategy to combine artificial intelligence models using Bayesian model averaging to study a distressed aquifer with sparse data availability. J Hydrol 57:765–781. https://doi.org/10.1016/j.jhydrol.2019.02.011
Moghaddam HK, Milan SG, Kayhomayoon Z, Azar NA (2021) The prediction of aquifer groundwater level based on spatial clustering approach using machine learning. Environ Monit Assess 193:1–20. https://doi.org/10.1007/s10661-021-08961-y
Nadiri AA, Moghaddam AA, Tsai FT, Fijani E (2013) Hydrogeochemical analysis for Tasuj plain aquifer. Iran J Earth Syst Sci 122:1091–1105. https://doi.org/10.1007/s12040-013-0329-4
Nguyen TT, Kawamura A, Tong TN, Nakagawa N, Amaguchi H, Gilbuena R (2015) Clustering spatio–seasonal hydrogeochemical data using self-organizing maps for groundwater quality assessment in the Red River Delta. Vietnam J Hydrol 522:661–673. https://doi.org/10.1016/j.jhydrol.2015.01.023
Nourani V, Ejlali RG, Alami MT (2011) Spatiotemporal groundwater level forecasting in coastal aquifers by hybrid artificial neural network-geostatistics model: a case study. Environ Eng Sci 28:217–228. https://doi.org/10.1089/ees.2010.0174
Nourani V, Baghanam AH, Vousoughi VD, Alami MT (2012) Classification of groundwater level data using SOM to develop ANN-based forecasting model. Int J Soft Comput Eng 2:2231–2307
Nourani V, Alami MT, Vousoughi FD (2016) Self-organizing map clustering technique for ANN-based spatiotemporal modeling of groundwater quality parameters. J Hydroinf 18:288–309
Peeters L, Bação F, Lobo V, Dassargues A (2007) Exploratory data analysis and clustering of multivariate spatial hydrogeological data by means of GEO3DSOM, a variant of Kohonen’s Self-Organizing Map. Hydrol Earth Syst Sci Discuss 11:1309–1321. https://doi.org/10.5194/hess-11-1309-2007
Peng K, Li X, Wang Z (2015) Hydrochemical characteristics of groundwater movement and evolution in the Xinli deposit of the Sanshandao gold mine using FCM and PCA methods. Environ Earth Sci 73:7873–7888. https://doi.org/10.1007/s12665-014-3938-6
Roshni T, Jha MK, Drisya J (2020) Neural network modeling for groundwater-level forecasting in coastal aquifers. Neural Comput Appl. https://doi.org/10.1007/s00521-020-04722-z
Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65. https://doi.org/10.1016/0377-0427(87)90125-7
Sadeghfam S, Ehsanitabar A, Khatibi R, Daneshfaraz R (2018) Investigating ‘risk’ of groundwater drought occurrences by using reliability analysis. Ecol Indic 94:170–184. https://doi.org/10.1016/j.ecolind.2018.06.055
Sadeghfam S, Hassanzadeh Y, Khatibi R, Nadiri AA, Moazamnia, M (2019) Groundwater remediation through pump-treat-inject technology using optimum control by artificial intelligence (OCAI). Water Resour Manag 33:1123–1145. https://doi.org/10.1007/s11269-018-2171-6
Sanchez-martos F, Aguilera PA, Garrido-frenich A, Torres JA, Pulido-Bosch A, (2002) Assessment of groundwater quality by means of self-organizing maps: application in a semiarid area. Environ Manage 30:0716–0726. https://doi.org/10.1007/s00267-002-2746-z
Springer AE, Boldt EM, Junghans KM (2017) Local vs. regional groundwater flow delineation from stable isotopes at western North America springs. Groundwater 55:100–109. https://doi.org/10.1111/gwat.12442
Suykens JA, De Brabanter J, Lukas L, Vandewalle J (2002) Weighted least squares support vector machines: robustness and sparse approximation. Neurocomputing 48:85–105
Todd DK, Mays LW (2005) Groundwater hydrology, 3rd edn. Wiley, New Jersey
Vapnik V (1998) The support vector method of function estimation. In Nonlinear modeling 55–85. Springer, Boston, MA. https://link.springer.com/chapter/https://doi.org/10.1007/978-1-4615-5703-6_3
Yoon H, Jun SC, Hyun Y, Bae GO, Lee KK (2011) A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer. J Hydrol 396:128–138. https://doi.org/10.1016/j.jhydrol.2010.11.002
Acknowledgements
The authors would like to thank West Azerbaijan Regional Water Authority for their cooperation in data preparation.
Author information
Authors and Affiliations
Corresponding author
Appendix
Rights and permissions
About this article
Cite this article
Moazamnia, M., Hassanzadeh, Y., Sadeghfam, S. et al. Formulating GA-SOM as a Multivariate Clustering Tool for Managing Heterogeneity of Aquifers in Prediction of Groundwater Level Fluctuation by SVM Model. Iran J Sci Technol Trans Civ Eng 46, 555–571 (2022). https://doi.org/10.1007/s40996-021-00759-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40996-021-00759-9