Abstract
The purpose of current study is to predict Snow Water Equivalent (SWE) in Sohrevard watershed, Iran, using different machine learning algorithms such as Bayesian Artificial Neural Network (BANN), Support Vector Machine (SVM), Cubist and Random Forest (RF) with Latin Hypercube Sampling (LHS). In this regard, nine geo-environmental variables—altitude, slope, eastness, profile curvature, plan curvature, solar radiation, Topographic Position Index (TPI), Topographic Wetness Index (TWI) and wind exposition index—were used as SWE influencing factors. Based on the results obtained from the error metrics, the RF algorithm (train and testing stages, r = 0.96 and 0.76; Root Mean Square Error (RMSE) = 2.54 and 5.46 cm; Mean Absolute Error (MAE) = 1.74 and 4.05 cm; Percent BIAS (PBIAS) = 0.4 and 2.3 respectively) was selected as the best model. Based on our findings, the highest amount of SWE was concentrated in the eastern part of the watershed. SWE modeling is a useful tool for optimal and integrated management of water resources.
Similar content being viewed by others
Data availability
The datasets generated during the current study are available from the corresponding author on reasonable request.
References
Allahbakhshian-Farsani P, Vafakhah M, Khosravi-Farsani H, Hertig E (2020) Regional flood frequency analysis through some machine learning models in semi-arid regions. Water Resour Manag 34(9):2887–2909
Allison P (1999) Multiple regression: A primer Pine Forge Press. Thousand Oaks, CA
Appelhans T, Mwangomo E, Hardy DR, Hemp A, Nauss T (2015) Evaluating machine learning approaches for the interpolation of monthly air temperature at Mt. Kilimanjaro. Tanzania Spatial Statistics 14:91–113. https://doi.org/10.1016/j.spasta.2015.05.008
Avand M, Janizadeh S, Tien Bui D, Pham VH, Ngo PTT, Nhu VH (2020) A tree-based intelligence ensemble approach for spatial prediction of potential groundwater. Int J Digit Earth 13(12):1408–1429. https://doi.org/10.1080/17538947.2020.1718785
Bai Y, Fernald A, Tidwell V, Gunda T (2019) Reduced and earlier snowmelt runoff impacts traditional irrigation systems. J Contemp Water Res Education 168(1):10–28
Bair EH, Calfa AA, Rittger K, Dozier J (2018) Using machine learning for real-time estimates of snow water equivalent in the watersheds of Afghanistan. Cryosphere. 12(5):1579–1594
Berezowski T, Chybicki A (2018) High-resolution discharge forecasting for snowmelt and rainfall mixed events. Water (Switzerland) 10(1):56. https://doi.org/10.3390/w10010056
Botsis D, Latinopoulos P, Diamantaras K (2011) Rainfall–runoff modeling using support vector regression and artificial neural networks. In 12th International Conference on Environmental Science and Technology, Rhodes, Greece. http://aetos.it.teithe.gr/~kdiamant/docs/CEST2011.pdf
Breiman L (2001) Random forests Machine learning 45(1):5–32
Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees Chapman and Hall, New York
Bui DT, Ngo P-TT, Pham TD, Jaafari A, Minh NQ, Hoa PV, Samui P (2019) A novel hybrid approach based on a swarm intelligence optimized extreme learning machine for flash flood susceptibility mapping. Catena. 179:184–196
Cortes C, Vapnik V (1995) Support-vector networks. Machine learning 20(3):273–297
Costache R (2019) Flood susceptibility assessment by using bivariate statistics and machine learning models-a useful tool for flood risk management. Water Resour Manag 33(9):3239–3256
Darabi H, Choubin B, Rahmati O, Haghighi AT, Pradhan B, Kløve B (2019) Urban flood risk mapping using the GARP and QUEST models: a comparative study of machine learning techniques. J Hydrol 569:142–154
DeWalle DR, Rango A (2008) Principles of snow hydrology. Cambridge University Press
Essenfelder AH, Giupponi C (2020) A coupled hydrologic-machine learning modelling framework to support hydrologic modelling in river basins under Interbasin water transfer regimes. Environ Model Softw 131:104779. https://doi.org/10.1016/j.envsoft.2020.104779
Fathololoumi S, Vaezi AR, Alavipanah SK, Ghorbani A, Saurette D, Biswas A (2021) Effect of multi-temporal satellite images on soil moisture prediction using a digital soil mapping approach. Geoderma 385:114901
Fernández-Delgado M, Sirsat MS, Cernadas E, Alawadi S, Barro S, Febrero-Bande M (2019) An extensive experimental survey of regression methods. Neural Networks 111:11–34. https://doi.org/10.1016/j.neunet.2018.12.010
Ganjkhanlo H, Vafakhah M, Zeinivand H, Fathzadeh A (2020a) The effect of different sampling schemes on estimation precision of snow water equivalent (SWE) using geostatistics techniques in a semi-arid region of Iran. Geocarto Int 35(16):1769–1782. https://doi.org/10.1080/10106049.2019.1581267
Ganjkhanlo H, Vafakhah M, Zeinivand H, Fathzadeh A (2020b) Prediction of snow water equivalent using artificial neural network and adaptive neuro-fuzzy inference system with two sampling schemes in semi-arid region of Iran. J Mt Sci 17(7):1712–1723. https://doi.org/10.1007/s11629-018-4875-8
Ghanbarpour MR, Saghafian B, Saravi MM, Abbaspour KC (2007) Evaluation of spatial and temporal variability of snow cover in a large mountainous basin in Iran. Hydrol Res 38(1):45–58
Gholami H, Dolat Kordestani M, Li J, Telfer MW, Fathabadi A (2019) Diverse sources of aeolian sediment revealed in an arid landscape in southeastern Iran using a modified Bayesian un-mixing model. Aeolian Res 41(December 2018):100547. https://doi.org/10.1016/j.aeolia.2019.100547
Goodfellow I, Bengio Y, Courville A, Bengio Y (2016) Deep learning. MIT press, Cambridge
Granata F, Gargano R, de Marinis G (2016) Support vector regression for rainfall-runoffmodeling in urban drainage: a comparison with the EPA’s storm water management model. Water (Switzerland) 8(3):69. https://doi.org/10.3390/w8030069
Harvey AC (1977) Some comments on multicollinearity in regression. J R Stat Soc: Ser C: Appl Stat 26(2):188–191
Hatta S, Nishimura T, Saga H, Fujita M (1995) Study on snowmelt runoff prediction using weekly weather forecast. Environ Int 21(5):501–507
Hong H, Pradhan B, Xu C, Bui DT (2015) Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. Catena. 133:266–281
Hsieh WW (2009) Machine learning methods in the environmental sciences: neural networks and kernels. [place unknown]: Cambridge university press
Jaafari A, Zenner EK, Pham BT (2018) Wildfire spatial pattern analysis in the Zagros Mountains, Iran: a comparative study of decision tree based classifiers. Ecological informatics 43:200–211
Janizadeh S, Avand M, Jaafari A, Van Phong T, Bayat M, Ahmadisharaf E, Prakash I, Pham BT, Lee S (2019) Prediction success of machine learning methods for flash flood susceptibility mapping in the Tafresh watershed. Iran Sustainability (Switzerland) 11(19)
Koopialipoor M, Armaghani DJ, Hedayat A, Marto A, Gordan B (2019a) Applying various hybrid intelligent systems to evaluate and predict slope stability under static and dynamic conditions. Soft Comput 23(14):5913–5929
Koopialipoor M, Fallah A, Armaghani DJ, Azizi A, Mohamad ET (2019b) Three hybrid intelligent models in estimating flyrock distance resulting from blasting. Eng Comput 35(1):243–256
Kotsiantis SB, Pintelas PE (2004) Combining bagging and boosting. Comput Intell 1(4):324–333. https://doi.org/10.1103/PhysRevD.77.085025
Kroll CN, Song P (2013) Impact of multicollinearity on small sample hydrologic regression models. Water Resour Res 49(6):3756–3769
Kuhn M, Johnson K, et al. (2013) Applied predictive modeling. [place unknown]: Springer
LaMalfa EM, Ryle R (2008) Differential snowpack accumulation and water dynamics in aspen and conifer communities: implications for water yield and ecosystem function. Ecosystems. 11(4):569–581
Lianjun C (2016) Research on snow extracting methods on the basis of random forests algorithm. International Journal of Simulation: Systems, Science and Technology 17(19):3.1–3.6
Ließ M, Glaser B, Huwe B (2012) Uncertainty in the spatial prediction of soil texture: comparison of regression tree and random Forest models. Geoderma. 170:70–79
Ma Y, Huang Y, Chen X, Li Y, Bao A. 2013. Modelling snowmelt runoff under climate change scenarios in an ungauged mountainous watershed, Northwest China. Mathematical Problems in Engineering. 2013
MacKay DJC (1992) Bayesian interpolation. Neural Comput 4(3):415–447
Marwala T (2007) Bayesian training of neural networks using genetic programming. Pattern Recogn Lett 28(12):1452–1458
Maurer EP, Rhoads JD, Dubayah RO, Lettenmaier DP (2003) Evaluation of the snow-covered area data product from MODIS. Hydrol Process 17(1):59–71
McBratney AB, Santos MM, Minasny B (2003) On digital soil mapping. Geoderma 117(1–2):3–52
Mekanik F, Imteaz MA, Gato-Trinidad S, Elmahdi A (2013) Multiple regression and artificial neural network for long-term rainfall forecasting using large scale climate modes. J Hydrol 503:11–21
Mosavi A, Ozturk P, Chau K (2018) Flood prediction using machine learning models: literature review. Water. 10(11):1536
Mote PW, Hamlet AF, Clark MP, Lettenmaier DP (2005) Declining mountain snowpack in western North America. Bull Am Meteorol Soc 86(1):39–50
Nandhini M, Sivanandam SN (2015) An improved predictive association rule based classifier using gain ratio and T-test for health care data diagnosis. Sadhana. 40(6):1683–1699
Nguyen PT, Ha DH, Avand M, Jaafari A, Nguyen HD, Al-Ansari N, Van PT, Sharma R, Kumar R, Van LH et al (2020) Soft computing ensemble models based on logistic regression for groundwater potential mapping. Appl Sci 10(7):2469
Nhu VH, Janizadeh S, Avand M, Chen W, Farzin M, Omidvar E, Shirzadi A, Shahabi H, Clague JJ, Jaafari A et al (2020) GIS-based gully erosion susceptibility mapping: a comparison of computational ensemble data mining models. Applied Sciences (Switzerland). 10(6)
Nikoo M, Ramezani F, Hadzima-Nyarko M, Nyarko EK, Mohammad N (2016) Flood-routing modeling with neural network optimized by social-based algorithm. Nat Hazards 82(1):1–24
Noi PT, Degener J, Kappas M (2017) Comparison of multiple linear regression, cubist regression, and random forest algorithms to estimate daily air surface temperature from dynamic combinations of MODIS LST data. Remote Sens 9(5)
O’brien RM (2007) A caution regarding rules of thumb for variance inflation factors. Qual Quant 41(5):673–690
Peters J, Verhoest NEC, Samson R, Boeckx P, De Baets B (2008) Wetland vegetation distribution modelling for the identification of constraining environmental variables. Landsc Ecol 23(9):1049–1065
Pham L, Luo L, Finley A (2020) Evaluation of random Forest for short-term daily streamflow forecast in rainfall and snowmelt driven watersheds. Hydrol Earth Syst Sci Discuss(June):1–33
Rahman MM, Karunasinghe J, Clifford S, Knibbs LD, Morawska L (2020) New insights into the spatial distribution of particle number concentrations by applying non-parametric land use regression modelling. Sci Total Environ 702:134708
Rahmati O, Pourghasemi HR, Zeinivand H (2016) Flood susceptibility mapping using frequency ratio and weights-of-evidence models in the Golastan Province. Iran Geocarto International 31(1):42–70
Rango A, Steele CM, Elias E, Mejia J, Fernald A. 2013. Potential impacts of climate warming on runoff from snowmelt: A case study of two mountainous basins in the Upper Rio Grande. In: AGU Fall Meeting Abstracts. Vol. 2013. [place unknown]; p. H23A--1217
Revuelto J, López-Moreno JI, Azorin-Molina C, Vicente-Serrano SM (2014) Topographic control of snowpack distribution in a small catchment in the central Spanish Pyrenees: intra-and inter-annual persistence. Cryosphere 8(5):1989–2006
Robinson C, Schumacker RE (2009) Interaction effects: centering, variance inflation factor, and interpretation issues. Multiple linear regression viewpoints 35(1):6–11
Rozos E (2019) Machine learning, urban water resources management and operating policy. Resources. 8(4):173
Sahoo GB, Ray C, De Carlo EH (2006) Calibration and validation of a physically distributed hydrological model, MIKE SHE, to predict streamflow at high frequency in a flashy mountainous Hawaii stream. J Hydrol 327(1–2):94–109
Shabani S, Yousefi P, Adamowski J, Naser G. 2016. Intelligent soft computing models in water demand forecasting. Water Stress in Plants:99–117
Sharifi Garmdareh E, Vafakhah M, Eslamian SS (2018) Regional flood frequency analysis using support vector regression in arid and semi-arid regions of Iran. Hydrol Sci J 63(3):426–440. https://doi.org/10.1080/02626667.2018.1432056
Sharifi Garmdareh E, Vafakhah M, Eslamian SS, Khosrobeigi Bozchaloei S, Allahbakhshian-Farsani P, Vafakhah M, Khosravi-Farsani H, Hertig E, Appelhans T, Mwangomo E et al (2020) A coupled hydrologic-machine learning modelling framework to support hydrologic modelling in river basins under interbasin water transfer regimes. Water (Switzerland) 34(5):283–294. https://doi.org/10.1016/j.asoc.2016.12.052
Starzyk J (2010) Water resource planning and management using motivated machine learning. IAHS-AISH Publication 338(July):214–220
Sun M, Chen T, Yu Y, Wang Z, Chi D, others. 2014. Extreme learning machine application in flood forecasting. Journal of Shenyang Agricultural University 45(2):245–248
Tahmasebi P, Kamrava S, Bai T, Sahimi M (2020) Machine learning in geo- and environmental sciences: from small to large scale. Adv Water Resour 142:103619. https://doi.org/10.1016/j.advwatres.2020.103619
Talei A, Chua LHC, Quek C, Jansson P-E (2013) Runoff forecasting using a Takagi--Sugeno neuro-fuzzy model with online learning. J Hydrol 488:17–32
Thapa S, Zhao Z, Li B, Lu L, Fu D, Shi X, Tang B, Qi H (2020) Snowmelt-driven streamflow prediction using machine learning techniques (LSTM, NARX, GPR, and SVR). Water (Switzerland). 12(6)
Tien Bui D, Shirzadi A, Shahabi H, Chapi K, Omidavr E, Pham BT, Talebpour Asl D, Khaledian H, Pradhan B, Panahi M et al (2019) A novel ensemble artificial intelligence approach for gully erosion mapping in a semi-arid watershed (Iran). Sensors. 19(11):2444
Vafakhah M, Khosrobeigi BS (2020) Regional analysis of flow duration curves through support vector regression. Water Resour Manag 34(1):283–294
Vafakhah M, Mohseni SM, Mahdavi M, Alavipanah SK (2011) Snowmelt runoff prediction by using artificial neural network and adaptive neuro-fuzzy inference system in Taleghan watershed. Iranian J Watershed Manag Sci Eng 5(14):23–35
Vafakhah M, Nouri A, Alavipanah SK (2015) Snowmelt-runoff estimation using radiation SRM model in Taleghan watershed. Environ Earth Sci 73(3):993–1003
Viswesvaran C (1998) Multiple regression in behavioral research: explanation and prediction. Pers Psychol 51(1):223
Vorpahl P, Elsenbeer H, Märker M, Schröder B (2012) How can statistical models help to determine driving factors of landslides? Ecol Model 239:27–39
Wester P, Mishra A, Mukherji A, Shrestha AB. 2019. The Hindu Kush Himalaya assessment: Mountains, climate change, sustainability and people. [place unknown]: Springer Nature
Wheeler D, Tiefelsdorf M (2005) Multicollinearity and correlation among local regression coefficients in geographically weighted regression. J Geogr Syst 7(2):161–187
Winstral A, Elder K, Davis RE (2002 Oct) Spatial snow modeling of wind-redistributed snow using terrain-based parameters. J Hydrometeorol 3(5):524–538
Xie Z, Lou I, Ung WK, Mok KM (2012) Freshwater algal bloom prediction by support vector machine in Macau storage reservoirs. Math Probl Eng 2012
Yariyan P, Janizadeh S, Van Phong T, Nguyen HD, Costache R, Van Le H, Pham BT, Pradhan B, Tiefenbacher JP (2020) Improvement of best first decision trees using bagging and Dagging ensembles for flood probability mapping. Water Resour Manag 34(9):3037–3053
Yoo C, Cho E (2019) Effect of multicollinearity on the bivariate frequency analysis of annual maximum rainfall events. Water (Switzerland) 11(5):905. https://doi.org/10.3390/w11050905
Young CC, Liu WC, Wu MC (2017) A physically based and machine learning hybrid approach for accurate rainfall-runoff modeling during extreme typhoon events. App Soft Comput 53:205–216. https://doi.org/10.1016/j.asoc.2016.12.052
Zheng X, Wang Q, Zhou L, Sun Q, Li Q (2018) Predictive contributions of snowmelt and rainfall to streamflow variations in the Western United States. Adv Meteorol 2018:3765098. https://doi.org/10.1155/2018/3765098
Zhou J, Li E, Wei H, Li C, Qiao Q, Armaghani DJ (2019) Random forests and cubist algorithms for predicting shear strengths of rockfill materials. Applied Sciences (Switzerland) 9(8):1–16
Author information
Authors and Affiliations
Contributions
Mehdi Vafakhah: assisted in running the program and data collection, revised manuscript; Ali Nasiri Khiavi: drafted the manuscript; Saeid Janizadeh: conducted data analysis; Hojatolah Ganjkhanlo: collected data and provided maps. The authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Human and animal rights
This article does not contain any studies with animals performed by any of the authors.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Additional information
Communicated by: H. Babaie
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Vafakhah, M., Nasiri Khiavi, A., Janizadeh, S. et al. Evaluating different machine learning algorithms for snow water equivalent prediction. Earth Sci Inform 15, 2431–2445 (2022). https://doi.org/10.1007/s12145-022-00846-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-022-00846-z