Abstract
Lakes play an important role in hydrologic cycle, and water level forecasting can provide vital information for future management of lakes and their ecosystem. In the present study, five standalone tree-based models including alternating model tree (AMT), M5 Prime (M5P), random forest (RF), random tree (RT) and reduced error pruning tree (REPT) with their ensemble with Bagging algorithm were applied to 1, 2, 3 and 6-months ahead water level forecasting at Lake Urmia, northwest of Iran. Water level time series data from 2001 to 2020 were obtained and divided into two sub-groups for model building (from 2001 to 2014) and validation (from 2015 to 2020). Different input scenarios were constructed to find the most effective input scenario. Finally developed models were evaluated through several visually-based and quantitative-based criteria. The obtained results indicated that for 1 and 2-months ahead, ensemble model of BA-AMT has a higher performance, while for 3 and 6-months ahead, BA-RF outperform other models. Overall, ensemble models enhanced forecasting power of standalone models. Also, developed models were able to forecast water level for up to 3-months ahead, and by increasing the scale for more months ahead, forecasting accuracy decreased and model were found to be of high uncertainty and low forecasting power.
Similar content being viewed by others
References
Abpeykar S, Ghatee M (2019) Neural trees with peer-to-peer and server-to-client knowledge transferring models for high-dimensional data classification. Expert Syst Appl 137:281–291. https://doi.org/10.1016/j.eswa.2019.07.003
Barzegar R, Adamowski J, Asghari Moghaddam A (2016) Application of wavelet-artificial intelligence hybrid models for water quality prediction: a case study in Aji-Chay River, Iran. Stoch Environ Res Risk Assess 30(7):1797–1819
Breiman L (1999) Random forests-random feature, UC Berkeley, Technical Report 567, pp. 29
Breiman L (2001) Random forests. Mach Learn 45:5–32
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140. https://doi.org/10.1007/BF00058655
Breslow LA, Aha DW (1997) Simplifying decision trees: A survey. Knowl Eng Rev 12:1–40
Bui DT, Khosravi K, Tiefenbacher J, Nguyen H, Kazakis N (2020) Improving prediction of water quality indices using novel hybrid machine-learning algorithms. Sci Total Environ 721:137612
Carranza EJM, Laborte AG (2016) Data-driven predictive modeling of mineral prospectivity using random forests: a case study in Catanduanes Island (Philippines) Nat Resour Res 25:35–50. https://doi.org/10.1007/s11053-015-9268-x
Chang FJ, Chang YT (2006) Adaptive neuro-fuzzy inference system for prediction of water level in reservoir. Adv Water Resour 29(1):1–10. https://doi.org/10.1016/j.advwatres.2005.04.015
Chen W, Panahi M, Pourghasemi HR (2017) Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution. Catena 157:310–324
Chen X, Quan Q, Zhang K, Wei J (2021) Spatiotemporal characteristics and attribution of dry/wet conditions in the Weihe River Basin within a typical monsoon transition zone of East Asia over the recent 547 years. Environ Model Softw 143:105116. https://doi.org/10.1016/j.envsoft.2021.105116
Chen Z, Liu Z, Yin L, Zheng W (2022) Statistical analysis of regional air temperature characteristics before and after dam construction. Urban Clim 41. https://doi.org/10.1016/j.uclim.2022.101085
Choi C, Kim J, Han H, Han D, Kim HS (2019) Development of water level prediction models using machine learning in wetlands: a case study of Upo Wetland in South Korea. Water 2020 12:93. https://doi.org/10.3390/w12010093
Choubin B, Zehtabian GR, Azareh A, Rafiei-Sardooi E, Sajedi-Hosseini F, Kisi O (2016) Precipitation forecasting using classification and regression trees (CART) model: a comparative study of different approaches. Environ Earth Sci 77(8):1–13
Coulibaly P (2010) Reservoir computing approach to Great Lakes water level forecasting. J Hydrol 381(1–2):76–88
Drmota M, Gittenberger B (1997) On the profile of random trees. Random Structures & Algorithms 10:421–451
Ehteram M, Ferdowsi A, Faramarzpour M, Mohammed Sami Al-Janabi A, Al-ansari N, Dhanraj Bokde N, Yaseen ZM (2021) Hybridization of artificial intelligence models with nature inspired optimization algorithms for lake water level prediction and uncertainty analysis. Alex Eng J 60:2193–2208
Fang X, Wang Q, Wang J, Xiang Y, Wu, Y, ..., Zhang Y (2021) Employing extreme value theory to establish nutrient criteria in bay waters: A case study of Xiangshan Bay. J Hydrol. https://doi.org/10.1016/j.jhydrol.2021.127146
Frank E, Mayo M, Kramer S (2015) Alternating model trees. In: Proceedings of the 30th annual 589 ACM symposium on applied computing, pp 871–878. https://doi.org/10.1145/2695664.2695848
Freund Y, Mason L (1999) The alternating decision tree learning algorithm. In Proc 16th International Conf on Machine Learning, pages 124–133, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc
Gislason PO, Benediktsson JA, Sveinsson JR (2006) Random forests for land cover classification. Pattern Recogn Lett 27:294–300
Hrnjica B, Bonacci O (2019) Lake level prediction using feed forward and recurrent neural networks. Water Resour Manage. https://doi.org/10.1007/s11269-019-02255-2
Hussain D, Khan AA (2020) Machine learning techniques for monthly river flow forecasting of Hunza River, Pakistan. Earth Sci Inform. https://doi.org/10.1007/s12145-020-00450-z
Jiang S, Zuo Y, Yang M, Feng R (2021) Reconstruction of the Cenozoic tectono-thermal history of the Dongpu Depression, Bohai Bay Basin, China: Constraints from apatite fission track and vitrinite reflectance data. J Petrol Sci Eng 205:108809. https://doi.org/10.1016/j.petrol.2021.108809
Kapoor P, Rani R (2015) Efficient decision tree algorithm using J48 and reduced error pruning. Int J Eng Res Gen Sci 3(3):1–9
Khatibi R, Ghorbani MA, Naghipour L, Jothiprakash V, Fathima TA, Fazelifard MH (2014) Inter-613 comparison of time series models of lake levels predicted by several modeling strategies. J Hydrol 511:530–545
Khosravi K, Panahi M, Tien Bui D (2018) Spatial prediction of groundwater spring potential mapping based on an adaptive neuro-fuzzy inference system and metaheuristic optimization. Hydrol Earth Syst Sci 22(9):4771–4792
Khosravi K, Daggupati P, Alami MT, Awadh SM, Ghareb MI, Panahi M (2019) Meteorological data mining and hybrid data-intelligence models for reference evaporation simulation: A case study in Iraq. Comput Electron Agric 167:105041
Khosravi K, Cooper JR, Daggupati P, Pham BT, Bui DT (2020a) Bedload transport rate prediction: Application of novel hybrid data mining techniques. J Hydrol 585:124774
Khosravi K, Barzegar R, Miraki S, Adamowski J, Daggupati P (2020b) Stochastic modeling of groundwater fluoride contamination: Introducing lazy learners. Groundwater 58(5):723–734
Kisi O, Shiri J, Nikoofar B (2012) Forecasting daily lake levels using artificial intelligence approaches. Comput Geosci 41:169–180
LaValle SM (1998) Rapidly-exploring random trees: A new tool for path planning. The Annu Res Rep pp. 1–4
Liu H, Chen SM (2020) Heuristic creation of deep rule ensemble through iterative expansion of feature space. Inf Sci. https://doi.org/10.1016/j.ins.2020.02.001
Liu B, Spiekermann R, Zhao C, Püttmann W, Sun Y, Jasper A, ..., Uhl D (2022a) Evidence for the repeated occurrence of wildfires in an upper Pliocene lignite deposit from Yunnan, SW China. Int J Coal Geol 250. https://doi.org/10.1016/j.coal.2021.103924
Liu S, Liu Y, Wang C, Dang X (2022b) The distribution characteristics and human health risks of high- fluorine groundwater in coastal plain: A case study in Southern Laizhou Bay, China. Front Environ Sci. https://doi.org/10.3389/fenvs.2022.901637
Liu Y, Zhang K, Li Z, Liu Z, Wang J, ... Huang P (2020) A hybrid runoff generation modelling framework based on spatial combination of three runoff generation schemes for semi-humid and semi-arid watersheds. J Hydrol (Amsterdam) 590. https://doi.org/10.1016/j.jhydrol.2020.125440
Melesse AM, Ahmad S, McClain ME, Wang X, Lim YH (2011) Suspended sediment load prediction of river systems: An artificial neural network approach. Agric Water Manag. https://doi.org/10.1016/j.agwat.2010.12.012
Mohebzadeh H (2018) Extracting A-L Relationship for Urmia Lake, Iran Using MODIS NDVI/NDWI Indices. J Hydrogeol Hydrol Eng 7:1. https://doi.org/10.4172/2325-9647.1000164
Mohamed WNHW, Salleh MNM, Omar AH (2013) A comparative study of Reduced Error Pruning method in decision tree algorithms. In: Proceedings - 2012 IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2012
Moriasi DN, Arnold JG, Van Liew MW, Binger RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50:885–900. https://doi.org/10.13031/2013.23153
Ngo PTT, Panahi M, Khosravi K, Ghorbanzadeh O, Kariminejad N (2021) Evaluation of deep learning algorithms for national scale landslide susceptibility mapping of Iran. Geosci Front 12(2):505–519
Onan A, Korukoğlu S, Bulut H (2016) A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification. Expert Syst Appl 62:1–16. https://doi.org/10.1016/j.eswa.2016.06.005
Onyari E, Ilunga F (2013) Application of MLP neural network and M5P model tree in predicting streamflow: A case study of luvuvhu catchment, South Africa. Int J Innov Technol Manag 4(1):11–15
Quinlan JR (1992) Learning with continuous classes. Mach Learn 92:343–348
Razmkhah A, Alvankar SR, Kakahaji A (2016) Modeling lake Urmia water-level changes using local linear neuro-fuzzy method. J Water Sci Res 9(1, Spring 2017):47–61
Sajjad-Khan M, Coulibaly P (2006) Application of support vector machine in lake water level prediction. J Hydrol Eng 11(3). https://doi.org/10.1061/(ASCE)1084-0699(2006)11:3(199)
Shiri J, Shamshirband S, Kisi O, Karimi S, Bateni SM, Nezhad SHH (2016) Prediction of water-level in the Urmia Lake using the extreme learning machine approach. Water Resour Manage 30(14):5217–5229
Tama BA, Comuzzi M (2019) An empirical comparison of classification techniques for next event prediction using business process event logs. Expert Syst Appl 129:233–245. https://doi.org/10.1016/j.eswa.2019.04.016
Tian H, Qin Y, Niu Z, Wang L, Ge S (2021) Summer maize mapping by compositing time series sentinel-1A imagery based on crop growth cycles. J Indian Soc Remote Sens 49(11):2863–2874. https://doi.org/10.1007/s12524-021-01428-0
Tian H, Wang Y, Chen T, Zhang L, Qin Y (2021) Early-season mapping of winter crops using sentinel-2 optical imagery. Remote Sens (Basel, Switzerland) 13(19):3822. https://doi.org/10.3390/rs13193822
Wang J, Zuo R, Xiong Y (2019) Mapping mineral prospectivity via semi-supervised random forest. Nat Resour Res. https://doi.org/10.1007/s11053-019-09510-8
Wang Y, Witten IH (1997) Induction of model trees for predicting continuous classes. Proc. 9th Eur. Conf. Mach. Learn. Poster Pap. 128–137
Wang Z, Zuo R, Dong Y (2019) Mapping geochemical anomalies through integrating random forest and metric learning methods. Nat Resour Res 28:1285–1298. https://doi.org/10.1007/s11053-019-09471-y
Wang S, Zhang K, Chao L, Li D, Tian X, Bao H, ... Xia Y (2021) Exploring the utility of radar and satellite-sensed precipitation and their dynamic bias correction for integrated prediction of flood and landslide hazards. J Hydrol (Amsterdam) 603. https://doi.org/10.1016/j.jhydrol.2021.126964
Wang Y, Cheng H, Hu Q, Liu L, Jia L, Gao S, ... Wang Y (2022) Pore structure heterogeneity of Wufeng-Longmaxi shale, Sichuan Basin, China: Evidence from gas physisorption and multifractal geometries. J Petrol Sci Eng 208. https://doi.org/10.1016/j.petrol.2021.109313
Waseem Ahmad M, Reynolds J, Rezgui Y (2018) Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. J Clean Prod 203:810–821. https://doi.org/10.1016/j.jclepro.2018.08.207
Xie W, Li X, Jian W, Yang Y, Liu H, Robledo LF, ... Nie W (2021) A novel hybrid method for landslide susceptibility mapping-based Geodetector and machine learning cluster: A case of Xiaojin County. China ISPRS Int J Geo-Inf 10(2):93. https://doi.org/10.3390/ijgi10020093
Xie W, Nie W, Saffari P, Robledo LF, Descote P, ... Jian W (2021) Landslide hazard assessment based on Bayesian optimization–support vector machine in Nanping City. China Natural Hazards (Dordrecht) 109(1):931–948. https://doi.org/10.1007/s11069-021-04862-y
Yarar A, Onucyildiz M, Copty NK (2009) Modelling level change in lakes using neuro-fuzzy and artificial neural networks. J Hydrol 365(3–4):329–334
Yin L, Wang L, Keim BD, Konsoer K, Zheng W (2022a) Wavelet analysis of dam injection and discharge in three gorges dam and reservoir with precipitation and river discharge. Water 14(4):567. https://doi.org/10.3390/w14040567
Yin L, Wang L, Zheng W, Ge L, Tian J, Liu Y, ... Liu S (2022b) Evaluation of empirical atmospheric models using swarm-c satellite data. Atmosphere 13(2):294. https://doi.org/10.3390/atmos13020294
Young CC, Liu WC, Hshieh WL (2015) Predicting the water level fluctuation in an Alpine Lake using physically based, artificial neural network, and time series forecasting models. Math Probl Eng, Article ID708204. https://doi.org/10.1155/2015/708204
Yucalar F, Ozcift A, Borandag E, Kilinc D (2020) Multiple-classifiers in software quality engineering: Combining predictors to improve software fault prediction ability. Eng Sci Technol. https://doi.org/10.1016/j.jestch.2019.10.005
Zhang S, Xiao K, Carranza EJM, Yang F (2019) Maximum entropy and random forest modeling of mineral potential: Analysis of gold prospectivity in the Hezuo-Meiwu District West Qinling Orogen. China Nat Resour Res 28:645–664. https://doi.org/10.1007/s11053-018-9425-0
Zhang K, Ali A, Antonarakis A, Moghaddam M, Saatchi S, Tabatabaeenejad A, ... Moorcroft P (2019) The Sensitivity of North American Terrestrial Carbon Fluxes to Spatial and Temporal Variation in Soil Moisture: An analysis using radar-derived estimates of root-zone soil moisture. J Geophys Res Biogeosci 124(11):3208–3231. https://doi.org/10.1029/2018JG004589
Zhang K, Shalehy MH, Ezaz GT, Chakraborty A, Mohib KM, ... Liu L (2022) An integrated flood risk assessment approach based on coupled hydrological-hydraulic modeling and bottom-up hazard vulnerability analysis. Environ Model Softw 148. https://doi.org/10.1016/j.envsoft.2021.105279
Zhang K, Wang S, Bao H, Zhao X (2019) Characteristics and influencing factors of rainfall-induced landslide and debris flow hazards in Shaanxi Province, China. Nat Hazard 19(1):93–105. https://doi.org/10.5194/nhess-19-93-2019
Zhao F, Song L, Peng Z, Yang J, Luan G, Chu C, ... Xie Z (2021a) Night-time light remote sensing mapping: Construction and analysis of ethnic minority development index. Remote Sens (Basel, Switzerland) 13(11):2129. https://doi.org/10.3390/rs13112129
Zhao F, Zhang S, Du Q, Ding J, Luan G, ... Xie Z (2021b) Assessment of the sustainable development of rural minority settlements based on multidimensional data and geographical detector method: A case study in Dehong, China. Socio-Econ Plann Sci. https://doi.org/10.1016/j.seps.2021b.101066
Zhao X, Xia H, Pan L, Song H, Niu W, Wang R, ... Qin Y (2021c) Drought monitoring over Yellow River Basin from 2003–2019 using reconstructed MODIS land surface temperature in Google Earth Engine. Remote Sens (Basel, Switzerland) 13(18):3748. https://doi.org/10.3390/rs13183748
Zhu S, Lu H, Ptak M, Dai J, Ji Q (2020a) Lake water-level fluctuation forecasting using machine 703 learning models: a systematic review. Environ Sci Pollut Res 27(44807–704):44819
Zhu S, Hrnjica B, Ptak M, Choiński A, Sivakumar B (2020b) Forecasting of water level in multiple 701 temperate lakes using machine learning models. J Hydrol 585:124819
Zhu Z, Zhu Z, Wu Y, Han J (2022) A prediction method of coal burst based on analytic hierarchy process and fuzzy comprehensive evaluation. Front Earth Sci (Lausanne) 9. https://doi.org/10.3389/feart.2021.834958
Zounemat-Kermani M, Stephan D, Barjenbruch M, Hinkelmann R (2020) Ensemble data mining modeling in corrosion of concrete sewer: A comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models. Adv Eng Inform 43:101030. https://doi.org/10.1016/j.aei.2019.101030
Zuo Y, Jiang S, Wu S, Xu W, Zhang J, Feng R, ... Santosh M (2020) Terrestrial heat flow and lithospheric thermal structure in the Chagan Depression of the Yingen-Ejinaqi Basin, north central China. Basin Res 32(6):1328–1346. https://doi.org/10.1111/bre.12430
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There is not any conflict of interest among authors.
Additional information
Communicated by H. Babaie.
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hajian, R., Jalali, M.R. & Mastouri, R. Multi-step Lake Urmia water level forecasting using ensemble of bagging based tree models. Earth Sci Inform 15, 2515–2543 (2022). https://doi.org/10.1007/s12145-022-00857-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-022-00857-w