Skip to main content

Advertisement

Log in

Multi-step Lake Urmia water level forecasting using ensemble of bagging based tree models

  • Research Article
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Lakes play an important role in hydrologic cycle, and water level forecasting can provide vital information for future management of lakes and their ecosystem. In the present study, five standalone tree-based models including alternating model tree (AMT), M5 Prime (M5P), random forest (RF), random tree (RT) and reduced error pruning tree (REPT) with their ensemble with Bagging algorithm were applied to 1, 2, 3 and 6-months ahead water level forecasting at Lake Urmia, northwest of Iran. Water level time series data from 2001 to 2020 were obtained and divided into two sub-groups for model building (from 2001 to 2014) and validation (from 2015 to 2020). Different input scenarios were constructed to find the most effective input scenario. Finally developed models were evaluated through several visually-based and quantitative-based criteria. The obtained results indicated that for 1 and 2-months ahead, ensemble model of BA-AMT has a higher performance, while for 3 and 6-months ahead, BA-RF outperform other models. Overall, ensemble models enhanced forecasting power of standalone models. Also, developed models were able to forecast water level for up to 3-months ahead, and by increasing the scale for more months ahead, forecasting accuracy decreased and model were found to be of high uncertainty and low forecasting power.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

  • Abpeykar S, Ghatee M (2019) Neural trees with peer-to-peer and server-to-client knowledge transferring models for high-dimensional data classification. Expert Syst Appl 137:281–291. https://doi.org/10.1016/j.eswa.2019.07.003

    Article  Google Scholar 

  • Barzegar R, Adamowski J, Asghari Moghaddam A (2016) Application of wavelet-artificial intelligence hybrid models for water quality prediction: a case study in Aji-Chay River, Iran. Stoch Environ Res Risk Assess 30(7):1797–1819

    Article  Google Scholar 

  • Breiman L (1999) Random forests-random feature, UC Berkeley, Technical Report 567, pp. 29

  • Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  Google Scholar 

  • Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140. https://doi.org/10.1007/BF00058655

  • Breslow LA, Aha DW (1997) Simplifying decision trees: A survey. Knowl Eng Rev 12:1–40

    Article  Google Scholar 

  • Bui DT, Khosravi K, Tiefenbacher J, Nguyen H, Kazakis N (2020) Improving prediction of water quality indices using novel hybrid machine-learning algorithms. Sci Total Environ 721:137612

    Article  Google Scholar 

  • Carranza EJM, Laborte AG (2016) Data-driven predictive modeling of mineral prospectivity using random forests: a case study in Catanduanes Island (Philippines) Nat Resour Res 25:35–50. https://doi.org/10.1007/s11053-015-9268-x

  • Chang FJ, Chang YT (2006) Adaptive neuro-fuzzy inference system for prediction of water level in reservoir. Adv Water Resour 29(1):1–10. https://doi.org/10.1016/j.advwatres.2005.04.015

    Article  Google Scholar 

  • Chen W, Panahi M, Pourghasemi HR (2017) Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution. Catena 157:310–324

    Article  Google Scholar 

  • Chen X, Quan Q, Zhang K, Wei J (2021) Spatiotemporal characteristics and attribution of dry/wet conditions in the Weihe River Basin within a typical monsoon transition zone of East Asia over the recent 547 years. Environ Model Softw 143:105116. https://doi.org/10.1016/j.envsoft.2021.105116

    Article  Google Scholar 

  • Chen Z, Liu Z, Yin L, Zheng W (2022) Statistical analysis of regional air temperature characteristics before and after dam construction. Urban Clim 41. https://doi.org/10.1016/j.uclim.2022.101085

  • Choi C, Kim J, Han H, Han D, Kim HS (2019) Development of water level prediction models using machine learning in wetlands: a case study of Upo Wetland in South Korea. Water 2020 12:93. https://doi.org/10.3390/w12010093

  • Choubin B, Zehtabian GR, Azareh A, Rafiei-Sardooi E, Sajedi-Hosseini F, Kisi O (2016) Precipitation forecasting using classification and regression trees (CART) model: a comparative study of different approaches. Environ Earth Sci 77(8):1–13

    Google Scholar 

  • Coulibaly P (2010) Reservoir computing approach to Great Lakes water level forecasting. J Hydrol 381(1–2):76–88

    Article  Google Scholar 

  • Drmota M, Gittenberger B (1997) On the profile of random trees. Random Structures & Algorithms 10:421–451

    Article  Google Scholar 

  • Ehteram M, Ferdowsi A, Faramarzpour M, Mohammed Sami Al-Janabi A, Al-ansari N, Dhanraj Bokde N, Yaseen ZM (2021) Hybridization of artificial intelligence models with nature inspired optimization algorithms for lake water level prediction and uncertainty analysis. Alex Eng J 60:2193–2208

    Article  Google Scholar 

  • Fang X, Wang Q, Wang J, Xiang Y, Wu, Y, ..., Zhang Y (2021) Employing extreme value theory to establish nutrient criteria in bay waters: A case study of Xiangshan Bay. J Hydrol. https://doi.org/10.1016/j.jhydrol.2021.127146

  • Frank E, Mayo M, Kramer S (2015) Alternating model trees. In: Proceedings of the 30th annual 589 ACM symposium on applied computing, pp 871–878. https://doi.org/10.1145/2695664.2695848

  • Freund Y, Mason L (1999) The alternating decision tree learning algorithm. In Proc 16th International Conf on Machine Learning, pages 124–133, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc

  • Gislason PO, Benediktsson JA, Sveinsson JR (2006) Random forests for land cover classification. Pattern Recogn Lett 27:294–300

    Article  Google Scholar 

  • Hrnjica B, Bonacci O (2019) Lake level prediction using feed forward and recurrent neural networks. Water Resour Manage. https://doi.org/10.1007/s11269-019-02255-2

    Article  Google Scholar 

  • Hussain D, Khan AA (2020) Machine learning techniques for monthly river flow forecasting of Hunza River, Pakistan. Earth Sci Inform. https://doi.org/10.1007/s12145-020-00450-z

  • Jiang S, Zuo Y, Yang M, Feng R (2021) Reconstruction of the Cenozoic tectono-thermal history of the Dongpu Depression, Bohai Bay Basin, China: Constraints from apatite fission track and vitrinite reflectance data. J Petrol Sci Eng 205:108809. https://doi.org/10.1016/j.petrol.2021.108809

    Article  Google Scholar 

  • Kapoor P, Rani R (2015) Efficient decision tree algorithm using J48 and reduced error pruning. Int J Eng Res Gen Sci 3(3):1–9

  • Khatibi R, Ghorbani MA, Naghipour L, Jothiprakash V, Fathima TA, Fazelifard MH (2014) Inter-613 comparison of time series models of lake levels predicted by several modeling strategies. J Hydrol 511:530–545

    Article  Google Scholar 

  • Khosravi K, Panahi M, Tien Bui D (2018) Spatial prediction of groundwater spring potential mapping based on an adaptive neuro-fuzzy inference system and metaheuristic optimization. Hydrol Earth Syst Sci 22(9):4771–4792

    Article  Google Scholar 

  • Khosravi K, Daggupati P, Alami MT, Awadh SM, Ghareb MI, Panahi M (2019) Meteorological data mining and hybrid data-intelligence models for reference evaporation simulation: A case study in Iraq. Comput Electron Agric 167:105041

    Article  Google Scholar 

  • Khosravi K, Cooper JR, Daggupati P, Pham BT, Bui DT (2020a) Bedload transport rate prediction: Application of novel hybrid data mining techniques. J Hydrol 585:124774

    Article  Google Scholar 

  • Khosravi K, Barzegar R, Miraki S, Adamowski J, Daggupati P (2020b) Stochastic modeling of groundwater fluoride contamination: Introducing lazy learners. Groundwater 58(5):723–734

    Google Scholar 

  • Kisi O, Shiri J, Nikoofar B (2012) Forecasting daily lake levels using artificial intelligence approaches. Comput Geosci 41:169–180

    Article  Google Scholar 

  • LaValle SM (1998) Rapidly-exploring random trees: A new tool for path planning. The Annu Res Rep pp. 1–4

  • Liu H, Chen SM (2020) Heuristic creation of deep rule ensemble through iterative expansion of feature space. Inf Sci. https://doi.org/10.1016/j.ins.2020.02.001

    Article  Google Scholar 

  • Liu B, Spiekermann R, Zhao C, Püttmann W, Sun Y, Jasper A, ..., Uhl D (2022a) Evidence for the repeated occurrence of wildfires in an upper Pliocene lignite deposit from Yunnan, SW China. Int J Coal Geol 250. https://doi.org/10.1016/j.coal.2021.103924

  • Liu S, Liu Y, Wang C, Dang X (2022b) The distribution characteristics and human health risks of high- fluorine groundwater in coastal plain: A case study in Southern Laizhou Bay, China. Front Environ Sci. https://doi.org/10.3389/fenvs.2022.901637

  • Liu Y, Zhang K, Li Z, Liu Z, Wang J, ... Huang P (2020) A hybrid runoff generation modelling framework based on spatial combination of three runoff generation schemes for semi-humid and semi-arid watersheds. J Hydrol (Amsterdam) 590. https://doi.org/10.1016/j.jhydrol.2020.125440

  • Melesse AM, Ahmad S, McClain ME, Wang X, Lim YH (2011) Suspended sediment load prediction of river systems: An artificial neural network approach. Agric Water Manag. https://doi.org/10.1016/j.agwat.2010.12.012

  • Mohebzadeh H (2018) Extracting A-L Relationship for Urmia Lake, Iran Using MODIS NDVI/NDWI Indices. J Hydrogeol Hydrol Eng 7:1. https://doi.org/10.4172/2325-9647.1000164

  • Mohamed WNHW, Salleh MNM, Omar AH (2013) A comparative study of Reduced Error Pruning method in decision tree algorithms. In: Proceedings - 2012 IEEE International Conference on Control System, Computing and Engineering, ICCSCE 2012

  • Moriasi DN, Arnold JG, Van Liew MW, Binger RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50:885–900. https://doi.org/10.13031/2013.23153

  • Ngo PTT, Panahi M, Khosravi K, Ghorbanzadeh O, Kariminejad N (2021) Evaluation of deep learning algorithms for national scale landslide susceptibility mapping of Iran. Geosci Front 12(2):505–519

    Article  Google Scholar 

  • Onan A, Korukoğlu S, Bulut H (2016) A multiobjective weighted voting ensemble classifier based on differential evolution algorithm for text sentiment classification. Expert Syst Appl 62:1–16. https://doi.org/10.1016/j.eswa.2016.06.005

    Article  Google Scholar 

  • Onyari E, Ilunga F (2013) Application of MLP neural network and M5P model tree in predicting streamflow: A case study of luvuvhu catchment, South Africa. Int J Innov Technol Manag 4(1):11–15

    Google Scholar 

  • Quinlan JR (1992) Learning with continuous classes. Mach Learn 92:343–348

    Google Scholar 

  • Razmkhah A, Alvankar SR, Kakahaji A (2016) Modeling lake Urmia water-level changes using local linear neuro-fuzzy method. J Water Sci Res 9(1, Spring 2017):47–61

  • Sajjad-Khan M, Coulibaly P (2006) Application of support vector machine in lake water level prediction. J Hydrol Eng 11(3). https://doi.org/10.1061/(ASCE)1084-0699(2006)11:3(199)

  • Shiri J, Shamshirband S, Kisi O, Karimi S, Bateni SM, Nezhad SHH (2016) Prediction of water-level in the Urmia Lake using the extreme learning machine approach. Water Resour Manage 30(14):5217–5229

    Article  Google Scholar 

  • Tama BA, Comuzzi M (2019) An empirical comparison of classification techniques for next event prediction using business process event logs. Expert Syst Appl 129:233–245. https://doi.org/10.1016/j.eswa.2019.04.016

  • Tian H, Qin Y, Niu Z, Wang L, Ge S (2021) Summer maize mapping by compositing time series sentinel-1A imagery based on crop growth cycles. J Indian Soc Remote Sens 49(11):2863–2874. https://doi.org/10.1007/s12524-021-01428-0

    Article  Google Scholar 

  • Tian H, Wang Y, Chen T, Zhang L, Qin Y (2021) Early-season mapping of winter crops using sentinel-2 optical imagery. Remote Sens (Basel, Switzerland) 13(19):3822. https://doi.org/10.3390/rs13193822

    Article  Google Scholar 

  • Wang J, Zuo R, Xiong Y (2019) Mapping mineral prospectivity via semi-supervised random forest. Nat Resour Res. https://doi.org/10.1007/s11053-019-09510-8

    Article  Google Scholar 

  • Wang Y, Witten IH (1997) Induction of model trees for predicting continuous classes. Proc. 9th Eur. Conf. Mach. Learn. Poster Pap. 128–137

  • Wang Z, Zuo R, Dong Y (2019) Mapping geochemical anomalies through integrating random forest and metric learning methods. Nat Resour Res 28:1285–1298. https://doi.org/10.1007/s11053-019-09471-y

    Article  Google Scholar 

  • Wang S, Zhang K, Chao L, Li D, Tian X, Bao H, ... Xia Y (2021) Exploring the utility of radar and satellite-sensed precipitation and their dynamic bias correction for integrated prediction of flood and landslide hazards. J Hydrol (Amsterdam) 603. https://doi.org/10.1016/j.jhydrol.2021.126964

  • Wang Y, Cheng H, Hu Q, Liu L, Jia L, Gao S, ... Wang Y (2022) Pore structure heterogeneity of Wufeng-Longmaxi shale, Sichuan Basin, China: Evidence from gas physisorption and multifractal geometries. J Petrol Sci Eng 208. https://doi.org/10.1016/j.petrol.2021.109313

  • Waseem Ahmad M, Reynolds J, Rezgui Y (2018) Predictive modelling for solar thermal energy systems: A comparison of support vector regression, random forest, extra trees and regression trees. J Clean Prod 203:810–821. https://doi.org/10.1016/j.jclepro.2018.08.207

    Article  Google Scholar 

  • Xie W, Li X, Jian W, Yang Y, Liu H, Robledo LF, ... Nie W (2021) A novel hybrid method for landslide susceptibility mapping-based Geodetector and machine learning cluster: A case of Xiaojin County. China ISPRS Int J Geo-Inf 10(2):93. https://doi.org/10.3390/ijgi10020093

  • Xie W, Nie W, Saffari P, Robledo LF, Descote P, ... Jian W (2021) Landslide hazard assessment based on Bayesian optimization–support vector machine in Nanping City. China Natural Hazards (Dordrecht) 109(1):931–948. https://doi.org/10.1007/s11069-021-04862-y

  • Yarar A, Onucyildiz M, Copty NK (2009) Modelling level change in lakes using neuro-fuzzy and artificial neural networks. J Hydrol 365(3–4):329–334

    Article  Google Scholar 

  • Yin L, Wang L, Keim BD, Konsoer K, Zheng W (2022a) Wavelet analysis of dam injection and discharge in three gorges dam and reservoir with precipitation and river discharge. Water 14(4):567. https://doi.org/10.3390/w14040567

    Article  Google Scholar 

  • Yin L, Wang L, Zheng W, Ge L, Tian J, Liu Y, ... Liu S (2022b) Evaluation of empirical atmospheric models using swarm-c satellite data. Atmosphere 13(2):294. https://doi.org/10.3390/atmos13020294

  • Young CC, Liu WC, Hshieh WL (2015) Predicting the water level fluctuation in an Alpine Lake using physically based, artificial neural network, and time series forecasting models. Math Probl Eng, Article ID708204. https://doi.org/10.1155/2015/708204

  • Yucalar F, Ozcift A, Borandag E, Kilinc D (2020) Multiple-classifiers in software quality engineering: Combining predictors to improve software fault prediction ability. Eng Sci Technol. https://doi.org/10.1016/j.jestch.2019.10.005

  • Zhang S, Xiao K, Carranza EJM, Yang F (2019) Maximum entropy and random forest modeling of mineral potential: Analysis of gold prospectivity in the Hezuo-Meiwu District West Qinling Orogen. China Nat Resour Res 28:645–664. https://doi.org/10.1007/s11053-018-9425-0

    Article  Google Scholar 

  • Zhang K, Ali A, Antonarakis A, Moghaddam M, Saatchi S, Tabatabaeenejad A, ... Moorcroft P (2019) The Sensitivity of North American Terrestrial Carbon Fluxes to Spatial and Temporal Variation in Soil Moisture: An analysis using radar-derived estimates of root-zone soil moisture. J Geophys Res Biogeosci 124(11):3208–3231. https://doi.org/10.1029/2018JG004589

  • Zhang K, Shalehy MH, Ezaz GT, Chakraborty A, Mohib KM, ... Liu L (2022) An integrated flood risk assessment approach based on coupled hydrological-hydraulic modeling and bottom-up hazard vulnerability analysis. Environ Model Softw 148. https://doi.org/10.1016/j.envsoft.2021.105279

  • Zhang K, Wang S, Bao H, Zhao X (2019) Characteristics and influencing factors of rainfall-induced landslide and debris flow hazards in Shaanxi Province, China. Nat Hazard 19(1):93–105. https://doi.org/10.5194/nhess-19-93-2019

    Article  Google Scholar 

  • Zhao F, Song L, Peng Z, Yang J, Luan G, Chu C, ... Xie Z (2021a) Night-time light remote sensing mapping: Construction and analysis of ethnic minority development index. Remote Sens (Basel, Switzerland) 13(11):2129. https://doi.org/10.3390/rs13112129

  • Zhao F, Zhang S, Du Q, Ding J, Luan G, ... Xie Z (2021b) Assessment of the sustainable development of rural minority settlements based on multidimensional data and geographical detector method: A case study in Dehong, China. Socio-Econ Plann Sci. https://doi.org/10.1016/j.seps.2021b.101066

  • Zhao X, Xia H, Pan L, Song H, Niu W, Wang R, ... Qin Y (2021c) Drought monitoring over Yellow River Basin from 2003–2019 using reconstructed MODIS land surface temperature in Google Earth Engine. Remote Sens (Basel, Switzerland) 13(18):3748. https://doi.org/10.3390/rs13183748

  • Zhu S, Lu H, Ptak M, Dai J, Ji Q (2020a) Lake water-level fluctuation forecasting using machine 703 learning models: a systematic review. Environ Sci Pollut Res 27(44807–704):44819

    Google Scholar 

  • Zhu S, Hrnjica B, Ptak M, Choiński A, Sivakumar B (2020b) Forecasting of water level in multiple 701 temperate lakes using machine learning models. J Hydrol 585:124819

    Article  Google Scholar 

  • Zhu Z, Zhu Z, Wu Y, Han J (2022) A prediction method of coal burst based on analytic hierarchy process and fuzzy comprehensive evaluation. Front Earth Sci (Lausanne) 9. https://doi.org/10.3389/feart.2021.834958

  • Zounemat-Kermani M, Stephan D, Barjenbruch M, Hinkelmann R (2020) Ensemble data mining modeling in corrosion of concrete sewer: A comparative study of network-based (MLPNN & RBFNN) and tree-based (RF, CHAID, & CART) models. Adv Eng Inform 43:101030. https://doi.org/10.1016/j.aei.2019.101030

    Article  Google Scholar 

  • Zuo Y, Jiang S, Wu S, Xu W, Zhang J, Feng R, ... Santosh M (2020) Terrestrial heat flow and lithospheric thermal structure in the Chagan Depression of the Yingen-Ejinaqi Basin, north central China. Basin Res 32(6):1328–1346. https://doi.org/10.1111/bre.12430

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Reza Jalali.

Ethics declarations

Conflict of interest

There is not any conflict of interest among authors.

Additional information

Communicated by H. Babaie.

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hajian, R., Jalali, M.R. & Mastouri, R. Multi-step Lake Urmia water level forecasting using ensemble of bagging based tree models. Earth Sci Inform 15, 2515–2543 (2022). https://doi.org/10.1007/s12145-022-00857-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-022-00857-w

Keywords

Navigation